The Devil is in the Data

The Devil is in the Data is a blog about practical and fun data science in the R language by Dr Peter Prevos. This website goes beyond the hype, given that 99.9% of problems can be solved without machine learning. Good data science is:

  • Useful: Increases value.
  • Sound: Valid, reliable and reproducible.
  • Aesthetic: Easy to understand.

Proudly associated with

Proudly associated with R Bloggers

Top 40 R Programming Blogs to follow in 2020

Top 20 Programming Blogs

Top 20 Programming Blogs

Discourse Network Analysis: Undertaking Literature Reviews in R
Literature review using R with RQDA and iGraph. This method uses qualitative coding and quantitative discourse network analysis.
Celebrate Halloween with Creepy Computer Games in R
This article translates code from the 1983 Creepy Computer Games book by Uborn Publishing. Be careful not to get caught by the skeletons!
Mapping and Geocoding with ggmap and the Google API
Geocoding transforms a description to a location on the Earth's surface. This article explains geocoding with ggmap and the Google API.
Flat Earth Mathematics with examples in the R Language
The idea that the earth is flat is unfortunately gaining popularity. This article discusses flat earth mathematics using the ggplot package in R.
Call Centre Workforce Planning Using Erlang C in R language
This article explains how to undertake Call Centre Workforce Planning using Erlang C in R, plus a Monte Carlo simulation to manage uncertainty.
Project Euler 33: Digit Cancelling Fractions
Solutions to Project Euler 33 in R: Farey sequence to list all fractions between 0 and 1, visualsed with a Ford Circles.
Analyse Definitions of Marketing with Rvest and Tidytext
This article describes how to scrape a website to analyse 72 definitions of marketing using the Rvest and Tidytext packages in the R language.
Qualitative Data Science: Using RQDA to analyse interviews
This article analyses interviews using the RQDA package. Qualitative data science is a valuable methodology because it preserves meaning in the data.