The Devil is in the Data

The Devil is in the Data is a blog about practical and fun data science in the R language by Dr Peter Prevos. This website goes beyond the hype, given that 99.9% of problems can be solved without machine learning. Good data science is:

  • Useful: Increases value.
  • Sound: Valid, reliable and reproducible.
  • Aesthetic: Easy to understand.

Proudly associated with

Proudly associated with R Bloggers

Top 40 R Programming Blogs to follow in 2020

Top 20 Programming Blogs

Top 20 Programming Blogs

Tap Water Sentiment Analysis using Twitter and Tidytext
This tap water sentiment analysis looks at a corpus of tweets about tap water to better understand people's attitudes to tap water.
Analyse Digital Water Meter Data using the Tidyverse
This article presents a series of functions, such as a diurnal curve, to analyse digital water meter data using the tidyverse libraries.
Simulating Water Consumption to Develop Analysis Tools
This article simulates water consumption to assist with developing leak detection algorithms. Simulating water consumption helps to develop business tools.
Project Euler 32: Pandigital Products
Solutions to Project Euler 32: Products whose multiplicand/multiplier/product identity is one through nine pandigital
Analysing soil moisture data with the ncdf4 library
netCFD is a popular data format using geospatial time series in any science. The ncdf4 library provides functionality to use the netCDF format in R.
Pacific Island Hopping using R and the iGraph package
Go on a virtual holiday to the tropical islands of the pacific. Go Pacific island hopping and create your perfect trip.
Project Euler 30: Digit Fifth Powers
Solutions to Project Euler 30 in R: All numbers that can be written as the sum of the fifth powers of their digits.
Visualise Water Consumption with a Geographic Bubble Chart
This article shows how to plot a geographic bubble chart to visualise water consumption. The data was obtained from Phú Thọ Water in Vietnam.