The Devil is in the Data

The Devil is in the Data is a blog about practical and fun data science in the R language by Dr Peter Prevos. This website goes beyond the hype, given that 99.9% of problems can be solved without machine learning. Good data science is:

  • Useful: Increases value.
  • Sound: Valid, reliable and reproducible.
  • Aesthetic: Easy to understand.

Proudly associated with

Proudly associated with R Bloggers

Top 40 R Programming Blogs to follow in 2020

Top 20 Programming Blogs

Top 20 Programming Blogs

GEDCOM Reader for the R Language: Analysing Family History
This article describes how to build a GEDCOM reader for the R langugage. The resulting data frame can be analysed in many ways.
Project Euler 144: Laser Beams and Elliptical Billiards
Solutions to Project Euler 144: This artcle provides a solution in the R language to investigate multiple reflections of a laser beam
The Secret of Landusia: A Text Adventure in the R Language
Write your own text adventure in the R programming langauge. How could we use machine learning to write interactive fiction?
Analyse Site Structure networks with R and igraph
This article discusses how to analyse site structure and internal linking structure of a Wordpress website using the iGraph package in R.
Decode Lyrics in Pop Music with the Songsim algorithm
The lyrics of songs are more and more repetitive. Wihin this repetitiveness hides a summetrical beauty. The Songsim algorithm helps to decode lyrics.
Strategic Data Science: Create Value from Data Big and Small
This article looks behind the hype and presents a model for strategic data science. Summary of the book Principles of Strategic Data Science.
Factor Analysis in R: Measuring Consumer Involvement
Consumer involvement measures how much customers care. This article explains measuring the Personal Involvement Inventory using factor analysis in R.
5½ Reasons to Ditch Spreadsheets for Data Science
Should you use spreadsheets for data science? This article argues that spreadsheets invite chaos. Writing code is poetry for data science.