The Devil is in the Data

The Devil is in the Data is a blog about practical and fun data science in the R language by Dr Peter Prevos. This website goes beyond the hype, given that 99.9% of problems can be solved without machine learning. Good data science is:

  • Useful: Increases value.
  • Sound: Valid, reliable and reproducible.
  • Aesthetic: Easy to understand.

Proudly associated with

Proudly associated with R Bloggers

Top 40 R Programming Blogs to follow in 2020

Top 20 Programming Blogs

Top 20 Programming Blogs

Project Euler 29: Distinct Powers
Solutions to Project Euler 29: How many distinct terms are in the sequence generated by a^b?
How Virtual Tags have transformed SCADA data analysis
International keynote at the Asset Data & Insights Conference in Auckland, 26 July 2017. Using virtual tags to create value from SCADA data.
Project Euler 28: Number Spiral Diagonals
Solutions to Project Euler 28: What is the sum of the numbers on the diagonals in a 1001 by 1001 spiral? Plus bonus Ulam Spiral.
Using the iGraph package to Analyse the Enron Corpus
Instructions on how to use R and iGraph to analyse the Enron corpus. The Enron corpus gives detailed insight into how employees communicate.
Project Euler 27: Quadratic Primes
Solutions to Project Euler 27 in the R language. Code to find the quadratic expression that generate the most prime numbers.
Project Euler 26: Reciprocal Cycles
Solutions to Project Euler 26: Find the value of d<1000 for which 1/d contains the longest cycle in its decimal fraction part.
Project Euler 25: Fibonacci number with 1000 digits
Solutions to Project Euler 25 in the R language. What is the first Fibonacci number with 1000 digits?
Project Euler 24: Lexicographic Permutations
Solutions to Project Euler 24 in the R language: What is the millionth lexicographic permutation of the digits zero to nine?