The Devil is in the Data

The Devil is in the Data is a blog about practical and fun data science in the R language by Dr Peter Prevos. This website goes beyond the hype, given that 99.9% of problems can be solved without machine learning. Good data science is:

  • Useful: Increases value.
  • Sound: Valid, reliable and reproducible.
  • Aesthetic: Easy to understand.

Proudly associated with

Proudly associated with R Bloggers

Top 40 R Programming Blogs to follow in 2020

Top 20 Programming Blogs

Top 20 Programming Blogs

Project Euler 11: Largest product in a grid
Solutions to Project Euler 11: Greatest product of four adjacent numbers in the same direction (up, down left, right) in a 20×20 grid?
Project Euler 10: Summation of Primes
Solutions to Project Euler 10 in the R language. Find the sum of all primes below two million, using the Sieve of Erarosthenes.
Spike Detection in Water Treatment Data in the R Language
This article explains how to implement spike detection for time series data using the runlength function in the R language.
Project Euler 9: Special Pythagorean Triplet
Solutions to Project Euler 9: Find the Pythagorean triplet where a^2 + b^2 = c^2 and a + b + c = 1000.
Trumpworld Analysis: Ownership Relations in his Business Network
Trumpworld analysis of ownership relationships between businesses in trumpland - the web of connections in the world of Donald J. Trump.
Mapping antipodes using the ggmap package
The antipode of any place on Earth is the point on the Earth's surface that is diametrically opposite to it. How can you use ggmap for mapping antipodes?
Project Euler 8: Largest product in a series
Solutions to Project Euler 8: Find the thirteen adjacent digits in the 1,000-digit number that have the greatest product. What is the value of this product?
Project Euler 7: 10,001st Prime Number
Solution to Project Euler 7 in the R language: What is the 1,0001st prime number and what is the distribution of prime gaps?