Data Science for Water Utilities: Book Resources

Welcome to the website for "Data Science for Water Utilities". Here, you'll find summaries of each chapter and additional information. Each page on this website includes a screencast to demonstrate the code presented in the book.

Data Science for Water Utilities is an applied, practical guide that shows water professionals how to use data science to solve urban water management problems.

This guide features four case studies that progressively increase in complexity. The first case study analyses water quality to protect public health, while the second explores customer feedback. The third case study delves into the use of smart meter data. Finally, the guide analyses concrete curing data utilising machine learning techniques. Each case study builds upon fundamental principles and includes practical examples of code.

Readers will be familiar with analysing data but do not need coding experience to follow this course.

The data and code used in this book are available on GitHub:

Data Science for Water Utilities

Data Science for Water Utilities

Data Science for Water Utilities published by CRC Press is an applied, practical guide that shows water professionals how to use data science to solve urban water management problems using the R language for statistical computing.

Analysing the Customer Experience
This chapter of the Data Science for Utilities describes the principle of analysing the customer experience and using the R language
Basic Linear Regression
This chapter of the Data Science for Utilities discusses how to undertake basic linear regression as a foundation for statistical modelling
Basics of the R Language
This chapter of the Data Science for Water Utilities book introduces the basic principles of the R language with a case study about flow in open channel flow
Clustering Customers to Define Segments
This Data Science for Water Utilities chapter implements cluster analysis to segment customers using hierarchical clustering and k-means.
Descriptive Statistics in Water Quality
This chapter of Data Science for Water Utilities teaches how to generate various types of descriptive statistics and grouped analysis with R
Detecting Outliers and Anomalies in Time Series Data
This chapter of Data Science for Utilities discusses detecting outliers and anomalies in time series data using, including leak detection
Introduction to Machine Learning
This chapter of the Data Science for Utilities is an introduction to machine learning using multiple linear regression and decision trees.
Introduction to R for Utilities
This chapter of the Data Science for Utilities introduces the basic principles of using R for water utilities and advice on how to learn to code