Lifting the Big Data Veil: Data Science Strategy for Water Utilities

Data Science Strategy for water utilities: The Data Science Venn Diagram (Conway, 2010).

The Data Science Venn Diagram (Conway, 2010).

In my job as manager data science for a medium-sized water utility in Australia, I have developed a strategy to increased the amount of value we extract from data.

Many businesses that seek the promised benefits of Big Data don’t achieve those because they don’t start with the basics.

The most important data science strategy advice is to spend a lot of time getting to know and to improve data quality.

Good data science needs to comply with these four basic principles:

  • Utility: The analysis needs to be able to improve reality, otherwise we end with ‘analysis-paralysis‘. Although we speak of data science, it is really data engineering because we are not seeking the truth, we seek improvement of reality.
  • Soundness: The analysis needs to be scientifically valid so that managers can make reliable decisions.
  • Aesthetics: Visualisations need to be pleasing to the eye, not as a beautification but to ensure users draw correct conclusions.
  • Reproducibility: Analysts need to be able to repeat the work of other people to ensure quality control. This is where the science comes into data analytics.

I have recently published a paper about data science strategy for water utilities to share some of my thoughts on this topic.

Data Science Strategy for Water Utilities

Abstract: Big Data promises future benefits by using smart algorithms to improve the customer experience. Many organisations struggle leveraging the benefits of the data revolution. This paper summarises how water utilities can use the emerging field of data science to create value from information. The paper explains the principles of data science and illustrates these using examples from water utilities. This paper closes with recommendations on how to implement data science projects to maximise value from data. These benefits are realised using existing investments in information technology infrastructure and existing competencies.

You can read an extract of the paper on the Australian Water Association website. The full version is behind their paywall.

Furthermore, I am interested in creating an alliance with other water utility professionals that write code in R. Feel free to comment below to discuss any thoughts you might have on this issue.

12 thoughts on “Lifting the Big Data Veil: Data Science Strategy for Water Utilities

  1. Pingback: Lifting the Big Data Veil: Data Science Strategy for Water Utilities - Use-R!Use-R!

  2. Pingback: Lifting the Big Data Veil: Data Science Strategy for Water Utilities – Mubashir Qasim

      • Hi Peter,

        How nice to meet another dutchman here!
        Are there some more dutch companies interested? Is it possible to have a phonecall anytime?

        Greetz from the homeland <;-)

  3. Hi,
    I am a Business Consultant living in Madrid (Spain), with experience with data integration and business intelligence, and several years coding in R. Also with training in chemical process simulation and water desalination design. I am pursuing a changing professional career and work in water industry. I am happy to help and learn…

  4. Hi Peter,

    Thank you for this blog post; I recently started using R to help tackle non-revenue water/apparent losses for a utility in Florida (USA). I recently submitted a request to your Linkedin group and connect to your personal Linkedin and am interested in learning how other professionals are leveraging to potential of R in the water industry.

  5. Pingback: Data Science for Water Utilities Using R | The Devil is in the Data

Feel free to share your thoughts about this article