Predicting Pipe Breaks
in the City of Atlanta
Thach Tran
Joseph Moravitz
Shweta Shalini
Anum Alimohammed
Portia Essuman
Presentation Outline
The Problem
City of Atlanta’s Water Distribution
America's Aging Water Infrastructure
Pipe Breaks in ATLANTA are an Issue
About the Data
Overview of Pipe Failures in Atlanta
Graph of Pipe Failures Over Time: 1997-2017
Trends in Pipe Failures
SEASONALITY
Trends in Pipe Failures
AGE
Trends in Pipe Failures
MATERIALS
Why does it matter?
Cost to Fix a Break
Cost to Inspect a Pipe before Break
Existing Solutions
Existing solutions
Our Solution
GOAL:
Identify water mains that are at high risk of failure in order to prevent breaks and reduce repair costs
Our Solution
A machine learning algorithm that predicts which pipes are most likely to break
Benefits:
Data Cleaning Process
Machine Learning Model
Model Performance
Confusion Matrix
Accuracy : 80%
| True Negative | False Positive |
NOT BROKEN | 42306 | 10294 |
| False Negative | True Positive |
BROKEN | 675 | 2737 |
Model Performance
Classification Report
| precision | recall | f1-score | support |
Not-Broken | 1.00 | 0.82 | 0.90 | 15780 |
Broken | 0.25 | 0.94 | 0.40 | 1024 |
avg / total | 0.95 | 0.83 | 0.87 | 16804 |
Key Findings:
Factors that Affect Likelihood of a Pipe Break
We considered 57 unique factors
Top Factors--by feature importance:
Shortcomings of Our Model
Thank You!
Shweta wants to know. . .
“Do you wanna play in the rain of the broken water main?”