Statistical Analysis of Pump Failure
Analysis by Allison Fultz
1
Project Overview
Problem:
Understand what variables may drive an asset failure in Southern Water Corporation’s water plant pumps
Goals:
Process Using Excel for Analysis:
Product:
Statistical Alarm Signal to Predict Pump Failure
2
Descriptive and inferential statistical methodologies have proven effective in creating a proactive ‘alarm’, accurately identifying Pump Failures with Horse Power (HP) and Pump Efficiency (PE) emerging as key variables of interest with deviations of 15 HP and > 3 % PE being our core signal thresholds.
3
Descriptive Analysis has enabled us to clearly identify particular signature abnormalities showing clear signature changes in both Rolling Standard Deviation and Rolling Mean Datasets when observed over the respective failure period of interest.
4
Further segmentation of the data via binary means (Pump Failure = 0 or 1) illustrated through Box Plots, show a clear signature difference between that of normal behaviour and that of Failure with Pump Torque, Pump Speed, and Pump Efficiency showing the 3 largest variances.
5
Correlation analyses across datasets yield interesting insights with Pump Efficiency, Volumetric Flow Meter 1, and Volumetric Flow Meter 2 negatively correlated with Pump Failure in the Rolling Mean Data, whilst Horse Power, Pump Speed, and Pump Torque show a subsequently strong positive correlation in the Rolling Stdev Dataset.
6
Lastly, analysis of the model fit reveals that with a R Squared of .78, a linear model is a good fit for the data with variables Horse Power and Pump Efficiency having the largest coefficients, indicative that these variables have the most immediate relationship with respect to Pump Failure behaviour.
7