Forecasting New Jersey
Train Delays for Rail Track Upgrades
Project by Stephanie Cheng and Shreya Bansal
ON TRACK
NJ Transit Delays
Data Range:
May-August 2018
Newark Penn Station experienced over 2,000 hours of delays cumulatively over 4 months
Crowded trains
Source: Kevin R. Wexler-NorthJersey.com
Climate stress on train tracks
Source: Railroadrails.com
Proposal: Prevention is better than cure
NJT Capital Use program (2010)
Source: Transit.dot.gov
Assumption: trains are delayed mostly due to climatic stress and forecasting when delays will occur will allow NJ Transit to optimize repair funding more efficiently and prevent delays.
Source: Bloomberg
Model: Variables
CLIMATE
Temperature
Precipitation
Wind Speed
Visibility
TIME
Day of the week
Hour
Time of Day
TIME LAG
hour lags
day lag
holiday lag
SPACE
Station Name
Line
Models: Regression
Regression 3 <- lm (delay_minutes ~ station + hour + day of the week + Temperature + Precipitation + Visibility + Wind_Speed + lagHour + lag2Hours +lag3Hours +lag12Hours + lag1day + holidayLag + holiday
Model Testing:
K-Fold Cross Validation
Model Accuracy
Further Explorations: Heat Island Effect, Tree Cover
Benefits:
Conclusion