Hoops and Algorithms: Predicting NBA Winners
Big Picture
Expanding Accessibility & Fan Engagement –
Feature Variables
Real time predictions of an NBA game
Inputs Live data to calculate win probability
Assists in informed strategy decisions
Enhances Live Sports Broadcasting
Historical Model Process
Clean Data
Training
Historical Model
Historical Model
Data Source
The input data is from the airflow, which came from a public API
Key Features
Used feature variables of the score difference of the game, time left in the game and the pregame win probability
Model Training
Trained the model on past in-game data
Visualization
Built a matplotlib that shows the changes of win probability based on the time left in the game
Building a Real-Time Model
Real-Time Analysis
Implementing an
ELT Pipeline
Snowflake
dbt
DAGs and Cosmos DAG
Challenges
Data Types
Different Data Types
RAM
Ram Limitations
Integrating DAGS
Integrating multiple DAGs into Cosmos' single DAG structure
Next Steps
Limitations in our current model:
Future Improvements:
Thank You!