Identifying Routes in the NFL
Dani Chu
M.Sc. Student - Statistics, Simon Fraser University
Graduate Intern - Basketball Strategy & Analytics, NBA
1
Collaborators
2
NFL Big Data Bowl
3
NFL Big Data Bowl
4
The Data
5
Example - Eligible Receivers Only
6
Example - Eligible Receivers Only
7
Receiver Performance
8
Receiving Statistics
How can we account for usage?
9
Route Identification
10
Why is it Useful?
11
In the NBA
12
What is a Route?
13
*Star Tribune
Example - Eligible Receivers Only
14
Functional Data
15
Functional Data
16
Functional Data
17
Bezier Curves
18
Bezier Curve
Defined on with control points
Where
In 2 dimensions and degree 9
19
Curve Clustering
20
Route Tree Reminder
21
*Star Tribune
Pre-processing: Data
22
Curve Clustering
Assume
Then
23
Expectation Step
In our case:
Observations are 2 dimensional curves of different lengths
Parameters are control points of bezier curve and variance parameter
For zjk = 1 when observation j is from cluster k
Update the expected cluster membership for each observation
24
Maximization Step
Update parameters for cluster k through
25
Cluster Means
26
Route Labelling
27
Cluster 20 - Post
28
WRs with most Routes per Team
29
Full Process
30
Route Profiles
31
Position Comparisons
32
WR Usage
33
Air Yards Over Expectation
34
Air Yards Over Expectation (>100 Routes)
35
Who’s the Best at Running Routes?
36
Play Database
37
Play Database
Can identify play description, results and other information based on:
For example we can pull film for:
38
Thank You!
Any Questions?
Email:
Twitter:
Email:
Twitter:
39
40
RB Usage
41
Targets Over Expectation
42
Targets Over Expectation (> 100 Routes)
43
Who’s the Best at Running Routes?
44