Predicting Restaurant Traffic
Art or Science?
Manu Lohiya
August 2017
Data Science Bootcamp, General Assembly
The Problem Statement
I am an independent restaurant owner with a very small marketing budget. How can I bring in more customers?
Hypothesis
The number of “check-ins” at a restaurant has a relationship with various attributes of the restaurant’s profile
The Data: DataSF + Foursquare
2. DataSF + Foursquare Mapping (Foursquare API Search)
3. Foursquare Attributes (Foursquare API Venues)
5,568 Rows
EDA - Health Scores
HIGH RISK Restaurants (Lowest Quartile Scores)
LOW RISK Restaurants (Highest Quartile Scores)
EDA - Collinearity
Training - Linear Regression (checkinsPerDay ~ X)
Feature | Adj. R^2 | t score | Result |
Number of Ratings | 0.655 | 54 | Significant |
Avg Rating Score | 0.21 | 21 | Significant |
Avg Health Score | 0.01 | 3 | Significant |
Has Menu | 0.02 | 5 | Significant |
Price | 0.05 | 9 | Significant |
Testing: Using Sklearn and cross-validation (RatingSignals vs CheckinsPerDay) to predict
Takeaways: A restaurant owner should focus on trying to get as many ratings as possible. This feature alone predicts 65% of checkins.
It is interesting to note that the quality of the ratings matters less. In other words, customers are more likely to come to a restaurant with more ratings and a lower score than a restaurant with high score and few ratings.
Method | Score |
Sklearn on df_test | 0.65 |
Cross-validation on df | 0.67 |
Next Steps
Thank You!