Data Science �Project Cycles & Common Pitfalls
Disclaimer: the opinions expressed in this presentation is presenters’ own and do not represent the view of presenters’ employers.
Data Science Project Cycles
Data Science Project Cycle (Overview)
Product
Concept
Soft skills: communication, leadership, collaboration and business insights
Big data infrastructure and tool sets.
Strong modeling background
Data
Information
Knowledge
Insight
Decision & Action
Business problem and value
Resources, milestone and timeline
Project Cycle
Planning
Formulation
Modeling
Production
Post-
Production
Cross Team
Collaboration
Agile-Style Project Management
Online vs Offline Training
Model trained in batch using offline data
Make features used in the model available online
Model use online data to make real time decisions
There are also offline-only models with regular batch process; and models training using online real time data and deploy in real time depending on different applications.
Common Pitfalls of Data Science Projects
Project Planning Stage
Problem Formulation Stage
Modeling Stage
Productionization Stage
Post-Production Stage
Soft Skills
Leading With Statistics
Communication:
Speaking the Same Language
Communication: Different Styles
Business Domain Knowledge
Keep on Track for Data Science Career
Fun Video: THE EXPERT
Hilarious but sadly true for many data science projects!
Probably you are the only data scientist in the room next time,
be prepared to fight back!