Purple 7: Andrew, Billy, Pallavi, Shivangi, Slim, Vanessa
DATA ANALYSIS
MSIS 510
About Kickstarter
Visualization Tools
Multiple Linear Regression
Decision Tree Modeling
Logistic Regression
Conclusion
TABLE OF CONTENTS
01
02
03
04
05
06
Introduction
01
01
Kickstarter’s Mission
To bring more creativity into the world
What is Kickstarter?
ABOUT US
How Kickstarter Works
Creators (fund seekers)
Backers (Community members)
Micro-payments
Funding minus 5% commission
Exclusive products/
services
Exposure on platform
Successfully funded
Unsuccessfully funded
Dataset & Market Trends
Visualization Tools
06
02
Understand & Clean Data
Num.
DATA
Cate.
DATA
Launch Duration
Prepare Duration
Title
Blurb
Staff Picked
Category
(popularity..)
Location
(City, Country)
(Language..)
Deadline
(Week, Month…)
Supporter Number
Goal
Pledge Amount
Status
More Data?
COVID?
TEXT
DATA
What are the important variables?
Text | Classification | Time | Others |
Title Length | Category | Launch Time | COVID |
Blurb length | Country | Project Duration | Goal |
| City | Prepare Duration | Staff Pick |
| | Deadline | Spotlight |
| | | Pledged |
Correlation Coefficient Matrix
Successful vs. Failed Projects Across Categories
Success Rate Comparison Across the Top 10 Countries
Success Rate According to Different Word Counts of Names for the Projects
Percentage of the Success and Failure Based on Launch Day of the Week
COVID’s Effects on Project Performances
Multiple Linear Regression
06
03
What is Multiple Linear Regression?
THE MODEL
Pre-Processing our Data
Launch Season
Project Duration
Goal
Deadline Season
Staff Pick
Results
Coefficient: 0.04472
Pr(>|t|): <2e-16
Coefficient: 0.02777
Pr(>|t|): <2e-16
Coefficient: -0.00389
Pr(>|t|): 0.14
Coefficient: 0.01482
Pr(>|t|):<2e-16
Coefficient: 0.09114
Pr(>|t|):<2e-16
Decision Tree Modeling
06
04
Predictors Considered
____________________________________________________________________________________________
Target�_______________________________________
Decision Tree
Significant Variables
____________________________________________________________________________________________
Accuracy
___________________________________________________
73.3%
Logistic Regression
06
05
Logistic Regression 1
Question
When is the best time to launch a product and for how long?
Outcome Variable
______________________________________________________________________________________________________
Input Variables
______________________________________________________________________________________________________
Launch Season
Successful
Project
Failed
Project
Project
Duration
Launch
Day of Week
Results & Analysis
Significant Variables
______________________________________________________________________________________________________
Project Duration: 60-70 days
Launch Season: Winter
Launch Season: Summer
Launch Day: Tuesday
← -1.5
← 0.14
← -0.32
← 0.16
Accuracy: 67%
Winter
Tuesday
Shorter
Logistic Regression 2
Question
How should you categorize and describe your product?
Outcome Variable
______________________________________________________________________________________________________
Input Variables
______________________________________________________________________________________________________
Successful
Project
Failed
Project
Project
Category
Project Name Word Count
Results & Analysis
Significant Variables
______________________________________________________________________________________________________
12/15 Categories
12/16 Word Count Values
← -1 to 1.6
← 0.2 to 1.85
Accuracy: 60%
Comics
Dance
Design
Music
More words
in the title
(up to 16)
More Success
Conclusion
06
06
Conclusion
Kickstarter should take advantage of their data to give suggestions and instructions to the project creators and help them avoid unnecessary mistakes
A project’s success largely depends on the idea and delivery.
Our goal is to help make decisions to improve decisions towards launching a product on Kickstarter
Thank You!
Questions?