Sonya Lawrence
07/31/2022
2
Outline
3
Executive Summary
4
Introduction
Project background and context
SpaceX advertises Falcon 9 rocket launches on its website, with a cost of 62 million dollars; other providers cost upward of 165 million dollars each, much of the savings is because SpaceX can reuse the first stage. Therefore, if we can determine if the first stage will land, we can determine the cost of a launch. This information can be used if an alternate company wants to bid against SpaceX for a rocket launch.
Problem questions:
5
Section 1
6
Methodology
7
Data Collection
SpaceX API
Organized Data
Saved File
Wikipedia
Organized Data
Saved File
8
Data Collection – SpaceX API
9
Data Collection - Web Scraping
`1` meaning it was successful
`0` meaning it was unsuccessful.
10
Data Wrangling
Perform exploratory data analysis and determined training labels
Calculate the number of launches at each site
Calculate the number of occurrences of each orbit type
Create a landing outcome label called Case from the Outcomes column
Exported the cleaned data to a .csv file
11
EDA with Data Visualization
12
EDA with SQL in Jupyter Notebook
13
Folium Interactive Map
14
Plotly Dash Dashboard
15
Predictive Analysis (Classification)
Imported .csv data file into a Pandas dataframe
Created a Numpy array containing the 'Class' column
Standardized, fitted and transformed the data using StandardScalar
Split the data in training and testing sets using the train_test_split function
Created a GridSearchCV object with cv=10 to find the best parameters
Applied GridSearchCV to Logistic Regression, Support Vector Machine, Decision Tree, K-Nearest Neighbor
Calculated the accuracy of the test data using the .score() method for each model
Created and assessed the confusion matrix for each model
Found the model with the best parameters by comparing the model accuracy score
16
Results
Section 2
18
Flight Number vs. Launch Site
19
Payload vs. Launch Site
20
Success Rate vs. Orbit Type
21
Flight Number vs. Orbit Type
22
MOST SUCCESSFUL Orbit: LEO, ISS, and PO
23
Launch Success Yearly Trend Upwards since 2013
24
All Launch Site Names
25
Launch Site Names Begin with 'KSC'
26
Total Payload Mass
27
Average Payload Mass by F9 v1.1
28
First Successful Ground Landing Date
29
Successful Drone Ship Landing with Payload between 4000 and 6000
30
Total Number of Successful and Failure Mission Outcomes
31
Boosters Carrying Maximum Payload
32
2017 Launch Records
33
Rank Landing Outcomes Between 2010-06-04 and 2017-03-20
Section 3
SpaceX launch sites are located in the states of Florida and California in the Unites States of America.
35
Launch Site Locations – Global Map
36
Success Rates at Launch Sites
37
Distance from CCAFS SLC-40 to Landmarks
The distance between CCAFS SLC-40 and:
Section 4
39
Launch Success Count for all Sites
40
Most Successful Launch Site
41
Low Payload Mass = High Success Rate
Section 5
The most accurate classification algorithm for predicting a successful landing, from the models tested, is the Decision Tree Classifier (DTC) with an accuracy score of 0.889 and the following parameters:
43
Supervised Machine Learning Classification Accuracy
The confusion matrix for the DTC shows that the classifier can distinguish between the different landing classes.
The major problem is the false positives .i.e., unsuccessful landings marked as successful by the classifier.
44
Confusion Matrix for DTC
46
Conclusions
47
Appendix