Titanic - Machine Learning from Disaster
Meet the workshop leads
Sissy He
Second Year SE
Aisha Khatun
Masters CS
Vivian Guo
Second Year CFM
Sabina
Gorbachev
Third Year CS/BBA
Molly Xu
Third Year�CS
Problem Introduction
Brief intro to the Titanic Challenge
What is this Challenge about?
The Titanic
What is this Challenge about?
The Challenge
The Data
Technologies
Poll:
Are you familiar with Python?
Python
PANDAS
SCIKIT LEARN
Example Code
PANDAS
import pandas as pd
train_data = pd.read_csv("...")
SCIKIT LEARN
from sklearn. . .import . . .
What is AI and ML? (And the difference)
AI
(Artificial
Intelligence)
It's the quest to build machines that can reason, learn, and act intelligently, and it has barely begun. It covers the latest advances in machine learning, neural networks, and robots. (MIT)
Machine Learning
Machine learning is a subfield of artificial intelligence that gives computers the ability to learn without explicitly being programmed (MIT)
Data Exploration
Data Exploration
What kinds of information do each of the features have?
Let us explore a few features:
Poll:
How many rows are there with Pclass = 3?
Prediction
Prediction
Imagine you are taken back to time and are on the Titanic ship. You see lots of people around you.
Can you look at someone, and predict whether or not they will survive the Titanic sinking?
Other use cases:
Prediction
Lets predict survival based on only one feature.
Try Yourself
Survival prediction by Parch
Demo
Survival prediction by Gender
Poll:
What is the accuracy if we only use Pclass for predicting survival?
Decision Trees
Decision Trees & Random Forest
Poll:
Is a Random Forest with 150 estimators and 5 max_depth better or worse?
Thank You For Listening! :)