FOR INSTRUCTOR PURPOSES ONLY
INSTRUCTOR NOTES
FOR INSTRUCTOR PURPOSES ONLY
MATERIALS
FOR INSTRUCTOR PURPOSES ONLY
PRE-WORK
Insert Instructor Name
Title, Company
INTRODUCTION TO REGRESSION ANALYSIS
LEARNING OBJECTIVES
INTRODUCTION TO REGRESSION ANALYSIS
INTRODUCTION TO REGRESSION ANALYSIS
PRE-WORK
PRE-WORK REVIEW
OPENING
INTRODUCTION TO REGRESSION ANALYSIS
WHERE ARE WE IN THE DATA SCIENCE WORKFLOW?
INTRODUCTION
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
SIMPLE LINEAR REGRESSION
DEMO
REGRESSING AND NORMAL DISTRIBUTIONS
DEMO: REGRESSING AND NORMAL DISTRIBUTIONS
GUIDED PRACTICE
USING SEABORN TO GENERATE SIMPLE LINEAR MODEL PLOTS
EXERCISE
Two plots
DELIVERABLE
DIRECTIONS (15 minutes)
ACTIVITY: GENERATE SINGLE VARIABLE LINEAR MODEL PLOTS
INTRODUCTION
SIMPLE REGRESSION ANALYSIS IN SKLEARN
SIMPLE LINEAR REGRESSION ANALYSIS IN SKLEARN
CLASSES AND OBJECTS IN OBJECT ORIENTED PROGRAMMING
# generate an instance of an estimator class�estimator = base_models.AnySKLearnObject()�# fit your data�estimator.fit(X, y)�# score it with the default scoring method (recommended to use the metrics module in the future)�estimator.score(X, y)�# predict a new set of data�estimator.predict(new_X)�# transform a new X if changes were made to the original X while fitting�estimator.transform(new_X)
SIMPLE LINEAR REGRESSION ANALYSIS IN SKLEARN
DEMO
SIGNIFICANCE IS KEY
DEMO: SIGNIFICANCE IS KEY
GUIDED PRACTICE
USING THE LINEAR REGRESSION OBJECT
EXERCISE
Two new models
DELIVERABLE
DIRECTIONS (15 minutes)
ACTIVITY: USING THE LINEAR REGRESSION OBJECT
EXERCISE
X =�y =�loop = []�for boolean in loop:� print 'y-intercept:', boolean� lm = linear_model.LinearRegression(fit_intercept=boolean)� get_linear_model_metrics(X, y, lm)� print
Two new models
DELIVERABLE
DIRECTIONS (15 minutes)
ACTIVITY: USING THE LINEAR REGRESSION OBJECT
INDEPENDENT PRACTICE
BASE LINEAR REGRESSION CLASSES
EXERCISE
Note: We’ll cover these new regression techniques in a later class.
New models and evaluation metrics
DELIVERABLE
DIRECTIONS (20 minutes)
ACTIVITY: BASE LINEAR REGRESSION CLASSES
INTRODUCTION
MULTIPLE REGRESSION ANALYSIS
MULTIPLE REGRESSION ANALYSIS
BIKE DATA EXAMPLE
GUIDED PRACTICE
MULTICOLLINEARITY WITH DUMMY VARIABLES
EXERCISE
Two models’ output
DELIVERABLE
DIRECTIONS (15 minutes)
ACTIVITY: MULTICOLLINEARITY WITH DUMMY VARIABLES
EXERCISE
lm = linear_model.LinearRegression()�weather = pd.get_dummies(bike_data.weathersit)�get_linear_model_metrics(weather[[1, 2, 3, 4]], y, lm)�print�# drop the least significant, weather situation = 4�get_linear_model_metrics(weather[[1, 2, 3]], y, lm)
Two models’ output
DELIVERABLE
DIRECTIONS (15 minutes)
ACTIVITY: MULTICOLLINEARITY WITH DUMMY VARIABLES
GUIDED PRACTICE
COMBINING FEATURES INTO A BETTER MODEL
EXERCISE
DIRECTIONS (15 minutes)
ACTIVITY: COMBINING FEATURES INTO A BETTER MODEL
Visualization of correlations, new models
DELIVERABLE
EXERCISE
lm = linear_model.LinearRegression()�bikemodel_data = bike_data.join() # add in the three weather situations��cmap = sns.diverging_palette(220, 10, as_cmap=True)�correlations = # what are we getting the correlations of?�print correlations�print sns.heatmap(correlations, cmap=cmap)��columns_to_keep = [] #[which_variables?]�final_feature_set = bikemodel_data[columns_to_keep]��get_linear_model_metrics(final_feature_set, y, lm)
DIRECTIONS (15 minutes)
ACTIVITY: COMBINING FEATURES INTO A BETTER MODEL
Visualization of correlations, new models
DELIVERABLE
INDEPENDENT PRACTICE
BUILDING MODELS FOR OTHER Y VARIABLES
EXERCISE
A new model and evaluation metrics
DELIVERABLE
DIRECTIONS (25 minutes)
ACTIVITY: BUILDING MODELS FOR OTHER Y VARIABLES
BONUS
CONCLUSION
TOPIC REVIEW
CONCLUSION
WEEK 3 : LESSON 6
UPCOMING WORK
Week 4 : Lesson 8
UPCOMING WORK
Q & A
INTRODUCTION TO REGRESSION ANALYSIS
EXIT TICKET
INTRODUCTION TO REGRESSION ANALYSIS
DON’T FORGET TO FILL OUT YOUR EXIT TICKET!
THANKS!
INSTRUCTOR NAME