1 of 33

Classification Central A Classification Playground for Aspiring Data Scientists By Data Scientists

2 of 33

How many of you wish you had a site to learn about classification on an interactive basis?

3 of 33

How many of you wish you had a good explanation for each piece of code that’s happening in a model on an interactive basis?

4 of 33

Project Goal

Provide Aspiring Data Scientists with

1. A clear understanding of each step within the classification modeling process

2. A tutorial of how to classify their own dataset

5 of 33

Classification Techniques�Explained

Logistic Regression

KNN

Random Forest

Naïve Bayes

6 of 33

7 of 33

  1. Check for and Remove Null Values
  2. Summarize Your Data
  3. Replace the Classifier with 0 and 1
  4. Implement Test and Training Sets
  5. Determine the Accuracy Score
  6. Output a Confusion Matrix

8 of 33

�Logistic Regression Summary

9 of 33

�KNN Summary

10 of 33

�Random Forest Summary

11 of 33

Naïve Bayes Summary

12 of 33

13 of 33

14 of 33

Tutorial Steps

15 of 33

What is the most important step in any modeling?

16 of 33

Understanding your dataset!!!

17 of 33

How do you understand your data?

  • By performing Exploratory Data Analysis
  • Histogram
  • Box Plot
  • Outlier Analysis
  • Central Tendency
  • Dispersion

18 of 33

Determine Column Name Containing the Nulls, then Delete

19 of 33

20 of 33

Let’s Replace Categorical Values to Quantitative

21 of 33

Replace Cont’d

22 of 33

Test and Training Data

Part 1: Select independent variables

Part 2: Select a dependent variable

Part 3: Splits the dataset into test and training 

Part 4: Create a model

Part 5: Train the data

Part 6: Test the model

23 of 33

Accuracy: Can you spot the difference?

24 of 33

How accurate is the model?

25 of 33

Output Confusion Matrix

26 of 33

27 of 33

28 of 33

29 of 33

Data Report

30 of 33

31 of 33

Null Values

32 of 33

Visualizations

33 of 33

Classification Central Video Walk-Through:�

https://www.youtube.com/watch?v=QCucIBfCy84