How can we create data visualizations that represent the CS Pioneers Calendar?
Session Facilitator
Layla Quiñones
CS4ALL
6- 12 Implementation team
Warm Up
What comes to mind when you think of Data Science?
What is Data Science?
Data Science is a field of study that uses computer science concepts to help analyze, visualize and communicate data to the world.
Focuses on elevating the data narrative - the story that the data is telling us.
What are data visualizations?
Data visualizations are used to help illustrate relationships within the data. This provides evidence for the data narrative.
How do we visualize data?
Identify variables, observations and data types
What are some variable data types?
Quantitative (numerical values): The information is recorded as numbers and represents an objective measurement or a count. Temperature, weight, and a count of transactions are all quantitative data. Analysts also refer to this type as numerical data. Resource
What are some variable data types?
Qualitative (categorical values): The information represents characteristics that you do not measure with numbers. Instead, the observations fall within a countable number of groups. In fact, this type of variable can capture information that isn’t easily measured and can be subjective. Taste, eye color, race, and marital status are all types of qualitative variables.
What are data visualizations?
Bar charts should be used when comparing categorical and quantitative data.
Scatter plots should be used when you are exploring how one variable changes with respect to another.
Line plots should be used when exploring how two continuous values that are continuous respond to one another.
Pie charts should be used when you are comparing each part of a whole to each other.
Histograms should be used when using data that counts the number of times a specific data point appears in the data set. Is also used to analyze the distribution of data.
Box Plot should be used when analyzing the distribution of data
Why should we create visualizations using the CS Pioneers calendar?
What do each pioneer have in common?
Identify some categories/characteristics we can measure/observe for each pioneer.
Why should we create visualizations using the CS Pioneers calendar?
Look at the pioneers calendar data set model
Why should we create visualizations using the CS Pioneers calendar?
Does the CS Pioneers Calendar succeed in representing historically excluded groups?
Historically underrepresented groups in tech include:
Data Aquisition
How can we create a data set from the pioneers calendar?
ACTIVITY: Creating a Data Set
1. Copy the google sheet and add the CS pioneers using this pioneer calendar AND add one of your own
2. Submit your dataset in the following form
Data Exploration & Visualization
Why should we create visualizations using the CS Pioneers calendar?
Visualizations helps people understand and interpret relationships between variables.
For example we can identify relationships between:
Model: Data Exploration
Model Visualization
Ensure you execute the following steps to help you visualize data.
Checklist:
Model Visualization
Question: What races and genders are represented in the pioneers calendar?
Variables used: Race & Gender - both categorical
Visualization: Clustered bar chart or stacked bar chart
Model Visualization
Model Visualization
This visualization shows the relationship between race and gender in the CS Pioneers calendar data.
We can see that:
Model Visualization
This visualization offers the same insight as the previous except for the comparison of the totals for each race (stacked).
ACTIVITY: Generating Visualizations
1. Make a copy of the google sheet and generate your own data visualization.
2. Submit your samples in the following form
Thank You!
See you next time!