Modeling with Data for Beginners
WELCOME!
While you are waiting…
As you are getting settled, please take time to answer the modeling questions located at your table and the Mentimeter poll.
https://www.menti.com/alfnqjvoeuqj
Our Vision
THANK YOU to our partners and collaborators
It is the goal of the Data Science Academy and The Science House to encourage the use of data and development of data skills and literacy throughout K-12 education - to include the awareness of and ability to use mathematical modeling in everyday thinking.
Your Vision
(Anything you like! Be creative!)
Today’s Modeling with Data Challenge Question:
Is the Marvel Cinematic Universe representative of the population?
Math Modeling vs Word Problems
Math Modeling vs Word Problems
Modeling Problems:
Modeling problems do not provide all of the information and require students to use both mathematics and creativity.
Modeling with Data = Data Science
Math Modeling
Algorithms
Machine Learning & AI
Digital Humanities
Statistics
Computing
Coding
Geographic Information Systems (GIS)
Data Visualization
Asking personally or socially relevant data questions
Asking students questions that they care about or that are relevant to their lives and communities not only builds agency and drives engagement, but also teaches them how to start asking their own questions.
Is the Marvel Cinematic Universe representative of the population?
How can we understand students' cultures and interests?
Where to Find the Data
Authentic datasets can be found in a number of different places:
The Modeling Process
Math Modeling is an iterative process, but can be broken down into several basic steps:
Defining the Problem Statement
Task: At your tables, work as a collaborative group to brainstorm all of the ways to interpret the problem and possible variables to consider.
(Feel free to use any type of visual diagram or mind maps to brainstorm your ideas.)
Making Assumptions
Many modeling problems are too complex to solve outright.
Task: In your group, decide what considerations or variables can you fill in with assumptions?
Try to limit your problem to only 3 variables.
Defining Variables
Once you have defined the problem and thought out a list of assumptions, you should identify the most important aspects of the problem that can be measured. These are the variables.
Independent Variables = measurable inputs into the model
Dependent Variables = measurable outputs from the model
Model Parameters = Constants (unchanging parameters; possibly from some of the previous assumptions made)
Data Analysis
There are a number of platforms that allow you to easily analyze datasets: CODAP, DataClassroom, Tuva
Before looking at today’s dataset, first open the following 2 files:
MCU Character DB, mcu_box_office
Data Analysis - Tidy vs Clean Data
Characteristics of Tidy Data:
Cleaning Data:
Step 1: Remove duplicate, irrelevant, or unwanted observations
Step 2: Fix structural errors - naming conventions, typos, or incorrect capitalization
Step 3: Decide what to do about outliers
Step 4: Handle missing data
Data Analysis
Today we are working with the MCU_DB
We will bring our dataset into CODAP together.
After you familiarize yourself with CODAP, think about the variables in your model and start exploring!
Look for trends and potential relationships
Consider if you want or need to change your variables or assumptions (Modeling is and iterative process!)
Getting a Solution
Now that you have defined the problem, identified some measurable variables, and simplified the problem with assumptions, you have a basic initial mathematical model.
You will use this model to generate some preliminary answers to the problem.
There are a number of ways that you could calculate your answers - from using calculus or differential equations to using graphs - your mathematical toolbox will determine your next steps.
Getting a Solution
As you decide what approach to take in building your solution, the following considerations may be helpful:
Getting a Solution
Task: In your teams discuss and design a basic model that you think will help you answer our modeling question.
On the large note paper provided, layout your basic model.
Include:
Getting a Solution
Have students rotate and take a look at the other teams’ models and use the sticky notes to share their ideas about each model:
Model Assessment
Does My Answer Make Sense?
Encourage students to assess their model
If something looks off, first check the calculations or formulas. Then determine if changes to the assumptions or the math are needed.
Model Assessment: How Strong Is Our Model?
Analysis & Model Assessment
Is My Model Valid?
Have students change their parameters and note the outputs of their model. Record any thoughts, complications, etc.
Reporting the Results
Communication is a key aspect of data science and therefore students should be given opportunities to report out on their modeling process.
Reporting the Results
Key Ideas:
Data Pass - Data Visualization
Pass around each data visualization[.
During each round record the following:
Option 1
Round 1: One Understanding about the data
Round 2: One Question the data could answer
Round 3 : One Question the data creates
Data Pass - Data Visualization
Pass around each data visualization.
During each round record the following:
Option 2
Round 1: Write something that you
Round 2: Describe any trends that you may see.
Round 3: Is the style of graph the best for conveying this type of information? Why or Why not?
Data Visualization
Look back at your personal data card.
Look at you modeling! You’re a natural!