Tidying and Wrangling
Making data useful!
Zachary del Rosario (He/Him)
1
Workshop Schedule
Extract
Wrangle + Tidy
Friday
Saturday
Visualize
Model
Sunday
Monday
Tabula +
WebPlotDigitizer
Python + Jupyter
Concepts
Execution
Concepts
Execution
Concepts
Fin
Focus
Live
Take-Home
2
Tidying and Wrangling
Tidying: Reshaping to tidy data
Wrangling: Unit conversions, data types, invalid/missing values, etc.
Why are these important? (In-chat)
3
Survey Time!
4
Looking Forward
Visualizing Tidy Data!
5
Visualizing Tidy Data is Trivial
(
df_converted
>> ggplot(aes(“sigma_MPa”))
+ geom_density()
)
6
Visualizing Tidy Data is Trivial
(
df_converted
>> ggplot(aes(“sigma_MPa”))
+ geom_density()
+ theme_minimal()
+ labs(
x=”Critical Stress (MPa)”,
y=”Density (-)”
)
)
7
Tonight’s Exercise
8
Tonight’s Notebook: Programmatic Data Operations
03_data_assignment
9
End of Today
10