Social Science R Onboarding
For Data Carpentry Instructors
October-November 2018
Who is this onboarding for?
Please note: This onboarding is not a substitute for Instructor Training or any stage of the Instructor checkout process. If you are not a certified Instructor, you will need to complete Instructor Training and checkout before teaching at a centrally-organized Carpentry workshop.
What if I’m not an Instructor?
Why should I complete onboarding?
Please note: This onboarding is not a substitute for Instructor Training or any stage of the Instructor checkout process. If you are not a certified Instructor, you will need to complete Instructor Training and checkout before teaching at a centrally-organized Carpentry workshop.
This onboarding will not cover:
These materials are in alpha development and are not currently part of Data Carpentry core curricular offerings.
Questions / Discussion
Origins of Curriculum
Introduction to the dataset
Data description: https://datacarpentry.org/socialsci-workshop/data/
Tabular dataset from SAFI (Studying African Farmer-Led Irrigation) project. Surveys conducted by smartphone application with questions about households and agricultural practices.
Subset of larger study - 131 responses included.
Workshop structure
Lesson | Length |
Data Organization in Spreadsheets* | 2 to 3 hours |
Data Cleaning with OpenRefine* | 2 to 3 hours |
Data Analysis and visualisation with R | 8+ hours |
*Spreadsheets + OpenRefine lessons take longer than in Ecology versions
Dataset versions
Different versions of data file for the different applications:
Illustrate different problems with data organization and cleaning. Essential to use the correct version for each exercise.
Audience
Workshop overview
Questions / Discussion
Data Organization in Spreadsheets
Data Organization in Spreadsheets
Dataset formatting
Several columns include lists within a cell - separated by semicolon (;).
Dealt with explicitly in OpenRefine lesson.
Can cause problems on data import to LibreOffice.
Things to watch out for
Detailed in the Instructor Notes.
Don’t rush! Take the time to troubleshoot. Use helpers for one-on-one support.
Questions / Discussion
Data Cleaning with OpenRefine
Link to lesson: https://datacarpentry.org/openrefine-socialsci/
Data Cleaning with OpenRefine
Questions / Discussion
Data Analysis and visualisation with R
Uses the following R packages:
All installed through tidyverse.
Link to lesson: https://datacarpentry.org/r-socialsci/
Data Analysis and visualisation with R
Link to lesson: https://datacarpentry.org/r-socialsci/
Data Analysis and visualisation with R
Link to lesson: https://datacarpentry.org/r-socialsci/
Data Analysis and Visualisation with R
Link to lesson: https://datacarpentry.org/r-socialsci/
Data Analysis and Visualisation with R
Link to lesson: https://datacarpentry.org/r-socialsci/
Things to watch out for
Questions / Discussion
Preparing to Teach
What next?
How to get help?
Questions?