Data Analysis and Design
Data Jam
Justin Joque
Fall 2012
This presentation lis licensed under a Creative Commons Attribution NonCommercial Share Alike 3.0 license. Copyight 2012. Justin Joque.
Agenda
Needs Evaluation
3 Critical Components:
Analysis Tools
Remember: Focus on processes whenever possible
Data Design
Data Design - The Difficulty of Excel
Name | Pet |
John | Cat |
Alice | Dog |
Bob | Cat |
Name | Pet | Pet Name |
John | Cat | Whiskers |
Alice | Dog | Spot |
Bob | Cat,Dog | Mittens,Sparky |
Data Design - The Difficulty of Excel
Name | Pet |
John | Cat |
Alice | Dog |
Bob | Cat |
Name | Pet1 | Pet Name1 | Pet2 | Pet Name2 |
John | Cat | Whiskers | | |
Alice | Dog | Spot | | |
Bob | Cat | Mittens | Dog | Sparky |
Data Design - The Difficulty of Excel
Name | Pet |
John | Cat |
Alice | Dog |
Bob | Cat |
Name | Pet | Pet Name |
John | Cat | Whiskers |
Alice | Dog | Spot |
Bob | Dog | Sparky |
Bob | Cat | Mittens |
Data Design - Relational Databases
Name | ID |
John | 1 |
Alice | 2 |
Bob | 3 |
Person_ID | Pet | Pet Name |
1 | Cat | Whiskers |
2 | Dog | Spot |
3 | Dog | Sparky |
3 | Cat | Mittens |
Data Design - Flatness
Data Design - Uniqueness
Data Enterers
Likelihood of Error or Willful Abandonment of the Planned Data Entry Method
Data Design - Authorities
Case Study - HRWC
Site Survey Data Description: HRWC divided the county up into numbered bioreserves which were then combined (geographically) with parcels. They then use this information to try to contact property owners, get permission to survey the property, and then record the results.
Problem: They have two different database, one for the addresses and contact information and one for the survey information. The two databases do not interact very well.
Case Study - HRWC
Step 1: Try to figure out what is going on with the sample data.
Case Study - HRWC
Step 2: Try to think through how the tables could be rearranged and data entry could be better controlled.
Case Study - HRWC
Step 3: Try to think about what analyses could easily be mined from your new data structure.