Communicate Data Findings Project
Dataset Options
To complete this project, select one of the datasets in the table, or you can select your own dataset. For guidelines on choosing your own dataset, see below the table.
Dataset | Overview and Notes | Example Topics/Questions |
This data set includes information about individual rides made in a bike-sharing system covering the greater San Francisco Bay area.
| When are most trips taken in terms of time of day, day of the week, or month of the year? How long does the average trip take? Does the above depend on if a user is a subscriber or customer? | |
This dataset reports flights in the United States, including carriers, arrival and departure delays, and reasons for delays, from 1987 to 2008.
| Are there certain destination or arrival cities that are home to more delays or cancellations? What are the preferred times for flights to occur? Are there any changes over multiple years? | |
This data set contains 113,937 loans with 81 variables on each loan, including loan amount, borrower rate (or interest rate), current loan status, borrower income, and many others.
| What factors affect a loan’s outcome status? What affects the borrower’s APR or interest rate? Are there differences between loans depending on how large the original loan amount was? | |
Note: The unzipped PISA Data csv file is 2.75 GB. | PISA is a survey of students' skills and knowledge as they approach the end of compulsory education. It is not a conventional school test. Rather than examining how well students have learned the school curriculum, it looks at how well prepared they are for life beyond school. Around 510,000 students in 65 economies took part in the PISA 2012 assessment of reading, mathematics and science representing about 28 million 15-year-olds globally. Of those economies, 44 took part in an assessment of creative problem solving and 18 in an assessment of financial literacy.
| How does the choice of school play into academic performance? Are there differences in achievement based on gender, location, or student attitudes? Are there differences in achievement based on teacher practices and attitudes? Does there exist inequality in academic achievement? |
Or select your own dataset! | See below for guidelines on whether or not a dataset will be appropriate for use in this project. Remember that finding and cleaning your own data set could take significant time and effort! |
Your dataset should:
Here are some resources to help you find a dataset: