ABCDEFGHIJKLMNOPQRSTUVWXYZAAAB
1
US-SpecificEducationNameLinkDescription
2
11Urban Institute's Education Data Portal/Explorerhttps://educationdata.urban.org/data-explorer/
School, district, and college-level data.
3
11Stanford Education Data Archivehttps://edopportunity.org/get-the-data/
School- and district-level test score data comparable across tests.
4
01American Economic Association Data Sourceshttps://www.aeaweb.org/resources/dataVarious
5
11NCES Data Labhttps://nces.ed.gov/datalab/index.aspx
Various federal education data
6
11IPUMShttps://ipums.org/
Census and survey data from around the world.
7
01Data Is Plural Structured Archive
https://docs.google.com/spreadsheets/d/1wZhPLMCHKJvwOkP4juclhjFgqIY8fQFMemwKL2c64vk/edit#gid=0
Various
8
11California Department of Education Public Data Fileshttps://www.cde.ca.gov/ds/
CA education data
9
11California Department of Education Student and School Data Fileshttps://www.cde.ca.gov/ds/sd/sd/
Data files pertaining to student and school demographics that can be downloaded to your computer.
10
11California Department of Education Staff Data Fileshttps://www.cde.ca.gov/ds/sd/df/
Data files pertaining to staff demographics that can be downloaded to your computer.
11
11School Finance Indicators Databasehttp://schoolfinancedata.org/
"a collection of sophisticated school finance measures for assessing the adequacy and fairness of each state’s revenue, spending, and resource allocation, and for comparing these outcomes between states and over time"
12
00Tidy Tuesdayhttps://github.com/rfordatascience/tidytuesday
Weekly data project aimed at online R learning community
13
11American Community Surveyhttps://www.census.gov/programs-surveys/acs
A variety of data from the US Census Bureau
14
11DATA.GOVhttps://www.data.gov/
Clearinghouse for public data from the federal government
15
11National Household Education Surveys Program Data Productshttps://nces.ed.gov/nhes/dataproducts.asp
Since its inception in 1991, the NHES has fielded topical survey modules about early childhood care and education, children's readiness for school, parents' perceptions of school safety and discipline, before- and after-school activities of school-age children, participation in adult and career education, parents' involvement in their children's education, school choice, homeschooling, and civic involvement. Since 2007, the NHES has focused on four main topics: young children's care and education before school , participation in adult training and education, parents' involvement in their children's education–including school choice–and homeschooling.
16
11Illinois State Board of Education Report Card Data Libraryhttps://www.isbe.net/Pages/Illinois-State-Report-Card-Data.aspx
School- and district level data in IL
17
11Illinois State Board of Education Expulsions, Suspensions, and Truantshttps://www.isbe.net/Pages/Expulsions-Suspensions-and-Truants-by-District.aspx
District-level discipline data
18
11Common Core of Datahttps://nces.ed.gov/ccd/ccddata.asp
School- and district-level nationwide data
19
11Evidence Project Data Explorerhttps://www.evidence-project.org/find-research/data-explorer
This page includes interactive visualizations of data from COVID-19 K-12 studies.
20
11American State Administrators Project (ASAP)https://asap.wisc.edu/
"a decades-long survey of state agency leaders...a 50-state chronological portrait of state administrative leaders, what they think, and what their agencies do"
21
11National Education Research Database on Schools (NERD$)https://edunomicslab.org/nerds/
School-level finance data.
22
01OECDiLbriary Education at a Glancehttps://www.oecd-ilibrary.org/education/data/education-at-a-glance_eag-data-en
"This database includes data on education learning outputs and outcome and resources, access to education and the learning environment."
23
11DC Enrollment Study Data Repositoryhttps://github.com/betsyjwolf/DC-Enrollment-Study
Data from the DC Enrollment Study
24
11BPCNet Statistics and Data Hubhttps://bpcnet.org/statistics/
Combines public higher education data, especially focused on computer science and engineering attainment.
25
11American Educator Panel Survey Data Portalhttps://www.rand.org/education-and-labor/projects/aep/data-portal.html
Free, de-identified survey data of educators from RAND.
26
11Civil Rights Data Collectionhttps://ocrdata.ed.gov/
Wide-ranging academic, discipline, staffing, and financial data on US public schools.
27
11Integrated Postsecondary Education Data System (IPEDS)https://nces.ed.gov/datalab/index.aspx
Data on institutions of higher education in the United States, including enrollment, financial aid, degree completion, etc.
28
11NCES Online Codebookhttps://nces.ed.gov/OnlineCodebook
Tool to explore federal education data sets, including older data.
29
11College Scorecardhttps://collegescorecard.ed.gov/data/
Includes information on institutional characteristics, enrollment, student aid, costs, and student outcomes, as well as supporting data on student completion, debt and repayment, earnings, and more.
30
11EdBuild's School District Finance Datahttp://data.edbuild.org/
EdBuild’s master dataset of school district finance, student demographics, and community economic indicators for every school district in the United States.
31
10IPPSR Correlates of State Policyhttp://ippsr.msu.edu/public-policy/correlates-state-policy
The Correlates of State Policy Project includes more than 900 variables, with observations across the 50 U.S. states and across time (1900–2016, approximately). These variables represent policy outputs or political, social, or economic factors that may influence policy differences.
32
10Federal Reserve Economic Data (FRED)https://fred.stlouisfed.org/
Primarily macroeconomic data.
33
10Ed-Data (California Education Data Partnership)http://www.ed-data.org/
Combines data on California schools, including tools for combining data elements.
34
00Gapminderhttps://www.gapminder.org/data/
"Most of our data are not good enough for detailed numeric analysis." Gapminder combines data from multiple sources into unique coherent time-series that can’t be found elsewhere.
35
00Our World in Datahttps://ourworldindata.org/Various
36
10PEERS Data Hubhttps://www.icpsr.umich.edu/web/pages/peersdatahub/
STEM education-related data with "both restricted and public-use data files that can be used for quantitative and qualitative analysis. Search through the archives by topic, variable, or title."
37
10California Community Colleges Chancellor's Office MIS Data Marthttps://datamart.cccco.edu/
"The data mart provides information about students, courses, student services, outcomes and faculty and staff."
38
10NYC Open Datahttps://opendata.cityofnewyork.us/
"Open Data is free public data published by New York City agencies and other partners."
39
00Kaggle Datasetshttps://www.kaggle.com/datasetsVarious
40
00WHO Global Health Observatory data repositoryhttps://apps.who.int/gho/data/node.home
"WHO's gateway to health-related statistics for more than 1000 indicators for its 194 Member States. Data are organized to monitor progress towards the Sustainable Development Goals (SDGs), including health status indicators to monitor progress towards for the overall health goal, indicators to track equity in health indicators, and the indicators for the specific health and health-related targets of the SDGs."
41
00British Film Industry Industry data and insightshttps://www.bfi.org.uk/industry-data-insights
"Read free research data and market intelligence on the UK film industry and other screen sectors."
42
10FiveThirtyEight Datahttps://data.fivethirtyeight.com/
"We’re sharing the data and code behind some of our articles and graphics."
43
00UNICEF Datahttps://data.unicef.org/#
"Monitoring the situation of children and women"
44
00Harvard Dataversehttps://dataverse.harvard.edu/
Various; "Harvard Dataverse is a repository for research data."
45
00Datahub.iohttps://datahub.io/collections
Various; "high quality data and datasets organized by topic"; may require more data processing experience.
46
00Data in Briefhttps://www.sciencedirect.com/journal/data-in-brief
Various; "Data in Brief is a multidisciplinary, open access, peer-reviewed journal, which publishes short, digestible articles that describe and provide access to research data."
47
10Measures of Effective Teaching Longitudinal Databasehttps://www.icpsr.umich.edu/web/pages/about/metldb.html
"This site enables users to apply for access to quantitative data and classroom videos created by the Measures of Effective Teaching (MET) project, funded by the Bill & Melinda Gates Foundation. Use of the MET Longitudinal Database is offered to approved researchers via a remote access system."
48
10Early Childhood Longitudinal Study (ECLS)https://nces.ed.gov/ecls/index.asp
"The Early Childhood Longitudinal Study (ECLS) program provides important information about children's knowledge, skills, and development from birth through elementary school."
49
00Education (and other) Data Sources (Matthew Lenard)https://www.dropbox.com/s/q28muxrxooqetcv/data_sources.pdf?dl=0
Thorough and well organized list of data sources.
50
00Programme for International Student Assessmenthttps://www.oecd.org/pisa/data/
"The PISA database contains the full set of responses from individual students, school principals and parents."
51
00Base dos Dadoshttps://basedosdados.org/
Various. (Portuguese)
52
10Child & Family Data Archivehttps://www.childandfamilydataarchive.org/cfda/pages/cfda/data.html
Datasets related to early childhood.
53
10Ed Finance-Related Datasetshttps://edunomicslab.org/ed-finance-related-datasets/
From the Edunomics Lab: "a list of datasets commonly used in education research and practice focused primarily on finance-related data"
54
10National Council on Teacher Qualityhttps://www.nctq.org
Various teacher-related databases either available to download or available upon request.
55
10U.S. COVID-19 County Policy Databasehttps://doi.org/10.3886/E180482V1
The objective of the U.S. COVID-19 County Policy (UCCP) Database is to systematically gather, characterize, and assess variation in U.S. county-level COVID-19-related policies. (paper link: https://bmcpublichealth.biomedcentral.com/articles/10.1186/s12889-022-14132-6 )
56
00This is the way: Network perspective on targets for spatial ability development programmeshttps://osf.io/d57uw/?view_only=
Published paper re: spatial and cognitive ability using data from Russian university students: https://bpspsychub.onlinelibrary.wiley.com/doi/10.1111/bjep.12524
57
11IHE Program-Level Debt and Earnings Outcomes
https://robertkelchen.com/2023/01/19/sharing-a-dataset-of-program-level-debt-and-earnings-outcomes/
Assembled and shared by Robert Kelchen; "The resulting dataset covers 45,971 programs at 5,033 institutions with data on both student debt and earnings for those same cohorts."
58
00The Socioeconomic High-resolution Rural-Urban Geographic Platform for India (SHRUG)https://www.devdatalab.org/shrug
"an open access repository currently comprising dozens of datasets covering India’s over 500,000 villages and 8000 towns over a span of 25 years, all linked together with a set of common geographic identifiers"
59
10
Replication Data for: Do Male and Female Legislators Have Different Twitter Communication Styles?
https://dataverse.unc.edu/dataset.xhtml?persistentId=doi:10.15139/S3/MHAAZV
"a dataset with nearly 4 million tweets by all American state legislators, coded for topic and ideology through supervised learning"
60
10Federal Bureau of Investigation Crime Data Explorerhttps://cde.ucr.cjis.gov/LATEST/webapp/#/pages/home
"The FBI's Crime Data Explorer (CDE) aims to provide transparency, create easier access, and expand awareness of criminal, and noncriminal, law enforcement data sharing; improve accountability for law enforcement; and provide a foundation to help shape public policy with the result of a safer nation. Use the CDE to discover available data through visualizations, download data in .csv format, and other large data files."
61
11Office of Special Education Programs (OSEP), US Department of Educationhttps://www2.ed.gov/about/offices/list/osers/osep/index.html
Data on the provision of special education services (e.g., SPED personnel).
62
10Youth Risk Behavior Surveillance System (YRBSS) (Centers for Disease Control & Prevention)https://www.cdc.gov/healthyyouth/data/yrbs/index.htm
"The Youth Risk Behavior Surveillance System (YRBSS) is a set of surveys that track behaviors that can lead to poor health in students grades 9 through 12."
63
10Law Enforcement Management and Administrative Statistics (LEMAS)
https://bjs.ojp.gov/data-collection/law-enforcement-management-and-administrative-statistics-lemas
From the Bureau of Justice Statisics: "Conducted periodically since 1987, the LEMAS core collects data from over 3,000 general purpose, county, and local law enforcement agencies, including all those that employ 100 or more full-time sworn officers and a nationally representative sample of smaller agencies. Data are obtained on agency responsibilities, operating expenditures, job functions of sworn and civilian employees, officer salaries and special pay, demographic characteristics of officers, weapons and armor policies, education and training requirements, computers and information systems, vehicles, special units, and community policing activities."
64
10Myers Abortion Facility Databasehttps://osf.io/8dg7r/
"Public data include a county-by-month panel of travel distances to the nearest U.S. abortion provider from January 2009 through December 2022. Researchers can apply for restricted data identifying abortion facilities."
65
10United States Census Bureau Datahttps://data.census.gov/
Explorer and table generator for US Census data.
66
10American Stories datasethttps://huggingface.co/datasets/dell-research-harvard/AmericanStories
"The American Stories dataset is a collection of full article texts extracted from historical U.S. newspaper images. It includes nearly 20 million scans from the public domain Chronicling America collection maintained by the Library of Congress. The dataset is designed to address the challenges posed by complex layouts and low OCR quality in existing newspaper datasets." See: https://arxiv.org/abs/2308.12477
67
10National Cancer Institute Surveillance, Epidemiology, and End Results (SEER) Data Setshttps://seer.cancer.gov/data-software/datasets.html
"The SEER research data include SEER incidence and population data associated by age, sex, race, year of diagnosis, and geographic areas (including SEER registry and county). SEER releases new research data every Spring based on the previous November’s submission of data. Use SEER data to address multiple topics; for example, you can: Examine stage at diagnosis by race/ethnicity Calculate survival by stage at diagnosis, age at diagnosis, and tumor grade or size. Determine trends and incidence rates for various cancer sites over time."
68
11Florida PK-12 Public School Data Publications and Report
https://www.fldoe.org/accountability/data-sys/edu-info-accountability-services/pk-12-public-school-data-pubs-reports/
Public data files from the Florida Department of Education
69
00Find Economic Articles with Datahttps://ejd.econ.mathematik.uni-ulm.de/
"This is an R Shiny app to search for economic articles that have provided data and code for replication purposes. The main feature is a keyword search in the article's titles and abstracts. It returns a list with links to the articles on their journal websites and some estimates of the sizes of data files and relevant code files. By default only articles are included that have a data or code supplement."
70
00Mostly Harmless Econometrics Data Archivehttps://economics.mit.edu/people/faculty/josh-angrist/mhe-data-archive
"Data and programs from Mostly Harmless Econometrics"
71
10Income Distributions and Dynamics in America: Data Center
https://www.minneapolisfed.org/institute/income-distributions-and-dynamics-in-america/data-center
The Income Distributions and Dynamics in America project seeks to foster new research and analysis on income disparities by providing statistics on income percentiles, shares, growth rates, persistence, and more for many U.S. demographic groups at national and state levels.
72
00Publicly Available Data in SociologyOpen DatasetsVia Miles Brickell
73
11National Longitudinal School Databasehttps://reachcentered.org/publications/national-longitudinal-school-database
The National Longitudinal School Database (NLSD) was created by the National Center for Research on Education Access and Choice (REACH) to allow researchers to examine various aspects of traditional public schools, charter schools, magnet schools, and private schools.
74
00Pew Research Center Datasetshttps://www.pewresearch.org/download-datasets/
"Pew Research Center makes its data available to the public for secondary analysis after a period of time."
75
10The Downballot Ultimate Data Guidehttps://www.the-downballot.com/p/data
US elections data
76
10OpenElections Data Repositoryhttps://github.com/openelections
"The goal of OpenElections is to create the first free, comprehensive, standardized, linked set of election results data for the United States."
77
10DataLumoshttps://www.datalumos.org/datalumos/
"DataLumos is an ICPSR archive for valuable government data resources. ICPSR has a long commitment to safekeeping and disseminating US government and other social science data. DataLumos accepts deposits of public data resources from the community and recommendations of public data resources that ICPSR itself might add to DataLumos."
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100