Best Publicly Available Educational Data Set Prize Nominations (2026)
The International Educational Data Mining Society invites nominations for the 2026 Best Publicly Available Educational Data Set. Each year's winner receives a prize of $2000 and potentially, free registration to present an award talk at the following year's International Conference on Educational Data Mining. Please see https://educationaldatamining.org/data-set-awards/ for more details.

We welcome proposals for this prize and invite all members of the community to nominate data sets they consider valuable contributions to our community. Self-nominations are allowed, as are nominations of data sets posted by other individuals.

Please fill out the details of the data set below.  If you're unable to answer all questions, please provide as much of the requested information as possible.  We will follow up with data set owners where we have further questions.

Questions about the nomination process or prize should be emailed directly to Anna Rafferty (arafferty@carleton.edu).
Sign in to Google to save your progress. Learn more
Your name:
Your email address:
Name of the data set you're nominating:
URL(s) to access the data set and with any descriptions of the data set:
SIZE, COVERAGE, BREADTH. How many data records are included in this data set? What is the unit of analysis, and what are the student demographics and representativeness, if applicable?  Is there missing data? Any other relevant information that describes the characteristics and nature of the data is welcome.
SOURCE. What is the original source of this data set?  For example, was it recorded on a specific EdTech platform? Collected through use in classrooms or in some other setting? What person, group, or lab originally released it?
PUBLICATIONS.  Please list published papers describing and/or using this data set, or link to a website listing these papers.
ETHICS CONDITIONS.  Is the data subject to ethical concerns and was its collection reviewed and approved according to local ethics regulations (e.g., institutional review board in countries where applicable)?
​LICENSE. Who owns this data,  and what are the specific restrictions for using the data (e.g., legal agreements)? What license is the data shared under? (e.g. CC BY 4.0)

​AVAILABILITY AND PERMANENCE. How can the data set be accessed? How is the availability of the data set expected to be guaranteed in perpetuity? Is it hosted on a well-recognised and reliable platform? (E.g. HuggingFace, LearnSphere)

(Optional) In a paragraph or so, tell us why you think this data set is a particularly valuable educational data set.
Submit
Clear form
Never submit passwords through Google Forms.
This form was created inside of Carleton College.

Does this form look suspicious? Report