A | B | C | D | E | F | G | H | I | |
---|---|---|---|---|---|---|---|---|---|
1 | Humanities Datasets in Context | ||||||||
2 | |||||||||
3 | |||||||||
4 | Literature | African American Literature (1853-1923) | From: Melanie Walsh’s list of Datasets | ||||||
5 | Data location | Format | Original Source | Notes | |||||
6 | African American Literature (1853-1923) | Zip of txt files | Amardeep Singh | Plain text of over 100 novels by Black literary authors including poetry and novels. Spreadsheet of metadata for the works. | |||||
7 | Additional information | ||||||||
8 | Please follow the principle of the Colored Conventions Project when using and downloading the data | ||||||||
9 | |||||||||
10 | Colonial South Asian Literature (1850-1923) | From: Melanie Walsh’s list of Datasets | |||||||
11 | Data location | Format | Original Source | Notes | |||||
12 | Colonial South Asian Literature (1850-1923) | Zip of txt files | Amardeep Singh | Plain text files of British and South Asian writers in English or in translation. | |||||
13 | |||||||||
14 | TxtLab's Multilingual Novels (1771-1932) | From: Melanie Walsh’s list of Datasets | |||||||
15 | Data location | Format | Original Source | Notes | |||||
16 | Multilingual Novels Database | Zip of txt files | Andrew Piper, TxtLab | Plain text files of 450 novels in German, French, and English | |||||
17 | |||||||||
18 | Goodreads Ratings | From: Kaggle | |||||||
19 | Data location | Format | Original Source | Notes | |||||
20 | Goodreads data | CSV | Goodreads API and Kaggle user Soumik | CSV containing metadata about the book and the average rating on Goodreads | |||||
21 | |||||||||
22 | |||||||||
23 | Library Data | Shakespeare & Company: Lending Library Data | |||||||
24 | Data location | Format | Original Source | Notes | |||||
25 | Shakespeare & Co. Lending Library Data | CSV and JSON files | The Shakespeare and Company Project | CSV files of lending library borrows, purchases. memberships and renewals | |||||
26 | |||||||||
27 | Seattle Public Library Ciculation Data (2005-present) | From: Melanie Walsh’s list of Datasets | |||||||
28 | Data location | Format | Original Source | Notes | |||||
29 | Seattle Library checkouts by title 2005-present | exportable spreadsheet | City of Seattle | Data for all checkouts since 2005, filterable by interest | |||||
30 | |||||||||
31 | |||||||||
32 | Pop Culture | Game of Thrones Character Relationships | From: Melanie Walsh’s list of Datasets | ||||||
33 | Data location | Format | Original Source | Notes | |||||
34 | GoT Character data | CSV files | A. Beveridge and J. Shan | 107 characters and their interactions. Best suited for Network Analysis. | |||||
35 | |||||||||
36 | Bechdel Test for Movies | From: FiveThirtyEight | |||||||
37 | Data location | Format | Original Source | Notes | |||||
38 | Bechdel Movie Data | CSV | "The Dollars-and-Cents Case Against Hollywood's Exclusion of Women" | 1,615 films from 1990 to 2013 measured against the Bechdel Test. | |||||
39 | |||||||||
40 | Hollywood Film Dialogue by Character Gender and Age | From: Melanie Walsh’s list of Datasets | |||||||
41 | Data location | Format | Original Source | Notes | |||||
42 | Hollywood Film Dialogue | CSV | “Film Dialogue from 2,000 screenplays, Broken Down by Gender and Age”. | Dialogue for 2000 films (1925-2015). For more info see FAQ for the “Film Dialogue, By Gender” Project. | |||||
43 | |||||||||
44 | 50 Years of Pop Music Lyrics | From: Sierra Eckhert's list of datasets | |||||||
45 | Data location | Format | Original Source | Notes | |||||
46 | Billboard Year-End Hot 100 (1965-2015) | CSV | Kaylin Pavlik | 50 years of lyrics from Billboard's Top 100. Includes song, artist, year, lyrics, and source. | |||||
47 | |||||||||
48 | Graphic Novel Corpus | From: Hybrid Narrativity | |||||||
49 | Data location | Format | Original Source | Notes | |||||
50 | Graphic Novel Corpus | Zip of CSV files and PDF of database logic | About the Graphic Novel Corpus | 240 graphic novels. Files include author, title, and other metadata. Structured in multiple CSV files for creation of a database. Charts section shows possible analysis. | |||||
51 | |||||||||
52 | |||||||||
53 | Miscellaneous | Queer Politics at Princeton | From: QP@P | ||||||
54 | Data location | Format | Original Source | Notes | |||||
55 | Queer Politics at Princeton | CSV | Andrew Reynolds, Queer Politicians Data, QP@P Princeton | Dataset includes out LGBTQI+ elected officials since 1976, geographic information (w/o Lat/Long), political party, role, etc. | |||||
56 | |||||||||
57 | The Endangered Languages Project | From: Endangered Languages Project | |||||||
58 | Data location | Format | Original Source | Notes | |||||
59 | Endangered Languages Data | CSV | Alliance for Linguistic Diversity | Dataset includes vitality of a language, linguistic details, number of living speakers and geographic data. | |||||
60 | |||||||||
61 | Who Has Your Face? | From: Atlas of Surveillance | |||||||
62 | Data location | Format | Original Source | Notes | |||||
63 | Facial Recognition Data | CSV | Electric Frontier Foundation Who Has Your Face? report | Dataset organized by state that details which agencies have access to which type of facial recognition data | |||||
64 | |||||||||
65 | Torn Apart/Separados | From: Mobilized Humanities | |||||||
66 | Data location | Format | Original Source | Notes | |||||
67 | Open Data Repository | CSV files | Torn Apart/Separados | Data culled from multiple sources to compile an atlas of ICE detention centers on the southern border | |||||
68 | |||||||||
69 | Nobel Prize Winners | From: Melanie Walsh’s list of Datasets | |||||||
70 | Data location | Format | Original Source | Notes | |||||
71 | Nobel Prize Winners data | CSV | The European Data Portal and the official Nobel Prize API | 957 Nobel Prize winners (1901-2017) includes metadata about the winners and location data for mapping | |||||
72 | |||||||||
73 | What's on the Menu? | From: Melanie Walsh’s list of Datasets | |||||||
74 | Data location | Format | Original Source | Notes | |||||
75 | What's on the Menu data | CSV | New York Public Library | Menu and dish data including prices collected so far from the 45,000 menus in their colllection (1840-present) | |||||
76 | Additional information | ||||||||
77 | This project was the subject of Katie Rawson and Trevor Muñoz' article "Against Cleaning" |