ABCDEFGHI
1
Humanities Datasets in Context
2
3
4
LiteratureAfrican American Literature (1853-1923)From: Melanie Walsh’s list of Datasets
5
Data locationFormatOriginal SourceNotes
6
African American Literature (1853-1923)Zip of txt filesAmardeep SinghPlain text of over 100 novels by Black literary authors including poetry and novels. Spreadsheet of metadata for the works.
7
Additional information
8
Please follow the principle of the Colored Conventions Project when using and downloading the data
9
10
Colonial South Asian Literature (1850-1923)From: Melanie Walsh’s list of Datasets
11
Data locationFormatOriginal SourceNotes
12
Colonial South Asian Literature (1850-1923)Zip of txt filesAmardeep SinghPlain text files of British and South Asian writers in English or in translation.
13
14
TxtLab's Multilingual Novels (1771-1932)From: Melanie Walsh’s list of Datasets
15
Data locationFormatOriginal SourceNotes
16
Multilingual Novels DatabaseZip of txt filesAndrew Piper, TxtLabPlain text files of 450 novels in German, French, and English
17
18
Goodreads RatingsFrom: Kaggle
19
Data locationFormatOriginal SourceNotes
20
Goodreads dataCSVGoodreads API and Kaggle user SoumikCSV containing metadata about the book and the average rating on Goodreads
21
22
23
Library DataShakespeare & Company: Lending Library Data
24
Data locationFormatOriginal SourceNotes
25
Shakespeare & Co. Lending Library DataCSV and JSON filesThe Shakespeare and Company ProjectCSV files of lending library borrows, purchases. memberships and renewals
26
27
Seattle Public Library Ciculation Data (2005-present)From: Melanie Walsh’s list of Datasets
28
Data locationFormatOriginal SourceNotes
29
Seattle Library checkouts by title 2005-presentexportable spreadsheetCity of SeattleData for all checkouts since 2005, filterable by interest
30
31
32
Pop CultureGame of Thrones Character RelationshipsFrom: Melanie Walsh’s list of Datasets
33
Data locationFormatOriginal SourceNotes
34
GoT Character dataCSV filesA. Beveridge and J. Shan107 characters and their interactions. Best suited for Network Analysis.
35
36
Bechdel Test for MoviesFrom: FiveThirtyEight
37
Data locationFormatOriginal SourceNotes
38
Bechdel Movie DataCSV"The Dollars-and-Cents Case Against Hollywood's Exclusion of Women"1,615 films from 1990 to 2013 measured against the Bechdel Test.
39
40
Hollywood Film Dialogue by Character Gender and AgeFrom: Melanie Walsh’s list of Datasets
41
Data locationFormatOriginal SourceNotes
42
Hollywood Film DialogueCSV“Film Dialogue from 2,000 screenplays, Broken Down by Gender and Age”.Dialogue for 2000 films (1925-2015). For more info see FAQ for the “Film Dialogue, By Gender” Project.
43
44
50 Years of Pop Music LyricsFrom: Sierra Eckhert's list of datasets
45
Data locationFormatOriginal SourceNotes
46
Billboard Year-End Hot 100 (1965-2015)CSVKaylin Pavlik50 years of lyrics from Billboard's Top 100. Includes song, artist, year, lyrics, and source.
47
48
Graphic Novel CorpusFrom: Hybrid Narrativity
49
Data locationFormatOriginal SourceNotes
50
Graphic Novel CorpusZip of CSV files and PDF of database logicAbout the Graphic Novel Corpus240 graphic novels. Files include author, title, and other metadata. Structured in multiple CSV files for creation of a database. Charts section shows possible analysis.
51
52
53
MiscellaneousQueer Politics at PrincetonFrom: QP@P
54
Data locationFormatOriginal SourceNotes
55
Queer Politics at PrincetonCSVAndrew Reynolds, Queer Politicians Data, QP@P PrincetonDataset includes out LGBTQI+ elected officials since 1976, geographic information (w/o Lat/Long), political party, role, etc.
56
57
The Endangered Languages ProjectFrom: Endangered Languages Project
58
Data locationFormatOriginal SourceNotes
59
Endangered Languages DataCSVAlliance for Linguistic DiversityDataset includes vitality of a language, linguistic details, number of living speakers and geographic data.
60
61
Who Has Your Face?From: Atlas of Surveillance
62
Data locationFormatOriginal SourceNotes
63
Facial Recognition DataCSVElectric Frontier Foundation Who Has Your Face? reportDataset organized by state that details which agencies have access to which type of facial recognition data
64
65
Torn Apart/SeparadosFrom: Mobilized Humanities
66
Data locationFormatOriginal SourceNotes
67
Open Data RepositoryCSV filesTorn Apart/SeparadosData culled from multiple sources to compile an atlas of ICE detention centers on the southern border
68
69
Nobel Prize WinnersFrom: Melanie Walsh’s list of Datasets
70
Data locationFormatOriginal SourceNotes
71
Nobel Prize Winners dataCSVThe European Data Portal and the official Nobel Prize API957 Nobel Prize winners (1901-2017) includes metadata about the winners and location data for mapping
72
73
What's on the Menu?From: Melanie Walsh’s list of Datasets
74
Data locationFormatOriginal SourceNotes
75
What's on the Menu dataCSVNew York Public LibraryMenu and dish data including prices collected so far from the 45,000 menus in their colllection (1840-present)
76
Additional information
77
This project was the subject of Katie Rawson and Trevor Muñoz' article "Against Cleaning"