Archives Catalogues as Data
TASK: Turn 1990s archival descriptions in print inventories into machine-actionable, online accessible data
APPROACH: Collections-as-data
GROUP MEMBERS
Yael Netzer
Aaron Christianson
Elena Hamidy
Amalia Levi
Imme Klages
9 printed volumes
Original database ‘lost’
5 Volumes are digitized
Background
“Collections as Data”
https://collectionsasdata.github.io/
Human readable - Machine Actionable
Yerushah project https://www.yerusha-search.eu/viewer/index/
Description of institution and relevant Jewish collections
Notepad++ with regexp
GOALS
Continue working with all volumes (this is only one volume out of 9)
Networks: locations, persons, timeline…
Mapping: reconcile and link locations
Persons: reconcile and link [to other collections too]
Linking archives/records to digital catalogues.
Find interesting things through distant reading archival descriptions!
Notepad++ with regexp
Omeka-S
Exiles in “Quellen zur Geschichte der Juden in den Archiven der Neuen Bundesländer”
Types of Organisation of a Book (Institutionenregister)/ digital edition possibilities
Schocken
(reply to Sinai’s question)
Combining indexes to link persons to locations (Gephi-Graph)
LIMITATIONS OF AVAILABLE DATA
Lessons Learned
Always be suspicious towards resources
Catalogues as data, books as data: think about future process of creation
Paradata/Provenance should not be neglected or ignored
Modeling of knowledge is an iterative, community-based process
Hackathons are effective and fun