Web Archives as Big Data: Building Tools and Community to Support Access and Usability with Archives Unleashed
IFLA WLIC 2021 17-19 August 2021
Big Data
Big Data
data whose scale, diversity and complexity require new architectures, techniques, algorithms, and analytics to manage it and extract value and hidden meaning from it.
(noun).
(Noam Slonim et al, 2012)
Web Archiving
“Libraries have long been in the business of preserving documentary heritage”
- Ben White, 2012
Libraries & (W/ARC) Data
Libraries & (W/ARC) Data
Small sample of national, university/college, and public libraries conducting web archiving
Challenges with Web Archives
Despite the volume of data captured web archives have not become a dominant resource for researchers
Challenges with Web Archives
Solutions
with the Archives Unleashed Project
Tool Building Community Engagement Collaborative Partnerships
Est. 2017
1. Tools: Scalable and User-Friendly
Archives Unleashed Toolkit
Archives Unleashed Cloud
2. Learning Resources: Inspire Confidence & Use
Archives Unleashed Toolkit User Documentation: https://aut.docs.archivesunleashed.org
3. Build and Engage Community
Archives Unleashed Washington, DC. Datathon
Gelman Library, George Washington University, 2019. Photo by Samantha Fritz
Formalized collaboration with the Internet Archive’s Archive-It (2020-2023) to integrate services
Worked with scholars in several disciplines: digital humanities, social sciences, medicine, journalism, and political science
Proactively developed collaborative relationships with several stakeholder groups
4. Collaboration Expands visibility, Access, and Use
RESEARCH
COMMUNITIES
ACADEMIC
LIBRARIES +
ARCHIVE-IT
COLLABORATION
Collaborated with library institutions in North America to make scholarly derivatives openly available.
Conclusions
Sources
IFLA WLIC 2021 17-19 August 2021
Thank You!
Samantha Fritz, MLIS
Project Manager, Archives Unleashed
https://archivesunleashed.org/
Samantha.fritz@uwaterloo.ca @SamVFritz