Build: The Archives Research Compute Hub from Idea to Platform
Ian Milligan, Jefferson Bailey, Nick Ruest, Helge Holzmann, Samantha Fritz, Kody Willis
Web Archiving Conference, 2022
The Web
2
Citation:
1. Schroeder, R., & Brügger, N. (2017). Introduction: The web as history. In R. Schroeder & N. Brügger (Eds.), The Web as History: Using Web Archives to Understand the Past and the Present (pp. 1–20). UCL Press. https://doi.org/10.2307/j.ctt1mtz55k.6
The Challenge
available analytics tools, community infrastructure, and inaccessible web archival interfaces present high barriers for conducting research with web archives at scale.
3
Archives Unleashed I (2017-2020)
4
Archives Unleashed I (2017-2020)
5
Archives Unleashed II (2020-2023)
6
Merge Archives Unleashed with the Internet Archive Archive-It Platform to create an end-to-end solution to collect and study web archives.
Foster and support a research community of practice by offering opportunities to engage with web archive research.
ARCH (Archives Research Compute Hub)
Cohort Program
Project Priorities
Introducing the ARCH Platform
7
Introducing the ARCH Platform
Features:
8
Switching to Live Demo here
9
Building for Scalable Analysis
10
First Steps
11
Ideation: Identifying existing Archives Unleashed and Archive-It services – overlaps and differences?
Creation: A half-dozen paper drawings to an interactive prototype (using MockPlus) - sketching wireframe
Iteration: Showing teams storyboards, thinking about how to make for an intuitive and friendly workflow
User Experience Testing
12
seeks to understand the impressions, experience, and feelings a user expresses while interacting with a product prototype.
User Experience Testing
13
Connection and Integration
14
Continual Improvement
15
Lessons Learned
16
Reflecting on Lessons Learned
Lesson 1
If you build it, they won’t come. You need to actively work to create an environments where users feel comfortable.
Lesson 2
Work to meet your users. This doesn’t necessarily mean that you will make all of them happy, but it does mean you need to listen and be responsive through UX testing and outreach.
Lesson 3
Be ready for the unexpected! If there’s something that is 1 in a 1,000,000, you’ll run into it dozens of times in your WARCs. So be ready for error handling and continual improvement.
17
18
In partnership with the Internet Archive’s Archive-It, this work is primarily supported by the Andrew W. Mellon Foundation. Other financial and in-kind support has come from the Social Sciences and Humanities Research Council, Compute Canada, York University Libraries, Start Smart Labs, and the Faculty of Arts and David R. Cheriton School of Computer Science at the University of Waterloo.
Acknowledgements of Institutional Support
Thanks!
Any questions ?
Connect with out project team:
19