Photo is © Jennifer Peebles , used under a Creative Commons Attribution-NonCommercial license. https://goo.gl/tUDx1w
Pickling your Project: Workshop
Preservation is another phase in the lifecycle of your research:
outstanding stats: The LifeSpan of the average website
1 Mike Ashenfelder, “The Average Lifespan of a Webpage | The Signal: Digital Preservation,” webpage, (November 8, 2011), http://blogs.loc.gov/digitalpreservation/2011/11/the-average-lifespan-of-a-webpage/.
2 “A Look at Website Lifespans,” Bismarck Tribune, accessed November 17, 2015, http://bismarcktribune.com/news/columnists/keith-darnay/a-look-at-website-lifespans/article_1d879ae6-851a-11e3-8bd1-0019bb2963f4.html.
3 Joy Thomas, “Web Site Demise and Graduate Research: Persistence of Web Pages Cited in Social Work Theses,” Behavioral & Social Sciences Librarian 22, no. 2 (January 2004): 67–77, doi:10.1300/J103v22n02_04.
4 T. Agata et al., “Life Span of Web Pages: A Survey of 10 Million Pages Collected in 2001,” in 2014 IEEE/ACM Joint Conference on Digital Libraries (JCDL), 2014, 463–64, doi:10.1109/JCDL.2014.6970226.
5 Daniel Gomes and Miguel Costa, “The Importance of Web Archives for Humanities,” Journal of Humanities & Arts Computing: A Journal of Digital Humanities 8, no. 1 (March 2014): 106–23, doi:10.3366/ijhac.2014.0122.
6 Adrienne LaFrance, “Raiders of the Lost Web,” The Atlantic, October 14, 2015, http://www.theatlantic.com/technology/archive/2015/10/raiders-of-the-lost-web/409210/.
��
Depending upon your project
If a web-based project for your dissertation/thesis:
Let us know as early as possible, so we can review best practices, review webrecorder and start performing test crawls.
Some Best Practices
For Example:
The timeline does not appear in the archived version
http://wayback.archive-it.org/4739/20140722000114/http://mydigitalfootprint.org/
And
Embedded Media Does Not appear in Archived Version
Another example
http://dbpod.graciass.net/browse
Searches
Returns:
In the Archived Version
http://wayback.archive-it.org/5163/20150917122237/http://dbpod.graciass.net/browse
The same search returns “Not in archive” in archive-it version
Other Interactive Elements|Maps�http://nycfashionindex.com/
No interactivity in the archived Version
http://wayback.archive-it.org/5978/20150921204622/http://nycfashionindex.com/
http://dropoutsdropin.org/
http://wayback.archive-it.org/4739/20160511121130/http://dropoutsdropin.org/
http://wayback.archive-it.org/5484/20150403202249/http://inq13.gc.cuny.edu/videos/
Crawlers are not able to fully simulate a user interacting with a site via a browser because crawlers are able to read, but, generally, unable to execute many of the scripts embedded in a website.
The web is a…
Archive it support:
“ In general, the steps you took towards expanding/limiting the scope and using the Developer's Tools to pinpoint what was missing/had changed were spot-on. This is exactly what scoping is all about. The phrase we commonly invoke, "the web is a mess" is quite real.”
Again, Some Best Practices
Depending upon your project
If a web-based project for your dissertation/thesis:
Let us know as early as possible, so we can review best practices, review webrecorder and start performing test crawls.
We have you do it (DIY)
webrecorder.io
WebRecorder Resolves
Recommendations & Processes documented in A Libguide
We:
Demo
ablility to test
Your WARCs using a player
Why we Embraced
uploadED to CUNY’s institutional repo (Academic Works)
�In addition to providing links to archive-it in our catalog and institutional repository (Academic Works), we decided to upload the user generated WARC from webrecorder to our institutional repository for additional preservation. Note: According to Corey Davis “capturing websites in WARC format for playback and full-text search is only a part of what is needed for true digital preservation. WARC files backed up by the Internet Archive are susceptible to corruption.”1.
1 Davis, Corey. “Archiving the Web: A Case Study from the University of Victoria.” The Code4Lib Journal 26 (2014): n. pag. Code4Lib Journal. Web. 13 Nov. 2015.
If an application...
If an application (desktop or mobile) and not a website, you are welcome to provide a link to your GitHub repository, but please also include:
Please upload all of these files when you deposit to Academic Works.
More info
Thank you
Stephen Klein�Digital Systems Librarian�sklein@gc.cuny.edu