1 of 19

ALWAYS ON THE MOVE: TRANSIENT SOFTWARE AND DATA MIGRATIONS

2 of 19

HELLO!

I’m David Wilcox

I am the Fedora Product Manager with DuraSpace.

You can find me at dwilcox@duraspace.org and @d_wilcox

3 of 19

Our community

is part of an interconnected,

worldwide, scholarly ecosystem.

Source: The Digital Ecosystem in the Balanced Value Impact Model (illustrated by Alice Maggs); http://simon-tanner.blogspot.com/2014/09/do-you-understand-your-digital-ecosystem.html

4 of 19

DuraSpace open source projects

5 of 19

  • Stores, preserves, provides access to digital objects
  • Supports flexible and complex content models
  • Supports semantic relationships using RDF

Fedora is designed to interoperate with other applications and services.

6 of 19

FOCUS ON DURABILITY

  • Checksums are calculated and stored
  • An audit history can be maintained
  • Changes can be versioned

Fedora has features that support long-term digital preservation.

7 of 19

FEDORA FRONT-ENDS

Fedora is middleware

You can build a custom framework or join a community

8 of 19

TRANSIENT SOFTWARE

  • Software comes and goes
  • Even well-established software changes
  • Major changes often require migrations

Data migrations will almost certainly be necessary over time.

9 of 19

DATA MIGRATIONS

  • Data migrations are rarely easy
  • Quality and consistency are important
  • Use standards when possible

Adopting and applying standards consistently can make migrations much smoother.

10 of 19

IMPORT/EXPORT UTILITY

  • Transacts in serialized RDF
  • Metadata and files can be included
  • Optionally supports BagIt and Profiles

The import/export utility is a Java command line application

11 of 19

USE CASES AND REQUIREMENTS

  • Upgrading between major versions
  • Transferring between Fedora repositories
  • Exporting to external preservation systems

Other use cases are documented on the wiki

12 of 19

COMPONENT of ECOSYSTEM

Fedora

Binaries

HTTP- API

Import/Export

IIIF

Server

Web Site

Profiles

FS

Research

Data

DDP

Binaries

Triple

store

13 of 19

ACHIEVING INTEROPERABILITY

  • Other platforms could be supported
  • Working within context of RDA
  • Broad interoperability is the goal

A Research Data Alliance working group is recommending this approach.

14 of 19

THE VALUE OF COMMUNITY

  • Software is difficult to support and maintain
  • Even large institutions can’t go it alone
  • No single point of failure or control

Fedora is supported by a distributed, global community of stakeholders and contributors.

15 of 19

16 of 19

TRAINING WORKSHOPS AND CAMPS

17 of 19

LEARN MORE

18 of 19

THANKS!

Any questions?

You can find me at @d_wilcox & dwilcox@duraspace.org

19 of 19

CREDITS

Presentation template by SlidesCarnival