BagIt
for data managers
problem space
use cases:
while being able to:
also:
motivation
interoperability.
(the kind that does not require standards committees or endless email exchanges to achieve or only works on some operating systems or requires more than 30 minutes to understand.)
problem space
that's it.
(if you think you're missing something profound, you're not.)
"bag" as metaphor
think of a package sent through the mail:
1. it is wrapped (the bag structure)
2. it contains the item being delivered (the payload)
3. it has labels on the outside about the sender and receiver (the tags)
4. it includes a list of the items being delivered (the manifests)
structure of a bag
<base directory>/� | bagit.txt� | manifest-<algorithm>.txt� | [optional additional tag files]� \--- data/� | [payload files]� \--- [optional tag directories]/� | [optional tag files]
structure: base directory
<base directory>/� | bagit.txt� | manifest-<algorithm>.txt� | [optional additional tag files]� \--- data/� | [payload files]� \--- [optional tag directories]/� | [optional tag files]
structure: payload
<base directory>/� | bagit.txt� | manifest-<algorithm>.txt� | [optional additional tag files]� \--- data/� | [payload files]� \--- [optional tag directories]/� | [optional tag files]
structure: tags
<base directory>/� | bagit.txt� | manifest-<algorithm>.txt� | [optional additional tag files]� \--- data/� | [payload files]� \--- [optional tag directories]/� | [optional tag files]
structure of a bag: bagit.txt
<base directory>/� | bagit.txt� | manifest-<algorithm>.txt� | [optional additional tag files]� \--- data/� | [payload files]� \--- [optional tag directories]/� | [optional tag files]
BagIt-Version: 0.97� Tag-File-Character-Encoding: UTF-8
structure: payload manifest
<base directory>/� | bagit.txt� | manifest-<algorithm>.txt� | [optional additional tag files]� \--- data/� | [payload files]� \--- [optional tag directories]/� | [optional tag files]
49afbd86a1ca9f34b677a3f09655eae9 data/27613-h/images/q172.png�408ad21d50cef31da4df6d9ed81b01a7 data/27613-h/images/q172.txt
structure: tag manifest
<base directory>/� | bagit.txt� | manifest-<algorithm>.txt� | [tagmanifest-<algorithm>.txt]� \--- data/� | [payload files]� \--- [optional tag directories]/� | [optional tag files]
structure: bag-info.txt
<base directory>/� | bagit.txt� | manifest-<algorithm>.txt� | [bag-info.txt]� \--- data/� | [payload files]� \--- [optional tag directories]/� | [optional tag files]
Source-Organization: Spengler University�Organization-Address: 1400 Elm St., Cupertino, Ca, 95014�Contact-Name: Edna Janssen�Bagging-Date: 2008-01-15�External-Identifier: spengler_yoshimuri_001
conformance: complete
conformance: valid
best practices
tools
experience at Library of Congress
more info