1 of 21

Nick Krabbenhoeft, Assistant Director, Digital Preservation

Hilary Shiue, Digital Repository Coordinator

Digital Preservation Program, Digital Research, The New York Public Library

Preservica US User Group Meeting on August 24, 2022

The New York Public Library

Preservica Implementation

System Integrations and Data Model

The New York Public Library | 1

2 of 21

NYPL's Preservica Journey

2018-2021

  • Working with Accelerated Success
  • Learning the system
  • Learning what we can really do

2021-2022

Environmental Scans

Consultants

RFP Process

Demos

The New York Public Library | 2

3 of 21

NYPL's Context

3 Existing Descriptive Systems

  • III Sierra for catalog records
  • Archives Space for finding aids
  • Custom System for item-level descriptions

Focus on Preserving Physical Objects

  • Audio, video, film, and microfilm digitized for preservation not access

Multiple Access Strategies

  • digitalcollections.nypl.org
  • archives.nypl.org
  • Reading room access

The New York Public Library | 3

4 of 21

NYPL's Implementation Philosophy

Minimal description in Preservica

Store only identifiers to descriptions in other systems

Align digital object modeling with physical objects

Store files according to the objects their origin not their descriptions

Share data model across many types of objects

Use the same structure to simplify the logic to build new access methods

The New York Public Library | 4

5 of 21

Digital Object Modeling: Video

Digitization creates per-object packages of files

Video digitization (typically) produces

  • 1 preservation file (MKV/FFV1/FLAC)
  • 1 service file (MP4/H264/AAC)
  • Object Photographs (JPEG)
  • Digitization Metadata (JSON)

Object�SO

Metadata

SO

Contents

SO

SO: Structural Object (folder)

The New York Public Library | 5

6 of 21

Video

Media�IO

Photo�IO

Object�SO

Metadata

SO

Contents

SO

Dig. MD IO

IO: Information Object (asset)

Dig. MD: Digitization Metadata

The New York Public Library | 6

7 of 21

Video

Media�IO

Photo�IO

Object�SO

Metadata

SO

Contents

SO

Dig. MD IO

Representation: Preservation

Representation: Access

The New York Public Library | 7

8 of 21

Digital Object Modeling: Film

Use multiple representations when needed

Film digitization (typically) produces

  • 1 preservation file (MKV/FFV1/FLAC)
  • 1 mezzanine file (MOV/ProRes/PCM)
  • 1 service file (MP4/H264/AAC)
  • Object Photographs (JPEG)
  • Digitization Metadata (JSON)

The New York Public Library | 8

9 of 21

Film

Media�IO

Photo�IO

Object�SO

Metadata

SO

Contents

SO

Dig. MD IO

Rep.: Pres. 1 (preservation)

Rep.: Pres. 2 (mezzanine)

Rep.: Acce. (service)

Rep.: Representation

Pres.: Preservation

Acce.: Access

The New York Public Library | 9

10 of 21

Digital Object Modeling: Audio

Use multiple content objects when needed

Audio digitization (typically) produces

  • 2 preservation file (FLAC/FLAC)
  • 2 edit file (FLAC/FLAC)
  • Object Photographs (JPEG)
  • Digitization Metadata (JSON)

The New York Public Library | 10

11 of 21

Audio

CO: Content Object

Media�IO

Photo�IO

Object�SO

Metadata

SO

Contents

SO

Dig. MD IO

Rep: Pres 1

Rep: Access

Face 1 CO

Face 2 CO

Face 1 CO

Face 2 CO

Rep: Pres 2

Face 1 CO

Face 2 CO

The New York Public Library | 11

12 of 21

Minimal Description

3 Descriptive Systems, 3 Opportunities to Disagree

Timothy Leary Papers Finding Aid

A Digitized Item on Digital Collections Website

The New York Public Library | 12

13 of 21

Minimal Description

Store identifiers to each description

Each IO stores record identifiers from other systems like the barcode (catalog) and cms ID (collection management)

The New York Public Library | 13

14 of 21

Expanding the Model: Born-Digital

Handle born-digital disk contents like audio/video

Born-Digital Archival processing produces

  • 1 folder of content files (500+ file formats, so far)
  • Object Photographs (JPEG)
  • Tech Metadata Sidecars (XML/TXT/JSON)

The New York Public Library | 14

15 of 21

Born-Digital

File�IO

Photo�IO

Object�SO

Metadata

SO

Contents

SO

DFXML�IO

Rep: Pres

Rep: Acce

File�IO

File�IO

Rep: Pres

Rep: Acce

Rep: Pres

Rep: Acce

FTK�IO

DFXML: Digital Forensics XML

FTK: Forensic Toolkit

The New York Public Library | 15

16 of 21

Expanding the Model: Timed Text

Audio content will need timed text

Transcripts and captions produced during:

  • Video digitization
  • NYPL-produced oral histories
  • NYPL-acquired oral history projects
  • (future) accessibility projects

The New York Public Library | 16

17 of 21

Timed Text

Media�IO

Photo�IO

Object�SO

Metadata

SO

Contents

SO

Dig. MD IO

Rep: Pres

Rep: Acce

Caption�IO

The New York Public Library | 17

18 of 21

Supporting Access

Staff Access (via Preservica UA)

Access in Reading rooms

Access on Public Web

The New York Public Library | 18

19 of 21

Supporting Access

+

Improvements desired for Universal Access

Playback support for media assets with multiple Content Object

The New York Public Library | 19

20 of 21

Thank You

Hilary Shiue, Digital Repository Coordinator

hilaryszuyinshiue@nypl.org

Nick Krabbenhoeft, Assistant Director, Digital Preservation

nickkrabbenhoeft@nypl.org

Digital Preservation Program

digpres@nypl.org

The New York Public Library | 20

21 of 21

Links

The New York Public Library | 21