1 of 42

Webinar 4: Digitization

in-house vs out-of-house

Southwest Region IMLS + NEH Cohorts

2 of 42

Digitization Planning May 26, 2021

Presenters: Kelli Hix & Marie Lascu��

#cawesome

In this session, participants learn about planning for digitization, with an emphasis on the differences between in and out of house digitization.

3 of 42

What has been covered so far:

  • Identifying, inspecting, and assessing film & magnetic media
  • Inventorying film & magnetic media
  • Metadata for creating inventory templates

4 of 42

This webinar will cover:

  • Planning for digitization

  • Determining whether to pursue
    • in-house digitization
    • out-of-house digitization with a vendor
    • a combination of both

  • Utilizing your inventory for
    • preservation planning
    • tracking preservation work

5 of 42

What is Digitization?

  • Analog signal (e.g. light, sound) → Digitally encoded format.

6 of 42

What is Digitization?

  • Analog waveform is approximated digitally.

7 of 42

Why Digitize?

A step toward preservation:

  • Many analog formats are obsolete, and need to be reformatted to non-obsolete formats to be usable.
  • Many analog formats are unstable and degrade over time.
  • Digital content can be copied exactly over and over again without “generation loss.”

For easier access:

  • It is easier to make copies of digitized content.
  • Digitized content can be shared electronically / online.

8 of 42

Digitization vs.

Digital Preservation

Digitization:

  • Reformatting an analog object into a digital representation.
  • Migrating a media carrier-based digital signal to a digital file form.

Digital Preservation:

  • Digitization is ONE step in the preservation process.
  • Multiple steps required to ensure preservation.
  • Digital files cannot survive through passive preservation.
  • Preservation is long-term.

9 of 42

Plan Before You Digitize!

Avoid common pitfalls:

  • Digitizing to a file format that you cannot play or open

  • Digitizing with a vendor that does not meet standards

  • Digitizing at low quality

  • Digitizing to optical media (not a preservation format)

10 of 42

Planning Foundations:

Create an Inventory

11 of 42

Planning Foundations: Develop Preservation Priorities

  • Which of our collections have the most important content to our organization?

  • What do we have that is unique to our collection?

  • What is the most unstable / endangered part of the collection?

12 of 42

Planning Foundations: Determine Digitization Strategy

In- House

Out of House

Combination

13 of 42

In house?

Or

Out of house?

  1. Financial Considerations

  • Collection Considerations

  • Human / Quality of Life Considerations

14 of 42

1

Financial

Considerations

What method is more cost effective?

  1. Labor costs
  2. Equipment costs
  3. Vendor costs

15 of 42

1

Financial

Considerations

Factors to include in cost estimate:

In-house:

  • 3 hours labor / 1 hour of content
  • Training cost
  • Equipment cost

Out of House:

  • Vendor estimate
  • 2-3 hours of admin and quality control labor per hour of content
  • Equipment / computers for performing quality control

16 of 42

2

Collection

Care Considerations

Advantages of Digitizing In-House:

  • No need to ship assets
  • You have control over quality
  • Sensitive materials / materials with access restrictions stay with your organization.
  • Improved knowledge about collection content for your organization

Advantages of Digitizing with a Vendor:

  • Specialized training and experience
  • Specialized equipment
  • May be able to handle mold, damage, and higher levels of decay, etc.
  • The vendor has built-in technical support

17 of 42

3

Human /

Quality of Life/

Community considerations

What is best for your staff and community?

In-house benefits:

  • Skill building
  • Job creation
  • Quality of life
  • Building Community
  • Building knowledge about the collection

Out of house benefits:

  • No need to purchase software, hardware, equipment
  • Potential for less technical skills hand-on

* The more technical knowledge you have, the better results *

18 of 42

So you decided to digitize in-house

Consider the space requirements

    • Soundproofing
    • Internet access
    • Power
    • Permanent set-up

Have backup equipment

Have an extra computer for file processing

Have a dedicated staff member

Have a network or advisor that can help with troubleshooting.

Consider sharing a digitization station or equipment - use existing community resources

19 of 42

So you decided to digitize out of house

  • Create a vendor agreement

  • Get an estimate

  • Plan staff roles

  • Test the workflow with a pilot project

  • Do not have assets shipped back until you have approved the files.

  • And so much more...

20 of 42

Digital Files

Project Planning includes:

    • Target formats for digitization

    • File naming conventions

    • File types and version control

    • Quality control considerations

  • Use your inventory for:

    • Estimating digital storage

    • Tracking digital preservation information

21 of 42

Target File Formats

Selection considerations

Questions to ask before choosing target file formats:

  • Can you store large digital files?

  • Can you back up digital media?

  • What are your long term goals for saving this media – both physical and digital?

  • What type of access do you want for your media?

  • How do you want to use your digital media?

22 of 42

Target File Formats

The ideal: Preservation-level file

a high quality digital transfer that would capture as much information as possible from the original source.

This is not the file used for any editing, screening, or other access-related projects.

In addition to the preservation file, you will want to create access copies - compressed or smaller formats - generated using your preservation file.

Selection considerations

23 of 42

Target File Formats: Film

35mm

File Format (image):

Digital Picture Exchange (DPX)

Codec: Uncompressed

Resolution: 4k

(4096 pixels)

Bit depth / Sample rate:

10 bit RGB

File Format (audio): BWAV

Codec: PCM

Bit depth / Sample rate: 24-bit, 48kHz

16mm, 8mm, Super 8

File Format (image): Digital Picture Exchange (DPX)

Codec: Uncompressed

Resolution: 2k

(2048 pixels)

Bit depth / Sample rate:

10 bit RGB

File Format (audio): BWAV

Codec: PCM

Bit depth / Sample rate: 24-bit, 48kHz

24 of 42

Target File Formats: Video

File Format:

QuickTime (MOV)

Codec (Image): Uncompressed 4:2:2

Bit depth / Sample rate:

(recommended) 10-bit

(minimum) 8-bit

Codec (audio): PCM

Bit depth / Sample rate:

(recommended) 24-bit, 48kHz

(minimum) 16-bit, 48kHz

½” open reel, U-Matic, Betacam, Hi-8, VHS...

File Format:

Matroska (MKV)

Codec (Image): FFV1 version 3

Bit depth / Sample rate:

(recommended) 10-bit

(minimum) 8-bit

Codec (audio): PCM

Bit depth / Sample rate:

(recommended) 24-bit, 48kHz

(minimum) 16-bit, 48kHz

...DV, MiniDV, DVCam...

25 of 42

Target File Formats: Video

File Format:

QuickTime (MOV)

Codec (Image): ProRes 422

Bit depth / Sample rate:

(recommended) 10-bit HQ

(minimum) 8-bit

Codec (audio): PCM

Bit depth / Sample rate:

(recommended) 24-bit, 48kHz

(minimum) 16-bit, 48kHz

½” open reel, U-Matic, Betacam, Hi-8, VHS...

File Format:

Quicktime (MOV)

Codec (Image): DV-NTSC (DV-25)

Bit depth / Sample rate: 8-bit

Codec (audio): PCM

Bit depth / Sample rate:

16-bit, 48kHz

DV, MiniDV, DVCam.

26 of 42

Target File Formats: Audio

File Format:

Broadcast Wave (BWF)

Codec: LPCM

Bit depth / Sample rate:

(recommended) 24-bit, 96kHz

(minimum) 24-bit, 48kHz

File Format:

Waveform (WAV)

Codec: LPCM

Bit depth / Sample rate:

(recommended) 24-bit, 96kHz

(minimum) 24-bit, 48kHz

¼” open reel, Compact audio cassette

27 of 42

File Naming

(This should sound familiar!)

Develop File Naming Conventions for digital film/video/audio files:

Establish conventions

  • Name files/folders in a consistent way
  • Keep file/folder names relatively short
  • Use unique IDs
  • Name in a way that makes files/folders easily searchable
  • Avoid characters like #$*&%/?:!@

28 of 42

Production Name (abbreviated)

Footage type sub folders

Unique ID applied by digital camera

EXAMPLE 2: Folder structure for digital production elements

EXAMPLE 1: File Naming Convention example

*Examples sourced from archiving.witness.org/archive-guide

Established convention: UniqueID_DateYYYYMMDD_CityName

Unique ID

Date

City Name

29 of 42

000001_Demo_2015-01-25_Raw_Preservation.mov

000001_Demo_2015-01-25_Raw_Mezzanine.mov

000001_Demo_2015-01-25_Raw_Access.mp4

000001_Demo_2015-01-25_Edit001.mov

000001_Demo_2015-01-25_Edit002.mp4

000001_Demo_2015-01-25_Raw_Transcript.doc

000001_001_Demo_2015-01-25_Raw_Preservation.mov

000001_003_Demo_2015-01-25_Raw_Mezzanine.mov

000001_004_Demo_2015-01-25_Raw_Access.mp4

000001_005_Demo_2015-01-25_Edit001.mov

000001_006_Demo_2015-01-25_Edit002.mp4

000001_009_Demo_2015-01-25_Raw_Transcript.doc

Unique ID_Title_DateYYYY-MM-DD_FootageType_FileType/Version

Unique ID(item)_UniqueID(file)_Title_DateYYYY-MM-DD_FootageType_FileType/Version

000001_001_Preservation.mov

000001_003_Mezzanine.mov

000001_004_Access.mp4

000001_005_Edit001.mov

000001_006_Edit002.mp4

000001_009_Transcript.doc

Unique ID(item)_UniqueID(file)_FileType/Version

EXAMPLE 1:

Assigning file types/versions

EXAMPLE 2:

Assigning file types/versions

EXAMPLE 3:

Assigning file types/versions

30 of 42

Quality Control

Making Sure Your New Digital Files Meet Your Specifications

Post-Digitization

Quality Control -

Procedures need to be established so that methods are consistent. QC must occur within a reasonable timeframe after files have been created (especially if created by outside vendor).

Responding to Vendor Errors -

Files from vendors must be reviewed immediately in case there are problems with the file. Errors should be clearly documented before communicating with the vendor. Make sure to know what the vendor’s policy is in dealing with errors (will they re-do for free, etc.).

31 of 42

Quality Control

Procedures to include

  • Is the filename correct?

  • Open/play the file - confirm the file works!

  • Spot check content -
    • Content correct in relation to inventory? (possible content was unknown!)

    • View/listen to file in at least 3 spots -

(beginning, middle, and end)

      • Audio in sync? other errors?
      • Video error from analog or transfer?

  • Confirm Technical Specifications
    • Correct file format?
    • Correct aspect ratio?
    • Audio and color information correct?
    • Correct additional embedded metadata?

  • How to deal with errors found...

32 of 42

Quality Control

Procedures to include

Responding to Vendor Errors:

Review files from a vendor immediately

(recommended window is 30 days)

  • Receiving staff should check for:
    • Correct number of files
    • Confirm requested technical specifications
    • Confirm requested additional metadata

Vendor errors to note are same as in-house work:

  • sync issues not present in the original asset
  • discrete audio channels that have been mixed
  • misidentified or improperly attributed file names
  • Etc. etc.

33 of 42

Estimating Digital Storage

Ideal: Enough digital storage available for files in advance, including for back-up

  • Plan in advance to avoid maxing out drives or having to halt digitization processes until new storage can be added

  • Digitizing in-house can allow for increasing storage incrementally

  • Material is sent in bulk to a vendor requires storage to be available as soon as vendor completes work

34 of 42

Estimating Digital Storage: Film

16mm, 8mm, Super 8

File Format (image): DPX

Codec: Uncompressed

Resolution: 2k

(2048 pixels)

Bit depth / Sample rate:

10 bit RGB

Storage estimate: ~765 GB/hr

File Format (audio): BWAV

Codec: PCM

Bit depth / Sample rate: 24-bit, 48kHz

Storage estimate: ~1 GB/hr

35mm

File Format (image): DPX

Codec: Uncompressed

Resolution: 4k

(4096 pixels)

Bit depth / Sample rate:

10 bit RGB

Storage estimate: ~4.3 TB/hr

File Format (audio): BWAV

Codec: PCM

Bit depth / Sample rate: 24-bit, 48kHz

Storage estimate: ~1 GB/hr

35 of 42

Estimating Digital Storage: Video

File Format:

QuickTime (MOV)

Codec (Image): Uncompressed 4:2:2

Bit depth / Sample rate:

(recommended) 10-bit

(minimum) 8-bit

Codec (audio): PCM

Bit depth / Sample rate:

(recommended) 24-bit, 48kHz

(minimum) 16-bit, 48kHz

Storage estimates:

10-bit ~100 GB/hr

8-bit ~76 GB/hr

½” open reel, U-Matic, Betacam, Hi-8, VHS...

File Format:

Matroska (MKV)

Codec (Image): FFV1 version 3

Bit depth / Sample rate:

(recommended) 10-bit

(minimum) 8-bit

Codec (audio): PCM

Bit depth / Sample rate:

(recommended) 24-bit, 48kHz

(minimum) 16-bit, 48kHz

Storage estimates:

10-bit ~40 GB/hr

8-bit ~40 GB/hr

36 of 42

Estimating Digital Storage: Video

File Format:

QuickTime (MOV)

Codec (Image): ProRes 422

Bit depth / Sample rate:

(recommended) 10-bit HQ

(minimum) 8-bit

Codec (audio): PCM

Bit depth / Sample rate:

(recommended) 24-bit, 48kHz

(minimum) 16-bit, 48kHz

Storage estimates:

10-bit ~27 GB/hr

8-bit ~18 GB/hr

½” open reel, U-Matic, Betacam, Hi-8, VHS...

File Format:

Quicktime (MOV)

Codec (Image): DV-NTSC (DV-25)

Bit depth / Sample rate: 8-bit

Codec (audio): PCM

Bit depth / Sample rate:

16-bit, 48kHz

Storage estimate:

8-bit ~13 GB/hr

DV, MiniDV, DVCam

37 of 42

Estimating Digital Storage: Audio

File Format:

Broadcast Wave (BWF)

Codec: LPCM

Bit depth / Sample rate:

(recommended) 24-bit, 96kHz

(minimum) 24-bit, 48kHz

Storage estimate: ~2 GB/hr

File Format:

Waveform (WAV)

Codec: LPCM

Bit depth / Sample rate:

(recommended) 24-bit, 96kHz

(minimum) 24-bit, 48kHz

Storage estimate: ~1 GB/hr

¼” open reel, Compact audio cassette

38 of 42

Post- Digitization

Digital Storage Considerations

  • Did you begin the project with enough storage?

  • Is storage filling up faster than anticipated?

  • Can you make adjustments to the established workflow for future projects?

  • Do you have the ability to deal with sudden hard drive failures (i.e. purchase immediate replacements?)

  • It is impossible to anticipate every mistake.

  • It is important to be flexible, especially when dealing with ever changing technology.

39 of 42

Track this all in your inventory!

40 of 42

We Could Go On!!

41 of 42

Questions?

42 of 42

Thank you!