| A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z | AA | AB | AC | AD | AE | ||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | Field Label | # of Instances | # of Coll. | # of Val. | Collection Usage | Free text/CV/Date | Usage Notes | Issues | Property Recommendations | Value Recommendations | Linked Data Recommendations | Broader Questions | ||||||||||||||||||||
2 | title | Title | 81568 | 48 | 61239 | Used in every collection except for road, mma, adaccess | Free text | Messy, slight variations across items that have the same titles (generally advertising collections) | Require field. Set policies for type of collection (archival, doc_phot, etc). | |||||||||||||||||||||||
3 | title | Headline | 78738 | 9 | 37420 | Used in advertising collections | Free text | Same as Title | Keep as Advertising-specific property. | |||||||||||||||||||||||
4 | alternative | Alternative_Title | 746 | 8 | 655 | duc; frankespada; gamble; hasm; italianposters; protfam; quartets; russianposters | Free text | Used to capture English translation when Title is in a non-English language, except in Gamble - there used to capture Chinese title as well as 'Original Image Title' | 25 instances of value 'None' in Frank Espada collection. Some instances seem more like description than title (eg: "Photographs of the principal individuals involved: James. B. Duke"). Some values that are showing up in alternative are in title field in CONTENTdm, was mapping changed when METS was generated? | Set policies for when to use - translations, supplied, etc? Use collection-specific labels to elucidate use. | Remove all instances of 'None'. Move descriptive information to Description field. | |||||||||||||||||||||
5 | alternative | First_Line | 2097 | 1 | 1930 | hasm | Free text | Same as Title | Keep as legacy field. | |||||||||||||||||||||||
6 | alternative | Refrain | 1373 | 1 | 1306 | hasm | Free text | Same as Title | Keep as legacy field. | |||||||||||||||||||||||
7 | creator | Company | 89909 | 10 | 14155 | Used in advertising collections | Mix of free text/authoritative | Sometimes authoritative value of proper name, sometimes general heading (eg, 'Museums'), sometimes appears to have been transcribed. Lots of brackets, question marks, etc | 4626 instances of some form of 'N/A'; 2261 instances of some form of 'unknown'. Lots of 'Various'. Lots of brackets, question marks, etc. Slight variations in terms that result in many different values for the same entity. | Keep as Advertising-specific property. | Cluster and edit authoritative (or authoritative-like) headings. Remove brackets and question marks. Remove all instances of 'N/A' and 'unknown'. Move 'Various' info to description field. | Reconcile against LCNAF. | ||||||||||||||||||||
8 | creator | Composer | 17638 | 3 | 6888 | Used in music collections (hasm; quartets; sheetmusicindex) | Mix of free text/authoritative | Sometimes authoritative value of proper name, sometimes transcribed. | Transcribed values with authoritative values. Question marks. | Merge with Creator. Use Composer as collection-specific label. | Cluster and edit authoritative (or authoritative-like) headings. Remove question marks. Remove transcribed values, or move to Description. | Reconcile against LCNAF. | ||||||||||||||||||||
9 | creator | Creator | 19119 | 26 | 899 | Used in all collections | Mix of free text/authoritative | Sometimes authoritative value of proper name, sometimes transcribed. | Transcribed values mixed with authoritative values (heading for GF Handel alongside 'A Lady', lots of initials). Some could be authoritative but are not (Paganini). Lots of quotation marks, allcaps (mostly Mazzoni), brackets, N/A. | Set as controlled vocabulary field. | Move 26 instances of creator in hasm to contributor field. Cluster and edit authoritative (or authoritative-like) headings. Remove question marks. Remove transcribed values, or move to Description. Excluding Mazzoni will make this a lot easier. | Reconcile against LCNAF. | ||||||||||||||||||||
10 | creator | Interviewer_Name | 446 | 1 | 29 | behindtheveil | Controlled vocabulary | Inverted personal names. One corporate body (Fox Television). Very clean | Local controlled vocabulary - most likely not found outside of Duke? Except for Fox. | Remap to contributor. | Reconcile against LCNAF. | |||||||||||||||||||||
11 | creator | Lyricist | 6826 | 2 | 3009 | hasm; sheetmusicindex | Mix of free text/authoritative | Sometimes authoritative value of proper name, sometimes transcribed. | Transcribed values mixed with authoritative values (heading for Irving Berlin alongside 'Author of O sleep my love', lots of initials). Some could be authoritative but are not (Paganini). Multiple values in field, separated by ';'. | Keep as legacy field. | Split multi-valued cells. Cluster and edit authoritative (or authoritative-like) headings. Remove question marks. Remove transcribed values, or move to Description. | |||||||||||||||||||||
12 | contributor | Arranger | 1233 | 2 | 682 | hasm; sheetmusicindex | Mix of free text/authoritative | Sometimes authoritative value of proper name, sometimes transcribed. | Transcribed values mixed with authoritative values. | Keep as legacy field. | Cluster and edit authoritative (or authoritative-like) headings. Remove transcribed values, or move to Description. | Reconcile against LCNAF. | ||||||||||||||||||||
13 | contributor | Artist | 731 | 3 | 415 | adaccess; hasm; sheetmusicindex | Mix of free text/authoritative | Sometimes authoritative value of proper name, sometimes transcribed. | What is the difference between this field and Illustrator? Transcribed values mixed with authoritative values. Some inverted, some not. Lots of initials, question marks, illegible. Some could be authoritative but are not (Dashiell Hammett) | Keep as legacy field. | Cluster and edit authoritative (or authoritative-like) headings. Remove question marks, illegible. Remove transcribed values, or move to Description. | Reconcile against LCNAF. | ||||||||||||||||||||
14 | contributor | Choreographer | 16 | 2 | 13 | hasm; sheetmusicindex | Free text | Uninverted names | Uninverted names | Keep as legacy field. | Invert names. | Reconcile against LCNAF. | ||||||||||||||||||||
15 | contributor | Contributor | 1861 | 9 | 105 | behindtheveil;bloomsbury; caribbeansea; duc; dukechapel; hasm; meyermarshall; rubenstein; wlmpc | Mix of free text/authoritative | Transcribed values mixed with authoritative values. Some names are uninverted. Multiple values in some fields, separated by ';'. | Make controlled vocabulary field. (Larger question - same controlled vocabulary as Creator? How do we want to manage named entities?) | Split multi-valued cells. Invert names. Cluster and edit authoritative headings. Remove transcribed values, or move to Description. | Reconcile against LCNAF. | |||||||||||||||||||||
16 | contributor | Dedicatee | 2589 | 2 | 1612 | hasm; sheetmusicindex | Free text | Uninverted names. Transcribed from item. | Transcribed values. Uninverted names. Multiple values in some fields. | Keep as legacy field. | Split multi-valued cells. Invert names. Cluster and edit authoritative headings. Remove transcribed values, or move to Description. | Reconcile against LCNAF. | ||||||||||||||||||||
17 | contributor | Engraver | 2295 | 2 | 410 | hasm; sheetmusicindex | Free text | Uninverted names. Transcribed from item. | Stuff like "L. Johnson (electrotyper)" and "O.". Lots of initials, parts of names. Uninverted. Multiple values in some fields. | Keep as legacy field. | Split multi-valued cells. Invert names. Cluster and edit authoritative headings. Remove transcribed values, or move to Description. | Reconcile against LCNAF. | ||||||||||||||||||||
18 | contributor | Illustrator | 1749 | 3 | 472 | adaccess; hasm; sheetmusicindex | Mix of free text/authoritative | Mostly transcribed. | What is the difference between this field and Artist? Lots of initials, parts of names, question marks, illegibles. Messy. | Merge with Artist | Invert names. Cluster and edit. Remove transcribed values, or move to Description. | Reconcile against LCNAF. | ||||||||||||||||||||
19 | contributor | Lithographer | 1021 | 2 | 257 | hasm; sheetmusicindex | Mix of free text/authoritative | Sometimes authoritative value of proper name, sometimes transcribed. | Lots of initials, variations on the same name, non-name stuff (eg: "Targis (the illustration following the music)") | Keep as legacy field. | Invert names. Cluster and edit. Remove transcribed values, or move to Description. | Reconcile against LCNAF. | ||||||||||||||||||||
20 | contributor | Performer | 2513 | 2 | 1560 | hasm; sheetmusicindex | Mix of free text/authoritative | Sometimes authoritative value of proper name, sometimes transcribed. | Lots of junk. variations on the same name. Uninverted names. Some could be authoritative but are not (Al Jolson). | Keep as legacy field. | Invert names. Cluster and edit. Remove transcribed values, or move to Description. | Reconcile against LCNAF. | ||||||||||||||||||||
21 | contributor | Placement_Company | 55425 | 6 | 822 | advertising collections | Mix of free text/authoritative | Transcribed - doesn't look normalized | Variations on the same names. Question marks, lots of brackets, illegibles. | Keep as Advertising-specific property. | Cluster and edit. Remove transcribed values, or move to Description. | Reconcile against LCNAF. | ||||||||||||||||||||
22 | contributor | Producer | 148 | 1 | 108 | sheetmusicindex | Mix of free text/authoritative | Mostly uninverted, transcribed. | Variations on the same names. Multiple values in some fields, separated with ','. | Keep as legacy field. | Invert names. Cluster and edit. Remove transcribed values, or move to Description. | Reconcile against LCNAF. | ||||||||||||||||||||
23 | contributor | Sponsor | 97 | 1 | 56 | adaccess | Mix of free text/authoritative | Mostly corporate bodies, names don't look normalized | Mostly corporate names but a few have values like 'Industry' and 'Sugar'. | Keep as legacy field. | Cluster and edit. Clean up. | Reconcile against LCNAF. | ||||||||||||||||||||
24 | contributor | Staging | 28 | 1 | 21 | sheetmusicindex | Free text | Mostly uninverted personal names. | Uninverted names. Not authoritative values (eg., 'Bob Fosse') | Keep as legacy field. | Invert names. | Reconcile against LCNAF. | ||||||||||||||||||||
25 | publisher | Publisher | 20157 | 7 | 3282 | hasm; jwtnewsletters; museesdeshorreurs; russianposters; sheetmusicindex; songsheets; wlmpc | Mix of free text/authoritative | Sometimes authoritative value of proper name, sometimes transcribed. | Lots of junk. Brackets, question marks, variations in same names, uninverted names. | Set as controlled vocabulary field. | Split multi-value cells. Invert names. Cluster and edit. Remove transcribed values, or move to Description. | Reconcile against LCNAF. | ||||||||||||||||||||
26 | date | Date | 130636 | 43 | 12146 | most of them | Free text/date | No consistent input format. | Brackets, question marks, dashes, nonnumeric stuff, variations of N.D., | Set as date data type | Split multi-value cells. Remove brackets, reformat consistently. Attempt to normalize using EDTF formats. | |||||||||||||||||||||
27 | date | Interview_Date | 405 | 1 | 136 | behindtheveil | Date | Consistent yyyy-mm-dd | None | Merge with Date field. Use Interview_Date as collection-specific label | None | |||||||||||||||||||||
28 | date | Interviewee_Date_of_Birth | 376 | 1 | 371 | behindtheveil | Date | Consistent yyyy-mm-dd | None | Keep as legacy field. | This is about the interviewee, not the resource. We need a way to model named entities that aren't digital objects in the repository. | |||||||||||||||||||||
29 | date | Issue_Date | 1148 | 1 | 380 | zines | Free text/date | Mix of formats (eg. '1984', '2000 Spring', 'Apr-93') - used as companion to 'Date_Created field in zines collection | None as it is used with Date_Created field. | Out of scope - excluding zines metadata from remediation. | If we consider the zines collection out of scope, we don't need to address this field usage immediately. | |||||||||||||||||||||
30 | created | Date_Created | 1448 | 1 | 26 | zines | Date | Consistent yyyy | None | Out of scope - excluding zines metadata from remediation. | If we consider the zines collection out of scope, we don't need to address this field usage immediately. | |||||||||||||||||||||
31 | modified | Date_Modified | 410 | 1 | 6 | behindtheveil | Date | Consistent yyyy-mm-dd | What does this refer to? Is this technical metadata? | Remove. | See property recommendations | |||||||||||||||||||||
32 | temporal | Temporal_Coverage | 21356 | 11 | 560 | adaccess; dsp; duc; eaa; garymonroe; gedney; hasm; italianposters; kwilecki; mma; trumpet | Free text/date | Mostly yyyy or yyyy-mm or yyyy-mm-dd, but free text too | Usage of 'x', question marks, '/', 's', ('191x0/191x9', '('1932', 'Oct')-10', '1870s/1879'). Also weirdly, 753 instances of the value 'or'. Most weirdnesses seem to be in gedney. | Not clear what the purpose of this field is in relation to Date. Generate list of items for which the value in this field is different from the value in the Date field, and try to figure it out. | See property recommendations | |||||||||||||||||||||
33 | type | DCMI_Type | 38627 | 32 | 7 | most of them | CV | DCMI type vocabulary | Every item should have this field. Slight variations in terms (Moving Image vs. MovingImage and Still Image vs. StillImage). | Make this the only Type field, require it for all items. | Limit values to DCMI type vocabulary. Attempt to apply to all items. | Replace with URIs from DCMI namespace. | ||||||||||||||||||||
34 | type | Genre | 87740 | 28 | 27 | CV | Largely AAT, some DCMI type erroneously thrown in | Includes DCMI terms. Not confirmed that AAT was consistently used. | Merge with Format. | Move DCMI terms to Type. Use AAT as controlled vocabulary. | ||||||||||||||||||||||
35 | type | Type | 120044 | 37 | 216 | sheetmusicindex accounts for lots of the weirdness | CV | AAT, DCMI, maybe local as well? | Unclear what CVs are used. AAT, DCMI, maybe LCSH and local headings? Variations in capitalization, misspellings. Some cells contain multiple values separated by ' ; ' | Remap to Format. Use Type for DCMI type. | Move DCMI terms to Type. Use AAT as controlled vocabulary. Cluster and edit values, map local terms to AAT. | |||||||||||||||||||||
36 | format | Format | 77107 | 5 | 53 | dukechapel; eaa; hfc; mma; road | Free text/CV | Mostly free text but some AAT terms. Physdesc stuff. | Should be CV field but is free text. All sorts of weird stuff (eg.: 'half-size; b/w photocopy, cardstock cover'. | Limit to AAT terms. Map values to AAT. Remove physdesc stuff to Description if possible. | Reconcile against AAT LOD | |||||||||||||||||||||
37 | medium | Medium | 86776 | 9 | 53 | advertising collections | Local CV? | Mostly a list of local headings for advertising collections, plus some outliers. | Improper use of dc:medium. Messy, slight variations, misspellings. Weird stuff like 'Collier's' and 'Charity'. | Move most instances to Format. Use this field only to describe physical medium of original. | ||||||||||||||||||||||
38 | extent | Extent | 22438 | 29 | 1275 | Free text | Number of pages, scans, audiocassetes; slides; dimensions, | sheetmusicindex almost entirely values that consist only of numbers, with no unit of measurement indicated. Frank Espada collection uses only the value 'None'. | Add unit of measurement to sheetmusicindex values, or remove them. | |||||||||||||||||||||||
39 | subject | Duke_Opponent | 1508 | 2 | 132 | rubenstein (only 1 instance); sportsfilms | Local CV | Out of scope - excluding sportsfilms metadata from remediation. | Not authorized versions of names | Out of scope - excluding zines metadata from remediation. | Map values to authoritative versions of names. | Reconcile against LCNAF | Do we want Subject fields to be divided into Subject-Topic, Subject-Geographic, Subject-Name? | |||||||||||||||||||
40 | subject | Interviewee_Gender | 406 | 1 | 4 | behindtheveil | Used to record the gender of the interviewee | Really only two values: male, female. Two values are 'Male; Female' and 'Female; Male'. | Do we really need this information in it's own field? UPDATE: Per John Gartrell, this is useful for researchers. As moving this information into a subject field would involve losing meaning (eg, 'men' to describe the collection instead of 'male' to describe the interviewee), I think we should retain the field as a legacy, but revisit developing an metadata application profile for oral history materials at a later date. | This is about the interviewee, not the resource. We need a way to model named entities that aren't digital objects in the repository. | ||||||||||||||||||||||
41 | subject | Interviewee_Occupation | 728 | 1 | 145 | behindtheveil | Local CV | Used to record the occupation of the interviewee | Local terms when authoritative CV could be used instead | Either: move this information to Description field, or merge with Subject-Topic field | Move to Description field. | This is about the interviewee, not the resource. We need a way to model named entities that aren't digital objects in the repository. | ||||||||||||||||||||
42 | subject | Product | 86437 | 10 | 18986 | advertising collections | Free text/local CV | Used to record the product being advertised | Mix of corporate bodies and general products (Coca Cola, horse racing). Brackets and question marks. Use of 'unknown' and 'n/a'. LOTS of slight variations resulting in multiple values for same entity. | Leave as legacy field. | LOTS of cluster and edit. Remove bracket, question marks, 'n/a', unknown. Try to use authoritative headings when possible. | Reconcile against LCNAF | This is a huge and messy field. I would like to attempt to clean it up but if it turns out to be too complicated we can consider leaving it segregated. | |||||||||||||||||||
43 | subject | Race | 110 | 1 | 2 | blake | Local CV | Used to record the race of the person/people depicted | Not the most sensitive way to deal with racial characteristics | Get rid of it. | Maybe move this information to the Description field? | |||||||||||||||||||||
44 | subject | Series | 12545 | 14 | 80 | behindtheveil; dukechronicle; dukengineer; frankespada; gamble; garymonroe; hleewaters; jesseandrews; jwtnewsletters; kwilecki; meyermarshall; russianposters; wlmpc; womenstraveldiaries | Free text | Used to indicate the archival series from the source collection finding aid, sometimes created for the digital collection | Potentially inconsistently applied. Used differently across collections? (behindtheveil appears to be geographic area, dukechronicle usage indicates edition type, dukengineer is title at time of publication, wlmpc is subject category). In the context of archival collections, how useful is it to have it in the record? Sometimes it isn't exactly as expressed in the finding aid anyways (eg jesseandrews) | If we want a field for including series information as pulled from the finding aid, use this field but map it to IsPartOf. Examine other uses and remap to subject, alternative, spatial, as necessary. | Keep only uses that indicate location in archival collection (if desired), and use consistently going forward. Ensure that series information is the same as listed in finding aid. | Part of larger conversation about how/whether or not to represent archival context in metadata records. We probably need to consult with Rubenstein on this. Potentially Noah can be their proxy. | ||||||||||||||||||||
45 | subject | Subject | 204553 | 44 | 8851 | most of them | Free text/local CV | Sometimes LCSH/LCNAF, sometimes locally defined, sometimes no apparent attempt at regularizing input | 25 use LCSH/LCNAF (or names are formatted as such). Advertising collections have their own weird list of terms. Gamble, gedney, hasm, mma, sheetmusicindex are all over the place. | Make controlled vocabulary field. See broader questions | Limit to LCSH. | Reconcile against authorities | Need to decide how to treat subjects. Currently separated into vocabulary-specific fields in CONTENTdm, subjects are dumped into a single field when Will creates METS files. We need a way to indicate the authority being used (LCSH, LCNAF, TGM, etc) | |||||||||||||||||||
46 | subject | Subseries | 2320 | 2 | 37 | jwtnewsletters; kwilecki | Free text | Used to indicate the archival subseries from the source collection finding aid | This isn't really subject information. | Remap to IsPartOf. Potentially concatenate with Series information. | Ensure that series information is the same as listed in finding aid. | Part of larger conversation about how/whether or not to represent archival context in metadata records. We probably need to consult with Rubenstein on this. Potentially Noah can be their proxy. | ||||||||||||||||||||
47 | subject | Tag | 144 | 1 | 5 | protfam | Local CV | Short list of local headings | Merge with Subject. | Map values to authoritative versions of names, if possible. | ||||||||||||||||||||||
48 | description | Description | 90088 | 39 | 51861 | most of them | Free text | Used to add contextual information in a free form manner. | A couple of collections have instances of "None" (caribbeansea, jesseandrews). Some contain untranslated foreign languages, others contain translations of foreign language in other fields. Some are very messy (sheetmusic index). | I don't know that we need to prescribe use of this field? | ||||||||||||||||||||||
49 | description | Illustrated | 1626 | 1 | 2 | songsheets | Yes/No | Used to indicate presence of illustrations | Either merge with Description field and include information about Illustrations there or don't map field to Description and leave as legacy field (if it's important for users to have this information in its own field). | |||||||||||||||||||||||
50 | description | Instrumentation | 3018 | 1 | 76 | hasm | Local CV | Used to indicate instrumentation | Messy, multiple values per field. Includes initialisms for vocal ranges. | Either merge with Subject field and move information about vocal ranges to Description, or don't map field and leave as legacy field (if it's important for users to have this information in its own field). | Cluster and edit; map existing terms to the LC medium of performance thesaurus for music | Reconcile against LCSH, LCMPT | ||||||||||||||||||||
51 | description | Setting | 41286 | 6 | 5 | advertising collections | Local CV | Used to indicate setting of billboard (I think) | Leave as legacy field. | |||||||||||||||||||||||
52 | description | Time_of_Photo | 336 | 1 | 336 | gedney | One value | Used to indicate time of day photo was taken | Only one term - 'Night'. Is it necessary for this information to be in a separate field? | Move this information to the Description field. | ||||||||||||||||||||||
53 | description | Tone | 31876 | 5 | 3 | advertising collections | Local CV | Used to indicate whether photo is color or black and white | One erroneous instance. Otherwise uniformly applied. | Leave as Advertising specific property | Use AAT terms 'black-and-white photographs' and 'color photographs' | |||||||||||||||||||||
54 | abstract | Abstract | 3 | 1 | 3 | vica | Free text | Only used 3 times for one collection. Could use Description field instead? | ||||||||||||||||||||||||
55 | spatial | City_of_Publication | 9914 | 3 | 427 | hasm; quartets; sheetmusicindex | Local CV | Used to record city of publication | Not normalized values. Some brackets, question marks, instances of 's.l.' and 'n.p.', misspellings, lower/upper case differences. Same place name in different languages (Milan, Milano). Multiple values in some cells. | Combine with Country_of_Publication, then merge with Spatial_Coverage. | ||||||||||||||||||||||
56 | spatial | Country_of_Publication | 6859 | 1 | 148 | sheetmusicindex | Local CV/free text | Used to record country of publication | Mix of country codes and names. Typos, erroneous entries ('1856', 'Oliver Ditson'). Multiple values per cell. | Combine with City_of_Publication, then merge with Spatial_Coverage. | ||||||||||||||||||||||
57 | spatial | Interview_Location | 410 | 1 | 59 | behindtheveil | CV | Used to record city where interview took place | Keep as legacy field. | LCSH headings are consistently used. | ||||||||||||||||||||||
58 | spatial | Interview_State | 410 | 1 | 11 | behindtheveil | CV | Used to record state where interview took place. LCSH | Seems redundant since state information is part of LCSH city headings. But good for broader geographic browsing. There is a better way to do this. | Keep as legacy field. | LCSH headings are consistently used. | |||||||||||||||||||||
59 | spatial | Interviewee_Birthplace | 381 | 1 | 193 | behindtheveil | CV | Used to record interviewee's city or county of birth. LCSH | This is not about the resource, but about the interviewee. | Keep as legacy field. | LCSH headings are consistently used. | This is about the interviewee, not the resource. We need a way to model named entities that aren't digital objects in the repository. | ||||||||||||||||||||
60 | spatial | Interviewee_Residence | 410 | 1 | 65 | behindtheveil | CV | Used to record interviewee's residence. LCSH | This is not about the resource, but about the interviewee. | Keep as legacy field. | LCSH headings are consistently used. | This is about the interviewee, not the resource. We need a way to model named entities that aren't digital objects in the repository. | ||||||||||||||||||||
61 | spatial | Interviewee_State_of_Birth | 98 | 1 | 13 | behindtheveil | CV | Used to record the interviewee's state of birth. LCSH | Inconsistently applied. This is not about the resource, but about the interviewee. | Keep as legacy field. | LCSH headings are consistently used. | This is about the interviewee, not the resource. We need a way to model named entities that aren't digital objects in the repository. | ||||||||||||||||||||
62 | spatial | Region_of_Publication | 2645 | 1 | 13 | zines | Local CV | Regions of U.S. | Not authoritative values ('Non-U.S.', 'South U.S.') | Out of scope - excluding zines from remediation. | ||||||||||||||||||||||
63 | spatial | Site_Alignment | 1507 | 1 | 1507 | sportsfilms | Local CV | "Home" or "Neutral" | Out of scope - excluding sportsfilms from remediation. | |||||||||||||||||||||||
64 | spatial | Spatial_Coverage | 59069 | 30 | 2013 | CV/free text | Sometimes free text, sometimes authoritative values | Messy. Brackets, question marks, typos, misspellings, 'unknown'. Authoritative and nonauthoritative representations of the same places. Sometimes not specific enough (eg. "Patterson"). | Strongly recommend if applicable. Rename field (Location?) | Cluster and edit. Convert to LCSH terms. | Reconcile against LC LOD | |||||||||||||||||||||
65 | spatial | State_of_Publication | 3018 | 1 | 87 | hasm | Local CV/free text | Sometimes free text, sometimes authoritative values | Messy. Inconsistent representation of state names. Sometimes not a state. Multiple values per cell. | Merge with Spatial_Coverage | Split multi-valued cells. Convert to LCSH terms | |||||||||||||||||||||
66 | spatial | Venue | 954 | 1 | 87 | sportsfilms | Local CV | Location of game, sometimes name of tournament too | Inconsistent | Out of scope - excluding sportsfilms from remediation. | ||||||||||||||||||||||
67 | language | Language | 10702 | 16 | 11 | behindtheveil; broadsides; caribbeansea; dukechapel; dukechronicle; dukengineer; hleewaters; italianposters; jwtnewsletters; kwilecki; meyermarshall; museedeshorreurs; rubenstein; russianposters; wlmpc; womenstraveldiaries | CV | Language of item | None | Make required/strongly recommend if applicable. | Use MARC language codes; display human readable version | Reconcile against LC LOD | ||||||||||||||||||||
68 | audience | Audience | 7308 | 1 | 11 | adaccess | Local CV | Intended audience of advertising | Short but messy and illogical ('farmers' alongside 'magazine') list of terms. 'Consumers' used 7123 out of 7308 instances. The application of this field needs better definition. | Get rid of term as it is used now - current application doesn't make sense. | Use Library of Congress Demographic Group Terms? | Reconcile against LC LOD | ||||||||||||||||||||
69 | identifier | Box_Number | 9706 | 10 | 142 | caribbeansea; dukechapel; frankespada; gedney; jesseandrews; jwtnewsletters; kwilecki; meyermarshall; ronaldreis; rubenstein | Free text | Used to indicate box number in archival collection | Mostly just contains a number, unit type is indicated by field label. This creates a problem when sharing collection metadata. Also it's mapped to dc:identifier but not really an identifier. | Consider getting rid of it. Or, if it's useful to have this as part of the metadata for functionality (retrieval for patrons or for aggregating materials in the digital collection interface), map field to isPartOf and include unit type as part of value. | If we keep it, add unit type (Box) to value. Or - can we store an archivespace component ID? | Ideally we wouldn't store this information in the item level metadata record but would instead use the structured data from the finding aid to represent this information | ||||||||||||||||||||
70 | identifier | Call_Number | 44 | 1 | 44 | quartets | Free text | Local call number of item? | If you search these values in the Call Number field from the Rubenstein website they yield no results. | Seems like a reasonable field to have but the contents are incomprehensible. Leave as legacy field? | ||||||||||||||||||||||
71 | identifier | Endeca_Identifier | 41 | 2 | 40 | earlymss; rubenstein | Free text | Keep as is. | ||||||||||||||||||||||||
72 | identifier | Folder | 4942 | 2 | 189 | broadsides; caribbeansea | Free text | Used to indicate folder number or label in archival collection | Not really an identifier. Broadsides contains folder number, caribbeansea folder label. | Caribbeansea folder label information could go in a description field. Broadsides folder number info could go into an isPartOf field. | If we keep it, add unit type (Folder) to value. Or - can we store an archivespace component ID? | |||||||||||||||||||||
73 | identifier | Identifier | 3606 | 7 | 3199 | eaa; garymonroe; hasm; italianposters; museedeshorreurs; wlmpc; womenstraveldiaries | Free text | Generally used to indicate location in archival collection (Box 1 etc) | Not really an identifier. Inconsistently applied. eaa contains only numbers, no unit type. museedeshorreurs only contains 'none'. It's unclear what the values used in hasm represent. | Remove all instances of none. Move information about location in archival collection to isPartOf. Prepend with unit type if missing. | We need to have a comprehensive strategy for managing identifiers, as well as representing context for archival collection. | |||||||||||||||||||||
74 | identifier | Interview_Number | 410 | 1 | 410 | behindtheveil | Free text | Interview number | Not sure what the interview number refers to | Determine whether or not this is a meaningful identifier - in the online interface the 'Item Identifier' is the DPC ID (I think). Merge with Identifier; use Interview_Number as collection-specific label | ||||||||||||||||||||||
75 | identifier | Issue_Number | 2201 | 1 | 142 | zines | Free text | Issue number | Out of scope - excluding zines from remediation. | |||||||||||||||||||||||
76 | identifier | Negative_Number | 4072 | 1 | 3746 | gedney | Free text | Photographer's negative number (I think) | Keep as is. | Keep as is. | ||||||||||||||||||||||
77 | identifier | OCLC_Number | 44 | 1 | 44 | quartets | OCLC/Worldcat ID number | Why is this recorded for this collection but not others? Do other Tripod2 materials have Worldcat records? | ||||||||||||||||||||||||
78 | identifier | Print_Number | 1807 | 5 | 1691 | frankespada, hmp, jesseandrews, kwilecki; ronaldreis | Free text | Photographer's print number | Keep as is. | |||||||||||||||||||||||
79 | identifier | Roll_Number | 5033 | 1 | 708 | gamble | Free text | Photographer's roll number | Not really an identifier? Consists of just a number or number/letter combo. | Either remap to IsPartOf or merge with Identifier and use Roll_Number as collection-specific label. | ||||||||||||||||||||||
80 | identifier | Volume | 2468 | 2 | 90 | dukengineer; dukechronicle | Free text | Volume number | Not really an identifier | Remap to IsPartOf | ||||||||||||||||||||||
81 | hasversion | Digitized | 22586 | 1 | 20726 | road | Free text | Includes link to digitized version in OAAA | ||||||||||||||||||||||||
82 | ispartof | Is_Part_Of | 4730 | 4 | 43 | diap; esr; hasm; rubenstein | Free text | Is Part Of' displays in the interface. In diap, sometimes value is a collection guide title, but sometimes just a personal name, with no indication as to what this means. In hasm, values are date ranges with no indication as to what this means. In rubenstein, values are collection slugs - what is rubenstein? | Figure out a more discernable label. Use consistently as this is a required field for DPLA (their label is 'Collection'). | When value is a collection guide, include link to said guide. Figure out what the personal names in diap refer to, what date ranges in hasm refer to. | This relates to how Source/Series/Subseries/Box/Folder information is represented in metadata. | |||||||||||||||||||||
83 | provenance | Provenance | 5680 | 17 | 48 | Free text | Provenance of physical item/collection | 158 instances of 'None'. Not used in all collections for which we presumably have provenance information? | Remove instances of 'None'. Some normalization/cleanup. | |||||||||||||||||||||||
84 | source | Source | 994 | 3 | 4 | blake; hleewaters; hmp | Free text | Name of archival collection from which digital collection was derived | This appears to be the same usage as IsPartOf fields. | Could go either way but I think we should just use IsPartOf for the archival collection, as this is what DPLA MAP uses for Collection. | Move all instances to IsPartOf | This relates to how IsPartOf/Series/Subseries/Box/Folder information is represented in metadata. | ||||||||||||||||||||
85 | source | Publication | 9234 | 4 | 635 | adaccess; eaa; mma; protfam | Free text | Name of publication which included ad | Messy, multiple iterations of same entity. Mix of authoritative and transcribed values. | Merge with Source; use Publication as collection-specific label. | Cluster and edit. | Reconcile against authorities. | ||||||||||||||||||||
86 | rights | Rights | 12191 | 19 | 27 | Less than half, should be all of them. | Free text | Free text rights statement. | Inconsistently applied. 831 have value of 'None' (caribbeansea). | Leave as is | Leave as is, other than a little bit of tidying. Out of scope for this phase of the project. | Going forward, this is going to be a broader conversation probably involving Kevin Smith and other people in the library. | ||||||||||||||||||||
87 | ||||||||||||||||||||||||||||||||
88 | ||||||||||||||||||||||||||||||||
89 | ||||||||||||||||||||||||||||||||
90 | ||||||||||||||||||||||||||||||||
91 | ||||||||||||||||||||||||||||||||
92 | ||||||||||||||||||||||||||||||||
93 | ||||||||||||||||||||||||||||||||
94 | ||||||||||||||||||||||||||||||||
95 | ||||||||||||||||||||||||||||||||
96 | ||||||||||||||||||||||||||||||||
97 | ||||||||||||||||||||||||||||||||
98 | ||||||||||||||||||||||||||||||||
99 | ||||||||||||||||||||||||||||||||
100 |