What information does DigitalNZ collect?
This document describes the most common fields used in the Digital New Zealand harvesting system. The field names and schema are loosely based on Dublin Core but have been developed over the years to be more flexible and better cater for Digital New Zealand’s wide range of content and varying metadata levels.
Below each field title are three keywords. These words indicate whether the field is required, if it is searchable and whether it can contain multiple values.
Requirement status:
Available through search:
Single or multi-valued:
dnz : category
Compulsory . Searchable . Multi-value.
Explanation:
This field is used for filtering, via tabs, on the Digitalnz.org site. It is also used for the API and possibly other hosted/custom search integrations.
Category | Definition | Usage notes |
Images | Image is a primary/major component of resource. | Items can also use other categories such as Newspapers (e.g. images from Paperspast), or Articles (e.g. Kete), or books (e.g. for images from books at NZETC) |
Video | Video is a primary/major component of resource. | Items can also use other categories such as Articles (e.g. a video included in a Te Papa Article) |
Audio | Audio is a primary/major component of resource. | Items can also be other categories such as Articles (such as Te Ara). |
Books | Digitised books in various forms, including ephemeral booklets; or born digital books. | Maybe combined with Images, if the image exists as significant component of an individual item. |
Newspapers | Digitised newspapers | Maybe combined with Images, if the image exists as significant component of an individual item. |
Manuscripts | Digitised unpublished archival material, such as letters, scrapbooks, diaries etc | Usually discrete collections so doesn’t need additional categories. Use dc_subject to add useful descriptors if not already supplied by content partner (e.g. letters, diaries) |
Data | Datasets or unprocessed information. May be raw data, downloadable sets of resources such as map tiles/layers or csv files. | Due to diversity of this set there may be times where it is appropriate to combine, e.g. if maps tiles are available in a usable image format, or e.g. if data has been sourced in bulk from a newspaper collection |
Research papers | Individual published outputs of research | Usually discrete collections so doesn’t need additional categories. Do check for Video |
Articles | Single items or groups (e.g. journals) of news or opinion, fact or general information. Includes blogs. Not for use for Newspaper articles, use “Newspaper” instead. Examples include guides and reports. | Maybe combined with audio, video or image if there is enough contextual information to warrant it. However if the main focus of the item is the information and the image is only illustrative (i.e. secondary) do not apply the image category. Do utilize the thumbnail if possible. |
Archives | For other types of unpublished archival material not covered by Manuscripts etc. | Examples include Archives NZ NZDF Personnel Files |
Other | For items that do not fit into existing categories. | Things like interactive web resources, slideshows and any other outlying item types. |
Sets | DigitalNZ Sets created by users. | Not strictly an item category, but needed. |
dnz : collection
Compulsory . Searchable . Multi-value.
Explanation:
Name of the collection(s) and/or website that the item belongs to. This field can also contain sub-collections or groupings within the main collection. This field is searchable but does not actually get displayed directly on digitalnz.org. A separate (single value) field named 'display_collection' containing the primary top level collection is used for displaying alongside the record and in the collection filter drop downs.
Example(s):
Papers past
Evening post
Grey river argus
Te ao hou
dnz : content_partner
Compulsory . Searchable . Multi-value.
Explanation:
Name of the organisation(s), institution(s), or individual(s) making content available through DigitalNZ. Usually the name of the organisation that has agreed to the DigitalNZ Metadata Contribution Terms.
Example(s):
Archives New Zealand Te Rua Mahara o te Kāwanatanga
Alexander Turnbull Library
dc : description
Compulsory . Searchable . Multi-value.
Explanation:
Descriptive information about the record. Can be a summary, abstract, or any information about the record. In most cases this is limited to 350 characters, except where the provided descriptions are clear abstracts or summaries from trusted sources that are not too much longer than 350 characters.
Example(s):
"The drawing is from Burton Silver's book, `Why Cats Paint: a theory of feline aesthetics'. Published by Heather Busch and Burton Silver, …"
"During the late 1980s, Kiwi inventor John Britten developed and built a revolutionary racing motorcycle. He pursu
dnz : landing_url
Compulsory . Not searchable . Single-value.
Explanation:
An HTTP URL resolving to a landing page for the content.
Example(s):
http://www.nzmuseums.co.nz/index.php?option=com_nstp&task=showDetail&objectContext=&recordIdSet=1075
http://digital.otago.ac.nz/results.php?arguement=s1794_00&focus=Record&submit=GO
dnz : thumbnail_url
Compulsory . Searchable . Single-value.
Explanation:
A URL resolving to a thumbnail of the content for display in search results list. Preferably around 250px wide. If the image is larger than 255px wide or 138px high it will be resized/shrunk on the DigitalNZ website.
Example(s):
http://ndhadeliver.natlib.govt.nz/delivery/DeliveryManagerServlet?dps_pid=IE506404&dps_func=thumbnail
http://www.aucklandcity.govt.nz/dbtw-wpd/HeritageImages/Images/AWNt/1939/AWNS_19390215_p044_i001_x.jpg
http://i.tinysrc.mobi/150/http://digital.otago.a
dc : title
Compulsory . Searchable . Single-value.
Explanation:
A name given to the resource or content object being described. If the titles of a series of resources are all identical then some extra information can be appended to the title like a date or identifier so that the titles are more unique.
Example(s):
US forces at Camp McKay store
[Looking east from Ponsonby over Auckland City to harbour,....]
The paleoaustral genus Protodolium Wilckens, 1922 (Mollusca: Gastropoda), and a new species from the Late Cretaceous of Chatham Islands, New Zealand
dnz : usage
Compulsory . Searchable . Multi-value.
Explanation:
Human-understandable label defined by the DigitalNZ project team.
ie CC-BY-NC would have: Share and Modify
No known copyright would have: Share, Modify and Use commercially
If the status of content is not identified by the content provider the value Unknown is used.
Example(s):
A combination of: All rights reserved, Share, Modify, Use commercially, Unknown
dc : creator
Recommended . Searchable . Multi-value.
Explanation:
The name of the person, organisation, institution, service etc who created the content (e.g. the photographer or artist or writer or author). Often this will be the original copyright holder. Where the content is born digital, this may be the creator of the digital file. Where content has been digitised, it is usually the creator of the analogue source.
Example(s):
John Pine Snadden
dc : date
Recommended . Searchable . Multi-value.
Explanation:
Stores date information to be used in the filters/facets. Because it is being used in the filters this information is transformed during harvesting to a computer readable format. DNZ can not currently deal with date ranges, the work around for now is to take the first date of the date range. Also this date must be expressed down to a day, so will default to 1st of the 1st if only a year is specified in the original source.
Example(s):
2011-12-15T00:00:00.000Z
dnz : display_date
Recommended . Not searchable . Single-value.
Explanation:
Stores date information in any format. This field does not have to be computer readable like dc:date so can store any text based date expression.
Example(s):
circa 1960's
920's to 1930's
c1895
15 Jan 1992
dnz : large_thumbnail_url
Recommended . Searchable . Single-value.
Explanation:
A URL resolving to an image which will be resized to 520 px wide and displayed on the DNZ details page. Large thumbs are preferably less than 1Mb in file size and no less than 500px wide. This image will always be resized and cached. Refer to DigitalNZ Metadata Contribution Terms (http://www.digitalnz.org/about/terms-of-use/metadata-contrbution-terms) for the terms that allow for this.
Example(s):
http://www.aucklandcity.govt.nz/dbtw-wpd/HeritageImages/Images/AWNf/1936/AWNS_19360304_p044_i001_b.jpg
http://digital.otago.ac.nz/images/hocken/full/a1023.jpg
dnz : locations
Recommended . Not searchable . Single-value.
Explanation:
This field contains geographical location information including latitude/longitude co-ordinates, text based location info ( ie city/region/street), and details about where the location information comes from (ie "Location provided by Museum of New Zealand Te Papa Tongarewa")
Example(s):
<entry lat="-40.0733" lng="+176.8967" area="North Island, Hawkes Bay" specific_location="750 m South of Tuingara Point, Pourerere, Hawkes Bay" comment="Location provided by Museum of New Zealand Te Papa Tongarewa"/>
dnz : placename
Recommended . Searchable . Multi-value.
Explanation:
Human-readable place name. Ideally from a controlled vocabulary such as the New Zealand Gazetteer of Official Placenames ( http://www.linz.govt.nz/placenames/find-names/nz-gazetteer-official-names ) or the Getty Thesaurus of Geographic names.
Example(s):
Castlepoint, Wairarapa, New Zealand
dc : rights
Recommended . Searchable . Single-value.
Explanation:
Rights information. Can be a rights statement explaining the rights of the record or a link to a webpage with more detailed rights information.
Example(s):
Crown Copyright 2009 administered through the New Zealand Ministry for Culture and Heritage Te Manatu Taonga.
dnz : rights_url
Recommended . Not searchable . Multi-value.
Explanation:
An HTTP URL resolving to a rights or terms of use page statement for the resource.
Where records have a rights statement rather than a URL, this is recorded in dc:rights.
Example(s):
http://www.georgegrey.org.nz/TermsAndConditions.aspx#websitecontentandconditionsofreuse
dc : subject
Recommended . Searchable . Multi-value.
Explanation:
Arepeatable field containing keywords about the record content. The harvester will typically split any comma (or similar character) separated lists into separate elements. A harvester may add useful subjects to records to aid findability and grouping if those terms don’t already exist in the data.
Example(s):
Clothing & dress
LB Theory and practice of education
Caring for sick
World War I
dc : type
Recommended . Not searchable . Multi-value.
Explanation:
Terms describing general categories, functions, genres, or aggregation levels for content. This field is one of the main factors used to determine the TYPE search filter (such as memorabilia, magazine, people etc).
Example(s):
MovingImage
StillImage
Thesis
dc : contributor
Optional . Searchable . Multi-value.
Explanation:
If the content comes from an aggregation service this field can optionally provide more context as to the original organisation that provided the content.
Example(s):
School of Biological Sciences
Jordan, Bill
dc : coverage
Optional . Not searchable . Multi-value.
Explanation:
The extent or scope of the content of the resource. Coverage can include spatial (geographical) location or temporal period (a period label, date, or date range).
This is not the primary field for location or date information within DNZ and is mainly used to match existing metadata field names.
DNZ's primary spatial/location field is dnz:location (which holds latitude and longitude co-ord's if available) and dnz:placename for place names as text.
DNZ's primary date fields are dc:date (strict formated date) and dnz:display_date (more flexible text based date information).
Example(s):
Auckland Region (N.Z.)
1980
dc : format
Optional . Searchable . Multi-value.
Explanation:
Contains information about the physical attributes of the item/record. Often used to match an existing field name from the content source.
Example(s):
Watercolour/pencil
112 x 112mm
dc : identifier
Optional . Searchable . Multi-value.
Explanation:
Keeps information like content provider record ID's, ISBN's, serial numbers etc.
Example(s):
KF5-803
dc : language
Optional . Searchable . Multi-value.
Explanation:
Indicates the languages of the resource. Harvested as a language code like "en".
Example(s):
en
bo
dnz : object
Optional . Not searchable . Multi-value.
Explanation:
This field stores metadata about any objects associated with the item (ie high-res images, pdf's, documents, etc). The following attributes can be stored with each object (where applicable/available):
url, name, type, size, and thumbnail.
Example(s):
<entry url="http://aut.researchgateway.ac.nz/bitstream/handle/10292/2563/Published_paper1.pdf" name="Integration of vibro-acoustography imaging modality with the traditional mammography" type="document/pdf" size="1.816Mb" />
dnz : object_url
Optional . Searchable . Single-value.
Explanation:
A single HTTP URL resolving to the resource or content object itself, such as a PDF or image file (where multiple versions exist, use the highest-resolution version).
Example(s):
http://digital.otago.ac.nz/images/specialcollections/full/s1788_3.jpg
http://www.mfe.govt.nz/publications/organisms/royal-commission-gm/definition-genetic-modification.pdf
dc : publisher
Optional . Searchable . Multi-value.
Explanation:
The name of the publisher of the digital object or the original physical object in the case of a digitisation.
Example(s):
Taylor and Francis
dc : relation
Optional . Searchable . Multi-value.
Explanation:
Information about a related resource. It could contain a URL to a related resource or just be the title of a related collection.
Example(s):
K E Niven and Co :Commercial negatives
dc : source
Optional . Not searchable . Multi-value.
Explanation:
Information about where the resource has come from or a reference to another resource from which the content is derived, such as a book in the case of a digitised book.
Example(s):
Amer. fiction, 1774-1910 (microfilm, 1970-1978 ed.), v. 2 (1851-1875), reel R-19, no. 2144A
Archives footage preserved and made available by Archives New Zealand/Te Rua Mahara o te Kāwanatanga