1 | This document is discussed in more detail at: | http://code.google.com/p/ala-datamob/wiki/RelatedRecords |
---|---|---|
2 | scenarios: | A scenario can be thought of as 'a circumstance of the data / data management', i.e. it's a way someone is gathering and managing their collection data which results in a relationship being formed, or captured, between two (or more) records. |
3 | entities: | These are logical (, generic, abstract) 'things' that participate in the scenarios, i.e. the things between which relationships exist; in the scope of the document, all scenarios can be broken down to interactions between: 1. specimen or sighting 2. a multimedia collection item 3. a duplicate Entities are limited to being either the 'object' or 'subject' of a relationship. There are obviously other metadata associated with each entity, such as the collector, or the location, or the date, or the url, etc. - these are beyond the scope of the document and therefore remain unaddressed; e.g. the relationship between a specimen and its collector is clearly defined by the group of collector-related metadata in whichever spec the reader is concerned with... |
4 | concepts: | Concepts could be considered more completely as 'concepts pertinent to record relationships' ... the concepts are also a reduction of the scenarios to a more common base - but a step in a slightly different direction than that of 'scenario and entity'. These is show that there are key concepts already addressed in the standards, and there are these terms to communicate these facts to data consumers. The concepts in the scenarios are not limited to being either the 'object' or 'subject' of a relationship, whereas the entities are. |
5 | entity-scenario matrix: | These sections attempt to identify instances of common ground shared amongst the scenarios, i.e. a 'lowest common denominator', with the goal of relating each pattern to a prescribed usage within a particular standard. For each discrete combination of entity-pairs, there is a pattern - a matrix; another way of putting it would be 'entity combinations'. For some thoughts on how these patterns are identified, see Appendix: Searching for common patterns in the scenarios, at: http://code.google.com/p/ala-datamob/wiki/RelatedRecords#Appendix:_Searching_for_common_patterns_in_the_scenarios |
6 | xy-pairs: | |
7 | dwc-synop: | Once the patterns are identified, we can use them to give rigour to the standards analyses in these sections; i.e. for each pattern, determine if it is currently supported by the standards, or if we need to address it with a non-standard extension. From this point we can make solid recommendations about what needs to be done under certain circumstances. |
8 | hispid-synop: | |
9 |
1 | Scenario | Description |
---|---|---|
2 | 1 | Specimen record holds metadata on multiple preparations |
3 | 2 | Specimen record is a single preparation, multiple preparations implying a whole (no longer existing) specimen |
4 | 3 | One or more preparations relate to one or more specimens through a cross-reference table (note: this situation has not been encountered - if this is your data, give us a yell! thanks...) |
5 | 4 | Discrete specimen a constituent part (piece, relationship, derivation) of another (both records exist) |
6 | 5 | A 'collection item' being a generalisation of the more specific (specimen) in any of the above scenarios |
7 | 6 | Any collection item, from which one or more observations (sightings, occurrences) are evident - these observations may not necessarily have a determination (taxon) |
8 | 7 | Any collection item representing another in a many to one relationship; eg: * a photo of a drawer full of specimens * a photo of a flock of birds * a digitisation of a sound-reel, having many recordings, each recording a discrete site visit, each recording capturing one or more taxa |
9 | 8 | DNA and one (or more) sequence(s) of (barcodes), each sequence matching one or more taxa (next-gen sequencing - http://www.google.com.au/search?q=nextgen+sequencing) |
10 | 9 | vouchering of specimens amongst herbaria |
11 | 10 | donation of collections; movement of collections amongst institutions |
12 | 11 | digitisation of type specimens, which are then shared in lieu of a physical loan on the original, and are subsequently handled in the recipient system in the same manner as other preserved specimens (for operational convenience) |
13 | 12 | a herbarium sheet, which has multiple related specimens attached and databased, is photographed once in its entirety |
14 | 13 | duplication of specimens within/amonsgt collection management systems |
15 | 14 | duplication of sightings in aggregators' data supply chains, i.e. harvesting from a primary data-producer, as well as a down-stream aggregator |
16 | 15 | collaboration between institutions in the field; specimen(s) gathered from a common observation (biological individual, sighting, occurrence) made on a joint expedition |
17 | 16 | many multimedia items associated with a single specimen/sighting (e.g. a site photo, painting prep'd spec, ...) |
18 | 17 | deriving sightings from sightings (?) |
19 | 18 | a photo of a skin (being a preparation of a specimen collected in the field) |
20 | ... Please give us a shout to add more scenarios! |
1 | Entity | Specimen or sighting |
---|---|---|
2 | Examples | Specimen, sighting, preparation, occurrence, observation, DNA |
3 | Defined | as a discrete specimen -or- sighting record (HISPID); by available metadata, i.e. completeness of the record (DwC); by implication, through preparations metadata (HISPID, DwC); |
4 | Details | These entities share the following attributes: - a collector or observer; - a taxon, a determination; - (possibly) a location in time and space; |
5 | Entity | Multimedia collection item |
6 | Examples | Image, sound, video, painting, dna-sequence... : |
7 | Defined | as a discrete General/Unit record (DwC/HISPID); by implication, through url metadata (DwC); by nesting, in discrete sub-objects (HISPID); |
8 | Details | - usually where there is not enough information associated with the data being represented to treat it as a specimen/sighting, then we can treat it as supplementary (and assuming there is a 'primary'/'core' record somewhere); - a representation of specimen(s), or evidence of sighting(s); - (possibly) metadata of the item such as usage rights, or authorship; |
9 | Entity | Duplicate |
10 | Examples | - a deliberate duplication of data at the process level, e.g. multiple pieces of the whole being distributed to many org's, with no one org keeping the whole; - a necessary duplication, e.g. a digitisation of a type specimen which is shared in lieu of a loan on the physical specimen, - an inadvertent duplication, e.g. an aggregator ingesting two copies of the same record - one from the data-producer and one from another data-aggregator, - a redundant dupliction, e.g. a specimen being digitised, the original specimen being 'de-accessioned' but not 'un-databased' |
11 | Details | This is a specialised case applicable to both the specimen/sighting or multimedia entities. What constitutes a duplicate can vary across the domains (see Appendix: Relevance to different biodiversity domains) but the up shot is roughly the same - two 'dots on a map' in the aggregate view, where there should really only be one: |
1 | Concepts | ||||
---|---|---|---|---|---|
2 | Description | Scenarios | Entities | Existing DwC terms | Existing HISPID concepts |
3 | specimen or sighting | 1, 2, 3, 4, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, ... | S/S | General (G); Occurrence (O); | /Unit (U); /SpecimenUnit (SU); |
4 | preparation of a specimen | 1,2,3,4 | S/S | O.preparations | /U/SU/Preparations; |
5 | digitisation & multi media (images, sound...) | MCI | O.associatedMedia -or- a discrete General record | /U/MultiMediaObjects (MMO); /U/Gathering/SiteImages (SI); /U/SU/Marks/Mark/Images; | |
6 | all (generic) relationships | ResourceRelationship (RR) | /U/Associations | ||
7 | sequencing/barcoding of dna | S/S, MCI | O.associatedSequences -or- a discrete General record | /U/Sequences | |
8 | duplications | S/S, MCI | /U/SU/History/PreviousUnits; SU/DuplicatesDistributedTo; | ||
9 | field collaboration | S/S, MCI, D | G.datasetName | /U/Assemblages; /U/Gathering/Project; /U/G/Code | |
10 | collecting events (gathering, site visit) * | Event (E); Location (L) | /U/Gathering (G) | ||
11 | basis of record * | dcterms:type; G.basisOfRecord; | /U/RecordBasis; /U/KindOfUnit; | ||
12 | record identifier (specimen/sighting) * | S/S | O.occurrenceID; O.catalogNumber; G.institutionX; G.collectionX; | /U/UnitID; /U/UnitGUID; /U/SourceInstitutionID; /U/SourceID; | |
13 | additional metadata for multimedia coll.item id * | MCI | MMO/MultiMediaObject/ID; G/SI/SiteImage/ID; | ||
14 | * note: included for convenience; may also be 'involved' in defining the relationship(s) |
1 | pattern prefix | pa | pb | pc | ||
---|---|---|---|---|---|---|
2 | pattern suffix | relates to | entity | Specimen/sighting | Multimedia coll.item | Duplicate |
3 | 1 | one only | specimen/sighting | 2,4,9,10 | 6,11 | 14 |
4 | 2 | many | specimens/sightings | 2,4,9,10,15 | 6,7,8,12 | 14 |
5 | 3 | one only | multimedia coll.item | 16 | 5 | |
6 | 4 | many | multimedia coll.items | 16 | 5,7 | |
7 | 5 | one only | duplicate | 13,14 | ||
8 | 6 | many | duplicates | 13,14 | ||
9 |
1 | Pattern | Primary (core) | RelationType | Supplement (child) | Scenario(s) | Notes |
---|---|---|---|---|---|---|
2 | a1 | Specimen/sighting | relates to one only | specimen/sighting | 2,4,9,10 | |
3 | a2 | Specimen/sighting | relates to many | specimens/sightings | 2,4,9,10,15 | |
4 | a3 | Specimen/sighting | relates to one only | multimedia coll.item | 16 | |
5 | a4 | Specimen/sighting | relates to many | multimedia coll.items | 16 | |
6 | a5 | Specimen/sighting | relates to one only | duplicate | 13,14 | |
7 | a6 | Specimen/sighting | relates to many | duplicates | 13,14 | |
8 | b1 | Multimedia coll.item | relates to one only | specimen/sighting | 6,11 | |
9 | b2 | Multimedia coll.item | relates to many | specimens/sightings | 6,7,8,12 | |
10 | b3 | Multimedia coll.item | relates to one only | multimedia coll.item | 5 | |
11 | b4 | Multimedia coll.item | relates to many | multimedia coll.items | 5,7 | |
12 | b5 | Multimedia coll.item | relates to one only | duplicate | ||
13 | b6 | Multimedia coll.item | relates to many | duplicates | ||
14 | c1 | Duplicate | relates to one only | specimen/sighting | 14 | |
15 | c2 | Duplicate | relates to many | specimens/sightings | 14 | |
16 | c3 | Duplicate | relates to one only | multimedia coll.item | ||
17 | c4 | Duplicate | relates to many | multimedia coll.items | ||
18 | c5 | Duplicate | relates to one only | duplicate | ||
19 | c6 | Duplicate | relates to many | duplicates |
1 | DwC synopsis | ||||
---|---|---|---|---|---|
2 | Pattern | (STANDARD) simple dwc | (NON-STANDARD) all dwc, simple/archive/xml | (STANDARD) dwc-archive/dwc-xml | Description |
3 | a1 | O.associatedOccurrences; O.associatedTaxa | principal- / parent record | O.associatedOccurrences; O.associatedTaxa; ResourceRelationships; | Specimen/sighting relates to one only specimen/sighting |
4 | a2 | O.associatedOccurrences; O.associatedTaxa | (2 only) principal- + parent record | O.associatedOccurrences; O.associatedTaxa; ResourceRelationships; | Specimen/sighting relates to many specimens/sightings |
5 | a3 | O.associatedMedia; O.associatedSequences; | principal- / parent record | O.associatedMedia; ResourceRelationships; | Specimen/sighting relates to one only multimedia coll.item |
6 | a4 | O.associatedMedia; O.associatedSequences; | (2 only) principal- + parent record | O.associatedMedia; ResourceRelationships; | Specimen/sighting relates to many multimedia coll.items |
7 | a5 | ResourceRelationships; | Specimen/sighting relates to one only duplicate | ||
8 | a6 | ResourceRelationships; | Specimen/sighting relates to many duplicates | ||
9 | b1 | principal- / parent record | ResourceRelationships; | Multimedia coll.item relates to one only specimen/sighting | |
10 | b2 | (2 only) principal- + parent record | ResourceRelationships; | Multimedia coll.item relates to many specimens/sightings | |
11 | b3 | NOT O.associatedMedia | principal- / parent record | ResourceRelationships; NOT O.associatedMedia | Multimedia coll.item relates to one only multimedia coll.item |
12 | b4 | (2 only) principal- + parent record | ResourceRelationships; | Multimedia coll.item relates to many multimedia coll.items | |
13 | b5 | principal- / parent record | ResourceRelationships; | Multimedia coll.item relates to one only duplicate | |
14 | b6 | (2 only) principal- + parent record | ResourceRelationships; | Multimedia coll.item relates to many duplicates | |
15 | c1 | principal- / parent record | ResourceRelationships; | Duplicate relates to one only specimen/sighting | |
16 | c2 | (2 only) principal- + parent record | ResourceRelationships; | Duplicate relates to many specimens/sightings | |
17 | c3 | principal- / parent record | ResourceRelationships; | Duplicate relates to one only multimedia coll.item | |
18 | c4 | (2 only) principal- + parent record | ResourceRelationships; | Duplicate relates to many multimedia coll.items | |
19 | c5 | principal- / parent record | ResourceRelationships; | Duplicate relates to one only duplicate | |
20 | c6 | (2 only) principal- + parent record | ResourceRelationships; | Duplicate relates to many duplicates |
1 | HISPID synopsis | ||||
---|---|---|---|---|---|
2 | Pattern | (NON-STANDARD) hispid-light | (STANDARD) hispid-xml | (NON-STANDARD) hispid-xml | Description |
3 | a1 | /Unit/Assemblages; /Unit/Associations; /Unit/SpecimenUnit/Preparations; | Specimen/sighting relates to one only specimen/sighting | ||
4 | a2 | /Unit/Assemblages; /Unit/Associations; /Unit/SpecimenUnit/Preparations; | Specimen/sighting relates to many specimens/sightings | ||
5 | a3 | /Unit/MultiMediaObjects (/U/MMO); /Unit/Gathering/SiteImages (/U/G/SI); | Specimen/sighting relates to one only multimedia coll.item | ||
6 | a4 | /Unit/MultiMediaObjects; /Unit/Gathering/SiteImages; | Specimen/sighting relates to many multimedia coll.items | ||
7 | a5 | /U/SU/DuplicatesDistributedTo; /Unit/Associations; | Specimen/sighting relates to one only duplicate | ||
8 | a6 | /U/SU/DuplicatesDistributedTo; /Unit/Associations; | Specimen/sighting relates to many duplicates | ||
9 | b1 | /Unit/Associations; | /U/MMO/MultiMediaObject/UnitID; /U/MMO/MultiMediaObject/UnitGUID; /U/G/SI//SiteImage/UnitID; /U/G/SI//SiteImage/UnitGUID; | Multimedia coll.item relates to one only specimen/sighting | |
10 | b2 | /Unit/Associations; | Multimedia coll.item relates to many specimens/sightings | ||
11 | b3 | /Unit/Associations? or nested under MMO/SI? | Multimedia coll.item relates to one only multimedia coll.item | ||
12 | b4 | /Unit/Associations? or nested under MMO/SI? | Multimedia coll.item relates to many multimedia coll.items | ||
13 | b5 | Multimedia coll.item relates to one only duplicate | |||
14 | b6 | Multimedia coll.item relates to many duplicates | |||
15 | c1 | /U/SU/History/PreviousUnits; /U/SU/Acquisitions; /U/Associations; | Duplicate relates to one only specimen/sighting | ||
16 | c2 | /U/SU/History/PreviousUnits; /U/SU/Acquisitions; /U/Associations; | Duplicate relates to many specimens/sightings | ||
17 | c3 | /Unit/Associations? or nested under MMO/SI? | Duplicate relates to one only multimedia coll.item | ||
18 | c4 | /Unit/Associations? or nested under MMO/SI? | Duplicate relates to many multimedia coll.items | ||
19 | c5 | /U/Associations; /U/SU/Acquisitions; | Duplicate relates to one only duplicate | ||
20 | c6 | /U/Associations; /U/SU/Acquisitions; | Duplicate relates to many duplicates |