Arctos Collaborative Management and Shared Data Economies
Emily Braker, Arctos Education and Training Officer
NMNH Digitization Workshop, May 2023
>4.8 million records
52 Institutions
302 Collections
977K media
806K georeferenced localities
>39K citations|10,284 pubs
>37K GenBank links
53 institutions
302 collections
>4.8M records
Data Snapshot
Shared Data Environment
Controlled vocabularies and shared authorities
Highly structured and normalized data
Agents
Taxonomy
Geography
Media
Publications
Projects
Arctos Virtual Private Database Model
with Shared Tables and Vocabularies
Shared
Arctos Entity Relationship Diagram
All roads lead to agents!
Agents:
Collectors
Determiners
Preparators
Authors
Georeferences
Loan Contacts
Media Creators
Organizations
Etc.
Relationships
Attribute Determinations
Transactions
Publications
Identifications
Name Variants
Media
Records
Addresses & Biographical Remarks
Projects
Georeferences
& Event
Assertions
AGENT ACTIVITY
Agent Activity Summary
Agent Activity for Robert L. Rausch
Agents
Multiple
institutions
Familial, Academic, Professional
Collaborative Edits
Suggest and Merge Duplicates
Find inconsistencies
Data Quality Tools
Summarize and filter activity
?
Pathways:
Taxonomy
Clone External Classifications into Local Table
Shared Localities
Georeferencing
Places
Any overlapping
collecting locality
Locality resampled through time
1910
1960
2010
Examples:
Diverse taxa from same collecting event
= Efficiency, consistency, discoverability gains
Digitization/Extended Specimen Tools
Citations
5 institutions: DMNS, MSB, UAM, UCM, UMNH
Media
API
Projects
LINKED PROJECTS
BERINGIAN COEVOLUTION PROJECT (BCP)
AGENTS
DESCRIPTION
MEDIA
USAGE
PUBLICATIONS
CATALOG RECORD
PROJECTS
Contributed: The Robert and Virginia Rausch Helminthological Studies
Contributed by: Beringian coevolution project
parasite of MSB:Mamm:158238
RELATIONSHIPS
MEDIA
Locality
Agents
Citations
Parts & Attributes
Genetic sequences
Identifiers and External Linkages
MSB:Para:1252
Arostrilepis cooki
Collated Project data
(multi-collection, cross-institutional)
Secondary and tertiary Project products
Primary source data
PROJECT
PROJECT
PROJECT
Projects
Relationships
Museum of Vertebrate Zoology
MVZ:Mamm:225308
Tamias alpinus
Museum of Southwestern Biology
MSB:Para:27057
Rauschtineria eutamii
COLLECTING EVENT
Cirque Lake, Inyo Co,
CA USA
8 July 2010
MEDIA/FIELD NOTES
Cited in 3
PUBLICATIONS
3 DNA
SEQUENCES
DNA
ISOTOPE Analysis
Cited in 1 PUBLICATION
HOST
PARASITE
5 MVZ chipmunks from same locality
2 MSB parasites from same lot
Tertiary Extension
DATA PRODUCTS
Tertiary Extension
DATA PRODUCTS
CITES 300
MVZ mammal specimens
DATASETS citing DMNS, UMNH and other MSB specimens
1 PROJECT: Grinnell Resurvey (NSF-funded)
2 PhD DISSERTATIONS
Relationship DATA PUBLISHED to GloBI
Dryad
DATASET
CT MEDIA hosted on
MorphoSource
NCBI
RELATIONSHIP
Community
https://github.com/ArctosDB
Community
Decisions
Webinars, Video Tutorials
Community-curated Documentation
Collaborative Learning
Shared Workflows
Shared Resources
Shared Interns
Virtual Coffee and Office Hours
Peer Mentorship (incoming collections)
Wrap up
Collaborative Data Model:
CollaBEARative?
We put the BEAR
in collaborative!
Learn More:
Arctos Website: arctosdb.org
Arctos GitHub: https://github.com/ArctosDB
Contact: arctos-working-group-officers@googlegroups.com