1 of 22

A Brave New PID: The DMP-ID*

Open Community Session

Maria Praetzellis

September 22, 2021

*Title from DataCite blog post

2 of 22

Agenda

What’s in a DMP

DMP-ID

Networked DMPs & the DMPTool

3 of 22

California Digital Library (CDL)/UC3

CDL founded by the University of California in 1996

�University of California Curation Center (UC3) is CDL’s program concerned with maintaining, preserving, and adding value to digital research data throughout its lifecycle UC3 areas of focus:

  • Research data management
  • Data publication and data metrics
  • Persistent identifiers
  • Digital preservation
  • Data/software skills training

4 of 22

DMPTool

Free, open-source community supported tool

+60K users, 310 participating institutions

Create next-generation, machine-actionable DMPs

5 of 22

What’s in a DMP?

6 of 22

Standard Components of a DMP

  • Data Collection
    • What data will you collect or create?
  • Documentation and Metadata
    • What documentation and metadata will accompany the data?
  • Ethics and Legal Compliance
    • How will you manage any ethical issues?
  • Storage and Backup
    • How will the data be stored and backed up during the research?
  • Selection and Preservation
    • Which data are of long-term value and should be shared, and/or preserved?
  • Data Sharing
  • Responsibilities and Resources

7 of 22

Our Ultimate Goal

The principle goal of a machine actionable DMP is to support the creation and stewardship of FAIR data

  • Allow data and information about research to be communicated and shared across stakeholders
  • Facilitating
    • notifications and verification
    • real-time reporting
    • automated compliance
  • maDMPs should lessen the administrative burden on researchers and grant administrators.

  • Implementing Effective Data Practices: Stakeholder Recommendations for Collaborative Research Support.

https://doi.org/10.29242/report.effectivedatapractices2020

8 of 22

NSF EAGER research

Active DMPs Grant 2018-2021

  • Developinging and implementing the metadata structure and technical features needed to facilitate machine-actionability of DMPs
  • Testing the hypothesis that we can connect DMPs to PID graph

FAIR Island Grant 2021-2023

  • Building interoperability between pieces of critical research infrastructure -- Data Management Plans (DMPs), open data policy, DOIs, and publications.
  • Piloting and continuing to develop the technical infrastructure built in our prior NSF grant.

9 of 22

Networked DMPs and the PID Graph

10 of 22

Identifiers connect research activities

DMPTool supports PIDs within a DMP:

  • DMP-IDs
  • RORs for research organizations
  • Funder Registry IDs for funders
  • ORCIDs for DMP creators and collaborators
  • Registry of Research Data Repositories (re3data)
  • Licenses (spdx)
  • RDA Metadata Standards Directory

11 of 22

DataCite Metadata Schema 4.4

Addition of new values to the resourceTypeGeneral property: OutputsManagementPlan

https://support.datacite.org/docs/datacite-dmp-ids

https://blog.datacite.org/announcing-dmp-ids/

12 of 22

PIDs for DMPs

Generating identifiers for DMPs create an unbreakable link between a data plan to the project outputs and allows access to DataCite’s supporting services such as Event Data to facilitate connections via the PID Graph in support of FAIR data.

13 of 22

Leveraging the PID Graph

Having unique persistent identifiers for researchers and their outputs is crucial to connecting pieces of the research landscape together.

PIDs already have the potential to enable the connected research graph, but we’re not yet taking full advantage of their connecting powers.

We can now clearly link PIDs together via relations in their metadata to enable the discovery of connections at least two “hops” away

14 of 22

Thanks to Erin Robinson, Metadata Game Changers, for the use of this graphic.

15 of 22

Landing page for a DMP-ID

Also via DataCite Commons: https://commons.datacite.org/doi.org/10.48321/d1f88s

16 of 22

17 of 22

Using the new DMP-ID

18 of 22

DMP-ID & ORCID Integration

DMP IDs generated via the DMPTool are now automatically linked to the DMP creator’s ORCID record.

19 of 22

Support for DMPs as living documents:

Electronic Lab Notebooks

Researchers will be able to keep DMPs up to date in the ELN, RSpace.

When datasets are published in external repositories the dataset DOI is then connected to the original DMP via related identifiers.

Demo of feature in testing environment

20 of 22

NSF EAGER: The FAIR Island Project for Place-based Open Science

Maria Praetzellis, John Chodacki, Neil Davies, Erin Robinson, Matthew Buys, & Catherine Nancarrow. (2021). EAGER: The FAIR Island Project for Place-based Open Science. Zenodo. https://doi.org/10.5281/zenodo.5117892

Data

Policy

(IDEA: Tetiaroa)

Data Management Plans

(CDL:

DMPTool)

Integrate with Existing Infrastructure

(UCNRS:

RAMS)

Continuing to Analyze, Iterate, and Improve

Expand these practices to other place-based research sites

21 of 22

Where to Learn More

22 of 22

Thank you!