We would appreciate if you could answer the questions in the questionnaire and return your answers by e-mail to: email@example.com - by 1 February 2018 at latest.
Please note that it is not an anonymous questionnaire, as we would like you to leave your contact details (name and e-mail) in case we would like to send follow-up questions, and also to share our final report with you afterwards. Your answers will however only be kept in our files during this project, and any information included in our report will be completely anonymized before publishing. Don’t hesitate to contact us if you have any questions, see our contact details below.
The ENVRIplus project is highly engaged in these questions on standards and practices for data citation and persistent identifiers. One of the ENVRIplus work packages focuses on data identification and citation, namely Work Package 6 entitled “Research Infrastructure data identification and citation services”. Within this sub project we are now sending out a questionnaire to publishers, PID service providers and other organizations engaged in providing services to research infrastructures. The questionnaire is also a follow-up on the ENVRIplus workshop “Closing the gap: The need for tools to identify, track and cite environmental research data” that was held in Hamburg in October 2017. Further information on the ENVRIplus workshop is available at: https://drive.google.com/drive/folders/0B1dxrrXyiW2aV2FLejZOSnI5S3c
Fragment datasetIn research there may be needed to make citations to subsets of a dataset, which is fragment dataset, on a very granular level, in particular when there is a constant change and update of the dataset.
Data collection In some cases there might be a need to gather, collect several datasets into a data collection.
Dynamic dataDynamic data refers to datasets that may change over time, e.g. because new data has been added, updates or changes of data have been made.
Queries to data storesInstead of storing many duplicates of subsets of data it is possible to create specific queries in order to identify and obtain certain subsets of data. The queries may also be stored in a query store, and thus possible to re-run and re-use.
Archival Resource Keys (ARK)The Archival Resource Key (ARK) is a Uniform Resource Locator intended to serve as a long-term persistent identifier. The system of ARK was developed by the California Digital Library in 2003.
Digital Object Identifiers (DOI)A Digital Object Identifier (DOI) is a unique identifier linked to a specific object, which must be a clearly defined piece of intellectual property. The system of DOI was introduced by the International DOI Foundation in 1998, and it makes use of the Handle System.
HandlesThe Handle System was invented by the Corporation for National Research Initiative (CNRI), and it facilitates the assignment of unique global persistent identifiers to locate digital resources over time, in a manner that is independent of current or future storage locations. The Handle System is used by thousands of organizations to assign persistent identifiers, for example the DOI system uses the Handle protocol.
Life Science Identifiers (LSID)The Life Science Identifier system was introduced by the Object Management Group (OMP) in 2004 as a system to uniquely name life science entities. LSID are being used by all globally leading providers for biodiversity data to identify organism names.
Persistent URL (PURL)A persistent uniform resource locator (PURL) is a uniform resource locator (URL) (i.e., location-based uniform resource identifier or URI) that is used to redirect to the location of the requested web resource. The PURL system was developed by the Online Computer Library Center (OCLC) in 1995.
Uniform Resource Name (URN)URNs were intended to serve as persistent, location-independent identifiers, allowing the simple mapping of namespaces into a single URN namespace.
• Socha, Y.M. Out of cite, out of mind: The current state of practice policy, and technology for the citation of data. Data Science Journal. 2013; 12 September.
• Hellström et al. A system design for data identifier and citation services for environmental RIs projects to prepare an ENVRIPLUS strategy to negotiate with external organisations, Work Package 6 – inter-RI data identification and citation services. ENVRIPLUS; 2017.
• ENVRIPLUS. Presentations for the workshop “Closing the gap: The need for tools to identify, track and cite environmental research data”. [Internet]. 2017 [cited date 2018-01-10]. Available from: https://drive.google.com/drive/folders/0B1dxrrXyiW2aV2FLejZOSnI5S3c
• Rauber et al. Identification of Reproducible Subsets of Data Citation, Sharing and Re-Use. Bulletin of the IEEE Technical Committe on Digital Libraries. 2016; 12(1).
• Dodds, et al. Creating Value with Identifiers in an Open Data World. Open Data Institute, Thomson Reuters; 2016.
Other ENVRIplus Work Package 6 contacts: • Alex Vermeulen, Margareta Hellström, Lund University / ICOS Carbon Portal• Frank Toussaint, Stephan Kindermann, DKRZ (Deutsches Klimarechenzentrum)• Robert Huber, Markus Stocker, Universität Bremen