Dataset mapping
Comments
 Share
The version of the browser you are using is no longer supported. Please upgrade to a supported browser.Dismiss

 
Comment only
 
 
Still loading...
ABCDEFGHIJKLMNOPQRSTUVWXYZAAABACADAEAFAG
1
Schema.org DatasetDescriptionTypeBioSchemas DatasetTypeRequirement LevelUse CasesGoogle Dataset Citation MetadataDATS.DatasetHCLSDataCiteomicsdiData repositories / Data catalogue
DATS.DataRepository
2
Overall aim: findability but not necessarily structured queriesM,R,O,NA (Mandatory, Recommended, Optional, Not Applicable). Inheriting from Schema.org "DataCatalog"
3
name
A descriptive name of a dataset (e.g., “Snow depth in Northern Hemisphere”)
Text1. nameTextMUSTto search on a title of datasetnametitletitledc:titletitledoneMname
4
description
A short summary describing a dataset
Text2. descriptionTextMUSTto enable discovery by indexing on free text descriptiondescriptionNAdescriptiondc:descriptiondescriptiondone Mdescription
5
url
Location of a page describing the dataset.
URL3. url URLMUSTenable direct access, resolution of dataseturldataset identifieridentifierfoaf:pageidentifierdone Midentifier
6
sameAs
Other URLs that can be used to access the dataset page.
URLsameAsNAalternativeIdentifierRelatedIdentifierdone OalternativeIdentifier
7
version
The version number for this dataset.
Text, NumberversionText,NumberSHOULDversionversionversionpav:versionversiondoneRversion
8
keywords
Keywords summarizing the dataset.
Text4. keywordsTextMUSTto enable discovery by indexing on free text descriptionkeywordsNAkeywordsdcat:keyworddone Rscopes
9
variablesMeasured (pending)
What does the dataset measure? (e.g., temperature, pressure) [sic variableMeasured in googles dataset page]
Text, PropertyValuevariablesMeasuredTextSHOULDallow restriction to specific dimensions and variables specificly recorded in a dataset (e.g. get all climate datasets which have monitors CO2 concentration or datasets where metabolite concentration was recorded)
variablesMeasured
NAdimensionsNANAdone NANA
10
creator
The name of the dataset creator (person or organization).
Person, OrganizationcreatorTextSHOULDallow attribution and creditcreator.namecreatorcreatordc:creatorcreatordone R - Provider/Contact?publisher
11
includedInDataCatalog
The catalog to which this dataset belongs to.
DataCatalog5. includedInDataCatalogDataCatalogMUSTto search according to the repository of the datasets
includedInDataCatalog
data repository or archivedistribution.storedInpublisherNA
12
distribution
Description of the location for download of the dataset and the file format for download
DataDownloaddistributiondistributiondcat:distributionR
13
distribution.fileFormat
The file format of this distribution
Textdatasets may be made available in various forms. providing information about format enables estabilishing compatibility to tools
distribution.fileFormat
distribution.formatsdc:formatR
14
distribution.contentUrlThe link for the download.URL,
distribution.contentUrl
distribution.access (broader than just URL)dcat:downloadURLR
15
citation
A citation for a publication that describes the dataset (e.g., “J.Smith 'How I created an awesome dataset’, Journal of Data Science, 1966”)
CreativeWork, TextcitationTextSHOULDcitationprimaryPublication / citationscito:citeAsAuthority, rdfs:seeAlsocitation/isBasedOnR (not all databases have a citation)
16
license
A license under which the dataset is distributed.
URL, TextlicenceText, URLSHOULDlicenselicensesdc:licenceMlicenses
17
additionalType?typesresourceTypeGeneral
18
identifier (pending)
Any kind of identifier for any kind of thing
6. identifier
PropertyValue, Text, URL
MUST
Taxonomies (PROPOSAL)
R
19
measurementTechnique (pending)
A technique or technology used in a Dataset (or DataDownload, DataCatalog), corresponding to the method used for measuring the corresponding variable(s) (described using variableMeasured).
measurementTechniqueSHOULDallow restriction to how a dataset has been generated , usually combined with the facet VariablesMeasured (e.g. get all climate datasets w metabolite concentration was recorded if acquired using 'measurementTechnique: mass spectrometry)
Cell Type (PROPOSAL)
20
Diseases (PROPOSAL)
21
Softwares (PROPOSAL)
22
Instruments (PROPOSAL)
23
Sample Protocol (PROPOSAL)
24
dataType(PROPOSAL)
Data Protocol (PROPOSAL)
25
primaryPublication(PROPOSAL)
publications: Associated with the dataset (PROPOSAL)
26
acknowledges(PROPOSAL): GrantM - BioDBcore ID (for databases) (PROPOSAL)
27
conformsTo (PROPOSAL), https://github.com/schemaorg/schemaorg/issues/1516M - identifiers:accessPattern (PROPOSAL)
28
M - identifiers:idPattern (PROPOSAL)
29
O - identifiers: testId (PROPOSAL)
30
M - Release: DataCatalog.dateModified
31
R - Tools: webAPIs (PROPOSAL)
32
O - Tools: SoftwareApplication
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
Loading...
 
 
 
Datasets