Dataverse Croissant Update
Philip Durbin
Croissant Task Force Meeting
2024-06-12
What is Dataverse?
Demo!
Status of activities
Design influences
Croissant jar 0.1.2
citeAs vs. citation (Related Publication)
"citeAs": "@data{FK2/VQTYHD_2024,author = {Durbin, Philip and IQSS},publisher = {Root},title = {Max Schema.org},year = {2024},url = {https://doi.org/10.5072/FK2/VQTYHD}}",
"citation": [{
"@type": "CreativeWork",
"name": "Tykhonov, V., & Durbin, P. (2024, March 20). Croissant ML standard in the context of Dataverse, EOSC and beyond. Zenodo. https://doi.org/10.5281/zenodo.10843668",
"@id": "https://doi.org/10.5281/zenodo.10843668",
"identifier": "https://doi.org/10.5281/zenodo.10843668",
"url": "https://doi.org/10.5281/zenodo.10843668" }],
Geographic Coverage and Time Period
"spatialCoverage": [
"Cambridge, MA, United States, Harvard Square"
],
"temporalCoverage": [
"2023-01-01/2023-12-31"
],
distribution and recordSet
"distribution": [ {
"@type": "cr:FileObject",
"@id": "data/stata13-auto.dta",
"name": "stata13-auto.dta",
"encodingFormat": "application/x-stata-13",
"md5": "7b1201ce6b469796837a835377338c5a",
"contentSize": "6443",
"description": "",
"contentUrl": "http://localhost:8080/api/access/datafile/6?format=original"
}],
"recordSet": [ {
"@type": "cr:RecordSet",
"field": [ {
"@type": "cr:Field",
"name": "make",
"description": "Make and Model",
"dataType": "sc:Text",
"source": {"@id": "11","fileObject": {"@id": "data/stata13-auto.dta"}}
},
Concern: large Croissant files
Concern: date formats
Differences from Kaggle
Differences from pyDataverse
Differences from Schema.org JSON-LD
Once the QA'ed Croissant jar has been released...
Thank you!