Dataverse, Journals, and Sensitive Data
Gustavo Durand
Dataverse Technical Lead / Architect
Data-PASS Pre-APSA Workshop - August 30, 2017
Dataverse
Dataverse
Overview
Dataverse Features - Data
Dataverse Features - Users
Dataverse Features - Workflows
Dataverse Features - Interoperability
Dataverse Technology
Glassfish Server 4.1
Java SE8
Java EE7
Storage: Postgres, Solr, File System / Swift / S3
Dataverse Development Process
(some) Collaborations
Dataverse Community
Dataverse Community
Community
Journals
Journals
Journals
https://dataverse.org/journals
Permissions / Roles
Robust Permission System:
Review Workflow
If you have a Contributor role in a Dataverse you can submit your dataset for review when you have finished uploading your files and filling in all of the relevant metadata fields.
Private URLs
Creating a Private URL for your dataset allows you to share your dataset (for viewing and downloading of files) before it is published to a wide group of individuals who may not have a user account on Dataverse. Anyone you send the Private URL to will not have to log into Dataverse to view the dataset.
Sensitive Data
Sensitive Data
Infrastructure
DataTags
A datatag is a set of security features and access requirements for file handling.
A datatags repository is one that stores and shares data files in accordance with a standardized and ordered level of security and access requirements.
DataTags Levels
DataTags
PolicyModels
PolicyModels is a system for creating models of policies, and can be used to perform interactive interviews which yield a concrete treatment that is both human readable and machine actionable.
DataTags
Differential Privacy
What is Differential Privacy?
Differential Privacy
Differential Privacy
Differential Privacy is a formal, mathematical conception of privacy preservation.
It guarantees that any reported result does not reveal information about any one single individual, regardless of auxiliary information.
PSI (Differential Privacy)
Private data Sharing Interface
PSI - Budgeteer
The budgeteer allows users to select which statistics they would like to calculate and are given estimates of how accurately each statistic can be computed. They can also redistribute their privacy budget according to which statistics they think are most valuable in their dataset.
PSI (Differential Privacy)
Thank you!
Please get in touch with us!
Google Group, Github, IRC, Twitter - dataverse.org/contact
Dataverse Community Meeting 2018
June 13, 14, 15 at Harvard University