Dec 9, 2019
Clinical data in dbGAP is stored in hundreds of files
For EACH consent group
Framingham heart study
Consent group1
Consent group 2
Consent group 3
Consent group 4
or
Investigator accesses FILES based on study and consent groups per study.
Then they need to decrypt the files and COMBINE them to run any analysis
On a dbGaP authorized project an investigator may have access to consents 1 and 2 �and on another dbGAP project they may have access to consents 2,3 and 4
Framingham heart study
Consent group1
Consent group 2
Consent group 3
Consent group 4
Via PIC-SURE API an Investigator accesses VARIABLES (and not FILES) based on study and consent groups per study.
Everything is ALREADY COMBINED to run any analysis
They can SEARCH and RETRIEVE across all data they are authorized
On a dbGaP authorized project an investigator may have access to consents 1 and 2 �and on another dbGAP project they may have access to consents 2,3 and 4
PIC-SURE API
R data frame or Python panda data frame
PIC-SURE API
Consent group1
Consent group 2
Consent group 3
Consent group 4
R data frame or Python panda data frame
Consent group1
Consent group 2
Consent group 3
Consent group 4
Consent group1
Consent group 2
Consent group 3
Consent group 4
Consent group1
Consent group 2
Consent group 3
Consent group 4
DCC 88 Harmonized variables with 17 studies (only 4 shown here)
Via PIC-SURE API an Investigator accesses VARIABLES (and not FILES) based on study and consent groups per study.
Everything is ALREADY COMBINED to run any analysis
They can SEARCH and RETRIEVE across all data they are authorized
On a dbGaP authorized project an investigator may have access to consents 1 and 2 �and on another dbGaP project they may have access to consents 2,3 and 4
platform
PIC-SURE User Interface
Fence
A
feasibility queries & cohort builder
To enable�C�Analysis
and
PIC-SURE API
Part 1) Phenotypic data preparation (before an investigator logs in)
Part 2) Phenotypic query in real time by an investigator across platforms
decrypt
TOPMED Data Coordinating Center�Harmonization process
platform
platform
Part 2) Phenotypic query in real time by an investigator across platforms
platform
PIC-SURE User Interface
Fence
and
PIC-SURE API
Part 1) Phenotypic data preparation (before an investigator logs in)
decrypt
TOPMED Data Coordinating Center�Harmonization process
platform
https://www.nhlbidatastage.org/
Then click on PIC-SURE logo bottom page
Access PIC-SURE?
https://github.com/hms-dbmi/PIC-SURE_API-BioDataCatalyst-examples