1 of 10

Dec 9, 2019

2 of 10

Clinical data in dbGAP is stored in hundreds of files

For EACH consent group

Framingham heart study

3 of 10

4 of 10

Consent group1

Consent group 2

Consent group 3

Consent group 4

or

Investigator accesses FILES based on study and consent groups per study.

Then they need to decrypt the files and COMBINE them to run any analysis

On a dbGaP authorized project an investigator may have access to consents 1 and 2 �and on another dbGAP project they may have access to consents 2,3 and 4

Framingham heart study

5 of 10

Consent group1

Consent group 2

Consent group 3

Consent group 4

Via PIC-SURE API an Investigator accesses VARIABLES (and not FILES) based on study and consent groups per study.

Everything is ALREADY COMBINED to run any analysis

They can SEARCH and RETRIEVE across all data they are authorized

On a dbGaP authorized project an investigator may have access to consents 1 and 2 �and on another dbGAP project they may have access to consents 2,3 and 4

PIC-SURE API

R data frame or Python panda data frame

6 of 10

PIC-SURE API

Consent group1

Consent group 2

Consent group 3

Consent group 4

R data frame or Python panda data frame

Consent group1

Consent group 2

Consent group 3

Consent group 4

Consent group1

Consent group 2

Consent group 3

Consent group 4

Consent group1

Consent group 2

Consent group 3

Consent group 4

DCC 88 Harmonized variables with 17 studies (only 4 shown here)

Via PIC-SURE API an Investigator accesses VARIABLES (and not FILES) based on study and consent groups per study.

Everything is ALREADY COMBINED to run any analysis

They can SEARCH and RETRIEVE across all data they are authorized

On a dbGaP authorized project an investigator may have access to consents 1 and 2 �and on another dbGaP project they may have access to consents 2,3 and 4

7 of 10

platform

PIC-SURE User Interface

Fence

A

feasibility queries & cohort builder

To enable�C�Analysis

and

8 of 10

PIC-SURE API

Part 1) Phenotypic data preparation (before an investigator logs in)

Part 2) Phenotypic query in real time by an investigator across platforms

decrypt

TOPMED Data Coordinating Center�Harmonization process

platform

platform

9 of 10

Part 2) Phenotypic query in real time by an investigator across platforms

platform

PIC-SURE User Interface

Fence

and

PIC-SURE API

Part 1) Phenotypic data preparation (before an investigator logs in)

decrypt

TOPMED Data Coordinating Center�Harmonization process

platform

10 of 10

https://www.nhlbidatastage.org/

Then click on PIC-SURE logo bottom page

Access PIC-SURE?

https://github.com/hms-dbmi/PIC-SURE_API-BioDataCatalyst-examples