STAGE Architecture Overview and User Narrative Breakout Prep
Proposed Activity
Brainstorming
Let’s brainstorm in this shared document
No wrong answers, let’s collect ideas.�
Goal: how systems relate to each other �(not the details inside your system).
Xenon
Project Management
User Management
Authentication & Authorization
System Monitoring
Usage Logging
Notification Service
Backup Service
Billing Management
Web Application
API
Task Execution API
Data/Metadata Service
Cloud Storage & Compute
Resource Manager
Core Platform Infrastructure
Data Infrastructure
Independent Core Services
Task Execution Infrastructure
Task Scheduler
Job Management Layer
Orchestration Layer
FAIR4CURES
DataSTAGE
Powered by Seven Bridges
PIC-SURE API
Part 1) Phenotypic data preparation (before an investigator logs in)
Part 2) Phenotypic query in real time by an investigator across platforms
decrypt
TOPMED Data Coordinating Center�Harmonization process
platform
platform
Real time synonym search
Carbon
tranSMART
PIC-SURE �User Interface
i2b2 Core
PIC-SURE �Auth Micro-app
PIC-SURE �API V2
Fractalis
PIC-SURE HPDS
User Interfaces
Backend Services
Datastores
AuthN / AuthZ
Monitoring
AWS Account
Data Flow
Auth Flow
ETL Client
A) feasibility queries & cohort builder
B) Exploration
to generate hypothesis
C) Analysis
DataStore
Relational DB
Oracle + MySQL
HPDS
All logs ingested by Splunk
i2b2/tranSMART
platform
Carbon
DataStore
Relational DB
MySQL
HPDS
ETL Client i2b2/TM 18.1
Data Types
Clinical
Registries
Exome
Genome
i2b2/tranSMART
platform
Carbon
Ca+ Architecture
TOPMED WORKFLOWS
TERRA WORKSPACE
CROMWELL WORKFLOW ENGINE
JUPYTER INTERACTIVE ANALYSIS
WINDMILL DATA EXPLORER
U CHICAGO�Indexd
ORIGINAL METADATA
Metadata
harmonization across TOPMed datasets
Load by reference with GUIDs
ORIGINAL DATA FILES
DOCKSTORE TOOL REPOSITORY
Workflows via TRS
Metadata
via BDBag
Data files via DOS
Workflows
relevant to the
TOPMed community
AUTH
AUTH
AUTH
Next Steps