PCGL Driver Project
�
A
4
G
H
Mélanie Courtot, David Bujold and Ma’n Zawati
2024-04-23
Dream team of Canadian genomicists
Team
Why a pan-Canadian framework for genome data?
Unify Canada’s human genome sequencing efforts to prevent redundant work across many existing National Genomic Data Generation Projects
Set out a federated data management system that leverages international standards and respects limitations on the jurisdictional and cultural movement of human genetic data
Why a pan-Canadian framework for genome data?
Unify Canada’s human genome sequencing efforts to prevent redundant work across many existing National Genomic Data Generation Projects
Set out a federated data management system that leverages international standards and respects limitations on the jurisdictional and cultural movement of human genetic data
$15,000,000
5 years
Impact of the PCGL
For access and analysis of genomes sequenced in Canada to further research
To provide means and opportunities to facilitate clinical trials
To address equity, diversity and inclusion by capturing the Canadian genomic “variome”
To provide a key mechanism to improve effectiveness of healthcare delivery
To position Canada as a key player in international genomic research endeavours
Structure of the
PCGL project
PCGL data flows
PCGL initial driving use cases
Led by CGEn, contains whole genome sequences (WGS) and healthcare data for 11,000 individuals, recruited into 15 Canadian clinical studies during the COVID-19 pandemic.
HostSeq
Including AllforOne, Care4Rare and Genomics4RD Initiatives. AllforOne is expected to sequence approximately 9,000 samples as part of clinical care for rare diseases.
Rare disease community
MOHCCN
Led by the Terry Fox Research Institute to accelerate the adoption of precision medicine by uniting cancer centre efforts across Canada, has plans to generate 15,000 WGS
Silent Genomes
The Silent Genomes project is studying genetic/genomic health care barriers for Indigenous peoples of Canada with initial data from 600 First Nations participants.
Involvement in GA4GH
Partial table of WS participation
Name | Engagement |
David Bujold |
|
Melanie Courtot |
|
Karen Cranston |
|
Yann Joly |
|
Champions
Partial table of WS participation
Name | Engagement |
Guillaume Bourque |
|
Michael Brudno |
|
Jon Eubank |
|
Daisie Huang |
|
Gordon Krieger |
|
Francis Nguyen |
|
Ma’n H. Zawati |
|
Implementation
PCGL Technical Components
Data Submission, Archival, Processing
Phenopackets as a way to submit clinical data to the PCGL
Experiments Metadata to properly characterize experiments that generate sequencing data
Variants submission and storage as VCF
Data deposition and retrieval using DRS
Aligned readsets stored as CRAM
Pipeline execution standardized using WES
Data Access
Implementation of Authentication / Authorization across the federation using Passports for:
Data Discovery & Download
Datasets annotated with DUO codes to characterize usage
Beacon v2
Streaming of deposited data using htsget / rnaget (once access has been granted)
Clinical data exportable in Phenopackets
Governance, Ethics, International & Commercial Partnerships
Deliverables:
Interoperability with other GA4GH Driver Projects
Counting on GA4GH standards to increase cross-initiatives connection
Thank you!
The Pan Canadian Genome Library gratefully
acknowledges the support of CIHR