1 of 44

Hae Kyung Im, PhD

Functional Genomics to Inform GWAS - QTL Analysis

April 20, 2022

2 of 44

3 of 44

4911 distinct loci associated with cancer traits are available as of 4/20/2022

4 of 44

The Perils of Polygenicity

4

Modified from https://twitter.com/DPosthu/status/1328662943739867137

Cancer Biology

5 of 44

Most GWAS catalog variants are non-coding

6 of 44

Altered Protein Levels Influence Disease Risk

6

Albert & Kruglyak 2015 NGReviews

7 of 44

Total mRNA and Splicing Affect Complex Traits

7

2016

8 of 44

  • 17,382 tissue samples
  • 54 tissues
  • 838 donors
  • mRNA-seq
  • WGS 30x

Goals

  • characterize genetic effects on the transcriptome across human tissues
  • link these regulatory mechanisms to trait and disease associations

The GTEx Consortium, 2020, Science 369:1318-1330

https://www.science.org/doi/10.1126/science.aaz1776

9 of 44

The GTEx Consortium, 2020, Science 369:1318-1330

10 of 44

Expression Quantitative Trait Loci

10

11 of 44

12 of 44

Expression and Splicing QTL as Function of Sample Size

12

13 of 44

Allelic Heterogeneity

13

The GTEx Consortium, 2020, Science 369:1318-1330

14 of 44

Trans-QTLs

14

The GTEx Consortium, 2020, Science 369:1318-1330

15 of 44

Sex-biased eQTL

15

The GTEx Consortium, 2020, Science 369:1318-1330

16 of 44

Population-biased eQTL

16

The GTEx Consortium, 2020, Science 369:1318-1330

17 of 44

Fine-mapping

18 of 44

Regional Plot (Locus Zoom)

18

19 of 44

Fine Mapping

  • Fine mapping: searching for the causal variant underlying an observed association
  • An association observed in GWAS often involves a large number of associated variants in LD. The causal variant is not obvious.
  • To “fine map” we need association results for ALL variants that could possibly be the causal variant, either by imputation or “fill-in” genotyping
  • For a given locus, we can build “credible sets” of SNPs that are 95% (or 99% or 99.9%) likely to contain the causal variant (based on Bayes Factors or Likelihoods) assuming a single causal variant (Maller et al, Nat Genet 2012)
  • or Posterior Probability of Causality (pip=posterior inclusion prob.)

19

20 of 44

Posterior Inclusion Probability: P(Causal given Data)

20

red circles: p values

bars: posterior prob. causal

yellow bar: causal SNPs

W. Chen et al, “Fine mapping causal variants with an approximate Bayesian method using marginal test statistics,” Genetics, 2015.

21 of 44

Example Top SNP Not Causal

21

SMA: strongest marginal association

SNP1: causal SNP

SNP2: causal SNP

https://rss.onlinelibrary.wiley.com/doi/10.1111/rssb.12388

22 of 44

Fine Mapping Methods

  • CAVIAR (Hormozdiari et al, Bioinformatics 2015)
  • Finemap (Benner et al, Bioinformatics 2016)
  • fastPAINTOR (Kichaev et al, Bioinformatics 2016)
  • DAP-G (Wen et al, AJHG 2016)
  • SUSIER (Wang et al, J. R. Stat. Soc. B, 2020)

22

23 of 44

Fine-mapping of cis-eQTLs in GTEx

23

The GTEx Consortium, 2020, Science 369:1318-1330

24 of 44

Fine-mapping EGR1 eQTL

24

The GTEx Consortium, 2020, Science 369:1318-1330

25 of 44

Enrichment of Functional Annotation for cis and trans QTLs

25

The GTEx Consortium, 2020, Science 369:1318-1330

26 of 44

Trans-eQTLs Enriched among cis-eQTLs, likely mediators

26

The GTEx Consortium, 2020, Science 369:1318-1330

27 of 44

GWAS-Associated SNPs are Enriched among cis-e/sQTLs

27

The GTEx Consortium, 2020, Science 369:1318-1330

Trans-eQTLs also enriched among trait-associated variants

28 of 44

Concordance of Mediated Effects of cis-eQTLs on GWAS Traits

28

The GTEx Consortium, 2020, Science 369:1318-1330

Given allelic heterogeneity, we tested whether the primary eQTL's mediated effect was similar to the secondary one

29 of 44

Trait Pleiotropy Associated with Allelic Heterogeneity and Cross-tissueness

29

The GTEx Consortium, 2020, Science 369:1318-1330

30 of 44

Tissue specificity of cis-QTLs

30

The GTEx Consortium, 2020, Science 369:1318-1330

31 of 44

ENCODE (Encyclopedia of DNA Elements)

  • Pilot phase started in 2003
  • Goal is to catalog all the functional elements in the human genome including
    • elements that act at the protein and RNA levels,
    • regulatory elements that control cells and circumstances in which a gene is active
  • Characterize functional elements of the Genome and Epigenome

31

32 of 44

ENCODE (Encyclopedia of DNA Elements)

32

This provides information on regions, not specific variants

33 of 44

Partitioning Heritability

34 of 44

Partitioning Heritability into Classes

34

Variance of each component is computed using

REML: Restricted Maximum Likelihood

GRM of each class is computed using only SNPs in the class

35 of 44

Partitioning Heritability by Chromosome

35

J. Yang,et al, “Genome partitioning of genetic variation for complex traits using common SNPs”, May 2011.

Height

BMI

36 of 44

Partitioning Heritability by Chromosome

36

vWF

QTi

Von Willebrand factor (vWF) is a blood glycoprotein involved in hemostasis

QT interval is a measure of the time between the start of the Q wave and the end of the T wave in the heart's electrical cycle.

37 of 44

MAF-stratified Heritability Estimates

37

J. Yang, A. Bakshi, Z. Zhu, G. Hemani, A. A. E. Vinkhuyzen, S. H. Lee, M. R. Robinson, J. R. B. Perry, I. M. Nolte, J. V. Van Vliet-Ostaptchouk, H. Snieder, T. Esko, L. Milani, R. Mägi, A. Metspalu, A. Hamsten, P. K. E. Magnusson, N. L. Pedersen, E. Ingelsson, N. Soranzo, M. C. Keller, N. R. Wray, M. E. Goddard, and P. M. Visscher, “Genetic variance estimation with imputed variants finds negligible missing heritability for human height and body mass index,” Nat Genet, vol. 47, no. 10, pp. 1114–1120, Aug. 2015.

38 of 44

Functional Partitioning of Heritability

38

Important to use imputed genotypes

A. Gusev et al, “Partitioning Heritability of Regulatory and Cell-Type-Specific Variants across 11 Common Diseases,” AJHG 2014.

39 of 44

Partitioning with LD Score Regression

39

GWAS results (effect size / standard error )^2 is Chi2 under Null of no association

40 of 44

How Do You Calculate Chi2 Statistic From Summary Results?

40

41 of 44

Stratified LD score regression

41

Finucane et al, “Partitioning heritability by functional annotation using genome-wide association summary statistics,” Nat Genet, Sep. 2015.

Combination of nine phenotypes with low phenotypic correlation and sample overlap

height, BMI, age at menarche, LDL levels, coronary artery disease, schizophrenia, educational attainment, smoking behavior, and rheumatoid arthritis

42 of 44

Gene Expression Traits: Functional Annotations Enrichment

42

43 of 44

Enrichment of Tissue Types

43

Finucane et al, “Partitioning heritability by functional annotation using genome-wide association summary statistics,” Nat Genet, Sep. 2015.

44 of 44

Examples of Partitioning by Functional Classes

44