Genome wide association studies
Saket Choudhary
Introduction to computational multi-omics
DH 607
Lecture 22 || Wednesday, 23rd October 2024
dfdf
What is a SNP?
SNP = Position in genome where some individuals have one nucleotide (say G) and other individuals havea different nucleotide (say T)
How to profile mutations?
dfdf
How SNP arrays work
Oligonucleotide hybridization analysis
dfdf
Genetic testing is becoming ‘accessible’
Most genetic testing kits use affymetrix or Illumina SNP arrays
dfdf
How SNP arrays work
Goal: Determine the SNP at the A/C locus of the given DNA fragment
Affymetrix:
IIlumina:
For both platforms, we require algorithms to convert raw signal into SNPs
SNP arrays
How to associate mutations with phenotypes?
Genome wide association studies
Test for association
Fits a linear model for every variant, where the x axis is genotype and the y axis is a phenotype
Manhattan Plot shows significance of each variant’s association with a phenotype
Expected vs observed p=values
Functional follow up of GWAS
To prioritize likely causal variants, statistical fine-mapping is applied to identify a set of variants that are likely to include the causal variant as well as the most likely causal variant
Functional follow up of GWAS
General idea: Perturb the loci and measure the phenotype
Polygenic risk score
Disease are often associated with multiple (poly) genes. How can we assign risk scores to individuals based on their SNP profile?
GWAS:The dark side and the bright side
dfdf
Questions