PhenoAge & PCage
Saket Choudhary
Computational Multi-omics of Ageing
DH 603
Lecture 07 || Wednesday, 2nd April 2025
PhenoAge
3
dfdf
PhenoAge: Predictor for ageing and lifespan
dfdf
From last class: GrimAge uses plasma protein and smoking years as surrogates
dfdf
PhenoAge Steps
https://www.cdc.gov/nchs/nhanes/index.html
What is NHANES?
https://www.cdc.gov/nchs/nhanes/index.html
dfdf
What is NHANES
dfdf
PhenoAge can predict mortality
dfdf
PhenoAge can also predict chronological age
PCAge
11
dfdf
PCAge
dfdf
Intraclass correlation coefficient as a measure of reproducibility
dfdf
CpG signal shows low reproducibility
dfdf
Default clocks lack reproducibility
dfdf
PCage uses PCA before running elastic-net regression
dfdf
Epigenetic clocks trained from principal components are highly reliable
dfdf
1D Principal Component Analysis (PCA)
Goal: Reduce the dimensionality of data by projecting the data into lower dimensional space such that the reconstruction error is minimized
dfdf
Minimizing reconstruction error = Maximizing variance
Original point
Projected
point
Reconstruction error
Variance
How can we determine the direction of maximal variance?
dfdf
What does covariance matrix tell you about the data?
Data aligned with axes and covariance is diagonal
Data oblique wrt axies and covariance is diagonal
Gaussian cloud
dfdf
Covariance matrix captures the general extent of data
Different distributions with same covariance matrix
dfdf
PCA rotates the axes to diagonalize the covariance matrix
Singular value decomposition
Geometry of Singular value decomposition
Singular value decomposition
What are singular values?
Principal Component Analysis - The recipe
dfdf
PCA: the optimization
PC1
PC2
28
Questions?