Machine Learning Chemistry
Final term paper presentation
Anshuman S. and Bhavay A.
1
MLC 8803
Geometry-enhanced molecular representations using X-attention
Scan the QR code for paper
The need to model molecules?
Problem Statement
Molecule
Feature descriptors
Or molecular fingerprints
Machine learning
Topology and Geometry relation
Motivation
Figure:
Atom-Bond Embedding
Bond Angle-Bond Embedding
Project deliverables
Major highlights of the Project
Atom-Bond graph
Bond-Bond angle graph
1
2
3
1,3
1,2
4
1,4
Property
Prediction
DL Model
Molecular representation
Work done
Methodology
Data (Graph) Preparation
SSL
X-attention
Downstream task
SSL: Contrastive learning
Step 2: Introduction to SSL
Input Graph
Node dropping
Edge perturbation
Edge attribute masking
Deep Mind
SSL: Contrastive learning
Step 3: Application
ZINC
dataset
Pre-trained Inner block (BAB)
Pre-trained Inner block (AB)
Training procedure
SSL Pre-training
Work by Pande group Stanford
Atom-Bond graph
Bond-Bond angle graph
Results
SSL Pre-training
Bond-Angle | Bond graph
Atom | Bond graph
Figure: Visualisation of the principle components of the embedding space
Figure: Training and Validation loss plots
InfoNCE Loss
Epochs
Validation
Training
X-attention
Structure-Property Multi-Modal foundation model
X-attention
Introduction
Atom-Bond graph embedding
Linear Layer
Fusion encoder
(Multi-modal)
Bond-Angle graph embedding
Combined Multi-
Modal embedding
2D property prediction
3D property prediction
X4
Downstream molecule
Pre-trained Inner blocks
Pre-trained Inner blocks
Application
X-attention
X-attention
Atom-Bond graph
Bond-Bond angle graph
Multi-modal
embedding
Step 6: Downstream predictions
Experiments
And further improvements
Conclusion
Publications | Tutorials
References