3-dimensional genome. PartII.�Chromatin features detection.
3D organisation of the chromatin revealed by Hi-C
Genome-wide Hi-C interaction map shows intrachromosomal (same as cis-) and interchromosomal (trans-) interactions.
2
Chromosome territories
3
Bonev et al. Nature Reviews 2016
How to deal with interchromosomal contacts?
4
Cis-to-trans ratio: m1/m2
mean (m1)
mean (m2)
There is no strong definition of cis-to-trans ratio. Variations of this one can be calculated in different ways: as average across all cis- divided by the average of all trans-contacts; as feature for genomic bin (ICF); as value for each pair of chromosome.
ICFj=mean(cis)/mean(trans) – for binj
binj
Intrachromosomal interactions: TADs
5
Bonev et al. Nature Reviews 2016
TADs detection
6
Какие ТАДы правильные?
TADs detection
7
8
Categories of TAD hierarchy callers
Xu, J., Xu, X., Huang, D. et al. Nat Commun 15, 4376 (2024). https://doi.org/10.1038/s41467-024-48593-7
9
10
Armatus allows the wide range of TADs sizes.
Matryoshka is a widely used Armatus-based tool for hierarchical TADs calling
HiCExplorer covers the most part of genome, avoiding strange gaps
Insulation score – a measure of local chromatin density
Convenient implementation: Cooltools
11
Directionality index
12
A – upstream
B - downstream
E=(A+B)/2
+HMM on the top -> TADs borders
Reciprocal insulation
13
TADs detection
The number and size of TADs depend on:
TADs are hierarchical -> the better the resolution, the smaller TADs we can obtain
14
15
Approach | Caller | Input format | Main language | Parameter |
Liner Score | Arrowhead | hic format | Shell, Awk, Java | 1 |
Armatus | dense matrix, sparse matrix, Rao format* | C++, Python | 1 | |
CaTCH | catch format** | C, R, Shell | 0 | |
HiTAD | cool format*** | Python | 1 | |
matryoshka | dense matrix, sparse matrix, Rao format | C++, Shell | 1 | |
OnTAD | dense matrix, hic format | C++ | 2 | |
Multi-CD | dense matrix | Matlab | NA | |
Clustering | IC-Finder | dense matrix, sparse matrix | Matlab | NA |
TADpole | dense matrix | R | 6 | |
BHi-Cect | Rao format | R | 0 | |
SpectralTAD | dense matrix, sparse matrix, hic format, cool format, Rao format | R | 3 | |
Network features | HBM | dense matrix | R | 5 |
spectral | mat format**** | Matlab | NA | |
3DNetMod | sparse matrix | Python | 18 | |
GRiNCH | sparse matrix | C, Python | 3 | |
Structural Entropy | deDoc | sparse matrix | Java | 0 |
SuperTAD | dense matrix, sparse matrix | C++ | 0 | |
Statistical Model | TADtree | dense matrix | Python | 6 |
GMAP | dense matrix, sparse matrix | R | 4 | |
PSYCHIC | dense matrix | Matlab, C | NA | |
HiCKey | dense matrix, sparse matrix, Rao format | C++ | 6 |
16
Intrachromosomal interactions: compartments
17
Bonev et al. Nature Reviews 2016
Compartment detection:
18
19
20
Saddle plot
PCA application to the matrix:
the sign of PC1 defines the compartment membership.
Saddle plot
Compartment strength
22
How to estimate difference in compartments?
More accurate method:
for each bin, the average normalized frequency of interactions with bins belonging to the same compartment was divided by the average normalized contacts with bins from the other compartment.
��
mean of ”reds” divided by mean of “greens”
Subcompartment calling
23
detection via cis-contacts
(in fact, TADs clusterization)
"resolution enhancement" using neural network approaches, such as autoencoder
requires high resolution
(~ 5 billion contacts for human data,
current usual dataset is nearly 10 time less)
Calder
24
The problem: compartments are not just TADs interactions. Moreover, in regions with high rate of extrusion TADs and compartmental structure oppose each other. Cis-approach can be not applicable in this case.
Compartmental and extrusion TADs
25
compartmental
extrusion
Intrachromosomal interactions: loops
Loop-callers: cooltools (python implementation of HICCUPS), MUSTACHE and others
Loop callers
27
Algorithms usually consider 2 features:
Another approaches (Mustache) rely on Gaussian smoothing. The algorithm removes noise and details and keeps only the most bright features like loops
+ Mustache, SIP, etc
Personal preference (probably, not the best): cooltools, chromosight
Significant contacts
Apart from loops, there is one more similar, but another type of contacts – significant interactions on Hi-C map.
These can be promoter-promoter, promoter-enhancer or polycomb interactions.
28
Loop
Significant contact
Tool for significant contacts detection: FitHiC2
FitHiC application example�after additional filtration on specific histone modification: fithic+H3K27me3
29
Without additional filtration result usually seems to be more random
Resolution
30
| human, mouse | fruit fly |
TADs | 10 kb for good quality, 40 kb for worse quality | 4-5 kb for good quality, 10 kb for worse quality |
Loops | ~10 kb | ~5 kb (loops are rare in fruit fly) |
Compartments | 100-250 kb, depends on quality and aims | 10-20 kb |
Subcompartments | 50 kb | ? |
Significant contacts: promoter-promoter, promoter-enhancer interactions | 2-5 kb for good quality, 10-20 kb for worse quality | 2-5 kb |
Significant contacts: polycomb interactions | 100 kb | 10 kb |
Today’s data resolution is often not enough for feature detection
Fires: frequently interacting regions
31
Scale of interactions: ±200kb for human genome
Firecaller:
from doi: 10.1016/j.csbj.2020.12.026
How to create beautiful average plots?
32
Bad and good quality example
33
Same resolution, 40 kb
Rabl: centromer and telomer interactions
34
An example of telomer-centromer interactions
in budding yeast
This is a model conformation for
centromer-centromer interactions
Why subcompartments can be detected from trans-interactions
35
Trans-interactions
reveal compartment
structure
Practise: cooltools
36