Introduction to graph theory and molecular networks
Sep 14th 2021
Some of these materials are from Introduction to Bioinformatics, BMI/CS 576.
Goals for today
Networks are ubiquitous
Internet
Image credit: Wikipedia, Wikimedia, The cellmap, Euroscientist, https://extensionaus.com.au/extension-practice/social-network-analysis/
Yeast genetic interaction network
Social network
Networks are powerful representations of complex systems
What is a network?
Vertex/Node
Edge
C
B
A
D
G
H
E
F
Notation
A few graph-theoretic concepts
Representing a graph
Undirected graphs
Adapted from Barabasi & Oltvai, Nature Reviews Genetics 2004
C
B
A
D
G
H
E
F
E= {(A,B),(A,C),(A,D),(A,E),(A,H),
(B,C),(B,G),
(E,F),
(F,G),
(G,H)}
V={A, B, C, D, E, F, G, H}
Adjacency matrix
C
Adapted from Barabasi & Oltvai, Nature Reviews Genetics 2004
B
A
D
G
H
E
0 | 1 | 1 | 1 | 1 | 0 | 0 | 1 |
1 | 0 | 1 | 0 | 0 | 0 | 1 | 0 |
1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 |
0 | 1 | 0 | 0 | 0 | 1 | 0 | 1 |
1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
F
A
B
C
D
E
F
G
H
A
B
C
D
E
F
G
H
V={A, B, C, D, E, F, G, H}
E= {(A,B),(A,C),(A,D),(A,E),(A,H),(B,C),(B,G),(E,A),(E,F),(F,G),(G,H)}
Adjacency list
C
B
A
D
G
H
E
F
A
B
C
D
E
H
B
A
C
G
C
A
B
D
A
E
A
F
F
G
E
G
B
F
H
H
A
G
Adjacency list versus matrix
Adjacency list
Adjacency matrix
Directed graphs
C
B
A
D
G
H
E
F
0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 |
1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
0 | 0 | 0 | 0 | 1 | 0 | 1 | 0 |
0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 |
1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
A
B
C
D
E
F
G
H
A
B
C
D
E
F
G
H
Weighted graphs
C
B
A
D
H
E
0 | 0 | 0 | 0.9 | 1.4 | 0 |
1.6 | 0 | 0 | 0 | 0 | 0 |
0.2 | 0.6 | 0 | 0 | 0 | 0 |
0 | 0 | 0 | 0 | 0 | 0 |
0 | 0 | 0 | 0 | 0 | 2 |
3.1 | 0 | 0 | 0 | 0 | 0 |
A
B
C
D
E
H
A
B
C
D
E
H
0.6
1.6
0.2
3.1
1.4
0.9
2
Node degree
Node degree
C
B
A
D
G
H
E
F
What is the out degree of E?
Paths and cycles
Paths and cycles
C
B
A
D
G
H
E
Paths from B to D
F
C
B
A
D
G
H
E
A cycle
F
cycle
Paths
Connected components
Connected components
H
C
B
A
D
G
H
E
F
Two connected components
A
B
C
D
E
F
G
Four strongly connected components
Connected components in an undirected graph
Connected components in a directed graph
Special types of graphs
Subgraph
Common graph traversal algorithms
Breadth-first search (BFS)
Breadth first search algorithm sketch
Breadth first search algorithm
Breadth first search example
Adapted from Introduction to Algorithms, 2nd Edition, Cormen, Leiserson, Rivest, Stein
∞
Q={s}
∞
∞
∞
∞
∞
∞
0
s
r
t
u
v
w
x
y
Before while loop
Breadth first search example
Adapted from Introduction to Algorithms, 2nd Edition, Cormen, Leiserson, Rivest, Stein
1
Q={w,r}
∞
1
∞
∞
∞
∞
0
s
r
t
u
v
w
x
y
Iteration 1
Breadth first search continued
Iteration 2
1
∞
1
2
2
∞
∞
0
s
r
t
u
v
w
x
y
Q={r,t,x}
Breadth first search continued
Q={t,x,v}
Iteration 3
1
2
1
2
2
∞
∞
0
s
r
t
u
v
w
x
y
Breadth first search continued
Q={x,v,u}
Iteration 4
1
2
1
2
2
∞
3
0
s
r
t
u
v
w
x
y
Breadth first search continued
Q={v,u,y}
Iteration 5
1
2
1
2
2
3
3
0
s
r
t
u
v
w
x
y
Breadth first search continued
Q={u,y}
Iteration 6
1
2
1
2
2
3
3
0
s
r
t
u
v
w
x
y
Breadth first search continued
Q={y}
Iteration 7
1
2
1
2
2
3
3
0
s
r
t
u
v
w
x
y
Breadth first search continued
Q={}
Iteration 8
1
2
1
2
2
3
3
0
s
r
t
u
v
w
x
y
Depth first search
Depth first search algorithm
Depth first search example
v
u
w
Iteration 1
x
y
z
1/
Start time
Node | 𝜋 |
u | NIL |
v | NIL |
w | NIL |
x | NIL |
y | NIL |
z | NIL |
Time=1
DFS-VISIT (u)
Depth first search example
x
y
z
1/
2/
v
u
w
Iteration 2
Node | 𝜋 |
u | NIL |
v | u |
w | NIL |
x | NIL |
y | NIL |
z | NIL |
Time=2
DFS-VISIT (v)
Depth first search example
x
y
z
1/
2/
v
u
w
3/
Iteration 3
Node | 𝜋 |
u | NIL |
v | u |
w | NIL |
x | NIL |
y | v |
z | NIL |
Time=3
DFS-VISIT (y)
Depth first search example
x
y
z
1/
2/
v
u
w
3/
Iteration 4
Node | 𝜋 |
u | NIL |
v | u |
w | NIL |
x | y |
y | v |
z | NIL |
Time=4
4/
DFS-VISIT (x)
Depth first search example
Node | 𝜋 |
u | NIL |
v | u |
w | NIL |
x | y |
y | v |
z | NIL |
Time=5
x
y
z
1/
2/
v
u
w
3/
Iteration 5
4/5
DFS-VISIT (x)
Start time
End time
Depth first search example
x
y
z
1/
2/
v
u
w
3/6
Iteration 6
Node | 𝜋 |
u | NIL |
v | u |
w | NIL |
x | y |
y | v |
z | NIL |
Time=6
4/5
DFS-VISIT (y)
Depth first search example
x
y
z
1/
2/7
v
u
w
3/6
Iteration 7
Node | 𝜋 |
u | NIL |
v | u |
w | NIL |
x | y |
y | v |
z | NIL |
Time=7
4/5
DFS-VISIT (v)
Depth first search example
x
y
z
1/8
2/7
v
u
w
3/6
Iteration 8
Node | 𝜋 |
u | NIL |
v | u |
w | NIL |
x | y |
y | v |
z | NIL |
Time=8
4/5
DFS-VISIT (u)
Depth first search example
x
y
z
1/8
2/7
v
u
w
3/6
Iteration 9
Node | 𝜋 |
u | NIL |
v | u |
w | NIL |
x | y |
y | v |
z | w |
Time=9
4/5
DFS-VISIT (w)
9/
Depth first search example
x
y
z
1/8
2/7
v
u
w
3/6
Iteration 10
Node | 𝜋 |
u | NIL |
v | u |
w | NIL |
x | y |
y | v |
z | w |
Time=10
4/5
DFS-VISIT (z)
9/
10/
Depth first search example
x
y
z
1/8
2/7
v
u
w
3/6
Iteration 11
Node | 𝜋 |
u | NIL |
v | u |
w | NIL |
x | y |
y | v |
z | w |
Time=11
4/5
DFS-VISIT (z)
9/
10/11
Depth first search example
x
y
z
1/8
2/7
v
u
w
3/6
Iteration 12
Node | 𝜋 |
u | NIL |
v | u |
w | NIL |
x | y |
y | v |
z | w |
Time=12
4/5
DFS-VISIT (w)
9/12
10/11
Take away points
Goals for today
Organization of biological information
Organism
Tissue
Gene
Chromosome
Cell
http://publications.nigms.nih.gov/thenewgenetics/chapter1.html;�Images: J Merkin et al. Science 2012;338:1593-1599, Wikipedia, Google, Shutterstock
Organs
Molecules of life
Deoxyribonucleic acid (DNA)
image from the DOE Human Genome Program
http://www.ornl.gov/hgmis
DNA is a double helical molecule
http://www.chemheritage.org/discover/online-resources/chemistry-in-history/themes/biomolecules/dna/watson-crick-wilkins-franklin.aspx
Watson and Crick
Maurice
Wilkins
Rosalind Franklin
Nucleotides
Phosphate
Base
Sugar
Hydroxy
DNA stores the blue print of an organism
Chromosomes
Image from www.genome.gov
Genes
…GTATGTCTAAGCCTGAATTCAGAACGGCTTC…
RNA: Ribonucleic acid
Proteins
Amino Acids
The central dogma of Molecular biology
DNA
RNA
Proteins
Transcription
Translation
The genetic code: specifies how mRNA is translated into protein
Genetic code is degenerate
Transcription
Translation
A video about transcription and translation
Metabolites
Goals for today
Graphs for representing molecular networks
Transcriptional regulatory networks
Regulatory network of E. coli.
153 TFs (green & light red), 1319 targets
Vargas and Santillan, 2008
A
B
Gene C
Transcription factors (TF)
C
A
B
DNA
Protein-protein interaction networks
Barabasi et al. 2003
Yeast protein interaction network
X
Y
X
Y
Signaling networks
Sachs et al., 2005, Science
Receptors
P
Q
A
TF
A
P
Q
Reactions associated with Galactose metabolism
Metabolic networks
Metabolites
N
M
a
b
c
O
Enzymes
d
O
M
N
KEGG
Genetic interaction networks
Dixon et al., 2009, Annu. Rev. Genet
Q
G
Genetic interaction: If the phenotype of a double mutant is significantly different than each mutant alone
Categorization of molecular networks
“Omic” tools measure cellular molecular components in a high-throughput manner
Uwe Sauer, Matthias Heinemann, Nicola Zamboni, Science 2007
References