Topological properties of graphs
Oct 16th 2025 (Updated on Oct 21st 2025)
These slides, excluding third-party material, are licensed under CC BY-NC 4.0 by Sushmita Roy and Anthony Gitter
Plan for the semester
When | What |
Week 2-Week 4 | Representing and learning networks from data |
Week 5-Week 7 | Deep learning in network biology |
Week 7-Week 8 | Graph analysis, topology, and modules |
Week 9-Week 10 | Graph alignment and comparison |
Week 10-Week 13 | Network-based integration and interpretation |
Week 14-Week 15 | Project presentations |
Plan for this section
Goals for today
Why should we care about network properties?
“Probably the most important discovery of network theory was the realization that despite the remarkable diversity of networks in nature, their architecture is governed by a few simple principles that are common to most networks of major scientific and technological interest” Barabasi and Oltvai, Nature Genetics Review 2004
Commonly measured network properties
Recall node degree
C
B
A
D
G
H
E
F
Average degree
Degree distribution
Different degree distributions
Poisson distribution
k
P(X=k)
Power law distribution
P(k)
k
Barabasi and Oltvai, Nature Genetics Review 2004
Erdös Renyi random graphs
Scale free networks
Poisson versus Scale free
Barabasi & Oltvai 2004, Nature Genetics Review
Yeast protein interaction network is believed to be scale free
Barabasi & Oltvai 2004, Nature Genetics Review
Degree of a node is correlated to functional importance of a node
Yeast PPI network degree distribution
5,905 nodes
86,286 edges
Origin of scale free networks
Barabasi & Oltvai 2004, Nature Genetics Review
Growth and preferential attachment in scale free networks
A new node (red) is more likely to connect to node 1
than 2
Barabasi & Oltvai 2004, Nature Genetics Review
Commonly measured network properties
Paths on a graph
C
B
A
D
G
H
E
Path: a set of connected edges from one node to another
Paths from B to D
F
There are two paths from B-D: B->A->D and B->G->H->A->D
Path lengths
Scale-free networks tend to be ultra-small
Yeast protein interaction network
Avg. path length: 2.6
Diameter: 6
Nodes: 5,905
Edges: 86,286
Are scale free networks ubiquitous?
Broido, A. D. & Clauset, A. Scale-free networks are rare. Nat Commun 10, 1017 (2019).
Top Hat Question
Commonly measured network properties
Network motifs
Milo Science 2002
Network motifs of size 3 in a directed network
Additional possibilities if we consider the sign of an edge
Milo Science 2002
Different types of feed-forward motifs
Coherent: sign of the direct path is the same as the indirect path
Incoherent: sign of the direct path and indirect path are not the same
Alon 2007, Nature Review Genetics
These appear much more in transcriptional networks than other FF motifs
Network motifs are associated with dynamics
A feed-forward motif with an AND gate can encode a sign sensitive delay
Alon 2007, Nature Review Genetics
Network motifs found in many complex networks
The occurrence of the feed-forward loop in both networks suggests a fundamental similarity in the design on these networks
Milo et al., Science 2002
Common structural motifs seen in the yeast regulatory network
Lee et.al. 2002, Mangan & Alon, 2003
Auto-regulation
Multi-component
Feed-forward loop
Single Input
Multi Input
Regulatory Chain
Feed-forward loops involved in speeding up in response of target gene
How to find network motifs?
Wernicke 2005: Pages 165-177 https://link.springer.com/content/pdf/10.1007/11557067.pdf
Subgraph isomorphism
https://en.wikipedia.org/wiki/Subgraph_isomorphism_problem
Algorithm for enumerating subgraphs: Edge sampling
N(V’): set of neighbors of all vertices in V’
Kashtan et al., Bioinformatics 2004
Generating a randomized network
Strategy to generate randomized networks (Switching method)
1
2
3
4
5
1
2
3
4
5
Select two edges connecting four vertices and swap the end points.
Repeat.
Milo, R., Kashtan, N., Itzkovitz, S., Newman, M. E. J. & Alon, U. On the uniform generation of random graphs with prescribed degree sequences. at http://arxiv.org/abs/cond-mat/0312028 (2004).
Strategy to generate randomized network (Matching method)
A
B
C
D
Adapted from Jure Leskovec slides on “Machine learning with graphs”
A
B
C
D
A
B
C
D
Nodes with spokes
Randomly pair up spokes
Resulting graph
Assessing the significance of a network motif
The motif (red dashed edges) occurs much more frequently in the real network than
in any randomized network
Commonly measured network properties
Resuming on Oct 21st, 2025
Modularity in networks
A modular network
Module 1
Module 2
Module 3
M. E. J. Newman, Modularity and community structure in networks, PNAS 2006
Different types of network modules
Albert-László Barabási, Natali Gulbahce, and Joseph Loscalzo, Nature Review Genetics 2011, Mitra et al., Nature Review Genetics 2014
Studying modularity in biological systems
Clustering coefficient
A
B
C
?
Clustering coefficient example
A
C
B
G
D
Q measure for modularity
K: number of groups
eij: Fraction of total edges that link nodes in group i to group j
Detecting community structure in networks. M.E. J Newman
Fraction of edges within group i
Fraction of edges without regard to community structure
Summary of topological properties of networks
References
Neural Subgraph Matching
Rex et al. Neural Subgraph Matching. at http://arxiv.org/abs/2007.03092 (2020).