Minimal Cycle Representatives in Persistent Homology using Linear Programming
Lori Ziegelmeier
Macalester College
AATRN Vietoris-Rips Seminar
September 30, 2022
Motivation
Data
(in the form of a topological space)
Cycle representative
(of a homology class)
Cycle representative
(of a different homology class)
A tighter representative
PLAN
Collaborators:
Paper
Minimal Cycle Representatives in Persistent Homology using Linear Programming: an Empirical Study with User’s Guide
https://arxiv.org/pdf/2105.07025.pdf
https://www.frontiersin.org/articles/10.3389/frai.2021.681117/full
Code
https://github.com/ TDAMinimalGeneratorResearch/minimal-generator
https://github.com/UDATG/optimal_reps
Related papers
I. Obayashi. "Volume-optimal cycle: Tightest representative cycle of a generator in persistent homology." SIAM Journal on Applied Algebra and Geometry 2.4 (2018): 508-534.
E. Escolar and Y. Hiraoka. "Optimal cycles for persistent homology via linear programming." Optimization in the Real World. Springer, Tokyo, 2016. 79-96.
H. Hang, C. Giusti, L. Ziegelmeier, and G. Henselman-Petrusek. “U-match factorization: sparse homological algebra, lazy cycle representatives, and dualities in persistent (co)homology.” 2021, https://arxiv.org/pdf/2108.08831.pdf
C. Giusti, C. Henselman-Petrusek, and L. Ziegelmeier. “The basis matching complex: a streamlined framework for persistent homological algebra,” forthcoming.
ExHACT PIs
Lu Li Connor Thompson Greg Henselman-Petrusek Chad Giusti
Background
motivation: finding loops in a topological space
What is homology?
Boundary
matrix
DX
Topological space
X
Since
And Homology
Zk := k-cycles
Bk := k-boundaries
choice of field
motivation: mimic the mind’s eye with a union of balls
what you see
what you tell your computer
What is persistent homology?
what you construe
radius = 0
radius = 0.5
radius = 1
What is persistent homology?
problem: topology depends on choice of radius
radius
nested
sequence
of spaces
idealized
model
idealized
model
“lifespan" of
cycle
appears
cycle
vanishes ( 0 = [A]∈ H1(X) )
1-boundaries
1-cycles
idealized
model
barcode
lifespan of
lifespan of
radius
barcode
lifespan of
lifespan of
nested
sequence
of spaces
more “robust”
motivation: matrix factorization
How to compute persistence?
Standard Algorithm
Cohomology Algorithm
V, invertible &
upper triangular
R, “reduced”, every nonzero column has lowest entry in distinct row
Aside: ExHACT allows users to compute each of these (and several other related) matrices
Check out:
H. Hang, C. Giusti, L. Ziegelmeier, and G. Henselman-Petrusek. “U-match factorization: sparse homological algebra,�lazy cycle representatives, and dualities in persistent (co)homology.” 2021, https://arxiv.org/pdf/2108.08831.pdf
Another Aside: Basis Matching Complex
C. Giusti, C. Henselman-Petrusek, and L. Ziegelmeier. “The basis matching complex: a streamlined framework for persistent homological algebra,” forthcoming.
Cycle Representatives
When do “good” cycle representatives matter?
lifespan of B
lifespan of A
Would you prefer A or B?
When do “good” cycle representatives matter?
3d reconstruction of cardiac tissue
Righthand column shows application of “tight” cycle representatives to counterbalance automated smoothing priors; blue signifies higher reconstruction error
Source: Wu, Pengxiang, et al. "Optimal topological cycles and their application in cardiac trabeculae restoration." International Conference on Information Processing in Medical Imaging. Springer, Cham, 2017.
Example:
Analyze chromatin interaction to study folding
Loops represent different types of chromatin interactions (pictured). Annotate particular loops by localizing a cycle.
Example:
Source: Emmett, Kevin, et al. “Multiscale Topology of Chromatin Folding." Proceedings of the 9th EAI International Conference on Bio-inspired Information and Communications Technologies. 2016.
What makes a cycle representative “good”?
simple closed curves - where possible, it’s convenient to represent a feature as a simple closed curve (connections to graph theory)
tightness - tight representatives help to localize holes in data
What makes a cycle representative “good”?
Example Filtration
A
B
D
E
F
G
C
AC
BC
A B C D E F G
A
B
D
E
F
G
C
A B C D E F G
AC
BC
A
B
D
E
F
G
C
AB
EF
FG
AC
BC
A B C D E F G
A
B
D
E
F
G
C
ABC
AC
BC
AB
EF
FG
A B C D E F G
A
B
D
E
F
G
C
AG
BD
ABC
AC
BC
AB
EF
FG
A B C D E F G
A
B
D
E
F
G
C
DE
ABC
AG
BD
AC
BC
AB
EF
FG
A B C D E F G
A
B
D
E
F
G
C
ABC
AG
BD
AC
BC
AB
EF
FG
DE
A B C D E F G
A
B
D
E
F
G
C
ABC
AG
BD
AC
BC
AB
EF
FG
DE
A B C D E F G
A
B
D
E
F
G
C
CD
ABC
AG
BD
AC
BC
AB
EF
FG
DE
A B C D E F G
A
B
D
E
F
G
C
BCD
ABC
AG
BD
AC
BC
AB
EF
FG
DE
CD
A B C D E F G
A
B
D
E
F
G
C
GC
ABC
AG
BD
BCD
AC
BC
AB
EF
FG
DE
CD
A B C D E F G
A
B
D
E
F
G
C
ACG
A B C D E F G
ABC
AG
BD
BCD
GC
AC
BC
AB
EF
FG
DE
CD
A
B
D
E
F
G
C
A B C D E F G
ABC
AG
BD
BCD
GC
ACG
AC
BC
AB
EF
FG
DE
CD
CE
A
B
D
E
F
G
C
CED
A B C D E F G
ABC
AG
BD
BCD
GC
ACG
CE
AC
BC
AB
EF
FG
DE
CD
A
B
D
E
F
G
C
CF
A B C D E F G
ABC
AG
BD
BCD
GC
ACG
CE
CED
AC
BC
AB
EF
FG
DE
CD
A
B
D
E
F
G
C
CEF
A B C D E F G
ABC
AG
BD
BCD
GC
ACG
CE
CED
CF
AC
BC
AB
EF
FG
DE
CD
CFG
A
B
D
E
F
G
C
A B C D E F G
ABC
AG
BD
BCD
GC
ACG
CE
CED
CF
CEF
CFG
AC
BC
AB
EF
FG
DE
CD
A
B
D
E
F
G
C
A B C D E F G
ABC
AG
BD
BCD
GC
ACG
CE
CED
CF
CEF
CFG
AC
BC
AB
EF
FG
DE
CD
A B C D E F G
ABC
AG
BD
BCD
GC
ACG
CE
CED
CF
CEF
CFG
AC
BC
AB
EF
FG
DE
CD
dim = 0
dim = 1
dim = 2
Simplex-wise Filtration
Minimal Cycle Representatives
Example
POINT CLOUD
FILTRATION
BARCODE
REPRESENTATIVE
FOR A SINGLE BAR
Minimizing a Persistent Cycle Representative
Related papers
E. Escolar and Y. Hiraoka. "Optimal cycles for persistent homology via linear programming." Optimization in the Real World. Springer, Tokyo, 2016. 79-96.
I. Obayashi. "Volume-optimal cycle: Tightest representative cycle of a generator in persistent homology." SIAM Journal on Applied Algebra and Geometry 2.4 (2018): 508-534.
?
dimension-1
D
A
B
E
F
G
C
D
A
B
E
F
G
C
AG
BD
DE
CD
GC
CE
AC
BC
AB
EF
FG
1
-1
0
1
1
1
1
1
0
0
0
CF
-1
1
1
0
0
0
0
0
0
0
0
+
Minimize Number of Edges
Minimize Number of Edges
D
A
B
E
F
G
C
D
A
B
E
F
G
C
AG
BD
DE
CD
GC
CE
AC
BC
AB
EF
FG
1
-1
0
1
1
1
1
1
0
0
0
CF
-1
1
1
0
0
0
0
0
0
0
0
+
0
0
1
1
1
1
1
1
0
0
0
=
Minimize Number of Edges
D
A
B
E
F
G
C
D
A
B
E
F
G
C
AG
BD
DE
CD
GC
CE
AC
BC
AB
EF
FG
1
-1
0
1
1
1
1
1
0
0
0
CF
-1
1
1
0
0
0
0
0
0
0
0
+
0
0
1
1
1
1
1
1
0
0
0
=
D
A
B
F
G
C
D
A
B
E
F
G
C
D
A
B
E
F
G
C
Weighted Length of Edges
D
A
B
E
F
G
C
D
A
B
E
F
G
C
Number of Triangles (a cycle bounds)
ABC
AG
BD
DE
CD
BCD
GC
CE
CED
CF
CEF
CFG
AC
BC
AB
EF
FG
ACG
Number of Triangles (a cycle bounds)
ABC
AG
BD
DE
CD
BCD
GC
CE
CED
CF
CEF
AC
BC
AB
EF
FG
ACG
F
G
C
Number of Triangles (a cycle bounds)
ABC
AG
BD
DE
CD
BCD
GC
CE
CED
CF
AC
BC
AB
EF
FG
ACG
E
F
G
C
Number of Triangles (a cycle bounds)
ABC
AG
BD
DE
CD
BCD
GC
CE
CED
CF
AC
BC
AB
EF
FG
A
E
F
G
C
Number of Triangles (a cycle bounds)
ABC
AG
BD
DE
CD
BCD
GC
CE
CF
AC
BC
AB
EF
FG
A
D
E
F
G
C
Number of Triangles (a cycle bounds)
A
B
D
E
F
G
C
ABC
AG
BD
DE
CD
GC
CE
CF
AC
BC
AB
EF
FG
Number of Triangles (a cycle bounds)
A
B
D
E
F
G
C
AG
BD
DE
CD
GC
CE
CF
CEF
AC
BC
AB
EF
FG
ABC
BCD
CED
ACG
CFG
Number of Triangles (a cycle bounds)
A
B
D
E
F
G
C
AG
BD
DE
CD
GC
CE
CF
CEF
AC
BC
AB
EF
FG
ABC
BCD
CED
ACG
CFG
Number of Triangles (a cycle bounds)
A
B
D
E
F
G
C
A
B
D
E
F
G
C
6 Triangles
5 Triangles
Edge-loss
Triangle-loss
A
B
D
E
F
G
C
AG
BD
DE
CD
GC
CE
CF
CEF
AC
BC
AB
EF
FG
ABC
BCD
CED
ACG
CFG
Weighted Area of Triangles (a cycle bounds)
A Note About Linear Programming
Linear Programs are of the form:
Mixed Integer Programs are of the form:
Fewer constraints,
Convex optimization
More constraints,
Discrete problem
Summary
POINT CLOUD
FILTRATION
BARCODE
REPRESENTATIVE
FOR A SINGLE BAR
EIGHT OPTIMIZATION PROBLEMS
# edges
sum of edge
lengths
sum of (bounding) triangle areas
# bounding
triangles
Example
Solutions to Unweighted Problems Often Non-Unique
Unif loss functions use actual 0-norm
Optimized Persistent Homology Cycle Basis
Empirical Analysis of �Cycle Representatives
What happens when we optimize?
values in {-1, 0, 1}?)
DATA SETS
“Real world”
Random point cloud
Erdös-Rényi
Computation
https://github.com/TDAMinimalGeneratorResearch/minimal-generator
Computation time
Timing
Real-World
Synthetic
Take-away
Dimension of ambient Euclidean space
Take-away
Solver matters!!!
Timing
Real-World
Synthetic
Take-away
box plots
red line represents
ratio of 1
scatter plots
Average size
What’s normal?
Dimension of ambient Euclidean space
Most cycles start small
(So there’s little room for improvement)
Most random Euclidean point clouds have small PH cycle representatives.
Effectiveness of optimization
high compression ratio
low compression ratio
Effectiveness of optimization
compression ratio ~
Randomly generated point clouds
ceiling effect
compression
ratio
“Real world” data sets
“Real world” data sets
Length versus bounded area
Minimizing length does not necessarily minimize area — but it usually does
Take-away
cycle minimization
area minimization
ratio of compression (weighted length of edges)
ratio of compression
(# edges)
ratio of compression
(bounded area)
Length versus area
Surprisingly, methods designed to minimize length can reduce area better than methods designed to minimize area
Original
Optimized for # of edges
Optimized for total area of bounding triangles
Source of the discrepancy
Counting loops
in the support of a cycle
The only random space to generate multi-loop persistent cycle representatives in large numbers
Coefficients
Findings
Advantages of …
Take-away
Can drop the integral constraint (to save on computational costs) and still achieve integral solution!
SUMMARY FOR THE PRACTITIONER
Questions for the researcher: Why?
Questions?
Collaborators:
Lu Li Connor Thompson Greg Henselman-Petrusek Chad Giusti
Supported by NSF:CDS&E-MSS-1854703
Join WinCompTop:
Send blank email to
WinCompTop+subscribe@googlegroups.com
Paper
Minimal Cycle Representatives in Persistent Homology using Linear Programming: an Empirical Study with User’s Guide
https://www.frontiersin.org/articles/10.3389/frai.2021.681117/full
Code
https://github.com/ TDAMinimalGeneratorResearch/minimal-generator
https://github.com/UDATG/optimal_reps
Related papers
I. Obayashi. "Volume-optimal cycle: Tightest representative cycle of a generator in persistent homology." SIAM Journal on Applied Algebra and Geometry 2.4 (2018): 508-534.
E. Escolar and Y. Hiraoka. "Optimal cycles for persistent homology via linear programming." Optimization in the Real World. Springer, Tokyo, 2016. 79-96.
H. Hang, C. Giusti, L. Ziegelmeier, and G. Henselman-Petrusek. “U-match factorization: sparse homological algebra, lazy cycle representatives, and dualities in persistent (co)homology.” 2021, https://arxiv.org/pdf/2108.08831.pdf
C. Giusti, C. Henselman-Petrusek, and L. Ziegelmeier. “The basis matching complex: a streamlined framework for persistent homological algebra,” forthcoming.
Qingru Zhang