Theoretical Foundations for Artificial Intelligence (AI) Inspired from Understanding Biological Intelligence (BI)� �Detecting Phase Transitions & Quantifying the Degree of Emergence in Deep Learning
1
Paul Bogdan
Ming Hsieh Department of Electrical and Computer Engineering, University of Southern California
The 7th workshop on Neural Scaling Laws - Scaling, Transfer & Multilingual Models�Monday, July 22, 2024, Vienna, Courtyard Vienna Prater/Messe
1
Motivation & Grand Challenges
2
Complexity of Neuronal Networks
3
Yin et al. “Network science characteristics of brain-derived neuronal cultures deciphered from quantitative phase imaging data”, Nature Scientific Reports, 2020
New Mathematics for Complex Networks
4
Weighted Multifractal Graph Generator
5
WMG Inference vs. Network Complexity
6
Non-recoverable
Recoverable
R. Yang, F. Sala, and P. Bogdan, Nature Communications Physics, 2021
WMGG Can Mine Insect Brain Data (I)
7
Inferred WMG model with M=2, K=3
Data and brain image source: Xu, C. Shan, et al. "A connectome of the adult drosophila central brain.”(bioRxiv 2020) Dataset: https://www.janelia.org/project-team/flyem/hemibrain
WMGG Can Mine Insect Brain Data (II)
8
R. Yang, F. Sala, and P. Bogdan, Nature Communications Physics, 2021
Mining Cognition From Scarce Brain Nets
9
R. Yang, F. Sala, and P. Bogdan, Nature Communications Physics, 2021
Infer Generators of Brain Networks
10
R. Yang, F. Sala, and P. Bogdan, Nature Communications Physics, 2021
Motivation & Grand Challenges
11
Infusing Flocking Intelligence into ANNs
12
C. Yin et al., arXiv link: https://arxiv.org/pdf/2310.07885.pdf
DNNs vs. Leader-Follower Neural Networks
13
LFNNs Performance vs. Leadership Size
when leaders and followers are about 50/50
14
C. Yin et al., arXiv link: https://arxiv.org/pdf/2310.07885.pdf
BP-Free Leader-Follower Neural Network
15
C. Yin et al., arXiv link: https://arxiv.org/pdf/2310.07885.pdf
Leaders vs. Followers
16
C. Yin et al., arXiv link: https://arxiv.org/pdf/2310.07885.pdf
Motivation & Grand Challenges
17
Differential Geometry of Networks
18
Quantifies topological properties of graphs
Quantifies the volume growth
What is the Curvature of a DNN?
19
Geometric properties
Learning task
DNN’s Accuracy vs. FRC Entropy: Fashion-MNIST
20
M.R. Znaidi et al. "A unified approach of detecting phase transition in time-varying complex networks" Scientific Reports 13, no. 1, 2023
DNN’s Accuracy vs. FRC Entropy: CIFAR-10 Dataset
21
M.R. Znaidi et al. "A unified approach of detecting phase transition in time-varying complex networks." Scientific Reports 13, no. 1, 2023
Findings & Future Work
22
Correlation between Forman-Ricci curvature network entropy and training accuracy
Detect the learning phase transitions
Perform dropout based on Forman-Ricci curvature of each node
Motivation & Grand Challenges
23
Mono-Fractal Analysis
24
d = 1.719
| | | | |
| | | | |
| | | | |
| | | | |
| | | | |
| | |
| | |
| | |
A Simple Example
25
0.3
0.3
0.05
0.05
0.05
0.05
0.2
0.2
0.1
0.2
0.3
0.4
0.5
5
4
3
2
1
0.3
0.1
0.05
0.05
0.05
0.05
0.2
0.2
0.1
0.2
0.3
0.4
0.5
5
4
3
2
1
Node-based Multifractal Analysis (I)
26
q>0: Prioritize abundant patterns
q<0: Prioritize rare patterns
Multi-fractal scaling behavior
X. Xiao, H. Chen, and P. Bogdan, “Deciphering the generating rules and functionalities of complex networks”, Nature Scientific Reports, 2021
Node-based Multifractal Analysis (II)
27
𝜏(𝑞) vs 𝑞
Legendre transform
Multifractal spectrum
Emergence & Self-Organization in AI
28
arXiv link: https://arxiv.org/pdf/2402.09099.pdf
(Artificial) Neuronal Interaction Network (NIN)
29
arXiv link: https://arxiv.org/pdf/2402.09099.pdf
Multifractal Analysis of LLM Training
30
arXiv link: https://arxiv.org/pdf/2402.09099.pdf
Emergence in LLM Training
31
arXiv link: https://arxiv.org/pdf/2402.09099.pdf
Relevance
Predictability + New Information
CPS Group & Collaborators
32
Yuankun Xue
Valeriu Balaban
Gaurav Gupta
Jyotirmoy Deshmukh (USC)
Hana Koorehdavoudi
Edmond Jonckheere (USC)
Jayson Sia
Mohamed R. Znaidi
Panagiotis Kyriakis
Yao Xiao
Thank you!
More info at https://cps.usc.edu/
George Pappas (Upenn)
Ruochen Yang
Mingxi Cheng
Graduated PhD students
Sergio Pequito (Delft Univ.)
Qi Cao
Xiong Ye Xiao
Current PhD students
Justin Rhodes (UIUC)
Roozbeh Kiani (NYU)
James Boedicker (USC)
Radu Balan (UMD)
Brief list of collaborators:
Frederic Sala (UWisconsin)
Chenzhong Yin
Nicholas Kotov (Univ. of Michigan)
Emily A. Reed
Kien Nguyen
Heng Ping
Shixuan Li
Gengshuo Liu
Shukai Duan
Graph Theory-aware ML (GTML)
33
M. Cha, E.S.T. Emre, X. Xiao, J.-Y. Kim, P.B., S.J. VanEpps, A. Violi, N.A. Kotov, Unifying Structural Descriptors for Biological and Bioinspired Nanoscale Complexes, Nature Comp. Sci., 2022
BA Prediction via An Interpretable 3D-CNN
34
Chenzhong Yin, Mihai Udrescu, Gaurav Gupta, Mingxi Cheng, Andrei Lihu, Lucretia Udrescu, Paul Bogdan, David M. Mannino, and Stefan Mihaicuta. "Fractional dynamics foster deep learning of COPD stage prediction." Advanced Science (2023): 2203485. https://onlinelibrary.wiley.com/doi/full/10.1002/advs.202203485
C. Yin et al., "Anatomically interpretable deep learning of brain age captures domain-specific cognitive impairment" Proceedings of the National Academy of Sciences, 2023 https://www.pnas.org/doi/abs/10.1073/pnas.2214634120
Block-wise BP-Free (BWBPF) Network
35
Results on Cifar-10 and Tiny-ImageNet
36
A. Cheng, H. Ping, Z. Wang, X. Xiao, C Yin, S. Nazarian, M. Cheng, and P. Bogdan. "Unlocking Deep Learning: A BP-Free Approach for Parallel Block-Wise Training of Neural Networks" IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024 https://ieeexplore.ieee.org/document/10447377
WMG Can Mine 4D Nucleome Data
37
Yeast in quiescence
Exponentially growing yeast
Data source: M.T. Rutledge et al. The yeast genome undergoes significant topological reorganization in quiescence. Nucleic acids research, 2015
Chemical, Geometrical & Graph Descriptors
38
Graph Theory-aware ML (GTML)
39
M. Cha, E.S.T. Emre, X. Xiao, J.-Y. Kim, P.B., S.J. VanEpps, A. Violi, N.A. Kotov, Unifying Structural Descriptors for Biological and Bioinspired Nanoscale Complexes, Nature Comp. Sci., 2022
Prediction of Protein–Protein Complexes
40
Quantifying Phase Transitions
41
x=0.409
x=0.431
x=0.930
x=0.007
x=0.190
x=0.204
micelles 🡪 cylinders
cylinders 🡪 lamellae
Molecular Dynamics Descriptors
42
Graph Description of Complex Systems
43
GT Shades Light on Phases
44
Tracking Phase Transitions via GT Metrics
45
Structural Fluctuations & Phase Transitions
46
Colored by value of closeness centrality
Max - blue; Min - red
Dominant of micellar 🡪
mixture 🡪
dominant of hexagonal
Multifractality vs. Phase Transitions
47
Forman-Ricci Curvature (FRC) (I)
48
Forman-Ricci curvature (FRC) (II)
49
M.R. Znaidi,J. Sia, S. Ronquist, I. Rajapakse, E. Jonckheere & P. Bogdan. "A unified approach of detecting phase transition in time-varying complex networks." Scientific Reports 13, no. 1, 2023
Forman-Ricci curvature (FRC) (III)
9/47
Advantage: It is far simpler to evaluate on large networks than Ollivier-Ricci curvature!
Augmented Forman-Ricci curvature (FRC)
9/47
Advantage: It is far simpler to evaluate on large networks than Ollivier-Ricci curvature!
Worker Activity in LFNNs
52
C. Yin et al., arXiv link: https://arxiv.org/pdf/2310.07885.pdf
Differential Geometry of Networks (I)
53
[1] M. Weber et al. "Characterizing complex networks with Forman-Ricci curvature and associated geometric flows." Journal of Complex Networks 5.4 (2017)
[2] M. Weber et al. "Coarse geometry of evolving networks." Journal of Complex Networks 6.5 (2018)
Quantifies the volume growth
Differential Geometry of Networks (II)
54
Differential Geometry of Networks (III)
55
Quantifies topological properties of graphs