AI/ML Techniques Overview in Neutrino Physics
Patrick de Perio (Kavli IPMU)
Kazu Terao (SLAC)
FAIRS-Japan @ KMI
AI/ML Applications in Neutrino Physics
FAIRS-Japan, Dec. 3-5, 2024 AI/ML in Neutrino 2
Data Reconstruction and Analysis
FAIRS-Japan, Dec. 3-5, 2024 AI/ML in Neutrino 3
image height
image width
image depth
features
repeat
...
P(μ±)
P(e±)
P(π0)
P(γ)
2. Convolutions & down-samples
3. Fully connected neural network
“Softmax discriminators”
Many applications of Convolutional/Graph Neural Networks
Softmax P(γ)
νe CC0π
NC γ
NC π0
Distance to detector wall (cm)
e/μ identification
1. Pre-processing
Data Reconstruction and Analysis
FAIRS-Japan, Dec. 3-5, 2024 AI/ML in Neutrino 4
Many applications of Convolutional/Graph Neural Networks
Spherical CNN and KamNet in KamLAND
E3NN: Euclidean Neural Nets and on-going application to LArTPC
Euclidean (3) equivariant neural network model for translation, rotation, and mirroring
Data Reconstruction and Analysis
FAIRS-Japan, Dec. 3-5, 2024 AI/ML in Neutrino 5
Many applications of Convolutional/Graph Neural Networks
Surrogate Models
Neural surrogate models are used in many parts of simulation
FAIRS-Japan, Dec. 3-5, 2024 Neutrino 6
Simulation - ML for Physics Modeling
FAIRS-Japan, Dec. 3-5, 2024 AI/ML in Neutrino 7
Traditional physics simulator requires manual process to optimize against data using separate softwares (i.e. calibration, reconstruction). ML-based approaches can bring automation to this process and/or flexibility to learn and represent missing physics models from real data
Simulation - ML for Physics Modeling
FAIRS-Japan, Dec. 3-5, 2024 AI/ML in Neutrino 8
Traditional physics simulator requires manual process to optimize against data using separate softwares (i.e. calibration, reconstruction). ML-based approaches can bring automation to this process and/or flexibility to learn and represent missing physics models from real data
Initial Guess
Optimization
Final Prediction
True Track
Trajectory
Predicted Track
Trajectory
Predicted Detector �Hits
Photon Trajectories
Model
Evaluation
Parameter 𝜃
Input 𝑥
Output
Objectives
Domain Adaptation (Fighting Data Shift) - DAT
FAIRS-Japan, Dec. 3-5, 2024 AI/ML in Neutrino 9
Simulation = largely accurate but not perfect. Optimizing a model using simulation, then applying for real data can result in data shift. As a consequence, the model may underperform on data.
A
B
Force the model to only learn common features across both domains
How?
Add a task to classify 2 domains, and maximize its error while minimizing the task (label) error. This pressures the model to learn only features common in both domains
Domain Adaptation (Fighting Data Shift) - Contrastive Learning
FAIRS-Japan, Dec. 3-5, 2024 AI/ML in Neutrino 10
Simulation = largely accurate but not perfect. Optimizing a model using simulation, then applying for real data can result in data shift. As a consequence, the model may underperform on data.
Augment data to make the model learn about common underlying (unchanged) features
Contrastive Learning
Image credit: Alexander W. (UCL), talk at NPML (2024)
Domain Adaptation (Fighting Data Shift)
FAIRS-Japan, Dec. 3-5, 2024 AI/ML in Neutrino 11
Simulation = largely accurate but not perfect. Optimizing a model using simulation, then applying for real data can result in data shift. As a consequence, the model may underperform on data.
Image credit: Masked Autoencoder paper
Reconstruction
Track v.s. Shower pixel-level separation
Summary
FAIRS-Japan, Dec. 3-5, 2024 AI/ML in Neutrino 12
Appendix
13
Simulation - Differentiable Physics Modeling
Implementation Highlights:
FAIRS-Japan, Dec. 3-5, 2024 AI/ML in Neutrino 14
Detector response showing accumulated charge in PMTs (1M photons)
Loss landscape when varying track position and opening angle White streamlines show computed gradient directions
O. Alterkait
Good practices
Dr. Saúl Alonso-Monsalve – ETH Zurich
FAIRS-Japan, Dec. 3-5, 2024 Neutrino 15
Preventing common mistakes.
ML model uncertainty
Dr. Saúl Alonso-Monsalve – ETH Zurich
FAIRS-Japan, Dec. 3-5, 2024 Neutrino 16
Problem
Dr. Saúl Alonso-Monsalve – ETH Zurich
FAIRS-Japan, Dec. 3-5, 2024 Neutrino 17
Can we trust a ML model?
Domain adaptation
Dr. Saúl Alonso-Monsalve – ETH Zurich
FAIRS-Japan, Dec. 3-5, 2024 Neutrino 18
A
B
Force the model to only learn common features across both domains
B
A
Force the a domain shift
Through meta-learning, contrastive learning, differentiable simulations, etc
¿Black box?
Dr. Saúl Alonso-Monsalve – ETH Zurich
FAIRS-Japan, Dec. 3-5, 2024 Neutrino 19
Example: understanding a trained model (DUNE CVN)
Dr. Saúl Alonso-Monsalve – ETH Zurich
FAIRS-Japan, Dec. 3-5, 2024 Neutrino 20
electron neutrino (𝜈e)
original
occlusion map
muon neutrino (𝜈μ)
original
occlusion map
Removing the start of the electron shower reduces the 𝜈e score, as expected
The CVN finds the vertex a bit ambiguous, but it is using the end point of the muon to gain a handle on the event type.
Topic 2:
Electron vs. multi-𝛾 event classification
21
Maksimovic et al., J. Cosmol. Astropart. Phys. 051 (2021)
S. Fujita, S. Han, Y. Koshio
Topic 3:
Muon track reconstruction
22
https://arxiv.org/abs/2005.12872
S. Fujita, S. Han, Y. Koshio
Uses of machine learning in SK analyses
FAIRS-Japan, Dec. 3-5, 2024 Neutrino 23
S. Fujita, S. Han, Y. Koshio