Wrap up of caloSim project�-- and next plan?
AUC: 0.5 is perfect (fully confuse the classifier)
JSD: 0 is perfect ( BCE loss reaches max, sqrt[2] )
GAN model
Mixed model
Is there still chance for VAE model?
VQVAE: Nice presentation from Chase!
Recap of datasets
DS1: easy dataset, 368 dimensions. irregular geometry and similar to caloGAN dataset
DS2: medium dataset, 6480 dim., cubic (but [r,phi,z] cyclindrical coordinate)
DS3: hard dataset, 40500dim., cubic (but [r,phi,z] cyclindrical coordinate)
→ 3 sets of metrics used for the evaluation how “real” the generated image is
More details about VQVAE on DS1
More details about VQVAE on DS1
Image looks like very “real”
avg.
individual
(normed)
Metrics :
Two sets of metrics to measure “hwo good the generation”:
Chi2 Metric
Physics metric looks good (chi2 of two high-level variable)
sum of layer N
sum of all / condition (particle incident energy)
0 means perfect
More details about VQVAE on DS1
Some physics variable metric is not very good like
enrgy weighted avg(X)
enrgy weighted avg(Y)
enrgy weighted std(X)
enrgy weighted std(Y)
Classifier Metric
DS1:
DS2, DS3:
Flow model | AUC/JSD |
low level classifier | 0.739/0.131 |
high level classifier | 0.556/0.015 |
Our model | AUC/JSD |
low level classifier | 0.995/0.873 |
high level classifier | 0.947/0.579 |
Flow model | DS2 AUC/JSD | DS3 AUC/JSD |
low level classifier | 0.823/0.263 | 0.889/0.411 |
high level classifier | 0.860/0.329 | 0.931/0.524 |
More details about VQVAE on DS1
Classifier using fixed MLP architecture and trained for 50 epochs to discriminate the generated and truth image
→ 3 model used different inputs:
Low level: all pixel energy feed into
Low level normed: nomalizaed pixel energy
High level: use the previous mentioned physics variable only
More details about VQVAE on DS2&3
More details about VQVAE on DS2&3
AE performance
avg.
individual
More details about VQVAE on DS2&3
VQVAE performance
avg.
individual
More details about VQVAE on DS2&3
VQVAE performance
losses/metrics
encoded & quantized
code distribution
Another (similar) VAE model on ML4jets
Manifold?�~latent?
Encoder/decoder arch
We choose 3 layers with 500 units
Dimension of latent
We choose 16, from experiments
We choose 200, but seems can be reduced
Discussion
Is there still chance for VAE model?
More interesting things