HS3
High Energy Physics �Statistics Serialization Standard
Carsten Burgard
Tomas Dado, Jonas Eschle, Matthew Feickert, Cornelius Grunwald, Alexander Held, Robin Pelkner, Jonas Rembser, Oliver Schulz
1
Stating the problem
2
RooWorkspace
Stating the problem
3
3
?
?
?
?
?
RooWorkspace
Publications as a channel of communication
4
4
Simplified Models: Multivariate Gaussians
5
5
covariance matrix
correlations
uncertainties
central values
Publications as a channel of communication
6
6
A long journey
7
7
Massimo Corradi
It seems to me that there is a general consensus that what is really meaningful for an experiment is likelihood, and almost everybody would agree on the prescription that experiments should give their likelihood function for these kinds of results. Does everybody agree on this statement, to publish likelihoods?
Louis Lyons
Any disagreement? Carried unanimously. That’s actually quite an achievement for this Workshop.
this slide was stolen from Lukas Heinrich & Kyle Cranmer
pyhf and its influence on likelihood publishing
8
8
Paper combining full likelihoods published by ATLAS with simplified models published by CMS
Look out for this button here!
There is significant tooling available around pyhf & JSON:
Even though pyhf is not the right tool for every analysis, the documentation quality is excellent and it’s a great tool for simple & quick analyses!
pyhf: A success story
9
9
Purely descriptive, static model representations are extremely useful
The common problem(s)
10
10
What we want
11
11
Where to learn from
12
12
If we can have the features of a RooWorkspace
convertible to and from ROOT
in a format that resembles pyhf JSON
with similar quality documentation,
we’re basically there!
ROOT Workspaces
13
13
computational graph of a Higgs combination workspace
this graphic was stolen from Wouter Verkerke
ROOT Workspaces ⇔ JSON Files
14
14
JSON representation of an ATLAS measurement
mix of HiFa & non-HiFa components
PreprocessFunctions & more
“ModelConfig”
obs, asimov, toys, …
pdfs
param. ranges
functions
connecting data+model
Top-level components
15
15
HS3 statistical models
distributions
functions
data
likelihoods
domains
parameter�points
analyses
metadata
misc
this graphic was stolen from Robin Pelkner
Example: Distributions
16
16
definition in the standard
actual usage in JSON
More examples
17
17
All of these are suggestions, �feel free to suggest improvements!
This is not a pipe-dream!
18
18
It’s alive: Higgs discovery workspaces from ATLAS
19
19
analyses
20
20
you can always leave it to the user to come up with descriptions of what they want to do
likelihoods
21
21
data
22
22
distributions
23
23
Please take a look at the standard and find whether any of the distributions important to you is not yet included
Long term, we strive to include all reasonably frequently used pdf types �in the standard!
distributions
24
24
functions
25
25
parameter_points
domains
26
26
metadata
27
27
misc
28
28
anything that is nice-to-have but not standardizable enough to fit in metadata goes here!
Top-level components
29
29
HS3 statistical models
distributions
functions
data
likelihoods
domains
parameter�points
analyses
metadata
misc
this graphic was stolen from Robin Pelkner
Conclusions
30
30