Scope & Activities
Teng Jian Khoo (HU-Berlin/Innsbruck - ATLAS)
Paul Laycock (BNL - Belle II, DUNE)
Andrea Rizzi (INFN Pisa - CMS)
Outline
2
Working group page: https://hepsoftwarefoundation.org/workinggroups/dataanalysis.html
DAWG Goals
Aims:
Priorities:
3
Highlight event
Pre-CHEP ‘19 WLCG/HSF Workshop�Analysis Systems: From Future Facilities to Final Plots
“Brain-writing” exercises addressing:
Continued active engagement with WLCG critical
4
5
HL-LHC Computing Review
LHCC commissioned review by HSF: “Common Tools and Community Software”
Analysis highlights:
6
Development targets
7
Declarative analysis models
Analysis code quality
Reproducibility & preservation
Data formats for analysis
Growing use of ML
Trends
Topics
Targets
Analysis facility design
More data, higher precision
Efficient use of resources
Fluid research workforce
Analysis metadata
Specific questions
Standardised analysis formats a la CMS nano-AOD, ATLAS DAOD_PHYSLITE� -- Production models? Adaptability c.f. the “10% analyses”
Analysis interfaces, description, preservation�-- Is a Domain-Specific Language a practical solution? �-- Or declarative layers (high-level workflow, mid-level tasks, low-level cuts)?�-- How to store/access metadata uniformly and robustly?
Analysis & the grid�-- What do we need at computing facilities (GPU, fast network vs disk, …)?�-- Do we need specialised facilities for analysis? How will job distribution work?�-- How to improve validation & performance monitoring of user code?
8
Outlook
Analysis software should be an enabler, not an obstacle� -- Design such that good practices are the default
Build capabilities for growing sophistication without exploding costs� -- Need effective interfaces to ML, accelerators� -- Must provide equitable access to infrastructure
Close connections to software training & documentation� -- “Higher level” languages for analysis operations could help
Quis custodiet analysis metadata?�-- Do we need an event/body to steer? Key stakeholders?
9