WLCG Experiments Test Framework (ETF)
Marian Babik, CERN�HEPiX IPv6 WG meeting
Motivation
Current Service Availability Monitoring (SAM) structure
2
HEPiX IPv6 Meeting
Overview
3
Generic test middleware based on open source
Focuses on functional testing (atomic)
~ 150 sites, 1200 hosts monitored
~ 10 metrics/host
~ 1M metrics/day
High-level functional testing
Plugins conforming to Nagios standard
Configurable schedule for test execution
Checkmk dashboard to show results
HEPiX IPv6 Meeting
Architecture
ETF Core Framework
Plugins (probes/tests)
MQ Stream for publishing results
Job Submission Framework (JESS)
Worker Node Framework (WN-FM)
4
HEPiX IPv6 Meeting
Deployment and Operations
Experiment instances @CERN (IPv4-only/IPv6-only in QA, IPv4-only in PROD) �perfSONAR infrastructure monitoring @OSG
ETF now runs in containers and is integrated with gitlab CI
5
HEPiX IPv6 Meeting
Plugins/Tests
6
Plugins | Users/Experiments | Maintained by |
Job Submission | ||
CREAM, ARC, HTCONDOR-CE JESS** | LHCb, ALICE, ATLAS, CMS | ETF |
Worker Nodes | ||
ATLAS (3), CMS (11), LHCb (7) | ATLAS, CMS, LHCb | ATLAS, CMS, LHCb |
Storage | ||
GFAL2 (SRM, gsiftp, XRoot, HTTP) | ATLAS | ATLAS |
GFAL2 (SRM) | CMS | CMS |
XRoot** | CMS | CMS |
HTTPs/WebDAV** | HTTP TF* | HTTP TF* |
Network | ||
perfSONAR infrastructure** | WLCG Network Throughput WG | OSG, WLCG |
**Uses new library for writing plugins (python-nap) *Probe is still supported by GFAL2 team
HEPiX IPv6 Meeting
Summary
7
HEPiX IPv6 Meeting
Questions ?
Docs: https://etf.cern.ch/docs/latest/
Central instance: https://etf.cern.ch/etf/check_mk/
Instances (access requires IGTF/x509 cert loaded in the browser):
CMS production CMS QA IPv6 CMS QA Code: CMS gitlab
ATLAS production ATLAS QA IPv6 ATLAS QA Code: ATLAS gitlab
LHCb production LHCb QA IPv6 LHCb QA Code: LHCb gitlab
ALICE production ALICE QA Code: ALICE gitlab
pS production pS QA Code: pS gitlab�
ETF framework
ETF core containers ETF Job Submission (Jess)
ETF nagios plugins lib. NAP ETF rule-based configuration (ncgx)
ETF support channels: GGUS: Grid Monitoring or etf-support@cern.ch (SNOW) �
8
HEPiX IPv6 Meeting
WLCG Networks: Update on Monitoring and Analytics
perfSONAR Update
S. McKee1, B. Bockelman2, R. Gardner3, �I. Vukotic3, M. Babik4, D. Weitzel5, M. Zvada5, E. F. Hernandez6, �1 University of Michigan, 2 Morgridge Institute of Research, 3 University of Chicago, 4 CERN,�5 University of Nebraska, 6 UCSD
perfSONAR News
perfSONAR 4.2 was released (4.2.2 is the latest release)
�
10
HEPiX IPv6 Meeting
perfSONAR deployment
11
261 Active perfSONAR instances
- 207 production endpoints
- 173 running 4.2; 138 on 4.2.1 (latest)
- T1/T2 coverage
- Continuously testing over 5000 links
- Testing coordinated and managed from central place
- Dedicated latency and bandwidth nodes at each site
- Open platform - tests can be scheduled by anyone who participates in our network and runs perfSONAR
HEPiX IPv6 Meeting
Platform Overview
12
Collector �(NEW)
Store (long-term)
Store (short-term)
pS Monitoring
pS Configuration�
Tape
Experiments
MONIT-GRAFANA
pS Dashboard
HEPiX IPv6 Meeting
MONIT perfSONAR IPv6 dashboard
13
HEPiX IPv6 Meeting
MONIT perfSONAR IPv6 dashboard
14
HEPiX IPv6 Meeting
Network Analytics Activities
During the spring of 2019 we engaged a group of students to work on analysis and visualization of our network metrics
15
Prototype path display using network metrics from ES
The students have worked independently over summer 2019 learning about the data we have and the analytics platform itself
For Fall 2019, the goal was to clean up and annotate the path information, filtering out bad or incomplete traceroute measurements and then work on analyzing, organizing and displaying path information with corresponding network metrics like packet-loss, throughput or delay
HEPiX IPv6 Meeting
Collaboration with MEPhI on Network Visualization
The SAND project is collaborating with MEPHI (Moscow Engineering Physics Institute) on network path visualization
16
Containerized Version running at UC https://perfsonar.uc.ssl-hep.org/graph/viewer
Application being Updated
HEPiX IPv6 Meeting
Platform Use
17
HEPiX IPv6 Meeting
Plans
18
HEPiX IPv6 Meeting
Summary
19
HEPiX IPv6 Meeting
References
�
20
HEPiX IPv6 Meeting