Introduction to Landscape - navigating dashboards, interacting with Grafana, and finding relevant information, with a focus on batch job and related resource monitoring.
Landscape grew out of FIFE batch monitoring, but it also serves CMS users, where everything is similar, but different. What can FIFE and DUNE learn from CMS, and vice-versa?
Using Kibana for ad-hoc and deep data exploration and troubleshooting, and directly accessing data for nonstandard usage.
Introduction to types of data that can be collected, what can be done with it, and what best practices you should follow.
Discussion on the state and future of combined monitoring, understanding usage patterns and where the resource contention and bottlenecks are at any given time - compute resources, network, dCache, tape?
The Elastic Analysis Facility is the next-generation interactive computing facility for users, providing scalable on-demand resources though "cloud native" technologies like Kubernetes (OKD) and Ceph, with extensive Prometheus metrics and logs for observability.