-
Notifications
You must be signed in to change notification settings - Fork 22
Monitoring ESS Hardware metrics
In version 5.2.2, IBM Storage Scale introduced a new sensor, 'GPFSHardware', which collects metrics about fan rotation speed, power supply unit (PSU) power consumption, and the temperature of various enclosure components.
Background:
The IBM Storage Scale System 6000 (ISS 6000) is an implementation of the IBM Storage Scale software in hardware, optimised for the most demanding AI, HPC, analytics and hybrid cloud workloads.
It includes node-to-node communication through an internal Ethernet private network and nontransparent bridge (NTB) for peer node diagnostic and control. Remote console through serial over LAN (SOL) using a Baseboard Management Controller (BMC) and Intelligent Platform Management Interface (IPMI) is available for monitoring and controlling the ISS 6000 enclosure and to assist with deployment and installation.
IPMI( Intelligent Platform Management Interface) is a standardised, message-based hardware management interface. At the core of IPMI is a hardware chip known as the Baseboard Management Controller (BMC) or Management Controller (MC). The BMC provides the various interfaces needed to monitor the health of the hardware components, such as temperature, voltage and fan speed.
Each IBM Storage Scale System cluster requires a minimum of one Enterprise Management Server (EMS). The EMS, also referred to as the Utility Node, is the central management component in an IBM Storage Scale System. The EMS is implemented as a RHEL KVM VM and serves as the control hub for the Storage Scale System 6000. Finally, it provides capabilities to acess the hardware health and performance data captured from the hardware abstraction layer (HAL) and stored using the IBM Performance Monitoring tool.
You can use the Grafana software and the ESS hardware sample dashboard bundle to explore hardware metrics for key components directly in your web browser.
Visit the IBM Storage Scale Knowledge Center for getting more info about the latest product updates
-
- Setup classic Grafana
- Make usage of Grafana Provisioning feature
-
- Installing RedHat community-powered Grafana operator from OperatorHub
- Creating Grafana instance using the RedHat community-powered Grafana-operator
- Creating Grafana Datasorce instance from Custom Resource managed by the RedHat community powered Grafana operator
- Importing the predefined dashboard from the example dashboards collection
- Exploring Grafana WEB interface for CNSA project in a k8s OCP environment
- How to setup Grafana instance to monitor multiple IBM Storage Scale clusters running in a cloud or mixed environment
- API key authentication
- Configurable bridge settings
- CherryPy builtin HTTP server settings
- How to setup HTTPS(SSL) connection
- Start and stop grafana-bridge with systemd
- Refresh IBM Storage Scale cluster configuration data cached by grafana bridge
- Accelerate the PrometheusExporter data retrieval time
- Optimize the performance of PrometheusExporter by using scrape_job params settings
- Grafana Dashboard Panel shows no metric values for a particular entity
- Missing Grafana-Operator on an OpenShift cluster
- Missing CherryPy packages
- What to do if your system is on Python < 3.8
- Grafana-bridge fails to start with Python3.8
- Grafana-bridge container time is different from a host time
- Verify that the grafana-bridge returns data as expected