diff --git a/source/operations/monitoring/collect-minio-metrics-using-prometheus.rst b/source/operations/monitoring/collect-minio-metrics-using-prometheus.rst index fb1988bb..44601b3d 100644 --- a/source/operations/monitoring/collect-minio-metrics-using-prometheus.rst +++ b/source/operations/monitoring/collect-minio-metrics-using-prometheus.rst @@ -196,8 +196,8 @@ The following query examples return metrics collected by Prometheus: .. code-block:: shell :class: copyable - minio_cluster_disk_online_total{job="minio-job"}[5m] - minio_cluster_disk_offline_total{job="minio-job"}[5m] + minio_cluster_drive_online_total{job="minio-job"}[5m] + minio_cluster_drive_offline_total{job="minio-job"}[5m] minio_bucket_usage_object_total{job="minio-job"}[5m] @@ -229,7 +229,7 @@ You can modify or otherwise use these examples as guidance in building your own description: "Node(s) in cluster {{ $labels.instance }} offline for more than 5 minutes" - alert: DisksOffline - expr: avg_over_time(minio_cluster_disk_offline_total{job="minio-job"}[5m]) > 0 + expr: avg_over_time(minio_cluster_drive_offline_total{job="minio-job"}[5m]) > 0 for: 10m labels: severity: warn diff --git a/source/operations/monitoring/healthcheck-probe.rst b/source/operations/monitoring/healthcheck-probe.rst index 6713c411..60a8ebfe 100644 --- a/source/operations/monitoring/healthcheck-probe.rst +++ b/source/operations/monitoring/healthcheck-probe.rst @@ -69,7 +69,7 @@ metrics to detect potential issues or errors on the MinIO cluster: - ``minio_cluster_nodes_offline_total`` to alert if one or more MinIO nodes are offline. -- ``minio_node_disk_free_bytes`` to alert if the cluster is running +- ``minio_node_drive_free_bytes`` to alert if the cluster is running low on free drive space. Cluster Read Quorum diff --git a/source/operations/monitoring/monitor-and-alert-using-influxdb.rst b/source/operations/monitoring/monitor-and-alert-using-influxdb.rst index 81864415..ae0c2262 100644 --- a/source/operations/monitoring/monitor-and-alert-using-influxdb.rst +++ b/source/operations/monitoring/monitor-and-alert-using-influxdb.rst @@ -111,7 +111,7 @@ Configure InfluxDB to Collect and Alert using MinIO Metrics - Create a :guilabel:`Threshold Check` named ``MINIO_QUORUM_WARNING``. - Set the filter for the ``minio_cluster_disk_offline_total`` key. + Set the filter for the ``minio_cluster_drive_offline_total`` key. Set the :guilabel:`Thresholds` to :guilabel:`CRITICAL` when the value is one less than your configured :ref:`Erasure Code Parity ` setting.