Cluster Audit Report - 2026-02-14

Generated at: 2026-02-14 18:42

1. 节点资源概览 (Node Resources)

🟢 master-01 (worker)

  • Version: v1.34.3+k3s1
  • CPU: 6% (121m)
  • Mem: 66% (2595Mi)

🟢 worker-01 (worker)

  • Version: v1.34.3+k3s1
  • CPU: 18% (740m)
  • Mem: 87% (6912Mi)

🟢 worker-02 (worker)

  • Version: v1.34.3+k3s1
  • CPU: 17% (680m)
  • Mem: 23% (1836Mi)

🟢 worker-03 (worker)

  • Version: v1.34.3+k3s1
  • CPU: 16% (656m)
  • Mem: 75% (5966Mi)

2. 工作负载健康度 (Workload Health)

  • Deployments: All 57 Healthy
  • StatefulSets: All 6 Healthy

3. 异常 Pods (Abnormal Pods)

🟡 engine-image-ei-ff1cedad-45wkf

  • Namespace: longhorn-system
  • Status: Running (Unknown)
  • Restarts: 50
  • Node: worker-01

🟡 ingest-worker-v2-b1-7c7bb587f7-rzcpf

  • Namespace: dev-2
  • Status: Running (Unknown)
  • Restarts: 19
  • Node: worker-03

🟡 longhorn-csi-plugin-cbdfg

  • Namespace: longhorn-system
  • Status: Running (Unknown)
  • Restarts: 0
  • Node: worker-01

🟡 longhorn-csi-plugin-cbdfg

  • Namespace: longhorn-system
  • Status: Running (Unknown)
  • Restarts: 44
  • Node: worker-01

🟡 master-disk-check

  • Namespace: kube-system
  • Status: Running (Unknown)
  • Restarts: 816
  • Node: master-01

🟡 monitor-kube-prometheus-st-operator-688f6b9597-mgcn8

  • Namespace: monitoring
  • Status: Running (Unknown)
  • Restarts: 49
  • Node: worker-01

🟡 monitor-kube-state-metrics-75b75b8c56-sdfd8

  • Namespace: monitoring
  • Status: Running (Unknown)
  • Restarts: 62
  • Node: worker-01

🟡 monitor-prometheus-node-exporter-76q4c

  • Namespace: monitoring
  • Status: Running (Unknown)
  • Restarts: 31
  • Node: worker-01

🔴 postgres-backup-29510100-xrthd

  • Namespace: infra
  • Status: Pending (Unknown)
  • Restarts: 0
  • Node: master-01

🟡 prometheus-monitor-kube-prometheus-st-prometheus-0

  • Namespace: monitoring
  • Status: Running (Unknown)
  • Restarts: 0
  • Node: worker-01

🟡 prometheus-monitor-kube-prometheus-st-prometheus-0

  • Namespace: monitoring
  • Status: Running (Unknown)
  • Restarts: 39
  • Node: worker-01

🟡 rocket-chat-account-cfd85cc46-bjqzr

  • Namespace: project-team-chat
  • Status: Running (Unknown)
  • Restarts: 8
  • Node: worker-03

🟡 rocket-chat-ddp-streamer-664656df6b-g6hct

  • Namespace: project-team-chat
  • Status: Running (Unknown)
  • Restarts: 113
  • Node: worker-01

🟡 rocket-chat-presence-86449566b6-jkkmx

  • Namespace: project-team-chat
  • Status: Running (Unknown)
  • Restarts: 8
  • Node: worker-03

🟡 rocket-chat-rocketchat-7c8bcbb868-5rj2t

  • Namespace: project-team-chat
  • Status: Running (Unknown)
  • Restarts: 68
  • Node: worker-01

🔴 w03-final

  • Namespace: default
  • Status: Failed (Unknown)
  • Restarts: 0
  • Node: worker-03

4. 存储健康度 (Storage Health)

PVC Top Usage

  • 40% used by firecrawl-redis-56b494c988-xbpqh (/data)
  • 1% used by nocodb-5ff5dd9484-gq2sd (/usr/app/data)
  • 2% used by postgres-f65fc7f79-mb5b5 (/var/lib/postgresql/data)
  • 3% used by nginx-proxy-manager-5dfcd6bcc8-pm5qp (/data)
  • 1% used by ghost-5b5f458c6d-cz4ns (/var/lib/ghost/content)
  • 5% used by ghost-mysql-5b9d4d6b68-8mq5m (/var/lib/mysql)

5. 近期警告事件 (Recent Warnings)

[2026-02-14T10:42:04Z] Pod/master-disk-check (kube-system): Back-off restarting failed container check in pod master-disk-check_kube-system(d441b30a-fa5a-4139-bca3-5da6f77602ec) [2026-02-14T10:40:10Z] Pod/postgres-backup-29510100-xrthd (infra): Error: ImagePullBackOff [2026-02-14T09:46:15Z] Pod/kalai-5bc87fb6d-zqhp8 (default): Back-off restarting failed container kalai in pod kalai-5bc87fb6d-zqhp8_default(6b33515e-8e38-4cf8-bfbc-d857cd161c2f) [2026-02-14T09:41:30Z] Pod/kalai-6cd8d46866-5f2bx (default): Back-off restarting failed container kalai in pod kalai-6cd8d46866-5f2bx_default(e94f71fe-6970-4f91-b918-f4467ece3365)
Share this article
The link has been copied!