cluster-audit
Cluster Audit Report - 2026-02-14
Generated at: 2026-02-14 18:42
1. 节点资源概览 (Node Resources)
🟢 master-01 (worker)
- Version: v1.34.3+k3s1
- CPU: 6% (121m)
- Mem: 66% (2595Mi)
🟢 worker-01 (worker)
- Version: v1.34.3+k3s1
- CPU: 18% (740m)
- Mem: 87% (6912Mi)
🟢 worker-02 (worker)
- Version: v1.34.3+k3s1
- CPU: 17% (680m)
- Mem: 23% (1836Mi)
🟢 worker-03 (worker)
- Version: v1.34.3+k3s1
- CPU: 16% (656m)
- Mem: 75% (5966Mi)
2. 工作负载健康度 (Workload Health)
- ✅ Deployments: All 57 Healthy
- ✅ StatefulSets: All 6 Healthy
3. 异常 Pods (Abnormal Pods)
🟡 engine-image-ei-ff1cedad-45wkf
- Namespace:
longhorn-system - Status: Running (Unknown)
- Restarts: 50
- Node: worker-01
🟡 ingest-worker-v2-b1-7c7bb587f7-rzcpf
- Namespace:
dev-2 - Status: Running (Unknown)
- Restarts: 19
- Node: worker-03
🟡 longhorn-csi-plugin-cbdfg
- Namespace:
longhorn-system - Status: Running (Unknown)
- Restarts: 0
- Node: worker-01
🟡 longhorn-csi-plugin-cbdfg
- Namespace:
longhorn-system - Status: Running (Unknown)
- Restarts: 44
- Node: worker-01
🟡 master-disk-check
- Namespace:
kube-system - Status: Running (Unknown)
- Restarts: 816
- Node: master-01
🟡 monitor-kube-prometheus-st-operator-688f6b9597-mgcn8
- Namespace:
monitoring - Status: Running (Unknown)
- Restarts: 49
- Node: worker-01
🟡 monitor-kube-state-metrics-75b75b8c56-sdfd8
- Namespace:
monitoring - Status: Running (Unknown)
- Restarts: 62
- Node: worker-01
🟡 monitor-prometheus-node-exporter-76q4c
- Namespace:
monitoring - Status: Running (Unknown)
- Restarts: 31
- Node: worker-01
🔴 postgres-backup-29510100-xrthd
- Namespace:
infra - Status: Pending (Unknown)
- Restarts: 0
- Node: master-01
🟡 prometheus-monitor-kube-prometheus-st-prometheus-0
- Namespace:
monitoring - Status: Running (Unknown)
- Restarts: 0
- Node: worker-01
🟡 prometheus-monitor-kube-prometheus-st-prometheus-0
- Namespace:
monitoring - Status: Running (Unknown)
- Restarts: 39
- Node: worker-01
🟡 rocket-chat-account-cfd85cc46-bjqzr
- Namespace:
project-team-chat - Status: Running (Unknown)
- Restarts: 8
- Node: worker-03
🟡 rocket-chat-ddp-streamer-664656df6b-g6hct
- Namespace:
project-team-chat - Status: Running (Unknown)
- Restarts: 113
- Node: worker-01
🟡 rocket-chat-presence-86449566b6-jkkmx
- Namespace:
project-team-chat - Status: Running (Unknown)
- Restarts: 8
- Node: worker-03
🟡 rocket-chat-rocketchat-7c8bcbb868-5rj2t
- Namespace:
project-team-chat - Status: Running (Unknown)
- Restarts: 68
- Node: worker-01
🔴 w03-final
- Namespace:
default - Status: Failed (Unknown)
- Restarts: 0
- Node: worker-03
4. 存储健康度 (Storage Health)
PVC Top Usage
- 40% used by
firecrawl-redis-56b494c988-xbpqh(/data) - 1% used by
nocodb-5ff5dd9484-gq2sd(/usr/app/data) - 2% used by
postgres-f65fc7f79-mb5b5(/var/lib/postgresql/data) - 3% used by
nginx-proxy-manager-5dfcd6bcc8-pm5qp(/data) - 1% used by
ghost-5b5f458c6d-cz4ns(/var/lib/ghost/content) - 5% used by
ghost-mysql-5b9d4d6b68-8mq5m(/var/lib/mysql)
5. 近期警告事件 (Recent Warnings)
[2026-02-14T10:42:04Z] Pod/master-disk-check (kube-system): Back-off restarting failed container check in pod master-disk-check_kube-system(d441b30a-fa5a-4139-bca3-5da6f77602ec) [2026-02-14T10:40:10Z] Pod/postgres-backup-29510100-xrthd (infra): Error: ImagePullBackOff [2026-02-14T09:46:15Z] Pod/kalai-5bc87fb6d-zqhp8 (default): Back-off restarting failed container kalai in pod kalai-5bc87fb6d-zqhp8_default(6b33515e-8e38-4cf8-bfbc-d857cd161c2f) [2026-02-14T09:41:30Z] Pod/kalai-6cd8d46866-5f2bx (default): Back-off restarting failed container kalai in pod kalai-6cd8d46866-5f2bx_default(e94f71fe-6970-4f91-b918-f4467ece3365)