<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>监控 on 黄文卓 | DevOps Engineer</title><link>https://socake.github.io/tags/%E7%9B%91%E6%8E%A7/</link><description>Recent content in 监控 on 黄文卓 | DevOps Engineer</description><generator>Hugo -- gohugo.io</generator><language>zh-CN</language><managingEditor>17691281867@163.com (Wenzhuo Huang)</managingEditor><webMaster>17691281867@163.com (Wenzhuo Huang)</webMaster><copyright>© 2026 Wenzhuo Huang</copyright><lastBuildDate>Thu, 18 Dec 2025 11:20:00 +0800</lastBuildDate><atom:link href="https://socake.github.io/tags/%E7%9B%91%E6%8E%A7/index.xml" rel="self" type="application/rss+xml"/><item><title>Prometheus 进程监控：process-exporter 实战与告警配置</title><link>https://socake.github.io/posts/prometheus-process-monitoring/</link><pubDate>Thu, 18 Dec 2025 11:20:00 +0800</pubDate><author>17691281867@163.com (Wenzhuo Huang)</author><guid>https://socake.github.io/posts/prometheus-process-monitoring/</guid><description>K8s 有完善的 Pod 监控体系，但裸机和 VM 上运行的进程如何监控？本文介绍 process-exporter 的部署与配置实践，覆盖进程组匹配、核心指标、告警规则设计及实际踩坑经验。</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/prometheus-process-monitoring/featured.jpg"/></item><item><title>Linux 系统性能排查手册</title><link>https://socake.github.io/docs/linux/linux%E7%B3%BB%E7%BB%9F%E6%80%A7%E8%83%BD%E6%8E%92%E6%9F%A5/</link><pubDate>Tue, 09 Dec 2025 10:00:00 +0800</pubDate><author>17691281867@163.com (Wenzhuo Huang)</author><guid>https://socake.github.io/docs/linux/linux%E7%B3%BB%E7%BB%9F%E6%80%A7%E8%83%BD%E6%8E%92%E6%9F%A5/</guid><description>覆盖 top/htop/mpstat/vmstat/iostat/sar 等核心命令，结合 iowait/softirq/CPU 窃取等指标含义，提供完整排查流程和组合命令速查。</description></item><item><title>Prometheus + Grafana + Loki 可观测性体系建设</title><link>https://socake.github.io/docs/kubernetes/%E5%8F%AF%E8%A7%82%E6%B5%8B%E6%80%A7%E5%BB%BA%E8%AE%BE/</link><pubDate>Mon, 08 Dec 2025 15:00:00 +0800</pubDate><author>17691281867@163.com (Wenzhuo Huang)</author><guid>https://socake.github.io/docs/kubernetes/%E5%8F%AF%E8%A7%82%E6%B5%8B%E6%80%A7%E5%BB%BA%E8%AE%BE/</guid><description>记录在多套 K8s 集群上建立统一可观测性平台的实践经验，包含 Prometheus 采集配置、告警规则设计、Grafana Dashboard 组织方式，以及跨集群日志聚合的 Loki 部署方案。</description></item><item><title>ELK 集群监控：用 Prometheus + Grafana 监控 Elasticsearch 健康</title><link>https://socake.github.io/posts/elk-prometheus-monitoring/</link><pubDate>Wed, 08 Oct 2025 11:33:00 +0800</pubDate><author>17691281867@163.com (Wenzhuo Huang)</author><guid>https://socake.github.io/posts/elk-prometheus-monitoring/</guid><description>Kibana 内置的 Stack Monitoring 免费功能有限，告警媒介也受商业授权约束。我们最终选择 Prometheus + Grafana 方案监控 ELK 集群，这篇文章记录完整的落地过程和踩坑。</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/elk-prometheus-monitoring/featured.jpg"/></item><item><title>VictoriaMetrics：比 Prometheus 更省资源的监控存储方案</title><link>https://socake.github.io/posts/victoriametrics-prometheus/</link><pubDate>Mon, 28 Jul 2025 13:37:00 +0800</pubDate><author>17691281867@163.com (Wenzhuo Huang)</author><guid>https://socake.github.io/posts/victoriametrics-prometheus/</guid><description>Prometheus 撑不住了？本文对比 VictoriaMetrics 与 Prometheus 的核心差异，介绍 remote_write 无缝迁移方案，以及 VM 在资源占用、压缩率、查询性能上的实际提升。</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/victoriametrics-prometheus/featured.jpg"/></item><item><title>可观测性建设：从 Prometheus 采集到 Grafana 告警联动</title><link>https://socake.github.io/posts/prometheus-grafana/</link><pubDate>Fri, 06 Dec 2024 09:30:00 +0800</pubDate><author>17691281867@163.com (Wenzhuo Huang)</author><guid>https://socake.github.io/posts/prometheus-grafana/</guid><description>可观测性不是装几个监控工具，而是让系统在出问题时能快速定位根因。这篇文章从采集架构到 PromQL 到告警路由，覆盖我们在生产环境中实际遇到的 cardinality 爆炸、告警噪音等问题。</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/prometheus-grafana/featured.jpg"/></item><item><title>Python 对接 Prometheus：查询监控数据与告警状态自动化</title><link>https://socake.github.io/posts/python-prometheus-monitoring/</link><pubDate>Mon, 25 Nov 2024 11:44:00 +0800</pubDate><author>17691281867@163.com (Wenzhuo Huang)</author><guid>https://socake.github.io/posts/python-prometheus-monitoring/</guid><description>用 Python 直接调 Prometheus HTTP API，实现服务存活巡检、可用率日报生成，最后接入钉钉每日自动推送集群健康摘要。</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/python-prometheus-monitoring/featured.jpg"/></item></channel></rss>