<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>事故响应 on 黄文卓 | DevOps Engineer</title><link>https://socake.github.io/tags/%E4%BA%8B%E6%95%85%E5%93%8D%E5%BA%94/</link><description>Recent content in 事故响应 on 黄文卓 | DevOps Engineer</description><generator>Hugo -- gohugo.io</generator><language>zh-CN</language><managingEditor>17691281867@163.com (Wenzhuo Huang)</managingEditor><webMaster>17691281867@163.com (Wenzhuo Huang)</webMaster><copyright>© 2026 Wenzhuo Huang</copyright><lastBuildDate>Sat, 05 Jul 2025 09:30:00 +0800</lastBuildDate><atom:link href="https://socake.github.io/tags/%E4%BA%8B%E6%95%85%E5%93%8D%E5%BA%94/index.xml" rel="self" type="application/rss+xml"/><item><title>SRE 故障管理全生命周期：从响应到复盘</title><link>https://socake.github.io/posts/sre-incident-management/</link><pubDate>Sat, 05 Jul 2025 09:30:00 +0800</pubDate><author>17691281867@163.com (Wenzhuo Huang)</author><guid>https://socake.github.io/posts/sre-incident-management/</guid><description>故障处理不只是技术问题，更是协作和信息流问题。这篇文章完整梳理了从故障触发到 Post-Mortem 归档的每个环节，包括 IC 角色的意义、15 分钟定界框架，以及如何让 Post-Mortem 真正推动改进而不是走过场。</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/sre-incident-management/featured.jpg"/></item></channel></rss>