<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>结构化生成 on 黄文卓 | DevOps Engineer</title><link>https://socake.github.io/tags/%E7%BB%93%E6%9E%84%E5%8C%96%E7%94%9F%E6%88%90/</link><description>Recent content in 结构化生成 on 黄文卓 | DevOps Engineer</description><generator>Hugo -- gohugo.io</generator><language>zh-CN</language><managingEditor>17691281867@163.com (Wenzhuo Huang)</managingEditor><webMaster>17691281867@163.com (Wenzhuo Huang)</webMaster><copyright>© 2026 Wenzhuo Huang</copyright><lastBuildDate>Sat, 14 Mar 2026 16:45:00 +0800</lastBuildDate><atom:link href="https://socake.github.io/tags/%E7%BB%93%E6%9E%84%E5%8C%96%E7%94%9F%E6%88%90/index.xml" rel="self" type="application/rss+xml"/><item><title>SGLang 结构化生成实战：RadixAttention、约束解码与多轮对话优化</title><link>https://socake.github.io/posts/sglang-structured-generation/</link><pubDate>Sat, 14 Mar 2026 16:45:00 +0800</pubDate><author>17691281867@163.com (Wenzhuo Huang)</author><guid>https://socake.github.io/posts/sglang-structured-generation/</guid><description>SGLang 是被低估的 LLM 推理框架，RadixAttention 对多轮对话和 Agent 场景收益巨大。本文讲清 SGLang 的核心机制、前端 DSL、约束解码、部署方式和踩坑。</description><media:content xmlns:media="http://search.yahoo.com/mrss/" url="https://socake.github.io/posts/sglang-structured-generation/featured.jpg"/></item></channel></rss>