最佳实践提示工程

Claude Sonnet 5 提示编写指南

Claude Sonnet 5 的行为差异和提示编写模式，涵盖 effort 参数、自适应思考默认设置、工具使用，以及从 Claude Sonnet 4.6 迁移的相关内容。

本指南介绍 Claude Sonnet 5 特有的提示编写模式。有关该模型的功能和 API 变更，请参阅 Claude Sonnet 5 新特性。有关适用于所有当前 Claude 模型的技巧，请参阅提示编写最佳实践。

Claude Sonnet 5 在编码和智能体任务方面具有特别的优势。它在现有的 Claude Sonnet 4.6 提示上开箱即用，表现良好。本指南中的模式涵盖了最常需要调优的行为。

有关从 Claude Sonnet 4.6 迁移时的 API 参数变更（自适应思考默认开启、不再接受采样参数、移除手动扩展思考，以及新的分词器），请参阅迁移指南。

响应长度和详细程度

Claude Sonnet 5 会根据任务的复杂程度来校准响应长度，而不是默认采用固定的详细程度。这通常意味着对于简单的查询会给出较短的回答，而对于开放式分析则会给出较长的回答。

如果您的产品依赖于特定风格或详细程度的输出，您可能需要调整提示。例如，要降低详细程度，您可以添加：

Provide concise, focused responses. Skip non-essential context, and keep examples minimal.

如果您发现特定类型的冗长表达（例如过度解释），可以在提示中添加额外的指令来避免这种情况。展示 Claude 如何以适当简洁程度进行沟通的正面示例，往往比告诉模型不要做什么的负面示例或指令更有效。

校准 effort 和思考深度

effort 参数允许您调整 Claude 的智能程度与令牌消耗之间的平衡，以能力换取更快的速度和更低的成本。在 Claude Sonnet 5 上，effort 默认为 high，与 Claude Sonnet 4.6 相同。对于最困难的编码和智能体任务，请将 effort 提升至 xhigh。尝试其他 effort 级别以进一步调整令牌使用量和智能程度：

max： 绝对最高能力，对令牌消耗没有任何限制。
xhigh： 超高 effort 是最困难的编码和智能体用例的推荐设置。
high： 默认值。此设置在大多数用例中平衡了令牌使用量和智能程度。
medium： 适用于需要减少令牌使用量同时牺牲部分智能程度的成本敏感型用例。
low： 保留用于简短、范围明确的任务以及对延迟敏感但对智能程度不敏感的工作负载。

作为迁移时的粗略跨模型对照：Claude Sonnet 5 在 medium 级别的智能程度与 Claude Sonnet 4.6 在 high 级别相当，Claude Sonnet 5 在 high 级别与 Claude Sonnet 4.6 在 max 级别相当。进行基准测试时，请根据观察到的思考长度而非 effort 名称进行匹配。

Claude Sonnet 5 严格遵守 effort 级别，尤其是在较低级别。在 low 和 medium 级别，模型会将其工作范围限定在所要求的内容上，而不会超额完成。这有利于降低延迟和成本，但在以 low effort 运行中等复杂度的任务时，存在一定的思考不足风险。

如果您在复杂问题上观察到推理过浅，请将 effort 提升至 high 或 xhigh，而不是通过提示来规避。如果您需要为了延迟而将 effort 保持在 low，请添加针对性的指导：

This task involves multi-step reasoning. Think carefully through the problem before responding.

在 Claude Sonnet 5 上，自适应思考（adaptive thinking）默认开启。不带 thinking 字段的请求会以自适应思考模式运行。这与 Claude Sonnet 4.6 不同，在 Claude Sonnet 4.6 中相同的请求会在不启用思考的情况下运行。要完全关闭思考，请传递 thinking: {type: "disabled"}。由于 max_tokens 是对总输出（思考加响应文本）的硬性限制，请重新审视那些在 Claude Sonnet 4.6 上未启用思考运行的工作负载的该参数设置。如果您之前在 Claude Sonnet 4.6 上关闭了思考，请尝试在 Claude Sonnet 5 上开启思考并使用较低的 effort 级别。

自适应思考的触发行为是可引导的。如果您发现模型发出思考块的频率超出您的预期（这在系统提示较大或较复杂时可能发生），请添加指导来引导它。与往常一样，请衡量任何提示更改对性能的影响。示例：

Thinking adds latency and should only be used when it will meaningfully improve answer quality, typically for problems that require multi-step reasoning. When in doubt, respond directly.

反之，如果您在 medium 级别运行困难的工作负载并发现思考不足，首要手段是提升 effort。如果您需要更精细的控制，请直接通过提示来实现。

Claude Sonnet 5 不支持手动扩展思考（thinking: {type: "enabled", budget_tokens: N}），使用时会返回 400 错误。该功能在 Claude Sonnet 4.6 上已被弃用，现已移除。请改用带有 effort 参数的自适应思考。

如果您在 high、xhigh 或 max effort 级别运行 Claude Sonnet 5，请在 max_tokens 中留出余量，以便模型有空间进行思考和工具调用。在长任务中，自适应思考可能会占用预算的很大一部分；如果预算紧张，您可能会看到响应几乎全是思考内容，随后是被截断的回答和 stop_reason: "max_tokens"。提高 max_tokens 或降至 medium effort 可以解决此问题。由于 Claude Sonnet 5 使用新的分词器，对相同文本会产生大约多 30% 的令牌，因此针对 Claude Sonnet 4.6 调优的 max_tokens 限制可能会截断等效的输出。

工具使用触发

Claude Sonnet 5 默认比 Claude Sonnet 4.6 更具智能体特性，会更主动地使用工具并运行自我验证循环。在禁用思考的情况下，模型不太可能主动使用工具或考虑搜索；如果您在关闭思考的情况下依赖工具调用，请在系统提示中添加明确的提示。effort 也是影响工具使用的一个杠杆：high 或 xhigh effort 设置在智能体搜索和编码中会显示出明显更多的工具使用。对于您希望增加工具使用的场景，您还可以调整提示，明确指示模型何时以及如何正确使用其工具。例如，如果您发现模型没有使用您的网络搜索工具，请清楚地描述它应该使用的原因和方式。

面向用户的进度更新

Claude Sonnet 5 在长时间的智能体执行过程中会向用户提供定期的、更高质量的更新。如果您添加了强制生成中间状态消息的脚手架（例如"每 3 次工具调用后，总结一下进度"），请尝试将其移除。如果您发现 Claude Sonnet 5 面向用户的更新在长度或内容上与您的用例不太匹配，请在提示中明确描述这些更新应该是什么样的，并提供示例。

更字面化的指令遵循

Claude Sonnet 5 会字面且明确地解读提示，尤其是在较低的 effort 级别下。它不会默默地将针对某一项的指令泛化到另一项，也不会推断您未提出的请求。这种字面化的好处是精确性，对于具有精心调优提示、结构化提取以及需要可预测行为的流水线等 API 用例，它通常表现更好。如果您需要 Claude 广泛应用某条指令，请明确说明范围（例如，"将此格式应用于每个部分，而不仅仅是第一个部分"）。

语气和写作风格

与任何新模型一样，长篇写作的文体风格可能会有所变化。如果您的产品依赖于特定的语气风格，请根据新的基准重新评估风格提示。

例如，如果您的产品语气更温暖或更具对话性，请添加：

Use a warm, collaborative tone. Acknowledge the user's framing before answering.

如果您之前依赖 temperature 来实现风格多样性，请注意在 Claude Sonnet 5 上将 temperature、top_p 或 top_k 设置为非默认值会返回 400 错误。此限制对于 Sonnet 级别的模型是新增的。迁移时请移除这些参数，改用系统提示指令来引导语气和多样性。

设计和前端默认风格

在开放式的前端和设计需求中，Claude Sonnet 5 可能会形成一致的默认视觉风格。默认的固有风格对某些需求可能看起来不错，但对于仪表板、开发工具、金融科技、医疗保健或企业应用可能会显得不合适。

通用指令（"不要使用那种颜色"、"让它简洁极简"）往往会使模型转向另一个固定的配色方案，而不是产生多样性。以下两种方法可靠有效：

1. 指定具体的替代方案。 模型会精确遵循明确的规范：

Design a desktop landing page for a supplement brand called AEFRM.

The visual direction should come from a cold monochrome atmosphere using pale silver-gray tones that gradually deepen into blue-gray and near-black, similar to a misted metallic surface.

The page should feel sharp and controlled, with a strong sense of structure and restraint.

Use this tonal system across the full page instead of introducing bright accent colors.

Use the uploaded image on the hero design in black and white.

The layout should be built with clear horizontal sections and a centered max-width container. Use 4px corner radius consistently across cards, buttons, inputs, and media frames. Margins should feel generous, with enough empty space around each section so the page breathes.

Typography should use a square, angular sans-serif with wider letter spacing than usual, especially in headings and navigation, so the text feels more engineered and less compressed. Headline text can be large and uppercase, while supporting copy remains short and sparse. The sub texts should be written with Alumni Sans SC in 4-6px like tiny little texts on corners bottom centre like that.

For the structure, start with a hero section containing a strong product statement, one short supporting paragraph, and a clean product placeholder or packshot frame. Below that, add a benefit grid with three or four blocks, then a formulation or ingredients section, and finally a cta.

Buttons should be flat and precise, with subtle hover changes using transition: all 160ms ease out where brightness and border contrast shift slightly rather than using dramatic motion.

Color palette should stay within this range:
#E9ECEC, #C9D2D4, #8C9A9E, #44545B, #11171B.

2. 让模型在构建之前提出多个选项。 这可以打破默认风格并让用户拥有控制权。由于 Claude Sonnet 5 不接受 temperature 参数，此方法是在多次运行中产生有实质差异的设计方向的推荐方式。示例提示：

Before building, propose 4 distinct visual directions tailored to this brief (each as: bg hex / accent hex / typeface, plus a one-line rationale). Ask the user to pick one, then implement only that direction.

为了避免用户所称的"AI 糟粕"美学这类通用模式，您可以在系统提示中包含一条简短的指令。frontend-design 技能提供了更完整的处理方法，但以下片段与前述的多样性方法配合使用效果良好：

<frontend_aesthetics>
NEVER use generic AI-generated aesthetics like overused font families (Inter, Roboto, Arial, system fonts), cliched color schemes (particularly purple gradients on white or dark backgrounds), predictable layouts and component patterns, and cookie-cutter design that lacks context-specific character. Use unique fonts, cohesive colors and themes, and animations for effects and micro-interactions.
</frontend_aesthetics>

交互式编码产品

在单个用户轮次的自主异步编码智能体与多个用户轮次的交互式同步编码智能体之间，令牌使用量和行为可能有所不同。为了在编码产品中同时最大化性能和令牌效率，请使用 xhigh 或 high effort，添加自动模式等自主功能，并减少用户所需的人工交互次数。

在限制所需用户交互次数时，重要的是在第一个人类轮次中预先指定任务、意图和相关约束。预先提供明确、清晰且准确的任务描述有助于最大化自主性和智能程度，同时最小化用户轮次后的额外令牌使用量。相比之下，在多个用户轮次中逐步传达的模糊或不充分的提示往往会相对降低令牌效率，有时还会降低性能。

代码审查框架

如果您的代码审查框架是针对早期模型调优的，您最初可能会在 Claude Sonnet 5 上看到较低的召回率。这很可能是框架效应，而非能力退化。当审查提示中包含诸如"仅报告高严重性问题"、"保持保守"或"不要吹毛求疵"之类的内容时，Claude Sonnet 5 可能会比早期模型更忠实地遵循该指令：它可能会同样彻底地调查代码、识别出错误，但随后不报告它判断为低于您所述标准的发现。这可能表现为模型进行了相同深度的调查，但将较少的调查转化为报告的发现，尤其是在较低严重性的错误上。精确率通常会上升，但即使模型的底层错误发现能力有所提升，测量到的召回率也可能下降。

一些推荐的提示语言：

Report every issue you find, including ones you are uncertain about or consider low-severity. Do not filter for importance or confidence at this stage - a separate verification step will do that. Your goal here is coverage: it is better to surface a finding that later gets filtered out than to silently drop a real bug. For each finding, include your confidence level and an estimated severity so a downstream filter can rank them.

即使没有实际的第二步，也可以使用此提示，但将置信度过滤从发现步骤中移出通常会有所帮助。如果您的框架有单独的验证、去重或排序阶段，请明确告诉模型它在发现阶段的任务是覆盖率而非过滤。

如果您确实希望模型在单次处理中进行自我过滤，请具体说明标准在哪里，而不是使用"重要"之类的定性术语：例如，"报告任何可能导致不正确行为、测试失败或误导性结果的错误；仅省略纯风格或命名偏好之类的琐碎问题。"

针对您的部分评估或测试用例迭代提示，以验证召回率或 F1 分数的提升。

计算机使用

Claude Sonnet 5 支持 computer_20251124 工具版本。计算机使用（computer use）功能适用于各种分辨率，最高分辨率为 2576px / 3.75MP。内部计算机使用测试表明，以 1080p 发送图像可以在性能和成本之间取得良好的平衡。

对于特别注重成本的工作负载，720p 或 1366×768 是成本较低且性能强劲的选项。请自行测试以找到适合您用例的理想设置；尝试不同的 effort 设置也有助于调整模型的行为。

Was this page helpful?

最佳实践提示工程

Claude Sonnet 5 提示编写指南

Claude Sonnet 5 的行为差异和提示编写模式，涵盖 effort 参数、自适应思考默认设置、工具使用，以及从 Claude Sonnet 4.6 迁移的相关内容。

有关从 Claude Sonnet 4.6 迁移时的 API 参数变更（自适应思考默认开启、不再接受采样参数、移除手动扩展思考，以及新的分词器），请参阅迁移指南。

响应长度和详细程度

如果您的产品依赖于特定风格或详细程度的输出，您可能需要调整提示。例如，要降低详细程度，您可以添加：

Provide concise, focused responses. Skip non-essential context, and keep examples minimal.

校准 effort 和思考深度

max： 绝对最高能力，对令牌消耗没有任何限制。
xhigh： 超高 effort 是最困难的编码和智能体用例的推荐设置。
high： 默认值。此设置在大多数用例中平衡了令牌使用量和智能程度。
medium： 适用于需要减少令牌使用量同时牺牲部分智能程度的成本敏感型用例。
low： 保留用于简短、范围明确的任务以及对延迟敏感但对智能程度不敏感的工作负载。

This task involves multi-step reasoning. Think carefully through the problem before responding.

Thinking adds latency and should only be used when it will meaningfully improve answer quality, typically for problems that require multi-step reasoning. When in doubt, respond directly.

反之，如果您在 medium 级别运行困难的工作负载并发现思考不足，首要手段是提升 effort。如果您需要更精细的控制，请直接通过提示来实现。

工具使用触发

面向用户的进度更新

更字面化的指令遵循

语气和写作风格

与任何新模型一样，长篇写作的文体风格可能会有所变化。如果您的产品依赖于特定的语气风格，请根据新的基准重新评估风格提示。

例如，如果您的产品语气更温暖或更具对话性，请添加：

Use a warm, collaborative tone. Acknowledge the user's framing before answering.

设计和前端默认风格

通用指令（"不要使用那种颜色"、"让它简洁极简"）往往会使模型转向另一个固定的配色方案，而不是产生多样性。以下两种方法可靠有效：

1. 指定具体的替代方案。 模型会精确遵循明确的规范：

Design a desktop landing page for a supplement brand called AEFRM.

The visual direction should come from a cold monochrome atmosphere using pale silver-gray tones that gradually deepen into blue-gray and near-black, similar to a misted metallic surface.

The page should feel sharp and controlled, with a strong sense of structure and restraint.

Use this tonal system across the full page instead of introducing bright accent colors.

Use the uploaded image on the hero design in black and white.

The layout should be built with clear horizontal sections and a centered max-width container. Use 4px corner radius consistently across cards, buttons, inputs, and media frames. Margins should feel generous, with enough empty space around each section so the page breathes.

Typography should use a square, angular sans-serif with wider letter spacing than usual, especially in headings and navigation, so the text feels more engineered and less compressed. Headline text can be large and uppercase, while supporting copy remains short and sparse. The sub texts should be written with Alumni Sans SC in 4-6px like tiny little texts on corners bottom centre like that.

For the structure, start with a hero section containing a strong product statement, one short supporting paragraph, and a clean product placeholder or packshot frame. Below that, add a benefit grid with three or four blocks, then a formulation or ingredients section, and finally a cta.

Buttons should be flat and precise, with subtle hover changes using transition: all 160ms ease out where brightness and border contrast shift slightly rather than using dramatic motion.

Color palette should stay within this range:
#E9ECEC, #C9D2D4, #8C9A9E, #44545B, #11171B.

Before building, propose 4 distinct visual directions tailored to this brief (each as: bg hex / accent hex / typeface, plus a one-line rationale). Ask the user to pick one, then implement only that direction.

<frontend_aesthetics>
NEVER use generic AI-generated aesthetics like overused font families (Inter, Roboto, Arial, system fonts), cliched color schemes (particularly purple gradients on white or dark backgrounds), predictable layouts and component patterns, and cookie-cutter design that lacks context-specific character. Use unique fonts, cohesive colors and themes, and animations for effects and micro-interactions.
</frontend_aesthetics>

交互式编码产品

代码审查框架

一些推荐的提示语言：

Report every issue you find, including ones you are uncertain about or consider low-severity. Do not filter for importance or confidence at this stage - a separate verification step will do that. Your goal here is coverage: it is better to surface a finding that later gets filtered out than to silently drop a real bug. For each finding, include your confidence level and an estimated severity so a downstream filter can rank them.

针对您的部分评估或测试用例迭代提示，以验证召回率或 F1 分数的提升。

计算机使用

Was this page helpful?

响应长度和详细程度

校准 effort 和思考深度

工具使用触发

面向用户的进度更新

更字面化的指令遵循

语气和写作风格

设计和前端默认风格

交互式编码产品

代码审查框架

计算机使用

响应长度和详细程度

校准 effort 和思考深度

工具使用触发

面向用户的进度更新

更字面化的指令遵循

语气和写作风格

设计和前端默认风格

交互式编码产品

代码审查框架

计算机使用

响应长度和详细程度

校准 effort 和思考深度

工具使用触发

面向用户的进度更新

更字面化的指令遵循

语气和写作风格

设计和前端默认风格

交互式编码产品

代码审查框架

计算机使用

响应长度和详细程度

校准 effort 和思考深度

工具使用触发

面向用户的进度更新

更字面化的指令遵循

语气和写作风格

设计和前端默认风格

交互式编码产品

代码审查框架

计算机使用