最佳实践提示工程

Claude Opus 4.8 提示指南

Claude Opus 4.8 的行为差异和提示模式，涵盖冗长度、effort 校准、工具使用、子代理和前端默认风格。

本指南涵盖 Claude Opus 4.8 特有的提示模式。关于从 Claude Opus 4.8 迁移到最新 Opus 模型所涉及的 API 变更，请参阅从 Claude Opus 4.8 迁移到 Claude Opus 5。关于适用于所有当前 Claude 模型的技巧，请参阅提示最佳实践。

Claude Opus 4.8 在长周期代理工作、知识工作、视觉和记忆任务方面具有特别的优势。它在现有的 Claude Opus 4.7 提示上开箱即用表现良好。以下模式涵盖了最常需要调优的行为。

关于自 Claude Opus 4.7 以来的 API 参数变更（采样参数、effort 默认值、1M 上下文窗口默认值、对话中途系统消息以及拒绝停止详情），请参阅从 Claude Opus 4.7 迁移到 Claude Opus 5，其中涵盖了迁移到最新 Opus 模型过程中的相同变更；Claude Opus 4.8 具有相同的这些行为。

响应长度和冗长度

Claude Opus 4.8 会根据其判断的任务复杂程度来校准响应长度，而不是默认采用固定的冗长度。这通常意味着对简单查询给出更短的回答，而对开放式分析给出长得多的回答。

如果您的产品依赖于特定风格或冗长度的输出，您可能需要调优您的提示。例如，要降低冗长度，您可以添加：

Provide concise, focused responses. Skip non-essential context, and keep examples minimal.

如果您发现特定类型的冗长表现（例如过度解释），您可以在提示中添加额外的指令来防止它们。展示 Claude 如何以适当简洁程度进行沟通的正面示例，往往比负面示例或告诉模型不要做什么的指令更有效。

校准 effort 和思考深度

effort 参数允许您在 Claude 的智能与令牌消耗之间进行调优，以能力换取更快的速度和更低的成本。对于编码和代理用例，请从 xhigh effort 级别开始；对于大多数对智能敏感的用例，请至少使用 high effort。尝试其他 effort 级别以进一步调优令牌使用量和智能：

max： Max effort 在某些用例中可以带来性能提升，但随着令牌使用量的增加可能出现收益递减。此设置有时也容易出现过度思考。对于智能要求高的任务，请测试 max effort。
xhigh： Extra high effort 是大多数编码和代理用例的最佳设置。
high： 此设置在令牌使用量和智能之间取得平衡。对于大多数对智能敏感的用例，请至少使用 high effort。
medium： 适合需要减少令牌使用量同时在智能上有所取舍的成本敏感用例。
low： 保留给简短、范围明确的任务以及对延迟敏感但对智能不敏感的工作负载。

Claude Opus 4.8 严格遵守 effort 级别，尤其是在低端。在 low 和 medium 下，模型会将其工作范围限定在所要求的内容上，而不会超出预期。这对延迟和成本有利，但在以 low effort 运行中等复杂度的任务时，存在一定的思考不足风险。

如果您在复杂问题上观察到浅层推理，请将 effort 提高到 high 或 xhigh，而不是通过提示来规避。如果您出于延迟考虑需要将 effort 保持在 low，请添加有针对性的指导：

This task involves multistep reasoning. Think carefully through the problem before responding.

对于此模型，effort 可能比以往任何 Opus 模型都更重要，因此在升级时请积极地进行实验。

在 Claude Opus 4.8 上，除非您显式设置 thinking: {type: "adaptive"}，否则思考功能是关闭的。自适应思考的触发行为是可引导的。如果您发现模型思考的频率超出您的期望（这在系统提示较大或较复杂时可能发生），请添加指导来引导它。与往常一样，请衡量任何提示更改对性能的影响。示例：

Thinking adds latency and should only be used when it will meaningfully improve answer quality — typically for problems that require multistep reasoning. When in doubt, respond directly.

相反，如果您在 medium 下运行困难的工作负载并发现思考不足，首要手段是提高 effort。如果您需要更精细的控制，请直接通过提示来实现。

如果您以 max 或 xhigh effort 运行 Claude Opus 4.8，请设置较大的最大输出令牌预算，以便模型有足够的空间在其子代理和工具调用中进行思考和行动。从 64k 令牌开始，然后在此基础上调优。

工具使用触发

Claude Opus 4.8 倾向于优先推理而非工具调用。在大多数情况下，这会产生更好的结果。然而，提高 effort 设置是增加工具使用程度的有效手段，尤其是在知识工作中。high 或 xhigh effort 设置在代理搜索和编码中会显示出明显更多的工具使用。对于您希望有更多工具使用的场景，您还可以调整提示，明确指示模型何时以及如何正确使用其工具。例如，如果您发现模型没有使用您的网络搜索工具，请清楚地描述它为什么以及应该如何使用。

面向用户的进度更新

Claude Opus 4.8 在长代理轨迹中会向用户提供更规律、更高质量的更新。如果您添加了强制中间状态消息的脚手架（"每 3 次工具调用后，总结进度"），请尝试将其移除。如果您发现 Claude Opus 4.8 面向用户的更新的长度或内容与您的用例校准不佳，请在提示中明确描述这些更新应该是什么样子，并提供示例。

更字面的指令遵循

Claude Opus 4.8 会字面且明确地解释提示，尤其是在较低的 effort 级别下。它不会悄悄地将一个项目的指令泛化到另一个项目，也不会推断您没有提出的请求。这种字面性的好处是精确性和更少的反复，并且对于具有精心调优的提示、结构化提取以及需要可预测行为的管道的 API 用例，它通常表现更好。如果您需要 Claude 广泛应用某条指令，请明确说明范围（例如，"将此格式应用于每个部分，而不仅仅是第一个部分"）。

语气和写作风格

与任何新模型一样，长篇写作的文风可能会发生变化。Claude Opus 4.8 倾向于直接、有主见的风格，极少使用迎合性措辞，并且节制使用表情符号。如果您的产品依赖于特定的语气，请针对新的基线重新评估风格提示。

例如，如果您的产品语气更温暖或更具对话性，请添加：

Use a warm, collaborative tone. Acknowledge the user's framing before answering.

控制子代理生成

Claude Opus 4.8 默认倾向于生成较少的子代理。然而，这种行为可以通过提示来引导；请为 Claude Opus 4.8 提供关于何时需要子代理的明确指导。以下是一个编码用例的简单示例：

Do not spawn a subagent for work you can complete directly in a single response (e.g. refactoring a function you can already see).

Spawn multiple subagents in the same turn when fanning out across items or reading multiple files.

设计和前端默认风格

Claude Opus 4.8 具有很强的设计直觉，并有一种一致的默认风格：温暖的奶油色/米白色背景（约 #F4F1EA）、衬线展示字体（Georgia、Fraunces、Playfair）、斜体词语强调，以及赤陶色/琥珀色点缀。这种风格适合编辑类、酒店类和作品集类需求，但对于仪表板、开发工具、金融科技、医疗保健或企业应用会显得不合适。这种默认风格会出现在幻灯片和 Web UI 中。

这种默认风格很顽固。泛泛的指令（"不要使用奶油色"、"做得干净简约"）往往会使模型转向另一种固定的配色方案，而不是产生多样性。有两种方法可以可靠地奏效：

1. 指定一个具体的替代方案。 模型会精确遵循明确的规格：

Design a desktop landing page for a supplement brand called AEFRM.

The visual direction should come from a cold monochrome atmosphere using pale silver-gray tones that gradually deepen into blue-gray and near-black, similar to a misted metallic surface.

The page should feel sharp and controlled, with a strong sense of structure and restraint.

Use this tonal system across the full page instead of introducing bright accent colors.

Use the uploaded image on the hero design in black and white.

The layout should be built with clear horizontal sections and a centered max-width container. Use 4px corner radius consistently across cards, buttons, inputs, and media frames. Margins should feel generous, with enough empty space around each section so the page breathes.

Typography should use a square, angular sans-serif with wider letter spacing than usual, especially in headings and navigation, so the text feels more engineered and less compressed. Headline text can be large and uppercase, while supporting copy remains short and sparse. The sub texts should be written with Alumni Sans SC in 4-6px like tiny little texts on corners bottom centre like that.

For the structure, start with a hero section containing a strong product statement, one short supporting paragraph, and a clean product placeholder or packshot frame. Below that, add a benefit grid with three or four blocks, then a formulation or ingredients section, and finally a cta.

Buttons should be flat and precise, with subtle hover changes using transition: all 160ms ease out where brightness and border contrast shift slightly rather than using dramatic motion.

Color palette should stay within this range:
#E9ECEC, #C9D2D4, #8C9A9E, #44545B, #11171B.

2. 让模型在构建之前提出多个选项。 这会打破默认风格并让用户掌控。如果您之前依赖 temperature 来获得设计多样性，请使用这种方法；它会在多次运行中产生有意义的不同方向。示例提示：

Before building, propose 4 distinct visual directions tailored to this brief (each as: bg hex / accent hex / typeface — one-line rationale). Ask the user to pick one, then implement only that direction.

此外，与以前的模型相比，Claude Opus 4.8 需要更少的前端设计提示来避免用户所称的"AI 垃圾"美学的通用模式。对于早期模型，Anthropic 建议在 frontend-design 技能中使用较长的提示片段。然而，Claude Opus 4.8 只需更精简的提示指导即可生成独特、有创意的前端。以下提示片段与前面关于多样性的提示建议配合使用效果很好：

<frontend_aesthetics>
NEVER use generic AI-generated aesthetics like overused font families (Inter, Roboto, Arial, system fonts), cliched color schemes (particularly purple gradients on white or dark backgrounds), predictable layouts and component patterns, and cookie-cutter design that lacks context-specific character. Use unique fonts, cohesive colors and themes, and animations for effects and micro-interactions.
</frontend_aesthetics>

交互式编码产品

Claude Opus 4.8 的令牌使用量和行为在单个用户轮次的自主异步编码代理与多个用户轮次的交互式同步编码代理之间可能有所不同。具体来说，它在交互式场景中倾向于使用更多令牌，主要是因为它在用户轮次之后会进行更多推理。这可以在长时间的交互式编码会话中提高长周期连贯性、指令遵循和编码能力，但也会带来更多的令牌使用量。为了在编码产品中同时最大化性能和令牌效率，请使用 xhigh 或 high effort，添加自动模式等自主功能，并减少用户所需的人工交互次数。

当然，在限制所需用户交互次数时，重要的是在第一个人类轮次中预先指定任务、意图和相关约束。预先提供明确、清晰且准确的任务描述，有助于最大化自主性和智能，同时最小化用户轮次之后的额外令牌使用量。由于 Claude Opus 4.8 比以前的模型更自主，这种使用模式有助于最大化性能。相比之下，通过多个用户轮次逐步传达的模糊或规格不足的提示，往往会相对降低令牌效率，有时还会降低性能。

代码审查框架

Claude Opus 4.8 在发现错误方面明显优于以前的模型，并且在内部评估中具有更高的召回率和精确率。然而，如果您的代码审查框架是针对早期模型调优的，您最初可能会看到较低的召回率。这很可能是框架效应，而不是能力退化。当审查提示中包含"仅报告高严重性问题"、"保持保守"或"不要吹毛求疵"之类的内容时，Claude Opus 4.8 可能会比早期模型更忠实地遵循该指令：它可能会同样彻底地调查代码、识别错误，然后不报告它判断为低于您所述标准的发现。这可能表现为模型进行了相同深度的调查，但将更少的调查转化为报告的发现，尤其是在较低严重性的错误上。精确率通常会上升，但即使模型的底层错误发现能力已经提高，测得的召回率也可能下降。

一些推荐的提示语言：

Report every issue you find, including ones you are uncertain about or consider low-severity. Do not filter for importance or confidence at this stage - a separate verification step will do that. Your goal here is coverage: it is better to surface a finding that later gets filtered out than to silently drop a real bug. For each finding, include your confidence level and an estimated severity so a downstream filter can rank them.

即使没有实际的第二步，也可以使用此提示，但将置信度过滤从发现步骤中移出通常会有所帮助。如果您的框架有单独的验证、去重或排序阶段，请明确告诉模型它在发现阶段的工作是覆盖率而不是过滤。

如果您确实希望模型在单次处理中自我过滤，请具体说明标准在哪里，而不是使用"重要"之类的定性术语：例如，"报告任何可能导致不正确行为、测试失败或误导性结果的错误；仅省略纯粹的风格或命名偏好之类的小问题。"

针对您的评估或测试用例的子集迭代提示，以验证召回率或 F1 分数的提升。

计算机使用

计算机使用能力适用于各种分辨率，最高分辨率为 2576px / 3.75MP。内部计算机使用测试表明，以 1080p 发送图像可以在性能和成本之间取得良好的平衡。

对于特别注重成本的工作负载，720p 或 1366×768 是性能强劲的低成本选项。请进行您自己的测试，以找到适合您用例的理想设置；尝试不同的 effort 设置也有助于调优模型的行为。

Was this page helpful?

最佳实践提示工程

Claude Opus 4.8 提示指南

Claude Opus 4.8 的行为差异和提示模式，涵盖冗长度、effort 校准、工具使用、子代理和前端默认风格。

响应长度和冗长度

如果您的产品依赖于特定风格或冗长度的输出，您可能需要调优您的提示。例如，要降低冗长度，您可以添加：

Provide concise, focused responses. Skip non-essential context, and keep examples minimal.

校准 effort 和思考深度

max： Max effort 在某些用例中可以带来性能提升，但随着令牌使用量的增加可能出现收益递减。此设置有时也容易出现过度思考。对于智能要求高的任务，请测试 max effort。
xhigh： Extra high effort 是大多数编码和代理用例的最佳设置。
high： 此设置在令牌使用量和智能之间取得平衡。对于大多数对智能敏感的用例，请至少使用 high effort。
medium： 适合需要减少令牌使用量同时在智能上有所取舍的成本敏感用例。
low： 保留给简短、范围明确的任务以及对延迟敏感但对智能不敏感的工作负载。

This task involves multistep reasoning. Think carefully through the problem before responding.

对于此模型，effort 可能比以往任何 Opus 模型都更重要，因此在升级时请积极地进行实验。

Thinking adds latency and should only be used when it will meaningfully improve answer quality — typically for problems that require multistep reasoning. When in doubt, respond directly.

相反，如果您在 medium 下运行困难的工作负载并发现思考不足，首要手段是提高 effort。如果您需要更精细的控制，请直接通过提示来实现。

工具使用触发

面向用户的进度更新

更字面的指令遵循

语气和写作风格

例如，如果您的产品语气更温暖或更具对话性，请添加：

Use a warm, collaborative tone. Acknowledge the user's framing before answering.

控制子代理生成

Do not spawn a subagent for work you can complete directly in a single response (e.g. refactoring a function you can already see).

Spawn multiple subagents in the same turn when fanning out across items or reading multiple files.

设计和前端默认风格

1. 指定一个具体的替代方案。 模型会精确遵循明确的规格：

Design a desktop landing page for a supplement brand called AEFRM.

The visual direction should come from a cold monochrome atmosphere using pale silver-gray tones that gradually deepen into blue-gray and near-black, similar to a misted metallic surface.

The page should feel sharp and controlled, with a strong sense of structure and restraint.

Use this tonal system across the full page instead of introducing bright accent colors.

Use the uploaded image on the hero design in black and white.

The layout should be built with clear horizontal sections and a centered max-width container. Use 4px corner radius consistently across cards, buttons, inputs, and media frames. Margins should feel generous, with enough empty space around each section so the page breathes.

Typography should use a square, angular sans-serif with wider letter spacing than usual, especially in headings and navigation, so the text feels more engineered and less compressed. Headline text can be large and uppercase, while supporting copy remains short and sparse. The sub texts should be written with Alumni Sans SC in 4-6px like tiny little texts on corners bottom centre like that.

For the structure, start with a hero section containing a strong product statement, one short supporting paragraph, and a clean product placeholder or packshot frame. Below that, add a benefit grid with three or four blocks, then a formulation or ingredients section, and finally a cta.

Buttons should be flat and precise, with subtle hover changes using transition: all 160ms ease out where brightness and border contrast shift slightly rather than using dramatic motion.

Color palette should stay within this range:
#E9ECEC, #C9D2D4, #8C9A9E, #44545B, #11171B.

Before building, propose 4 distinct visual directions tailored to this brief (each as: bg hex / accent hex / typeface — one-line rationale). Ask the user to pick one, then implement only that direction.

<frontend_aesthetics>
NEVER use generic AI-generated aesthetics like overused font families (Inter, Roboto, Arial, system fonts), cliched color schemes (particularly purple gradients on white or dark backgrounds), predictable layouts and component patterns, and cookie-cutter design that lacks context-specific character. Use unique fonts, cohesive colors and themes, and animations for effects and micro-interactions.
</frontend_aesthetics>

交互式编码产品

代码审查框架

一些推荐的提示语言：

Report every issue you find, including ones you are uncertain about or consider low-severity. Do not filter for importance or confidence at this stage - a separate verification step will do that. Your goal here is coverage: it is better to surface a finding that later gets filtered out than to silently drop a real bug. For each finding, include your confidence level and an estimated severity so a downstream filter can rank them.

针对您的评估或测试用例的子集迭代提示，以验证召回率或 F1 分数的提升。

计算机使用

计算机使用能力适用于各种分辨率，最高分辨率为 2576px / 3.75MP。内部计算机使用测试表明，以 1080p 发送图像可以在性能和成本之间取得良好的平衡。

Was this page helpful?

响应长度和冗长度

校准 effort 和思考深度

工具使用触发

面向用户的进度更新

更字面的指令遵循

语气和写作风格

控制子代理生成

设计和前端默认风格

交互式编码产品

代码审查框架

计算机使用

响应长度和冗长度

校准 effort 和思考深度

工具使用触发

面向用户的进度更新

更字面的指令遵循

语气和写作风格

控制子代理生成

设计和前端默认风格

交互式编码产品

代码审查框架

计算机使用

响应长度和冗长度

校准 effort 和思考深度

工具使用触发

面向用户的进度更新

更字面的指令遵循

语气和写作风格

控制子代理生成

设计和前端默认风格

交互式编码产品

代码审查框架

计算机使用

响应长度和冗长度

校准 effort 和思考深度

工具使用触发

面向用户的进度更新

更字面的指令遵循

语气和写作风格

控制子代理生成

设计和前端默认风格

交互式编码产品

代码审查框架

计算机使用