最佳實踐提示工程

為 Claude Opus 4.8 撰寫提示

Claude Opus 4.8 的行為差異與提示模式，涵蓋冗長度、effort 校準、工具使用、子代理程式與前端預設值。

本指南涵蓋 Claude Opus 4.8 特有的提示模式。關於從 Claude Opus 4.8 遷移至最新 Opus 模型所涉及的 API 變更，請參閱從 Claude Opus 4.8 遷移至 Claude Opus 5。關於適用於所有目前 Claude 模型的技巧，請參閱提示最佳實踐。

Claude Opus 4.8 在長時程代理式工作、知識工作、視覺與記憶任務方面具有特別的優勢。它在現有的 Claude Opus 4.7 提示上開箱即用的表現良好。以下模式涵蓋最常需要調整的行為。

關於自 Claude Opus 4.7 以來的 API 參數變更（取樣參數、effort 預設值、1M 上下文視窗預設值、對話中途的系統訊息，以及拒絕停止的詳細資訊），請參閱從 Claude Opus 4.7 遷移至 Claude Opus 5，該文件涵蓋了邁向最新 Opus 模型過程中的相同變更；Claude Opus 4.8 具有相同的這些行為。

回應長度與冗長度

Claude Opus 4.8 會根據它判斷任務的複雜程度來校準回應長度，而不是預設為固定的冗長度。這通常意味著在簡單查詢上給出較短的答案，而在開放式分析上給出長得多的答案。

如果您的產品依賴於特定風格或冗長度的輸出，您可能需要調整您的提示。舉例來說，若要降低冗長度，您可以加入：

Provide concise, focused responses. Skip non-essential context, and keep examples minimal.

如果您看到特定類型的冗長度範例（例如過度解釋），您可以在提示中加入額外的指示來防止它們。展示 Claude 如何以適當簡潔程度進行溝通的正面範例，往往比負面範例或告訴模型不要做什麼的指示更有效。

校準 effort 與思考深度

effort 參數讓您可以調整 Claude 的智慧與 token 花費之間的平衡，以能力換取更快的速度和更低的成本。對於程式編寫和代理式使用案例，請從 xhigh effort 等級開始，對於大多數對智慧敏感的使用案例，請至少使用 high effort。嘗試其他 effort 等級以進一步調整 token 使用量和智慧：

max： Max effort 在某些使用案例中可以帶來效能提升，但隨著 token 使用量增加，可能會出現報酬遞減的情況。此設定有時也可能容易過度思考。請針對需要高智慧的任務測試 max effort。
xhigh： Extra high effort 是大多數程式編寫和代理式使用案例的最佳設定。
high： 此設定在 token 使用量和智慧之間取得平衡。對於大多數對智慧敏感的使用案例，請至少使用 high effort。
medium： 適合需要減少 token 使用量並以智慧作為代價的成本敏感使用案例。
low： 保留給簡短、範圍明確的任務，以及對延遲敏感但對智慧不敏感的工作負載。

Claude Opus 4.8 會嚴格遵守 effort 等級，尤其是在低端。在 low 和 medium 時，模型會將其工作範圍限定在被要求的內容，而不是超出預期。這對延遲和成本有利，但在以 low effort 執行中等複雜度的任務時，存在思考不足的風險。

如果您在複雜問題上觀察到淺層推理，請將 effort 提高到 high 或 xhigh，而不是透過提示來繞過它。如果您因延遲需求而需要將 effort 保持在 low，請加入有針對性的指引：

This task involves multistep reasoning. Think carefully through the problem before responding.

對於此模型而言，effort 可能比任何先前的 Opus 都更重要，因此在升級時請積極地進行實驗。

在 Claude Opus 4.8 上，除非您明確設定 thinking: {type: "adaptive"}，否則思考功能是關閉的。自適應思考的觸發行為是可引導的。如果您發現模型思考的頻率比您希望的更高（這在使用大型或複雜的系統提示時可能發生），請加入指引來引導它。一如既往，請衡量任何提示變更對效能的影響。範例：

Thinking adds latency and should only be used when it will meaningfully improve answer quality — typically for problems that require multistep reasoning. When in doubt, respond directly.

相反地，如果您在 medium 下執行困難的工作負載並看到思考不足的情況，第一個調整手段是提高 effort。如果您需要更精細的控制，請直接透過提示來達成。

如果您以 max 或 xhigh effort 執行 Claude Opus 4.8，請設定較大的最大輸出 token 預算，讓模型有空間在其子代理程式和工具呼叫之間進行思考和行動。從 64k token 開始，並從那裡進行調整。

工具使用觸發

Claude Opus 4.8 傾向於偏好推理而非工具呼叫。這在大多數情況下會產生更好的結果。然而，提高 effort 設定是增加工具使用程度的有用手段，尤其是在知識工作中。high 或 xhigh effort 設定在代理式搜尋和程式編寫中顯示出明顯更多的工具使用。對於您希望有更多工具使用的情境，您也可以調整您的提示，明確指示模型何時以及如何正確使用其工具。例如，如果您發現模型沒有使用您的網路搜尋工具，請清楚地描述它為什麼以及應該如何使用。

面向使用者的進度更新

Claude Opus 4.8 在長時間的代理式執行過程中，會向使用者提供更規律、更高品質的更新。如果您已加入強制產生中間狀態訊息的鷹架（「每 3 次工具呼叫後，總結進度」），請嘗試移除它。如果您發現 Claude Opus 4.8 面向使用者的更新的長度或內容與您的使用案例校準得不夠好，請在提示中明確描述這些更新應該是什麼樣子，並提供範例。

更字面的指令遵循

Claude Opus 4.8 會以字面且明確的方式解讀提示，尤其是在較低的 effort 等級下。它不會默默地將一個項目的指令推廣到另一個項目，也不會推斷您沒有提出的請求。這種字面主義的好處是精確性和更少的反覆，而且對於具有精心調整的提示、結構化擷取，以及您希望行為可預測的管線等 API 使用案例，它通常表現得更好。如果您需要 Claude 廣泛地套用某個指令，請明確說明範圍（例如，「將此格式套用到每個章節，而不僅僅是第一個」）。

語氣與寫作風格

與任何新模型一樣，長篇寫作的文體風格可能會有所改變。Claude Opus 4.8 傾向於直接、有主見的風格，極少使用認同式的措辭，並且節制地使用表情符號。如果您的產品依賴於特定的語調，請根據新的基準重新評估風格提示。

例如，如果您的產品語調更溫暖或更口語化，請加入：

Use a warm, collaborative tone. Acknowledge the user's framing before answering.

控制子代理程式的產生

Claude Opus 4.8 預設傾向於產生較少的子代理程式。然而，此行為可以透過提示來引導；請為 Claude Opus 4.8 提供關於何時需要子代理程式的明確指引。以下是一個程式編寫使用案例的簡單範例：

Do not spawn a subagent for work you can complete directly in a single response (e.g. refactoring a function you can already see).

Spawn multiple subagents in the same turn when fanning out across items or reading multiple files.

設計與前端預設值

Claude Opus 4.8 具有強烈的設計直覺，並有一致的預設風格：溫暖的奶油色/米白色背景（約 #F4F1EA）、襯線展示字體（Georgia、Fraunces、Playfair）、斜體字重點，以及赤陶色/琥珀色的強調色。這對於編輯、餐旅和作品集類的需求來說效果很好，但對於儀表板、開發工具、金融科技、醫療保健或企業應用程式來說會顯得不協調。此預設值會出現在簡報和網頁 UI 中。

此預設值相當頑固。泛泛的指示（「不要使用奶油色」、「讓它乾淨簡約」）往往會讓模型轉向另一種固定的調色盤，而不是產生多樣性。有兩種方法可以可靠地發揮作用：

1. 指定具體的替代方案。 模型會精確地遵循明確的規格：

Design a desktop landing page for a supplement brand called AEFRM.

The visual direction should come from a cold monochrome atmosphere using pale silver-gray tones that gradually deepen into blue-gray and near-black, similar to a misted metallic surface.

The page should feel sharp and controlled, with a strong sense of structure and restraint.

Use this tonal system across the full page instead of introducing bright accent colors.

Use the uploaded image on the hero design in black and white.

The layout should be built with clear horizontal sections and a centered max-width container. Use 4px corner radius consistently across cards, buttons, inputs, and media frames. Margins should feel generous, with enough empty space around each section so the page breathes.

Typography should use a square, angular sans-serif with wider letter spacing than usual, especially in headings and navigation, so the text feels more engineered and less compressed. Headline text can be large and uppercase, while supporting copy remains short and sparse. The sub texts should be written with Alumni Sans SC in 4-6px like tiny little texts on corners bottom centre like that.

For the structure, start with a hero section containing a strong product statement, one short supporting paragraph, and a clean product placeholder or packshot frame. Below that, add a benefit grid with three or four blocks, then a formulation or ingredients section, and finally a cta.

Buttons should be flat and precise, with subtle hover changes using transition: all 160ms ease out where brightness and border contrast shift slightly rather than using dramatic motion.

Color palette should stay within this range:
#E9ECEC, #C9D2D4, #8C9A9E, #44545B, #11171B.

2. 讓模型在建構之前提出選項。 這會打破預設值並讓使用者擁有控制權。如果您先前依賴 temperature 來獲得設計多樣性，請使用此方法；它會在不同的執行中產生有意義的不同方向。範例提示：

Before building, propose 4 distinct visual directions tailored to this brief (each as: bg hex / accent hex / typeface — one-line rationale). Ask the user to pick one, then implement only that direction.

此外，與先前的模型相比，Claude Opus 4.8 需要較少的前端設計提示，即可避免使用者所稱的「AI slop」美學的泛用模式。對於較早的模型，Anthropic 建議在 frontend-design skill 中使用較長的提示片段。然而，Claude Opus 4.8 只需更精簡的提示指引，即可產生獨特、有創意的前端。以下提示片段與前述關於多樣性的提示建議搭配使用效果良好：

<frontend_aesthetics>
NEVER use generic AI-generated aesthetics like overused font families (Inter, Roboto, Arial, system fonts), cliched color schemes (particularly purple gradients on white or dark backgrounds), predictable layouts and component patterns, and cookie-cutter design that lacks context-specific character. Use unique fonts, cohesive colors and themes, and animations for effects and micro-interactions.
</frontend_aesthetics>

互動式程式編寫產品

Claude Opus 4.8 的 token 使用量和行為，在只有單一使用者回合的自主、非同步程式編寫代理程式，與具有多個使用者回合的互動式、同步程式編寫代理程式之間可能有所不同。具體來說，它在互動式環境中傾向於使用更多 token，主要是因為它在使用者回合之後會進行更多推理。這可以在長時間的互動式程式編寫工作階段中改善長時程的連貫性、指令遵循和程式編寫能力，但也伴隨著更多的 token 使用量。為了在程式編寫產品中同時最大化效能和 token 效率，請使用 xhigh 或 high effort，加入自動模式等自主功能，並減少使用者所需的人工互動次數。

當然，在限制所需的使用者互動次數時，重要的是在第一個人類回合中預先指定任務、意圖和相關限制。預先提供明確指定、清晰且準確的任務描述，有助於最大化自主性和智慧，同時最小化使用者回合之後的額外 token 使用量。由於 Claude Opus 4.8 比先前的模型更具自主性，這種使用模式有助於最大化效能。相反地，透過多個使用者回合逐步傳達的模糊或指定不足的提示，往往會相對降低 token 效率，有時也會降低效能。

程式碼審查框架

Claude Opus 4.8 在尋找錯誤方面明顯優於先前的模型，並且在內部評估中具有更高的召回率和精確度。然而，如果您的程式碼審查框架是針對較早的模型調整的，您最初可能會看到較低的召回率。這很可能是框架效應，而不是能力退化。當審查提示中包含「只回報高嚴重性問題」、「保持保守」或「不要吹毛求疵」之類的內容時，Claude Opus 4.8 可能會比較早的模型更忠實地遵循該指令：它可能會同樣徹底地調查程式碼、識別出錯誤，然後不回報它判斷低於您所述標準的發現。這可能表現為模型進行相同深度的調查，但將較少的調查轉化為回報的發現，尤其是在較低嚴重性的錯誤上。精確度通常會上升，但即使模型的底層錯誤尋找能力已經提升，測得的召回率仍可能下降。

一些建議的提示用語：

Report every issue you find, including ones you are uncertain about or consider low-severity. Do not filter for importance or confidence at this stage - a separate verification step will do that. Your goal here is coverage: it is better to surface a finding that later gets filtered out than to silently drop a real bug. For each finding, include your confidence level and an estimated severity so a downstream filter can rank them.

此提示可以在沒有實際第二步驟的情況下使用，但將信心過濾從發現步驟中移出通常會有幫助。如果您的框架有獨立的驗證、去重或排名階段，請明確告訴模型，它在發現階段的工作是涵蓋範圍而不是過濾。

如果您確實希望模型在單一流程中自我過濾，請具體說明標準在哪裡，而不是使用「重要」之類的定性詞彙：例如，「回報任何可能導致不正確行為、測試失敗或誤導性結果的錯誤；只省略純粹的風格或命名偏好等小問題。」

針對您的評估或測試案例的子集反覆調整提示，以驗證召回率或 F1 分數的提升。

電腦使用

電腦使用能力適用於各種解析度，最高解析度為 2576px / 3.75MP。內部電腦使用測試顯示，以 1080p 傳送影像可在效能和成本之間取得良好的平衡。

對於特別注重成本的工作負載，720p 或 1366×768 是成本較低且效能強勁的選項。請進行您自己的測試，以找到適合您使用案例的理想設定；嘗試不同的 effort 設定也有助於調整模型的行為。

Was this page helpful?

最佳實踐提示工程

為 Claude Opus 4.8 撰寫提示

Claude Opus 4.8 的行為差異與提示模式，涵蓋冗長度、effort 校準、工具使用、子代理程式與前端預設值。

回應長度與冗長度

如果您的產品依賴於特定風格或冗長度的輸出，您可能需要調整您的提示。舉例來說，若要降低冗長度，您可以加入：

Provide concise, focused responses. Skip non-essential context, and keep examples minimal.

校準 effort 與思考深度

max： Max effort 在某些使用案例中可以帶來效能提升，但隨著 token 使用量增加，可能會出現報酬遞減的情況。此設定有時也可能容易過度思考。請針對需要高智慧的任務測試 max effort。
xhigh： Extra high effort 是大多數程式編寫和代理式使用案例的最佳設定。
high： 此設定在 token 使用量和智慧之間取得平衡。對於大多數對智慧敏感的使用案例，請至少使用 high effort。
medium： 適合需要減少 token 使用量並以智慧作為代價的成本敏感使用案例。
low： 保留給簡短、範圍明確的任務，以及對延遲敏感但對智慧不敏感的工作負載。

This task involves multistep reasoning. Think carefully through the problem before responding.

對於此模型而言，effort 可能比任何先前的 Opus 都更重要，因此在升級時請積極地進行實驗。

Thinking adds latency and should only be used when it will meaningfully improve answer quality — typically for problems that require multistep reasoning. When in doubt, respond directly.

工具使用觸發

面向使用者的進度更新

更字面的指令遵循

語氣與寫作風格

例如，如果您的產品語調更溫暖或更口語化，請加入：

Use a warm, collaborative tone. Acknowledge the user's framing before answering.

控制子代理程式的產生

Do not spawn a subagent for work you can complete directly in a single response (e.g. refactoring a function you can already see).

Spawn multiple subagents in the same turn when fanning out across items or reading multiple files.

設計與前端預設值

1. 指定具體的替代方案。 模型會精確地遵循明確的規格：

Design a desktop landing page for a supplement brand called AEFRM.

The visual direction should come from a cold monochrome atmosphere using pale silver-gray tones that gradually deepen into blue-gray and near-black, similar to a misted metallic surface.

The page should feel sharp and controlled, with a strong sense of structure and restraint.

Use this tonal system across the full page instead of introducing bright accent colors.

Use the uploaded image on the hero design in black and white.

The layout should be built with clear horizontal sections and a centered max-width container. Use 4px corner radius consistently across cards, buttons, inputs, and media frames. Margins should feel generous, with enough empty space around each section so the page breathes.

Typography should use a square, angular sans-serif with wider letter spacing than usual, especially in headings and navigation, so the text feels more engineered and less compressed. Headline text can be large and uppercase, while supporting copy remains short and sparse. The sub texts should be written with Alumni Sans SC in 4-6px like tiny little texts on corners bottom centre like that.

For the structure, start with a hero section containing a strong product statement, one short supporting paragraph, and a clean product placeholder or packshot frame. Below that, add a benefit grid with three or four blocks, then a formulation or ingredients section, and finally a cta.

Buttons should be flat and precise, with subtle hover changes using transition: all 160ms ease out where brightness and border contrast shift slightly rather than using dramatic motion.

Color palette should stay within this range:
#E9ECEC, #C9D2D4, #8C9A9E, #44545B, #11171B.

Before building, propose 4 distinct visual directions tailored to this brief (each as: bg hex / accent hex / typeface — one-line rationale). Ask the user to pick one, then implement only that direction.

<frontend_aesthetics>
NEVER use generic AI-generated aesthetics like overused font families (Inter, Roboto, Arial, system fonts), cliched color schemes (particularly purple gradients on white or dark backgrounds), predictable layouts and component patterns, and cookie-cutter design that lacks context-specific character. Use unique fonts, cohesive colors and themes, and animations for effects and micro-interactions.
</frontend_aesthetics>

互動式程式編寫產品

程式碼審查框架

一些建議的提示用語：

Report every issue you find, including ones you are uncertain about or consider low-severity. Do not filter for importance or confidence at this stage - a separate verification step will do that. Your goal here is coverage: it is better to surface a finding that later gets filtered out than to silently drop a real bug. For each finding, include your confidence level and an estimated severity so a downstream filter can rank them.

針對您的評估或測試案例的子集反覆調整提示，以驗證召回率或 F1 分數的提升。

電腦使用

電腦使用能力適用於各種解析度，最高解析度為 2576px / 3.75MP。內部電腦使用測試顯示，以 1080p 傳送影像可在效能和成本之間取得良好的平衡。

Was this page helpful?

回應長度與冗長度

校準 effort 與思考深度

工具使用觸發

面向使用者的進度更新

更字面的指令遵循

語氣與寫作風格

控制子代理程式的產生

設計與前端預設值

互動式程式編寫產品

程式碼審查框架

電腦使用

回應長度與冗長度

校準 effort 與思考深度

工具使用觸發

面向使用者的進度更新

更字面的指令遵循

語氣與寫作風格

控制子代理程式的產生

設計與前端預設值

互動式程式編寫產品

程式碼審查框架

電腦使用

回應長度與冗長度

校準 effort 與思考深度

工具使用觸發

面向使用者的進度更新

更字面的指令遵循

語氣與寫作風格

控制子代理程式的產生

設計與前端預設值

互動式程式編寫產品

程式碼審查框架

電腦使用

回應長度與冗長度

校準 effort 與思考深度

工具使用觸發

面向使用者的進度更新

更字面的指令遵循

語氣與寫作風格

控制子代理程式的產生

設計與前端預設值

互動式程式編寫產品

程式碼審查框架

電腦使用