模型與定價

定價

了解 Anthropic 的模型和功能定價結構

本頁提供 Anthropic 模型和功能的詳細定價資訊。所有價格均以美元計算。

如需最新定價資訊，請造訪 claude.com/pricing。

模型定價

下表顯示所有 Claude 模型在不同使用層級的定價：

Model	Base Input Tokens	5m Cache Writes	1h Cache Writes	Cache Hits & Refreshes	Output Tokens
Claude Opus 4.6	$5 / MTok	$6.25 / MTok	$10 / MTok	$0.50 / MTok	$25 / MTok
Claude Opus 4.5	$5 / MTok	$6.25 / MTok	$10 / MTok	$0.50 / MTok	$25 / MTok
Claude Opus 4.1	$15 / MTok	$18.75 / MTok	$30 / MTok	$1.50 / MTok	$75 / MTok
Claude Opus 4	$15 / MTok	$18.75 / MTok	$30 / MTok	$1.50 / MTok	$75 / MTok
Claude Sonnet 4.5	$3 / MTok	$3.75 / MTok	$6 / MTok	$0.30 / MTok	$15 / MTok
Claude Sonnet 4	$3 / MTok	$3.75 / MTok	$6 / MTok	$0.30 / MTok	$15 / MTok
Claude Sonnet 3.7 (deprecated)	$3 / MTok	$3.75 / MTok	$6 / MTok	$0.30 / MTok	$15 / MTok
Claude Haiku 4.5	$1 / MTok	$1.25 / MTok	$2 / MTok	$0.10 / MTok	$5 / MTok
Claude Haiku 3.5	$0.80 / MTok	$1 / MTok	$1.6 / MTok	$0.08 / MTok	$4 / MTok
Claude Opus 3 (deprecated)	$15 / MTok	$18.75 / MTok	$30 / MTok	$1.50 / MTok	$75 / MTok
Claude Haiku 3	$0.25 / MTok	$0.30 / MTok	$0.50 / MTok	$0.03 / MTok	$1.25 / MTok

MTok = 百萬 token。「Base Input Tokens」欄位顯示標準輸入定價，「Cache Writes」和「Cache Hits」專用於提示快取，「Output Tokens」顯示輸出定價。提示快取提供 5 分鐘（預設）和 1 小時的快取持續時間，以針對不同使用情境最佳化成本。

上表反映了以下提示快取定價乘數：

5 分鐘快取寫入 token 為基本輸入 token 價格的 1.25 倍
1 小時快取寫入 token 為基本輸入 token 價格的 2 倍
快取讀取 token 為基本輸入 token 價格的 0.1 倍

第三方平台定價

Claude 模型可在 AWS Bedrock、Google Vertex AI 和 Microsoft Foundry 上使用。如需官方定價，請造訪：

Claude 4.5 模型及更新版本的區域端點定價

從 Claude Sonnet 4.5 和 Haiku 4.5 開始，AWS Bedrock 和 Google Vertex AI 提供兩種端點類型：

全球端點：跨區域動態路由以實現最大可用性
區域端點：保證資料在特定地理區域內路由

區域端點相較全球端點有 10% 的溢價。Claude API（1P）預設為全球路由，不受此變更影響。 Claude API 僅提供全球路由（等同於其他供應商的全球端點方案和定價）。

適用範圍：此定價結構適用於 Claude Sonnet 4.5、Haiku 4.5 及所有未來模型。較早的模型（Claude Sonnet 4、Opus 4 及先前版本）維持其現有定價。

如需實作詳情和程式碼範例：

功能特定定價

資料駐留定價

對於 Claude Opus 4.6 及更新的模型，透過 inference_geo 參數指定僅限美國推論會對所有 token 定價類別（包括輸入 token、輸出 token、快取寫入和快取讀取）產生 1.1 倍的乘數。全球路由（預設）使用標準定價。

此僅適用於 Claude API（1P）。第三方平台有其自己的區域定價——詳情請參閱 AWS Bedrock、Google Vertex AI 和 Microsoft Foundry。較早的模型無論 inference_geo 設定如何，均維持其現有定價。

如需更多資訊，請參閱我們的資料駐留文件。

批次處理

Batch API 允許非同步處理大量請求，輸入和輸出 token 均享有 50% 折扣。

Model	Batch input	Batch output
Claude Opus 4.6	$2.50 / MTok	$12.50 / MTok
Claude Opus 4.5	$2.50 / MTok	$12.50 / MTok
Claude Opus 4.1	$7.50 / MTok	$37.50 / MTok
Claude Opus 4	$7.50 / MTok	$37.50 / MTok
Claude Sonnet 4.5	$1.50 / MTok	$7.50 / MTok
Claude Sonnet 4	$1.50 / MTok	$7.50 / MTok
Claude Sonnet 3.7 (deprecated)	$1.50 / MTok	$7.50 / MTok
Claude Haiku 4.5	$0.50 / MTok	$2.50 / MTok
Claude Haiku 3.5	$0.40 / MTok	$2 / MTok
Claude Opus 3 (deprecated)	$7.50 / MTok	$37.50 / MTok
Claude Haiku 3	$0.125 / MTok	$0.625 / MTok

如需更多關於批次處理的資訊，請參閱我們的批次處理文件。

長上下文定價

使用 Claude Opus 4.6、Sonnet 4.5 或 Sonnet 4 並啟用 1M token 上下文視窗時，超過 200K 輸入 token 的請求將自動按照高級長上下文費率收費：

1M token 上下文視窗目前為使用層級 4 的組織和具有自訂速率限制的組織提供 beta 版本。1M token 上下文視窗僅適用於 Claude Opus 4.6、Sonnet 4.5 和 Sonnet 4。

模型	≤ 200K 輸入 token	> 200K 輸入 token
Claude Opus 4.6	輸入：$5 / MTok	輸入：$10 / MTok
	輸出：$25 / MTok	輸出：$37.50 / MTok
Claude Sonnet 4.5 / 4	輸入：$3 / MTok	輸入：$6 / MTok
	輸出：$15 / MTok	輸出：$22.50 / MTok

長上下文定價與其他定價修飾符疊加：

Batch API 50% 折扣適用於長上下文定價
提示快取乘數在長上下文定價之上適用
資料駐留 1.1 倍乘數在長上下文定價之上適用

即使啟用了 beta 標誌，少於 200K 輸入 token 的請求仍按標準費率收費。如果您的請求超過 200K 輸入 token，所有 token 均按高級定價收費。

200K 閾值僅基於輸入 token（包括快取讀取/寫入）。輸出 token 數量不影響定價層級選擇，但當超過輸入閾值時，輸出 token 將按較高費率收費。

要檢查您的 API 請求是否按 1M 上下文視窗費率收費，請檢查 API 回應中的 usage 物件：

{
  "usage": {
    "input_tokens": 250000,
    "cache_creation_input_tokens": 0,
    "cache_read_input_tokens": 0,
    "output_tokens": 500
  }
}

透過加總以下項目計算總輸入 token：

input_tokens
cache_creation_input_tokens（如果使用提示快取）
cache_read_input_tokens（如果使用提示快取）

如果總數超過 200,000 token，則整個請求按 1M 上下文費率計費。

如需更多關於 usage 物件的資訊，請參閱 API 回應文件。

工具使用定價

Tool use requests are priced based on:

The total number of input tokens sent to the model (including in the tools parameter)
The number of output tokens generated
For server-side tools, additional usage-based pricing (e.g., web search charges per search performed)

Client-side tools are priced the same as any other Claude API request, while server-side tools may incur additional charges based on their specific usage.

The additional tokens from tool use come from:

The tools parameter in API requests (tool names, descriptions, and schemas)
tool_use content blocks in API requests and responses
tool_result content blocks in API requests

When you use tools, we also automatically include a special system prompt for the model which enables tool use. The number of tool use tokens required for each model are listed below (excluding the additional tokens listed above). Note that the table assumes at least 1 tool is provided. If no tools are provided, then a tool choice of none uses 0 additional system prompt tokens.

Model	Tool choice	Tool use system prompt token count
Claude Opus 4.6	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Opus 4.5	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Opus 4.1	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Opus 4	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Sonnet 4.5	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Sonnet 4	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Sonnet 3.7 (deprecated)	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Haiku 4.5	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Haiku 3.5	`auto`, `none` `any`, `tool`	264 tokens 340 tokens
Claude Opus 3 (deprecated)	`auto`, `none` `any`, `tool`	530 tokens 281 tokens
Claude Sonnet 3	`auto`, `none` `any`, `tool`	159 tokens 235 tokens
Claude Haiku 3	`auto`, `none` `any`, `tool`	264 tokens 340 tokens

These token counts are added to your normal input and output tokens to calculate the total cost of a request.

如需目前各模型的價格，請參閱上方的模型定價部分。

如需更多關於工具使用實作和最佳實踐的資訊，請參閱我們的工具使用文件。

特定工具定價

Bash 工具

The bash tool adds 245 input tokens to your API calls.

Additional tokens are consumed by:

Command outputs (stdout/stderr)
Error messages
Large file contents

完整定價詳情請參閱工具使用定價。

程式碼執行工具

Code execution tool usage is tracked separately from token usage. Execution time has a minimum of 5 minutes. If files are included in the request, execution time is billed even if the tool is not used due to files being preloaded onto the container.

Each organization receives 1,550 free hours of usage with the code execution tool per month. Additional usage beyond the first 1,550 hours is billed at $0.05 per hour, per container.

文字編輯器工具

The text editor tool uses the same pricing structure as other tools used with Claude. It follows the standard input and output token pricing based on the Claude model you're using.

In addition to the base tokens, the following additional input tokens are needed for the text editor tool:

Tool	Additional input tokens
`text_editor_20250429` (Claude 4.x)	700 tokens
`text_editor_20250124` (Claude Sonnet 3.7 (deprecated))	700 tokens

完整定價詳情請參閱工具使用定價。

網頁搜尋工具

Web search usage is charged in addition to token usage:

"usage": {
  "input_tokens": 105,
  "output_tokens": 6039,
  "cache_read_input_tokens": 7123,
  "cache_creation_input_tokens": 7345,
  "server_tool_use": {
    "web_search_requests": 1
  }
}

Web search is available on the Claude API for $10 per 1,000 searches, plus standard token costs for search-generated content. Web search results retrieved throughout a conversation are counted as input tokens, in search iterations executed during a single turn and in subsequent conversation turns.

Each web search counts as one use, regardless of the number of results returned. If an error occurs during web search, the web search will not be billed.

網頁擷取工具

Web fetch usage has no additional charges beyond standard token costs:

"usage": {
  "input_tokens": 25039,
  "output_tokens": 931,
  "cache_read_input_tokens": 0,
  "cache_creation_input_tokens": 0,
  "server_tool_use": {
    "web_fetch_requests": 1
  }
}

The web fetch tool is available on the Claude API at no additional cost. You only pay standard token costs for the fetched content that becomes part of your conversation context.

To protect against inadvertently fetching large content that would consume excessive tokens, use the max_content_tokens parameter to set appropriate limits based on your use case and budget considerations.

Example token usage for typical content:

Average web page (10KB): ~2,500 tokens
Large documentation page (100KB): ~25,000 tokens
Research paper PDF (500KB): ~125,000 tokens

電腦使用工具

Computer use follows the standard tool use pricing. When using the computer use tool:

System prompt overhead: The computer use beta adds 466-499 tokens to the system prompt

Computer use tool token usage:

Model	Input tokens per tool definition
Claude 4.x models	735 tokens
Claude Sonnet 3.7 (deprecated)	735 tokens

Additional token consumption:

Screenshot images (see Vision pricing)
Tool execution results returned to Claude

If you're also using bash or text editor tools alongside computer use, those tools have their own token costs as documented in their respective pages.

代理使用案例定價範例

在使用 Claude 建構時，了解代理應用程式的定價至關重要。這些實際範例可以幫助您估算不同代理模式的成本。

客戶支援代理範例

建構客戶支援代理時，成本可能如下分解：

處理 10,000 張支援工單的計算範例：

每次對話平均約 3,700 token
使用 Claude Opus 4.6，輸入 $5/MTok，輸出 $25/MTok
總成本：每 10,000 張工單約 $37.00

如需此計算的詳細說明，請參閱我們的客戶支援代理指南。

一般代理工作流程定價

對於具有多個步驟的更複雜代理架構：

初始請求處理
- 典型輸入：500-1,000 token
- 處理成本：每次請求約 $0.003
記憶和上下文檢索
- 檢索的上下文：2,000-5,000 token
- 每次檢索成本：每次操作約 $0.015
行動規劃和執行
- 規劃 token：1,000-2,000
- 執行回饋：500-1,000
- 合計成本：每次行動約 $0.045

如需代理定價模式的完整指南，請參閱我們的代理使用案例指南。

成本最佳化策略

使用 Claude 建構代理時：

使用適當的模型：簡單任務選擇 Haiku，複雜推理選擇 Sonnet
實作提示快取：降低重複上下文的成本
批次操作：對非時間敏感的任務使用 Batch API
監控使用模式：追蹤 token 消耗以識別最佳化機會

對於高流量代理應用程式，請考慮聯繫我們的企業銷售團隊以獲取自訂定價方案。

其他定價考量

速率限制

速率限制因使用層級而異，影響您可以發出的請求數量：

層級 1：入門級使用，具有基本限制
層級 2：為成長中的應用程式提供增加的限制
層級 3：為成熟應用程式提供更高的限制
層級 4：最大標準限制
企業：可提供自訂限制

如需詳細的速率限制資訊，請參閱我們的速率限制文件。

如需更高的速率限制或自訂定價方案，請聯繫我們的銷售團隊。

批量折扣

高流量使用者可能可以獲得批量折扣。這些折扣按個案協商。

標準層級使用上述定價
企業客戶可以聯繫銷售以獲取自訂定價
可能提供學術和研究折扣

企業定價

對於有特定需求的企業客戶：

自訂速率限制
批量折扣
專屬支援
自訂條款

請透過 [email protected] 或 Claude 控制台聯繫我們的銷售團隊，討論企業定價選項。

帳單和付款

帳單根據實際使用量按月計算
付款以美元處理
提供信用卡和發票選項
可在 Claude 控制台中追蹤使用量

常見問題

Token 使用量如何計算？

Token 是模型處理的文字片段。粗略估計，1 個 token 大約等於英文中的 4 個字元或 0.75 個單字。確切數量因語言和內容類型而異。

是否有免費層級或試用？

新使用者會收到少量免費額度來測試 API。如需企業評估的延長試用資訊，請聯繫銷售。

折扣如何疊加？

Batch API 和提示快取折扣可以合併使用。例如，同時使用這兩項功能相比標準 API 呼叫可以顯著節省成本。

接受哪些付款方式？

我們接受標準帳戶的主要信用卡。企業客戶可以安排發票和其他付款方式。

如有其他定價問題，請聯繫 [email protected]。

Was this page helpful?

模型與定價

定價

了解 Anthropic 的模型和功能定價結構

本頁提供 Anthropic 模型和功能的詳細定價資訊。所有價格均以美元計算。

如需最新定價資訊，請造訪 claude.com/pricing。

模型定價

下表顯示所有 Claude 模型在不同使用層級的定價：

Model	Base Input Tokens	5m Cache Writes	1h Cache Writes	Cache Hits & Refreshes	Output Tokens
Claude Opus 4.6	$5 / MTok	$6.25 / MTok	$10 / MTok	$0.50 / MTok	$25 / MTok
Claude Opus 4.5	$5 / MTok	$6.25 / MTok	$10 / MTok	$0.50 / MTok	$25 / MTok
Claude Opus 4.1	$15 / MTok	$18.75 / MTok	$30 / MTok	$1.50 / MTok	$75 / MTok
Claude Opus 4	$15 / MTok	$18.75 / MTok	$30 / MTok	$1.50 / MTok	$75 / MTok
Claude Sonnet 4.5	$3 / MTok	$3.75 / MTok	$6 / MTok	$0.30 / MTok	$15 / MTok
Claude Sonnet 4	$3 / MTok	$3.75 / MTok	$6 / MTok	$0.30 / MTok	$15 / MTok
Claude Sonnet 3.7 (deprecated)	$3 / MTok	$3.75 / MTok	$6 / MTok	$0.30 / MTok	$15 / MTok
Claude Haiku 4.5	$1 / MTok	$1.25 / MTok	$2 / MTok	$0.10 / MTok	$5 / MTok
Claude Haiku 3.5	$0.80 / MTok	$1 / MTok	$1.6 / MTok	$0.08 / MTok	$4 / MTok
Claude Opus 3 (deprecated)	$15 / MTok	$18.75 / MTok	$30 / MTok	$1.50 / MTok	$75 / MTok
Claude Haiku 3	$0.25 / MTok	$0.30 / MTok	$0.50 / MTok	$0.03 / MTok	$1.25 / MTok

上表反映了以下提示快取定價乘數：

5 分鐘快取寫入 token 為基本輸入 token 價格的 1.25 倍
1 小時快取寫入 token 為基本輸入 token 價格的 2 倍
快取讀取 token 為基本輸入 token 價格的 0.1 倍

第三方平台定價

Claude 模型可在 AWS Bedrock、Google Vertex AI 和 Microsoft Foundry 上使用。如需官方定價，請造訪：

Claude 4.5 模型及更新版本的區域端點定價

從 Claude Sonnet 4.5 和 Haiku 4.5 開始，AWS Bedrock 和 Google Vertex AI 提供兩種端點類型：

全球端點：跨區域動態路由以實現最大可用性
區域端點：保證資料在特定地理區域內路由

適用範圍：此定價結構適用於 Claude Sonnet 4.5、Haiku 4.5 及所有未來模型。較早的模型（Claude Sonnet 4、Opus 4 及先前版本）維持其現有定價。

如需實作詳情和程式碼範例：

功能特定定價

資料駐留定價

如需更多資訊，請參閱我們的資料駐留文件。

批次處理

Batch API 允許非同步處理大量請求，輸入和輸出 token 均享有 50% 折扣。

Model	Batch input	Batch output
Claude Opus 4.6	$2.50 / MTok	$12.50 / MTok
Claude Opus 4.5	$2.50 / MTok	$12.50 / MTok
Claude Opus 4.1	$7.50 / MTok	$37.50 / MTok
Claude Opus 4	$7.50 / MTok	$37.50 / MTok
Claude Sonnet 4.5	$1.50 / MTok	$7.50 / MTok
Claude Sonnet 4	$1.50 / MTok	$7.50 / MTok
Claude Sonnet 3.7 (deprecated)	$1.50 / MTok	$7.50 / MTok
Claude Haiku 4.5	$0.50 / MTok	$2.50 / MTok
Claude Haiku 3.5	$0.40 / MTok	$2 / MTok
Claude Opus 3 (deprecated)	$7.50 / MTok	$37.50 / MTok
Claude Haiku 3	$0.125 / MTok	$0.625 / MTok

如需更多關於批次處理的資訊，請參閱我們的批次處理文件。

長上下文定價

使用 Claude Opus 4.6、Sonnet 4.5 或 Sonnet 4 並啟用 1M token 上下文視窗時，超過 200K 輸入 token 的請求將自動按照高級長上下文費率收費：

1M token 上下文視窗目前為使用層級 4 的組織和具有自訂速率限制的組織提供 beta 版本。1M token 上下文視窗僅適用於 Claude Opus 4.6、Sonnet 4.5 和 Sonnet 4。

模型	≤ 200K 輸入 token	> 200K 輸入 token
Claude Opus 4.6	輸入：$5 / MTok	輸入：$10 / MTok
	輸出：$25 / MTok	輸出：$37.50 / MTok
Claude Sonnet 4.5 / 4	輸入：$3 / MTok	輸入：$6 / MTok
	輸出：$15 / MTok	輸出：$22.50 / MTok

長上下文定價與其他定價修飾符疊加：

Batch API 50% 折扣適用於長上下文定價
提示快取乘數在長上下文定價之上適用
資料駐留 1.1 倍乘數在長上下文定價之上適用

即使啟用了 beta 標誌，少於 200K 輸入 token 的請求仍按標準費率收費。如果您的請求超過 200K 輸入 token，所有 token 均按高級定價收費。

200K 閾值僅基於輸入 token（包括快取讀取/寫入）。輸出 token 數量不影響定價層級選擇，但當超過輸入閾值時，輸出 token 將按較高費率收費。

要檢查您的 API 請求是否按 1M 上下文視窗費率收費，請檢查 API 回應中的 usage 物件：

{
  "usage": {
    "input_tokens": 250000,
    "cache_creation_input_tokens": 0,
    "cache_read_input_tokens": 0,
    "output_tokens": 500
  }
}

透過加總以下項目計算總輸入 token：

input_tokens
cache_creation_input_tokens（如果使用提示快取）
cache_read_input_tokens（如果使用提示快取）

如果總數超過 200,000 token，則整個請求按 1M 上下文費率計費。

如需更多關於 usage 物件的資訊，請參閱 API 回應文件。

工具使用定價

Tool use requests are priced based on:

The total number of input tokens sent to the model (including in the tools parameter)
The number of output tokens generated
For server-side tools, additional usage-based pricing (e.g., web search charges per search performed)

Client-side tools are priced the same as any other Claude API request, while server-side tools may incur additional charges based on their specific usage.

The additional tokens from tool use come from:

The tools parameter in API requests (tool names, descriptions, and schemas)
tool_use content blocks in API requests and responses
tool_result content blocks in API requests

Model	Tool choice	Tool use system prompt token count
Claude Opus 4.6	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Opus 4.5	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Opus 4.1	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Opus 4	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Sonnet 4.5	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Sonnet 4	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Sonnet 3.7 (deprecated)	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Haiku 4.5	`auto`, `none` `any`, `tool`	346 tokens 313 tokens
Claude Haiku 3.5	`auto`, `none` `any`, `tool`	264 tokens 340 tokens
Claude Opus 3 (deprecated)	`auto`, `none` `any`, `tool`	530 tokens 281 tokens
Claude Sonnet 3	`auto`, `none` `any`, `tool`	159 tokens 235 tokens
Claude Haiku 3	`auto`, `none` `any`, `tool`	264 tokens 340 tokens

These token counts are added to your normal input and output tokens to calculate the total cost of a request.

如需目前各模型的價格，請參閱上方的模型定價部分。

如需更多關於工具使用實作和最佳實踐的資訊，請參閱我們的工具使用文件。

特定工具定價

Bash 工具

The bash tool adds 245 input tokens to your API calls.

Additional tokens are consumed by:

Command outputs (stdout/stderr)
Error messages
Large file contents

完整定價詳情請參閱工具使用定價。

程式碼執行工具

Each organization receives 1,550 free hours of usage with the code execution tool per month. Additional usage beyond the first 1,550 hours is billed at $0.05 per hour, per container.

文字編輯器工具

The text editor tool uses the same pricing structure as other tools used with Claude. It follows the standard input and output token pricing based on the Claude model you're using.

In addition to the base tokens, the following additional input tokens are needed for the text editor tool:

Tool	Additional input tokens
`text_editor_20250429` (Claude 4.x)	700 tokens
`text_editor_20250124` (Claude Sonnet 3.7 (deprecated))	700 tokens

完整定價詳情請參閱工具使用定價。

網頁搜尋工具

Web search usage is charged in addition to token usage:

"usage": {
  "input_tokens": 105,
  "output_tokens": 6039,
  "cache_read_input_tokens": 7123,
  "cache_creation_input_tokens": 7345,
  "server_tool_use": {
    "web_search_requests": 1
  }
}

Each web search counts as one use, regardless of the number of results returned. If an error occurs during web search, the web search will not be billed.

網頁擷取工具

Web fetch usage has no additional charges beyond standard token costs:

"usage": {
  "input_tokens": 25039,
  "output_tokens": 931,
  "cache_read_input_tokens": 0,
  "cache_creation_input_tokens": 0,
  "server_tool_use": {
    "web_fetch_requests": 1
  }
}

The web fetch tool is available on the Claude API at no additional cost. You only pay standard token costs for the fetched content that becomes part of your conversation context.

Example token usage for typical content:

Average web page (10KB): ~2,500 tokens
Large documentation page (100KB): ~25,000 tokens
Research paper PDF (500KB): ~125,000 tokens

電腦使用工具

Computer use follows the standard tool use pricing. When using the computer use tool:

System prompt overhead: The computer use beta adds 466-499 tokens to the system prompt

Computer use tool token usage:

Model	Input tokens per tool definition
Claude 4.x models	735 tokens
Claude Sonnet 3.7 (deprecated)	735 tokens

Additional token consumption:

Screenshot images (see Vision pricing)
Tool execution results returned to Claude

If you're also using bash or text editor tools alongside computer use, those tools have their own token costs as documented in their respective pages.

代理使用案例定價範例

在使用 Claude 建構時，了解代理應用程式的定價至關重要。這些實際範例可以幫助您估算不同代理模式的成本。

客戶支援代理範例

建構客戶支援代理時，成本可能如下分解：

處理 10,000 張支援工單的計算範例：

每次對話平均約 3,700 token
使用 Claude Opus 4.6，輸入 $5/MTok，輸出 $25/MTok
總成本：每 10,000 張工單約 $37.00

如需此計算的詳細說明，請參閱我們的客戶支援代理指南。

一般代理工作流程定價

對於具有多個步驟的更複雜代理架構：

初始請求處理
- 典型輸入：500-1,000 token
- 處理成本：每次請求約 $0.003
記憶和上下文檢索
- 檢索的上下文：2,000-5,000 token
- 每次檢索成本：每次操作約 $0.015
行動規劃和執行
- 規劃 token：1,000-2,000
- 執行回饋：500-1,000
- 合計成本：每次行動約 $0.045

如需代理定價模式的完整指南，請參閱我們的代理使用案例指南。

成本最佳化策略

使用 Claude 建構代理時：

使用適當的模型：簡單任務選擇 Haiku，複雜推理選擇 Sonnet
實作提示快取：降低重複上下文的成本
批次操作：對非時間敏感的任務使用 Batch API
監控使用模式：追蹤 token 消耗以識別最佳化機會

對於高流量代理應用程式，請考慮聯繫我們的企業銷售團隊以獲取自訂定價方案。

其他定價考量

速率限制

速率限制因使用層級而異，影響您可以發出的請求數量：

層級 1：入門級使用，具有基本限制
層級 2：為成長中的應用程式提供增加的限制
層級 3：為成熟應用程式提供更高的限制
層級 4：最大標準限制
企業：可提供自訂限制

如需詳細的速率限制資訊，請參閱我們的速率限制文件。

如需更高的速率限制或自訂定價方案，請聯繫我們的銷售團隊。

批量折扣

高流量使用者可能可以獲得批量折扣。這些折扣按個案協商。

標準層級使用上述定價
企業客戶可以聯繫銷售以獲取自訂定價
可能提供學術和研究折扣

企業定價

對於有特定需求的企業客戶：

自訂速率限制
批量折扣
專屬支援
自訂條款

請透過 [email protected] 或 Claude 控制台聯繫我們的銷售團隊，討論企業定價選項。

帳單和付款

帳單根據實際使用量按月計算
付款以美元處理
提供信用卡和發票選項
可在 Claude 控制台中追蹤使用量

常見問題

Token 使用量如何計算？

Token 是模型處理的文字片段。粗略估計，1 個 token 大約等於英文中的 4 個字元或 0.75 個單字。確切數量因語言和內容類型而異。

是否有免費層級或試用？

新使用者會收到少量免費額度來測試 API。如需企業評估的延長試用資訊，請聯繫銷售。

折扣如何疊加？

Batch API 和提示快取折扣可以合併使用。例如，同時使用這兩項功能相比標準 API 呼叫可以顯著節省成本。

接受哪些付款方式？

我們接受標準帳戶的主要信用卡。企業客戶可以安排發票和其他付款方式。

如有其他定價問題，請聯繫 [email protected]。

Was this page helpful?