第三方平台上的 Claude

Vertex AI 上的 Claude

Anthropic 的 Claude 模型現已透過 Vertex AI 正式提供使用。

Vertex API 存取 Claude 的方式與 Messages API 幾乎相同，並支援所有相同的選項，但有兩個關鍵差異：

在 Vertex 中，model 不會在請求主體中傳遞。而是在 Google Cloud 端點 URL 中指定。
在 Vertex 中，anthropic_version 在請求主體中傳遞（而非作為標頭），且必須設定為 vertex-2023-10-16 這個值。

Vertex 也受到 Anthropic 官方用戶端 SDK 的支援。本指南將引導您完成在 Python 或 TypeScript 中向 Vertex AI 上的 Claude 發送請求的過程。

請注意，本指南假設您已經擁有一個能夠使用 Vertex AI 的 GCP 專案。請參閱使用 Anthropic 的 Claude 3 模型以獲取有關所需設定的更多資訊，以及完整的操作指南。

安裝用於存取 Vertex AI 的 SDK

首先，安裝 Anthropic 適用於您所選語言的用戶端 SDK。

pip install -U google-cloud-aiplatform "anthropic[vertex]"

存取 Vertex AI

模型可用性

請注意，Anthropic 模型的可用性因地區而異。在 Vertex AI Model Garden 中搜尋「Claude」或前往使用 Claude 3 以獲取最新資訊。

API 模型 ID

模型	Vertex AI API 模型 ID
Claude Opus 4.6	claude-opus-4-6
Claude Sonnet 4.5	claude-sonnet-4-5@20250929
Claude Sonnet 4	claude-sonnet-4@20250514
Claude Sonnet 3.7 ⚠️	claude-3-7-sonnet@20250219
Claude Opus 4.5	claude-opus-4-5@20251101
Claude Opus 4.1	claude-opus-4-1@20250805
Claude Opus 4	claude-opus-4@20250514
Claude Haiku 4.5	claude-haiku-4-5@20251001
Claude Haiku 3.5 ⚠️	claude-3-5-haiku@20241022
Claude Haiku 3	claude-3-haiku@20240307

發送請求

在執行請求之前，您可能需要執行 gcloud auth application-default login 以向 GCP 進行身份驗證。

以下範例展示如何在 Vertex AI 上從 Claude 生成文字：

from anthropic import AnthropicVertex

project_id = "MY_PROJECT_ID"
region = "global"

client = AnthropicVertex(project_id=project_id, region=region)

message = client.messages.create(
    model="claude-opus-4-6",
    max_tokens=100,
    messages=[
        {
            "role": "user",
            "content": "Hey Claude!",
        }
    ],
)
print(message)

請參閱我們的用戶端 SDK 和官方 Vertex AI 文件以獲取更多詳細資訊。

活動記錄

Vertex 提供請求-回應記錄服務，允許客戶記錄與您的使用相關的提示和完成內容。

Anthropic 建議您至少以 30 天滾動方式記錄您的活動，以便了解您的活動並調查任何潛在的濫用行為。

啟用此服務不會讓 Google 或 Anthropic 存取您的內容。

功能支援

您可以在此處找到 Vertex 目前支援的所有功能。

全域端點與區域端點

從 Claude Sonnet 4.5 及所有未來模型開始，Google Vertex AI 提供兩種端點類型：

全域端點：動態路由以實現最大可用性
區域端點：保證資料透過特定地理區域路由

區域端點相較於全域端點有 10% 的價格溢價。

這僅適用於 Claude Sonnet 4.5 及未來的模型。較舊的模型（Claude Sonnet 4、Opus 4 及更早版本）維持其現有的定價結構。

何時使用各選項

全域端點（建議使用）：

提供最大的可用性和正常運行時間
動態將請求路由到具有可用容量的區域
無價格溢價
最適合資料駐留要求靈活的應用程式
僅支援隨用隨付流量（預配置吞吐量需要區域端點）

區域端點：

透過特定地理區域路由流量
資料駐留和合規要求所必需
支援隨用隨付和預配置吞吐量
10% 的價格溢價反映了專用區域容量的基礎設施成本

實作

使用全域端點（建議使用）：

在初始化用戶端時，將 region 參數設定為 "global"：

from anthropic import AnthropicVertex

project_id = "MY_PROJECT_ID"
region = "global"

client = AnthropicVertex(project_id=project_id, region=region)

message = client.messages.create(
    model="claude-opus-4-6",
    max_tokens=100,
    messages=[
        {
            "role": "user",
            "content": "Hey Claude!",
        }
    ],
)
print(message)

使用區域端點：

指定特定區域，例如 "us-east1" 或 "europe-west1"：

from anthropic import AnthropicVertex

project_id = "MY_PROJECT_ID"
region = "us-east1"  # Specify a specific region

client = AnthropicVertex(project_id=project_id, region=region)

message = client.messages.create(
    model="claude-opus-4-6",
    max_tokens=100,
    messages=[
        {
            "role": "user",
            "content": "Hey Claude!",
        }
    ],
)
print(message)

其他資源

Google Vertex AI 定價： cloud.google.com/vertex-ai/generative-ai/pricing
Claude 模型文件： Vertex AI 上的 Claude
Google 部落格文章： Claude 模型的全域端點
Anthropic 定價詳情： 定價文件

Was this page helpful?

第三方平台上的 Claude

Vertex AI 上的 Claude

Anthropic 的 Claude 模型現已透過 Vertex AI 正式提供使用。

Vertex API 存取 Claude 的方式與 Messages API 幾乎相同，並支援所有相同的選項，但有兩個關鍵差異：

在 Vertex 中，model 不會在請求主體中傳遞。而是在 Google Cloud 端點 URL 中指定。
在 Vertex 中，anthropic_version 在請求主體中傳遞（而非作為標頭），且必須設定為 vertex-2023-10-16 這個值。

Vertex 也受到 Anthropic 官方用戶端 SDK 的支援。本指南將引導您完成在 Python 或 TypeScript 中向 Vertex AI 上的 Claude 發送請求的過程。

安裝用於存取 Vertex AI 的 SDK

首先，安裝 Anthropic 適用於您所選語言的用戶端 SDK。

pip install -U google-cloud-aiplatform "anthropic[vertex]"

存取 Vertex AI

模型可用性

請注意，Anthropic 模型的可用性因地區而異。在 Vertex AI Model Garden 中搜尋「Claude」或前往使用 Claude 3 以獲取最新資訊。

API 模型 ID

模型	Vertex AI API 模型 ID
Claude Opus 4.6	claude-opus-4-6
Claude Sonnet 4.5	claude-sonnet-4-5@20250929
Claude Sonnet 4	claude-sonnet-4@20250514
Claude Sonnet 3.7 ⚠️	claude-3-7-sonnet@20250219
Claude Opus 4.5	claude-opus-4-5@20251101
Claude Opus 4.1	claude-opus-4-1@20250805
Claude Opus 4	claude-opus-4@20250514
Claude Haiku 4.5	claude-haiku-4-5@20251001
Claude Haiku 3.5 ⚠️	claude-3-5-haiku@20241022
Claude Haiku 3	claude-3-haiku@20240307

發送請求

在執行請求之前，您可能需要執行 gcloud auth application-default login 以向 GCP 進行身份驗證。

以下範例展示如何在 Vertex AI 上從 Claude 生成文字：

from anthropic import AnthropicVertex

project_id = "MY_PROJECT_ID"
region = "global"

client = AnthropicVertex(project_id=project_id, region=region)

message = client.messages.create(
    model="claude-opus-4-6",
    max_tokens=100,
    messages=[
        {
            "role": "user",
            "content": "Hey Claude!",
        }
    ],
)
print(message)

請參閱我們的用戶端 SDK 和官方 Vertex AI 文件以獲取更多詳細資訊。

活動記錄

Vertex 提供請求-回應記錄服務，允許客戶記錄與您的使用相關的提示和完成內容。

Anthropic 建議您至少以 30 天滾動方式記錄您的活動，以便了解您的活動並調查任何潛在的濫用行為。

啟用此服務不會讓 Google 或 Anthropic 存取您的內容。

功能支援

您可以在此處找到 Vertex 目前支援的所有功能。

全域端點與區域端點

從 Claude Sonnet 4.5 及所有未來模型開始，Google Vertex AI 提供兩種端點類型：

全域端點：動態路由以實現最大可用性
區域端點：保證資料透過特定地理區域路由

區域端點相較於全域端點有 10% 的價格溢價。

這僅適用於 Claude Sonnet 4.5 及未來的模型。較舊的模型（Claude Sonnet 4、Opus 4 及更早版本）維持其現有的定價結構。

何時使用各選項

全域端點（建議使用）：

提供最大的可用性和正常運行時間
動態將請求路由到具有可用容量的區域
無價格溢價
最適合資料駐留要求靈活的應用程式
僅支援隨用隨付流量（預配置吞吐量需要區域端點）

區域端點：

透過特定地理區域路由流量
資料駐留和合規要求所必需
支援隨用隨付和預配置吞吐量
10% 的價格溢價反映了專用區域容量的基礎設施成本

實作

使用全域端點（建議使用）：

在初始化用戶端時，將 region 參數設定為 "global"：

from anthropic import AnthropicVertex

project_id = "MY_PROJECT_ID"
region = "global"

client = AnthropicVertex(project_id=project_id, region=region)

message = client.messages.create(
    model="claude-opus-4-6",
    max_tokens=100,
    messages=[
        {
            "role": "user",
            "content": "Hey Claude!",
        }
    ],
)
print(message)

使用區域端點：

指定特定區域，例如 "us-east1" 或 "europe-west1"：

from anthropic import AnthropicVertex

project_id = "MY_PROJECT_ID"
region = "us-east1"  # Specify a specific region

client = AnthropicVertex(project_id=project_id, region=region)

message = client.messages.create(
    model="claude-opus-4-6",
    max_tokens=100,
    messages=[
        {
            "role": "user",
            "content": "Hey Claude!",
        }
    ],
)
print(message)

其他資源

Google Vertex AI 定價： cloud.google.com/vertex-ai/generative-ai/pricing
Claude 模型文件： Vertex AI 上的 Claude
Google 部落格文章： Claude 模型的全域端點
Anthropic 定價詳情： 定價文件

Was this page helpful?