功能

PDF 支援

使用 Claude 處理 PDF。從您的文件中擷取文字、分析圖表並理解視覺內容。

您現在可以向 Claude 詢問您提供的 PDF 中的任何文字、圖片、圖表和表格。一些範例使用案例：

分析財務報告並理解圖表/表格
從法律文件中擷取關鍵資訊
文件翻譯協助
將文件資訊轉換為結構化格式

開始之前

檢查 PDF 要求

Claude 可處理任何標準 PDF。但是，使用 PDF 支援時，您應確保您的請求大小符合以下要求：

要求	限制
最大請求大小	32MB
每次請求最大頁數	100
格式	標準 PDF（無密碼/加密）

請注意，這兩個限制都是針對整個請求負載，包括與 PDF 一起發送的任何其他內容。

由於 PDF 支援依賴 Claude 的視覺功能，因此它受到與其他視覺任務相同的限制和注意事項。

支援的平台和模型

PDF 支援目前透過直接 API 存取和 Google Vertex AI 提供。所有現行模型都支援 PDF 處理。

PDF 支援現已在 Amazon Bedrock 上提供，但有以下注意事項：

Amazon Bedrock PDF 支援

透過 Amazon Bedrock 的 Converse API 使用 PDF 支援時，有兩種不同的文件處理模式：

重要：要在 Converse API 中存取 Claude 的完整視覺 PDF 理解功能，您必須啟用引用。未啟用引用時，API 會退回到僅基本文字擷取。了解更多關於使用引用的資訊。

文件處理模式

Converse 文件聊天（原始模式 - 僅文字擷取）
- 提供 PDF 的基本文字擷取
- 無法分析 PDF 中的圖片、圖表或視覺版面
- 3 頁 PDF 大約使用 1,000 個 token
- 未啟用引用時自動使用
Claude PDF 聊天（新模式 - 完整視覺理解）
- 提供 PDF 的完整視覺分析
- 可以理解和分析圖表、圖形、圖片和視覺版面
- 將每頁作為文字和圖片進行處理，以實現全面理解
- 3 頁 PDF 大約使用 7,000 個 token
- 需要在 Converse API 中啟用引用

主要限制

Converse API：視覺 PDF 分析需要啟用引用。目前沒有不使用引用而使用視覺分析的選項（與 InvokeModel API 不同）。
InvokeModel API：提供對 PDF 處理的完全控制，無需強制引用。

常見問題

如果客戶反映使用 Converse API 時 Claude 無法看到 PDF 中的圖片或圖表，他們可能需要啟用引用標誌。沒有啟用引用時，Converse 會退回到僅基本文字擷取。

這是 Converse API 的已知限制，我們正在努力解決。對於需要不使用引用的視覺 PDF 分析的應用程式，請考慮改用 InvokeModel API。

對於非 PDF 檔案，如 .csv、.xlsx、.docx、.md 或 .txt 檔案，請參閱使用其他檔案格式。

使用 Claude 處理 PDF

發送您的第一個 PDF 請求

讓我們從使用 Messages API 的簡單範例開始。您可以透過三種方式向 Claude 提供 PDF：

作為線上託管 PDF 的 URL 參考
作為 document 內容區塊中的 base64 編碼 PDF
透過 Files API 的 file_id

選項 1：基於 URL 的 PDF 文件

最簡單的方法是直接從 URL 參考 PDF：

 curl https://api.anthropic.com/v1/messages \
   -H "content-type: application/json" \
   -H "x-api-key: $ANTHROPIC_API_KEY" \
   -H "anthropic-version: 2023-06-01" \
   -d '{
     "model": "claude-opus-4-6",
     "max_tokens": 1024,
     "messages": [{
         "role": "user",
         "content": [{
             "type": "document",
             "source": {
                 "type": "url",
                 "url": "https://assets.anthropic.com/m/1cd9d098ac3e6467/original/Claude-3-Model-Card-October-Addendum.pdf"
             }
         },
         {
             "type": "text",
             "text": "What are the key findings in this document?"
         }]
     }]
 }'

選項 2：Base64 編碼的 PDF 文件

如果您需要從本機系統發送 PDF 或 URL 不可用時：

# Method 1: Fetch and encode a remote PDF
curl -s "https://assets.anthropic.com/m/1cd9d098ac3e6467/original/Claude-3-Model-Card-October-Addendum.pdf" | base64 | tr -d '\n' > pdf_base64.txt

# Method 2: Encode a local PDF file
# base64 document.pdf | tr -d '\n' > pdf_base64.txt

# Create a JSON request file using the pdf_base64.txt content
jq -n --rawfile PDF_BASE64 pdf_base64.txt '{
    "model": "claude-opus-4-6",
    "max_tokens": 1024,
    "messages": [{
        "role": "user",
        "content": [{
            "type": "document",
            "source": {
                "type": "base64",
                "media_type": "application/pdf",
                "data": $PDF_BASE64
            }
        },
        {
            "type": "text",
            "text": "What are the key findings in this document?"
        }]
    }]
}' > request.json

# Send the API request using the JSON file
curl https://api.anthropic.com/v1/messages \
  -H "content-type: application/json" \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -d @request.json

選項 3：Files API

對於您會重複使用的 PDF，或當您想避免編碼開銷時，請使用 Files API：

# First, upload your PDF to the Files API
curl -X POST https://api.anthropic.com/v1/files \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -H "anthropic-beta: files-api-2025-04-14" \
  -F "[email protected]"

# Then use the returned file_id in your message
curl https://api.anthropic.com/v1/messages \
  -H "content-type: application/json" \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -H "anthropic-beta: files-api-2025-04-14" \
  -d '{
    "model": "claude-opus-4-6", 
    "max_tokens": 1024,
    "messages": [{
      "role": "user",
      "content": [{
        "type": "document",
        "source": {
          "type": "file",
          "file_id": "file_abc123"
        }
      },
      {
        "type": "text",
        "text": "What are the key findings in this document?"
      }]
    }]
  }'

PDF 支援的運作方式

當您向 Claude 發送 PDF 時，會發生以下步驟：

系統擷取文件的內容。
- 系統將文件的每一頁轉換為圖片。
- 從每頁擷取文字，並與每頁的圖片一起提供。
Claude 分析文字和圖片，以更好地理解文件。
- 文件以文字和圖片的組合形式提供進行分析。
- 這允許使用者詢問 PDF 視覺元素的見解，例如圖表、圖形和其他非文字內容。
Claude 回應，如相關則參考 PDF 的內容。
Claude 在回應時可以參考文字和視覺內容。您可以透過將 PDF 支援與以下功能整合來進一步提升效能：
- 提示快取：提升重複分析的效能。
- 批次處理：用於大量文件處理。
- 工具使用：從文件中擷取特定資訊作為工具輸入。

估算您的成本

PDF 檔案的 token 數量取決於從文件中擷取的總文字量以及頁數：

文字 token 成本：每頁通常使用 1,500-3,000 個 token，取決於內容密度。適用標準 API 定價，無額外 PDF 費用。
圖片 token 成本：由於每頁都會轉換為圖片，因此適用相同的基於圖片的成本計算。

您可以使用 token 計算來估算您特定 PDF 的成本。

最佳化 PDF 處理

提升效能

遵循以下最佳實踐以獲得最佳結果：

在請求中將 PDF 放在文字之前
使用標準字型
確保文字清晰易讀
將頁面旋轉至正確的直立方向
在提示中使用邏輯頁碼（來自 PDF 檢視器）
需要時將大型 PDF 分割成區塊
啟用提示快取以進行重複分析

擴展您的實作

對於大量處理，請考慮以下方法：

使用提示快取

快取 PDF 以提高重複查詢的效能：

# 使用 pdf_base64.txt 內容建立 JSON 請求檔案
jq -n --rawfile PDF_BASE64 pdf_base64.txt '{
    "model": "claude-opus-4-6",
    "max_tokens": 1024,
    "messages": [{
        "role": "user",
        "content": [{
            "type": "document",
            "source": {
                "type": "base64",
                "media_type": "application/pdf",
                "data": $PDF_BASE64
            },
            "cache_control": {
              "type": "ephemeral"
            }
        },
        {
            "type": "text",
            "text": "Which model has the highest human preference win rates across each use-case?"
        }]
    }]
}' > request.json

# 然後使用 JSON 檔案進行 API 呼叫
curl https://api.anthropic.com/v1/messages \
  -H "content-type: application/json" \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -d @request.json

批次處理文件

使用 Message Batches API 進行大量工作流程：

# 使用 pdf_base64.txt 內容建立 JSON 請求檔案
jq -n --rawfile PDF_BASE64 pdf_base64.txt '
{
  "requests": [
      {
          "custom_id": "my-first-request",
          "params": {
              "model": "claude-opus-4-6",
              "max_tokens": 1024,
              "messages": [
                {
                    "role": "user",
                    "content": [
                        {
                            "type": "document",
                            "source": {
                                "type": "base64",
                                "media_type": "application/pdf",
                                "data": $PDF_BASE64
                            }
                        },
                        {
                            "type": "text",
                            "text": "Which model has the highest human preference win rates across each use-case?"
                        }
                    ]
                }
              ]
          }
      },
      {
          "custom_id": "my-second-request",
          "params": {
              "model": "claude-opus-4-6",
              "max_tokens": 1024,
              "messages": [
                {
                    "role": "user",
                    "content": [
                        {
                            "type": "document",
                            "source": {
                                "type": "base64",
                                "media_type": "application/pdf",
                                "data": $PDF_BASE64
                            }
                        },
                        {
                            "type": "text",
                            "text": "Extract 5 key insights from this document."
                        }
                    ]
                }
              ]
          }
      }
  ]
}
' > request.json

# 然後使用 JSON 檔案進行 API 呼叫
curl https://api.anthropic.com/v1/messages/batches \
  -H "content-type: application/json" \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -d @request.json

後續步驟

嘗試 PDF 範例

在我們的 cookbook 食譜中探索 PDF 處理的實用範例。

查看 API 參考文件

查看 PDF 支援的完整 API 文件。

Was this page helpful?

功能

PDF 支援

使用 Claude 處理 PDF。從您的文件中擷取文字、分析圖表並理解視覺內容。

您現在可以向 Claude 詢問您提供的 PDF 中的任何文字、圖片、圖表和表格。一些範例使用案例：

分析財務報告並理解圖表/表格
從法律文件中擷取關鍵資訊
文件翻譯協助
將文件資訊轉換為結構化格式

開始之前

檢查 PDF 要求

Claude 可處理任何標準 PDF。但是，使用 PDF 支援時，您應確保您的請求大小符合以下要求：

要求	限制
最大請求大小	32MB
每次請求最大頁數	100
格式	標準 PDF（無密碼/加密）

請注意，這兩個限制都是針對整個請求負載，包括與 PDF 一起發送的任何其他內容。

由於 PDF 支援依賴 Claude 的視覺功能，因此它受到與其他視覺任務相同的限制和注意事項。

支援的平台和模型

PDF 支援目前透過直接 API 存取和 Google Vertex AI 提供。所有現行模型都支援 PDF 處理。

PDF 支援現已在 Amazon Bedrock 上提供，但有以下注意事項：

Amazon Bedrock PDF 支援

透過 Amazon Bedrock 的 Converse API 使用 PDF 支援時，有兩種不同的文件處理模式：

文件處理模式

Converse 文件聊天（原始模式 - 僅文字擷取）
- 提供 PDF 的基本文字擷取
- 無法分析 PDF 中的圖片、圖表或視覺版面
- 3 頁 PDF 大約使用 1,000 個 token
- 未啟用引用時自動使用
Claude PDF 聊天（新模式 - 完整視覺理解）
- 提供 PDF 的完整視覺分析
- 可以理解和分析圖表、圖形、圖片和視覺版面
- 將每頁作為文字和圖片進行處理，以實現全面理解
- 3 頁 PDF 大約使用 7,000 個 token
- 需要在 Converse API 中啟用引用

主要限制

Converse API：視覺 PDF 分析需要啟用引用。目前沒有不使用引用而使用視覺分析的選項（與 InvokeModel API 不同）。
InvokeModel API：提供對 PDF 處理的完全控制，無需強制引用。

常見問題

如果客戶反映使用 Converse API 時 Claude 無法看到 PDF 中的圖片或圖表，他們可能需要啟用引用標誌。沒有啟用引用時，Converse 會退回到僅基本文字擷取。

這是 Converse API 的已知限制，我們正在努力解決。對於需要不使用引用的視覺 PDF 分析的應用程式，請考慮改用 InvokeModel API。

對於非 PDF 檔案，如 .csv、.xlsx、.docx、.md 或 .txt 檔案，請參閱使用其他檔案格式。

使用 Claude 處理 PDF

發送您的第一個 PDF 請求

讓我們從使用 Messages API 的簡單範例開始。您可以透過三種方式向 Claude 提供 PDF：

作為線上託管 PDF 的 URL 參考
作為 document 內容區塊中的 base64 編碼 PDF
透過 Files API 的 file_id

選項 1：基於 URL 的 PDF 文件

最簡單的方法是直接從 URL 參考 PDF：

 curl https://api.anthropic.com/v1/messages \
   -H "content-type: application/json" \
   -H "x-api-key: $ANTHROPIC_API_KEY" \
   -H "anthropic-version: 2023-06-01" \
   -d '{
     "model": "claude-opus-4-6",
     "max_tokens": 1024,
     "messages": [{
         "role": "user",
         "content": [{
             "type": "document",
             "source": {
                 "type": "url",
                 "url": "https://assets.anthropic.com/m/1cd9d098ac3e6467/original/Claude-3-Model-Card-October-Addendum.pdf"
             }
         },
         {
             "type": "text",
             "text": "What are the key findings in this document?"
         }]
     }]
 }'

選項 2：Base64 編碼的 PDF 文件

如果您需要從本機系統發送 PDF 或 URL 不可用時：

# Method 1: Fetch and encode a remote PDF
curl -s "https://assets.anthropic.com/m/1cd9d098ac3e6467/original/Claude-3-Model-Card-October-Addendum.pdf" | base64 | tr -d '\n' > pdf_base64.txt

# Method 2: Encode a local PDF file
# base64 document.pdf | tr -d '\n' > pdf_base64.txt

# Create a JSON request file using the pdf_base64.txt content
jq -n --rawfile PDF_BASE64 pdf_base64.txt '{
    "model": "claude-opus-4-6",
    "max_tokens": 1024,
    "messages": [{
        "role": "user",
        "content": [{
            "type": "document",
            "source": {
                "type": "base64",
                "media_type": "application/pdf",
                "data": $PDF_BASE64
            }
        },
        {
            "type": "text",
            "text": "What are the key findings in this document?"
        }]
    }]
}' > request.json

# Send the API request using the JSON file
curl https://api.anthropic.com/v1/messages \
  -H "content-type: application/json" \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -d @request.json

選項 3：Files API

對於您會重複使用的 PDF，或當您想避免編碼開銷時，請使用 Files API：

# First, upload your PDF to the Files API
curl -X POST https://api.anthropic.com/v1/files \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -H "anthropic-beta: files-api-2025-04-14" \
  -F "[email protected]"

# Then use the returned file_id in your message
curl https://api.anthropic.com/v1/messages \
  -H "content-type: application/json" \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -H "anthropic-beta: files-api-2025-04-14" \
  -d '{
    "model": "claude-opus-4-6", 
    "max_tokens": 1024,
    "messages": [{
      "role": "user",
      "content": [{
        "type": "document",
        "source": {
          "type": "file",
          "file_id": "file_abc123"
        }
      },
      {
        "type": "text",
        "text": "What are the key findings in this document?"
      }]
    }]
  }'

PDF 支援的運作方式

當您向 Claude 發送 PDF 時，會發生以下步驟：

系統擷取文件的內容。
- 系統將文件的每一頁轉換為圖片。
- 從每頁擷取文字，並與每頁的圖片一起提供。
Claude 分析文字和圖片，以更好地理解文件。
- 文件以文字和圖片的組合形式提供進行分析。
- 這允許使用者詢問 PDF 視覺元素的見解，例如圖表、圖形和其他非文字內容。
Claude 回應，如相關則參考 PDF 的內容。
Claude 在回應時可以參考文字和視覺內容。您可以透過將 PDF 支援與以下功能整合來進一步提升效能：
- 提示快取：提升重複分析的效能。
- 批次處理：用於大量文件處理。
- 工具使用：從文件中擷取特定資訊作為工具輸入。

估算您的成本

PDF 檔案的 token 數量取決於從文件中擷取的總文字量以及頁數：

文字 token 成本：每頁通常使用 1,500-3,000 個 token，取決於內容密度。適用標準 API 定價，無額外 PDF 費用。
圖片 token 成本：由於每頁都會轉換為圖片，因此適用相同的基於圖片的成本計算。

您可以使用 token 計算來估算您特定 PDF 的成本。

最佳化 PDF 處理

提升效能

遵循以下最佳實踐以獲得最佳結果：

在請求中將 PDF 放在文字之前
使用標準字型
確保文字清晰易讀
將頁面旋轉至正確的直立方向
在提示中使用邏輯頁碼（來自 PDF 檢視器）
需要時將大型 PDF 分割成區塊
啟用提示快取以進行重複分析

擴展您的實作

對於大量處理，請考慮以下方法：

使用提示快取

快取 PDF 以提高重複查詢的效能：

# 使用 pdf_base64.txt 內容建立 JSON 請求檔案
jq -n --rawfile PDF_BASE64 pdf_base64.txt '{
    "model": "claude-opus-4-6",
    "max_tokens": 1024,
    "messages": [{
        "role": "user",
        "content": [{
            "type": "document",
            "source": {
                "type": "base64",
                "media_type": "application/pdf",
                "data": $PDF_BASE64
            },
            "cache_control": {
              "type": "ephemeral"
            }
        },
        {
            "type": "text",
            "text": "Which model has the highest human preference win rates across each use-case?"
        }]
    }]
}' > request.json

# 然後使用 JSON 檔案進行 API 呼叫
curl https://api.anthropic.com/v1/messages \
  -H "content-type: application/json" \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -d @request.json

批次處理文件

使用 Message Batches API 進行大量工作流程：

# 使用 pdf_base64.txt 內容建立 JSON 請求檔案
jq -n --rawfile PDF_BASE64 pdf_base64.txt '
{
  "requests": [
      {
          "custom_id": "my-first-request",
          "params": {
              "model": "claude-opus-4-6",
              "max_tokens": 1024,
              "messages": [
                {
                    "role": "user",
                    "content": [
                        {
                            "type": "document",
                            "source": {
                                "type": "base64",
                                "media_type": "application/pdf",
                                "data": $PDF_BASE64
                            }
                        },
                        {
                            "type": "text",
                            "text": "Which model has the highest human preference win rates across each use-case?"
                        }
                    ]
                }
              ]
          }
      },
      {
          "custom_id": "my-second-request",
          "params": {
              "model": "claude-opus-4-6",
              "max_tokens": 1024,
              "messages": [
                {
                    "role": "user",
                    "content": [
                        {
                            "type": "document",
                            "source": {
                                "type": "base64",
                                "media_type": "application/pdf",
                                "data": $PDF_BASE64
                            }
                        },
                        {
                            "type": "text",
                            "text": "Extract 5 key insights from this document."
                        }
                    ]
                }
              ]
          }
      }
  ]
}
' > request.json

# 然後使用 JSON 檔案進行 API 呼叫
curl https://api.anthropic.com/v1/messages/batches \
  -H "content-type: application/json" \
  -H "x-api-key: $ANTHROPIC_API_KEY" \
  -H "anthropic-version: 2023-06-01" \
  -d @request.json

後續步驟

嘗試 PDF 範例

在我們的 cookbook 食譜中探索 PDF 處理的實用範例。

查看 API 參考文件

查看 PDF 支援的完整 API 文件。

Was this page helpful?