Loading...
    • Developer Guide
    • API Reference
    • MCP
    • Resources
    • Release Notes
    Search...
    ⌘K
    Using the API
    API overviewBeta headersErrors
    Messages
    Create a Message
    Count tokens in a Message
    Models
    List Models
    Get a Model
    Beta
    Create a Message
    Count tokens in a Message
    Create a Message Batch
    Retrieve a Message Batch
    List Message Batches
    Cancel a Message Batch
    Delete a Message Batch
    Retrieve Message Batch results
    Admin
    Completions
    Create a Text Completion
    Support & configuration
    Rate limitsService tiersVersionsIP addressesSupported regionsOpenAI SDK compatibility
    Console
    Log in

    Batches

    Create a Message Batch
    BetaMessageBatch Beta.Messages.Batches.Create(BatchCreateParamsparameters, CancellationTokencancellationToken = default)
    POST/v1/messages/batches
    Retrieve a Message Batch
    BetaMessageBatch Beta.Messages.Batches.Retrieve(BatchRetrieveParamsparameters, CancellationTokencancellationToken = default)
    GET/v1/messages/batches/{message_batch_id}
    List Message Batches
    BatchListPageResponse Beta.Messages.Batches.List(BatchListParams?parameters, CancellationTokencancellationToken = default)
    GET/v1/messages/batches
    Cancel a Message Batch
    BetaMessageBatch Beta.Messages.Batches.Cancel(BatchCancelParamsparameters, CancellationTokencancellationToken = default)
    POST/v1/messages/batches/{message_batch_id}/cancel
    Delete a Message Batch
    BetaDeletedMessageBatch Beta.Messages.Batches.Delete(BatchDeleteParamsparameters, CancellationTokencancellationToken = default)
    DELETE/v1/messages/batches/{message_batch_id}
    Retrieve Message Batch results
    BetaMessageBatchIndividualResponse Beta.Messages.Batches.ResultsStreaming(BatchResultsParamsparameters, CancellationTokencancellationToken = default)
    GET/v1/messages/batches/{message_batch_id}/results
    ModelsExpand Collapse
    class BetaDeletedMessageBatch:
    required string ID

    ID of the Message Batch.

    JsonElement Type "message_batch_deleted"constant

    Deleted object type.

    For Message Batches, this is always "message_batch_deleted".

    class BetaMessageBatch:
    required string ID

    Unique object identifier.

    The format and length of IDs may change over time.

    required DateTimeOffset? ArchivedAt

    RFC 3339 datetime string representing the time at which the Message Batch was archived and its results became unavailable.

    required DateTimeOffset? CancelInitiatedAt

    RFC 3339 datetime string representing the time at which cancellation was initiated for the Message Batch. Specified only if cancellation was initiated.

    required DateTimeOffset CreatedAt

    RFC 3339 datetime string representing the time at which the Message Batch was created.

    required DateTimeOffset? EndedAt

    RFC 3339 datetime string representing the time at which processing for the Message Batch ended. Specified only once processing ends.

    Processing ends when every request in a Message Batch has either succeeded, errored, canceled, or expired.

    formatdate-time
    required DateTimeOffset ExpiresAt

    RFC 3339 datetime string representing the time at which the Message Batch will expire and end processing, which is 24 hours after creation.

    required ProcessingStatus ProcessingStatus

    Processing status of the Message Batch.

    Accepts one of the following:
    "in_progress"InProgress
    "canceling"Canceling
    "ended"Ended
    required BetaMessageBatchRequestCounts RequestCounts

    Tallies requests within the Message Batch, categorized by their status.

    Requests start as processing and move to one of the other statuses only once processing of the entire batch ends. The sum of all values always matches the total number of requests in the batch.

    required Long Canceled

    Number of requests in the Message Batch that have been canceled.

    This is zero until processing of the entire Message Batch has ended.

    required Long Errored

    Number of requests in the Message Batch that encountered an error.

    This is zero until processing of the entire Message Batch has ended.

    required Long Expired

    Number of requests in the Message Batch that have expired.

    This is zero until processing of the entire Message Batch has ended.

    required Long Processing

    Number of requests in the Message Batch that are processing.

    required Long Succeeded

    Number of requests in the Message Batch that have completed successfully.

    This is zero until processing of the entire Message Batch has ended.

    required string? ResultsUrl

    URL to a .jsonl file containing the results of the Message Batch requests. Specified only once processing ends.

    Results in the file are not guaranteed to be in the same order as requests. Use the custom_id field to match results to requests.

    JsonElement Type "message_batch"constant

    Object type.

    For Message Batches, this is always "message_batch".

    class BetaMessageBatchCanceledResult:
    JsonElement Type "canceled"constant
    class BetaMessageBatchErroredResult:
    required BetaErrorResponse Error
    required BetaError Error
    Accepts one of the following:
    class BetaInvalidRequestError:
    required string Message
    JsonElement Type "invalid_request_error"constant
    class BetaAuthenticationError:
    required string Message
    JsonElement Type "authentication_error"constant
    class BetaBillingError:
    required string Message
    JsonElement Type "billing_error"constant
    class BetaPermissionError:
    required string Message
    JsonElement Type "permission_error"constant
    class BetaNotFoundError:
    required string Message
    JsonElement Type "not_found_error"constant
    class BetaRateLimitError:
    required string Message
    JsonElement Type "rate_limit_error"constant
    class BetaGatewayTimeoutError:
    required string Message
    JsonElement Type "timeout_error"constant
    class BetaApiError:
    required string Message
    JsonElement Type "api_error"constant
    class BetaOverloadedError:
    required string Message
    JsonElement Type "overloaded_error"constant
    required string? RequestID
    JsonElement Type "error"constant
    JsonElement Type "errored"constant
    class BetaMessageBatchExpiredResult:
    JsonElement Type "expired"constant
    class BetaMessageBatchIndividualResponse:

    This is a single line in the response .jsonl file and does not represent the response as a whole.

    required string CustomID

    Developer-provided ID created for each request in a Message Batch. Useful for matching results to requests, as results may be given out of request order.

    Must be unique for each request within the Message Batch.

    required BetaMessageBatchResult Result

    Processing result for this request.

    Contains a Message output if processing was successful, an error response if processing failed, or the reason why processing was not attempted, such as cancellation or expiration.

    Accepts one of the following:
    class BetaMessageBatchSucceededResult:
    required BetaMessage Message
    required string ID

    Unique object identifier.

    The format and length of IDs may change over time.

    required BetaContainer? Container

    Information about the container used in the request (for the code execution tool)

    required string ID

    Identifier for the container used in this request

    required DateTimeOffset ExpiresAt

    The time at which the container will expire.

    required IReadOnlyList<BetaSkill>? Skills

    Skills loaded in the container

    required string SkillID

    Skill ID

    required Type Type

    Type of skill - either 'anthropic' (built-in) or 'custom' (user-defined)

    Accepts one of the following:
    "anthropic"Anthropic
    "custom"Custom
    required string Version

    Skill version or 'latest' for most recent version

    required IReadOnlyList<BetaContentBlock> Content

    Content generated by the model.

    This is an array of content blocks, each of which has a type that determines its shape.

    Example:

    [{"type": "text", "text": "Hi, I'm Claude."}]
    

    If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.

    For example, if the input messages were:

    [
      {"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
      {"role": "assistant", "content": "The best answer is ("}
    ]
    

    Then the response content might be:

    [{"type": "text", "text": "B)"}]
    
    Accepts one of the following:
    class BetaTextBlock:
    required IReadOnlyList<BetaTextCitation>? Citations

    Citations supporting the text block.

    The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.

    Accepts one of the following:
    class BetaCitationCharLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndCharIndex
    required string? FileID
    required Long StartCharIndex
    JsonElement Type "char_location"constant
    class BetaCitationPageLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndPageNumber
    required string? FileID
    required Long StartPageNumber
    JsonElement Type "page_location"constant
    class BetaCitationContentBlockLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndBlockIndex
    required string? FileID
    required Long StartBlockIndex
    JsonElement Type "content_block_location"constant
    class BetaCitationsWebSearchResultLocation:
    required string CitedText
    required string EncryptedIndex
    required string? Title
    JsonElement Type "web_search_result_location"constant
    required string Url
    class BetaCitationSearchResultLocation:
    required string CitedText
    required Long EndBlockIndex
    required Long SearchResultIndex
    required string Source
    required Long StartBlockIndex
    required string? Title
    JsonElement Type "search_result_location"constant
    required string Text
    JsonElement Type "text"constant
    class BetaThinkingBlock:
    required string Signature
    required string Thinking
    JsonElement Type "thinking"constant
    class BetaRedactedThinkingBlock:
    required string Data
    JsonElement Type "redacted_thinking"constant
    class BetaToolUseBlock:
    required string ID
    required IReadOnlyDictionary<string, JsonElement> Input
    required string Name
    JsonElement Type "tool_use"constant
    Caller Caller

    Tool invocation directly from the model.

    Accepts one of the following:
    class BetaDirectCaller:

    Tool invocation directly from the model.

    JsonElement Type "direct"constant
    class BetaServerToolCaller:

    Tool invocation generated by a server-side tool.

    required string ToolID
    JsonElement Type "code_execution_20250825"constant
    class BetaServerToolCaller20260120:
    required string ToolID
    JsonElement Type "code_execution_20260120"constant
    class BetaServerToolUseBlock:
    required string ID
    required IReadOnlyDictionary<string, JsonElement> Input
    required Name Name
    Accepts one of the following:
    "web_search"WebSearch
    "web_fetch"WebFetch
    "code_execution"CodeExecution
    "bash_code_execution"BashCodeExecution
    "text_editor_code_execution"TextEditorCodeExecution
    "tool_search_tool_regex"ToolSearchToolRegex
    "tool_search_tool_bm25"ToolSearchToolBm25
    JsonElement Type "server_tool_use"constant
    Caller Caller

    Tool invocation directly from the model.

    Accepts one of the following:
    class BetaDirectCaller:

    Tool invocation directly from the model.

    JsonElement Type "direct"constant
    class BetaServerToolCaller:

    Tool invocation generated by a server-side tool.

    required string ToolID
    JsonElement Type "code_execution_20250825"constant
    class BetaServerToolCaller20260120:
    required string ToolID
    JsonElement Type "code_execution_20260120"constant
    class BetaWebSearchToolResultBlock:
    required BetaWebSearchToolResultBlockContent Content
    Accepts one of the following:
    class BetaWebSearchToolResultError:
    required BetaWebSearchToolResultErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "max_uses_exceeded"MaxUsesExceeded
    "too_many_requests"TooManyRequests
    "query_too_long"QueryTooLong
    "request_too_large"RequestTooLarge
    JsonElement Type "web_search_tool_result_error"constant
    IReadOnlyList<BetaWebSearchResultBlock>
    required string EncryptedContent
    required string? PageAge
    required string Title
    JsonElement Type "web_search_result"constant
    required string Url
    required string ToolUseID
    JsonElement Type "web_search_tool_result"constant
    Caller Caller

    Tool invocation directly from the model.

    Accepts one of the following:
    class BetaDirectCaller:

    Tool invocation directly from the model.

    JsonElement Type "direct"constant
    class BetaServerToolCaller:

    Tool invocation generated by a server-side tool.

    required string ToolID
    JsonElement Type "code_execution_20250825"constant
    class BetaServerToolCaller20260120:
    required string ToolID
    JsonElement Type "code_execution_20260120"constant
    class BetaWebFetchToolResultBlock:
    required Content Content
    Accepts one of the following:
    class BetaWebFetchToolResultErrorBlock:
    required BetaWebFetchToolResultErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "url_too_long"UrlTooLong
    "url_not_allowed"UrlNotAllowed
    "url_not_accessible"UrlNotAccessible
    "unsupported_content_type"UnsupportedContentType
    "too_many_requests"TooManyRequests
    "max_uses_exceeded"MaxUsesExceeded
    "unavailable"Unavailable
    JsonElement Type "web_fetch_tool_result_error"constant
    class BetaWebFetchBlock:
    required BetaDocumentBlock Content
    required BetaCitationConfig? Citations

    Citation configuration for the document

    required Boolean Enabled
    required Source Source
    Accepts one of the following:
    class BetaBase64PdfSource:
    required string Data
    JsonElement MediaType "application/pdf"constant
    JsonElement Type "base64"constant
    class BetaPlainTextSource:
    required string Data
    JsonElement MediaType "text/plain"constant
    JsonElement Type "text"constant
    required string? Title

    The title of the document

    JsonElement Type "document"constant
    required string? RetrievedAt

    ISO 8601 timestamp when the content was retrieved

    JsonElement Type "web_fetch_result"constant
    required string Url

    Fetched content URL

    required string ToolUseID
    JsonElement Type "web_fetch_tool_result"constant
    Caller Caller

    Tool invocation directly from the model.

    Accepts one of the following:
    class BetaDirectCaller:

    Tool invocation directly from the model.

    JsonElement Type "direct"constant
    class BetaServerToolCaller:

    Tool invocation generated by a server-side tool.

    required string ToolID
    JsonElement Type "code_execution_20250825"constant
    class BetaServerToolCaller20260120:
    required string ToolID
    JsonElement Type "code_execution_20260120"constant
    class BetaCodeExecutionToolResultBlock:
    required BetaCodeExecutionToolResultBlockContent Content

    Code execution result with encrypted stdout for PFC + web_search results.

    Accepts one of the following:
    class BetaCodeExecutionToolResultError:
    required BetaCodeExecutionToolResultErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "too_many_requests"TooManyRequests
    "execution_time_exceeded"ExecutionTimeExceeded
    JsonElement Type "code_execution_tool_result_error"constant
    class BetaCodeExecutionResultBlock:
    required IReadOnlyList<BetaCodeExecutionOutputBlock> Content
    required string FileID
    JsonElement Type "code_execution_output"constant
    required Long ReturnCode
    required string Stderr
    required string Stdout
    JsonElement Type "code_execution_result"constant
    class BetaEncryptedCodeExecutionResultBlock:

    Code execution result with encrypted stdout for PFC + web_search results.

    required IReadOnlyList<BetaCodeExecutionOutputBlock> Content
    required string FileID
    JsonElement Type "code_execution_output"constant
    required string EncryptedStdout
    required Long ReturnCode
    required string Stderr
    JsonElement Type "encrypted_code_execution_result"constant
    required string ToolUseID
    JsonElement Type "code_execution_tool_result"constant
    class BetaBashCodeExecutionToolResultBlock:
    required Content Content
    Accepts one of the following:
    class BetaBashCodeExecutionToolResultError:
    required ErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "too_many_requests"TooManyRequests
    "execution_time_exceeded"ExecutionTimeExceeded
    "output_file_too_large"OutputFileTooLarge
    JsonElement Type "bash_code_execution_tool_result_error"constant
    class BetaBashCodeExecutionResultBlock:
    required IReadOnlyList<BetaBashCodeExecutionOutputBlock> Content
    required string FileID
    JsonElement Type "bash_code_execution_output"constant
    required Long ReturnCode
    required string Stderr
    required string Stdout
    JsonElement Type "bash_code_execution_result"constant
    required string ToolUseID
    JsonElement Type "bash_code_execution_tool_result"constant
    class BetaTextEditorCodeExecutionToolResultBlock:
    required Content Content
    Accepts one of the following:
    class BetaTextEditorCodeExecutionToolResultError:
    required ErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "too_many_requests"TooManyRequests
    "execution_time_exceeded"ExecutionTimeExceeded
    "file_not_found"FileNotFound
    required string? ErrorMessage
    JsonElement Type "text_editor_code_execution_tool_result_error"constant
    class BetaTextEditorCodeExecutionViewResultBlock:
    required string Content
    required FileType FileType
    Accepts one of the following:
    "text"Text
    "image"Image
    "pdf"Pdf
    required Long? NumLines
    required Long? StartLine
    required Long? TotalLines
    JsonElement Type "text_editor_code_execution_view_result"constant
    class BetaTextEditorCodeExecutionCreateResultBlock:
    required Boolean IsFileUpdate
    JsonElement Type "text_editor_code_execution_create_result"constant
    class BetaTextEditorCodeExecutionStrReplaceResultBlock:
    required IReadOnlyList<string>? Lines
    required Long? NewLines
    required Long? NewStart
    required Long? OldLines
    required Long? OldStart
    JsonElement Type "text_editor_code_execution_str_replace_result"constant
    required string ToolUseID
    JsonElement Type "text_editor_code_execution_tool_result"constant
    class BetaToolSearchToolResultBlock:
    required Content Content
    Accepts one of the following:
    class BetaToolSearchToolResultError:
    required ErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "too_many_requests"TooManyRequests
    "execution_time_exceeded"ExecutionTimeExceeded
    required string? ErrorMessage
    JsonElement Type "tool_search_tool_result_error"constant
    class BetaToolSearchToolSearchResultBlock:
    required IReadOnlyList<BetaToolReferenceBlock> ToolReferences
    required string ToolName
    JsonElement Type "tool_reference"constant
    JsonElement Type "tool_search_tool_search_result"constant
    required string ToolUseID
    JsonElement Type "tool_search_tool_result"constant
    class BetaMcpToolUseBlock:
    required string ID
    required IReadOnlyDictionary<string, JsonElement> Input
    required string Name

    The name of the MCP tool

    required string ServerName

    The name of the MCP server

    JsonElement Type "mcp_tool_use"constant
    class BetaMcpToolResultBlock:
    required Content Content
    Accepts one of the following:
    string
    IReadOnlyList<BetaTextBlock>
    required IReadOnlyList<BetaTextCitation>? Citations

    Citations supporting the text block.

    The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.

    Accepts one of the following:
    class BetaCitationCharLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndCharIndex
    required string? FileID
    required Long StartCharIndex
    JsonElement Type "char_location"constant
    class BetaCitationPageLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndPageNumber
    required string? FileID
    required Long StartPageNumber
    JsonElement Type "page_location"constant
    class BetaCitationContentBlockLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndBlockIndex
    required string? FileID
    required Long StartBlockIndex
    JsonElement Type "content_block_location"constant
    class BetaCitationsWebSearchResultLocation:
    required string CitedText
    required string EncryptedIndex
    required string? Title
    JsonElement Type "web_search_result_location"constant
    required string Url
    class BetaCitationSearchResultLocation:
    required string CitedText
    required Long EndBlockIndex
    required Long SearchResultIndex
    required string Source
    required Long StartBlockIndex
    required string? Title
    JsonElement Type "search_result_location"constant
    required string Text
    JsonElement Type "text"constant
    required Boolean IsError
    required string ToolUseID
    JsonElement Type "mcp_tool_result"constant
    class BetaContainerUploadBlock:

    Response model for a file uploaded to the container.

    required string FileID
    JsonElement Type "container_upload"constant
    class BetaCompactionBlock:

    A compaction block returned when autocompact is triggered.

    When content is None, it indicates the compaction failed to produce a valid summary (e.g., malformed output from the model). Clients may round-trip compaction blocks with null content; the server treats them as no-ops.

    required string? Content

    Summary of compacted content, or null if compaction failed

    JsonElement Type "compaction"constant
    required BetaContextManagementResponse? ContextManagement

    Context management response.

    Information about context management strategies applied during the request.

    required IReadOnlyList<AppliedEdit> AppliedEdits

    List of context management edits that were applied.

    Accepts one of the following:
    class BetaClearToolUses20250919EditResponse:
    required Long ClearedInputTokens

    Number of input tokens cleared by this edit.

    required Long ClearedToolUses

    Number of tool uses that were cleared.

    JsonElement Type "clear_tool_uses_20250919"constant

    The type of context management edit applied.

    class BetaClearThinking20251015EditResponse:
    required Long ClearedInputTokens

    Number of input tokens cleared by this edit.

    required Long ClearedThinkingTurns

    Number of thinking turns that were cleared.

    JsonElement Type "clear_thinking_20251015"constant

    The type of context management edit applied.

    required Model Model

    The model that will complete your prompt.

    See models for additional details and options.

    Accepts one of the following:
    "claude-opus-4-6"ClaudeOpus4_6

    Most intelligent model for building agents and coding

    "claude-sonnet-4-6"ClaudeSonnet4_6

    Frontier intelligence at scale — built for coding, agents, and enterprise workflows

    "claude-opus-4-5-20251101"ClaudeOpus4_5_20251101

    Premium model combining maximum intelligence with practical performance

    "claude-opus-4-5"ClaudeOpus4_5

    Premium model combining maximum intelligence with practical performance

    "claude-3-7-sonnet-latest"Claude3_7SonnetLatest

    High-performance model with early extended thinking

    "claude-3-7-sonnet-20250219"Claude3_7Sonnet20250219

    High-performance model with early extended thinking

    "claude-3-5-haiku-latest"Claude3_5HaikuLatest

    Fastest and most compact model for near-instant responsiveness

    "claude-3-5-haiku-20241022"Claude3_5Haiku20241022

    Our fastest model

    "claude-haiku-4-5"ClaudeHaiku4_5

    Hybrid model, capable of near-instant responses and extended thinking

    "claude-haiku-4-5-20251001"ClaudeHaiku4_5_20251001

    Hybrid model, capable of near-instant responses and extended thinking

    "claude-sonnet-4-20250514"ClaudeSonnet4_20250514

    High-performance model with extended thinking

    "claude-sonnet-4-0"ClaudeSonnet4_0

    High-performance model with extended thinking

    "claude-4-sonnet-20250514"Claude4Sonnet20250514

    High-performance model with extended thinking

    "claude-sonnet-4-5"ClaudeSonnet4_5

    Our best model for real-world agents and coding

    "claude-sonnet-4-5-20250929"ClaudeSonnet4_5_20250929

    Our best model for real-world agents and coding

    "claude-opus-4-0"ClaudeOpus4_0

    Our most capable model

    "claude-opus-4-20250514"ClaudeOpus4_20250514

    Our most capable model

    "claude-4-opus-20250514"Claude4Opus20250514

    Our most capable model

    "claude-opus-4-1-20250805"ClaudeOpus4_1_20250805

    Our most capable model

    "claude-3-opus-latest"Claude3OpusLatest

    Excels at writing and complex tasks

    "claude-3-opus-20240229"Claude_3_Opus_20240229

    Excels at writing and complex tasks

    "claude-3-haiku-20240307"Claude_3_Haiku_20240307

    Our previous most fast and cost-effective

    JsonElement Role "assistant"constant

    Conversational role of the generated message.

    This will always be "assistant".

    required BetaStopReason? StopReason

    The reason that we stopped.

    This may be one the following values:

    • "end_turn": the model reached a natural stopping point
    • "max_tokens": we exceeded the requested max_tokens or the model's maximum
    • "stop_sequence": one of your provided custom stop_sequences was generated
    • "tool_use": the model invoked one or more tools
    • "pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue.
    • "refusal": when streaming classifiers intervene to handle potential policy violations

    In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.

    Accepts one of the following:
    "end_turn"EndTurn
    "max_tokens"MaxTokens
    "stop_sequence"StopSequence
    "tool_use"ToolUse
    "pause_turn"PauseTurn
    "compaction"Compaction
    "refusal"Refusal
    "model_context_window_exceeded"ModelContextWindowExceeded
    required string? StopSequence

    Which custom stop sequence was generated, if any.

    This value will be a non-null string if one of your custom stop sequences was generated.

    JsonElement Type "message"constant

    Object type.

    For Messages, this is always "message".

    required BetaUsage Usage

    Billing and rate-limit usage.

    Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.

    Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.

    For example, output_tokens will be non-zero, even for an empty string response from Claude.

    Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.

    required BetaCacheCreation? CacheCreation

    Breakdown of cached tokens by TTL

    required Long Ephemeral1hInputTokens

    The number of input tokens used to create the 1 hour cache entry.

    required Long Ephemeral5mInputTokens

    The number of input tokens used to create the 5 minute cache entry.

    required Long? CacheCreationInputTokens

    The number of input tokens used to create the cache entry.

    required Long? CacheReadInputTokens

    The number of input tokens read from the cache.

    required string? InferenceGeo

    The geographic region where inference was performed for this request.

    required Long InputTokens

    The number of input tokens which were used.

    required IReadOnlyList<BetaIterationsUsageItems>? Iterations

    Per-iteration token usage breakdown.

    Each entry represents one sampling iteration, with its own input/output token counts and cache statistics. This allows you to:

    • Determine which iterations exceeded long context thresholds (>=200k tokens)
    • Calculate the true context window size from the last iteration
    • Understand token accumulation across server-side tool use loops
    Accepts one of the following:
    class BetaMessageIterationUsage:

    Token usage for a sampling iteration.

    required BetaCacheCreation? CacheCreation

    Breakdown of cached tokens by TTL

    required Long Ephemeral1hInputTokens

    The number of input tokens used to create the 1 hour cache entry.

    required Long Ephemeral5mInputTokens

    The number of input tokens used to create the 5 minute cache entry.

    required Long CacheCreationInputTokens

    The number of input tokens used to create the cache entry.

    required Long CacheReadInputTokens

    The number of input tokens read from the cache.

    required Long InputTokens

    The number of input tokens which were used.

    required Long OutputTokens

    The number of output tokens which were used.

    JsonElement Type "message"constant

    Usage for a sampling iteration

    class BetaCompactionIterationUsage:

    Token usage for a compaction iteration.

    required BetaCacheCreation? CacheCreation

    Breakdown of cached tokens by TTL

    required Long Ephemeral1hInputTokens

    The number of input tokens used to create the 1 hour cache entry.

    required Long Ephemeral5mInputTokens

    The number of input tokens used to create the 5 minute cache entry.

    required Long CacheCreationInputTokens

    The number of input tokens used to create the cache entry.

    required Long CacheReadInputTokens

    The number of input tokens read from the cache.

    required Long InputTokens

    The number of input tokens which were used.

    required Long OutputTokens

    The number of output tokens which were used.

    JsonElement Type "compaction"constant

    Usage for a compaction iteration

    required Long OutputTokens

    The number of output tokens which were used.

    required BetaServerToolUsage? ServerToolUse

    The number of server tool requests.

    required Long WebFetchRequests

    The number of web fetch tool requests.

    required Long WebSearchRequests

    The number of web search tool requests.

    required ServiceTier? ServiceTier

    If the request used the priority, standard, or batch tier.

    Accepts one of the following:
    "standard"Standard
    "priority"Priority
    "batch"Batch
    required Speed? Speed

    The inference speed mode used for this request.

    Accepts one of the following:
    "standard"Standard
    "fast"Fast
    JsonElement Type "succeeded"constant
    class BetaMessageBatchErroredResult:
    required BetaErrorResponse Error
    required BetaError Error
    Accepts one of the following:
    class BetaInvalidRequestError:
    required string Message
    JsonElement Type "invalid_request_error"constant
    class BetaAuthenticationError:
    required string Message
    JsonElement Type "authentication_error"constant
    class BetaBillingError:
    required string Message
    JsonElement Type "billing_error"constant
    class BetaPermissionError:
    required string Message
    JsonElement Type "permission_error"constant
    class BetaNotFoundError:
    required string Message
    JsonElement Type "not_found_error"constant
    class BetaRateLimitError:
    required string Message
    JsonElement Type "rate_limit_error"constant
    class BetaGatewayTimeoutError:
    required string Message
    JsonElement Type "timeout_error"constant
    class BetaApiError:
    required string Message
    JsonElement Type "api_error"constant
    class BetaOverloadedError:
    required string Message
    JsonElement Type "overloaded_error"constant
    required string? RequestID
    JsonElement Type "error"constant
    JsonElement Type "errored"constant
    class BetaMessageBatchCanceledResult:
    JsonElement Type "canceled"constant
    class BetaMessageBatchExpiredResult:
    JsonElement Type "expired"constant
    class BetaMessageBatchRequestCounts:
    required Long Canceled

    Number of requests in the Message Batch that have been canceled.

    This is zero until processing of the entire Message Batch has ended.

    required Long Errored

    Number of requests in the Message Batch that encountered an error.

    This is zero until processing of the entire Message Batch has ended.

    required Long Expired

    Number of requests in the Message Batch that have expired.

    This is zero until processing of the entire Message Batch has ended.

    required Long Processing

    Number of requests in the Message Batch that are processing.

    required Long Succeeded

    Number of requests in the Message Batch that have completed successfully.

    This is zero until processing of the entire Message Batch has ended.

    class BetaMessageBatchResult: A class that can be one of several variants.union

    Processing result for this request.

    Contains a Message output if processing was successful, an error response if processing failed, or the reason why processing was not attempted, such as cancellation or expiration.

    class BetaMessageBatchSucceededResult:
    required BetaMessage Message
    required string ID

    Unique object identifier.

    The format and length of IDs may change over time.

    required BetaContainer? Container

    Information about the container used in the request (for the code execution tool)

    required string ID

    Identifier for the container used in this request

    required DateTimeOffset ExpiresAt

    The time at which the container will expire.

    required IReadOnlyList<BetaSkill>? Skills

    Skills loaded in the container

    required string SkillID

    Skill ID

    required Type Type

    Type of skill - either 'anthropic' (built-in) or 'custom' (user-defined)

    Accepts one of the following:
    "anthropic"Anthropic
    "custom"Custom
    required string Version

    Skill version or 'latest' for most recent version

    required IReadOnlyList<BetaContentBlock> Content

    Content generated by the model.

    This is an array of content blocks, each of which has a type that determines its shape.

    Example:

    [{"type": "text", "text": "Hi, I'm Claude."}]
    

    If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.

    For example, if the input messages were:

    [
      {"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
      {"role": "assistant", "content": "The best answer is ("}
    ]
    

    Then the response content might be:

    [{"type": "text", "text": "B)"}]
    
    Accepts one of the following:
    class BetaTextBlock:
    required IReadOnlyList<BetaTextCitation>? Citations

    Citations supporting the text block.

    The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.

    Accepts one of the following:
    class BetaCitationCharLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndCharIndex
    required string? FileID
    required Long StartCharIndex
    JsonElement Type "char_location"constant
    class BetaCitationPageLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndPageNumber
    required string? FileID
    required Long StartPageNumber
    JsonElement Type "page_location"constant
    class BetaCitationContentBlockLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndBlockIndex
    required string? FileID
    required Long StartBlockIndex
    JsonElement Type "content_block_location"constant
    class BetaCitationsWebSearchResultLocation:
    required string CitedText
    required string EncryptedIndex
    required string? Title
    JsonElement Type "web_search_result_location"constant
    required string Url
    class BetaCitationSearchResultLocation:
    required string CitedText
    required Long EndBlockIndex
    required Long SearchResultIndex
    required string Source
    required Long StartBlockIndex
    required string? Title
    JsonElement Type "search_result_location"constant
    required string Text
    JsonElement Type "text"constant
    class BetaThinkingBlock:
    required string Signature
    required string Thinking
    JsonElement Type "thinking"constant
    class BetaRedactedThinkingBlock:
    required string Data
    JsonElement Type "redacted_thinking"constant
    class BetaToolUseBlock:
    required string ID
    required IReadOnlyDictionary<string, JsonElement> Input
    required string Name
    JsonElement Type "tool_use"constant
    Caller Caller

    Tool invocation directly from the model.

    Accepts one of the following:
    class BetaDirectCaller:

    Tool invocation directly from the model.

    JsonElement Type "direct"constant
    class BetaServerToolCaller:

    Tool invocation generated by a server-side tool.

    required string ToolID
    JsonElement Type "code_execution_20250825"constant
    class BetaServerToolCaller20260120:
    required string ToolID
    JsonElement Type "code_execution_20260120"constant
    class BetaServerToolUseBlock:
    required string ID
    required IReadOnlyDictionary<string, JsonElement> Input
    required Name Name
    Accepts one of the following:
    "web_search"WebSearch
    "web_fetch"WebFetch
    "code_execution"CodeExecution
    "bash_code_execution"BashCodeExecution
    "text_editor_code_execution"TextEditorCodeExecution
    "tool_search_tool_regex"ToolSearchToolRegex
    "tool_search_tool_bm25"ToolSearchToolBm25
    JsonElement Type "server_tool_use"constant
    Caller Caller

    Tool invocation directly from the model.

    Accepts one of the following:
    class BetaDirectCaller:

    Tool invocation directly from the model.

    JsonElement Type "direct"constant
    class BetaServerToolCaller:

    Tool invocation generated by a server-side tool.

    required string ToolID
    JsonElement Type "code_execution_20250825"constant
    class BetaServerToolCaller20260120:
    required string ToolID
    JsonElement Type "code_execution_20260120"constant
    class BetaWebSearchToolResultBlock:
    required BetaWebSearchToolResultBlockContent Content
    Accepts one of the following:
    class BetaWebSearchToolResultError:
    required BetaWebSearchToolResultErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "max_uses_exceeded"MaxUsesExceeded
    "too_many_requests"TooManyRequests
    "query_too_long"QueryTooLong
    "request_too_large"RequestTooLarge
    JsonElement Type "web_search_tool_result_error"constant
    IReadOnlyList<BetaWebSearchResultBlock>
    required string EncryptedContent
    required string? PageAge
    required string Title
    JsonElement Type "web_search_result"constant
    required string Url
    required string ToolUseID
    JsonElement Type "web_search_tool_result"constant
    Caller Caller

    Tool invocation directly from the model.

    Accepts one of the following:
    class BetaDirectCaller:

    Tool invocation directly from the model.

    JsonElement Type "direct"constant
    class BetaServerToolCaller:

    Tool invocation generated by a server-side tool.

    required string ToolID
    JsonElement Type "code_execution_20250825"constant
    class BetaServerToolCaller20260120:
    required string ToolID
    JsonElement Type "code_execution_20260120"constant
    class BetaWebFetchToolResultBlock:
    required Content Content
    Accepts one of the following:
    class BetaWebFetchToolResultErrorBlock:
    required BetaWebFetchToolResultErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "url_too_long"UrlTooLong
    "url_not_allowed"UrlNotAllowed
    "url_not_accessible"UrlNotAccessible
    "unsupported_content_type"UnsupportedContentType
    "too_many_requests"TooManyRequests
    "max_uses_exceeded"MaxUsesExceeded
    "unavailable"Unavailable
    JsonElement Type "web_fetch_tool_result_error"constant
    class BetaWebFetchBlock:
    required BetaDocumentBlock Content
    required BetaCitationConfig? Citations

    Citation configuration for the document

    required Boolean Enabled
    required Source Source
    Accepts one of the following:
    class BetaBase64PdfSource:
    required string Data
    JsonElement MediaType "application/pdf"constant
    JsonElement Type "base64"constant
    class BetaPlainTextSource:
    required string Data
    JsonElement MediaType "text/plain"constant
    JsonElement Type "text"constant
    required string? Title

    The title of the document

    JsonElement Type "document"constant
    required string? RetrievedAt

    ISO 8601 timestamp when the content was retrieved

    JsonElement Type "web_fetch_result"constant
    required string Url

    Fetched content URL

    required string ToolUseID
    JsonElement Type "web_fetch_tool_result"constant
    Caller Caller

    Tool invocation directly from the model.

    Accepts one of the following:
    class BetaDirectCaller:

    Tool invocation directly from the model.

    JsonElement Type "direct"constant
    class BetaServerToolCaller:

    Tool invocation generated by a server-side tool.

    required string ToolID
    JsonElement Type "code_execution_20250825"constant
    class BetaServerToolCaller20260120:
    required string ToolID
    JsonElement Type "code_execution_20260120"constant
    class BetaCodeExecutionToolResultBlock:
    required BetaCodeExecutionToolResultBlockContent Content

    Code execution result with encrypted stdout for PFC + web_search results.

    Accepts one of the following:
    class BetaCodeExecutionToolResultError:
    required BetaCodeExecutionToolResultErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "too_many_requests"TooManyRequests
    "execution_time_exceeded"ExecutionTimeExceeded
    JsonElement Type "code_execution_tool_result_error"constant
    class BetaCodeExecutionResultBlock:
    required IReadOnlyList<BetaCodeExecutionOutputBlock> Content
    required string FileID
    JsonElement Type "code_execution_output"constant
    required Long ReturnCode
    required string Stderr
    required string Stdout
    JsonElement Type "code_execution_result"constant
    class BetaEncryptedCodeExecutionResultBlock:

    Code execution result with encrypted stdout for PFC + web_search results.

    required IReadOnlyList<BetaCodeExecutionOutputBlock> Content
    required string FileID
    JsonElement Type "code_execution_output"constant
    required string EncryptedStdout
    required Long ReturnCode
    required string Stderr
    JsonElement Type "encrypted_code_execution_result"constant
    required string ToolUseID
    JsonElement Type "code_execution_tool_result"constant
    class BetaBashCodeExecutionToolResultBlock:
    required Content Content
    Accepts one of the following:
    class BetaBashCodeExecutionToolResultError:
    required ErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "too_many_requests"TooManyRequests
    "execution_time_exceeded"ExecutionTimeExceeded
    "output_file_too_large"OutputFileTooLarge
    JsonElement Type "bash_code_execution_tool_result_error"constant
    class BetaBashCodeExecutionResultBlock:
    required IReadOnlyList<BetaBashCodeExecutionOutputBlock> Content
    required string FileID
    JsonElement Type "bash_code_execution_output"constant
    required Long ReturnCode
    required string Stderr
    required string Stdout
    JsonElement Type "bash_code_execution_result"constant
    required string ToolUseID
    JsonElement Type "bash_code_execution_tool_result"constant
    class BetaTextEditorCodeExecutionToolResultBlock:
    required Content Content
    Accepts one of the following:
    class BetaTextEditorCodeExecutionToolResultError:
    required ErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "too_many_requests"TooManyRequests
    "execution_time_exceeded"ExecutionTimeExceeded
    "file_not_found"FileNotFound
    required string? ErrorMessage
    JsonElement Type "text_editor_code_execution_tool_result_error"constant
    class BetaTextEditorCodeExecutionViewResultBlock:
    required string Content
    required FileType FileType
    Accepts one of the following:
    "text"Text
    "image"Image
    "pdf"Pdf
    required Long? NumLines
    required Long? StartLine
    required Long? TotalLines
    JsonElement Type "text_editor_code_execution_view_result"constant
    class BetaTextEditorCodeExecutionCreateResultBlock:
    required Boolean IsFileUpdate
    JsonElement Type "text_editor_code_execution_create_result"constant
    class BetaTextEditorCodeExecutionStrReplaceResultBlock:
    required IReadOnlyList<string>? Lines
    required Long? NewLines
    required Long? NewStart
    required Long? OldLines
    required Long? OldStart
    JsonElement Type "text_editor_code_execution_str_replace_result"constant
    required string ToolUseID
    JsonElement Type "text_editor_code_execution_tool_result"constant
    class BetaToolSearchToolResultBlock:
    required Content Content
    Accepts one of the following:
    class BetaToolSearchToolResultError:
    required ErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "too_many_requests"TooManyRequests
    "execution_time_exceeded"ExecutionTimeExceeded
    required string? ErrorMessage
    JsonElement Type "tool_search_tool_result_error"constant
    class BetaToolSearchToolSearchResultBlock:
    required IReadOnlyList<BetaToolReferenceBlock> ToolReferences
    required string ToolName
    JsonElement Type "tool_reference"constant
    JsonElement Type "tool_search_tool_search_result"constant
    required string ToolUseID
    JsonElement Type "tool_search_tool_result"constant
    class BetaMcpToolUseBlock:
    required string ID
    required IReadOnlyDictionary<string, JsonElement> Input
    required string Name

    The name of the MCP tool

    required string ServerName

    The name of the MCP server

    JsonElement Type "mcp_tool_use"constant
    class BetaMcpToolResultBlock:
    required Content Content
    Accepts one of the following:
    string
    IReadOnlyList<BetaTextBlock>
    required IReadOnlyList<BetaTextCitation>? Citations

    Citations supporting the text block.

    The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.

    Accepts one of the following:
    class BetaCitationCharLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndCharIndex
    required string? FileID
    required Long StartCharIndex
    JsonElement Type "char_location"constant
    class BetaCitationPageLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndPageNumber
    required string? FileID
    required Long StartPageNumber
    JsonElement Type "page_location"constant
    class BetaCitationContentBlockLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndBlockIndex
    required string? FileID
    required Long StartBlockIndex
    JsonElement Type "content_block_location"constant
    class BetaCitationsWebSearchResultLocation:
    required string CitedText
    required string EncryptedIndex
    required string? Title
    JsonElement Type "web_search_result_location"constant
    required string Url
    class BetaCitationSearchResultLocation:
    required string CitedText
    required Long EndBlockIndex
    required Long SearchResultIndex
    required string Source
    required Long StartBlockIndex
    required string? Title
    JsonElement Type "search_result_location"constant
    required string Text
    JsonElement Type "text"constant
    required Boolean IsError
    required string ToolUseID
    JsonElement Type "mcp_tool_result"constant
    class BetaContainerUploadBlock:

    Response model for a file uploaded to the container.

    required string FileID
    JsonElement Type "container_upload"constant
    class BetaCompactionBlock:

    A compaction block returned when autocompact is triggered.

    When content is None, it indicates the compaction failed to produce a valid summary (e.g., malformed output from the model). Clients may round-trip compaction blocks with null content; the server treats them as no-ops.

    required string? Content

    Summary of compacted content, or null if compaction failed

    JsonElement Type "compaction"constant
    required BetaContextManagementResponse? ContextManagement

    Context management response.

    Information about context management strategies applied during the request.

    required IReadOnlyList<AppliedEdit> AppliedEdits

    List of context management edits that were applied.

    Accepts one of the following:
    class BetaClearToolUses20250919EditResponse:
    required Long ClearedInputTokens

    Number of input tokens cleared by this edit.

    required Long ClearedToolUses

    Number of tool uses that were cleared.

    JsonElement Type "clear_tool_uses_20250919"constant

    The type of context management edit applied.

    class BetaClearThinking20251015EditResponse:
    required Long ClearedInputTokens

    Number of input tokens cleared by this edit.

    required Long ClearedThinkingTurns

    Number of thinking turns that were cleared.

    JsonElement Type "clear_thinking_20251015"constant

    The type of context management edit applied.

    required Model Model

    The model that will complete your prompt.

    See models for additional details and options.

    Accepts one of the following:
    "claude-opus-4-6"ClaudeOpus4_6

    Most intelligent model for building agents and coding

    "claude-sonnet-4-6"ClaudeSonnet4_6

    Frontier intelligence at scale — built for coding, agents, and enterprise workflows

    "claude-opus-4-5-20251101"ClaudeOpus4_5_20251101

    Premium model combining maximum intelligence with practical performance

    "claude-opus-4-5"ClaudeOpus4_5

    Premium model combining maximum intelligence with practical performance

    "claude-3-7-sonnet-latest"Claude3_7SonnetLatest

    High-performance model with early extended thinking

    "claude-3-7-sonnet-20250219"Claude3_7Sonnet20250219

    High-performance model with early extended thinking

    "claude-3-5-haiku-latest"Claude3_5HaikuLatest

    Fastest and most compact model for near-instant responsiveness

    "claude-3-5-haiku-20241022"Claude3_5Haiku20241022

    Our fastest model

    "claude-haiku-4-5"ClaudeHaiku4_5

    Hybrid model, capable of near-instant responses and extended thinking

    "claude-haiku-4-5-20251001"ClaudeHaiku4_5_20251001

    Hybrid model, capable of near-instant responses and extended thinking

    "claude-sonnet-4-20250514"ClaudeSonnet4_20250514

    High-performance model with extended thinking

    "claude-sonnet-4-0"ClaudeSonnet4_0

    High-performance model with extended thinking

    "claude-4-sonnet-20250514"Claude4Sonnet20250514

    High-performance model with extended thinking

    "claude-sonnet-4-5"ClaudeSonnet4_5

    Our best model for real-world agents and coding

    "claude-sonnet-4-5-20250929"ClaudeSonnet4_5_20250929

    Our best model for real-world agents and coding

    "claude-opus-4-0"ClaudeOpus4_0

    Our most capable model

    "claude-opus-4-20250514"ClaudeOpus4_20250514

    Our most capable model

    "claude-4-opus-20250514"Claude4Opus20250514

    Our most capable model

    "claude-opus-4-1-20250805"ClaudeOpus4_1_20250805

    Our most capable model

    "claude-3-opus-latest"Claude3OpusLatest

    Excels at writing and complex tasks

    "claude-3-opus-20240229"Claude_3_Opus_20240229

    Excels at writing and complex tasks

    "claude-3-haiku-20240307"Claude_3_Haiku_20240307

    Our previous most fast and cost-effective

    JsonElement Role "assistant"constant

    Conversational role of the generated message.

    This will always be "assistant".

    required BetaStopReason? StopReason

    The reason that we stopped.

    This may be one the following values:

    • "end_turn": the model reached a natural stopping point
    • "max_tokens": we exceeded the requested max_tokens or the model's maximum
    • "stop_sequence": one of your provided custom stop_sequences was generated
    • "tool_use": the model invoked one or more tools
    • "pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue.
    • "refusal": when streaming classifiers intervene to handle potential policy violations

    In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.

    Accepts one of the following:
    "end_turn"EndTurn
    "max_tokens"MaxTokens
    "stop_sequence"StopSequence
    "tool_use"ToolUse
    "pause_turn"PauseTurn
    "compaction"Compaction
    "refusal"Refusal
    "model_context_window_exceeded"ModelContextWindowExceeded
    required string? StopSequence

    Which custom stop sequence was generated, if any.

    This value will be a non-null string if one of your custom stop sequences was generated.

    JsonElement Type "message"constant

    Object type.

    For Messages, this is always "message".

    required BetaUsage Usage

    Billing and rate-limit usage.

    Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.

    Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.

    For example, output_tokens will be non-zero, even for an empty string response from Claude.

    Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.

    required BetaCacheCreation? CacheCreation

    Breakdown of cached tokens by TTL

    required Long Ephemeral1hInputTokens

    The number of input tokens used to create the 1 hour cache entry.

    required Long Ephemeral5mInputTokens

    The number of input tokens used to create the 5 minute cache entry.

    required Long? CacheCreationInputTokens

    The number of input tokens used to create the cache entry.

    required Long? CacheReadInputTokens

    The number of input tokens read from the cache.

    required string? InferenceGeo

    The geographic region where inference was performed for this request.

    required Long InputTokens

    The number of input tokens which were used.

    required IReadOnlyList<BetaIterationsUsageItems>? Iterations

    Per-iteration token usage breakdown.

    Each entry represents one sampling iteration, with its own input/output token counts and cache statistics. This allows you to:

    • Determine which iterations exceeded long context thresholds (>=200k tokens)
    • Calculate the true context window size from the last iteration
    • Understand token accumulation across server-side tool use loops
    Accepts one of the following:
    class BetaMessageIterationUsage:

    Token usage for a sampling iteration.

    required BetaCacheCreation? CacheCreation

    Breakdown of cached tokens by TTL

    required Long Ephemeral1hInputTokens

    The number of input tokens used to create the 1 hour cache entry.

    required Long Ephemeral5mInputTokens

    The number of input tokens used to create the 5 minute cache entry.

    required Long CacheCreationInputTokens

    The number of input tokens used to create the cache entry.

    required Long CacheReadInputTokens

    The number of input tokens read from the cache.

    required Long InputTokens

    The number of input tokens which were used.

    required Long OutputTokens

    The number of output tokens which were used.

    JsonElement Type "message"constant

    Usage for a sampling iteration

    class BetaCompactionIterationUsage:

    Token usage for a compaction iteration.

    required BetaCacheCreation? CacheCreation

    Breakdown of cached tokens by TTL

    required Long Ephemeral1hInputTokens

    The number of input tokens used to create the 1 hour cache entry.

    required Long Ephemeral5mInputTokens

    The number of input tokens used to create the 5 minute cache entry.

    required Long CacheCreationInputTokens

    The number of input tokens used to create the cache entry.

    required Long CacheReadInputTokens

    The number of input tokens read from the cache.

    required Long InputTokens

    The number of input tokens which were used.

    required Long OutputTokens

    The number of output tokens which were used.

    JsonElement Type "compaction"constant

    Usage for a compaction iteration

    required Long OutputTokens

    The number of output tokens which were used.

    required BetaServerToolUsage? ServerToolUse

    The number of server tool requests.

    required Long WebFetchRequests

    The number of web fetch tool requests.

    required Long WebSearchRequests

    The number of web search tool requests.

    required ServiceTier? ServiceTier

    If the request used the priority, standard, or batch tier.

    Accepts one of the following:
    "standard"Standard
    "priority"Priority
    "batch"Batch
    required Speed? Speed

    The inference speed mode used for this request.

    Accepts one of the following:
    "standard"Standard
    "fast"Fast
    JsonElement Type "succeeded"constant
    class BetaMessageBatchErroredResult:
    required BetaErrorResponse Error
    required BetaError Error
    Accepts one of the following:
    class BetaInvalidRequestError:
    required string Message
    JsonElement Type "invalid_request_error"constant
    class BetaAuthenticationError:
    required string Message
    JsonElement Type "authentication_error"constant
    class BetaBillingError:
    required string Message
    JsonElement Type "billing_error"constant
    class BetaPermissionError:
    required string Message
    JsonElement Type "permission_error"constant
    class BetaNotFoundError:
    required string Message
    JsonElement Type "not_found_error"constant
    class BetaRateLimitError:
    required string Message
    JsonElement Type "rate_limit_error"constant
    class BetaGatewayTimeoutError:
    required string Message
    JsonElement Type "timeout_error"constant
    class BetaApiError:
    required string Message
    JsonElement Type "api_error"constant
    class BetaOverloadedError:
    required string Message
    JsonElement Type "overloaded_error"constant
    required string? RequestID
    JsonElement Type "error"constant
    JsonElement Type "errored"constant
    class BetaMessageBatchCanceledResult:
    JsonElement Type "canceled"constant
    class BetaMessageBatchExpiredResult:
    JsonElement Type "expired"constant
    class BetaMessageBatchSucceededResult:
    required BetaMessage Message
    required string ID

    Unique object identifier.

    The format and length of IDs may change over time.

    required BetaContainer? Container

    Information about the container used in the request (for the code execution tool)

    required string ID

    Identifier for the container used in this request

    required DateTimeOffset ExpiresAt

    The time at which the container will expire.

    required IReadOnlyList<BetaSkill>? Skills

    Skills loaded in the container

    required string SkillID

    Skill ID

    required Type Type

    Type of skill - either 'anthropic' (built-in) or 'custom' (user-defined)

    Accepts one of the following:
    "anthropic"Anthropic
    "custom"Custom
    required string Version

    Skill version or 'latest' for most recent version

    required IReadOnlyList<BetaContentBlock> Content

    Content generated by the model.

    This is an array of content blocks, each of which has a type that determines its shape.

    Example:

    [{"type": "text", "text": "Hi, I'm Claude."}]
    

    If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.

    For example, if the input messages were:

    [
      {"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
      {"role": "assistant", "content": "The best answer is ("}
    ]
    

    Then the response content might be:

    [{"type": "text", "text": "B)"}]
    
    Accepts one of the following:
    class BetaTextBlock:
    required IReadOnlyList<BetaTextCitation>? Citations

    Citations supporting the text block.

    The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.

    Accepts one of the following:
    class BetaCitationCharLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndCharIndex
    required string? FileID
    required Long StartCharIndex
    JsonElement Type "char_location"constant
    class BetaCitationPageLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndPageNumber
    required string? FileID
    required Long StartPageNumber
    JsonElement Type "page_location"constant
    class BetaCitationContentBlockLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndBlockIndex
    required string? FileID
    required Long StartBlockIndex
    JsonElement Type "content_block_location"constant
    class BetaCitationsWebSearchResultLocation:
    required string CitedText
    required string EncryptedIndex
    required string? Title
    JsonElement Type "web_search_result_location"constant
    required string Url
    class BetaCitationSearchResultLocation:
    required string CitedText
    required Long EndBlockIndex
    required Long SearchResultIndex
    required string Source
    required Long StartBlockIndex
    required string? Title
    JsonElement Type "search_result_location"constant
    required string Text
    JsonElement Type "text"constant
    class BetaThinkingBlock:
    required string Signature
    required string Thinking
    JsonElement Type "thinking"constant
    class BetaRedactedThinkingBlock:
    required string Data
    JsonElement Type "redacted_thinking"constant
    class BetaToolUseBlock:
    required string ID
    required IReadOnlyDictionary<string, JsonElement> Input
    required string Name
    JsonElement Type "tool_use"constant
    Caller Caller

    Tool invocation directly from the model.

    Accepts one of the following:
    class BetaDirectCaller:

    Tool invocation directly from the model.

    JsonElement Type "direct"constant
    class BetaServerToolCaller:

    Tool invocation generated by a server-side tool.

    required string ToolID
    JsonElement Type "code_execution_20250825"constant
    class BetaServerToolCaller20260120:
    required string ToolID
    JsonElement Type "code_execution_20260120"constant
    class BetaServerToolUseBlock:
    required string ID
    required IReadOnlyDictionary<string, JsonElement> Input
    required Name Name
    Accepts one of the following:
    "web_search"WebSearch
    "web_fetch"WebFetch
    "code_execution"CodeExecution
    "bash_code_execution"BashCodeExecution
    "text_editor_code_execution"TextEditorCodeExecution
    "tool_search_tool_regex"ToolSearchToolRegex
    "tool_search_tool_bm25"ToolSearchToolBm25
    JsonElement Type "server_tool_use"constant
    Caller Caller

    Tool invocation directly from the model.

    Accepts one of the following:
    class BetaDirectCaller:

    Tool invocation directly from the model.

    JsonElement Type "direct"constant
    class BetaServerToolCaller:

    Tool invocation generated by a server-side tool.

    required string ToolID
    JsonElement Type "code_execution_20250825"constant
    class BetaServerToolCaller20260120:
    required string ToolID
    JsonElement Type "code_execution_20260120"constant
    class BetaWebSearchToolResultBlock:
    required BetaWebSearchToolResultBlockContent Content
    Accepts one of the following:
    class BetaWebSearchToolResultError:
    required BetaWebSearchToolResultErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "max_uses_exceeded"MaxUsesExceeded
    "too_many_requests"TooManyRequests
    "query_too_long"QueryTooLong
    "request_too_large"RequestTooLarge
    JsonElement Type "web_search_tool_result_error"constant
    IReadOnlyList<BetaWebSearchResultBlock>
    required string EncryptedContent
    required string? PageAge
    required string Title
    JsonElement Type "web_search_result"constant
    required string Url
    required string ToolUseID
    JsonElement Type "web_search_tool_result"constant
    Caller Caller

    Tool invocation directly from the model.

    Accepts one of the following:
    class BetaDirectCaller:

    Tool invocation directly from the model.

    JsonElement Type "direct"constant
    class BetaServerToolCaller:

    Tool invocation generated by a server-side tool.

    required string ToolID
    JsonElement Type "code_execution_20250825"constant
    class BetaServerToolCaller20260120:
    required string ToolID
    JsonElement Type "code_execution_20260120"constant
    class BetaWebFetchToolResultBlock:
    required Content Content
    Accepts one of the following:
    class BetaWebFetchToolResultErrorBlock:
    required BetaWebFetchToolResultErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "url_too_long"UrlTooLong
    "url_not_allowed"UrlNotAllowed
    "url_not_accessible"UrlNotAccessible
    "unsupported_content_type"UnsupportedContentType
    "too_many_requests"TooManyRequests
    "max_uses_exceeded"MaxUsesExceeded
    "unavailable"Unavailable
    JsonElement Type "web_fetch_tool_result_error"constant
    class BetaWebFetchBlock:
    required BetaDocumentBlock Content
    required BetaCitationConfig? Citations

    Citation configuration for the document

    required Boolean Enabled
    required Source Source
    Accepts one of the following:
    class BetaBase64PdfSource:
    required string Data
    JsonElement MediaType "application/pdf"constant
    JsonElement Type "base64"constant
    class BetaPlainTextSource:
    required string Data
    JsonElement MediaType "text/plain"constant
    JsonElement Type "text"constant
    required string? Title

    The title of the document

    JsonElement Type "document"constant
    required string? RetrievedAt

    ISO 8601 timestamp when the content was retrieved

    JsonElement Type "web_fetch_result"constant
    required string Url

    Fetched content URL

    required string ToolUseID
    JsonElement Type "web_fetch_tool_result"constant
    Caller Caller

    Tool invocation directly from the model.

    Accepts one of the following:
    class BetaDirectCaller:

    Tool invocation directly from the model.

    JsonElement Type "direct"constant
    class BetaServerToolCaller:

    Tool invocation generated by a server-side tool.

    required string ToolID
    JsonElement Type "code_execution_20250825"constant
    class BetaServerToolCaller20260120:
    required string ToolID
    JsonElement Type "code_execution_20260120"constant
    class BetaCodeExecutionToolResultBlock:
    required BetaCodeExecutionToolResultBlockContent Content

    Code execution result with encrypted stdout for PFC + web_search results.

    Accepts one of the following:
    class BetaCodeExecutionToolResultError:
    required BetaCodeExecutionToolResultErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "too_many_requests"TooManyRequests
    "execution_time_exceeded"ExecutionTimeExceeded
    JsonElement Type "code_execution_tool_result_error"constant
    class BetaCodeExecutionResultBlock:
    required IReadOnlyList<BetaCodeExecutionOutputBlock> Content
    required string FileID
    JsonElement Type "code_execution_output"constant
    required Long ReturnCode
    required string Stderr
    required string Stdout
    JsonElement Type "code_execution_result"constant
    class BetaEncryptedCodeExecutionResultBlock:

    Code execution result with encrypted stdout for PFC + web_search results.

    required IReadOnlyList<BetaCodeExecutionOutputBlock> Content
    required string FileID
    JsonElement Type "code_execution_output"constant
    required string EncryptedStdout
    required Long ReturnCode
    required string Stderr
    JsonElement Type "encrypted_code_execution_result"constant
    required string ToolUseID
    JsonElement Type "code_execution_tool_result"constant
    class BetaBashCodeExecutionToolResultBlock:
    required Content Content
    Accepts one of the following:
    class BetaBashCodeExecutionToolResultError:
    required ErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "too_many_requests"TooManyRequests
    "execution_time_exceeded"ExecutionTimeExceeded
    "output_file_too_large"OutputFileTooLarge
    JsonElement Type "bash_code_execution_tool_result_error"constant
    class BetaBashCodeExecutionResultBlock:
    required IReadOnlyList<BetaBashCodeExecutionOutputBlock> Content
    required string FileID
    JsonElement Type "bash_code_execution_output"constant
    required Long ReturnCode
    required string Stderr
    required string Stdout
    JsonElement Type "bash_code_execution_result"constant
    required string ToolUseID
    JsonElement Type "bash_code_execution_tool_result"constant
    class BetaTextEditorCodeExecutionToolResultBlock:
    required Content Content
    Accepts one of the following:
    class BetaTextEditorCodeExecutionToolResultError:
    required ErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "too_many_requests"TooManyRequests
    "execution_time_exceeded"ExecutionTimeExceeded
    "file_not_found"FileNotFound
    required string? ErrorMessage
    JsonElement Type "text_editor_code_execution_tool_result_error"constant
    class BetaTextEditorCodeExecutionViewResultBlock:
    required string Content
    required FileType FileType
    Accepts one of the following:
    "text"Text
    "image"Image
    "pdf"Pdf
    required Long? NumLines
    required Long? StartLine
    required Long? TotalLines
    JsonElement Type "text_editor_code_execution_view_result"constant
    class BetaTextEditorCodeExecutionCreateResultBlock:
    required Boolean IsFileUpdate
    JsonElement Type "text_editor_code_execution_create_result"constant
    class BetaTextEditorCodeExecutionStrReplaceResultBlock:
    required IReadOnlyList<string>? Lines
    required Long? NewLines
    required Long? NewStart
    required Long? OldLines
    required Long? OldStart
    JsonElement Type "text_editor_code_execution_str_replace_result"constant
    required string ToolUseID
    JsonElement Type "text_editor_code_execution_tool_result"constant
    class BetaToolSearchToolResultBlock:
    required Content Content
    Accepts one of the following:
    class BetaToolSearchToolResultError:
    required ErrorCode ErrorCode
    Accepts one of the following:
    "invalid_tool_input"InvalidToolInput
    "unavailable"Unavailable
    "too_many_requests"TooManyRequests
    "execution_time_exceeded"ExecutionTimeExceeded
    required string? ErrorMessage
    JsonElement Type "tool_search_tool_result_error"constant
    class BetaToolSearchToolSearchResultBlock:
    required IReadOnlyList<BetaToolReferenceBlock> ToolReferences
    required string ToolName
    JsonElement Type "tool_reference"constant
    JsonElement Type "tool_search_tool_search_result"constant
    required string ToolUseID
    JsonElement Type "tool_search_tool_result"constant
    class BetaMcpToolUseBlock:
    required string ID
    required IReadOnlyDictionary<string, JsonElement> Input
    required string Name

    The name of the MCP tool

    required string ServerName

    The name of the MCP server

    JsonElement Type "mcp_tool_use"constant
    class BetaMcpToolResultBlock:
    required Content Content
    Accepts one of the following:
    string
    IReadOnlyList<BetaTextBlock>
    required IReadOnlyList<BetaTextCitation>? Citations

    Citations supporting the text block.

    The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.

    Accepts one of the following:
    class BetaCitationCharLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndCharIndex
    required string? FileID
    required Long StartCharIndex
    JsonElement Type "char_location"constant
    class BetaCitationPageLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndPageNumber
    required string? FileID
    required Long StartPageNumber
    JsonElement Type "page_location"constant
    class BetaCitationContentBlockLocation:
    required string CitedText
    required Long DocumentIndex
    required string? DocumentTitle
    required Long EndBlockIndex
    required string? FileID
    required Long StartBlockIndex
    JsonElement Type "content_block_location"constant
    class BetaCitationsWebSearchResultLocation:
    required string CitedText
    required string EncryptedIndex
    required string? Title
    JsonElement Type "web_search_result_location"constant
    required string Url
    class BetaCitationSearchResultLocation:
    required string CitedText
    required Long EndBlockIndex
    required Long SearchResultIndex
    required string Source
    required Long StartBlockIndex
    required string? Title
    JsonElement Type "search_result_location"constant
    required string Text
    JsonElement Type "text"constant
    required Boolean IsError
    required string ToolUseID
    JsonElement Type "mcp_tool_result"constant
    class BetaContainerUploadBlock:

    Response model for a file uploaded to the container.

    required string FileID
    JsonElement Type "container_upload"constant
    class BetaCompactionBlock:

    A compaction block returned when autocompact is triggered.

    When content is None, it indicates the compaction failed to produce a valid summary (e.g., malformed output from the model). Clients may round-trip compaction blocks with null content; the server treats them as no-ops.

    required string? Content

    Summary of compacted content, or null if compaction failed

    JsonElement Type "compaction"constant
    required BetaContextManagementResponse? ContextManagement

    Context management response.

    Information about context management strategies applied during the request.

    required IReadOnlyList<AppliedEdit> AppliedEdits

    List of context management edits that were applied.

    Accepts one of the following:
    class BetaClearToolUses20250919EditResponse:
    required Long ClearedInputTokens

    Number of input tokens cleared by this edit.

    required Long ClearedToolUses

    Number of tool uses that were cleared.

    JsonElement Type "clear_tool_uses_20250919"constant

    The type of context management edit applied.

    class BetaClearThinking20251015EditResponse:
    required Long ClearedInputTokens

    Number of input tokens cleared by this edit.

    required Long ClearedThinkingTurns

    Number of thinking turns that were cleared.

    JsonElement Type "clear_thinking_20251015"constant

    The type of context management edit applied.

    required Model Model

    The model that will complete your prompt.

    See models for additional details and options.

    Accepts one of the following:
    "claude-opus-4-6"ClaudeOpus4_6

    Most intelligent model for building agents and coding

    "claude-sonnet-4-6"ClaudeSonnet4_6

    Frontier intelligence at scale — built for coding, agents, and enterprise workflows

    "claude-opus-4-5-20251101"ClaudeOpus4_5_20251101

    Premium model combining maximum intelligence with practical performance

    "claude-opus-4-5"ClaudeOpus4_5

    Premium model combining maximum intelligence with practical performance

    "claude-3-7-sonnet-latest"Claude3_7SonnetLatest

    High-performance model with early extended thinking

    "claude-3-7-sonnet-20250219"Claude3_7Sonnet20250219

    High-performance model with early extended thinking

    "claude-3-5-haiku-latest"Claude3_5HaikuLatest

    Fastest and most compact model for near-instant responsiveness

    "claude-3-5-haiku-20241022"Claude3_5Haiku20241022

    Our fastest model

    "claude-haiku-4-5"ClaudeHaiku4_5

    Hybrid model, capable of near-instant responses and extended thinking

    "claude-haiku-4-5-20251001"ClaudeHaiku4_5_20251001

    Hybrid model, capable of near-instant responses and extended thinking

    "claude-sonnet-4-20250514"ClaudeSonnet4_20250514

    High-performance model with extended thinking

    "claude-sonnet-4-0"ClaudeSonnet4_0

    High-performance model with extended thinking

    "claude-4-sonnet-20250514"Claude4Sonnet20250514

    High-performance model with extended thinking

    "claude-sonnet-4-5"ClaudeSonnet4_5

    Our best model for real-world agents and coding

    "claude-sonnet-4-5-20250929"ClaudeSonnet4_5_20250929

    Our best model for real-world agents and coding

    "claude-opus-4-0"ClaudeOpus4_0

    Our most capable model

    "claude-opus-4-20250514"ClaudeOpus4_20250514

    Our most capable model

    "claude-4-opus-20250514"Claude4Opus20250514

    Our most capable model

    "claude-opus-4-1-20250805"ClaudeOpus4_1_20250805

    Our most capable model

    "claude-3-opus-latest"Claude3OpusLatest

    Excels at writing and complex tasks

    "claude-3-opus-20240229"Claude_3_Opus_20240229

    Excels at writing and complex tasks

    "claude-3-haiku-20240307"Claude_3_Haiku_20240307

    Our previous most fast and cost-effective

    JsonElement Role "assistant"constant

    Conversational role of the generated message.

    This will always be "assistant".

    required BetaStopReason? StopReason

    The reason that we stopped.

    This may be one the following values:

    • "end_turn": the model reached a natural stopping point
    • "max_tokens": we exceeded the requested max_tokens or the model's maximum
    • "stop_sequence": one of your provided custom stop_sequences was generated
    • "tool_use": the model invoked one or more tools
    • "pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue.
    • "refusal": when streaming classifiers intervene to handle potential policy violations

    In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.

    Accepts one of the following:
    "end_turn"EndTurn
    "max_tokens"MaxTokens
    "stop_sequence"StopSequence
    "tool_use"ToolUse
    "pause_turn"PauseTurn
    "compaction"Compaction
    "refusal"Refusal
    "model_context_window_exceeded"ModelContextWindowExceeded
    required string? StopSequence

    Which custom stop sequence was generated, if any.

    This value will be a non-null string if one of your custom stop sequences was generated.

    JsonElement Type "message"constant

    Object type.

    For Messages, this is always "message".

    required BetaUsage Usage

    Billing and rate-limit usage.

    Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.

    Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.

    For example, output_tokens will be non-zero, even for an empty string response from Claude.

    Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.

    required BetaCacheCreation? CacheCreation

    Breakdown of cached tokens by TTL

    required Long Ephemeral1hInputTokens

    The number of input tokens used to create the 1 hour cache entry.

    required Long Ephemeral5mInputTokens

    The number of input tokens used to create the 5 minute cache entry.

    required Long? CacheCreationInputTokens

    The number of input tokens used to create the cache entry.

    required Long? CacheReadInputTokens

    The number of input tokens read from the cache.

    required string? InferenceGeo

    The geographic region where inference was performed for this request.

    required Long InputTokens

    The number of input tokens which were used.

    required IReadOnlyList<BetaIterationsUsageItems>? Iterations

    Per-iteration token usage breakdown.

    Each entry represents one sampling iteration, with its own input/output token counts and cache statistics. This allows you to:

    • Determine which iterations exceeded long context thresholds (>=200k tokens)
    • Calculate the true context window size from the last iteration
    • Understand token accumulation across server-side tool use loops
    Accepts one of the following:
    class BetaMessageIterationUsage:

    Token usage for a sampling iteration.

    required BetaCacheCreation? CacheCreation

    Breakdown of cached tokens by TTL

    required Long Ephemeral1hInputTokens

    The number of input tokens used to create the 1 hour cache entry.

    required Long Ephemeral5mInputTokens

    The number of input tokens used to create the 5 minute cache entry.

    required Long CacheCreationInputTokens

    The number of input tokens used to create the cache entry.

    required Long CacheReadInputTokens

    The number of input tokens read from the cache.

    required Long InputTokens

    The number of input tokens which were used.

    required Long OutputTokens

    The number of output tokens which were used.

    JsonElement Type "message"constant

    Usage for a sampling iteration

    class BetaCompactionIterationUsage:

    Token usage for a compaction iteration.

    required BetaCacheCreation? CacheCreation

    Breakdown of cached tokens by TTL

    required Long Ephemeral1hInputTokens

    The number of input tokens used to create the 1 hour cache entry.

    required Long Ephemeral5mInputTokens

    The number of input tokens used to create the 5 minute cache entry.

    required Long CacheCreationInputTokens

    The number of input tokens used to create the cache entry.

    required Long CacheReadInputTokens

    The number of input tokens read from the cache.

    required Long InputTokens

    The number of input tokens which were used.

    required Long OutputTokens

    The number of output tokens which were used.

    JsonElement Type "compaction"constant

    Usage for a compaction iteration

    required Long OutputTokens

    The number of output tokens which were used.

    required BetaServerToolUsage? ServerToolUse

    The number of server tool requests.

    required Long WebFetchRequests

    The number of web fetch tool requests.

    required Long WebSearchRequests

    The number of web search tool requests.

    required ServiceTier? ServiceTier

    If the request used the priority, standard, or batch tier.

    Accepts one of the following:
    "standard"Standard
    "priority"Priority
    "batch"Batch
    required Speed? Speed

    The inference speed mode used for this request.

    Accepts one of the following:
    "standard"Standard
    "fast"Fast
    JsonElement Type "succeeded"constant

    Solutions

    • AI agents
    • Code modernization
    • Coding
    • Customer support
    • Education
    • Financial services
    • Government
    • Life sciences

    Partners

    • Amazon Bedrock
    • Google Cloud's Vertex AI

    Learn

    • Blog
    • Catalog
    • Courses
    • Use cases
    • Connectors
    • Customer stories
    • Engineering at Anthropic
    • Events
    • Powered by Claude
    • Service partners
    • Startups program

    Company

    • Anthropic
    • Careers
    • Economic Futures
    • Research
    • News
    • Responsible Scaling Policy
    • Security and compliance
    • Transparency

    Learn

    • Blog
    • Catalog
    • Courses
    • Use cases
    • Connectors
    • Customer stories
    • Engineering at Anthropic
    • Events
    • Powered by Claude
    • Service partners
    • Startups program

    Help and security

    • Availability
    • Status
    • Support
    • Discord

    Terms and policies

    • Privacy policy
    • Responsible disclosure policy
    • Terms of service: Commercial
    • Terms of service: Consumer
    • Usage policy