Batches
Cancel a Message Batch
Create a Message Batch
Delete a Message Batch
List Message Batches
Retrieve Message Batch results
Retrieve a Message Batch
ModelsExpand Collapse
class BetaDeletedMessageBatch:
ID of the Message Batch.
JsonValue; type "message_batch_deleted"constant"message_batch_deleted"constantDeleted object type.
For Message Batches, this is always "message_batch_deleted".
Deleted object type.
For Message Batches, this is always "message_batch_deleted".
class BetaMessageBatch:
Unique object identifier.
The format and length of IDs may change over time.
RFC 3339 datetime string representing the time at which the Message Batch was archived and its results became unavailable.
RFC 3339 datetime string representing the time at which cancellation was initiated for the Message Batch. Specified only if cancellation was initiated.
RFC 3339 datetime string representing the time at which the Message Batch was created.
RFC 3339 datetime string representing the time at which processing for the Message Batch ended. Specified only once processing ends.
Processing ends when every request in a Message Batch has either succeeded, errored, canceled, or expired.
RFC 3339 datetime string representing the time at which the Message Batch will expire and end processing, which is 24 hours after creation.
ProcessingStatus processingStatusProcessing status of the Message Batch.
Processing status of the Message Batch.
BetaMessageBatchRequestCounts requestCountsTallies requests within the Message Batch, categorized by their status.
Requests start as processing and move to one of the other statuses only once processing of the entire batch ends. The sum of all values always matches the total number of requests in the batch.
Tallies requests within the Message Batch, categorized by their status.
Requests start as processing and move to one of the other statuses only once processing of the entire batch ends. The sum of all values always matches the total number of requests in the batch.
Number of requests in the Message Batch that have been canceled.
This is zero until processing of the entire Message Batch has ended.
Number of requests in the Message Batch that encountered an error.
This is zero until processing of the entire Message Batch has ended.
Number of requests in the Message Batch that have expired.
This is zero until processing of the entire Message Batch has ended.
Number of requests in the Message Batch that are processing.
Number of requests in the Message Batch that have completed successfully.
This is zero until processing of the entire Message Batch has ended.
URL to a .jsonl file containing the results of the Message Batch requests. Specified only once processing ends.
Results in the file are not guaranteed to be in the same order as requests. Use the custom_id field to match results to requests.
JsonValue; type "message_batch"constant"message_batch"constantObject type.
For Message Batches, this is always "message_batch".
Object type.
For Message Batches, this is always "message_batch".
class BetaMessageBatchCanceledResult:
JsonValue; type "canceled"constant"canceled"constant
class BetaMessageBatchErroredResult:
BetaErrorResponse error
BetaError error
class BetaInvalidRequestError:
JsonValue; type "invalid_request_error"constant"invalid_request_error"constant
class BetaAuthenticationError:
JsonValue; type "authentication_error"constant"authentication_error"constant
class BetaBillingError:
JsonValue; type "billing_error"constant"billing_error"constant
class BetaPermissionError:
JsonValue; type "permission_error"constant"permission_error"constant
class BetaNotFoundError:
JsonValue; type "not_found_error"constant"not_found_error"constant
class BetaRateLimitError:
JsonValue; type "rate_limit_error"constant"rate_limit_error"constant
class BetaGatewayTimeoutError:
JsonValue; type "timeout_error"constant"timeout_error"constant
class BetaApiError:
JsonValue; type "api_error"constant"api_error"constant
class BetaOverloadedError:
JsonValue; type "overloaded_error"constant"overloaded_error"constant
JsonValue; type "error"constant"error"constant
JsonValue; type "errored"constant"errored"constant
class BetaMessageBatchExpiredResult:
JsonValue; type "expired"constant"expired"constant
class BetaMessageBatchIndividualResponse:This is a single line in the response .jsonl file and does not represent the response as a whole.
This is a single line in the response .jsonl file and does not represent the response as a whole.
Developer-provided ID created for each request in a Message Batch. Useful for matching results to requests, as results may be given out of request order.
Must be unique for each request within the Message Batch.
BetaMessageBatchResult resultProcessing result for this request.
Contains a Message output if processing was successful, an error response if processing failed, or the reason why processing was not attempted, such as cancellation or expiration.
Processing result for this request.
Contains a Message output if processing was successful, an error response if processing failed, or the reason why processing was not attempted, such as cancellation or expiration.
class BetaMessageBatchSucceededResult:
BetaMessage message
Unique object identifier.
The format and length of IDs may change over time.
Information about the container used in the request (for the code execution tool)
Information about the container used in the request (for the code execution tool)
Identifier for the container used in this request
The time at which the container will expire.
Skills loaded in the container
Skills loaded in the container
Skill ID
Type typeType of skill - either 'anthropic' (built-in) or 'custom' (user-defined)
Type of skill - either 'anthropic' (built-in) or 'custom' (user-defined)
Skill version or 'latest' for most recent version
List<BetaContentBlock> contentContent generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
Content generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
class BetaTextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class BetaCitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class BetaCitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class BetaCitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class BetaCitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class BetaCitationSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class BetaThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class BetaRedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class BetaToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class BetaServerToolUseBlock:
Name name
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class BetaWebSearchToolResultBlock:
class BetaWebSearchToolResultError:
BetaWebSearchToolResultErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<BetaWebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
class BetaWebFetchToolResultBlock:
Content content
class BetaWebFetchToolResultErrorBlock:
BetaWebFetchToolResultErrorCode errorCode
JsonValue; type "web_fetch_tool_result_error"constant"web_fetch_tool_result_error"constant
class BetaWebFetchBlock:
BetaDocumentBlock content
Citation configuration for the document
Citation configuration for the document
Source source
class BetaBase64PdfSource:
JsonValue; mediaType "application/pdf"constant"application/pdf"constant
JsonValue; type "base64"constant"base64"constant
class BetaPlainTextSource:
JsonValue; mediaType "text/plain"constant"text/plain"constant
JsonValue; type "text"constant"text"constant
The title of the document
JsonValue; type "document"constant"document"constant
ISO 8601 timestamp when the content was retrieved
JsonValue; type "web_fetch_result"constant"web_fetch_result"constant
Fetched content URL
JsonValue; type "web_fetch_tool_result"constant"web_fetch_tool_result"constant
class BetaCodeExecutionToolResultBlock:
class BetaCodeExecutionToolResultError:
BetaCodeExecutionToolResultErrorCode errorCode
JsonValue; type "code_execution_tool_result_error"constant"code_execution_tool_result_error"constant
class BetaCodeExecutionResultBlock:
List<BetaCodeExecutionOutputBlock> content
JsonValue; type "code_execution_output"constant"code_execution_output"constant
JsonValue; type "code_execution_result"constant"code_execution_result"constant
JsonValue; type "code_execution_tool_result"constant"code_execution_tool_result"constant
class BetaBashCodeExecutionToolResultBlock:
Content content
class BetaBashCodeExecutionToolResultError:
ErrorCode errorCode
JsonValue; type "bash_code_execution_tool_result_error"constant"bash_code_execution_tool_result_error"constant
class BetaBashCodeExecutionResultBlock:
List<BetaBashCodeExecutionOutputBlock> content
JsonValue; type "bash_code_execution_output"constant"bash_code_execution_output"constant
JsonValue; type "bash_code_execution_result"constant"bash_code_execution_result"constant
JsonValue; type "bash_code_execution_tool_result"constant"bash_code_execution_tool_result"constant
class BetaTextEditorCodeExecutionToolResultBlock:
Content content
class BetaTextEditorCodeExecutionToolResultError:
ErrorCode errorCode
JsonValue; type "text_editor_code_execution_tool_result_error"constant"text_editor_code_execution_tool_result_error"constant
class BetaTextEditorCodeExecutionViewResultBlock:
FileType fileType
JsonValue; type "text_editor_code_execution_view_result"constant"text_editor_code_execution_view_result"constant
class BetaTextEditorCodeExecutionCreateResultBlock:
JsonValue; type "text_editor_code_execution_create_result"constant"text_editor_code_execution_create_result"constant
class BetaTextEditorCodeExecutionStrReplaceResultBlock:
JsonValue; type "text_editor_code_execution_str_replace_result"constant"text_editor_code_execution_str_replace_result"constant
JsonValue; type "text_editor_code_execution_tool_result"constant"text_editor_code_execution_tool_result"constant
class BetaMcpToolUseBlock:
The name of the MCP tool
The name of the MCP server
JsonValue; type "mcp_tool_use"constant"mcp_tool_use"constant
class BetaMcpToolResultBlock:
Content content
List<BetaTextBlock>
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class BetaCitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class BetaCitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class BetaCitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class BetaCitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class BetaCitationSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
JsonValue; type "mcp_tool_result"constant"mcp_tool_result"constant
class BetaContainerUploadBlock:Response model for a file uploaded to the container.
Response model for a file uploaded to the container.
JsonValue; type "container_upload"constant"container_upload"constant
Context management response.
Information about context management strategies applied during the request.
Context management response.
Information about context management strategies applied during the request.
List<AppliedEdit> appliedEditsList of context management edits that were applied.
List of context management edits that were applied.
class BetaClearToolUses20250919EditResponse:
Number of input tokens cleared by this edit.
Number of tool uses that were cleared.
JsonValue; type "clear_tool_uses_20250919"constant"clear_tool_uses_20250919"constantThe type of context management edit applied.
The type of context management edit applied.
class BetaClearThinking20251015EditResponse:
Number of input tokens cleared by this edit.
Number of thinking turns that were cleared.
JsonValue; type "clear_thinking_20251015"constant"clear_thinking_20251015"constantThe type of context management edit applied.
The type of context management edit applied.
Model modelThe model that will complete your prompt.
See models for additional details and options.
The model that will complete your prompt.
See models for additional details and options.
High-performance model with early extended thinking
High-performance model with early extended thinking
Fastest and most compact model for near-instant responsiveness
Our fastest model
Hybrid model, capable of near-instant responses and extended thinking
Hybrid model, capable of near-instant responses and extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
Our best model for real-world agents and coding
Our best model for real-world agents and coding
Our most capable model
Our most capable model
Our most capable model
Our most capable model
Excels at writing and complex tasks
Excels at writing and complex tasks
Our previous most fast and cost-effective
JsonValue; role "assistant"constant"assistant"constantConversational role of the generated message.
This will always be "assistant".
Conversational role of the generated message.
This will always be "assistant".
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point
"max_tokens": we exceeded the requested max_tokens or the model's maximum
"stop_sequence": one of your provided custom stop_sequences was generated
"tool_use": the model invoked one or more tools
"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue.
"refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point"max_tokens": we exceeded the requestedmax_tokensor the model's maximum"stop_sequence": one of your provided customstop_sequenceswas generated"tool_use": the model invoked one or more tools"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue."refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
Which custom stop sequence was generated, if any.
This value will be a non-null string if one of your custom stop sequences was generated.
JsonValue; type "message"constant"message"constantObject type.
For Messages, this is always "message".
Object type.
For Messages, this is always "message".
BetaUsage usageBilling and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Billing and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Breakdown of cached tokens by TTL
Breakdown of cached tokens by TTL
The number of input tokens used to create the 1 hour cache entry.
The number of input tokens used to create the 5 minute cache entry.
The number of input tokens used to create the cache entry.
The number of input tokens read from the cache.
The number of input tokens which were used.
The number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web fetch tool requests.
The number of web search tool requests.
Optional<ServiceTier> serviceTierIf the request used the priority, standard, or batch tier.
If the request used the priority, standard, or batch tier.
JsonValue; type "succeeded"constant"succeeded"constant
class BetaMessageBatchErroredResult:
BetaErrorResponse error
BetaError error
class BetaInvalidRequestError:
JsonValue; type "invalid_request_error"constant"invalid_request_error"constant
class BetaAuthenticationError:
JsonValue; type "authentication_error"constant"authentication_error"constant
class BetaBillingError:
JsonValue; type "billing_error"constant"billing_error"constant
class BetaPermissionError:
JsonValue; type "permission_error"constant"permission_error"constant
class BetaNotFoundError:
JsonValue; type "not_found_error"constant"not_found_error"constant
class BetaRateLimitError:
JsonValue; type "rate_limit_error"constant"rate_limit_error"constant
class BetaGatewayTimeoutError:
JsonValue; type "timeout_error"constant"timeout_error"constant
class BetaApiError:
JsonValue; type "api_error"constant"api_error"constant
class BetaOverloadedError:
JsonValue; type "overloaded_error"constant"overloaded_error"constant
JsonValue; type "error"constant"error"constant
JsonValue; type "errored"constant"errored"constant
class BetaMessageBatchCanceledResult:
JsonValue; type "canceled"constant"canceled"constant
class BetaMessageBatchExpiredResult:
JsonValue; type "expired"constant"expired"constant
class BetaMessageBatchRequestCounts:
Number of requests in the Message Batch that have been canceled.
This is zero until processing of the entire Message Batch has ended.
Number of requests in the Message Batch that encountered an error.
This is zero until processing of the entire Message Batch has ended.
Number of requests in the Message Batch that have expired.
This is zero until processing of the entire Message Batch has ended.
Number of requests in the Message Batch that are processing.
Number of requests in the Message Batch that have completed successfully.
This is zero until processing of the entire Message Batch has ended.
class BetaMessageBatchResult: A class that can be one of several variants.union Processing result for this request.
Contains a Message output if processing was successful, an error response if processing failed, or the reason why processing was not attempted, such as cancellation or expiration.
Processing result for this request.
Contains a Message output if processing was successful, an error response if processing failed, or the reason why processing was not attempted, such as cancellation or expiration.
class BetaMessageBatchSucceededResult:
BetaMessage message
Unique object identifier.
The format and length of IDs may change over time.
Information about the container used in the request (for the code execution tool)
Information about the container used in the request (for the code execution tool)
Identifier for the container used in this request
The time at which the container will expire.
Skills loaded in the container
Skills loaded in the container
Skill ID
Type typeType of skill - either 'anthropic' (built-in) or 'custom' (user-defined)
Type of skill - either 'anthropic' (built-in) or 'custom' (user-defined)
Skill version or 'latest' for most recent version
List<BetaContentBlock> contentContent generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
Content generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
class BetaTextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class BetaCitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class BetaCitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class BetaCitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class BetaCitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class BetaCitationSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class BetaThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class BetaRedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class BetaToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class BetaServerToolUseBlock:
Name name
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class BetaWebSearchToolResultBlock:
class BetaWebSearchToolResultError:
BetaWebSearchToolResultErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<BetaWebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
class BetaWebFetchToolResultBlock:
Content content
class BetaWebFetchToolResultErrorBlock:
BetaWebFetchToolResultErrorCode errorCode
JsonValue; type "web_fetch_tool_result_error"constant"web_fetch_tool_result_error"constant
class BetaWebFetchBlock:
BetaDocumentBlock content
Citation configuration for the document
Citation configuration for the document
Source source
class BetaBase64PdfSource:
JsonValue; mediaType "application/pdf"constant"application/pdf"constant
JsonValue; type "base64"constant"base64"constant
class BetaPlainTextSource:
JsonValue; mediaType "text/plain"constant"text/plain"constant
JsonValue; type "text"constant"text"constant
The title of the document
JsonValue; type "document"constant"document"constant
ISO 8601 timestamp when the content was retrieved
JsonValue; type "web_fetch_result"constant"web_fetch_result"constant
Fetched content URL
JsonValue; type "web_fetch_tool_result"constant"web_fetch_tool_result"constant
class BetaCodeExecutionToolResultBlock:
class BetaCodeExecutionToolResultError:
BetaCodeExecutionToolResultErrorCode errorCode
JsonValue; type "code_execution_tool_result_error"constant"code_execution_tool_result_error"constant
class BetaCodeExecutionResultBlock:
List<BetaCodeExecutionOutputBlock> content
JsonValue; type "code_execution_output"constant"code_execution_output"constant
JsonValue; type "code_execution_result"constant"code_execution_result"constant
JsonValue; type "code_execution_tool_result"constant"code_execution_tool_result"constant
class BetaBashCodeExecutionToolResultBlock:
Content content
class BetaBashCodeExecutionToolResultError:
ErrorCode errorCode
JsonValue; type "bash_code_execution_tool_result_error"constant"bash_code_execution_tool_result_error"constant
class BetaBashCodeExecutionResultBlock:
List<BetaBashCodeExecutionOutputBlock> content
JsonValue; type "bash_code_execution_output"constant"bash_code_execution_output"constant
JsonValue; type "bash_code_execution_result"constant"bash_code_execution_result"constant
JsonValue; type "bash_code_execution_tool_result"constant"bash_code_execution_tool_result"constant
class BetaTextEditorCodeExecutionToolResultBlock:
Content content
class BetaTextEditorCodeExecutionToolResultError:
ErrorCode errorCode
JsonValue; type "text_editor_code_execution_tool_result_error"constant"text_editor_code_execution_tool_result_error"constant
class BetaTextEditorCodeExecutionViewResultBlock:
FileType fileType
JsonValue; type "text_editor_code_execution_view_result"constant"text_editor_code_execution_view_result"constant
class BetaTextEditorCodeExecutionCreateResultBlock:
JsonValue; type "text_editor_code_execution_create_result"constant"text_editor_code_execution_create_result"constant
class BetaTextEditorCodeExecutionStrReplaceResultBlock:
JsonValue; type "text_editor_code_execution_str_replace_result"constant"text_editor_code_execution_str_replace_result"constant
JsonValue; type "text_editor_code_execution_tool_result"constant"text_editor_code_execution_tool_result"constant
class BetaMcpToolUseBlock:
The name of the MCP tool
The name of the MCP server
JsonValue; type "mcp_tool_use"constant"mcp_tool_use"constant
class BetaMcpToolResultBlock:
Content content
List<BetaTextBlock>
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class BetaCitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class BetaCitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class BetaCitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class BetaCitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class BetaCitationSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
JsonValue; type "mcp_tool_result"constant"mcp_tool_result"constant
class BetaContainerUploadBlock:Response model for a file uploaded to the container.
Response model for a file uploaded to the container.
JsonValue; type "container_upload"constant"container_upload"constant
Context management response.
Information about context management strategies applied during the request.
Context management response.
Information about context management strategies applied during the request.
List<AppliedEdit> appliedEditsList of context management edits that were applied.
List of context management edits that were applied.
class BetaClearToolUses20250919EditResponse:
Number of input tokens cleared by this edit.
Number of tool uses that were cleared.
JsonValue; type "clear_tool_uses_20250919"constant"clear_tool_uses_20250919"constantThe type of context management edit applied.
The type of context management edit applied.
class BetaClearThinking20251015EditResponse:
Number of input tokens cleared by this edit.
Number of thinking turns that were cleared.
JsonValue; type "clear_thinking_20251015"constant"clear_thinking_20251015"constantThe type of context management edit applied.
The type of context management edit applied.
Model modelThe model that will complete your prompt.
See models for additional details and options.
The model that will complete your prompt.
See models for additional details and options.
High-performance model with early extended thinking
High-performance model with early extended thinking
Fastest and most compact model for near-instant responsiveness
Our fastest model
Hybrid model, capable of near-instant responses and extended thinking
Hybrid model, capable of near-instant responses and extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
Our best model for real-world agents and coding
Our best model for real-world agents and coding
Our most capable model
Our most capable model
Our most capable model
Our most capable model
Excels at writing and complex tasks
Excels at writing and complex tasks
Our previous most fast and cost-effective
JsonValue; role "assistant"constant"assistant"constantConversational role of the generated message.
This will always be "assistant".
Conversational role of the generated message.
This will always be "assistant".
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point
"max_tokens": we exceeded the requested max_tokens or the model's maximum
"stop_sequence": one of your provided custom stop_sequences was generated
"tool_use": the model invoked one or more tools
"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue.
"refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point"max_tokens": we exceeded the requestedmax_tokensor the model's maximum"stop_sequence": one of your provided customstop_sequenceswas generated"tool_use": the model invoked one or more tools"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue."refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
Which custom stop sequence was generated, if any.
This value will be a non-null string if one of your custom stop sequences was generated.
JsonValue; type "message"constant"message"constantObject type.
For Messages, this is always "message".
Object type.
For Messages, this is always "message".
BetaUsage usageBilling and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Billing and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Breakdown of cached tokens by TTL
Breakdown of cached tokens by TTL
The number of input tokens used to create the 1 hour cache entry.
The number of input tokens used to create the 5 minute cache entry.
The number of input tokens used to create the cache entry.
The number of input tokens read from the cache.
The number of input tokens which were used.
The number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web fetch tool requests.
The number of web search tool requests.
Optional<ServiceTier> serviceTierIf the request used the priority, standard, or batch tier.
If the request used the priority, standard, or batch tier.
JsonValue; type "succeeded"constant"succeeded"constant
class BetaMessageBatchErroredResult:
BetaErrorResponse error
BetaError error
class BetaInvalidRequestError:
JsonValue; type "invalid_request_error"constant"invalid_request_error"constant
class BetaAuthenticationError:
JsonValue; type "authentication_error"constant"authentication_error"constant
class BetaBillingError:
JsonValue; type "billing_error"constant"billing_error"constant
class BetaPermissionError:
JsonValue; type "permission_error"constant"permission_error"constant
class BetaNotFoundError:
JsonValue; type "not_found_error"constant"not_found_error"constant
class BetaRateLimitError:
JsonValue; type "rate_limit_error"constant"rate_limit_error"constant
class BetaGatewayTimeoutError:
JsonValue; type "timeout_error"constant"timeout_error"constant
class BetaApiError:
JsonValue; type "api_error"constant"api_error"constant
class BetaOverloadedError:
JsonValue; type "overloaded_error"constant"overloaded_error"constant
JsonValue; type "error"constant"error"constant
JsonValue; type "errored"constant"errored"constant
class BetaMessageBatchCanceledResult:
JsonValue; type "canceled"constant"canceled"constant
class BetaMessageBatchExpiredResult:
JsonValue; type "expired"constant"expired"constant
class BetaMessageBatchSucceededResult:
BetaMessage message
Unique object identifier.
The format and length of IDs may change over time.
Information about the container used in the request (for the code execution tool)
Information about the container used in the request (for the code execution tool)
Identifier for the container used in this request
The time at which the container will expire.
Skills loaded in the container
Skills loaded in the container
Skill ID
Type typeType of skill - either 'anthropic' (built-in) or 'custom' (user-defined)
Type of skill - either 'anthropic' (built-in) or 'custom' (user-defined)
Skill version or 'latest' for most recent version
List<BetaContentBlock> contentContent generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
Content generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
class BetaTextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class BetaCitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class BetaCitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class BetaCitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class BetaCitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class BetaCitationSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class BetaThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class BetaRedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class BetaToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class BetaServerToolUseBlock:
Name name
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class BetaWebSearchToolResultBlock:
class BetaWebSearchToolResultError:
BetaWebSearchToolResultErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<BetaWebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
class BetaWebFetchToolResultBlock:
Content content
class BetaWebFetchToolResultErrorBlock:
BetaWebFetchToolResultErrorCode errorCode
JsonValue; type "web_fetch_tool_result_error"constant"web_fetch_tool_result_error"constant
class BetaWebFetchBlock:
BetaDocumentBlock content
Citation configuration for the document
Citation configuration for the document
Source source
class BetaBase64PdfSource:
JsonValue; mediaType "application/pdf"constant"application/pdf"constant
JsonValue; type "base64"constant"base64"constant
class BetaPlainTextSource:
JsonValue; mediaType "text/plain"constant"text/plain"constant
JsonValue; type "text"constant"text"constant
The title of the document
JsonValue; type "document"constant"document"constant
ISO 8601 timestamp when the content was retrieved
JsonValue; type "web_fetch_result"constant"web_fetch_result"constant
Fetched content URL
JsonValue; type "web_fetch_tool_result"constant"web_fetch_tool_result"constant
class BetaCodeExecutionToolResultBlock:
class BetaCodeExecutionToolResultError:
BetaCodeExecutionToolResultErrorCode errorCode
JsonValue; type "code_execution_tool_result_error"constant"code_execution_tool_result_error"constant
class BetaCodeExecutionResultBlock:
List<BetaCodeExecutionOutputBlock> content
JsonValue; type "code_execution_output"constant"code_execution_output"constant
JsonValue; type "code_execution_result"constant"code_execution_result"constant
JsonValue; type "code_execution_tool_result"constant"code_execution_tool_result"constant
class BetaBashCodeExecutionToolResultBlock:
Content content
class BetaBashCodeExecutionToolResultError:
ErrorCode errorCode
JsonValue; type "bash_code_execution_tool_result_error"constant"bash_code_execution_tool_result_error"constant
class BetaBashCodeExecutionResultBlock:
List<BetaBashCodeExecutionOutputBlock> content
JsonValue; type "bash_code_execution_output"constant"bash_code_execution_output"constant
JsonValue; type "bash_code_execution_result"constant"bash_code_execution_result"constant
JsonValue; type "bash_code_execution_tool_result"constant"bash_code_execution_tool_result"constant
class BetaTextEditorCodeExecutionToolResultBlock:
Content content
class BetaTextEditorCodeExecutionToolResultError:
ErrorCode errorCode
JsonValue; type "text_editor_code_execution_tool_result_error"constant"text_editor_code_execution_tool_result_error"constant
class BetaTextEditorCodeExecutionViewResultBlock:
FileType fileType
JsonValue; type "text_editor_code_execution_view_result"constant"text_editor_code_execution_view_result"constant
class BetaTextEditorCodeExecutionCreateResultBlock:
JsonValue; type "text_editor_code_execution_create_result"constant"text_editor_code_execution_create_result"constant
class BetaTextEditorCodeExecutionStrReplaceResultBlock:
JsonValue; type "text_editor_code_execution_str_replace_result"constant"text_editor_code_execution_str_replace_result"constant
JsonValue; type "text_editor_code_execution_tool_result"constant"text_editor_code_execution_tool_result"constant
class BetaMcpToolUseBlock:
The name of the MCP tool
The name of the MCP server
JsonValue; type "mcp_tool_use"constant"mcp_tool_use"constant
class BetaMcpToolResultBlock:
Content content
List<BetaTextBlock>
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class BetaCitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class BetaCitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class BetaCitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class BetaCitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class BetaCitationSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
JsonValue; type "mcp_tool_result"constant"mcp_tool_result"constant
class BetaContainerUploadBlock:Response model for a file uploaded to the container.
Response model for a file uploaded to the container.
JsonValue; type "container_upload"constant"container_upload"constant
Context management response.
Information about context management strategies applied during the request.
Context management response.
Information about context management strategies applied during the request.
List<AppliedEdit> appliedEditsList of context management edits that were applied.
List of context management edits that were applied.
class BetaClearToolUses20250919EditResponse:
Number of input tokens cleared by this edit.
Number of tool uses that were cleared.
JsonValue; type "clear_tool_uses_20250919"constant"clear_tool_uses_20250919"constantThe type of context management edit applied.
The type of context management edit applied.
class BetaClearThinking20251015EditResponse:
Number of input tokens cleared by this edit.
Number of thinking turns that were cleared.
JsonValue; type "clear_thinking_20251015"constant"clear_thinking_20251015"constantThe type of context management edit applied.
The type of context management edit applied.
Model modelThe model that will complete your prompt.
See models for additional details and options.
The model that will complete your prompt.
See models for additional details and options.
High-performance model with early extended thinking
High-performance model with early extended thinking
Fastest and most compact model for near-instant responsiveness
Our fastest model
Hybrid model, capable of near-instant responses and extended thinking
Hybrid model, capable of near-instant responses and extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
Our best model for real-world agents and coding
Our best model for real-world agents and coding
Our most capable model
Our most capable model
Our most capable model
Our most capable model
Excels at writing and complex tasks
Excels at writing and complex tasks
Our previous most fast and cost-effective
JsonValue; role "assistant"constant"assistant"constantConversational role of the generated message.
This will always be "assistant".
Conversational role of the generated message.
This will always be "assistant".
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point
"max_tokens": we exceeded the requested max_tokens or the model's maximum
"stop_sequence": one of your provided custom stop_sequences was generated
"tool_use": the model invoked one or more tools
"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue.
"refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point"max_tokens": we exceeded the requestedmax_tokensor the model's maximum"stop_sequence": one of your provided customstop_sequenceswas generated"tool_use": the model invoked one or more tools"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue."refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
Which custom stop sequence was generated, if any.
This value will be a non-null string if one of your custom stop sequences was generated.
JsonValue; type "message"constant"message"constantObject type.
For Messages, this is always "message".
Object type.
For Messages, this is always "message".
BetaUsage usageBilling and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Billing and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Breakdown of cached tokens by TTL
Breakdown of cached tokens by TTL
The number of input tokens used to create the 1 hour cache entry.
The number of input tokens used to create the 5 minute cache entry.
The number of input tokens used to create the cache entry.
The number of input tokens read from the cache.
The number of input tokens which were used.
The number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web fetch tool requests.
The number of web search tool requests.
Optional<ServiceTier> serviceTierIf the request used the priority, standard, or batch tier.
If the request used the priority, standard, or batch tier.