Messages
Count tokens in a Message
Create a Message
ModelsExpand Collapse
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class Base64PdfSource:
JsonValue; mediaType "application/pdf"constant"application/pdf"constant
JsonValue; type "base64"constant"base64"constant
class CacheControlEphemeral:
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CacheCreation:
The number of input tokens used to create the 1 hour cache entry.
The number of input tokens used to create the 5 minute cache entry.
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsConfigParam:
class CitationsDelta:
Citation citation
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "citations_delta"constant"citations_delta"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class ContentBlock: A class that can be one of several variants.union
class TextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class ThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class RedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class ToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class ServerToolUseBlock:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class WebSearchToolResultBlock:
WebSearchToolResultBlockContent content
class WebSearchToolResultError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<WebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
class ContentBlockParam: A class that can be one of several variants.union Regular text content.
Regular text content.
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class DocumentBlockParam:
Source source
class Base64PdfSource:
JsonValue; mediaType "application/pdf"constant"application/pdf"constant
JsonValue; type "base64"constant"base64"constant
class PlainTextSource:
JsonValue; mediaType "text/plain"constant"text/plain"constant
JsonValue; type "text"constant"text"constant
class ContentBlockSource:
Content content
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
JsonValue; type "content"constant"content"constant
class UrlPdfSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "document"constant"document"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class SearchResultBlockParam:
List<TextBlockParam> content
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "search_result"constant"search_result"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ThinkingBlockParam:
JsonValue; type "thinking"constant"thinking"constant
class RedactedThinkingBlockParam:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class ToolUseBlockParam:
JsonValue; type "tool_use"constant"tool_use"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ToolResultBlockParam:
JsonValue; type "tool_result"constant"tool_result"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Optional<Content> content
List<Block>
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class SearchResultBlockParam:
List<TextBlockParam> content
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "search_result"constant"search_result"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class DocumentBlockParam:
Source source
class Base64PdfSource:
JsonValue; mediaType "application/pdf"constant"application/pdf"constant
JsonValue; type "base64"constant"base64"constant
class PlainTextSource:
JsonValue; mediaType "text/plain"constant"text/plain"constant
JsonValue; type "text"constant"text"constant
class ContentBlockSource:
Content content
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
JsonValue; type "content"constant"content"constant
class UrlPdfSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "document"constant"document"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ServerToolUseBlockParam:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class WebSearchToolResultBlockParam:
JsonValue; type "web_search_result"constant"web_search_result"constant
class WebSearchToolRequestError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ContentBlockSource:
Content content
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
JsonValue; type "content"constant"content"constant
class ContentBlockSourceContent: A class that can be one of several variants.union
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class DocumentBlockParam:
Source source
class Base64PdfSource:
JsonValue; mediaType "application/pdf"constant"application/pdf"constant
JsonValue; type "base64"constant"base64"constant
class PlainTextSource:
JsonValue; mediaType "text/plain"constant"text/plain"constant
JsonValue; type "text"constant"text"constant
class ContentBlockSource:
Content content
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
JsonValue; type "content"constant"content"constant
class UrlPdfSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "document"constant"document"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class InputJsonDelta:
JsonValue; type "input_json_delta"constant"input_json_delta"constant
class Message:
Unique object identifier.
The format and length of IDs may change over time.
List<ContentBlock> contentContent generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
Content generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
class TextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class ThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class RedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class ToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class ServerToolUseBlock:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class WebSearchToolResultBlock:
WebSearchToolResultBlockContent content
class WebSearchToolResultError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<WebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
Model modelThe model that will complete your prompt.
See models for additional details and options.
The model that will complete your prompt.
See models for additional details and options.
High-performance model with early extended thinking
High-performance model with early extended thinking
Fastest and most compact model for near-instant responsiveness
Our fastest model
Hybrid model, capable of near-instant responses and extended thinking
Hybrid model, capable of near-instant responses and extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
Our best model for real-world agents and coding
Our best model for real-world agents and coding
Our most capable model
Our most capable model
Our most capable model
Our most capable model
Excels at writing and complex tasks
Excels at writing and complex tasks
Our previous most fast and cost-effective
JsonValue; role "assistant"constant"assistant"constantConversational role of the generated message.
This will always be "assistant".
Conversational role of the generated message.
This will always be "assistant".
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point
"max_tokens": we exceeded the requested max_tokens or the model's maximum
"stop_sequence": one of your provided custom stop_sequences was generated
"tool_use": the model invoked one or more tools
"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue.
"refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point"max_tokens": we exceeded the requestedmax_tokensor the model's maximum"stop_sequence": one of your provided customstop_sequenceswas generated"tool_use": the model invoked one or more tools"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue."refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
Which custom stop sequence was generated, if any.
This value will be a non-null string if one of your custom stop sequences was generated.
JsonValue; type "message"constant"message"constantObject type.
For Messages, this is always "message".
Object type.
For Messages, this is always "message".
Usage usageBilling and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Billing and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Breakdown of cached tokens by TTL
Breakdown of cached tokens by TTL
The number of input tokens used to create the 1 hour cache entry.
The number of input tokens used to create the 5 minute cache entry.
The number of input tokens used to create the cache entry.
The number of input tokens read from the cache.
The number of input tokens which were used.
The number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web search tool requests.
Optional<ServiceTier> serviceTierIf the request used the priority, standard, or batch tier.
If the request used the priority, standard, or batch tier.
class MessageCountTokensTool: A class that can be one of several variants.union
class Tool:
InputSchema inputSchemaJSON schema for this tool's input.
This defines the shape of the input that your tool accepts and that the model will produce.
JSON schema for this tool's input.
This defines the shape of the input that your tool accepts and that the model will produce.
JsonValue; type "object"constant"object"constant
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Description of what this tool does.
Tool descriptions should be as detailed as possible. The more information that the model has about what the tool is and how to use it, the better it will perform. You can use natural language descriptions to reinforce important aspects of the tool input JSON schema.
Optional<Type> type
class ToolBash20250124:
JsonValue; name "bash"constant"bash"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "bash_20250124"constant"bash_20250124"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ToolTextEditor20250124:
JsonValue; name "str_replace_editor"constant"str_replace_editor"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "text_editor_20250124"constant"text_editor_20250124"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ToolTextEditor20250429:
JsonValue; name "str_replace_based_edit_tool"constant"str_replace_based_edit_tool"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "text_editor_20250429"constant"text_editor_20250429"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ToolTextEditor20250728:
JsonValue; name "str_replace_based_edit_tool"constant"str_replace_based_edit_tool"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "text_editor_20250728"constant"text_editor_20250728"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Maximum number of characters to display when viewing a file. If not specified, defaults to displaying the full file.
class WebSearchTool20250305:
JsonValue; name "web_search"constant"web_search"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "web_search_20250305"constant"web_search_20250305"constant
If provided, only these domains will be included in results. Cannot be used alongside blocked_domains.
If provided, these domains will never appear in results. Cannot be used alongside allowed_domains.
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Maximum number of times the tool can be used in the API request.
Optional<UserLocation> userLocationParameters for the user's location. Used to provide more relevant search results.
Parameters for the user's location. Used to provide more relevant search results.
JsonValue; type "approximate"constant"approximate"constant
The city of the user.
The region of the user.
class MessageDeltaUsage:
The cumulative number of input tokens used to create the cache entry.
The cumulative number of input tokens read from the cache.
The cumulative number of input tokens which were used.
The cumulative number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web search tool requests.
class MessageParam:
Content content
List<ContentBlockParam>
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class DocumentBlockParam:
Source source
class Base64PdfSource:
JsonValue; mediaType "application/pdf"constant"application/pdf"constant
JsonValue; type "base64"constant"base64"constant
class PlainTextSource:
JsonValue; mediaType "text/plain"constant"text/plain"constant
JsonValue; type "text"constant"text"constant
class ContentBlockSource:
Content content
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
JsonValue; type "content"constant"content"constant
class UrlPdfSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "document"constant"document"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class SearchResultBlockParam:
List<TextBlockParam> content
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "search_result"constant"search_result"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ThinkingBlockParam:
JsonValue; type "thinking"constant"thinking"constant
class RedactedThinkingBlockParam:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class ToolUseBlockParam:
JsonValue; type "tool_use"constant"tool_use"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ToolResultBlockParam:
JsonValue; type "tool_result"constant"tool_result"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Optional<Content> content
List<Block>
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class SearchResultBlockParam:
List<TextBlockParam> content
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "search_result"constant"search_result"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class DocumentBlockParam:
Source source
class Base64PdfSource:
JsonValue; mediaType "application/pdf"constant"application/pdf"constant
JsonValue; type "base64"constant"base64"constant
class PlainTextSource:
JsonValue; mediaType "text/plain"constant"text/plain"constant
JsonValue; type "text"constant"text"constant
class ContentBlockSource:
Content content
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
JsonValue; type "content"constant"content"constant
class UrlPdfSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "document"constant"document"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ServerToolUseBlockParam:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class WebSearchToolResultBlockParam:
JsonValue; type "web_search_result"constant"web_search_result"constant
class WebSearchToolRequestError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Role role
class MessageTokensCount:
The total number of tokens across the provided list of messages, system prompt, and tools.
class Metadata:
An external identifier for the user who is associated with the request.
This should be a uuid, hash value, or other opaque identifier. Anthropic may use this id to help detect abuse. Do not include any identifying information such as name, email address, or phone number.
class PlainTextSource:
JsonValue; mediaType "text/plain"constant"text/plain"constant
JsonValue; type "text"constant"text"constant
class RawContentBlockDelta: A class that can be one of several variants.union
class TextDelta:
JsonValue; type "text_delta"constant"text_delta"constant
class InputJsonDelta:
JsonValue; type "input_json_delta"constant"input_json_delta"constant
class CitationsDelta:
Citation citation
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "citations_delta"constant"citations_delta"constant
class ThinkingDelta:
JsonValue; type "thinking_delta"constant"thinking_delta"constant
class SignatureDelta:
JsonValue; type "signature_delta"constant"signature_delta"constant
class RawContentBlockDeltaEvent:
RawContentBlockDelta delta
class TextDelta:
JsonValue; type "text_delta"constant"text_delta"constant
class InputJsonDelta:
JsonValue; type "input_json_delta"constant"input_json_delta"constant
class CitationsDelta:
Citation citation
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "citations_delta"constant"citations_delta"constant
class ThinkingDelta:
JsonValue; type "thinking_delta"constant"thinking_delta"constant
class SignatureDelta:
JsonValue; type "signature_delta"constant"signature_delta"constant
JsonValue; type "content_block_delta"constant"content_block_delta"constant
class RawContentBlockStartEvent:
ContentBlock contentBlock
class TextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class ThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class RedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class ToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class ServerToolUseBlock:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class WebSearchToolResultBlock:
WebSearchToolResultBlockContent content
class WebSearchToolResultError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<WebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
JsonValue; type "content_block_start"constant"content_block_start"constant
class RawContentBlockStopEvent:
JsonValue; type "content_block_stop"constant"content_block_stop"constant
class RawMessageDeltaEvent:
Delta delta
JsonValue; type "message_delta"constant"message_delta"constant
MessageDeltaUsage usageBilling and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Billing and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
The cumulative number of input tokens used to create the cache entry.
The cumulative number of input tokens read from the cache.
The cumulative number of input tokens which were used.
The cumulative number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web search tool requests.
class RawMessageStartEvent:
Message message
Unique object identifier.
The format and length of IDs may change over time.
List<ContentBlock> contentContent generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
Content generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
class TextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class ThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class RedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class ToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class ServerToolUseBlock:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class WebSearchToolResultBlock:
WebSearchToolResultBlockContent content
class WebSearchToolResultError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<WebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
Model modelThe model that will complete your prompt.
See models for additional details and options.
The model that will complete your prompt.
See models for additional details and options.
High-performance model with early extended thinking
High-performance model with early extended thinking
Fastest and most compact model for near-instant responsiveness
Our fastest model
Hybrid model, capable of near-instant responses and extended thinking
Hybrid model, capable of near-instant responses and extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
Our best model for real-world agents and coding
Our best model for real-world agents and coding
Our most capable model
Our most capable model
Our most capable model
Our most capable model
Excels at writing and complex tasks
Excels at writing and complex tasks
Our previous most fast and cost-effective
JsonValue; role "assistant"constant"assistant"constantConversational role of the generated message.
This will always be "assistant".
Conversational role of the generated message.
This will always be "assistant".
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point
"max_tokens": we exceeded the requested max_tokens or the model's maximum
"stop_sequence": one of your provided custom stop_sequences was generated
"tool_use": the model invoked one or more tools
"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue.
"refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point"max_tokens": we exceeded the requestedmax_tokensor the model's maximum"stop_sequence": one of your provided customstop_sequenceswas generated"tool_use": the model invoked one or more tools"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue."refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
Which custom stop sequence was generated, if any.
This value will be a non-null string if one of your custom stop sequences was generated.
JsonValue; type "message"constant"message"constantObject type.
For Messages, this is always "message".
Object type.
For Messages, this is always "message".
Usage usageBilling and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Billing and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Breakdown of cached tokens by TTL
Breakdown of cached tokens by TTL
The number of input tokens used to create the 1 hour cache entry.
The number of input tokens used to create the 5 minute cache entry.
The number of input tokens used to create the cache entry.
The number of input tokens read from the cache.
The number of input tokens which were used.
The number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web search tool requests.
Optional<ServiceTier> serviceTierIf the request used the priority, standard, or batch tier.
If the request used the priority, standard, or batch tier.
JsonValue; type "message_start"constant"message_start"constant
class RawMessageStopEvent:
JsonValue; type "message_stop"constant"message_stop"constant
class RawMessageStreamEvent: A class that can be one of several variants.union
class RawMessageStartEvent:
Message message
Unique object identifier.
The format and length of IDs may change over time.
List<ContentBlock> contentContent generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
Content generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
class TextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class ThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class RedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class ToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class ServerToolUseBlock:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class WebSearchToolResultBlock:
WebSearchToolResultBlockContent content
class WebSearchToolResultError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<WebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
Model modelThe model that will complete your prompt.
See models for additional details and options.
The model that will complete your prompt.
See models for additional details and options.
High-performance model with early extended thinking
High-performance model with early extended thinking
Fastest and most compact model for near-instant responsiveness
Our fastest model
Hybrid model, capable of near-instant responses and extended thinking
Hybrid model, capable of near-instant responses and extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
Our best model for real-world agents and coding
Our best model for real-world agents and coding
Our most capable model
Our most capable model
Our most capable model
Our most capable model
Excels at writing and complex tasks
Excels at writing and complex tasks
Our previous most fast and cost-effective
JsonValue; role "assistant"constant"assistant"constantConversational role of the generated message.
This will always be "assistant".
Conversational role of the generated message.
This will always be "assistant".
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point
"max_tokens": we exceeded the requested max_tokens or the model's maximum
"stop_sequence": one of your provided custom stop_sequences was generated
"tool_use": the model invoked one or more tools
"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue.
"refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point"max_tokens": we exceeded the requestedmax_tokensor the model's maximum"stop_sequence": one of your provided customstop_sequenceswas generated"tool_use": the model invoked one or more tools"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue."refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
Which custom stop sequence was generated, if any.
This value will be a non-null string if one of your custom stop sequences was generated.
JsonValue; type "message"constant"message"constantObject type.
For Messages, this is always "message".
Object type.
For Messages, this is always "message".
Usage usageBilling and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Billing and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Breakdown of cached tokens by TTL
Breakdown of cached tokens by TTL
The number of input tokens used to create the 1 hour cache entry.
The number of input tokens used to create the 5 minute cache entry.
The number of input tokens used to create the cache entry.
The number of input tokens read from the cache.
The number of input tokens which were used.
The number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web search tool requests.
Optional<ServiceTier> serviceTierIf the request used the priority, standard, or batch tier.
If the request used the priority, standard, or batch tier.
JsonValue; type "message_start"constant"message_start"constant
class RawMessageDeltaEvent:
Delta delta
JsonValue; type "message_delta"constant"message_delta"constant
MessageDeltaUsage usageBilling and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Billing and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
The cumulative number of input tokens used to create the cache entry.
The cumulative number of input tokens read from the cache.
The cumulative number of input tokens which were used.
The cumulative number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web search tool requests.
class RawMessageStopEvent:
JsonValue; type "message_stop"constant"message_stop"constant
class RawContentBlockStartEvent:
ContentBlock contentBlock
class TextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class ThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class RedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class ToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class ServerToolUseBlock:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class WebSearchToolResultBlock:
WebSearchToolResultBlockContent content
class WebSearchToolResultError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<WebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
JsonValue; type "content_block_start"constant"content_block_start"constant
class RawContentBlockDeltaEvent:
RawContentBlockDelta delta
class TextDelta:
JsonValue; type "text_delta"constant"text_delta"constant
class InputJsonDelta:
JsonValue; type "input_json_delta"constant"input_json_delta"constant
class CitationsDelta:
Citation citation
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "citations_delta"constant"citations_delta"constant
class ThinkingDelta:
JsonValue; type "thinking_delta"constant"thinking_delta"constant
class SignatureDelta:
JsonValue; type "signature_delta"constant"signature_delta"constant
JsonValue; type "content_block_delta"constant"content_block_delta"constant
class RawContentBlockStopEvent:
JsonValue; type "content_block_stop"constant"content_block_stop"constant
class RedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class RedactedThinkingBlockParam:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class SearchResultBlockParam:
List<TextBlockParam> content
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "search_result"constant"search_result"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ServerToolUsage:
The number of web search tool requests.
class ServerToolUseBlock:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class ServerToolUseBlockParam:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class SignatureDelta:
JsonValue; type "signature_delta"constant"signature_delta"constant
enum StopReason:
class TextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class TextCitation: A class that can be one of several variants.union
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
class TextCitationParam: A class that can be one of several variants.union
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class TextDelta:
JsonValue; type "text_delta"constant"text_delta"constant
class ThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class ThinkingBlockParam:
JsonValue; type "thinking"constant"thinking"constant
class ThinkingConfigDisabled:
JsonValue; type "disabled"constant"disabled"constant
class ThinkingConfigEnabled:
Determines how many tokens Claude can use for its internal reasoning process. Larger budgets can enable more thorough analysis for complex problems, improving response quality.
Must be ≥1024 and less than max_tokens.
See extended thinking for details.
JsonValue; type "enabled"constant"enabled"constant
class ThinkingConfigParam: A class that can be one of several variants.union Configuration for enabling Claude's extended thinking.
When enabled, responses include thinking content blocks showing Claude's thinking process before the final answer. Requires a minimum budget of 1,024 tokens and counts towards your max_tokens limit.
See extended thinking for details.
Configuration for enabling Claude's extended thinking.
When enabled, responses include thinking content blocks showing Claude's thinking process before the final answer. Requires a minimum budget of 1,024 tokens and counts towards your max_tokens limit.
See extended thinking for details.
class ThinkingConfigEnabled:
Determines how many tokens Claude can use for its internal reasoning process. Larger budgets can enable more thorough analysis for complex problems, improving response quality.
Must be ≥1024 and less than max_tokens.
See extended thinking for details.
JsonValue; type "enabled"constant"enabled"constant
class ThinkingConfigDisabled:
JsonValue; type "disabled"constant"disabled"constant
class ThinkingDelta:
JsonValue; type "thinking_delta"constant"thinking_delta"constant
class Tool:
InputSchema inputSchemaJSON schema for this tool's input.
This defines the shape of the input that your tool accepts and that the model will produce.
JSON schema for this tool's input.
This defines the shape of the input that your tool accepts and that the model will produce.
JsonValue; type "object"constant"object"constant
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Description of what this tool does.
Tool descriptions should be as detailed as possible. The more information that the model has about what the tool is and how to use it, the better it will perform. You can use natural language descriptions to reinforce important aspects of the tool input JSON schema.
Optional<Type> type
class ToolBash20250124:
JsonValue; name "bash"constant"bash"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "bash_20250124"constant"bash_20250124"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ToolChoice: A class that can be one of several variants.union How the model should use the provided tools. The model can use a specific tool, any available tool, decide by itself, or not use tools at all.
How the model should use the provided tools. The model can use a specific tool, any available tool, decide by itself, or not use tools at all.
class ToolChoiceAuto:The model will automatically decide whether to use tools.
The model will automatically decide whether to use tools.
JsonValue; type "auto"constant"auto"constant
Whether to disable parallel tool use.
Defaults to false. If set to true, the model will output at most one tool use.
class ToolChoiceAny:The model will use any available tools.
The model will use any available tools.
JsonValue; type "any"constant"any"constant
Whether to disable parallel tool use.
Defaults to false. If set to true, the model will output exactly one tool use.
class ToolChoiceTool:The model will use the specified tool with tool_choice.name.
The model will use the specified tool with tool_choice.name.
The name of the tool to use.
JsonValue; type "tool"constant"tool"constant
Whether to disable parallel tool use.
Defaults to false. If set to true, the model will output exactly one tool use.
class ToolChoiceNone:The model will not be allowed to use tools.
The model will not be allowed to use tools.
JsonValue; type "none"constant"none"constant
class ToolChoiceAny:The model will use any available tools.
The model will use any available tools.
JsonValue; type "any"constant"any"constant
Whether to disable parallel tool use.
Defaults to false. If set to true, the model will output exactly one tool use.
class ToolChoiceAuto:The model will automatically decide whether to use tools.
The model will automatically decide whether to use tools.
JsonValue; type "auto"constant"auto"constant
Whether to disable parallel tool use.
Defaults to false. If set to true, the model will output at most one tool use.
class ToolChoiceNone:The model will not be allowed to use tools.
The model will not be allowed to use tools.
JsonValue; type "none"constant"none"constant
class ToolChoiceTool:The model will use the specified tool with tool_choice.name.
The model will use the specified tool with tool_choice.name.
The name of the tool to use.
JsonValue; type "tool"constant"tool"constant
Whether to disable parallel tool use.
Defaults to false. If set to true, the model will output exactly one tool use.
class ToolResultBlockParam:
JsonValue; type "tool_result"constant"tool_result"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Optional<Content> content
List<Block>
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class SearchResultBlockParam:
List<TextBlockParam> content
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "search_result"constant"search_result"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class DocumentBlockParam:
Source source
class Base64PdfSource:
JsonValue; mediaType "application/pdf"constant"application/pdf"constant
JsonValue; type "base64"constant"base64"constant
class PlainTextSource:
JsonValue; mediaType "text/plain"constant"text/plain"constant
JsonValue; type "text"constant"text"constant
class ContentBlockSource:
Content content
class TextBlockParam:
JsonValue; type "text"constant"text"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class CitationCharLocationParam:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocationParam:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocationParam:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationWebSearchResultLocationParam:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationSearchResultLocationParam:
JsonValue; type "search_result_location"constant"search_result_location"constant
class ImageBlockParam:
Source source
class Base64ImageSource:
MediaType mediaType
JsonValue; type "base64"constant"base64"constant
class UrlImageSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "image"constant"image"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
JsonValue; type "content"constant"content"constant
class UrlPdfSource:
JsonValue; type "url"constant"url"constant
JsonValue; type "document"constant"document"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ToolTextEditor20250124:
JsonValue; name "str_replace_editor"constant"str_replace_editor"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "text_editor_20250124"constant"text_editor_20250124"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ToolTextEditor20250429:
JsonValue; name "str_replace_based_edit_tool"constant"str_replace_based_edit_tool"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "text_editor_20250429"constant"text_editor_20250429"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ToolTextEditor20250728:
JsonValue; name "str_replace_based_edit_tool"constant"str_replace_based_edit_tool"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "text_editor_20250728"constant"text_editor_20250728"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Maximum number of characters to display when viewing a file. If not specified, defaults to displaying the full file.
class ToolUnion: A class that can be one of several variants.union
class Tool:
InputSchema inputSchemaJSON schema for this tool's input.
This defines the shape of the input that your tool accepts and that the model will produce.
JSON schema for this tool's input.
This defines the shape of the input that your tool accepts and that the model will produce.
JsonValue; type "object"constant"object"constant
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Description of what this tool does.
Tool descriptions should be as detailed as possible. The more information that the model has about what the tool is and how to use it, the better it will perform. You can use natural language descriptions to reinforce important aspects of the tool input JSON schema.
Optional<Type> type
class ToolBash20250124:
JsonValue; name "bash"constant"bash"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "bash_20250124"constant"bash_20250124"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ToolTextEditor20250124:
JsonValue; name "str_replace_editor"constant"str_replace_editor"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "text_editor_20250124"constant"text_editor_20250124"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ToolTextEditor20250429:
JsonValue; name "str_replace_based_edit_tool"constant"str_replace_based_edit_tool"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "text_editor_20250429"constant"text_editor_20250429"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class ToolTextEditor20250728:
JsonValue; name "str_replace_based_edit_tool"constant"str_replace_based_edit_tool"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "text_editor_20250728"constant"text_editor_20250728"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Maximum number of characters to display when viewing a file. If not specified, defaults to displaying the full file.
class WebSearchTool20250305:
JsonValue; name "web_search"constant"web_search"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "web_search_20250305"constant"web_search_20250305"constant
If provided, only these domains will be included in results. Cannot be used alongside blocked_domains.
If provided, these domains will never appear in results. Cannot be used alongside allowed_domains.
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Maximum number of times the tool can be used in the API request.
Optional<UserLocation> userLocationParameters for the user's location. Used to provide more relevant search results.
Parameters for the user's location. Used to provide more relevant search results.
JsonValue; type "approximate"constant"approximate"constant
The city of the user.
The region of the user.
class ToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class ToolUseBlockParam:
JsonValue; type "tool_use"constant"tool_use"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class UrlImageSource:
JsonValue; type "url"constant"url"constant
class UrlPdfSource:
JsonValue; type "url"constant"url"constant
class Usage:
Breakdown of cached tokens by TTL
Breakdown of cached tokens by TTL
The number of input tokens used to create the 1 hour cache entry.
The number of input tokens used to create the 5 minute cache entry.
The number of input tokens used to create the cache entry.
The number of input tokens read from the cache.
The number of input tokens which were used.
The number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web search tool requests.
Optional<ServiceTier> serviceTierIf the request used the priority, standard, or batch tier.
If the request used the priority, standard, or batch tier.
class WebSearchResultBlock:
JsonValue; type "web_search_result"constant"web_search_result"constant
class WebSearchResultBlockParam:
JsonValue; type "web_search_result"constant"web_search_result"constant
class WebSearchTool20250305:
JsonValue; name "web_search"constant"web_search"constantName of the tool.
This is how the tool will be called by the model and in tool_use blocks.
Name of the tool.
This is how the tool will be called by the model and in tool_use blocks.
JsonValue; type "web_search_20250305"constant"web_search_20250305"constant
If provided, only these domains will be included in results. Cannot be used alongside blocked_domains.
If provided, these domains will never appear in results. Cannot be used alongside allowed_domains.
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
Maximum number of times the tool can be used in the API request.
Optional<UserLocation> userLocationParameters for the user's location. Used to provide more relevant search results.
Parameters for the user's location. Used to provide more relevant search results.
JsonValue; type "approximate"constant"approximate"constant
The city of the user.
The region of the user.
class WebSearchToolRequestError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
class WebSearchToolResultBlock:
WebSearchToolResultBlockContent content
class WebSearchToolResultError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<WebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
class WebSearchToolResultBlockContent: A class that can be one of several variants.union
class WebSearchToolResultError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<WebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
class WebSearchToolResultBlockParam:
JsonValue; type "web_search_result"constant"web_search_result"constant
class WebSearchToolRequestError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
Create a cache control breakpoint at this content block.
Create a cache control breakpoint at this content block.
JsonValue; type "ephemeral"constant"ephemeral"constant
Optional<Ttl> ttlThe time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes
1h: 1 hour
Defaults to 5m.
The time-to-live for the cache control breakpoint.
This may be one the following values:
5m: 5 minutes1h: 1 hour
Defaults to 5m.
class WebSearchToolResultBlockParamContent: A class that can be one of several variants.union
JsonValue; type "web_search_result"constant"web_search_result"constant
class WebSearchToolRequestError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
class WebSearchToolResultError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
MessagesBatches
Cancel a Message Batch
Create a Message Batch
Delete a Message Batch
List Message Batches
Retrieve Message Batch results
Retrieve a Message Batch
ModelsExpand Collapse
class DeletedMessageBatch:
ID of the Message Batch.
JsonValue; type "message_batch_deleted"constant"message_batch_deleted"constantDeleted object type.
For Message Batches, this is always "message_batch_deleted".
Deleted object type.
For Message Batches, this is always "message_batch_deleted".
class MessageBatch:
Unique object identifier.
The format and length of IDs may change over time.
RFC 3339 datetime string representing the time at which the Message Batch was archived and its results became unavailable.
RFC 3339 datetime string representing the time at which cancellation was initiated for the Message Batch. Specified only if cancellation was initiated.
RFC 3339 datetime string representing the time at which the Message Batch was created.
RFC 3339 datetime string representing the time at which processing for the Message Batch ended. Specified only once processing ends.
Processing ends when every request in a Message Batch has either succeeded, errored, canceled, or expired.
RFC 3339 datetime string representing the time at which the Message Batch will expire and end processing, which is 24 hours after creation.
ProcessingStatus processingStatusProcessing status of the Message Batch.
Processing status of the Message Batch.
MessageBatchRequestCounts requestCountsTallies requests within the Message Batch, categorized by their status.
Requests start as processing and move to one of the other statuses only once processing of the entire batch ends. The sum of all values always matches the total number of requests in the batch.
Tallies requests within the Message Batch, categorized by their status.
Requests start as processing and move to one of the other statuses only once processing of the entire batch ends. The sum of all values always matches the total number of requests in the batch.
Number of requests in the Message Batch that have been canceled.
This is zero until processing of the entire Message Batch has ended.
Number of requests in the Message Batch that encountered an error.
This is zero until processing of the entire Message Batch has ended.
Number of requests in the Message Batch that have expired.
This is zero until processing of the entire Message Batch has ended.
Number of requests in the Message Batch that are processing.
Number of requests in the Message Batch that have completed successfully.
This is zero until processing of the entire Message Batch has ended.
URL to a .jsonl file containing the results of the Message Batch requests. Specified only once processing ends.
Results in the file are not guaranteed to be in the same order as requests. Use the custom_id field to match results to requests.
JsonValue; type "message_batch"constant"message_batch"constantObject type.
For Message Batches, this is always "message_batch".
Object type.
For Message Batches, this is always "message_batch".
class MessageBatchCanceledResult:
JsonValue; type "canceled"constant"canceled"constant
class MessageBatchErroredResult:
ErrorResponse error
ErrorObject error
class InvalidRequestError:
JsonValue; type "invalid_request_error"constant"invalid_request_error"constant
class AuthenticationError:
JsonValue; type "authentication_error"constant"authentication_error"constant
class BillingError:
JsonValue; type "billing_error"constant"billing_error"constant
class PermissionError:
JsonValue; type "permission_error"constant"permission_error"constant
class NotFoundError:
JsonValue; type "not_found_error"constant"not_found_error"constant
class RateLimitError:
JsonValue; type "rate_limit_error"constant"rate_limit_error"constant
class GatewayTimeoutError:
JsonValue; type "timeout_error"constant"timeout_error"constant
class ApiErrorObject:
JsonValue; type "api_error"constant"api_error"constant
class OverloadedError:
JsonValue; type "overloaded_error"constant"overloaded_error"constant
JsonValue; type "error"constant"error"constant
JsonValue; type "errored"constant"errored"constant
class MessageBatchExpiredResult:
JsonValue; type "expired"constant"expired"constant
class MessageBatchIndividualResponse:This is a single line in the response .jsonl file and does not represent the response as a whole.
This is a single line in the response .jsonl file and does not represent the response as a whole.
Developer-provided ID created for each request in a Message Batch. Useful for matching results to requests, as results may be given out of request order.
Must be unique for each request within the Message Batch.
MessageBatchResult resultProcessing result for this request.
Contains a Message output if processing was successful, an error response if processing failed, or the reason why processing was not attempted, such as cancellation or expiration.
Processing result for this request.
Contains a Message output if processing was successful, an error response if processing failed, or the reason why processing was not attempted, such as cancellation or expiration.
class MessageBatchSucceededResult:
Message message
Unique object identifier.
The format and length of IDs may change over time.
List<ContentBlock> contentContent generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
Content generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
class TextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class ThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class RedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class ToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class ServerToolUseBlock:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class WebSearchToolResultBlock:
WebSearchToolResultBlockContent content
class WebSearchToolResultError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<WebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
Model modelThe model that will complete your prompt.
See models for additional details and options.
The model that will complete your prompt.
See models for additional details and options.
High-performance model with early extended thinking
High-performance model with early extended thinking
Fastest and most compact model for near-instant responsiveness
Our fastest model
Hybrid model, capable of near-instant responses and extended thinking
Hybrid model, capable of near-instant responses and extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
Our best model for real-world agents and coding
Our best model for real-world agents and coding
Our most capable model
Our most capable model
Our most capable model
Our most capable model
Excels at writing and complex tasks
Excels at writing and complex tasks
Our previous most fast and cost-effective
JsonValue; role "assistant"constant"assistant"constantConversational role of the generated message.
This will always be "assistant".
Conversational role of the generated message.
This will always be "assistant".
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point
"max_tokens": we exceeded the requested max_tokens or the model's maximum
"stop_sequence": one of your provided custom stop_sequences was generated
"tool_use": the model invoked one or more tools
"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue.
"refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point"max_tokens": we exceeded the requestedmax_tokensor the model's maximum"stop_sequence": one of your provided customstop_sequenceswas generated"tool_use": the model invoked one or more tools"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue."refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
Which custom stop sequence was generated, if any.
This value will be a non-null string if one of your custom stop sequences was generated.
JsonValue; type "message"constant"message"constantObject type.
For Messages, this is always "message".
Object type.
For Messages, this is always "message".
Usage usageBilling and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Billing and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Breakdown of cached tokens by TTL
Breakdown of cached tokens by TTL
The number of input tokens used to create the 1 hour cache entry.
The number of input tokens used to create the 5 minute cache entry.
The number of input tokens used to create the cache entry.
The number of input tokens read from the cache.
The number of input tokens which were used.
The number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web search tool requests.
Optional<ServiceTier> serviceTierIf the request used the priority, standard, or batch tier.
If the request used the priority, standard, or batch tier.
JsonValue; type "succeeded"constant"succeeded"constant
class MessageBatchErroredResult:
ErrorResponse error
ErrorObject error
class InvalidRequestError:
JsonValue; type "invalid_request_error"constant"invalid_request_error"constant
class AuthenticationError:
JsonValue; type "authentication_error"constant"authentication_error"constant
class BillingError:
JsonValue; type "billing_error"constant"billing_error"constant
class PermissionError:
JsonValue; type "permission_error"constant"permission_error"constant
class NotFoundError:
JsonValue; type "not_found_error"constant"not_found_error"constant
class RateLimitError:
JsonValue; type "rate_limit_error"constant"rate_limit_error"constant
class GatewayTimeoutError:
JsonValue; type "timeout_error"constant"timeout_error"constant
class ApiErrorObject:
JsonValue; type "api_error"constant"api_error"constant
class OverloadedError:
JsonValue; type "overloaded_error"constant"overloaded_error"constant
JsonValue; type "error"constant"error"constant
JsonValue; type "errored"constant"errored"constant
class MessageBatchCanceledResult:
JsonValue; type "canceled"constant"canceled"constant
class MessageBatchExpiredResult:
JsonValue; type "expired"constant"expired"constant
class MessageBatchRequestCounts:
Number of requests in the Message Batch that have been canceled.
This is zero until processing of the entire Message Batch has ended.
Number of requests in the Message Batch that encountered an error.
This is zero until processing of the entire Message Batch has ended.
Number of requests in the Message Batch that have expired.
This is zero until processing of the entire Message Batch has ended.
Number of requests in the Message Batch that are processing.
Number of requests in the Message Batch that have completed successfully.
This is zero until processing of the entire Message Batch has ended.
class MessageBatchResult: A class that can be one of several variants.union Processing result for this request.
Contains a Message output if processing was successful, an error response if processing failed, or the reason why processing was not attempted, such as cancellation or expiration.
Processing result for this request.
Contains a Message output if processing was successful, an error response if processing failed, or the reason why processing was not attempted, such as cancellation or expiration.
class MessageBatchSucceededResult:
Message message
Unique object identifier.
The format and length of IDs may change over time.
List<ContentBlock> contentContent generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
Content generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
class TextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class ThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class RedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class ToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class ServerToolUseBlock:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class WebSearchToolResultBlock:
WebSearchToolResultBlockContent content
class WebSearchToolResultError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<WebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
Model modelThe model that will complete your prompt.
See models for additional details and options.
The model that will complete your prompt.
See models for additional details and options.
High-performance model with early extended thinking
High-performance model with early extended thinking
Fastest and most compact model for near-instant responsiveness
Our fastest model
Hybrid model, capable of near-instant responses and extended thinking
Hybrid model, capable of near-instant responses and extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
Our best model for real-world agents and coding
Our best model for real-world agents and coding
Our most capable model
Our most capable model
Our most capable model
Our most capable model
Excels at writing and complex tasks
Excels at writing and complex tasks
Our previous most fast and cost-effective
JsonValue; role "assistant"constant"assistant"constantConversational role of the generated message.
This will always be "assistant".
Conversational role of the generated message.
This will always be "assistant".
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point
"max_tokens": we exceeded the requested max_tokens or the model's maximum
"stop_sequence": one of your provided custom stop_sequences was generated
"tool_use": the model invoked one or more tools
"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue.
"refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point"max_tokens": we exceeded the requestedmax_tokensor the model's maximum"stop_sequence": one of your provided customstop_sequenceswas generated"tool_use": the model invoked one or more tools"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue."refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
Which custom stop sequence was generated, if any.
This value will be a non-null string if one of your custom stop sequences was generated.
JsonValue; type "message"constant"message"constantObject type.
For Messages, this is always "message".
Object type.
For Messages, this is always "message".
Usage usageBilling and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Billing and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Breakdown of cached tokens by TTL
Breakdown of cached tokens by TTL
The number of input tokens used to create the 1 hour cache entry.
The number of input tokens used to create the 5 minute cache entry.
The number of input tokens used to create the cache entry.
The number of input tokens read from the cache.
The number of input tokens which were used.
The number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web search tool requests.
Optional<ServiceTier> serviceTierIf the request used the priority, standard, or batch tier.
If the request used the priority, standard, or batch tier.
JsonValue; type "succeeded"constant"succeeded"constant
class MessageBatchErroredResult:
ErrorResponse error
ErrorObject error
class InvalidRequestError:
JsonValue; type "invalid_request_error"constant"invalid_request_error"constant
class AuthenticationError:
JsonValue; type "authentication_error"constant"authentication_error"constant
class BillingError:
JsonValue; type "billing_error"constant"billing_error"constant
class PermissionError:
JsonValue; type "permission_error"constant"permission_error"constant
class NotFoundError:
JsonValue; type "not_found_error"constant"not_found_error"constant
class RateLimitError:
JsonValue; type "rate_limit_error"constant"rate_limit_error"constant
class GatewayTimeoutError:
JsonValue; type "timeout_error"constant"timeout_error"constant
class ApiErrorObject:
JsonValue; type "api_error"constant"api_error"constant
class OverloadedError:
JsonValue; type "overloaded_error"constant"overloaded_error"constant
JsonValue; type "error"constant"error"constant
JsonValue; type "errored"constant"errored"constant
class MessageBatchCanceledResult:
JsonValue; type "canceled"constant"canceled"constant
class MessageBatchExpiredResult:
JsonValue; type "expired"constant"expired"constant
class MessageBatchSucceededResult:
Message message
Unique object identifier.
The format and length of IDs may change over time.
List<ContentBlock> contentContent generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
Content generated by the model.
This is an array of content blocks, each of which has a type that determines its shape.
Example:
[{"type": "text", "text": "Hi, I'm Claude."}]
If the request input messages ended with an assistant turn, then the response content will continue directly from that last turn. You can use this to constrain the model's output.
For example, if the input messages were:
[
{"role": "user", "content": "What's the Greek name for Sun? (A) Sol (B) Helios (C) Sun"},
{"role": "assistant", "content": "The best answer is ("}
]
Then the response content might be:
[{"type": "text", "text": "B)"}]
class TextBlock:
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
Citations supporting the text block.
The type of citation returned will depend on the type of document being cited. Citing a PDF results in page_location, plain text results in char_location, and content document results in content_block_location.
class CitationCharLocation:
JsonValue; type "char_location"constant"char_location"constant
class CitationPageLocation:
JsonValue; type "page_location"constant"page_location"constant
class CitationContentBlockLocation:
JsonValue; type "content_block_location"constant"content_block_location"constant
class CitationsWebSearchResultLocation:
JsonValue; type "web_search_result_location"constant"web_search_result_location"constant
class CitationsSearchResultLocation:
JsonValue; type "search_result_location"constant"search_result_location"constant
JsonValue; type "text"constant"text"constant
class ThinkingBlock:
JsonValue; type "thinking"constant"thinking"constant
class RedactedThinkingBlock:
JsonValue; type "redacted_thinking"constant"redacted_thinking"constant
class ToolUseBlock:
JsonValue; type "tool_use"constant"tool_use"constant
class ServerToolUseBlock:
JsonValue; name "web_search"constant"web_search"constant
JsonValue; type "server_tool_use"constant"server_tool_use"constant
class WebSearchToolResultBlock:
WebSearchToolResultBlockContent content
class WebSearchToolResultError:
ErrorCode errorCode
JsonValue; type "web_search_tool_result_error"constant"web_search_tool_result_error"constant
List<WebSearchResultBlock>
JsonValue; type "web_search_result"constant"web_search_result"constant
JsonValue; type "web_search_tool_result"constant"web_search_tool_result"constant
Model modelThe model that will complete your prompt.
See models for additional details and options.
The model that will complete your prompt.
See models for additional details and options.
High-performance model with early extended thinking
High-performance model with early extended thinking
Fastest and most compact model for near-instant responsiveness
Our fastest model
Hybrid model, capable of near-instant responses and extended thinking
Hybrid model, capable of near-instant responses and extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
High-performance model with extended thinking
Our best model for real-world agents and coding
Our best model for real-world agents and coding
Our most capable model
Our most capable model
Our most capable model
Our most capable model
Excels at writing and complex tasks
Excels at writing and complex tasks
Our previous most fast and cost-effective
JsonValue; role "assistant"constant"assistant"constantConversational role of the generated message.
This will always be "assistant".
Conversational role of the generated message.
This will always be "assistant".
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point
"max_tokens": we exceeded the requested max_tokens or the model's maximum
"stop_sequence": one of your provided custom stop_sequences was generated
"tool_use": the model invoked one or more tools
"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue.
"refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
The reason that we stopped.
This may be one the following values:
"end_turn": the model reached a natural stopping point"max_tokens": we exceeded the requestedmax_tokensor the model's maximum"stop_sequence": one of your provided customstop_sequenceswas generated"tool_use": the model invoked one or more tools"pause_turn": we paused a long-running turn. You may provide the response back as-is in a subsequent request to let the model continue."refusal": when streaming classifiers intervene to handle potential policy violations
In non-streaming mode this value is always non-null. In streaming mode, it is null in the message_start event and non-null otherwise.
Which custom stop sequence was generated, if any.
This value will be a non-null string if one of your custom stop sequences was generated.
JsonValue; type "message"constant"message"constantObject type.
For Messages, this is always "message".
Object type.
For Messages, this is always "message".
Usage usageBilling and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Billing and rate-limit usage.
Anthropic's API bills and rate-limits by token counts, as tokens represent the underlying cost to our systems.
Under the hood, the API transforms requests into a format suitable for the model. The model's output then goes through a parsing stage before becoming an API response. As a result, the token counts in usage will not match one-to-one with the exact visible content of an API request or response.
For example, output_tokens will be non-zero, even for an empty string response from Claude.
Total input tokens in a request is the summation of input_tokens, cache_creation_input_tokens, and cache_read_input_tokens.
Breakdown of cached tokens by TTL
Breakdown of cached tokens by TTL
The number of input tokens used to create the 1 hour cache entry.
The number of input tokens used to create the 5 minute cache entry.
The number of input tokens used to create the cache entry.
The number of input tokens read from the cache.
The number of input tokens which were used.
The number of output tokens which were used.
The number of server tool requests.
The number of server tool requests.
The number of web search tool requests.
Optional<ServiceTier> serviceTierIf the request used the priority, standard, or batch tier.
If the request used the priority, standard, or batch tier.