Classification

When to use Claude for classification

When should you consider using an LLM instead of a traditional ML approach for your classification tasks? Here are some key indicators:

Rule-based classes: Use Claude when classes are defined by conditions rather than examples, as it can understand underlying rules.

Evolving classes: Claude adapts well to new or changing domains with emerging classes and shifting boundaries.

Unstructured inputs: Claude can handle large volumes of unstructured text inputs of varying lengths.

Limited labeled examples: With few-shot learning capabilities, Claude learns accurately from limited labeled training data.

Reasoning Requirements: Claude excels at classification tasks requiring semantic understanding, context, and higher-level reasoning.

Implement Claude for classification

The three key model decision factors are: intelligence, latency, and price.

For classification, a smaller model like Claude Haiku 3 is typically ideal due to its speed and efficiency. Though, for classification tasks where specialized knowledge or complex reasoning is required, Sonnet or Opus may be a better choice. Learn more about how Opus, Sonnet, and Haiku compare in the models overview.

Use evaluations to gauge whether a Claude model is performing well enough to launch into production.

1. Build a strong input prompt

While Claude offers high-level baseline performance out of the box, a strong input prompt helps get the best results.

For a generic classifier that you can adapt to your specific use case, copy the starter prompt below:

2. Develop your test cases

To run your classification evaluation, you will need test cases to run it on. Take a look at the guide to developing test cases.

3. Run your eval

Evaluation metrics

Some success metrics to consider evaluating Claude’s performance on a classification task include:

Criteria	Description
Accuracy	The model's output exactly matches the golden answer or correctly classifies the input according to the task's requirements. This is typically calculated as (Number of correct predictions) / (Overall number of predictions).
F1 Score	The model's output optimally balances precision and recall.
Consistency	The model's output is consistent with its predictions for similar inputs or follows a logical pattern.
Structure	The model's output follows the expected format or structure, making it easy to parse and interpret. For example, many classifiers are expected to output JSON format.
Speed	The model provides a response within the acceptable time limit or latency threshold for the task.
Bias and Fairness	If classifying data about people, is it important that the model does not demonstrate any biases based on gender, ethnicity, or other characteristics that would lead to its misclassification.

Classification

Classification

When to use Claude for classification

Establish your classification use case

Implement Claude for classification

1. Build a strong input prompt

2. Develop your test cases

3. Run your eval

Evaluation metrics

Deploy your classifier

When to use Claude for classification

Establish your classification use case

Tech & IT

Customer Service

Healthcare

Finance

Implement Claude for classification

1. Build a strong input prompt

Starter prompt

2. Develop your test cases

3. Run your eval

Evaluation metrics

Deploy your classifier

Legal