logo

Claude 3.5 Sonnet

Translation
Question answering
Text generation
Zero-shot classification
Summarization
Conversational
Text classification

Claude 3.5 Sonnet sets new industry benchmarks for graduate-level reasoning (GPQA), undergraduate-level knowledge (MMLU), and coding proficiency (HumanEval). It shows marked improvement in grasping nuance, humor, and complex instructions, and is exceptional at writing high-quality content with a natural, relatable tone. Claude 3.5 Sonnet operates at twice the speed of Claude 3 Opus. This performance boost, combined with cost-effective pricing, makes Claude 3.5 Sonnet ideal for complex tasks such as context-sensitive customer support and orchestrating multi-step workflows.


Intended Use

  • Task automation: plan and execute complex actions across APIs and databases, interactive coding

  • R&D: research review, brainstorming and hypothesis generation, drug discovery

  • Strategy: advanced analysis of charts & graphs, financials and market trends, forecasting


Performance

  • Advanced Coding ability: In an internal evaluation by Anthropic, Claude 3.5 Sonnet solved 64% of problems, outperforming Claude 3 Opus which solved 38%. 

  • Multilingual Capabilities: Claude 3.5 Sonnet offers improved fluency in non-English languages such as Spanish and Japanese, enabling use cases like translation services and global content creation.

  • Vision and Image Processing: This model can process and analyze visual input, extracting insights from documents, processing web UI, generating image catalog metadata, and more.

  • Steerability and Ease of Use: Claude 3.5 Sonnet is designed to be easy to steer and better at following directions, giving you more control over model behavior and more predictable, higher-quality outputs.


Limitations

Here are some of the limitations we are aware of:

  • Medical images: Claude 3.5 is not suitable for interpreting specialized medical images like CT scans and shouldn't be used for medical advice.

  • Non-English: Claude 3.5 may not perform optimally when handling images with text of non-Latin alphabets, such as Japanese or Korean.

  • Big text: Users should enlarge text within the image to improve readability for Claude 3.5, but avoid cropping important details.

  • Rotation: Claude 3.5 may misinterpret rotated / upside-down text or images.

  • Visual elements: Claude 3.5 may struggle to understand graphs or text where colors or styles like solid, dashed, or dotted lines vary.

  • Spatial reasoning: Claude 3.5 struggles with tasks requiring precise spatial localization, such as identifying chess positions.

  • Hallucinations: the model can provide factually inaccurate information.

  • Image shape: Claude 3.5 struggles with panoramic and fisheye images.

  • Metadata and resizing: Claude 3.5 doesn't process original file names or metadata, and images are resized before analysis, affecting their original dimensions.

  • Counting: Claude 3.5 may give approximate counts for objects in images.

  • CAPTCHAS: For safety reasons, Claude 3.5 has a system to block the submission of CAPTCHAs.


Citation

https://docs.anthropic.com/claude/docs/models-overview