Anthropic just dropped Claude 3.5 Sonnet with better vision and a sense of humor
Claude 3.5 Sonnet is the latest artificial intelligence model from Anthropic, one of the leading AI labs in the world. The company promises it is faster than its predecessor, has a better understanding of humor and can even read your handwriting.
Claude 3 Opus was already impressive. A model I dubbed the "most human-like" of any of the AI chatbots. I had a quick play with 3.5 Sonnet and it does seem more natural and with a better understanding of sarcasm. Claude is also listed as the best alternative to ChatGPT in my guide to chatbots.
This is the first release of the new generation of Claude 3.5 models to be followed by updates to the tiny Haiku and the massive Opus. For now Sonnet 3.5 outperforms Claude 3 Opus on a number of evaluations despite being faster and cheaper.
Anthropic is also launching Artifacts on the Claude website. This is a new feature that will let Claude run a code snippet or even display a website in a sidebar. More modalities and a memory feature are coming soon.
The updated model is already available to free and paid users on the Claude website and iOS app. Pro subscribers will have higher rate limits similar to ChatGPT.
What is new in Claude 3.5 Sonnet?
Claude comes in three tiers of model, similar to Google Gemini. The smallest is Haiku designed for quick summary tasks and very fast. Sonnet is the mid-tier for most use cases and available for free. Opus is the massive model on par with GPT-4.
During benchmark tests it outperformed Opus and GPT-4o on graduate level reasoning questions and undergraduate level knowledge.
Claude comes in three tiers of model. The smallest is Haiku designed for quick summary tasks and very fast. Sonnet is the mid-tier for most use cases and available for free. Opus is the massive model on par with GPT-4.
It comfortably outperformed all leading models on multilingual math but while it outperformed Opus on math problem solving, it wasn'st as good as GPT-4o.
One of the biggest upgrades was to its vision capabilities. That is the ability to analyze and understand an image. That could be a graph, chart, photograph or handwriting.
Anthropic says it can accurately transcribe text from even imperfect images — so that's CaPTCHA made redundant then.
"Sonnet now surpasses Claude 3 Opus across all standard vision benchmarks," Anthropic explained. "Improvements are most noticeable in tasks requiring visual reasoning, like interpreting charts, graphs, or transcribing text from imperfect images."
The launch of Artifacts
One of the more surprising updates was the launch of Artifacts. According to Anthropic this is a new feature that expands how users interact with Claude.
If a user asks it to generate a piece of code, a text document or even a design for a website it will show off its work. Currently you get the code and have to implement it yourself to see if it works, but with Artifacts it will appear in the sidebar. This could include generating vector graphics and even entire games.
It appears in a dynamic workspace area and can be edited and built on in real-time "integrating AI-generated content into projects and workflows."
Currently a preview, the new feature moves Claude from AI chatbot to productivity tool, following similar moves by OpenAI with ChatGPT.
"In the near future, teams — and eventually entire organizations — will be able to securely centralize their knowledge, documents, and ongoing work in one shared space, with Claude serving as an on-demand teammate," Anthropic says.