Anthropic’s new Claude AI model can decide between speed and deep thinking

Anthropic released on Monday its Claude 3.7 Sonnet model, which it says returns results faster and can show the user the “chain of thought” it follows to reach an answer. This latest model also powers a new coding tool called Claude Code that can perform some development tasks autonomously.

Claude 3.7 Sonnet offers an “extended thinking” mode that engages in a more detailed “chain of thought” reasoning but takes longer to generate a response. For simpler questions it eschews this mode and instead focuses on speed. Other models offer their own versions of “thinking” mode, but typically the user has to select that feature for harder problems;  Anthropic says Claude 3.7 Sonnet is the first publicly available model with the capability to choose the best mode based on the user’s question. If Grok 3 and DeepSeek-R1 are stick shifts, then Anthropic’s new model is an automatic.

“Just as humans use a single brain for both quick responses and deep reflection, we believe reasoning should be an integrated capability of frontier models rather than a separate model entirely,” Anthropic says in a blog post.

Claude 3.7 Sonnet outperforms other “thinking” models in some important benchmark tests. On SWE-bench, which evaluates AI models’ ability to solve real-world software issues, the model beat OpenAI’s o1 and o3-mini and DeepSeek-R1 by a comfortable margin. It was the same story on TAU-bench, which tests AI agents on complex real-world tasks with user and tool interactions. However, OpenAI’s o1 model still edges out Claude 3.7 Sonnet in math problem solving, visual reasoning, multilingual Q&A, and graduate-level reasoning benchmarks.

Anthropic describes the Claude Code tool as an active collaborator that can search and read code, edit files, write and run tests, and commit and push code to GitHub. The company says the tool has already become “indispensable” for its own coders, completing tasks in a single pass that would normally take 45 minutes or more of manual work. 

Claude 3.7 Sonnet is now available on all Claude subscription plans—Free, Pro, Team, and Enterprise–but the extended thinking mode isn’t available to users of the free tier. Claude 3.7 Sonnet is also available to developers as an API for the same price as earlier Claude models.

https://www.fastcompany.com/91283751/anthropic-new-claude-3-7-sonnet-ai-chain-of-thought?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

созданный 2mo | 24 февр. 2025 г., 20:20:05


Войдите, чтобы добавить комментарий

Другие сообщения в этой группе

‘Thank you for your attention to this matter!’: Trump’s favorite sign-off has become a viral meme

Thanksgiving may not arrive until November, but you wouldn’t know it from perusing Donald Trump’s social media feeds. He’s been giving thanks quite a lot lately. “

23 апр. 2025 г., 14:50:08 | Fast company - tech
Microsoft says these are the AI terms you need to know

Microsoft released its annual Work Trend Index report on Tuesday, which argued that 2025 is the year that companies stop simply experimenting with AI and start building it into key missions.

23 апр. 2025 г., 14:50:07 | Fast company - tech
Microsoft thinks AI colleagues are coming soon

Artificial intelligence has rapidly started finding its place in the workplace, but this year will be remembered as the moment when companies pushed past simply experimenting with AI and started b

23 апр. 2025 г., 14:50:06 | Fast company - tech
José Andrés on AI, crisis tech, and rethinking the food system

As the founder of World Central Kitchen, renowned chef and humanitarian José Andrés has truly mastered the art of leading through crisis. Andrés shares insights from his new book, Change the R

23 апр. 2025 г., 14:50:04 | Fast company - tech
20 years ago, this simple video rewired the way we share our lives online

The elephant enclosure at your local zoo is an interesting place to be. But until 20 years ago, it was somewhere you’d encounter in person—with reverence and intimacy. A video

23 апр. 2025 г., 12:30:06 | Fast company - tech
‘It is absolutely crushing to my business’: Trump’s tariffs hit Kickstarter campaigns hard

Curt Covert would love for people to play his latest board game—but with sky-high tariffs, he’s not sure anyone ever will.

“At 54%, I had a plan,” Covert tells Fast Company, ref

23 апр. 2025 г., 12:30:05 | Fast company - tech