Anthropic’s new Claude AI model can decide between speed and deep thinking

Anthropic released on Monday its Claude 3.7 Sonnet model, which it says returns results faster and can show the user the “chain of thought” it follows to reach an answer. This latest model also powers a new coding tool called Claude Code that can perform some development tasks autonomously.

Claude 3.7 Sonnet offers an “extended thinking” mode that engages in a more detailed “chain of thought” reasoning but takes longer to generate a response. For simpler questions it eschews this mode and instead focuses on speed. Other models offer their own versions of “thinking” mode, but typically the user has to select that feature for harder problems;  Anthropic says Claude 3.7 Sonnet is the first publicly available model with the capability to choose the best mode based on the user’s question. If Grok 3 and DeepSeek-R1 are stick shifts, then Anthropic’s new model is an automatic.

“Just as humans use a single brain for both quick responses and deep reflection, we believe reasoning should be an integrated capability of frontier models rather than a separate model entirely,” Anthropic says in a blog post.

Claude 3.7 Sonnet outperforms other “thinking” models in some important benchmark tests. On SWE-bench, which evaluates AI models’ ability to solve real-world software issues, the model beat OpenAI’s o1 and o3-mini and DeepSeek-R1 by a comfortable margin. It was the same story on TAU-bench, which tests AI agents on complex real-world tasks with user and tool interactions. However, OpenAI’s o1 model still edges out Claude 3.7 Sonnet in math problem solving, visual reasoning, multilingual Q&A, and graduate-level reasoning benchmarks.

Anthropic describes the Claude Code tool as an active collaborator that can search and read code, edit files, write and run tests, and commit and push code to GitHub. The company says the tool has already become “indispensable” for its own coders, completing tasks in a single pass that would normally take 45 minutes or more of manual work. 

Claude 3.7 Sonnet is now available on all Claude subscription plans—Free, Pro, Team, and Enterprise–but the extended thinking mode isn’t available to users of the free tier. Claude 3.7 Sonnet is also available to developers as an API for the same price as earlier Claude models.

https://www.fastcompany.com/91283751/anthropic-new-claude-3-7-sonnet-ai-chain-of-thought?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Utworzony 1mo | 24 lut 2025, 20:20:05


Zaloguj się, aby dodać komentarz

Inne posty w tej grupie

AI megaprojects come with hidden costs

While Tesla’s stock has seen a sharp decline in recent weeks, Elon Musk has quietly been working to expand his influence in a different area of tech: AI supercomputing. Recent reports have reveale

31 mar 2025, 10:40:04 | Fast company - tech
How Hebbia is building AI for in-depth research

A New York-based AI startup called Hebbia says it’s developed techniques that let AI answer questions about massive amounts of data without merely regur

31 mar 2025, 10:40:03 | Fast company - tech
How to bring Apple’s ‘Hide My Email’ privacy to Android and Windows

Have you ever wanted to sign up for an online service but you didn’t want to provide your real email address as part of the process?

There’s a good chance your email address has

30 mar 2025, 09:10:04 | Fast company - tech
3 great, free Word alternatives in the wake of the Microsoft 365 price hike

Did everyone get the Microsoft 365 rate-hike notice? The personal plan is going from $70 a year to $100 a year.

30 mar 2025, 06:40:06 | Fast company - tech
iMessage still lags behind its peers. 4 ways Apple should update it for iOS 19

Few apps are as inextricably linked to the iPhone as Apple’s Messages. Introduced with the original iPhone almost 18 years ago, the app (then called “Text”) has become the primar

29 mar 2025, 09:50:03 | Fast company - tech
Elon Musk’s xAI startup just bought X for $45 billion

Elon Musk said on Friday that his xAI has acquired X, the social media app formerly known as Twitter, in an all-stock transaction for $45 billion, including $12 billion in debt.

“x

29 mar 2025, 00:40:02 | Fast company - tech