Anthropic gives its AI models limited ability to control your computer

Anthropic is giving its new Claude 3.5 Sonnet model the ability to control a user’s computer and access the internet. The move marks a major step in generative AI models’ capabilities—and raises questions about AI companies’ ability to properly mitigate the risks of more autonomous AI.

According to a series of example videos from Anthropic posted Tuesday on X, Claude users might now ask the AI to follow the steps needed to create a personal website. In another example, a user asks Claude to help with the logistics of a trip to watch the sunrise from the Golden Gate bridge. The user describes what they want the model to do by giving it text prompts.

AI companies have been stressing a desire to push large language models to become more “agentic” and autonomous. Doing so means extending the ability of the AI to control not only its own functions but also external devices. 

“Instead of making specific tools to help Claude complete individual tasks, we’re teaching it general computer skills—allowing it to use a wide range of standard tools and software programs designed for people,” Anthropic said in a statement on X.

The new computer control capabilities are being rolled out to developers through an API, as a public beta. Anthropic says it wants to collect feedback on the performance and usefulness of the new capabilities. 

The company acknowledged that Claude 3.5 Sonnet’s current ability to use computers isn’t perfect and will make some mistakes (especially when it comes to scrolling and dragging), but the company expects this to rapidly improve in the coming months.

With greater power comes greater responsibility. Anthropic has some explicit instructions on how to mitigate the risk of giving an AI control over a computer. In the user guide, the company advises avoiding giving Claude access to sensitive data such as user passwords, and to limit the number of websites the AI can access. 

Its fourth point under minimizing risks states: “Ask a human to confirm decisions that may result in meaningful real-world consequences as well as any tasks requiring affirmative consent, such as accepting cookies, executing financial transactions, or agreeing to terms of service.”

Anthropic has taken a first cautious step into more autonomous AI. But the ability to manage some basic tasks on a PC will expand to greater and larger tasks and a wide array of devices, including phones and even home appliances. As this control extends, the extent of the risk increases, too. Autonomous AI could deliver a lot of convenience, but may have the ability to do lots of harm.

Expect other AI companies to begin rolling out similar functionality in the near future as part of a general move toward more agentic AI. 

https://www.fastcompany.com/91214520/anthropic-gives-its-ai-models-limited-ability-to-control-your-computer?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

созданный 4mo | 22 окт. 2024 г., 22:20:11


Войдите, чтобы добавить комментарий

Другие сообщения в этой группе

Elon Musk’s DOGE is draining the life from the once-vaunted U.S. Digital Service

The United States Digital Service (USDS), the storied group of Silicon Valley types brought together by Obama to bring government services into the 21st century, will likely never be the same afte

25 февр. 2025 г., 21:40:06 | Fast company - tech
21 federal workers resign from DOGE, refusing to ‘dismantle critical public services’

More than 20 civil service employees resigned Tuesday from billionaire Trump adviser Elon Musk’s 

25 февр. 2025 г., 21:40:06 | Fast company - tech
Nvidia stock struggles before its first post-DeepSeek earnings: 5 things to watch

It’s no exaggeration to say that Nvidia (Nasdaq:NVDA), to many people, is the most important stock on Wall Street these days. Last year, the com

25 февр. 2025 г., 21:40:05 | Fast company - tech
How Factory is turning AI into ‘a junior developer in a box’

Many things remain uncertain about AI’s future impact on our lives. One that isn’t in doubt is that more and more of the world’s software will be written, at least in part, by software. A

25 февр. 2025 г., 19:20:08 | Fast company - tech
Why Donald Trump and Elon Musk probably aren’t breaking up any time soon

On Monday morning, anonymous hackers played a video on screens throughout the Department of Housing and Urban Development HQ in Washington, D.C. The AI-generated video jankily portrayed President

25 февр. 2025 г., 19:20:08 | Fast company - tech
Chengdu’s Snow Village faces backlash for creating a fake winter wonder

There’s a new entrant in the scam hall of fame.

The Chengdu Snow Village—a newly opened destination in the suburban Chengdu, Sichuan province—advertised a picturesque snow landscap

25 февр. 2025 г., 19:20:07 | Fast company - tech
How LinkedIn became luxury fashion’s newest runway

As Fashion Week takes over New York, London, and Milan, designers aren’t just showcasing their collections on the runway—they’re taking over

25 февр. 2025 г., 17:10:06 | Fast company - tech