OpenAI’s new o1 models push AI to PhD-level intelligence

OpenAI introduced on Thursday OpenAI o1, a new series of large language models the company says are designed for solving difficult problems and working though complex tasks.

The models were trained to take longer to perform tasks than other AI models, thinking through problems in ways a human might. They can “refine their thinking process, try different strategies, and recognize their mistakes, OpenAI says in a press release. The models perform similarly to PhD students when working on physics, chemistry, and biology problems. 

The o1 models scored 83% on a qualifying exam for the International Mathematics Olympiad, OpenAI says, while its earlier GPT-4o model correctly solved only 13% of problems.

OpenAI provided some specific use case examples. The o1 models could be used by healthcare researchers to annotate cell sequencing data, by physicists to generate complicated mathematical formulas needed for quantum optics, and by developers to build and execute multi-step workflows. They also perform well in math and coding. 

Within OpenAI the o1 models were first codenamed “Q*” (pronounced “Q-star”), then “Strawberry.”

OpenAI says it’s taking a slow and cautious approach to releasing the new models. It’s releasing a couple of “early previews” of two of the models in the series. People with ChatGPT Plus or Teams accounts can access “o1-preview” by choosing it in a drop down menu within the chatbot. They can also choose “o1-mini,” which is faster and good at STEM questions, OpenAI says. 

Developers and researchers can access the models within ChatGPT and via an application programming interface. 

OpenAI says the new models won’t initially be able to access the internet. Users won’t be able to upload images or files to the models. OpenAI says it’s beefed up the safety features around the models, and has informed federal authorities about the more capable models.

https://www.fastcompany.com/91189817/openais-new-o1-models-push-ai-to-phd-level-intelligence?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Creată 6mo | 12 sept. 2024, 20:30:04


Autentifică-te pentru a adăuga comentarii

Alte posturi din acest grup

How this sex-forward gay cruising site finally launched an Apple-approved iOS app

As an app designed to facilitate gay hookups, popular site Sniffies has had a limitation since it started in 2018—it was only accessible via web browser. Until Monday, when the map-based cruising

6 mar. 2025, 21:20:06 | Fast company - tech
Why weird JD Vance memes have taken over the internet

Ironically enough, a divisive moment in the Oval Office last weekend seems to have brought the entire internet together. When Ukrainian President Volodymyr Zelenskyy  visited the White House

6 mar. 2025, 21:20:05 | Fast company - tech
TSMC’s $100 billion U.S. commitment could calm Taiwan tensions

Welcome to AI DecodedFast Company’s weekly newsletter that breaks down the most important news in the world of AI. You can sign up to receive this newsletter every week 

6 mar. 2025, 19:10:05 | Fast company - tech
Gig Companies are backing Trump’s Labor Secretary nominee. Here’s what that means for workers

The trade association representing America’s largest gig companies is backing President Trump’s nominees to lead the Department of Labor—an endorsement that could shape the future of worker classi

6 mar. 2025, 19:10:03 | Fast company - tech
The Trump administration just cut Defense Department grants that research terrorism and drug trafficking

Researchers in a highly regarded Department of Defense program called the Minerva Research Initiative recently received word that grants already awarded

6 mar. 2025, 14:30:02 | Fast company - tech
YouTube is doubling down on ‘bedtime’ reminders. Do they work?

Teenage YouTube users across the world will now get automatic reminders to go to bed and take a break from their screens. 

YouTube

6 mar. 2025, 12:10:06 | Fast company - tech
How Audiomack became an unlikely Spotify competitor

Kendrick Lamar. Drake. Lady Gaga. The charts of music streaming services pretty much all look the same these days, with familiar names dominating the top spots—except on up-and-coming Spotify comp

6 mar. 2025, 12:10:05 | Fast company - tech