OpenAI’s new o1 models push AI to PhD-level intelligence

OpenAI introduced on Thursday OpenAI o1, a new series of large language models the company says are designed for solving difficult problems and working though complex tasks.

The models were trained to take longer to perform tasks than other AI models, thinking through problems in ways a human might. They can “refine their thinking process, try different strategies, and recognize their mistakes, OpenAI says in a press release. The models perform similarly to PhD students when working on physics, chemistry, and biology problems.

The o1 models scored 83% on a qualifying exam for the International Mathematics Olympiad, OpenAI says, while its earlier GPT-4o model correctly solved only 13% of problems.

OpenAI provided some specific use case examples. The o1 models could be used by healthcare researchers to annotate cell sequencing data, by physicists to generate complicated mathematical formulas needed for quantum optics, and by developers to build and execute multi-step workflows. They also perform well in math and coding.

Within OpenAI the o1 models were first codenamed “Q*” (pronounced “Q-star”), then “Strawberry.”

OpenAI says it’s taking a slow and cautious approach to releasing the new models. It’s releasing a couple of “early previews” of two of the models in the series. People with ChatGPT Plus or Teams accounts can access “o1-preview” by choosing it in a drop down menu within the chatbot. They can also choose “o1-mini,” which is faster and good at STEM questions, OpenAI says.

Developers and researchers can access the models within ChatGPT and via an application programming interface.

OpenAI says the new models won’t initially be able to access the internet. Users won’t be able to upload images or files to the models. OpenAI says it’s beefed up the safety features around the models, and has informed federal authorities about the more capable models.

https://www.fastcompany.com/91189817/openais-new-o1-models-push-ai-to-phd-level-intelligence?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Vytvořeno 6mo | 12. 9. 2024 20:30:04

Chcete-li přidat komentář, přihlaste se

Ostatní příspěvky v této skupině

Back from Extinction: How Colossal Is Charting a New Frontier in Genomics

Featuring Ben Lamm, Founder and CEO, Colossal Biosciences and Joe Manganiello, Actor, Producer. Moderated by Kc Ifeanyi, Executive Director of Ed

10. 3. 2025 1:50:05 | Fast company - tech

iPad shoppers beware: One of the new models is not like the others

This week, Apple updated half of its iPad lineup.

After updating the iPad Pro and iPad mini in 2024, the company has just unveiled a third-generation iPad Air and an eleventh-generation

8. 3. 2025 12:40:08 | Fast company - tech

This secret site lets you try DeepSeek on a trustworthy U.S. server

We need to talk about AI. Have you noticed it often just isn’t—well, very intelligent?

Already, we’ve lived through years of AI hype. We’ve watched companies pitch AI as a great

8. 3. 2025 12:40:07 | Fast company - tech

YouTube is cracking down on gambling content. Here’s what’s changing

YouTube is taking steps to crack down on gambling content.

On Tuesday, the platform announced a new policy t

7. 3. 2025 22:50:03 | Fast company - tech

‘Crypto president’ Trump signs executive order to create Bitcoin reserve

President Donald Trump signed an

7. 3. 2025 18:20:03 | Fast company - tech

Deepfake scammers are hijacking TikTok’s wellness craze to sell dubious health products

By now, most people know not to trust everything they see on TikTok. But scams on the platform are becoming increasingly sophisticated, thanks to

7. 3. 2025 18:20:03 | Fast company - tech

SpaceX’s Starship explodes again, with wreckage seen from Florida

Nearly two months after an explosion sent flaming debris raining down on the Turks and Caicos

7. 3. 2025 15:50:05 | Fast company - tech

Tomas_r2