OpenAI’s new o1 models push AI to PhD-level intelligence

OpenAI introduced on Thursday OpenAI o1, a new series of large language models the company says are designed for solving difficult problems and working though complex tasks.

The models were trained to take longer to perform tasks than other AI models, thinking through problems in ways a human might. They can “refine their thinking process, try different strategies, and recognize their mistakes, OpenAI says in a press release. The models perform similarly to PhD students when working on physics, chemistry, and biology problems. 

The o1 models scored 83% on a qualifying exam for the International Mathematics Olympiad, OpenAI says, while its earlier GPT-4o model correctly solved only 13% of problems.

OpenAI provided some specific use case examples. The o1 models could be used by healthcare researchers to annotate cell sequencing data, by physicists to generate complicated mathematical formulas needed for quantum optics, and by developers to build and execute multi-step workflows. They also perform well in math and coding. 

Within OpenAI the o1 models were first codenamed “Q*” (pronounced “Q-star”), then “Strawberry.”

OpenAI says it’s taking a slow and cautious approach to releasing the new models. It’s releasing a couple of “early previews” of two of the models in the series. People with ChatGPT Plus or Teams accounts can access “o1-preview” by choosing it in a drop down menu within the chatbot. They can also choose “o1-mini,” which is faster and good at STEM questions, OpenAI says. 

Developers and researchers can access the models within ChatGPT and via an application programming interface. 

OpenAI says the new models won’t initially be able to access the internet. Users won’t be able to upload images or files to the models. OpenAI says it’s beefed up the safety features around the models, and has informed federal authorities about the more capable models.

https://www.fastcompany.com/91189817/openais-new-o1-models-push-ai-to-phd-level-intelligence?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Creato 8mo | 12 set 2024, 20:30:04


Accedi per aggiungere un commento

Altri post in questo gruppo

This free audio enhancer will totally transform your voice memos

Every now and then, you run into a tool that truly wows you.

It’s rare—especially nowadays, when everyone and their cousin is coming out with overhyped AI-centric codswallop tha

26 apr 2025, 12:20:10 | Fast company - tech
Elon Musk’s Trump gamble is costing him bigly

Tesla released its quarterly earnings report on Tuesday, its first since the company’s chief executive, Elon Musk, took up residence in the Trump White House and immediately began trying to fire f

26 apr 2025, 12:20:09 | Fast company - tech
Say goodbye to cheap versions of Ozempic and Wegovy

There’s never a dull day in the world of weight-loss medication. This week brought new restrictions on compounded GLP-1 medication, the cheaper, copycat versions of brand-name drugs that tel

26 apr 2025, 12:20:08 | Fast company - tech
Why Apple needs Tim Cook more than ever in the age of Trump

In December 2023, I wrote an article exploring Apple CEO Tim Cook’s most likely successors, because t

26 apr 2025, 10:10:03 | Fast company - tech
Families demand action from Meta over children’s deaths linked to platform harm

“Meta profits, kids pay the price,” was the message delivered by dozens of grieving families at the doors of Meta’s Manhattan office on Thursday.

Forty-five families traveled from

25 apr 2025, 20:10:07 | Fast company - tech
The other Blue Sky is getting tons of traffic

There’s Blue Sky and then there’s Bluesky.

Blue Sky, a paper goods company

25 apr 2025, 15:30:05 | Fast company - tech