OpenAI’s new o1 models push AI to PhD-level intelligence

OpenAI introduced on Thursday OpenAI o1, a new series of large language models the company says are designed for solving difficult problems and working though complex tasks.

The models were trained to take longer to perform tasks than other AI models, thinking through problems in ways a human might. They can “refine their thinking process, try different strategies, and recognize their mistakes, OpenAI says in a press release. The models perform similarly to PhD students when working on physics, chemistry, and biology problems.

The o1 models scored 83% on a qualifying exam for the International Mathematics Olympiad, OpenAI says, while its earlier GPT-4o model correctly solved only 13% of problems.

OpenAI provided some specific use case examples. The o1 models could be used by healthcare researchers to annotate cell sequencing data, by physicists to generate complicated mathematical formulas needed for quantum optics, and by developers to build and execute multi-step workflows. They also perform well in math and coding.

Within OpenAI the o1 models were first codenamed “Q*” (pronounced “Q-star”), then “Strawberry.”

OpenAI says it’s taking a slow and cautious approach to releasing the new models. It’s releasing a couple of “early previews” of two of the models in the series. People with ChatGPT Plus or Teams accounts can access “o1-preview” by choosing it in a drop down menu within the chatbot. They can also choose “o1-mini,” which is faster and good at STEM questions, OpenAI says.

Developers and researchers can access the models within ChatGPT and via an application programming interface.

OpenAI says the new models won’t initially be able to access the internet. Users won’t be able to upload images or files to the models. OpenAI says it’s beefed up the safety features around the models, and has informed federal authorities about the more capable models.

https://www.fastcompany.com/91189817/openais-new-o1-models-push-ai-to-phd-level-intelligence?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Created 7mo | Sep 12, 2024, 8:30:04 PM

Other posts in this group

Google’s Gemini 2.5 Pro could be the most important AI model so far this year

Google released its new Gemini 2.5 Pro Experimental AI model late last month,

Apr 3, 2025, 10:10:02 PM | Fast company - tech

TikTok Notes is shutting down as Lemon8 steps in

TikTok is shutting down TikTok Notes—wait, you didn’t even know it existed? Well, that explains a lot.

TikTok Notes, the platform’s short-lived attempt to take on Instagram (just as Inst

Apr 3, 2025, 7:40:05 PM | Fast company - tech

Women dominate online influencing. So why are they paid less?

Influencing has a major pay gap, and it’s not what you might expect.

A new report from Collabstr, based on over 15,0

Apr 3, 2025, 7:40:04 PM | Fast company - tech

An OpenAI ‘open’ model shows how much the company—and AI—has changed in two years

Welcome to AI Decoded, Fast Company’s weekly newsletter that breaks down the most important news in the world of AI. You can sign up to receive this newsletter every week

Apr 3, 2025, 5:20:11 PM | Fast company - tech

How Elon Musk’s political gambit could tarnish his legacy at Tesla

Tech leaders often brand themselves as “disruptors”—and few fit that label more snugly than Elon Musk. In the three months since joining Donald Trump in the White House following Trump’s election,

Apr 3, 2025, 5:20:10 PM | Fast company - tech

Amazon and OnlyFans founder place bids on TikTok ahead of imminent deadline for a U.S. buyer

As the weekend deadline for TikTok to find a buyer approaches, bidders for the short-video so

Apr 3, 2025, 3:10:03 PM | Fast company - tech

Visa unveils a trio of new tools to make the payments process easier

At Visa’s ETA Transact event on April 3, the payments giant introduced three new products designed to simplify and secure payment acceptance. These innovations—Authorize.net 2.0, Unified Checkout,

Apr 3, 2025, 12:40:06 PM | Fast company - tech

Tomas_r2