OpenAI’s new o1 models push AI to PhD-level intelligence

OpenAI introduced on Thursday OpenAI o1, a new series of large language models the company says are designed for solving difficult problems and working though complex tasks.

The models were trained to take longer to perform tasks than other AI models, thinking through problems in ways a human might. They can “refine their thinking process, try different strategies, and recognize their mistakes, OpenAI says in a press release. The models perform similarly to PhD students when working on physics, chemistry, and biology problems. 

The o1 models scored 83% on a qualifying exam for the International Mathematics Olympiad, OpenAI says, while its earlier GPT-4o model correctly solved only 13% of problems.

OpenAI provided some specific use case examples. The o1 models could be used by healthcare researchers to annotate cell sequencing data, by physicists to generate complicated mathematical formulas needed for quantum optics, and by developers to build and execute multi-step workflows. They also perform well in math and coding. 

Within OpenAI the o1 models were first codenamed “Q*” (pronounced “Q-star”), then “Strawberry.”

OpenAI says it’s taking a slow and cautious approach to releasing the new models. It’s releasing a couple of “early previews” of two of the models in the series. People with ChatGPT Plus or Teams accounts can access “o1-preview” by choosing it in a drop down menu within the chatbot. They can also choose “o1-mini,” which is faster and good at STEM questions, OpenAI says. 

Developers and researchers can access the models within ChatGPT and via an application programming interface. 

OpenAI says the new models won’t initially be able to access the internet. Users won’t be able to upload images or files to the models. OpenAI says it’s beefed up the safety features around the models, and has informed federal authorities about the more capable models.

https://www.fastcompany.com/91189817/openais-new-o1-models-push-ai-to-phd-level-intelligence?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Creată 8mo | 12 sept. 2024, 20:30:04


Autentifică-te pentru a adăuga comentarii

Alte posturi din acest grup

Amazon’s Grubhub deal is delivering big results

Amazon and Grubhub are entering the second year of a five-year commercial agreement that gives Amazon Prime members access to the food delivery platform’s subscription program at no extra co

22 mai 2025, 19:10:05 | Fast company - tech
Are OpenAI and Jony Ive headed for an iPhone moment?

Welcome to AI DecodedFast Company’s weekly newsletter that breaks down the most important news in the world of AI. You can sign up to receive this newsletter every week 

22 mai 2025, 19:10:04 | Fast company - tech
Crypto investors saw Trump as their champion. Now they’re not so sure

It seems like a triumph for a cryptocurrency industry that has long sought mainstream acceptance: Top investors in one of

22 mai 2025, 16:40:11 | Fast company - tech
Roku is doing more than ever, but focus is still its secret ingredient

It’s easy to forget how big a splash the first Roku box made when it debuted on May 20, 2008. At launch, the device wo

22 mai 2025, 12:10:07 | Fast company - tech
Forget return-to-office. Hybrid now means human plus AI

For the past few years, “hybrid work” has meant splitting time between home and office. And for the most part, people like it—flexibil

22 mai 2025, 12:10:07 | Fast company - tech
Trump’s 4,000 meme-coins-per-plate crypto dinner is an American embarrassment

On Thursday, President Donald Trump will sit down for an intimate evening at his Northern Virginia golf club with 220 of his favorite people in the world: a group of cryptocurrency speculators who

22 mai 2025, 12:10:06 | Fast company - tech
Why (and how) DoorDash and Uber Eats are getting into the restaurant reservations game

A decade ago, the easiest way in the front door at a

22 mai 2025, 09:50:02 | Fast company - tech