OpenAI’s new o1 models push AI to PhD-level intelligence

OpenAI introduced on Thursday OpenAI o1, a new series of large language models the company says are designed for solving difficult problems and working though complex tasks.

The models were trained to take longer to perform tasks than other AI models, thinking through problems in ways a human might. They can “refine their thinking process, try different strategies, and recognize their mistakes, OpenAI says in a press release. The models perform similarly to PhD students when working on physics, chemistry, and biology problems. 

The o1 models scored 83% on a qualifying exam for the International Mathematics Olympiad, OpenAI says, while its earlier GPT-4o model correctly solved only 13% of problems.

OpenAI provided some specific use case examples. The o1 models could be used by healthcare researchers to annotate cell sequencing data, by physicists to generate complicated mathematical formulas needed for quantum optics, and by developers to build and execute multi-step workflows. They also perform well in math and coding. 

Within OpenAI the o1 models were first codenamed “Q*” (pronounced “Q-star”), then “Strawberry.”

OpenAI says it’s taking a slow and cautious approach to releasing the new models. It’s releasing a couple of “early previews” of two of the models in the series. People with ChatGPT Plus or Teams accounts can access “o1-preview” by choosing it in a drop down menu within the chatbot. They can also choose “o1-mini,” which is faster and good at STEM questions, OpenAI says. 

Developers and researchers can access the models within ChatGPT and via an application programming interface. 

OpenAI says the new models won’t initially be able to access the internet. Users won’t be able to upload images or files to the models. OpenAI says it’s beefed up the safety features around the models, and has informed federal authorities about the more capable models.

https://www.fastcompany.com/91189817/openais-new-o1-models-push-ai-to-phd-level-intelligence?partner=rss&utm_source=rss&utm_medium=feed&utm_campaign=rss+fastcompany&utm_content=rss

Created 6mo | Sep 12, 2024, 8:30:04 PM


Login to add comment

Other posts in this group

Niantic unveils Quest app to explore 3D images from around the world

Meta Quest 3 users will now be able to explore detailed 3D scans of sculptures, rock formations, plant life, and other interesting objects from around the world.

The 3D images, which users can vi

Feb 26, 2025, 8:50:03 PM | Fast company - tech
At 10, USB-C still hasn’t lived up to its full potential

Slightly under 10 years ago, when I reviewed a new Apple MacBook, I devoted a surprising percentage of my wordage to its port.

Feb 26, 2025, 1:50:06 PM | Fast company - tech
Venus Williams backs the walking app WeWard

WeWard, an app that offers real-world rewards for walking, announced Wednesday it’s signed tennis champ Venus Williams as an investor and ambassador

Feb 26, 2025, 1:50:05 PM | Fast company - tech
Netflix is building a global audience by empowering Arab creatives

When Netflix reality show Dubai Bling debuted in 2022, it became a global sensation, garnering viewers across 51 countries. And it’s kept up the momentum: The show’s recently

Feb 26, 2025, 11:40:03 AM | Fast company - tech
Microsoft’s Majorana 1 widened the quantum field. But are we any closer to a eureka moment?

Quantum researchers are in a race for qubits, and Microsoft is in the thick of the competition.

Microsoft has spent the last 20 years pursuing a topological approach to quantum developme

Feb 26, 2025, 11:40:02 AM | Fast company - tech