OpenAI's new o3 and o4-mini models are all about 'thinking with images'

A mere two days after announcing GPT-4.1, OpenAI is releasing not one but two new models. The company today announced the public availability of o3 and o4-mini. Of the former, OpenAI says o3 is its most advanced reasoning model yet, with it showing "strong performance" in coding, math and science tasks. As for o4-mini, OpenAI is billing it as a lower cost alternative that still delivers "impressive results" across those same fields.

More notably, both models offer novel capabilities not found in OpenAI's past systems. For first time, the company's reasoning models can use and combine all of the tools available in ChatGPT, including web browsing and image generation. The company says this capability allows o3 and o4-mini solve challenging, multi-step problems more effectively, and "take real steps toward acting independently." 

At the same time, o3 and o4-mini can not just see images, but also interpret and "think" about them in a way that significantly extends their visual processing capabilities. For instance, you can upload images of whiteboards, diagrams or sketches — even poor quality ones — and the new models will understand them. They can also adjust the images as part of how they reason.  

"The combined power of state-of-the-art reasoning with full tool access translates into significantly stronger performance across academic benchmarks and real-world tasks, setting a new standard in both intelligence and usefulness," says OpenAI. 

Separately, OpenAI is releasing a new coding agent (à la Claude Code) named Codex CLI. It's designed to give developers a minimal interface they can use to link OpenAI's models with their local code. Out of the box, it works with o3 and o4-mini, with support for GPT-4.1 on the way. 

Today's announcement comes after OpenAI CEO Sam Altman said the company was changing course on the roadmap he detailed in February. At the time, Altman indicated OpenAI would not release o3, which the company first previewed late last year, as a standalone product. However, at the start of April, he announced a "change of plans," noting OpenAI was moving forward with the release of o3 and o4-mini.  

"There are a bunch of reasons for this, but the most exciting one is that we are going to be able to make GPT-5 much better than we originally though," he wrote on X. "We also found it harder than we thought it was going to be to smoothly integrate everything. and we want to make sure we have enough capacity to support what we expect to be unprecedented demand."

That means the streamlining Altman promised in February will likely need to wait until at least the release of GPT-5, which he said would arrive sometime in the next "few months." 

In the meantime, ChatGPT Plus, Pro and Team users can begin using o3 and o4-mini starting today. Sometime in the next few weeks, OpenAI will bring online o3-pro, an even more powerful version of its flagship reasoning model, and make it available to Pro subscribers. For the time being, those users can continue to use o1-pro. 

This article originally appeared on Engadget at https://www.engadget.com/ai/openais-new-o3-and-o4-mini-models-are-all-about-thinking-with-images-170043465.html?src=rss https://www.engadget.com/ai/openais-new-o3-and-o4-mini-models-are-all-about-thinking-with-images-170043465.html?src=rss
Creato 4d | 16 apr 2025, 18:40:16


Accedi per aggiungere un commento

Altri post in questo gruppo

A bunch of robots ran a half-marathon alongside humans and it was incredibly goofy

Beijing held what’s being called the world’s first half-marathon for robots, allowing bipedal bots to compete alongside human runners, and as one might expect, ridiculousness ensued. The robots, wh

19 apr 2025, 23:10:18 | Engadget
Doctor Who ‘Lux’ review: Hope can change the world

Spoilers for “Lux.”

It’s an interesting time to be a long-running science fantasy media property in the streaming TV age. Star Trek is in the grip of an

19 apr 2025, 20:50:13 | Engadget
NASA’s Lucy spacecraft is about to have its second close encounter with an asteroid

A NASA spacecraft will make a close approach to an asteroid in the main belt on Sunday afternoon, in the second of several asteroid flybys planned for its 12-year mission to study remnants of the e

19 apr 2025, 18:30:12 | Engadget
Star Wars Zero Company looks like XCOM with Jedi and droids

EA and Lucasfilm shared first look at

19 apr 2025, 16:20:10 | Engadget
Real-time strategy game 'Tempest Rising' has been released early to all users

Tempest Rising, a real-time strategy game that's being called a "spiritual successor" a

19 apr 2025, 13:50:15 | Engadget
Here are the coolest cars at New York International Auto Show 2025

This year marks the 125th anniversary of the New York International Auto Show (NYIAS), and despite concerns over tariffs, there are still a lot of manufacturers here showing off new models includin

18 apr 2025, 21:40:18 | Engadget
Google is trying to get college students hooked on AI with a free year of Gemini Advanced

Under no circumstances should you let AI do your schoolwork for you, but Google has decided to make that option a little bit easier for the next year. The company is

18 apr 2025, 21:40:17 | Engadget