Now you can generate images directly from ChatGPT and Sora

OpenAI just announced that all users will soon be able to generate images directly inside of ChatGPT. It’s rolling out to ChatGPT Plus, Pro, Team and, most importantly, Free users. This will be the default image generation tool in 4o, so there will be no need to open Dall-E whenever you want to whip up a picture of a cat in space eating lasagna or whatever. The feature’s also coming to Sora.

The company says that the platform will "generate high-quality images based on your prompt, conversation and uploaded files." To the latter point, it’ll be able to transform pre-existing images based on prompts. OpenAI is also boasting about significant improvements in text rendering and contextual understanding.

These new tools are intended for both personal and professional use. As such, OpenAI gives a number of examples as to where this type of image generation could come in handy. These include the creation of infographics, social media promotional graphics and images with plenty of text, as seen below. 

A generated image with plenty of text.
OpenAI

This being a modern generation tool, it can also handle high-end visuals. The company says it offers a "strong capability for photorealism, including light, shadow, and texture accuracy." The ability to understand context could also be useful, as OpenAI says this could be used to create a “poster of birds found in Central Park” or a "visualization of an art history era discussed previously in the conversation."

Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time: https://t.co/MYHZB79UqN

Text and image input rolling out today in API and ChatGPT with voice and video in the coming weeks. pic.twitter.com/uuthKZyzYx

— OpenAI (@OpenAI) May 13, 2024

It's built on GPT-4o, an AI model that was first released last year. The "o" stands for "omni", which is a reference to the model’s multimodal capabilities. This is what allows many of the aforementioned features, like being able to iterate on uploaded files. Today’s news looks like another step on the long road toward the “one AI to rule them all” functionality that Sam Altman teased a few weeks back.

This article originally appeared on Engadget at https://www.engadget.com/ai/now-you-can-generate-images-directly-from-chatgpt-and-sora-180047905.html?src=rss https://www.engadget.com/ai/now-you-can-generate-images-directly-from-chatgpt-and-sora-180047905.html?src=rss
Létrehozva 1mo | 2025. márc. 25. 18:10:21


Jelentkezéshez jelentkezzen be

EGYÉB POSTS Ebben a csoportban

There’s a massive power outage cross Spain, Portugal and parts of France

Spain, Portugal and parts of France have experienced a

2025. ápr. 28. 20:30:19 | Engadget
How to delete your Twitter (or X) account

There are plenty of good reasons to delete your X account, whether it's because of a general desire to

2025. ápr. 28. 20:30:18 | Engadget
Mycopunk is an upbeat love letter to extraction shooters

The extraction-shooter genre is getting a little more crowded and a lot more stylish with the announcement of Mycopunk, a four-player, first-person romp from indie studio Pigeons at Play a

2025. ápr. 28. 20:30:16 | Engadget
Researchers secretly experimented on Reddit users with AI-generated comments

A group of researchers covertly ran a months-long "unauthorized" experiment in one of Reddit’s most popular communities using AI-generated comments to test the persuasiveness of large language mode

2025. ápr. 28. 20:30:15 | Engadget
Russian regulators are trying to seize assets from the developers of World of Tanks

Top executives from Wargaming and Lesta Games, the joint developers of World of Tanks, could have their stakes in their respective companies seized by the Russian government, according to

2025. ápr. 28. 20:30:14 | Engadget
LG’s refreshed QNED Evo LCD TVs arrive in May

LG’s 2025 lineup of its QNED Evo premium LCD TVs will be available to buy starting in May, following an

2025. ápr. 28. 18:10:26 | Engadget