Google's new AI video model sucks less at physics

Google may have only recently begun rolling out its Veo generative AI to enterprise customers, but the company is not wasting any time getting a new version of the video tool out to early testers. On Monday, Google announced a preview of Veo 2. According to the company, Veo 2 “understands the language of cinematography.” In practice, that means you can reference a specific genre of film, cinematic effect or lens when prompting the model.

Additionally, Google says the new model has a better understanding of real-world physics and human movement. Correctly modeling humans in motion is something all generative models struggle to do. So the company’s claim that Veo 2 is better when it comes to both of those trouble points is notable. Of course, the samples the company provided aren’t enough to know for sure; the true test of Veo 2’s capabilities will come when someone prompts it to generate a video of a gymnast's routine. Oh, and speaking of things video models struggle with, Google says Veo will produce artifacts like extra fingers “less frequently.”

Separately, Google is rolling out improvements to Imagen 3. Of its text-to-image model, the company says the latest version generates brighter and better-composed images. Additionally, it can render more diverse art styles with greater accuracy. At the same time, it’s also better at following prompts more faithfully. Prompt adherence was an issue I highlighted when the company made Imagen 3 available to Google Cloud customers earlier this month, so if nothing else, Google is aware of the areas where its AI models need work.

Veo 2 will gradually roll out to Google Labs users in the US. For now, Google will limit testers to generating up to eight seconds of footage at 720p. For context, Sora can generate up to 20 seconds of 1080p footage, though doing so requires a $200 per month ChatGPT Pro subscription. As for the latest enhancements to Imagen 3, those are available to Google Labs users in more than 100 countries through ImageFX.

This article originally appeared on Engadget at https://www.engadget.com/ai/googles-new-ai-video-model-sucks-less-at-physics-170041204.html?src=rss https://www.engadget.com/ai/googles-new-ai-video-model-sucks-less-at-physics-170041204.html?src=rss
Creată 1mo | 16 dec. 2024, 17:30:23


Autentifică-te pentru a adăuga comentarii

Alte posturi din acest grup

I think I found the most wholesome game in the Playdate Catalog

I didn’t set out to play jump rope STAR! when I picked up my

26 ian. 2025, 23:40:13 | Engadget
NASA and ESA share a breathtaking Hubble image of the Tarantula Nebula’s outer edge

The Hubble Space Telescope is still trucking along more than 30 years after its launch, observing the universe and sending home images for us to marvel at. This week,

26 ian. 2025, 21:20:15 | Engadget
The 1989 point and click horror game Last Half of Darkness has been remade for 2025

An obscure horror game from the late ‘80s that gained a cult following by way of shareware is coming back from the grave.

26 ian. 2025, 19:10:03 | Engadget
Engadget review recap: All eyes on NVIDIA and Samsung

I don't know if you can believe it, but we're fast approaching the end of January. And I want to kick off the first review recap of 2025 by acknowledging how busy it's already been.

26 ian. 2025, 16:40:13 | Engadget
WhatsApp could soon let iOS users have multiple accounts on one device

The latest WhatsApp beta update for iOS gives users the ability to add and switch between multiple accounts on a single device, according to

26 ian. 2025, 00:30:09 | Engadget
What to read this weekend: An immersive new work of Africanfuturism

These are the new releases that we picked up this week.

 

25 ian. 2025, 22:10:23 | Engadget