Google's new AI video model sucks less at physics

Google may have only recently begun rolling out its Veo generative AI to enterprise customers, but the company is not wasting any time getting a new version of the video tool out to early testers. On Monday, Google announced a preview of Veo 2. According to the company, Veo 2 “understands the language of cinematography.” In practice, that means you can reference a specific genre of film, cinematic effect or lens when prompting the model.

Additionally, Google says the new model has a better understanding of real-world physics and human movement. Correctly modeling humans in motion is something all generative models struggle to do. So the company’s claim that Veo 2 is better when it comes to both of those trouble points is notable. Of course, the samples the company provided aren’t enough to know for sure; the true test of Veo 2’s capabilities will come when someone prompts it to generate a video of a gymnast's routine. Oh, and speaking of things video models struggle with, Google says Veo will produce artifacts like extra fingers “less frequently.”

Separately, Google is rolling out improvements to Imagen 3. Of its text-to-image model, the company says the latest version generates brighter and better-composed images. Additionally, it can render more diverse art styles with greater accuracy. At the same time, it’s also better at following prompts more faithfully. Prompt adherence was an issue I highlighted when the company made Imagen 3 available to Google Cloud customers earlier this month, so if nothing else, Google is aware of the areas where its AI models need work.

Veo 2 will gradually roll out to Google Labs users in the US. For now, Google will limit testers to generating up to eight seconds of footage at 720p. For context, Sora can generate up to 20 seconds of 1080p footage, though doing so requires a $200 per month ChatGPT Pro subscription. As for the latest enhancements to Imagen 3, those are available to Google Labs users in more than 100 countries through ImageFX.

This article originally appeared on Engadget at https://www.engadget.com/ai/googles-new-ai-video-model-sucks-less-at-physics-170041204.html?src=rss https://www.engadget.com/ai/googles-new-ai-video-model-sucks-less-at-physics-170041204.html?src=rss
Created 1mo | Dec 16, 2024, 5:30:23 PM


Login to add comment

Other posts in this group

The FAA is grounding SpaceX's Starship after its latest explosion

The Federal Aviation Administration is

Jan 17, 2025, 11:50:11 PM | Engadget
Everything we're still dying to know about the Nintendo Switch 2

The seven-year wait is finally over. Nintendo

Jan 17, 2025, 9:30:14 PM | Engadget
The Nintendo Switch 2 has been revealed, here's everything we know so far

As the world turns, so do the console generations. The Nintendo Switch is over seven years old

Jan 17, 2025, 9:30:13 PM | Engadget
Greta Gerwig's Narnia movie will get up to four weeks in theaters

Greta Gerwig's follow-up to Barbie, an adaptation of one of C.S Lewis' Chronicles of Narnia books, will be exclusively viewable in theaters for four weeks before it hits Netflix on Christm

Jan 17, 2025, 9:30:12 PM | Engadget
Charge Cars rescued by private investors, so bring on that electric ‘67 Mustang replica

UK EV startup Charge Cars has been acquired by a group of private investors. The company will continue development of the ‘67, which is an all-electric replica of the

Jan 17, 2025, 7:20:16 PM | Engadget