Google's generative AI video model is available in private preview

Google has begun rolling out private access to its Veo and Imagen 3 generative AI models. Starting today, customers of the company’s Vertex AI Google Cloud package can begin using Veo to generate videos from text prompts and images. Then, as of next week, Google will make Imagen 3, its latest text-to-image framework, available to those same users.

With Veo’s rollout, Google says it’s the first hyperscale cloud provider to offer an image-to-video model. To that point, OpenAI’s Sora model is still only available to select artists, academics and researchers — though that could change quickly with the company teasing 12 days of product demos starting December 5.  

Example footage of Google's Veo video model.

Of Veo, Google says the model creates 1080p footage “that’s consistent and coherent” and can run “beyond a minute.” The tool is also capable of working with both text prompts and images. In the latter case, it’s possible to use either AI-generated or human-made pictures as the starting point for a video.

Looking at the sample footage Google shared, it’s evident Veo, like all AI models, can struggle with cause and effect. For example, in the clip of the roasting marshmallows, the treats don’t yellow and char as they’re exposed to the heat of a campfire flame. Artifacting is also an issue, as is apparent if you look closely at the hands in the concert footage.

Example outputs from Google's Imagen tool
Google

As for Imagen 3, Google says the model generates “the most realistic and highest quality images from simple text prompts, surpassing previous versions of Imagen in detail, lighting, and artifact reduction.” Here again, however, you don’t have to look too closely to see Google has more work to do. 

In the first example of a group of friends sitting on the trunk of a car, the original prompt includes mention of “flash photography,” but the subjects are clearly backlit. One could argue that a flash was used to create intense backlighting, but if the idea behind the prompt was to create something representative of flash photography from the 1960s, this image isn’t it.

Still, Google is keen to get more of its enterprise customers using generative AI. Citing its own research, the tech giant says among companies using generative AI in production, 86 percent report an increase in revenue. However, a recent Appen survey found return on investment from AI projects fell by 4.6 percentage points from 2023 to 2024.

This article originally appeared on Engadget at https://www.engadget.com/ai/googles-generative-ai-video-model-is-available-in-private-preview-160055983.html?src=rss https://www.engadget.com/ai/googles-generative-ai-video-model-is-available-in-private-preview-160055983.html?src=rss
Created 5mo | Dec 4, 2024, 6:10:25 PM


Login to add comment

Other posts in this group

How to watch LlamaCon 2025, Meta's first generative AI developer conference

After a couple years of having its open-source Llama AI model be just a part of its Connect conferences, Meta is breaking things out and hosting an entirely generative AI-focused developer conferen

Apr 25, 2025, 10:50:14 PM | Engadget
Boox's new Go 7 E Ink tablets support handwriting with a $46 stylus

Boox, a company that makes E Ink gear ranging from

Apr 25, 2025, 8:30:23 PM | Engadget
“It feels alive”: The Legend of Ochi director on the power of puppets

The Legend of Ochi feels like a film that shouldn't exist today. It's an original story, not an adaptation of an already popular book or comic. It's filled with complex puppetry and practi

Apr 25, 2025, 8:30:22 PM | Engadget
Infinity Nikki is coming to Steam and getting a co-op mode

The fashion-forward adventure Infinity Nikki is finally coming to Steam on April 29, compl

Apr 25, 2025, 8:30:21 PM | Engadget