Google’s new AI tool Whisk uses images as prompts

Google has yet another AI tool to add to the pile. Whisk is a Google Labs image generator that lets you use an existing image as your prompt. But its output only captures your starter image’s “essence” rather than recreating it with new details. So, it’s better for brainstorming and rapid-fire visualizations than edits of the source image.

The company describes Whisk as “a new type of creative tool.” The input screen starts with a bare-bones interface with inputs for style and subject. This simple introductory interface only lets you choose from three predefined styles: sticker, enamel pin and plushie. I suspect Google found those three allowed for the kind of rough-outline outputs the experimental tool is most ideal for in its current form.

As you can see in the image above, it produced a solid image of a Wilford Brimley plushie. (Google’s terms forbid pictures of celebrities, but Wilford slipped through the gates, Quaker Oats in tow, without alerting the guards.)

Whisk also includes a more advanced editor (found by clicking “Start from scratch” from the main screen). In this mode, you can use text or a source image in three categories: subject, scene and style. There’s also an input bar to add more text for finishing touches. However, in its current form, the advanced controls didn’t produce results that looked anything like my queries.

For example, check out my attempt to generate the late Mr. Brimley in a lightbox scene in the style of a walrus plushie image I found online:

Screenshot of an AI generation tool producing images a man who looks a bit like Wilford Brimley.
Google / Screenshot by Will Shanklin for Engadget

Whisk spit out what looks like a vaguely Wilford Brimley-esque actor eating oatmeal inside a lightbox frame. As far as I can tell, that dude is not a plushie. So, it’s clear why Google recommends using the tool more for “rapid visual exploration” and less for production-ready content.

Google acknowledges that Whisk will only draw from “a few key characteristics” of your source image. “For example, the generated subject might have a different height, weight, hairstyle or skin tone,” the company warns.

To understand why, look no further than Google’s description of how Whisk works under the hood. It uses the Gemini language model to write a detailed caption of the source image you upload. It then feeds that description into the Imagen 3 image generator. So, the result is an image based on Gemini’s words about your image — not the source image itself.

Whisk is only available in the US, at least for now. You can try it at the project’s Google Labs site.

This article originally appeared on Engadget at https://www.engadget.com/ai/googles-new-ai-tool-whisk-uses-images-as-prompts-210105371.html?src=rss https://www.engadget.com/ai/googles-new-ai-tool-whisk-uses-images-as-prompts-210105371.html?src=rss
Creată 1mo | 16 dec. 2024, 22:10:17


Autentifică-te pentru a adăuga comentarii

Alte posturi din acest grup

A convincing dummy iPhone SE 4 suggests the return of the notch

Calling all iPhone lovers: we might just have a full look at Apple's iPhone SE 4 on our hands. X (formerly Twitter) user Majin Bu shared what Bu claims is

27 ian. 2025, 13:30:13 | Engadget
How to choose the best TV for gaming right now

Most of the time, the best TVs for gaming are the best TVs you can buy, period. That said, there are a few key features to prioritize when picking out a big screen for your PlayStation 5 or Xbox Se

27 ian. 2025, 08:50:28 | Engadget
I think I found the most wholesome game in the Playdate Catalog

I didn’t set out to play jump rope STAR! when I picked up my

26 ian. 2025, 23:40:13 | Engadget
NASA and ESA share a breathtaking Hubble image of the Tarantula Nebula’s outer edge

The Hubble Space Telescope is still trucking along more than 30 years after its launch, observing the universe and sending home images for us to marvel at. This week,

26 ian. 2025, 21:20:15 | Engadget
The 1989 point and click horror game Last Half of Darkness has been remade for 2025

An obscure horror game from the late ‘80s that gained a cult following by way of shareware is coming back from the grave.

26 ian. 2025, 19:10:03 | Engadget
Engadget review recap: All eyes on NVIDIA and Samsung

I don't know if you can believe it, but we're fast approaching the end of January. And I want to kick off the first review recap of 2025 by acknowledging how busy it's already been.

26 ian. 2025, 16:40:13 | Engadget