Google’s new AI tool Whisk uses images as prompts

Google has yet another AI tool to add to the pile. Whisk is a Google Labs image generator that lets you use an existing image as your prompt. But its output only captures your starter image’s “essence” rather than recreating it with new details. So, it’s better for brainstorming and rapid-fire visualizations than edits of the source image.

The company describes Whisk as “a new type of creative tool.” The input screen starts with a bare-bones interface with inputs for style and subject. This simple introductory interface only lets you choose from three predefined styles: sticker, enamel pin and plushie. I suspect Google found those three allowed for the kind of rough-outline outputs the experimental tool is most ideal for in its current form.

As you can see in the image above, it produced a solid image of a Wilford Brimley plushie. (Google’s terms forbid pictures of celebrities, but Wilford slipped through the gates, Quaker Oats in tow, without alerting the guards.)

Whisk also includes a more advanced editor (found by clicking “Start from scratch” from the main screen). In this mode, you can use text or a source image in three categories: subject, scene and style. There’s also an input bar to add more text for finishing touches. However, in its current form, the advanced controls didn’t produce results that looked anything like my queries.

For example, check out my attempt to generate the late Mr. Brimley in a lightbox scene in the style of a walrus plushie image I found online:

Screenshot of an AI generation tool producing images a man who looks a bit like Wilford Brimley.
Google / Screenshot by Will Shanklin for Engadget

Whisk spit out what looks like a vaguely Wilford Brimley-esque actor eating oatmeal inside a lightbox frame. As far as I can tell, that dude is not a plushie. So, it’s clear why Google recommends using the tool more for “rapid visual exploration” and less for production-ready content.

Google acknowledges that Whisk will only draw from “a few key characteristics” of your source image. “For example, the generated subject might have a different height, weight, hairstyle or skin tone,” the company warns.

To understand why, look no further than Google’s description of how Whisk works under the hood. It uses the Gemini language model to write a detailed caption of the source image you upload. It then feeds that description into the Imagen 3 image generator. So, the result is an image based on Gemini’s words about your image — not the source image itself.

Whisk is only available in the US, at least for now. You can try it at the project’s Google Labs site.

This article originally appeared on Engadget at https://www.engadget.com/ai/googles-new-ai-tool-whisk-uses-images-as-prompts-210105371.html?src=rss https://www.engadget.com/ai/googles-new-ai-tool-whisk-uses-images-as-prompts-210105371.html?src=rss
Created 1mo | Dec 16, 2024, 10:10:17 PM


Login to add comment

Other posts in this group

Comcast unveils ultra-low lag Internet connection

Comcast has announced new technology for ultra-low lag Internet on its Xfinity service. According to the company's release, users of select products and software from its partners will experience l

Jan 29, 2025, 4:30:26 PM | Engadget
Pick up a four-pack of Apple AirTags while on sale for $70

Do you constantly lose all of your stuff? No shame, but now might be a great time to invest in a few tracking devices. Luckily, a four-pack of Apple AirTags is on sale right now for

Jan 29, 2025, 4:30:24 PM | Engadget
NordVPN’s NordWhisper protocol can get around VPN blockers

NordVPN is known for developing its own VPN protocol, Nord

Jan 29, 2025, 4:30:23 PM | Engadget
Apple enables support for T-Mobile and Starlink satellite network on the iPhone

The latest update Apple rolled out for the iPhone allows T-Mobile customers — a select few, for now — to be able to send text messages even in locations where they have no coverage.

Jan 29, 2025, 2:20:07 PM | Engadget
China's DeepSeek AI hit by information request from Italy's data protection watchdog

China's DeepSeek AI has already caught the eye of a data protection watchdog, shortly after it went viral and became the

Jan 29, 2025, 2:20:06 PM | Engadget
Get more than $400 off one of our favorite Alienware gaming monitors

Looking to upgrade your gaming rig? Dell is selling one of its most popular Alienware gaming monitors

Jan 28, 2025, 10:10:10 PM | Engadget