DeepMind's latest AI model can help robots fold origami and close Ziploc bags

Since its debut at the end of last year, Gemini 2.0 has gone on to power a handful of Google products, including a new AI Mode chatbot. Now Google DeepMind is using that same technology for something altogether more interesting. On Wednesday, the AI lab announced two new Gemini-based models it says will "lay the foundation for a new generation of helpful robots."

The first, Gemini Robotics, was designed by Deepmind to facilitate direct control of robots. According to the company, AI systems for robots need to excel at three qualities: generality, interactivity and dexterity.

The first involves a robot's flexibility to adapt to novel situations, including ones not covered by its training. Interactivity, meanwhile, encapsulates a robot's ability to respond to people and the environment. Finally, there's dexterity, which is mostly self-explanatory: a lot of tasks humans can complete without a second thought involve fine motor skills that are difficult for robots to master.

"While our previous work demonstrated progress in these areas, Gemini Robotics represents a substantial step in performance on all three axes, getting us closer to truly general purpose robots," says DeepMind.

For instance, with Gemini Robotics powering it, DeepMind's ALOHA 2 robot is able to fold origami and close a Ziploc bag. The two-armed robot also understands all the instructions given to it in natural, everyday language. As you can see from the video Google shared, it can even complete tasks despite encountering roadblocks, such as when the researcher moves around the Tupperware he just asked the robot to place the fruit inside of.

Google is partnering with Apptronik, the company behind the Apollo bipedal robot, to build the next generation of humanoid robots. At the same time, DeepMind is releasing Gemini Robotics-ER (or embodied reasoning). Of the second model, the company says it will enable roboticists to run their own programs using Gemini's advanced reasoning abilities. DeepMind is giving "trusted testers," including one-time Google subsidiary Boston Dynamics, access to the system.

This article originally appeared on Engadget at https://www.engadget.com/ai/deepminds-latest-ai-model-can-help-robots-fold-origami-and-close-ziploc-bags-151455249.html?src=rss https://www.engadget.com/ai/deepminds-latest-ai-model-can-help-robots-fold-origami-and-close-ziploc-bags-151455249.html?src=rss
Created 1mo | Mar 12, 2025, 5:20:13 PM


Login to add comment

Other posts in this group

Sony raises PlayStation Plus prices in Canada

Sony is jacking up PlayS

Apr 16, 2025, 11:20:09 PM | Engadget
Zoom is back up after outages this afternoon

Zoom went down for many of its users this afternoon. People began experiencing issues with video conferencing service over the past few hours, peaking at more than 60,000 reports on

Apr 16, 2025, 11:20:08 PM | Engadget
American Airlines will provide inflight Wi-Fi for free starting next year

American Airlines has announced plans to finally offer

Apr 16, 2025, 11:20:07 PM | Engadget
Here’s how to watch the Mario Kart-focused Nintendo Direct

There’s yet another Nintendo Direct coming our way, which is the third in less than a month. This one is entirely focused on the

Apr 16, 2025, 8:50:16 PM | Engadget
Samsung Odyssey 3D monitor hands-on: This should be the new baseline for glasses-free 3D

It seems like every few years, gadget makers try to come up with something that will make us care about seeing things in 3D again. Without going all the way back to the

Apr 16, 2025, 8:50:13 PM | Engadget
iOS 18.4.1 patches two iPhone security flaws used in 'extremely sophisticated' attacks

On Wednesday, Apple pushed updates to most of its platforms: iOS 18.4.1, iPadOS 18.4.1, macOS 15.4.1, tvOS 18.4.1 and visionOS 2.4.1. They contain two

Apr 16, 2025, 8:50:11 PM | Engadget