Google DeepMind today announced Gemini Robotics to bring Gemini and “AI into the physical world,” with new models able to “perform a wider range of real-world tasks than ever before.”

In order for AI to be useful and helpful to people in the physical realm, they have to demonstrate “embodied” reasoning — the humanlike ability to comprehend and react to the world around us— as well as safely take action to get things done.

The aim is to build general purpose robots, with CEO Sundar Pichai adding how Google has “always thought of robotics as a helpful testing ground for translating AI advances into the physical world.”

“Gemini Robotics” is a vision-language-action (VLA) model built on Gemini 2.0 “with the addition of physical actions as a new output modality for the purpose of directly controlling robots.” 

Going in, Google has “three principal qualities” for robotic AI models:

Advertisement – scroll for more content

window.adSlotsConfig = window.adSlotsConfig || [];

adSlotsConfig.push( {
slotID: ‘/1049447/Outbrain’,
slotName: ‘div-gpt-ad-outbrain-ad-664922’,
sizes: [300, 250],
slotPosition: ‘mid_article’
} );

Generality: “able to adapt to different situations”

Interactivity: “understand and respond quickly to instructions or changes in their environment”

Dexterity: “can do the kinds of things people generally can do with their hands and fingers, like carefully manipulate objects.”

Google also announced the Gemini Robotics-ER (“embodied reasoning”) vision-language model with enhanced spatial “understanding of the world in ways necessary for robotics, focusing especially on spatial reasoning, and allows roboticists to connect it with their existing low level controllers.” 

 For example, when shown a coffee mug, the model can intuit an appropriate two-finger grasp for picking it up by the handle and a safe trajectory for approaching it.

These models run on various robot form factors (including bi-arm and humanoid robots), with trusted testers like Agile Robots, Agility Robots, Boston Dynamics, and Enchanted Tools.

FTC: We use income earning auto affiliate links. More.

You’re reading 9to5Google — experts who break news about Google and its surrounding ecosystem, day after day. Be sure to check out our homepage for all the latest news, and follow 9to5Google on Twitter, Facebook, and LinkedIn to stay in the loop. Don’t know where to start? Check out our exclusive stories, reviews, how-tos, and subscribe to our YouTube channel