
Google DeepMind today announced Gemini Robotics to bring Gemini and “AI into the physical world,” with new models able to “perform a wider range of real-world tasks than ever before.”
In order for AI to be useful and helpful to people in the physical realm, they have to demonstrate “embodied” reasoning — the humanlike ability to comprehend and react to the world around us— as well as safely take action to get things done.
The aim is to build general purpose robots, with CEO Sundar Pichai adding how Google has “always thought of robotics as a helpful testing ground for translating AI advances into the physical world.”
“Gemini Robotics” is a vision-language-action (VLA) model built on Gemini 2.0 “with the addition of physical actions as a new output modality for the purpose of directly controlling robots.”
Going in, Google has “three principal qualities” for robotic AI models:
window.adSlotsConfig = window.adSlotsConfig || [];
adSlotsConfig.push( {
slotID: ‘/1049447/Outbrain’,
slotName: ‘div-gpt-ad-outbrain-ad-664922’,
sizes: [300, 250],
slotPosition: ‘mid_article’
} );
Generality: “able to adapt to different situations”
Interactivity: “understand and respond quickly to instructions or changes in their environment”
Dexterity: “can do the kinds of things people generally can do with their hands and fingers, like carefully manipulate objects.”
Google also announced the Gemini Robotics-ER (“embodied reasoning”) vision-language model with enhanced spatial “understanding of the world in ways necessary for robotics, focusing especially on spatial reasoning, and allows roboticists to connect it with their existing low level controllers.”
For example, when shown a coffee mug, the model can intuit an appropriate two-finger grasp for picking it up by the handle and a safe trajectory for approaching it.
These models run on various robot form factors (including bi-arm and humanoid robots), with trusted testers like Agile Robots, Agility Robots, Boston Dynamics, and Enchanted Tools.
FTC: We use income earning auto affiliate links. More.
<hr>
<p><strong>🚨 Disclaimer(Because Lawyers Exist):</strong> This article was scraped, gathered, and possibly abducted from <a href=”[source_url]” target=”_blank”>[source_url]</a>.
Any hot takes, controversial opinions, or mind-blowing insights belong to them, not us.
So if you disagree, kindly direct your complaints to the source—or scream into the void, whichever works.</p>
<p><strong>🤖 AI Shenanigans:</strong> Some parts of this article were optimized, polished, and possibly rewritten by **our AI overlord** to make it more readable, engaging, and SEO-friendly.
So, if it sounds smarter than usual, thank the machine. If it sounds weird… well, also blame the machine.</p>
<p><strong>💸 Affiliate Hustle:</strong> This post may contain affiliate links (Amazon, BestBuy, or some other capitalist empires).
If you buy something through these links, we might make a few bucks—at no extra cost to you!
Consider it a **”digital high-five”** for bringing you this awesome content. <a href=”https://your-affiliate-link.com”>Check out our recommended deals here.</a></p>
<p>🔥 Stay informed, stay entertained, and don’t sue us. Haxx! 🎉</p>