Gemini Robotics
Our advanced Gemini 2.0-based models designed for the next generation of helpful robots
Gemini Robotics brings Gemini’s capacity for multimodal reasoning and world understanding into the physical world - allowing robots of any shape and size to perform a wide range of real-world tasks.
Key capabilities
Gemini models are capable of responding to text, images, audio, and video. Gemini Robotics adds the ability to reason about physical spaces – allowing robots to take action in the real world.
-
Generality
Uses Gemini's world understanding to generalize to novel situations, including dealing with new objects, diverse instructions, and new environments.
-
Interactivity
Understands and responds to everyday commands, and reacts to sudden changes in instructions – or its surroundings. Then carries on without further input.
-
Dexterity
Enables robots to tackle complex tasks requiring fine motor skills and precise manipulation – like folding origami, packing a lunch box, or preparing a salad.
-
Multiple embodiments
Adapts to a diverse array of robot forms, from bi-arm robotic platforms like ALOHA 2 to humanoid robots like Apptronik’s Apollo.
Hands-on
Watch how Gemini Robotics performs different tasks.
Gemini Robotics model family
Models created to support embodied AI capabilities.
Collaborations
We’re partnering with Apptronik to build the next generation of humanoid robots. We’re also working with a select number of trusted testers to guide the future of Gemini Robotics-ER.
Experience Gemini Robotics
If you're interested in testing our models, please share a few details to join the waitlist.