Gemini Robotics
Our advanced Gemini 2.0-based models designed for the next generation of helpful robots
Gemini Robotics brings Gemini’s capacity for multimodal reasoning and world understanding into the physical world - allowing robots of any shape and size to perform a wide range of real-world tasks.
Capabilities
Gemini models are capable of responding to text, images, audio, and video. Gemini Robotics adds the ability to reason about physical spaces – allowing robots to take action in the real world.
-
Generality
Uses Gemini's world understanding to generalize to novel situations, including dealing with new objects, diverse instructions, and new environments.
-
Interactivity
Understands and responds to everyday commands, and reacts to sudden changes in instructions – or its surroundings. Then carries on without further input.
-
Dexterity
Enables robots to tackle complex tasks requiring fine motor skills and precise manipulation – like folding origami, packing a lunch box, or preparing a salad.
-
Multiple embodiments
Adapts to a diverse array of robot forms, from bi-arm robotic platforms like ALOHA 2 to humanoid robots like Apptronik’s Apollo.
Hands-on
Watch how Gemini Robotics performs different tasks.
Model and tools
Models and tools created to support embodied AI capabilities.
Gemini Robotics SDK
Helping developers easily adapt our Gemini Robotics On-Device model to new tasks and environments.
Collaborations
We’re partnering with Apptronik to build the next generation of humanoid robots. We’re also working with a select number of trusted testers to guide the future of Gemini Robotics-ER.
Experience Gemini Robotics
If you're interested in testing our models, please share a few details to join the waitlist.