Jump to Content

Gemini Robotics

Our advanced Gemini 2.0-based models designed for the next generation of helpful robots

Gemini Robotics brings Gemini’s capacity for multimodal reasoning and world understanding into the physical world - allowing robots of any shape and size to perform a wide range of real-world tasks.

Key capabilities

Gemini models are capable of responding to text, images, audio, and video. Gemini Robotics adds the ability to reason about physical spaces – allowing robots to take action in the real world.

  • Generality

    Uses Gemini's world understanding to generalize to novel situations, including dealing with new objects, diverse instructions, and new environments.

  • Interactivity

    Understands and responds to everyday commands, and reacts to sudden changes in instructions – or its surroundings. Then carries on without further input.

  • Dexterity

    Enables robots to tackle complex tasks requiring fine motor skills and precise manipulation – like folding origami, packing a lunch box, or preparing a salad.

  • Multiple embodiments

    Adapts to a diverse array of robot forms, from bi-arm robotic platforms like ALOHA 2 to humanoid robots like Apptronik’s Apollo.

Hands-on

Watch how Gemini Robotics performs different tasks.

Responsibly advancing AI and robotics

To ensure Gemini Robotics benefits humanity, we’ve taken a comprehensive approach to safety, from practical safeguards to collaborations with experts, policymakers, and our Responsibility and Safety Council.

Learn more

Collaborations

We’re partnering with Apptronik to build the next generation of humanoid robots. We’re also working with a select number of trusted testers to guide the future of Gemini Robotics-ER.

Experience
Gemini Robotics

If you're interested in testing our models, please share a few details to join the waitlist.

Get the latest updates

Sign up for news on the latest innovations from Google DeepMind.