Gemini Robotics-ER 1.6

Our advanced embodied reasoning model—designed to help robots reason about the physical world with unprecedented precision, plan complex tasks, and make logical decisions.

Our Gemini-based multimodal model gives advanced world understanding to robots.

Capabilities

Gemini Robotics-ER 1.6 specializes in core robotics capabilities like spatial logic, task planning, and success detection.

It acts as a high-level brain to break down complex tasks, use intermediate steps to reason, and intelligently decide when to retry or progress.

Orchestration

Orchestrates robot activities, like a high-level brain. Excels at planning and making logical decisions within a physical environment. Interacts in natural language, estimates progress, and can natively call tools – like using Google Search to look for information.

Advanced spatial logic

Uses precision pointing for spatial identification, motion reasoning, and safely handling objects under strict physical constraints.

Visual & multi-view reasoning

Understands relationships across multiple camera streams to detect task success, and combines agentic vision with code execution to read complex industrial instruments.