Jump to Content

Introducing

Project Astra

A universal AI agent that is helpful in everyday life

Building on our Gemini models, Project Astra explores the future of AI assistants that can process multimodal information, understand the context you’re in, and respond naturally in conversation.

Building on our Gemini models, Project Astra explores the future of AI assistants that can process multimodal information, understand the context you’re in, and respond naturally in conversation.

This demo shows two continuous takes: one with the prototype running on a Google Pixel phone and another on a prototype glasses device.

Putting Project Astra to the test

Explore more of Project Astra’s capabilities.

All demos were taken in one continuous take, in real-time, on a Google Pixel phone or a prototype glasses device.

Explaining physics drawings

Explaining parts of a race car

Solving maths problems

Recognizing drawings of landmarks

Memorizing a sequence of objects

Interpreting drawings from literature

Under the hood

To be truly useful, an agent needs to understand and respond to the complex and dynamic world just like people do — and take in and remember what it sees and hears to understand context and take action. It also needs to be proactive, teachable and personal, so users can talk to it naturally and without lag or delay.

Continuously encoding video frames

Combining the video and speech input into a timeline of events

Caching information for efficient recall

While we’ve made incredible progress developing AI systems that can understand multimodal information, getting response time down to something conversational is a difficult engineering challenge.

Over the past few years, we've been working to improve how our models perceive, reason and converse to make the pace and quality of interaction feel more natural.

By leveraging our leading speech models, we also enhanced how our AI agents sound giving them a wider range of intonations. These agents can better understand the context they’re being used in, and respond quickly in conversation.

What’s next

With technology like Project Astra, it’s easy to envision a future where people could have an expert AI assistant by their side, through a phone or glasses. And some of these capabilities are coming to Google products, like the Gemini app and web experience, later this year.

Project Astra

A universal AI agent that is helpful in everyday life

Watch the Google I/O keynote

Get the latest updates

Sign up for news on the latest innovations from Google DeepMind.

Explore our other teams and product areas