Jump to Content

Company Gemini breaks new ground: a faster model, longer context and AI agents
Company Our next-generation model: Gemini 1.5
Company The next chapter of our Gemini era

The most general and capable AI models we've ever built.

Our most flexible models yet

Each Gemini model is built for its own set of use cases, making a versatile model family that runs efficiently on everything from data centers to on-device.

Project Astra

Project Astra explores the future of AI assistants. Building on our Gemini models, we’ve developed AI agents that can quickly process multimodal information, reason about the context you’re in, and respond to questions at a conversational pace, making interactions feel much more natural.

The demo shows two continuous takes: one with the prototype running on a Google Pixel phone and another on a prototype glasses device.

Natively multimodal

Gemini models are built from the ground up for multimodality, seamlessly combining and understanding text, code, images, audio, and video.

Following content is a visual/ descriptive representation of the functionality of Gemini:

Gemini models can generate code based on different kinds of inputs.

Gemini models can generate code based on different kinds of inputs.

Gemini

I see a murmuration of starlings, so I coded a flocking simulation.

Gemini models can generate text and images, combined.

Could Gemini show me ideas for what to make?

Gemini

How about an octopus with blue and pink tentacles?

Gemini models can understand and perform tasks involving several different written languages.

Could Gemini explain what this means?

Gemini

I see the time signature is 6/8. This means there are 6 eighth notes in each measure.

The dynamic marking is piano, which means to play softly. Andante grazioso means to play at a graceful walking pace.

Longer context

1.5 Pro and 1.5 Flash both have a default context window of up to one million tokens — the longest context window of any large scale foundation model. They achieve near-perfect recall on long-context retrieval tasks across modalities, unlocking the ability to process long documents, thousands of lines of code, hours of audio, video, and more. For 1.5 Pro, developers and enterprise customers can also sign up to try a two-million-token context window.

Research

Relentless innovation

Our research team is continually exploring new ideas at the frontier of AI, building innovative products that show consistent progress on a range of benchmarks. Our newest model is Gemini 1.5 Flash.

Capability

Benchmark

Description

Gemini 1.0 Pro

Gemini 1.0 Ultra

Gemini 1.5 Pro

(Feb 2024)

Gemini 1.5 Flash

General

MMLU

Representation of questions in 57 subjects (incl. STEM, humanities, and others)

General

MMLU

Representation of questions in 57 subjects (incl. STEM, humanities, and others)

Gemini 1.0 Pro

71.8%

Gemini 1.0 Ultra

83.7%

Gemini 1.5 Pro

(Feb 2024)

81.9%

Gemini 1.5 Flash

78.9%

Code

Natural2Code

Python code generation. Held out dataset HumanEval-like, not leaked on the web

Code

Natural2Code

Python code generation. Held out dataset HumanEval-like, not leaked on the web

Gemini 1.0 Pro

69.6%

Gemini 1.0 Ultra

74.9%

Gemini 1.5 Pro

(Feb 2024)

77.7%

Gemini 1.5 Flash

77.2%

Math

MATH

Challenging math problems (incl. algebra, geometry, pre-calculus, and others)

Math

MATH

Challenging math problems (incl. algebra, geometry, pre-calculus, and others)

Gemini 1.0 Pro

32.6%

Gemini 1.0 Ultra

53.2%

Gemini 1.5 Pro

(Feb 2024)

58.5%

Gemini 1.5 Flash

54.9%

Reasoning

GPQA (main)

Challenging dataset of questions written by domain experts in biology, physics, and chemistry

Reasoning

GPQA (main)

Challenging dataset of questions written by domain experts in biology, physics, and chemistry

Gemini 1.0 Pro

27.9%

Gemini 1.0 Ultra

35.7%

Gemini 1.5 Pro

(Feb 2024)

41.5%

Gemini 1.5 Flash

39.5%

Reasoning

Big-Bench Hard

Diverse set of challenging tasks requiring multi-step reasoning

Big-Bench Hard

Diverse set of challenging tasks requiring multi-step reasoning

Gemini 1.0 Pro

75.0%

Gemini 1.0 Ultra

83.6%

Gemini 1.5 Pro

(Feb 2024)

84.0%

Gemini 1.5 Flash

85.5%

Multilingual

WMT23

Language translation

Multilingual

WMT23

Language translation

Gemini 1.0 Pro

71.7

Gemini 1.0 Ultra

74.4

Gemini 1.5 Pro

(Feb 2024)

75.2

Gemini 1.5 Flash

74.1

Image

MMMU

Multi-discipline college-level reasoning problems

Image

MMMU

Multi-discipline college-level reasoning problems

Gemini 1.0 Pro

47.9%

Gemini 1.0 Ultra

59.4%

Gemini 1.5 Pro

(Feb 2024)

58.5%

Gemini 1.5 Flash

56.1%

Image

MathVista

Multi-discipline college-level reasoning problems

MathVista

Mathematical reasoning in visual contexts

Gemini 1.0 Pro

45.2%

Gemini 1.0 Ultra

53.0%

Gemini 1.5 Pro

(Feb 2024)

52.1%

Gemini 1.5 Flash

54.3%

Audio

FLEURS (55 languages)

Automatic speech recognition (based on word error rate, lower is better)

Audio

FLEURS (55 languages)

Automatic speech recognition (based on word error rate, lower is better)

Gemini 1.0 Pro

6.4

Gemini 1.0 Ultra

6.0

Gemini 1.5 Pro

(Feb 2024)

6.6

Gemini 1.5 Flash

9.8

Video

EgoSchema

Video question answering

Video

EgoSchema

Video question answering

Gemini 1.0 Pro

55.7%

Gemini 1.0 Ultra

61.5%

Gemini 1.5 Pro

(Feb 2024)

63.2%

Gemini 1.5 Flash

63.5%

Technical reports

February 2024 Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context
December 2023 Gemini 1.0: A family of highly capable multimodal models

For developers

Build with Gemini

Integrate Gemini models into your applications with Google AI Studio and Google Cloud Vertex AI.

Try the models

Get started

Example prompts for the Gemini API in Google AI Studio.

Research Assistant Understand the key attributes of a research paper’s methodology.
Plant care How do I best care for this plant?
Which shape comes next? Given a series of shapes, guess which shape comes next.

Responsibility at the core

Our models undergo extensive ethics and safety tests, including adversarial testing for bias and toxicity.

Hands-on

Serving billions of Google users

Gemini models are embedded in a range of Google experiences.

Gemini app Chat to supercharge your ideas
Workspace Boost productivity and creativity with Gemini in Gmail, Docs, Sheets, and more
Ads Gemini models are powering products to help businesses deliver growth and performance
Pixel Pixel 8 Pro is the first smartphone with Gemini Nano, Google’s most efficient AI model built for on-device tasks
Cloud Innovate faster with enterprise-ready AI, enhanced by Gemini models
Android Boost your productivity with Gemini in Android Studio

What's new

Get the latest updates

Sign up for news on the latest innovations from Google DeepMind.

Explore our other teams and product areas