Imagine almost anything – then create it. Generate detailed, playful, realistic, or whimsical images. Or anything in-between.

The Gemini Image model uses deep language understanding to capture the nuance of your prompts — bridging the gap between what you say and what you envision.

Capabilities

Multimodal understanding

Upload images and share text instructions with Gemini to create complex and detailed images.

Conversational inputs

Use everyday language while creating images, and keep the conversation going to refine what the model generates.

Real-world knowledge

Generate images that follow real-world logic, thanks to Gemini’s advanced reasoning capabilities.


Model family

Gemini Image models are natively multimodal, and respond effectively and efficiently to even the most detailed prompts.

3 Pro 🍌

Create and edit images with Nano Banana Pro, for studio-quality levels of precision and control.


Try Gemini Image