ShieldGemma 2
A suite of safety content classifier models built on Gemma 2 and designed to detect harmful content in AI models’ text inputs and outputs
Download ShieldGemma 2
Instruction-tuned models for evaluating the safety of text and images against pre-defined safety policies. Helps evaluate and prevent generative AI applications from violating safety policies.
Watch
Model versions
-
ShieldGemma 1
Built on Gemma 2 and available in 2B, 9B, and 27B parameter sizes.
-
ShieldGemma 2
A 4B parameter image safety model built on Gemma 3.
Capabilities
-
Content safety evaluation
Evaluate the safety of prompt input and output responses against a set of defined safety policies.
-
Tuneable, open models
ShieldGemma models are provided with open weights and can be fine-tuned for your specific use case.
Watch