Gemma Scope

A set of interpretability tools built to help researchers understand the inner workings of Gemma models.

Examine individual model layers to help address critical concerns including hallucinations, biases, and manipulation

Sparse autoencoders (SAEs) act as microscopes to inspect layer-specific representations and help pinpoint the source of issues.

With Gemma Scope 2, researchers can use transcoders to analyze complex, multi-step behaviors, from diagnosing jailbreaks and refusal mechanisms to verifying faithfulness to chain-of-thought reasoning.

Gemma Scope 2

Gemma Scope

Gemma Scope 2

Gemma Scope 2 includes SAEs and transcoders trained on every layer of Gemma 3. State-of-the-art Matryoshka training helps SAEs detect more useful concepts and resolve flaws, while skip-transcoders and cross-layer transcoders make it easier to decipher multi-step computations and algorithms throughout the model.

Explore Gemma Scope 2

Discover, analyze, and steer Gemma 3 features in an interactive demo on Neuronpedia.

Gemma Scope

Gemma Scope enables evaluation of the behavior of Gemma 2 models with layer-level analysis. SAEs zoom in on dense, compressed activations, and expand them to larger, sparser, more interpretable forms.

Explore Gemma Scope

Discover, analyze, and steer Gemma 2 features in an interactive demo on Neuronpedia.

Gemma Scope

A set of interpretability tools built to help researchers understand the inner workings of Gemma models.

Examine individual model layers to help address critical concerns including hallucinations, biases, and manipulation

Gemma Scope 2

Explore Gemma Scope 2

Gemma Scope

Explore Gemma Scope

Download Gemma Scope

Gemma Scope 2

Gemma Scope