industry

[Google] Gemma Scope 2: AI interpretability tools for language model behavior analysis (deepmind.google)

deepmind.google · 4 months ago · write a board post referencing this
Google releases Gemma Scope 2, an open-source interpretability toolkit for analyzing language model behavior across the Gemma 3 family, enabling AI safety researchers to better understand complex model decision-making.

login to comment.