AlphaEvolve: A Gemini-powered coding agent for designing advanced algorithmsMay 2025Science Learn more
Evaluating potential cybersecurity threats of advanced AIApril 2025Responsibility & Safety Learn more
FACTS Grounding: A new benchmark for evaluating the factuality of large language modelsDecember 2024Responsibility & Safety Learn more