-
Research
Data, Architecture, or Losses: What Contributes Most to Multimodal Transformer Success?
In this work, we examine what aspects of multimodal transformers – attention, losses, and pretraining data – are important in their success at multimodal pretraining. We find that Multimodal...
-
Research
MuZero: Mastering Go, chess, shogi and Atari without rules
In 2016, we introduced AlphaGo, the first artificial intelligence (AI) program to defeat humans at the ancient game of Go. Two years later, its successor - AlphaZero - learned from scratch to...
-
Research
Imitating Interactive Intelligence
We first create a simulated environment, the Playroom, in which virtual robots can engage in a variety of interesting interactions by moving around, manipulating objects, and speaking to each...
-
Research
Using JAX to accelerate our research
DeepMind engineers accelerate our research by building tools, scaling up algorithms, and creating challenging virtual and physical worlds for training and testing artificial intelligence (AI)...
-
Research
Using Unity to Help Solve Intelligence
We present our use of Unity, a widely recognised and comprehensive game engine, to create more diverse, complex, virtual simulations. We describe the concepts and components developed to simplify...
-
Research
Fast reinforcement learning through the composition of behaviours
Imagine if you had to learn how to chop, peel and stir all over again every time you wanted to learn a new recipe. In many machine learning systems, agents often have to learn entirely from...
-
Research
Computational predictions of protein structures associated with COVID-19
The scientific community has galvanised in response to the recent COVID-19 outbreak, building on decades of basic research characterising this virus family. Labs at the forefront of the outbreak...
-
Research
RL Unplugged: Benchmarks for Offline Reinforcement Learning
We propose a benchmark called RL Unplugged to evaluate and compare offline RL methods. RL Unplugged includes data from a diverse range of domains including games (e.g., Atari benchmark) and...
-
Research
dm_control: Software and Tasks for Continuous Control
The dm_control software package is a collection of Python libraries and task suites for reinforcement learning agents in an articulated-body simulation. A MuJoCo wrapper provides convenient...
-
Research
Acme: A new framework for distributed reinforcement learning
Acme is a framework for building readable, efficient, research-oriented RL algorithms. At its core Acme is designed to enable simple descriptions of RL agents that can be run at various scales of...
-
Research
Using AI to predict retinal disease progression
Vision loss among the elderly is a major healthcare issue: about one in three people have some vision-reducing disease by the age of 65. Age-related macular degeneration (AMD) is the most common...
-
Research
Simple Sensor Intentions for Exploration
In this paper we focus on a setting in which goal tasks are defined via simple sparse rewards, and exploration is facilitated via agent-internal auxiliary tasks. We introduce the idea of simple...