Research
Publications
-
- Date
- Title
- Transfer Learning for Bayesian Optimization on Heterogeneous Search Spaces
- Authors
- Zhou Fan, Xinran Han, Zi Wang
- Venue
- Transactions on Machine Learning Research (TMLR)
-
- Date
- Title
- Fractal Patterns May Unravel the Intelligence in Next-Token Prediction
- Authors
- Ibrahim Alabdulmohsin, Vinh Q. Tran, Mostafa Dehghani
- Venue
- arXiv
-
- Date
- Title
- Robust agents learn causal world models
- Authors
- Jonathan Richens, Tom Everitt
- Venue
- ICLR 2024
-
- Date
- Title
- Exploration at Scale using Epistemic Neural Networks
- Authors
- Vikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao, Benjamin Van Roy
- Venue
- arXiv
-
- Date
- Title
- Learning Universal Predictors
- Authors
- Jordi Grau-Moya *, Tim Genewein *, Marcus Hutter *, Laurent Orseau *, Grégoire Déletang, Elliot Catt, Anian Ruoss, Li Kevin Wenliang, Christopher Mattern, Matthew Aitchison and Joel Veness
- Venue
- arXiv
-
- Date
- Title
- Neural Population Learning beyond Symmetric Zero-Sum Games
- Authors
- Siqi Liu, Luke Marris, Marc Lanctot, Georgios Piliouras, Joel Leibo, Nicolas Heess
- Venue
- AAMAS 2024
-
- Date
- Title
- Asynchronous Local-SGD Training forLanguage Modeling
- Authors
- Bo Liu*, Arthur Douillard, Rachita Chhaparia, Jiajun Shen, Andrei Rusu, Arthur Szlam, Marc'aurelio Ranzato, Satyen Kale
- Venue
- arXiv
-
- Date
- Title
- E3x: E(3)-Equivariant Deep Learning Made Easy
- Authors
- Oliver Unke, Hartmut Maennel
- Venue
- arXiv
-
- Date
- Title
- Directly Fine-Tuning Diffusion Models on Differentiable Rewards
- Authors
- Kevin Clark*, Paul Vicol*, Kevin Swersky, David J. Fleet
- Venue
- ICLR 2024
-
- Date
- Title
- GATS: Gather-Attend-Scatter
- Authors
- Konrad Zolna, Serkan Cabi, Yutian Chen, Eric Lau, Claudio Fantacci, Jurgis Pasukonis, Jost Tobias Springenberg, Sergio Gomez
- Venue
- arXiv
-
- Date
- Title
- NfgTransformer: Equivariant Representation Learning for Normal-form Games
- Authors
- Siqi Liu, Luke Marris, Ian Gemp, Georgios Piliouras, Nicolas Heess
- Venue
- ICLR 2024
-
- Date
- Title
- Generative Adversarial Equilibrium Solvers
- Authors
- Denizalp Goktas, David C. Parkes, Ian Gemp, Luke Marris, Georgios Piliouras, Romuald Elie, Guy Lever, Andrea Tacchetti
- Venue
- ICLR 2024
-
- Date
- Title
- Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization
- Authors
- Ian Gemp, Luke Marris, Georgios Piliouras
- Venue
- ICLR 2024
-
- Date
- Title
- On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes
- Authors
- Rishabh Agarwal, Nino Vieillard, Yongchao Zhou, Piotr Stanczyk, Sabela Ramos, mfgeist , Olivier Bachem
- Venue
- ICLR 2024
-
- Date
- Title
- Learning Planning-compatible Cognitive Maps with Transformers in PartiallyObserved Environments
- Authors
- Antoine Dedieu, Wolfgang Lehrach, Guangyao Zhou, Dileep George, Miguel Lázaro-Gredilla
- Venue
- arXiv
-
- Date
- Title
- Distributional reinforcement learning in prefrontal cortex
- Authors
- Timothy Muller*, James Butler*, Sebastijan Veselic*, Bruno Miranda*, Timothy Behrens*, Zeb Kurth-Nelson, Steve Kennerley*
- Venue
- Nature Neuroscience
-
- Date
- Title
- AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents
- Authors
- Alex Irpan, Keerthana Gopalakrishnan, Sergey Levine, Ted Xiao, Peng Xu, Ryan Julian, Sean Kirmani, Debidatta Dwibedi, Karol Hausman, Dorsa Sadigh, ichter , yaolug , Stefan Welker, Pannag Sanketi, Kanishka Rao, Edward Lee, Fei Xia, Isabel Leal, Pierre Sermanet, Nikhil Joshi, Zhuo Xu, Quan Vuong, Michael Ahn, chelseaf , Montse Gonzalez Arenas, Steve Xu, Sharath Maddineni
- Venue
- arXiv
-
- Date
- Title
- GenCast: learning skillful ensemble forecasting of medium-range weather
- Authors
- Ilan Price, Matthew Willson, Alvaro Sanchez, Peter Battaglia, Remi Lam, Ferran Alet, Jacklynn Stott, Timo Ewalds, Shakir Mohamed
- Venue
- arXiv
-
- Date
- Title
- Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model
- Authors
- Saurabh Saxena, Junhwa Hur, Charles Herrmann, Deqing Sun, David Fleet
- Venue
- arXiv
-
- Date
- Title
- Equivariant MuZero
- Authors
- Andreea Deac*, Theophane Weber, George Papamakarios
- Venue
- Transactions on Machine Learning Research (TMLR)