Research
Publications
-
- Date
- Title
- On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes
- Authors
- Rishabh Agarwal, Nino Vieillard, Yongchao Zhou, Piotr Stanczyk, Sabela Ramos, mfgeist , Olivier Bachem
- Venue
- ICLR 2024
-
- Date
- Title
- GATS: Gather-Attend-Scatter
- Authors
- Konrad Zolna, Serkan Cabi, Yutian Chen, Eric Lau, Claudio Fantacci, Jurgis Pasukonis, Jost Tobias Springenberg, Sergio Gomez
- Venue
- arXiv
-
- Date
- Title
- Directly Fine-Tuning Diffusion Models on Differentiable Rewards
- Authors
- Kevin Clark*, Paul Vicol*, Kevin Swersky, David J. Fleet
- Venue
- ICLR 2024
-
- Date
- Title
- NfgTransformer: Equivariant Representation Learning for Normal-form Games
- Authors
- Siqi Liu, Luke Marris, Ian Gemp, Georgios Piliouras, Nicolas Heess
- Venue
- ICLR 2024
-
- Date
- Title
- Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization
- Authors
- Ian Gemp, Luke Marris, Georgios Piliouras
- Venue
- ICLR 2024
-
- Date
- Title
- Learning Planning-compatible Cognitive Maps with Transformers in PartiallyObserved Environments
- Authors
- Antoine Dedieu, Wolfgang Lehrach, Guangyao Zhou, Dileep George, Miguel Lázaro-Gredilla
- Venue
- arXiv
-
- Date
- Title
- Distributional reinforcement learning in prefrontal cortex
- Authors
- Timothy Muller*, James Butler*, Sebastijan Veselic*, Bruno Miranda*, Timothy Behrens*, Zeb Kurth-Nelson, Steve Kennerley*
- Venue
- Nature Neuroscience
-
- Date
- Title
- AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents
- Authors
- Alex Irpan, Keerthana Gopalakrishnan, Sergey Levine, Ted Xiao, Peng Xu, Ryan Julian, Sean Kirmani, Debidatta Dwibedi, Karol Hausman, Dorsa Sadigh, ichter , yaolug , Stefan Welker, Pannag Sanketi, Kanishka Rao, Edward Lee, Fei Xia, Isabel Leal, Pierre Sermanet, Nikhil Joshi, Zhuo Xu, Quan Vuong, Michael Ahn, chelseaf , Montse Gonzalez Arenas, Steve Xu, Sharath Maddineni
- Venue
- arXiv
-
- Date
- Title
- GenCast: learning skillful ensemble forecasting of medium-range weather
- Authors
- Ilan Price, Matthew Willson, Alvaro Sanchez, Peter Battaglia, Remi Lam, Ferran Alet, Jacklynn Stott, Timo Ewalds, Shakir Mohamed
- Venue
- arXiv
-
- Date
- Title
- Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model
- Authors
- Saurabh Saxena, Junhwa Hur, Charles Herrmann, Deqing Sun, David Fleet
- Venue
- arXiv
-
- Date
- Title
- Equivariant MuZero
- Authors
- Andreea Deac*, Theophane Weber, George Papamakarios
- Venue
- Transactions on Machine Learning Research (TMLR)
-
- Date
- Title
- Challenges with unsupervised LLM knowledge discovery
- Authors
- Sebastian Farquhar, Vikrant Varma, Zachary Kenton, Johannes Gasteiger, Vladimir Mikulik, Rohin Shah
- Venue
- arXiv
-
- Date
- Title
- Learning Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy
- Authors
- Max Schwarzer, Jesse Farebrother, Joshua Greaves, Ekin Dogus Cubuk, Rishabh Agarwal, Aaron Courville, Marc G. Bellemare, Sergei Kalinin, Igor Mordatch, Pablo Samuel Castro, Kevin M. Roccapriore
- Venue
- NeurIPS Workshop 2023
-
- Date
- Title
- Meta-in-context learning in large language models
- Authors
- Julian Coda-Forno, Marcel Binz, Zeynep Akata, Matthew Botvinick, Jane X. Wang, Eric Schulz
- Venue
- NeurIPS 2023
-
- Date
- Title
- A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames
- Authors
- Nelly Papalampidi, Skanda Koppula, Shreya Pathak, Justin Chiu, Viorica Patraucean, Joe Heyward, Jiajun Shen, Antoine Miech, Andrew Zisserman, Aida Nematzadeh
- Venue
- CVPR 2024
-
- Date
- Title
- Schema-learning and rebinding as mechanisms of in-context learning and emergence
- Authors
- Sivaramakrishnan Swaminathan, Antoine Dedieu, Rajkumar Vasudeva Raju, Murray Shanahan, Miguel Lázaro-Gredilla, Dileep George
- Venue
- NeurIPS 2023
-
- Date
- Title
- Online RL in Linearly $q^\pi$-Realizable MDPs Is as Easy as in Linear MDPs If You Learn What to Ignore
- Authors
- Gellért Weisz, Andras Gyorgy, Csaba Szepesvari
- Venue
- NeurIPS 2023
-
- Date
- Title
- A Definition of Continual Reinforcement Learning
- Authors
- David Abel, André Barreto, Benjamin Van Roy, Doina Precup, Hado van Hasselt, Satinder Singh
- Venue
- NeurIPS 2023
-
- Date
- Title
- Rethinking the Role of Token Retrieval in Multi-Vector Retrieval
- Authors
- Jinhyuk Lee, Zhuyun Dai, Sai Meher Karthik Duddu, Tao Lei, Iftekhar Naim, Ming-Wei Chang, Vincent Zhao
- Venue
- NeurIPS 2023
-
- Date
- Title
- Feature Likelihood Divergence: Evaluating the Generalization of Generative Models Using Samples
- Authors
- Marco Jiralerspong, Joey Bose, Ian Gemp, Chongli Qin, Yoram Bachrach, Gauthier Gidel
- Venue
- NeurIPS 2023