Research
Publications
-
- Date
- Title
- A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
- Authors
- Kuang-Huei Lee, Xinyun Chen, Hiroki Furuta, John Canny, Ian Fischer
- Venue
- arXiv
-
- Date
- Title
- Experts Don't Cheat: Learning What You Don't Know by Predicting Pairs
- Authors
- Daniel D. Johnson, Daniel Tarlow, David Duvenaud, Chris J. Maddison
- Venue
- arXiv
-
- Date
- Title
- Premise Order Matters in Reasoning with Large Language Models
- Authors
- Xinyun Chen, Ryan A. Chi, Xuezhi Wang, Denny Zhou
- Venue
- arXiv
-
- Date
- Title
- A Distributional Analogue to the Successor Representation
- Authors
- Harley Wiltzer*, Jesse Farebrother, Arthur Gretton, Yunhao Tang, Andre Barreto, Will Dabney, Marc Bellemare*, Mark Rowland
- Venue
- arXiv
-
- Date
- Title
- Near-Minimax-Optimal Distributional RL with a Generative Model
- Authors
- Mark Rowland, Kevin Li, Remi Munos, Clare Lyle, Yunhao Tang, Will Dabney
- Venue
- arXiv
-
- Date
- Title
- PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
- Authors
- Soroush Nasiriany*, Fei Xia*, Wenhao Yu, Ted Xiao*, Jacky Liang, Ishita Dasgupta, Annie Xie, Danny Driess , Ayzaan Wahid, Zhuo Xu, Quan Vuong, Tingnan Zhang, Tsang-Wei Edward Lee, Kuang-Huei Lee , Peng Xu, Sean Kirmani, Yuke Zhu, Andy Zeng, Karol Hausman, Nicolas Heess, Chelsea Finn, Sergey Levine, Brian Ichter*
- Venue
- arXiv
-
- Date
- Title
- Chain-of-Table: Evolves Tables in the LLM Reasoning Chain for Table Understanding
- Authors
- Zilong Wang*, Chen-Yu Lee, chunliang , Hao Zhang, Julian Eisenschlos, Lesly Miculicich, Tomas Pfister, Vincent Perot, Yasuhisa Fujii, Zifeng Wang, Jingbo Shang*
- Venue
- ICLR 2024
-
- Date
- Title
- Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
- Authors
- Nicolas Nguyen, Imad Aouali, András György, Claire Vernade
- Venue
- arXiv
-
- Date
- Title
- Memory Consolidation Enables Long-Context Video Understanding
- Authors
- Ivana Balazevic, Jimmy Shi, Nelly Papalampidi, Rahma Chaabouni, Skanda Koppula, Olivier Henaff
- Venue
- arXiv
-
- Date
- Title
- Large Language Models Self-Discover Reasoning Structures
- Authors
- Pei Zhou*, Jay Pujara*, Xiang Ren*, Swaroop Mishra, Steven Zheng, Denny Zhou, Heng-Tze Cheng, Quoc Le, Ed Chi, Xinyun Chen
- Venue
- arXiv
-
- Date
- Title
- States as Strings as Strategies: Steering Language Models with Game-Theoretic Solvers
- Authors
- Ian Gemp, Yoram Bachrach, Marc Lanctot, Roma Patel, Vibhavari Dasagi, Luke Marris, Georgios Piliouras, Siqi Liu, Karl Tuyls
- Venue
- arXiv
-
- Date
- Title
- Transfer Learning for Bayesian Optimization on Heterogeneous Search Spaces
- Authors
- Zhou Fan, Xinran Han, Zi Wang
- Venue
- Transactions on Machine Learning Research (TMLR)
-
- Date
- Title
- Fractal Patterns May Unravel the Intelligence in Next-Token Prediction
- Authors
- Ibrahim Alabdulmohsin, Vinh Q. Tran, Mostafa Dehghani
- Venue
- arXiv
-
- Date
- Title
- Exploration at Scale using Epistemic Neural Networks
- Authors
- Vikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao, Benjamin Van Roy
- Venue
- arXiv
-
- Date
- Title
- Robust agents learn causal world models
- Authors
- Jonathan Richens, Tom Everitt
- Venue
- ICLR 2024
-
- Date
- Title
- Learning Universal Predictors
- Authors
- Jordi Grau-Moya *, Tim Genewein *, Marcus Hutter *, Laurent Orseau *, Grégoire Déletang, Elliot Catt, Anian Ruoss, Li Kevin Wenliang, Christopher Mattern, Matthew Aitchison and Joel Veness
- Venue
- arXiv
-
- Date
- Title
- Neural Population Learning beyond Symmetric Zero-Sum Games
- Authors
- Siqi Liu, Luke Marris, Marc Lanctot, Georgios Piliouras, Joel Leibo, Nicolas Heess
- Venue
- AAMAS 2024
-
- Date
- Title
- Asynchronous Local-SGD Training forLanguage Modeling
- Authors
- Bo Liu*, Arthur Douillard, Rachita Chhaparia, Jiajun Shen, Andrei Rusu, Arthur Szlam, Marc'aurelio Ranzato, Satyen Kale
- Venue
- arXiv
-
- Date
- Title
- E3x: E(3)-Equivariant Deep Learning Made Easy
- Authors
- Oliver Unke, Hartmut Maennel
- Venue
- arXiv
-
- Date
- Title
- Approximating Nash Equilibria in Normal-Form Games via Stochastic Optimization
- Authors
- Ian Gemp, Luke Marris, Georgios Piliouras
- Venue
- ICLR 2024