Research
Publications
-
- Date
- Title
- Learning a Fourier Transform for Linear Relative Positional Encodings in Transformers
- Authors
- Krzysztof Marcin Choromanski, Shanda Li, Valerii Likhosherstov, Kumar Avinava Dubey, Shengjie Luo, Di He, Yiming Yang, Tamas Sarlos, Thomas Weingarten, Adrian Weller
- Venue
- (deprecated) AISTATS2024
-
- Date
- Title
- On Limitations of the Transformer Architecture
- Authors
- Binghui Peng*, Srini Narayanan, Christos Papadimitriou*
- Venue
- arXiv
-
- Date
- Title
- AlphaTensor for Optimizing Quantum Computations
- Authors
- Francisco Ruiz, Tuomas Laakkonen*, Johannes Bausch, Matej Balog, Mohammadamin Barekatain*, Francisco Heras, Alexander Novikov, Nathan Fitzpatrick*, Bernardino Romera Paredes, John van de Wetering*, Alhussein Fawzi, Konstantinos Meichanetzidis*, Pushmeet Kohli
- Venue
- arXiv
-
- Date
- Title
- Genie: Generative Interactive Environments
- Authors
- Jake Bruce, Michael Dennis, Ashley Edwards, Jack Parker-Holder, Yuge Shi, Edward Hughes, Matthew Lai, Aditi Mavalankar, Richie Steigerwald, Chris Apps, Yusuf Aytar, Sarah Bechtle, Feryal Behbahani, Stephanie Chan, Nicolas Heess, Lucy Gonzalez, Simon Osindero, Sherjil Ozair, Scott Reed, Jingwei Zhang, Konrad Zolna, Jeff Clune, Nando de Freitas, Satinder Singh, Tim Rocktäschel
- Venue
- arXiv
-
- Date
- Title
- When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method
- Authors
- Biao Zhang, Zhongtao Liu, Colin Cherry, Orhan Firat
- Venue
- ICLR 2024
-
- Date
- Title
- OmniPred: Language Models as Universal Regressors
- Authors
- Xingyou Song, Oscar Li, Chansoo Lee, Bangding (Jeffrey) Yang, Daiyi Peng, Sagi Perel, Yutian Chen
- Venue
- arXiv
-
- Date
- Title
- The Next 700 ML-Enabled Compiler Optimizations
- Authors
- S. VenkataKeerthy*, Siddharth Jain*, Umesh Kalvakuntla*, Gorantla Pranav Sai*, Albert Cohen, Eugene Brevdo, Mircea Trofin*, Ramakrishna Upadrasta*
- Venue
- ACM SIGPLAN 2024 International Conference on Compiler Construction
-
- Date
- Title
- Simulacra as Conscious Exotica
- Authors
- Murray Shanahan
- Venue
- arXiv
-
- Date
- Title
- Learning to Learn Faster from Human Feedback with Language Model Predictive Control
- Authors
- Ken Caluwaerts, Ben Jyenis, Jasmine Hsu, andyzeng , Wenhao Yu, Nik Stewart, Jacky Liang, Fei Xia, Peng Xu, Jie Tan, ichter , Erik Frey, Carolina Parada, Dorsa Sadigh, Tingnan Zhang, Ted Xiao, Zhuo Xu, Nikhil Joshi, Kuang-Huei Lee, Chase Kew, Ken Oslund, Sean Kirmani, Dushyant Rao, quanhovuong , Keerthana Gopalakrishnan, Marissa Giustina, Jonathan Tompson, Assaf Hurwitz Michaely, Baruch Tabanpour, Maria Bauza, Edward Lee, Maria Attarian, Leonard Hasenclever, Alex Bewley, Jan Humplik, Nimrod Gileadi, Joss Moore, Leila Takayama, allenren , Adil Dostmohamed, Chuyuan Kelly Fu, Ayzaan Wahid, Matt Bennice, Vincent Zhuang, Nicolas Heess, Izhak Shafran, Vincent Vanhoucke, Maja Mataric, Montse Gonzalez Arenas, Ying Xu, Kanishka Rao
- Venue
- arXiv
-
- Date
- Title
- A Human-Inspired Reading Agent with Gist Memory of Very Long Contexts
- Authors
- Kuang-Huei Lee, Xinyun Chen, Hiroki Furuta, John Canny, Ian Fischer
- Venue
- arXiv
-
- Date
- Title
- Experts Don't Cheat: Learning What You Don't Know by Predicting Pairs
- Authors
- Daniel D. Johnson, Daniel Tarlow, David Duvenaud, Chris J. Maddison
- Venue
- arXiv
-
- Date
- Title
- Premise Order Matters in Reasoning with Large Language Models
- Authors
- Xinyun Chen, Ryan A. Chi, Xuezhi Wang, Denny Zhou
- Venue
- arXiv
-
- Date
- Title
- A Distributional Analogue to the Successor Representation
- Authors
- Harley Wiltzer*, Jesse Farebrother, Arthur Gretton, Yunhao Tang, Andre Barreto, Will Dabney, Marc Bellemare*, Mark Rowland
- Venue
- arXiv
-
- Date
- Title
- Near-Minimax-Optimal Distributional RL with a Generative Model
- Authors
- Mark Rowland, Kevin Li, Remi Munos, Clare Lyle, Yunhao Tang, Will Dabney
- Venue
- arXiv
-
- Date
- Title
- PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
- Authors
- Soroush Nasiriany*, Fei Xia*, Wenhao Yu, Ted Xiao*, Jacky Liang, Ishita Dasgupta, Annie Xie, Danny Driess , Ayzaan Wahid, Zhuo Xu, Quan Vuong, Tingnan Zhang, Tsang-Wei Edward Lee, Kuang-Huei Lee , Peng Xu, Sean Kirmani, Yuke Zhu, Andy Zeng, Karol Hausman, Nicolas Heess, Chelsea Finn, Sergey Levine, Brian Ichter*
- Venue
- arXiv
-
- Date
- Title
- Chain-of-Table: Evolves Tables in the LLM Reasoning Chain for Table Understanding
- Authors
- Zilong Wang*, Chen-Yu Lee, chunliang , Hao Zhang, Julian Eisenschlos, Lesly Miculicich, Tomas Pfister, Vincent Perot, Yasuhisa Fujii, Zifeng Wang, Jingbo Shang*
- Venue
- ICLR 2024
-
- Date
- Title
- Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
- Authors
- Nicolas Nguyen, Imad Aouali, András György, Claire Vernade
- Venue
- arXiv
-
- Date
- Title
- Memory Consolidation Enables Long-Context Video Understanding
- Authors
- Ivana Balazevic, Jimmy Shi, Nelly Papalampidi, Rahma Chaabouni, Skanda Koppula, Olivier Henaff
- Venue
- arXiv
-
- Date
- Title
- States as Strings as Strategies: Steering Language Models with Game-Theoretic Solvers
- Authors
- Ian Gemp, Yoram Bachrach, Marc Lanctot, Roma Patel, Vibhavari Dasagi, Luke Marris, Georgios Piliouras, Siqi Liu, Karl Tuyls
- Venue
- arXiv
-
- Date
- Title
- Large Language Models Self-Discover Reasoning Structures
- Authors
- Pei Zhou*, Jay Pujara*, Xiang Ren*, Swaroop Mishra, Steven Zheng, Denny Zhou, Heng-Tze Cheng, Quoc Le, Ed Chi, Xinyun Chen
- Venue
- arXiv