Research
Publications
-
- Date
- Title
- Learning from One Continuous Video Stream
- Authors
- Venue
-
- Date
- Title
- Long-form factuality in large language models
- Authors
- Venue
-
- Date
- Title
- Few-Shot Recalibration of Language Models
- Authors
- Venue
-
- Date
- Title
- Evaluating Frontier Models for Dangerous Capabilities
- Authors
- Venue
-
- Date
- Title
- DiPaCo: Distributed Path Composition
- Authors
- Venue
-
- Date
- Title
- Model-free Posterior Sampling via Learning Rate Randomization
- Authors
- Venue
-
- Date
- Title
- Robust Exploration via Clustering-based Density Estimation
- Authors
- Venue
-
- Date
- Title
- Prosody for Intuitive Robotic Interface Design: It's Not What You Said, It's How You Said It
- Authors
- Venue
-
- Date
- Title
- Understanding Learning from Human Preferences
- Authors
- Venue
-
- Date
- Title
- Demonstration-Regularized RL
- Authors
- Venue
-
- Date
- Title
- AtP*: Efficient and scalable methods for localizing LLM behaviour to components
- Authors
- Venue
-
- Date
- Title
- How aligned are different alignment metrics?
- Authors
- Venue
-
- Date
- Title
- Approximating the Core of Cooperative Games
- Authors
- Venue
-
- Date
- Title
- Towards Practical Reinforcement Learning for Tokamak Magnetic Control
- Authors
- Venue
-
- Date
- Title
- Self-supervised video pretraining yields strong image representations
- Authors
- Venue
-
- Date
- Title
- Bad Students Make Great Teachers: Active Learning Accelerates Large Scale Visual Understanding
- Authors
- Venue
-
- Date
- Title
- Set Learning for Accurate and Calibrated Models
- Authors
- Venue
-
- Date
- Title
- A density estimation perspective on learning from pairwise human preferences
- Authors
- Venue
-
- Date
- Title
- Intriguing Properties of Generative Classifers
- Authors
- Venue
-
- Date
- Title
- Frozen Feature Augmentation
- Authors
- Venue