Publications and Preprints
$^\star$ equal contribution
2026
Aligning Language Models from User Interactions [pdf]
Thomas Kleine Buening, Jonas Hübotter, Barna Pásztor, Giorgia Ramponi, Andreas KrauseReinforcement Learning via Self-Distillation [pdf]
Jonas Hübotter, Frederike Lübeck, Lejs Behric, Anton Baumann, Marco Bagatella, Daniel Marta, Ido Hakimi, Idan Shenfeld, Thomas Kleine Buening, Carlos Guestrin, Andreas KrauseMAVRL: Learning Reward Functions from Multiple Feedback Types with Amortized Variational Inference [pdf]
Raphaël Baur, Yannick Metz, Maria Gkoulta, Menna El-Assady, Giorgia Ramponi, Thomas Kleine BueningStackelberg Learning from Human Feedback: Preference Optimization as a Sequential Game [pdf]
Barna Pásztor, Thomas Kleine Buening, Andreas Krause
ICLR 2026Causal Imitation Learning under Expert-Observable and Expert-Unobservable Confounding [pdf]
Daqian Shao, Thomas Kleine Buening, Marta Kwiatkowska
ICLR 2026
2025
Strategyproof Reinforcement Learning from Human Feedback [pdf]
Thomas Kleine Buening, Jiarui Gan, Debmalya Mandal, Marta Kwiatkowska
NeurIPS 2025Data Source Adaptive Online Learning under Heteroscedastic Noise [pdf]
Amith Anan, Aadirupa Saha, Thomas Kleine Buening, Haipeng Luo
OPT 2025: 17th Annual Workshop on Optimization for Machine LearingA Minimax Approach to Ad Hoc Teamwork [pdf]
Victor Villin, Thomas Kleine Buening, Christos Dimitrakakis
AAMAS 2025
2024
Strategic Linear Contextual Bandits [pdf]
Thomas Kleine Buening, Aadirupa Saha, Christos Dimitrakakis, Haifeng Xu
NeurIPS 2024Environment Design for Inverse Reinforcement Learning [pdf]
Thomas Kleine Buening$^\star$, Victor Villin$^\star$, Christos Dimitrakakis
ICML 2024, Oral Presentation (1.5%)Bandits Meet Mechanism Design to Combat Clickbait in Online Recommendation [pdf]
Thomas Kleine Buening, Aadirupa Saha, Christos Dimitrakakis, Haifeng Xu
ICLR 2024, Spotlight Presentation (5%)
2023
Minimax-Bayes Reinforcement Learning [pdf]
Thomas Kleine Buening$^\star$, Christos Dimitrakakis$^\star$, Hannes Eriksson$^\star$, Divya Grover$^\star$, Emilio Jorge$^\star$
AISTATS 2023An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits [pdf]
Thomas Kleine Buening, Aadirupa Saha
AISTATS 2023
2022
Interactive Inverse Reinforcement Learning for Cooperative Games [pdf]
Thomas Kleine Buening, Anne-Marie George, Christos Dimitrakakis
ICML 2022, Best Paper Award at the Cooperative AI Workshop (NeurIPS 2021)On Meritocracy in Optimal Set Selection [pdf]
Thomas Kleine Buening, Meirav Segal, Debabrota Basu, Anne-Marie George, Christos Dimitrakakis
EEAMO 2022, Best Student Paper Award
