I’m a Postdoctoral Research Associate at The Alan Turing Institute and an Associate Member of the Department of Computer Science at the University of Oxford, where I’m mentored by Marta Kwiatkowska and Lukasz Szpruch. From 2021 to 2024, I completed my PhD at the University of Oslo supervised by Christos Dimitrakakis, and previously studied Mathematics (BSc, MSc).
My research interests are in reinforcement learning and related areas, including reward learning, preference learning, and the intersection of machine learning with game theory and mechanism design.
News
- 04/2025: The 2nd Edition of last year’s workshop on Models of Human Feedback for AI Alignment is taking place on July 18th at ICML 2025! Submission Deadline is May 25th!
- 03/2025: We released three new preprints on Strategyproof RLHF, Causal Imitation Learning and Multi-Agent Cooperative RL.
- 11/2024: I’m visiting Haifeng Xu’s group at the University of Chicago. I’ll give a talk on Strategic Interactive Decision-Making on December 5th at the CS Department.
- 07/2024: We’re organizing the ICML 2024 Workshop on Models of Human Feedback for AI Alignment. 09/2024: Recordings are now available here.
Selected Publications
Strategyproof Reinforcement Learning from Human Feedback [pdf]
Thomas Kleine Buening, Jiarui Gan, Debmalya Mandal, Marta Kwiatkowska
working paperStrategic Linear Contextual Bandits [pdf]
Thomas Kleine Buening, Aadirupa Saha, Christos Dimitrakakis, Haifeng Xu
NeurIPS 2024Environment Design for Inverse Reinforcement Learning [pdf]
Thomas Kleine Buening$^\star$, Victor Villin$^\star$, Christos Dimitrakakis
ICML 2024, Oral Presentation
You can reach me at tbuening@turing.ac.uk or thomas.kleinebuening@cs.ox.ac.uk.