I’m a Postdoctoral Research Associate at The Alan Turing Institute and an Associate Member of the Department of Computer Science at the University of Oxford, where I’m mentored by Marta Kwiatkowska and Lukasz Szpruch. From 2021 to 2024, I completed my PhD at the University of Oslo supervised by Christos Dimitrakakis, and previously studied Mathematics (BSc, MSc).

My research interests are in reinforcement learning and related areas, including reward learning, preference learning, and the intersection of ML with game theory and mechanism design.

News

Selected Publications

  • Strategyproof Reinforcement Learning from Human Feedback [pdf]
    Thomas Kleine Buening, Jiarui Gan, Debmalya Mandal, Marta Kwiatkowska
    preprint

  • Strategic Linear Contextual Bandits [pdf]
    Thomas Kleine Buening, Aadirupa Saha, Christos Dimitrakakis, Haifeng Xu
    NeurIPS 2024

  • Environment Design for Inverse Reinforcement Learning [pdf]
    Thomas Kleine Buening$^\star$, Victor Villin$^\star$, Christos Dimitrakakis
    ICML 2024, Oral Presentation

  • Bandits Meet Mechanism Design to Combat Clickbait in Online Recommendation [pdf]
    Thomas Kleine Buening, Aadirupa Saha, Christos Dimitrakakis, Haifeng Xu
    ICLR 2024, Spotlight Presentation

You can reach me at tbuening@turing.ac.uk or thomas.kleinebuening@cs.ox.ac.uk.