I am a PhD Student at the Max Planck Institute for Intelligent Systems, working with Mortiz Hardt and Bernhard Schoelkopf.
My research focuses broadly on language models.
Selected works:
Traning on the Test Task Confounds Evaluation and Emergence, ICLR 2025 (Oral).Blog posts:
How to Fix the LMArena? Release All DataIf you're interested in working with me as a master's student or intern, feel free to reach out via email.