Skip down to main content

Yushi Yang

DPhil Student
Yushi Yang

Yushi Yang

DPhil Student

About

Yushi is a third-year DPhil student in Social Data Science at the OII, where she is a member of the Reasoning with Machines Lab (OxRML) under the supervision of Prof. Adam Mahdi. Her research focuses on post-training mechanisms for aligning language model agents, in particular how reinforcement learning and preference tuning methods align model behaviours with human values. Her work spans LLM post-training, Agents, and AI alignment, and has been published at venues including EMNLP and ICML.

Alongside her doctoral research, Yushi joined Meta Superintelligence Labs as a Research Scientist Intern. She has previously worked as an AI Engineer at Reply, a Data Analyst at UNDP, and an AI Researcher at the University of Oxford and Imperial College London.

Yushi holds an MSc in Artificial Intelligence from Imperial College London, an MSc in Statistical Science from the University of Oxford, and a BSc in Mathematics and Statistics from the University of Warwick.

Research Interests

LLM Post-training; Agents; AI alignment; Reinforcement learning

Positions at the OII

  • DPhil Student, September 2023 -

Research

Projects

News

Related Topics: