I’m a third-year Ph.D. student at McGill University and Mila — Quebec AI Institute, advised by Prof. Yichuan Ding.
Prior to McGill, I graduated from Tsinghua University, advised by Prof. Xianyuan Zhan, where I worked on offline reinforcement learning.
My research currently focuses on two directions:
Alignment of large language models. RLHF, preference modeling, and off-policy distillation (OPD) for post-training.
Marketing research with LLMs. LLM-powered methods for consumer research, behavioral simulation, personalization, and decision support.
Email: li.jiang3 [at] mail [dot] mcgill [dot] ca