![]() |
邱文杰
Email: qiuwj@lamda.nju.edu.cn Laboratory: Room A201, Shaoyifu Building, Nanjing University Xianlin Campus |
Currently I am a graduate student of School of Artificial Intelligence in
Nanjing University and a member of LAMDA Group, led
by professor Zhi-Hua Zhou.
I received my B.Eng. degree in Automation from School of Management and Engineering, Nanjing University in June 2023. In September 2023,
I was admitted to pursue a M.Sc. degree in Nanjing University, under the supervision of Associate Professor
Zongzhang Zhang.
My research interest includes Reinforcement Learning and its applications. Recently, I am interested in
Offline Reinforcement Learning
Reinforcement Learning from Human Feedback
Wenjie Qiu*, Yi-Chen Li*, Xuqin Zhang, Tianyi Zhang, Yihang Zhang, Zongzhang Zhang, Yang Yu. Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference. [PDF]
Yi-Chen Li*, Fuxiang Zhang*, Wenjie Qiu, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu, Bo An. Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation. In: Proceedings of the 13th International Conference on Learning Representations (ICLR 2025), Singapore, 2025. [PDF] [Code]
Xinyu Zhang*, Wenjie Qiu*, Yi-Chen Li*, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu. Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics. In: Proceedings of the 41th International Conference on Machine Learning (ICML 2024), Vienna, Austria, 2024. [PDF] [Code]
Huawei Scholarship, 2024
Outstanding graduated of Nanjing University, 2023
National Scholarship, 2022
Email:
qiuwj {AT} lamda.nju.edu.cn
Laboratory:
Room A201, Shaoyifu Building, Xianlin Campus of Nanjing University
Address:
National Key Laboratory for Novel Software Technology, Nanjing University, Xianlin Campus Mailbox 603, 163
Xianlin Avenue, Qixia District, Nanjing 210023, China
(南京市栖霞区仙林大道163号, 南京大学仙林校区603信箱, 软件新技术国家重点实验室, 210023.)