![]() |
王 傲 然
Ao-Ran Wang (A.-R. Wang) Ph.D. student, LAMDA Group School of Artificial Intelligence National Key Laboratory for Novel Software Technology Nanjing University, Nanjing 210023, China Email: wangar@lamda.nju.edu.cn Laboratory: Room A201, Shaoyifu Building, Nanjing University Xianlin Campus Supervisor: Professor Zong-Zhang Zhang |
Conference Paper:
Aoran Wang, Lei Ou, Yang Yu, Zongzhang Zhang. Reward Model Evaluation via Automatically-Ranked Policy Alignment. In: Proceedings of the 40th AAAI Conference on Artificial Intelligence (AAAI'26) Main Track, 2026. (Oral)