Currently I am a first year graduate student of School of Artificial Intelligence in
Nanjing University and a member of
LAMDA Group ( LAMDA Publications )
, led by Professor Zhi-Hua Zhou.
EditSupervisor
Professor
Yang Yu.
Edit
Research Interests
I am interested in Machine Learning, mainly focus on the following subfields:
Reinforcement Learning : especially on Model-Based RL;
Large Models : towards more general and multimodal agentic tasks in real world through large language model-centered structure and reinforcement learning;
Edit
Publications
-
Controlling Large Language Model with Latent Actions
Chengxing Jia, Ziniu Li, Pengyuan Wang, Yi-Chen Li, Zhenyu Hou, Yuxiao Dong, Yang Yu
In: Proceedings of the 42th International Conference on Machine Learning (ICML'25), 2025.
-
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation
Chengxing Jia*, Pengyuan Wang*, Ziniu Li*, Yi-Chen Li, Zhilong Zhang, Nan Tang, Yang Yu
arXiv preprint arXiv:2405.17039.
-
Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation
Yi-Chen Li*, Fuxiang Zhang*, Wenjie Qiu, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu, Bo An
In: Proceedings of the 13th International Conference on Learning Representations (ICLR'25), 2025.
-
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning
Haoxin Lin, Yu-Yan Xu, Yihao Sun, Zhilong Zhang, Yi-Chen Li, Chengxing Jia, Junyin Ye, Jiaji Zhang, Yang Yu
In: Proceedings of the 13th International Conference on Learning Representations (ICLR'25), 2025.
-
Offline transition modeling via contrastive energy learning
Ruifeng Chen*, Chengxing Jia*, Zefang Huang, Tian-Shuo Liu, Xu-Hui Liu, Yang Yu
In: Proceedings of the 41th International Conference on Machine Learning (ICML'24), 2024.
-
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
Xinyu Zhang*, Wenjie Qiu*, Yi-Chen Li*, Lei Yuan, Chengxing Jia, Zongzhang Zhang, Yang Yu
In: Proceedings of the 41th International Conference on Machine Learning (ICML'24), 2024.
-
Policy rehearsing: Training generalizable policies for reinforcement learning
Chengxing Jia, Chen-Xiao Gao, Hao Yin, Fuxiang Zhang, Xiong-Hui Chen, Tian Xu, Lei Yuan, Zongzhang Zhang, Zhi-Hua Zhou, Yang Yu
In: Proceedings of the 12th International Conference on Learning Representations (ICLR'24), 2024.
-
Disentangling policy from offline task representation learning via adversarial data augmentation
Chengxing Jia*, Fuxiang Zhang*, Yi-Chen Li, Chen-Xiao Gao, Xu-Hui Liu, Lei Yuan, Zongzhang Zhang, Yang Yu
In: Proceedings of the 23rd International Conference on Autonomous Agents and Multiagnet Systems (AAMAS'24), 2024.
-
Model gradient: unified model and policy learning in model-based reinforcement learning
Chengxing Jia*, Fuxiang Zhang*, Tian Xu, Jing-Cheng Pang, Zongzhang Zhang, Yang Yu
Frontiers of Computer Science, 2024
-
Model-Bellman inconsistency for model-based offline reinforcement learning
Yihao Sun*, Jiaji Zhang*, Chengxing Jia, Haoxin Lin, Junyin Ye, Yang Yu
In: Proceedings of the 40th International Conference on Machine Learning (ICML'23), 2023.
-
Discovering generalizable multi-agent coordination skills from multi-task offline data
Fuxiang Zhang*, Chengxing Jia*, Yi-Chen Li, Lei Yuan, Yang Yu, Zongzhang Zhang
In: Proceedings of the 11th International Conference on Learning Representations (ICLR'23), 2023.
-
Fast Teammate Adaptation in the Presence of Sudden Policy Change
Ziqian Zhang*, Lei Yuan*, Lihe Li, Ke Xue, Chengxing Jia, Cong Guan, Chao Qian, Yang Yu
Uncertainty in Artificial Intelligence (UIA'23), 2023.
-
Coordination Scheme Probing for Generalizable Multi-Agent Reinforcement Learning
Hao Ding*, Chengxing Jia*, Zongzhang Zhang*, Cong Guan, Feng Chen, Lei Yuan, Yang Yu (TNNLS'24), 2024.
EditAwards & Honors
Postgraduate Scholarship of Nanjing University, 2023
Postgraduate Scholarship of Nanjing University, 2022
Postgraduate Scholarship of Nanjing University, 2021
Postgraduate Scholarship of Nanjing University, 2020
Contact:
National Key Laboratory for Novel Software Technology, Nanjing
University, Xianlin Campus Mailbox 603, 163 Xianlin Avenue, Qixia
District, Nanjing 210023, China
(南京市栖霞区仙林大道163号,南京大学仙林校区603信箱,软件新技术国家重点实验室,210023)