![]() |
万盛华 |
I received my B.Sc. degree of GIS from Nanjing University, in June 2021. In the same year, I was admitted to study for a Ph.D. degree in Nanjing University without entrance examination in the LAMDA Group led by professor Zhi-Hua Zhou, under the supervision of Prof. De-Chuan Zhan.
My research interest includes Reinforcement Learning and its real-world applications, and mainly focus on sim2real problems:
Visual Imitation Learning (Cross-modal, Distractors, Viewpoint, etc.)
Representations in RL
![]() |
Existing visual RL algorithms mostly rely on a single observation from a well-designed fixed camera that requires human knowledge. Recent studies learn from different viewpoints with multiple fixed cameras, but this incurs high computation and storage costs and does not guarantee the coverage of the optimal viewpoint. To address these issues, we propose the View-conditional Markov Decision Process (VMDP) assumption and develop a new method, the MOdel-based SEnsor controlleR (MOSER), based on VMDP. MOSER jointly learns a view-conditional world model (VWM) to simulate the environment, a sensory policy to control the camera, and a motor policy to complete tasks. We design intrinsic rewards from the VWM without additional modules to guide the sensory policy to adjust the camera parameters. |
![]() |
Existing Model-based imitation learning algorithms are highly deceptive by task-irrelevant information, especially moving distractors in videos. To tackle this problem, we propose a new algorithm - named Separated Model-based Adversarial Imitation Learning (SeMAIL) - decoupling the environment dynamics into two parts by task-relevant dependency, which is determined by agent actions, and training separately. |
![]() |
We designed a general multitemporal framework to extract urban green cover using multi-training, a novel semi-supervised learning method for land cover classification on multitemporal remote sensing images. |
Presidential Special Scholarship for first year Ph.D. Student in Nanjing University, 2021.
Outstanding Graduate of Nanjing University, 2021.
Winner of the Ping An Insurance Data Mining Competition, 2021.
2-nd place in ZhongAn Cup Insurance Data Mining Competition, 2020.
Introduction to Machine Learning. (For undergraduate students, Spring, 2022)
Email:
wansh [at] lamda.nju.edu.cn
Office:
Yifu Building, Xianlin Campus of Nanjing University
Address:
National Key Laboratory for Novel Software Technology, Nanjing University, Xianlin Campus Mailbox 603, 163 Xianlin Avenue, Qixia District, Nanjing 210023, China
(南京市栖霞区仙林大道163号, 南京大学仙林校区603信箱, 软件新技术国家重点实验室, 210023.)