![]() |
![]()
M.Sc. Student, LAMDA Group |
I received my B.Sc. degree from School of Artificial Intelligence, Nanjing University, in June 2022. In the same year, I was admitted to study for a M.Sc. degree in Nanjing University in the LAMDA Group led by professor Zhi-Hua Zhou, under the supervision of Prof. De-Chuan Zhan.
My research interests primarily include reinforcement learning and multimodal large models, focusing on the following areas:
Imitation Learning
Unsupervised Reinforcement Learning
Interpretability of Multimodal Large Models
|
We propose a bi-level optimization framework named Separation-assisted eXplorer (SeeX). In the inner optimization, SeeX trains a separated world model to extract exogenous and endogenous information, minimizing uncertainty to ensure task relevance. In the outer optimization, it learns a policy on imaginary trajectories generated within the endogenous state space to maximize task-relevant uncertainty. |
![]() |
We propose a method for analyzing external ballistic measurement errors caused by various conditions, which result in unobservable random errors and latent systematic errors. Using Intrinsic Mode Function (IMF) energy inflection points, this method categorizes IMFs into high-frequency errors, mixed information, and useful information, effectively compensating for systematic errors and improving positioning accuracy. |
|
We focus on the problem of Learning from Noisy Demonstrations (LND), where the imitator is required to learn from data with noise that often occurs during the processes of data collection or transmission. We propose LTI-Mimic, which designs two discriminators to distinguish the noise level and expertise level of data, facilitating a feature encoder to learn task-related but domain-agnostic representations. |
|
We introduce active sensoring in the visual IL setting and propose a model-based SENSory imitatOR (SENSOR) to automatically change the agent's perspective to match the expert's. SENSOR jointly learns a world model to capture the dynamics of latent states, a sensor policy to control the camera, and a motor policy to control the agent. |
2023, Academic scholarship of Nanjing University
2020, First Prize in East China Region, 2020 National University Bio-network Design Competition
2019, Third Prize, The Ninth Nanjing University Career Planning Competition for College Students
Email: huangkc@lamda.nju.edu.cn
Office: Room A304, Shaoyifu Building, Xianlin Campus of Nanjing University
Address: National Key Laboratory for Novel Software Technology, Nanjing University,
Xianlin Campus Mailbox 603, 163 Xianlin Avenue, Qixia District, Nanjing 210023, China
(南京市栖霞区仙林大道163号, 南京大学仙林校区603信箱, 软件新技术国家重点实验室, 210023.)