![]() |
杨思航 |
Currently I am a second year graduate M.Sc. student of School of Artificial Intelligence in Nanjing University and a member of LAMDA Group, led by professor Zhi-Hua Zhou.
I received my B.Sc. degree from School of Artificial Intelligence, Nanjing University in June 2023. In September 2023, I was admitted to pursue a M.Sc. degree in Nanjing University, under the supervision of Professor Yang Yu without entrance examination.
My research interest is Reinforcement Learning. Especially, I am interested in
Reinforcement Learning System
Hierarchical Reinforcement Learning
Jing-Cheng Pang*, Si-Hang Yang*, Kaiyuan Li, Jiaji Zhang, Xiong-Hui Chen, Nan Tang and Yang Yu. Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts. In: NeurIPS, 2024. [Paper]
Xiong-Hui Chen, Junyin Ye, Hang Zhao, Yi-Chen Li, Xu-Hui Liu, Haoran Shi, Yu-Yan Xu, Zhihao Ye, Si-Hang Yang, Yang Yu, Kai Xu, Zongzhang Zhang, Anqi Huang. Deep Demonstration Tracing: Learning Generalizable Imitator for Runtime One-Shot Imitation. In: ICML, 2024. [Paper]
Jing-Cheng Pang*, Xin-yu Yang*, Si-Hang Yang, Xiong-Hui Chen and Yang Yu. Natural Language-conditioned Reinforcement Learning with Inside-out Task Language Development and Translation. In: NeurIPS, 2023. [Paper]
Jing-Cheng Pang*, Si-Hang Yang*, Xiong-Hui Chen, Xin-yu Yang, Yang Yu, Mas Ma, Zi-qi Guo, Howard Yang and Bill Huang. Object-Oriented Option Framework for Robotics Manipulation in Clutter. In: IROS, 2023. [Paper]
Jing-Cheng Pang*, Heng-Bo Fan*, Pengyuan Wang*, Jia-Hao Xiao*, Nan Tang, Si-Hang Yang, Chengxing Jia, Sheng-Jun Huang and Yang Yu. Empowering Language Models with Active Inquiry for Deeper Understanding. In: CoRR abs/2402.03719, 2024. [Paper]
* Equal contribution.
Email:
yangsh {AT} lamda.nju.edu.cn, CharlieBrown-v1 {AT} outlook.com
Laboratory:
A201, Shaoyifu Building, Nanjing University Xianlin Campus
Address:
National Key Laboratory for Novel Software Technology, Nanjing University, 163 Xianlin Avenue, Qixia District, Nanjing 210023, China
(南京市栖霞区仙林大道163号, 软件新技术国家重点实验室, 210023.)