Yang Yu @ NJUCS

ImageChinese name(中文简历)
Yang Yu (Y. Yu)
Can be pronounced as "young you"
Ph.D., Professor
School of Artificial Intelligence
National Key Laboratory for Novel Software Technology
Nanjing University

Office: 311, Computer Science Building, Xianlin Campus
email: ,
Image Image

I received my Ph.D. degree in Computer Science from Nanjing University in 2011 (supervisor Prof. Zhi-Hua Zhou), and then joined the LAMDA Group (LAMDA Publications), in the Department of Computer Science and Technology of Nanjing University as an Assistant Researcher from 2011, and as an Associate Professor from 2014. I joined the School of Artificial Intelligence of Nanjing University as a Professor from 2019.

My research interest is in machine learning, a sub-field of artificial intelligence. Currently, I am working on reinforcement learning in various aspects, including optimization, representation, transfer, etc. More information please see my CV. (Detailed CV | CV in PDF)


Recent Update

StarCraft II We published the first paper of reinforcement learning on the full length game of StarCraft II.   Virtual Taobao A Virtual Taobao environment is released for the research of recommendation system and reinforcement learning.
Neuron & Logic Our NeurIPS'19 paper connects neural perception and logic reasoning through abductive learning. It is now open sourced   Talk I gave an Early Career Spotlight talk on Toward Sample Efficient Reinforcement Learning in IJCAI 2018.
ZOOpt A Python package for derivative free optimization. Release 0.2.   AWRL We will have the 4th Asian Workshop on Reinforcement Learning



A quick-learned policy beats level 3 bot in Starcraft II

Currently, I am mainly focusing on reinforcement learning. Reinforcement learning searches for a policy of near-optimal decisions, by learning from environment interactions autonomously. Despite the fantastic future, reinforcement learning is still in early infancy. Its potential has not been fully released in many situations. Our team is trying in various aspects to improve reinforcement learning, including theoretical foundation, optimization, model structure, experience reuse, abstraction, model building, etc., heading toward sample-efficient methods for large-scale physical-world applications.

Full publication list >>>




Selected Work

Z.-H. Zhou, Y. Yu, C. Qian. Evoluionary
Learning: Advances in Theories and
Algorithms. Berlin: Springer, 2019.

(My Goolge Scholar Citations)






National Key Laboratory for Novel Software Technology, Nanjing University, Xianlin Campus Mailbox 603, 163 Xianlin Avenue, Qixia District, Nanjing 210023, China
(In Chinese:) 南京市栖霞区仙林大道163号,南京大学仙林校区603信箱,软件新技术国家重点实验室,210023。