I received my B.Sc. degree and M.Sc. degree from Nanjing University, China in 2010 and 2013 respectively. Since Sep. 2013, I become a Ph.D. student in the LAMDA Group, Nanjing University, under the supervision of
Prof. Zhi-Hua Zhou.
In Jan. 2015, I aborted the Ph.D. procedure and joined the search algorithm team at Alibaba.
For the latest information, please visit
http://daqings.net.
EditResearch Interests
My research interests include machine learning, data mining.
EditPublication
- Jing-Cheng Shi, Yang Yu, Qing Da, Shi-Yong Chen, An-Xiang Zeng. Virtual-Taobao: Virtualizing real-world online retail environment for reinforcement learning. CORR abs/1805.10000.
- Yusen Zhan, Qing Da, Fei Xiao, An-xiang Zeng, Yang Yu, Accelerating E-Commerce Search Engine Ranking by Contextual Factor Selection, CORR abs/1803.00693
- Hua-Lin Hei, Chun-Xiang Pan, Qing Da, An-Xiang Zeng. Speeding up the Metabolism in E-commerce by Reinforcement Mechanism Design. In: "Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML PKDD'18)", Dublin, Ireland, 2018. PDF
- Shi-Yong Chen, Yang Yu, Qing Da, Jun Tan, Hai-Kuan Huang and Hai-Hong Tang. Stablizing reinforcement learning in dynamic environment with application to online recommendation. In: Proceedings of the 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'18) (Research Track), London, UK, 2018. PDF
- Yujing Hu, Qing Da, Anxiang Zeng, Yang Yu, Yinghui Xu, Reinforcement Learning to Rank in E-Commerce Search Engine: Formalization, Analysis, and Application. In: Proceedings of the 24th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD'18) (Applied Track), London, UK, 2018. PDF
- Yang Yu, Shi-Yong Chen, Qing Da, Zhi-Hua Zhou. Reusable reinforcement learning via shallow trails. IEEE Transactions on Neural Networks and Learning Systems, 2018, 29(6): 2204-2215. PDF
- Yang Yu, Peng-Fei Hou, Qing Da, and Yu Qian. Boosting nonparametric policies. In: Proceedings of the 2016 International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'16), Singapore, 2016, pp.477-484. PDF Code
- Yang Yu and Qing Da, PolicyBoost: Functional policy gradient with ranking-based reward objective. In: Proceedings of AAAI Workshop on AI and Robotics (AIRob'14), Quebec City, Canada, 2014. PDF Code
- Qing Da, Yang Yu, and Zhi-Hua Zhou. Learning with Augmented Class by Exploiting Unlabeled Data. In: Proceedings of the 28th AAAI Conference on Artificial Intelligence (AAAI'14), Québec city, Canada, 2014. PDF Code
- Qing Da, Yang Yu, and Zhi-Hua Zhou. Napping for Functional Representation of Policy. In: Proceedings of the 2014 International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'14), Paris, France, 2014. PDF Code
- Qing Da, Yang Yu, and Zhi-Hua Zhou. Self-Practice Imitation Learning from Weak Policy. In: Proceedings of the 2nd IAPR International Workshop on Partially Supervised Learning (PSL'13), Nanjing, China, 2013, pp.9-20. PDF
EditTeaching Assistant
EditAwards & Honors
- National Graduate Scholarship, 2012
- First prize of Internet contest for Cloud & Mobile computing (for image search track), 2012
- Grand Prize Winner of the PAKDD 2012 Data Mining Competition (Open Category) (with Nan Li, Chao Qian, Shao-Yuan Li, Yue Zhu, and Zhi-Hua Zhou), 2012
- Outstanding project (six in total over the nation) of China Innovation Program for Students (sponsored by Sun), 2010 (project page)
- Computer World Scholarship, 2009
- First prize in China Undergraduate Mathematical Contest in Modeling (CUMCM), 2008
EditCorrespondence
Email:
daq@lamda.nju.edu.cn,
csdaqing@gmail.com
Office: Room 912, Computer Science Building, Xianlin Campus of Nanjing University
Address: Qing Da, National Key Laboratory for Novel Software Technology, Nanjing University, Xianlin Campus Mailbox 603, 163 Xianlin Avenue, Qixia District, Nanjing 210023, China