Page History: Reinforcement learning

Compare Page Revisions

« Older Revision - Back to Page History - Newer Revision »

Page Revision: 2013/10/26 18:53

Conference Papers

Qing Da, Yang Yu, and Zhi-Hua Zhou. Self-Practice Imitation Learning from Weak Policy. In: Proceedings of the 2nd IAPR International Workshop on Partially Supervised Learning, Nanjing, China, 2013, pp.9-20.

Wang-Zhou Dai, Yang Yu, and Zhi-Hua Zhou. Lifted-rollout for approximate policy iteration of Markov decision process. In: Proceedings of the International Workshop on Learning and Data Mining for Robotics (LEMIR'11)''', in conjunction with ICDM'11, Vancouver, Canada, 2011.