Page History: Reinforcement learning
Compare Page Revisions
8
7
6
5
4
3
2
1
0
Current
8
7
6
5
4
3
2
1
0
« Older Revision
-
Back to Page History
-
Newer Revision »
Page Revision: 2013/10/26 18:53
(
Back to main page
)
Conference Papers
Qing Da,
Yang Yu
, and Zhi-Hua Zhou.
Self-Practice Imitation Learning from Weak Policy
. In:
Proceedings of the 2nd IAPR International Workshop on Partially Supervised Learning, Nanjing, China, 2013, pp.9-20.
Wang-Zhou Dai,
Yang Yu
, and Zhi-Hua Zhou.
Lifted-rollout for approximate policy iteration of Markov decision process
. In: Proceedings of the International Workshop on Learning and Data Mining for Robotics (LEMIR'11)''', in conjunction with ICDM'11, Vancouver, Canada, 2011.
The end