Page History: Reinforcement learning
Compare Page Revisions
8
7
6
5
4
3
2
1
0
Current
8
7
6
5
4
3
2
1
0
« Older Revision
-
Back to Page History
-
Newer Revision »
Page Revision: 2014/04/08 15:18
(
Back to main page
)
Conference Papers
Qing Da,
Yang Yu
, and Zhi-Hua Zhou.
Napping for Functional Representation of Policy
. In:
Proceedings of the 2014 International Conference on Autonomous Agents and Multi-Agent Systems (AAMAS'14)
, Paris, France, 2014. (
PDF
)
Qing Da,
Yang Yu
, and Zhi-Hua Zhou.
Self-Practice Imitation Learning from Weak Policy
. In:
Proceedings of the 2nd IAPR International Workshop on Partially Supervised Learning (PSL'13)
, Nanjing, China, 2013, pp.9-20.
Wang-Zhou Dai,
Yang Yu
, and Zhi-Hua Zhou.
Lifted-rollout for approximate policy iteration of Markov decision process
. In:
Proceedings of the International Workshop on Learning and Data Mining for Robotics (LEMIR'11)
, in conjunction with ICDM'11, Vancouver, Canada, 2011.
The end