Image
Image
Image
Image
Image
Image
Image
Image
Image
Image



Search
»

AGPO

Description: The package is the code for Anomaly Guided Policy Learning from Imperfect Demonstrations in AAMAS 22'. This code is developed based on OpenAI baseline[1]. We also support environments from DeepMind Control Suite (dm-control)[2].

References:
[1] Prafulla Dhariwal, Christopher Hesse, Oleg Klimov, Alex Nichol, Matthias Plappert, AlecRadford, John Schulman, Szymon Sidor, Yuhuai Wu, and Peter Zhokhov. 2017. OpenAIBaselines.https://github.com/openai/baselines. [2] Yuval Tassa, Yotam Doron, Alistair Muldal, Tom Erez, Yazhe Li, Diego de LasCasas, David Budden, Abbas Abdolmaleki, Josh Merel, Andrew Lefrancq, et al.2018. Deepmind control suite.arXiv preprint arXiv:1801.00690(2018).

ATTN: This package is free for academic usage. You can run it at your own risk. For other purposes, please contact .

ATTN2: This package was developed by and . For any problem concerning the code, please feel free to contact us.

Requirement: See installation instruction from [1] and [2]

Download: [code] (100.00MB)
  Name Size

Image
PoweredBy © LAMDA, 2022