1 Hit in 2.4 sec

School of hard knocks: Curriculum analysis for Pommerman with a fixed computational budget [article]

Omkar Shelke, Hardik Meisheri, Harshad Khadilkar
2021 arXiv   pre-print
In this paper, we focus on developing a curriculum for learning a robust and promising policy in a constrained computational budget of 100,000 games, starting from a fixed base policy (which is itself  ...  We test this hypothesis and show that within constrained computational budgets, it is in fact better to "learn in the school of hard knocks", i.e., against all available opponent policies nearly from the  ...  Conclusion In a hybrid cooperative/adversarial multi-agent game such as Pommerman, curriculum learning is a popular way of accelerating training.  ... 
arXiv:2102.11762v2 fatcat:il54s3qbmjdzlk6yos3kva22bq