Filters








89,308 Hits in 3.0 sec

Deep Model-Based Reinforcement Learning for High-Dimensional Problems, a Survey [article]

Aske Plaat, Walter Kosters, Mike Preuss
2020 arXiv   pre-print
Model-based reinforcement learning creates an explicit model of the environment dynamics to reduce the need for environment samples.  ...  In recent years, many model-based methods have been introduced to address this challenge. In this paper, we survey the contemporary model-based landscape.  ...  ACKNOWLEDGMENTS We thank the members of the Leiden Reinforcement Learning Group, and especially Thomas Moerland and Mike Huisman, for many discussions and insights.  ... 
arXiv:2008.05598v2 fatcat:5xmwmemv5bfinkw57avf5ghhxq

A survey of benchmarks for reinforcement learning algorithms

Belinda Stapelberg, Katherine Mary Malan
2020 South African Computer Journal  
This paper provides an overview of different contributions to reinforcement learning benchmarking and discusses how they can assist researchers to address the challenges facing reinforcement learning.  ...  Reinforcement learning has recently experienced increased prominence in the machine learning community.  ...  and model-based reinforcement learning approaches There are different aspects of RL systems that can be learnt.  ... 
doi:10.18489/sacj.v32i2.746 fatcat:66fd47ejfbg6jcfykelm42klr4

Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning [article]

Anusha Nagabandi, Gregory Kahn, Ronald S. Fearing, Sergey Levine
2017 arXiv   pre-print
In this work, we demonstrate that medium-sized neural network models can in fact be combined with model predictive control (MPC) to achieve excellent sample complexity in a model-based reinforcement learning  ...  Model-free deep reinforcement learning algorithms have been shown to be capable of learning a wide range of robotic skills, but typically require a very large number of samples to achieve good performance  ...  MODEL-BASED DEEP REINFORCEMENT LEARNING We now present our model-based deep reinforcement learning algorithm. We detail our learned dynamics function f θ (s t , a t ) in Sec.  ... 
arXiv:1708.02596v2 fatcat:zfjugkxaabcm7mjc7qw4oboxse

Overcoming Model Bias for Robust Offline Deep Reinforcement Learning [article]

Phillip Swazinna, Steffen Udluft, Thomas Runkler
2021 arXiv   pre-print
To improve robustness and stability of the learning process, we use dynamics models to assess policy performance instead of value functions, resulting in MOOSE (MOdel-based Offline policy Search with Ensembles  ...  State-of-the-art reinforcement learning algorithms mostly rely on being allowed to directly interact with their environment to collect millions of observations.  ...  Acknowledgements The project this paper is based on was supported with funds from the German Federal Ministry of Education and Research under project number 01 IS 18049 A.  ... 
arXiv:2008.05533v4 fatcat:r2dw73ki7jdklarbxtsjvxdd2e

A survey of benchmarking frameworks for reinforcement learning [article]

Belinda Stapelberg, Katherine M. Malan
2020 arXiv   pre-print
This paper provides an overview of different contributions to reinforcement learning benchmarking and discusses how they can assist researchers to address the challenges facing reinforcement learning.  ...  The survey aims to bring attention to the wide range of reinforcement learning benchmarking tasks available and to encourage research to take place in a standardised manner.  ...  Model-free and model-based reinforcement learning approaches There are different aspects of RL systems that can be learnt.  ... 
arXiv:2011.13577v1 fatcat:uxjtrzl3erb4hk2xcbi6s6zqyq

CURL: Contrastive Unsupervised Representations for Reinforcement Learning [article]

Aravind Srinivas, Michael Laskin, Pieter Abbeel
2020 arXiv   pre-print
We present CURL: Contrastive Unsupervised Representations for Reinforcement Learning.  ...  CURL outperforms prior pixel-based methods, both model-based and model-free, on complex tasks in the DeepMind Control Suite and Atari Games showing 1.9x and 1.2x performance gains at the 100K environment  ...  We benchmark extensively against both model-based and model-free algorithms in our experiments.  ... 
arXiv:2004.04136v4 fatcat:fek5n6xsn5f23efn2anivekvde

Baconian: A Unified Open-source Framework for Model-Based Reinforcement Learning [article]

Linsen Dong, Guanyu Gao, Xinyi Zhang, Liangyu Chen, Yonggang Wen
2021 arXiv   pre-print
Model-Based Reinforcement Learning (MBRL) is one category of Reinforcement Learning (RL) algorithms which can improve sampling efficiency by modeling and approximating system dynamics.  ...  INTRODUCTION Model-Based Reinforcement Learning (MBRL) is proposed to reduce sample complexity introduced by model-free Deep Reinforcement Learning (DRL) algorithms [12] .  ...  Since many model-based algorithms are built upon model-free algorithms, we also implement some popular model-free algorithms including DQN [7] , DDPG [11] , and PPO [14] in Baconian.  ... 
arXiv:1904.10762v4 fatcat:cs2i4eubzfbbfmdudwttrxdh4u

Google Research Football: A Novel Reinforcement Learning Environment

Karol Kurach, Anton Raichuk, Piotr Stańczyk, Michał Zając, Olivier Bachem, Lasse Espeholt, Carlos Riquelme, Damien Vincent, Marcin Michalski, Olivier Bousquet, Sylvain Gelly
2020 PROCEEDINGS OF THE THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE AND THE TWENTY-EIGHTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE  
We introduce the Google Research Football Environment, a new reinforcement learning environment where agents are trained to play football in an advanced, physics-based 3D simulator.  ...  Recent progress in the field of reinforcement learning has been accelerated by virtual learning environments such as video games, where novel algorithms and ideas can be quickly tested in a safe and reproducible  ...  We expect that these benchmark tasks will be useful for investigating current scientific challenges in reinforcement learning such as sampleefficiency, sparse rewards, or model-based approaches.  ... 
doi:10.1609/aaai.v34i04.5878 fatcat:b2tov7kieza3fajhsxavfni5qm

Bellman: A Toolbox for Model-Based Reinforcement Learning in TensorFlow [article]

John McLeod, Hrvoje Stojic, Vincent Adam, Dongho Kim, Jordi Grau-Moya, Peter Vrancx, Felix Leibfried
2021 arXiv   pre-print
In the past decade, model-free reinforcement learning (RL) has provided solutions to challenging domains such as robotics.  ...  benchmarks that share characteristics with real-world problems.  ...  Benchmarking model-based reinforcement learning. arXiv, 2019. URL https: //github.com/WilsonWangTHU/mbbl. Y Wu, E Mansimov, S Liao, R Grosse, and J Ba.  ... 
arXiv:2103.14407v2 fatcat:c7mzfbm7w5g4vapexxzj4p66li

High-Accuracy Model-Based Reinforcement Learning, a Survey [article]

Aske Plaat and Walter Kosters and Mike Preuss
2021 arXiv   pre-print
To reduce the number of environment samples, model-based reinforcement learning creates an explicit model of the environment dynamics.  ...  Deep reinforcement learning has shown remarkable success in the past few years.  ...  Acknowledgments We thank the members of the Leiden Reinforcement Learning Group, and especially Thomas Moerland and Mike Huisman, for many discussions and insights.  ... 
arXiv:2107.08241v1 fatcat:tma6xb2uy5fybjfhmzasfx2cta

A2CM: a new multi-agent algorithm

Gabor Paczolay, Istvan Harmati
2021 ACTA IMEKO  
One subcategory of reinforcement learning is multi-agent reinforcement learning, in which multiple agents are present in the world.  ...  <p class="Abstract">Reinforcement learning is currently one of the most researched fields of artificial intelligence.  ...  Apart from the aforementioned model-free reinforcement learning methods, there is also model-based reinforcement learning.  ... 
doi:10.21014/acta_imeko.v10i3.1023 fatcat:fiasvepa3rc7np4vhfwljp5bfq

HOList: An Environment for Machine Learning of Higher-Order Theorem Proving [article]

Kshitij Bansal, Sarah M. Loos, Markus N. Rabe, Christian Szegedy, and Stewart Wilcox
2019 arXiv   pre-print
We provide an open-source framework based on the HOL Light theorem prover that can be used as a reinforcement learning environment.  ...  We also present a deep reinforcement learning driven automated theorem prover, DeepHOL, with strong initial results on this benchmark.  ...  This paper provides a benchmark and reinforcement learning environment for theorem proving.  ... 
arXiv:1904.03241v3 fatcat:ih4fizuonrbvzk2oyu4pekhftu

Google Research Football: A Novel Reinforcement Learning Environment [article]

Karol Kurach, Anton Raichuk, Piotr Stańczyk, Michał Zając, Olivier Bachem, Lasse Espeholt, Carlos Riquelme, Damien Vincent, Marcin Michalski, Olivier Bousquet, Sylvain Gelly
2020 arXiv   pre-print
We introduce the Google Research Football Environment, a new reinforcement learning environment where agents are trained to play football in an advanced, physics-based 3D simulator.  ...  Recent progress in the field of reinforcement learning has been accelerated by virtual learning environments such as video games, where novel algorithms and ideas can be quickly tested in a safe and reproducible  ...  We expect that these benchmark tasks will be useful for investigating current scientific challenges in reinforcement learning such as sampleefficiency, sparse rewards, or model-based approaches.  ... 
arXiv:1907.11180v2 fatcat:3iwdl6e5cnfnvnpupwy6fz552y

ReLeTA: Reinforcement Learning for Thermal-Aware Task Allocation on Multicore [article]

Shi-Gui Yang, Yuan-Yuan Wang, Di Liu, Xu Jiang, Hui Fang, Yu Yang, Mingxiong Zhao
2019 arXiv   pre-print
In this paper, we propose ReLeTA: Reinforcement Learning based Task Allocation for temperature minimization.  ...  We design a new reward function and use a new state model to facilitate optimization of reinforcement learning algorithm.  ...  Reinforcement Learning Environment Agent State Reward Action S t S t R t R t A t A t Fig. 1: Reinforcement learning Different from the widely used supervised learning algorithms which learn a prediction  ... 
arXiv:1912.00189v1 fatcat:3em4azgvmzhrxpbexojffmvyie

safe-control-gym: a Unified Benchmark Suite for Safe Learning-based Control and Reinforcement Learning [article]

Zhaocong Yuan, Adam W. Hall, Siqi Zhou, Lukas Brunke, Melissa Greeff, Jacopo Panerati, Angela P. Schoellig
2022 arXiv   pre-print
Here, we propose a new open-source benchmark suite, called safe-control-gym, supporting both model-based and data-based control techniques.  ...  control, and reinforcement learning.  ...  reinforcement learning [4] .  ... 
arXiv:2109.06325v3 fatcat:q7dpbgtog5aqlgocuom2kdbwvq
« Previous Showing results 1 — 15 out of 89,308 results