Filters








243 Hits in 4.6 sec

A Survey and Critique of Multiagent Deep Reinforcement Learning [article]

Pablo Hernandez-Leal, Bilal Kartal, Matthew E. Taylor
<span title="2019-06-22">2019</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
The primary goal of this article is to provide a clear overview of current multiagent deep reinforcement learning (MDRL) literature.  ...  Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has led to a dramatic increase in the number of applications and methods.  ...  and three anonymous reviewers whose comments and suggestions increased the quality of this work. 56 The title of this work makes a clear reference to previous seminal MAL works [2, 354] .  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1810.05587v2">arXiv:1810.05587v2</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/h4ei5zx2xfa7xocktlefjrvef4">fatcat:h4ei5zx2xfa7xocktlefjrvef4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200928115939/https://arxiv.org/pdf/1810.05587v2.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/3f/43/3f43f08611cbcfba62bb9e0c5339c2a8f0cc3e4b.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1810.05587v2" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Multi-UAV Conflict Resolution with Graph Convolutional Reinforcement Learning

Ralvi Isufaj, Marsel Omeri, Miquel Angel Piera
<span title="2022-01-09">2022</span> <i title="MDPI AG"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/smrngspzhzce7dy6ofycrfxbim" style="color: black;">Applied Sciences</a> </i> &nbsp;
In this paper, we model multi-UAV conflict resolution as a multiagent reinforcement learning problem.  ...  The model is evaluated in scenarios with 3 and 4 present agents. Results show that agents are able to successfully solve the multi-UAV conflicts through a cooperative strategy.  ...  A survey and critique of multiagent deep reinforcement learning. Auton. Agents Multi-Agent Syst. 2019, 33, 750–797. [CrossRef] 34. Bellman, R.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.3390/app12020610">doi:10.3390/app12020610</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/zv74eywngfh53h2chyxk4y4vke">fatcat:zv74eywngfh53h2chyxk4y4vke</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20220111162405/https://mdpi-res.com/d_attachment/applsci/applsci-12-00610/article_deploy/applsci-12-00610-v2.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/0d/4e/0d4e4e66269999f6abbb086b0af0c3c0a6f1c564.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.3390/app12020610"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> mdpi.com </button> </a>

A Survey on Transfer Learning for Multiagent Reinforcement Learning Systems

Felipe Leno Da Silva, Anna Helena Reali Costa
<span title="2019-03-11">2019</span> <i title="AI Access Foundation"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/4ax4efcwajcgvidb6hcg6mwx4a" style="color: black;">The Journal of Artificial Intelligence Research</a> </i> &nbsp;
This survey provides a unifying view of the literature on knowledge reuse in multiagent RL.  ...  Multiagent Reinforcement Learning (RL) solves complex tasks that require coordination with other agents through autonomous exploration of the environment.  ...  Taylor for the collaboration in previous versions of this survey.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1613/jair.1.11396">doi:10.1613/jair.1.11396</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/mn4gw6oh5zgszl6l53fgesei5i">fatcat:mn4gw6oh5zgszl6l53fgesei5i</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20190429063414/https://www.jair.org/index.php/jair/article/download/11396/26482" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/36/60/3660f76126fe1343c91f065f452845981041206c.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1613/jair.1.11396"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

Decentralized Multiagent Actor-Critic Algorithm Based on Message Diffusion

Siyuan Ding, Shengxiang Li, Guangyi Liu, Ou Li, Ke Ke, Yijie Bai, Weiye Chen, Giuseppe Quero
<span title="2021-12-08">2021</span> <i title="Hindawi Limited"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/zlknqk4ahbcsthbafxw55emm7a" style="color: black;">Journal of Sensors</a> </i> &nbsp;
To overcome these problems, in this paper, we propose a model-free and fully decentralized actor-critic multiagent reinforcement learning algorithm based on message diffusion.  ...  The exponential explosion of joint actions and massive data collection are two main challenges in multiagent reinforcement learning algorithms with centralized training.  ...  Taylor, “A very con- densed survey and critique of multiagent deep reinforcement  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2021/8739206">doi:10.1155/2021/8739206</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/z2kvi3ym7ndjdgwqea5i4lpbai">fatcat:z2kvi3ym7ndjdgwqea5i4lpbai</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20211211064940/https://downloads.hindawi.com/journals/js/2021/8739206.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/73/b6/73b6f189abda9025408fea8ad68a2e55aba7fece.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1155/2021/8739206"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> hindawi.com </button> </a>

Advancing Automation in Digital Forensic Investigations Using Machine Learning Forensics [chapter]

Salman Iqbal, Soltan Abed Alharbi
<span title="2019-12-17">2019</span> <i title="IntechOpen"> Digital Forensic Science [Working Title] </i> &nbsp;
As a result of this transformation, we are becoming the soft target of various types of cybercrimes.  ...  Laptops, tablets, smartphones and wearable devices are the major source of this digital data transformation and are becoming the core part of our daily life.  ...  We present the latest surveys in this field and give critique comparisons of these approaches.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5772/intechopen.90233">doi:10.5772/intechopen.90233</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/c2jl6pwvrndrjlqy7x6ixoe3zu">fatcat:c2jl6pwvrndrjlqy7x6ixoe3zu</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200108015958/https://api.intechopen.com/chapter/pdf-download/70281.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/51/b0/51b054aa0ebdf30c6bc9ce5cec53df4bec959423.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.5772/intechopen.90233"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="unlock alternate icon" style="background-color: #fb971f;"></i> Publisher / doi.org </button> </a>

2020 Index IEEE Transactions on Systems, Man, and Cybernetics: Systems Vol. 50

<span title="">2020</span> <i title="Institute of Electrical and Electronics Engineers (IEEE)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/whrz22ivnbhwbbeoexygwc4ybq" style="color: black;">IEEE Transactions on Systems, Man &amp; Cybernetics. Systems</a> </i> &nbsp;
The primary entry includes the coauthors' names, the title of the paper or other item, and its location, specified by the publication abbreviation, year, month, and inclusive pagination.  ...  The Subject Index contains entries describing the item under all appropriate subject headings, plus the first author's name, the publication abbreviation, month, and year, and inclusive pages.  ...  ., +, TSMC June 2020 2220-2230 cations: A Survey of Trends and Techniques.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/tsmc.2021.3054492">doi:10.1109/tsmc.2021.3054492</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/zartzom6xvdpbbnkcw7xnsbeqy">fatcat:zartzom6xvdpbbnkcw7xnsbeqy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210130184714/https://ieeexplore.ieee.org/ielx7/6221021/9261964/09336755.pdf?tp=&amp;arnumber=9336755&amp;isnumber=9261964&amp;ref=" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/0f/b9/0fb9dc11dda103177305124884b489d7aabc2d08.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1109/tsmc.2021.3054492"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> ieee.com </button> </a>

Deep Reinforcement Learning Versus Evolution Strategies: A Comparative Survey [article]

Amjad Yousef Majid, Serge Saaybi, Tomas van Rietbergen, Vincent Francois-Lavet, R Venkatesha Prasad, Chris Verhoeven
<span title="2021-09-28">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Deep Reinforcement Learning (DRL) and Evolution Strategies (ESs) have surpassed human-level control in many sequential decision-making problems, yet many open challenges still exist.  ...  Finally, to have an indication about how they compare in real-world applications, a survey of the literature for the set of applications they support is provided.  ...  Fig. 1 : 1 The structure of the survey robot (agent) Fig. 2 : 2 Iteration loops of (Deep) Reinforcement Learning and Evolutionary Strategies.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2110.01411v1">arXiv:2110.01411v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/nw47ududyndyljlh4nx2gm73jq">fatcat:nw47ududyndyljlh4nx2gm73jq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20211007045212/https://arxiv.org/pdf/2110.01411v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/ac/89/ac89d4156c66f792cacbd29600d4cf0cfead71f3.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2110.01411v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Towards Resilient Artificial Intelligence: Survey and Research Issues [article]

Oliver Eigner, Sebastian Eresheim, Peter Kieseberg, Lukas Daniel Klausner, Martin Pirker, Torsten Priebe, Simon Tjoa, Fiammetta Marulli, Francesco Mercaldo
<span title="2021-09-18">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Considering the particular nature of AI, and machine learning (ML) in particular, this paper provides an overview of the emerging field of resilient AI and presents research issues the authors identify  ...  Their resilience against attacks and other environmental influences needs to be ensured just like for other IT assets.  ...  Figure 1 : 1 Survey dimensions and identified research issues Resilience Testing and Monitoring Deep Learning Reinforcement Learning Model Evaluation and Operations Data Preparation and Management  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2109.08904v1">arXiv:2109.08904v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/vadq2vohljhxpbcokklir4buee">fatcat:vadq2vohljhxpbcokklir4buee</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210923115549/https://arxiv.org/ftp/arxiv/papers/2109/2109.08904.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/94/60/9460559a3438cd5b5fb3e4cc1cb9103f24891501.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2109.08904v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Reports of the AAAI 2014 Conference Workshops

Stefano V. Albrecht, André M. S. Barreto, Darius Braziunas, David L. Buckeridge, Heriberto Cuayáhuitl, Nina Dethlefs, Markus Endres, Amir-massoud Farahmand, Mark Fox, Lutz Frommberger, Sam Ganzfried, Yolanda Gil (+22 others)
<span title="2015-03-25">2015</span> <i title="Association for the Advancement of Artificial Intelligence (AAAI)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/27wksbinzzhjfow2wuy6m2iefm" style="color: black;">The AI Magazine</a> </i> &nbsp;
The AAAI-14 workshop program included fifteen workshops covering a wide range of topics in artificial intelligence.  ...  and Imperfect Information; Discovery Informatics; Incentives and Trust in Electronic Communities; Intelligent Cinematography and Editing; Machine Learning for Interactive Systems: Bridging the Gap between  ...  Buckeridge, and John S. Brownstein served as cochairs of this workshop. The papers of the symposium were published as AAAI Press Technical Report WS-14-14.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1609/aimag.v36i1.2575">doi:10.1609/aimag.v36i1.2575</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/vcuegeptjzdpfh4bgbi2ewjcg4">fatcat:vcuegeptjzdpfh4bgbi2ewjcg4</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20180721181005/https://aaai.org/ojs/index.php/aimagazine/article/download/2575/2474" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/01/de/01def99e199a030cb11e2ca304b829c47af2f841.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1609/aimag.v36i1.2575"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

A Conceptual Framework for Externally-influenced Agents: An Assisted Reinforcement Learning Review [article]

Adam Bignold, Francisco Cruz, Matthew E. Taylor, Tim Brys, Richard Dazeley, Peter Vamplew, Cameron Foale
<span title="2020-07-03">2020</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
A long-term goal of reinforcement learning agents is to be able to perform tasks in complex real-world scenarios.  ...  In this work, we propose a conceptual framework and taxonomy for assisted reinforcement learning, aimed at fostering such collaboration by classifying and comparing various methods that use external information  ...  Acknowledgments This work has been partially supported by the Australian Government Research Training Program (RTP) and the RTP Fee-Offset Scholarship through Federation University Australia.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2007.01544v1">arXiv:2007.01544v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/iepfl62fyfhudghvjjqhuunjqq">fatcat:iepfl62fyfhudghvjjqhuunjqq</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200722002104/https://arxiv.org/pdf/2007.01544v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/4b/8c/4b8c5e12a508df6e980bfd59106186aa897a8ee2.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2007.01544v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Deep Reinforcement Learning [article]

Yuxi Li
<span title="2018-10-15">2018</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
We start with background of artificial intelligence, machine learning, deep learning, and reinforcement learning (RL), with resources.  ...  We discuss deep reinforcement learning in an overview style. We draw a big picture, filled with details.  ...  The authors propose policy-space response oracle (PSRO), and its approximation, deep cognitive hierarchies (DCH), to compute best responses to a mixture of policies using deep RL, and to compute new meta-strategy  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1810.06339v1">arXiv:1810.06339v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/kp7atz5pdbeqta352e6b3nmuhy">fatcat:kp7atz5pdbeqta352e6b3nmuhy</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200823034914/https://arxiv.org/pdf/1810.06339v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/f2/ac/f2ac2a3fd7b341f2b1be752b4dd46ed9abcf0751.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/1810.06339v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Reinforcement learning based recommender systems: A survey [article]

M. Mehdi Afsar, Trafford Crump, Behrouz Far
<span title="2021-01-15">2021</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
In this paper, a survey on reinforcement learning based recommender systems (RLRSs) is presented.  ...  Accordingly, it can be formulated as a Markov decision process (MDP) and reinforcement learning (RL) methods can be employed to solve it.  ...  learning, and deep reinforcement learning.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2101.06286v1">arXiv:2101.06286v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/e234kqjtujazpdvt2wjek4et4i">fatcat:e234kqjtujazpdvt2wjek4et4i</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20210121032319/https://arxiv.org/pdf/2101.06286v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/28/10/281094023bd8fdf93b10bf504e3ff2e315497bd8.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2101.06286v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

Human-in-the-Loop Methods for Data-Driven and Reinforcement Learning Systems [article]

Vinicius G. Goecks
<span title="2020-08-30">2020</span> <i > arXiv </i> &nbsp; <span class="release-stage" >pre-print</span>
Recent successes combine reinforcement learning algorithms and deep neural networks, despite reinforcement learning not being widely applied to robotics and real world scenarios.  ...  Results presented in this work show that the reward signal that is learned based upon human interaction accelerates the rate of learning of reinforcement learning algorithms and that learning from a combination  ...  Agents and MultiAgent Systems.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2008.13221v1">arXiv:2008.13221v1</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/aofoenmwcvckvagbttrkskevty">fatcat:aofoenmwcvckvagbttrkskevty</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20200903024331/https://arxiv.org/pdf/2008.13221v1.pdf" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] </button> </a> <a target="_blank" rel="external noopener" href="https://arxiv.org/abs/2008.13221v1" title="arxiv.org access"> <button class="ui compact blue labeled icon button serp-button"> <i class="file alternate outline icon"></i> arxiv.org </button> </a>

AAAI News

Carol Hamilton
<span title="">2003</span> <i title="Association for the Advancement of Artificial Intelligence (AAAI)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/27wksbinzzhjfow2wuy6m2iefm" style="color: black;">The AI Magazine</a> </i> &nbsp;
of planning and learning algo- learning and stochastic planning, to dia- rithms, and multiagent robot teams for logue agents, and to the theory  ...  Bob had deep interests in both the technical and entrepreneurial aspects of AI.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1609/aimag.v24i2.1711">doi:10.1609/aimag.v24i2.1711</a> <a target="_blank" rel="external noopener" href="https://dblp.org/rec/journals/aim/Hamilton03a.html">dblp:journals/aim/Hamilton03a</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/vsylfwsqijgtdkpmw7c2wfzdxm">fatcat:vsylfwsqijgtdkpmw7c2wfzdxm</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20151022034702/http://www.aaai.org/ojs/index.php/aimagazine/article/download/1711/1609" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/7d/ae/7dae14136699c0100e798319fa93869f65d28322.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1609/aimag.v24i2.1711"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>

The Association for the Advancement of Artificial Intelligence 2020 Workshop Program

Grace Bang, Guy Barash, Ryan Bea, Jacques Cali, Mauricio Castillo-Effen, Xin Chen, Niyati Chhaya, Rachel Cummings, Rohan Dhoopar, Sebastijan Dumanci, Huáscar Espinoza, Eitan Farchi (+29 others)
<span title="2020-12-28">2020</span> <i title="Association for the Advancement of Artificial Intelligence (AAAI)"> <a target="_blank" rel="noopener" href="https://fatcat.wiki/container/27wksbinzzhjfow2wuy6m2iefm" style="color: black;">The AI Magazine</a> </i> &nbsp;
The Association for the Advancement of Artificial Intelligence 2020 Workshop Program included twenty-three workshops covering a wide range of topics in artificial intelligence.  ...  This report contains the required reports, which were submitted by most, but not all, of the workshop chairs.  ...  The concrete techniques ranged from symbolic logical reasoning to neural network-based deep learning and reinforcement learning.  ... 
<span class="external-identifiers"> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1609/aimag.v41i4.7398">doi:10.1609/aimag.v41i4.7398</a> <a target="_blank" rel="external noopener" href="https://fatcat.wiki/release/r6bw77vy4zgmrbgyuvsjs5knta">fatcat:r6bw77vy4zgmrbgyuvsjs5knta</a> </span>
<a target="_blank" rel="noopener" href="https://web.archive.org/web/20220309104254/https://ojs.aaai.org/index.php/aimagazine/article/download/7398/14939" title="fulltext PDF download" data-goatcounter-click="serp-fulltext" data-goatcounter-title="serp-fulltext"> <button class="ui simple right pointing dropdown compact black labeled icon button serp-button"> <i class="icon ia-icon"></i> Web Archive [PDF] <div class="menu fulltext-thumbnail"> <img src="https://blobs.fatcat.wiki/thumbnail/pdf/75/25/752561122b68ca26e5313eca6a6b63441fc5d3f9.180px.jpg" alt="fulltext thumbnail" loading="lazy"> </div> </button> </a> <a target="_blank" rel="external noopener noreferrer" href="https://doi.org/10.1609/aimag.v41i4.7398"> <button class="ui left aligned compact blue labeled icon button serp-button"> <i class="external alternate icon"></i> Publisher / doi.org </button> </a>
&laquo; Previous Showing results 1 &mdash; 15 out of 243 results