Filters








61 Hits in 6.7 sec

Micro-Objective Learning : Accelerating Deep Reinforcement Learning through the Discovery of Continuous Subgoals [article]

Sungtae Lee, Sang-Woo Lee, Jinyoung Choi, Dong-Hyun Kwak and Byoung-Tak Zhang
2017 arXiv   pre-print
However, discovering subgoals online is too expensive to be used to learn options in large state spaces. We propose Micro-objective learning (MOL) to solve this problem.  ...  Recently, reinforcement learning has been successfully applied to the logical game of Go, various Atari games, and even a 3D game, Labyrinth, though it continues to have problems in sparse reward settings  ...  Acknowledgements The authors would like to thank Heidi Lynn Tessmer for discussion and helpful comments.  ... 
arXiv:1703.03933v1 fatcat:murlzmc4fvfgljpzozhu5wyi34

Vision-Based Robot Navigation through Combining Unsupervised Learning and Hierarchical Reinforcement Learning

Xiaomao Zhou, Tao Bai, Yanbin Gao, Yuntao Han
2019 Sensors  
Finally, goal-directed navigation is performed using reinforcement learning in continuous state spaces which are represented by the population activities of place cells.  ...  In particular, considering that the topological map provides a natural hierarchical representation of the environment, hierarchical reinforcement learning (HRL) is used to exploit this hierarchy to accelerate  ...  Conflicts of Interest: The authors declare no conflict of interest.  ... 
doi:10.3390/s19071576 fatcat:x6tfzxa2ifc75hw3usg5x7dhb4

Deep Reinforcement Learning [article]

Yuxi Li
2018 arXiv   pre-print
We start with background of artificial intelligence, machine learning, deep learning, and reinforcement learning (RL), with resources.  ...  We discuss deep reinforcement learning in an overview style. We draw a big picture, filled with details.  ...  The authors propose policy-space response oracle (PSRO), and its approximation, deep cognitive hierarchies (DCH), to compute best responses to a mixture of policies using deep RL, and to compute new meta-strategy  ... 
arXiv:1810.06339v1 fatcat:kp7atz5pdbeqta352e6b3nmuhy

Towards an integration of deep learning and neuroscience [article]

Adam Marblestone, Greg Wayne, Konrad Kording
2016 arXiv   pre-print
Such a heterogeneously optimized system, enabled by a series of interacting cost functions, serves to make learning data-efficient and precisely targeted to the needs of the organism.  ...  Here we think about the brain in terms of these ideas.  ...  We thank Miles Brundage for an excellent Twitter feed of deep learning papers.  ... 
arXiv:1606.03813v1 fatcat:tmmholydqbcplbc5ihg76yip6e

Toward an Integration of Deep Learning and Neuroscience

Adam H. Marblestone, Greg Wayne, Konrad P. Kording
2016 Frontiers in Computational Neuroscience  
Such a heterogeneously optimized system, enabled by a series of interacting cost functions, serves to make learning data-efficient and precisely targeted to the needs of the organism.  ...  In support of these hypotheses, we argue that a range of implementations of credit assignment through multiple layers of neurons are compatible with our current knowledge of neural circuitry, and that  ...  AUTHOR CONTRIBUTION All authors contributed ideas and co-wrote the paper. ACKNOWLEDGMENTS We thank Ken Hayworth for key discussions that led to this paper. We thank Ed Boyden, Chris Eliasmith, Gary  ... 
doi:10.3389/fncom.2016.00094 pmid:27683554 pmcid:PMC5021692 fatcat:yikwc4h5yvfj7gwzlimtw5n6ai

Towards an integration of deep learning and neuroscience [article]

Adam Henry Marblestone, Greg Wayne, Konrad P Kording
2016 bioRxiv   pre-print
Such a heterogeneously optimized system, enabled by a series of interacting cost functions, serves to make learning data-efficient and precisely targeted to the needs of the organism.  ...  Here we think about the brain in terms of these ideas.  ...  The machine-learning field has recently been tackling the question of credit assignment in deep reinforcement learning.  ... 
doi:10.1101/058545 fatcat:4ryejpe2tnf7dgoaqhoastoiya

Transition scale-spaces: A computational theory for the discretized entorhinal cortex [article]

Nicolai Waniek
2019 bioRxiv   pre-print
The resultant data structure is a scale-space that learns approximate transitions and has an optimal scale-increment of √2 between subsequent scales.  ...  In addition, the scale-space can be used to find short-cuts, shown in a simulated Morris water maze experiment. Finally, the results provoke a novel understanding of Theta Phase Precession (TPP).  ...  Finally, thanks to Maike Krause for significant contributions to the structure of the manuscript.  ... 
doi:10.1101/543801 fatcat:wvs5mf2scrahdjewy7narafa4y

Advances in Electron Microscopy with Deep Learning

Jeffrey Ede
2020 Zenodo  
This doctoral thesis covers some of my advances in electron microscopy with deep learning.  ...  This copy of my thesis is typeset for online dissemination to improve readability, whereas the thesis submitted to the University of Warwick in support of my application for the degree of Doctor of Philosophy  ...  In addition, part of the text in section 1.2 is adapted from our earlier work with permission 201 under a Creative Commons Attribution 4.0 73 license.  ... 
doi:10.5281/zenodo.4399748 fatcat:63ggmnviczg6vlnqugbnrexsgy

Advances in Electron Microscopy with Deep Learning

Jeffrey Ede
2020 Zenodo  
This doctoral thesis covers some of my advances in electron microscopy with deep learning.  ...  This version of my thesis is typeset for online dissemination to improve readability, whereas the thesis submitted to the University of Warwick in support of my application for the degree of Doctor of  ...  In addition, part of the text in section 1.2 is adapted from our earlier work with permission 201 under a Creative Commons Attribution 4.0 73 license.  ... 
doi:10.5281/zenodo.4598227 fatcat:hm2ksetmsvf37adjjefmmbakvq

Advances in Electron Microscopy with Deep Learning

Jeffrey Ede
2020 Zenodo  
This doctoral thesis covers some of my advances in electron microscopy with deep learning.  ...  This version of my thesis is typeset for online dissemination to improve readability, whereas the thesis submitted to the University of Warwick in support of my application for the degree of Doctor of  ...  In addition, part of the text in section 1.2 is adapted from our earlier work with permission 201 under a Creative Commons Attribution 4.0 73 license.  ... 
doi:10.5281/zenodo.4591029 fatcat:zn2hvfyupvdwlnvsscdgswayci

Advances in Electron Microscopy with Deep Learning

Jeffrey Ede
2020 Zenodo  
This doctoral thesis covers some of my advances in electron microscopy with deep learning.  ...  This copy of my thesis is typeset for online dissemination to improve readability, whereas the thesis submitted to the University of Warwick in support of my application for the degree of Doctor of Philosophy  ...  In addition, part of the text in section 1.2 is adapted from our earlier work with permission 201 under a Creative Commons Attribution 4.0 73 license.  ... 
doi:10.5281/zenodo.4413249 fatcat:35qbhenysfhvza2roihx52afuy

Advances in Electron Microscopy with Deep Learning

Jeffrey Ede
2020 Zenodo  
This doctoral thesis covers some of my advances in electron microscopy with deep learning.  ...  This copy of my thesis is typeset for online dissemination to improve readability, whereas the thesis submitted to the University of Warwick in support of my application for the degree of Doctor of Philosophy  ...  In addition, part of the text in section 1.2 is adapted from our earlier work with permission 201 under a Creative Commons Attribution 4.0 73 license.  ... 
doi:10.5281/zenodo.4429792 fatcat:qs6yuapx4vdbdmwna7ix7nnwty

Advances in Electron Microscopy with Deep Learning

Jeffrey Ede
2020 Zenodo  
This doctoral thesis covers some of my advances in electron microscopy with deep learning.  ...  This copy of my thesis is typeset for online dissemination to improve readability, whereas the thesis submitted to the University of Warwick in support of my application for the degree of Doctor of Philosophy  ...  In addition, part of the text in section 1.2 is adapted from our earlier work with permission 201 under a Creative Commons Attribution 4.0 73 license.  ... 
doi:10.5281/zenodo.4415407 fatcat:6dejwzzpmfegnfuktrld6zgpiq

Papers Presented At The 28Th Annual Meeting Of The Psychonomic Society The Seattle Sheraton Hotel & Towers, Seattle, Washington November 6, 7, 8, 1987

1987 Bulletin of the Psychonomic Society  
Perception of self-target distance while walking (CEs) was a negatively accelerated function, and the time delay doubled the rate of negative acceleration.  ...  subgoals and methods.  ...  The present study traces the strategies of undergraduates in hypothesis testing and rule discovery tasks.  ... 
doi:10.3758/bf03330365 fatcat:kzkxlxwi7ngsfok2lydumy4vey

From Homo Sapiens to Robo Sapiens: The Evolution of Intelligence

Anat Ringel Raveh, Boaz Tamir
2018 Information  
The last step took place several years ago with the increased progress in deep neural network technology.  ...  In this paper, we present a review of recent developments in artificial intelligence (AI) towards the possibility of an artificial intelligence equal that of human intelligence.  ...  Conflicts of Interest: The authors declare no conflict of interest.  ... 
doi:10.3390/info10010002 fatcat:vd2zzph52naddi76y2xqcdfmtu
« Previous Showing results 1 — 15 out of 61 results