11,334 Hits in 2.6 sec

Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control

Nathaniel D Daw, Yael Niv, Peter Dayan
2005 Nature Neuroscience  
Here, we consider dual-action choice systems from a normative perspective, using the computational theory of reinforcement learning.  ...  We identify a key trade-off pitting computational simplicity against the flexible and statistically efficient use of experience.  ...  By contrast, we suggest that the prefrontal circuit subserves a modelbased reinforcement learning method.  ... 
doi:10.1038/nn1560 pmid:16286932 fatcat:gleux5kgzvdwjk4jg44qzxlfoy

Evidence for long-term spatial memory in a parid

Timothy C. Roth, Lara D. LaDage, Vladimir V. Pravosudov
2011 Animal Cognition  
We speculate that this ability may potentially be useful in relocating caches if reinforced by repeated visits.  ...  Future studies are necessary to test whether our results were speciWcally due to multiple reinforcements of the food-containing location and whether parids may have similar memory longevity during food-caching  ...  This research was supervised by the University of Nevada, Reno, IACUC (protocol #A05/06-35) and followed all federal and local guidelines for the use of animals in research.  ... 
doi:10.1007/s10071-011-0440-3 pmid:21773746 fatcat:h453efttvrakzbd4gabcngvv7a

Page 1937 of Psychological Abstracts Vol. 84, Issue 5 [page]

1997 Psychological Abstracts  
Learning using faces of animals of another unfamiliar breed was also signifi- cantly better than for symbols but was significantly worse than that seen using faces of a familiar breed.  ...  —Investigated the speed with which sheep learn to dis- criminate between familiar and unfamiliar faces, as opposed to symbols, to gain a food reward using a Y-maze apparatus.  ... 

Model and Machine Learning based Caching and Routing Algorithms for Cache-enabled Networks [article]

Adita Kulkarni, Anand Seetharam
2020 arXiv   pre-print
We then discuss the applicability of multiple machine learning models, specifically reinforcement learning, deep learning, transfer learning and probabilistic graphical models for the caching and routing  ...  In this paper, we compare and contrast model-based and machine learning approaches for designing caching and routing strategies to improve cache network performance (e.g., delay, hit rate).  ...  We discuss the potential benefits of multiple different classes of machine learning algorithms, in particular reinforcement learning, deep learning, deep reinforcement learning, transfer learning, and  ... 
arXiv:2004.06787v1 fatcat:tp4f2xhpg5g63fomgmx5esiqmy

Evaluation of the hypothesis that phasic dopamine constitutes a cached-value signal

Melissa J. Sharpe, Geoffrey Schoenbaum
2018 Neurobiology of Learning and Memory  
reinforcement learning signal.  ...  The phasic dopamine error signal is currently argued to be synonymous with the prediction error in Sutton and Barto (1987 Barto ( , 1998 model-free reinforcement learning algorithm (Schultz et al., 1997  ...  value learning, as described in model free reinforcement learning algorithms (Sutton & Barto, 1981 , 1987 .  ... 
doi:10.1016/j.nlm.2017.12.002 pmid:29269085 pmcid:PMC6136434 fatcat:btizu5lptzblpck3yucifwykmm

Interest Forwarding in Named Data Networking Using Reinforcement Learning

Olumide Akinwande
2018 Sensors  
We propose a novel adaptive forwarding strategy using reinforcement learning with the random neural network (NDNFS-RLRNN), which leverages the routing information and actively seeks new delivery paths  ...  from the nearest caches by flooding requests.  ...  NDN Forwarding with Reinforcement Learning Using the RNN We present NDNFS-RLRNN that uses reinforcement learning (RL) with the RNN for the strategy module per prefix in the NDN architecture, which operates  ... 
doi:10.3390/s18103354 fatcat:3c63dbozpvfinar3xd6xuzy36y

A DQN-Based Cache Strategy for Mobile Edge Networks

Siyuan Sun, Junhua Zhou, Jiuxing Wen, Yifei Wei, Xiaojun Wang
2022 Computers Materials & Continua  
deep reinforcement learning algorithm.  ...  The deep neural network (DNN) and Q-learning algorithm are combined to design a deep reinforcement learning framework named as the deep-Q neural network (DQN), in which the DNN is adopted to represent  ...  [27, 28] suggested a joint optimization for edge resource allocating problem, the proposed strategy uses the model-free actor-critic reinforcement learning to solve the joint optimization problems of  ... 
doi:10.32604/cmc.2022.020471 fatcat:5ht42r5pmvg3hggqrms5hoo5eu

Are animals stuck in time?

William A. Roberts
2002 Psychological bulletin  
A classic variable in early theories of learning was the delay of reinforcement.  ...  Pavlov suggested that withholding the US (food) led to the growth of an inhibitory process that blocked the CR.  ... 
doi:10.1037/0033-2909.128.3.473 pmid:12002698 fatcat:3gz4z3oaarhp5gmg6kc24zz2om

Are animals stuck in time?

William A. Roberts
2002 Psychological bulletin  
A classic variable in early theories of learning was the delay of reinforcement.  ...  Pavlov suggested that withholding the US (food) led to the growth of an inhibitory process that blocked the CR.  ... 
doi:10.1037//0033-2909.128.3.473 fatcat:bok3x4tdvzbzxmuwk5koxn4e4m

Deep Reinforcement Learning for Adaptive Caching in Hierarchical Content Delivery Networks [article]

Alireza Sadeghi, Gang Wang, Georgios B. Giannakis
2019 arXiv   pre-print
To model the two-way interactive influence between caching decisions at the parent and leaf nodes, a reinforcement learning framework is put forth.  ...  To handle the large continuous state space, a scalable deep reinforcement learning approach is pursued.  ...  This two-time scale caching task was cast in this paper using a reinforcement learning approach.  ... 
arXiv:1902.10301v2 fatcat:vqjy7jgolzdkjhufhmvc5mer6q

Enhanced dorsolateral striatal activity in drug use: The role of outcome in stimulus–response associations

Noam Schneck, Paul Vezina
2012 Behavioural Brain Research  
We describe how caching of reward value and Pavlovian to instrumental transfer can provide mechanisms for past and current reward values to regulate the performance of S-R habits.  ...  This effect has been interpreted in light of evidence that this brain region supports the generation of habitual stimulus-response (S-R) based behaviors to propose the idea that prolonged drug use leads  ...  These results suggest that the memory of the low value food pellets learned during exposure in the feeding cage was able to influence responding during the subsequent extinction test.  ... 
doi:10.1016/j.bbr.2012.07.042 pmid:22884607 pmcid:PMC3448372 fatcat:qreakrqcw5cb3dlovo5qgghkzq

Instrumental Learning [chapter]

Wendy A. Williams
2017 Encyclopedia of Animal Cognition and Behavior  
Many broad mechanisms have been suggested including the use of olfaction, a biological compass, or the use of landmarks or beacons.  ...  However, when a reinforcer from one system is used to reinforce a behavior from an incompatible system, learning is inhibited.  ... 
doi:10.1007/978-3-319-47829-6_1114-1 fatcat:r7zsh4zwsjeevf3yjbyx3dvruy

Q-Selector-Based Prefetching Method for DRAM/NVM Hybrid Main Memory System

Jeong-Geun Kim, Shin-Dug Kim, Su-Kyung Yoon
2020 Electronics  
The Q-selector-based prefetching method is based on the Q-learning method, one of the reinforcement learning algorithms, which determines a near-optimal prefetcher for an application's current running  ...  For this, our model analyzes real-time performance status to set the criteria for the Q-learning method.  ...  The basic elements of reinforcement learning method are: In this paper, we suggest a DRAM-cache-based PCM hybrid main memory system with an intelligent prefetcher based on machine learning, which adopts  ... 
doi:10.3390/electronics9122158 fatcat:tsrauw6it5ayxjozoste4ovz6a

Page 4539 of Psychological Abstracts Vol. 79, Issue 11 [page]

1992 Psychological Abstracts  
—Describes a female chimpanzee that could learn the use of personal pronouns in a conditional matching-to-sample task.  ...  Without putting their methodology or their results into question, analysis of the reinforcement conditions they used indicates that what was ac- tually examined was not transitive inference and that this  ... 

The impact of orbitofrontal dysfunction on cocaine addiction

Federica Lucantonio, Thomas A Stalnaker, Yavin Shaham, Yael Niv, Geoffrey Schoenbaum
2012 Nature Neuroscience  
This evidence suggests that cocaine-induced changes in orbitofrontal cortex disrupt the representation of states and transition functions that form the basis of flexible and adaptive 'model-based' behavioral  ...  A set of reinforcement learning methods that use prediction errors to estimate and store scalar cue or action values from experience.  ...  A set of reinforcement learning methods in which an internal model of the environment is learned and used to evaluate available actions or cues on the basis of their potential outcomes.  ... 
doi:10.1038/nn.3014 pmid:22267164 pmcid:PMC3701259 fatcat:orpo4vqfzjeinagliwsufmfnta
« Previous Showing results 1 — 15 out of 11,334 results