A Complete Serial Compound Temporal Difference Simulator for Compound stimuli, Configural cues and Context representation
- 702 Downloads
Temporal Difference (TD) (Sutton and Barto, 19871) is a real-time error correction model in which learning is computed according to the difference between successive predictions and a discount factor that decays exponentially, reflecting the fact that predictors closer to a reinforcer (the unconditioned stimulus, US) are based on more recent information and thus more accurate. In addition, an eligibility trace modulates the extent to which the stimulus predictive value is susceptible of changing on any given time-step.
The way stimuli are represented affects significantly how learning is implemented in TD. The Complete Serial Compound representation (CSC) (Moore et al., 19982) has become standard in studies of dopamine function (Schultz 20103) and is central in investigating reward-based models of schizophrenia (Smith et al., 20064). This CSC representation is at the core of the TD Simulator that we briefly describe in this item. The simulator has been built upon the graphical...