Subjects
The subjects were 2 male rhesus monkeys (monkeys LP and GM, 7.8 and 5.6 kg, respectively). Body weight was measured once every 2 weeks on average throughout the experimental period. All experimental procedures were carried out in accordance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals in the USA and were approved by the Animal Care and Use Committee of the National Institute of Radiological Sciences.
Measurement of blood osmolality
Blood samples (~0.5 mL/sample) were drawn from the saphenous vein via a venous catheter using an auto-blood sampling system (DR-II, Eicom Co.; in 7 sessions) or manually (in 3 sessions). The samples were stored at 4 °C for up to 3 h. After blood collection, blood osmolality was measured by using a freezing point method (Advance 3250, Advanced Instruments Inc.) on whole blood samples of 250 μL. The measurement error was ~2 mOsm/kgH2O. Since there was no significant difference in the osmolality level between serum and whole blood samples (0 ± 2.7 mOsm/kgH2O; difference ± SD; n = 9), we used whole blood samples to reduce the total sampling volume. The total amount of blood sampled never exceeded 4 mL in a day.
Behavioral task and testing procedure
We used the reward-size task (Fig. 1) (Minamimoto et al. 2009). A monkey initiated a trial by touching the bar in the chair; 100 ms later a visual cue (13° on a side), which will be described below, was presented at the center of the monitor. After 500 ms, a red target (0.5° on a side) also appeared at the center of the monitor. After a variable interval of 500, 750, 1,000, 1,250, or 1,500 ms, the target turned green, indicating that the monkey could release the bar to earn a liquid reward. If the monkey responded within 200–1,000 ms, the target turned blue, indicating that the trial had been completed correctly. In correct trials, a reward of 1, 2, 4, or 8 drops of water (1 drop = ~0.1 mL) was delivered immediately after the blue signal. Each reward size was selected randomly with equal probability. The visual cue presented at the beginning of the trial indicated the number of drops for the reward. An inter-trial interval (ITI) of 1 s was enforced before the next trial began. If the monkey released the bar before the green target appeared or within 200 ms after the green target appeared or failed to respond within 1 s after the green target appeared, we regarded the trial as an “error trial”; all visual stimuli disappeared, the trial was terminated immediately, and, after the 1-s ITI, the trial was repeated. In this task, our behavioral measurement for the motivational value of outcome was the proportion of error trials. Since the monkeys were able to perform the task correctly in nearly every trial when the reward size was not assigned, an error trial is regarded as a trial in which the monkeys are not sufficiently motivated to correctly release the bar (Minamimoto et al. 2009). Before each testing session, the monkeys were subject to ~22 h of water restriction in their home cage. Each testing session continued for 120 min. Before the end of the session, the monkey received a sufficient volume of water (~300–500 mL) and stopped working spontaneously (usually at ~100 min). If the monkeys were allowed free access to water immediately after the session (i.e., after all behavioral experiments, see below), they still drank water, indicating that they were not completely satiated for water at the end of the session. During the behavioral testing session, blood samples were taken every 15 min (total of 8 samples; from 0 to 105 min). After all behavioral experiments, the monkeys were allowed free access to water. To measure the baseline osmolality level, we collected blood samples on 3 consecutive days at more than 2 months following the end of the experiment. In order to assess the natural fluctuations in blood osmolality, a blood sample was taken every 30 min (total of 5 samples; from 0 to 120 min) while a water-restricted monkey sat on a chair without any behavioral testing or access to water (3 sessions).
Data analysis and model fitting
All data and statistical analyses were performed using the “R” statistical computing environment (R Development Core Team 2004). To assess the relationship between blood osmolality and cumulative reward, we performed multiple linear regression analysis. The osmolality level (O
SM) was fitted by a liner regression model:
$$ O_{\text{SM}} = b_{0} + b_{\text{cum}} R_{\text{cum}} + b_{\text{sub}} S_{\text{ub}} , $$
where R
cum is the cumulative reward (mL), S
ub is the factor of subjects (0 and 1 for Monkeys LP or GM, respectively), b
cum and b
sub are the regression coefficient for cumulative reward and subject, respectively, and b
0 is the intercept.
To assess the relationship between blood osmolality and task performance, we calculated the error rate for each drop-size condition within a 20-min time window around the blood sampling period (−12.5~7.5 min at each sample) (cf. Fig. 2). Each sample period contained 50 ± 23 and 54 ± 18 trials, in Monkeys LP and GM, respectively (mean ± SD).
We previously demonstrated that the error rate in the reward-size task has an inverse relationship with reward size: that is, E = 1/aR, where R is the reward size, a is a constant parameter, and E is the error rate (%) of the monkeys in trials with reward size R (Minamimoto et al. 2009). Here, the motivational value of the expected outcome R′ is inferred as being discounted as reward accumulation: \( R^{\prime } = R \cdot F_{S} (S) = R\cdot\frac{{1 + e^{{ - (s - s_{0} )/\sigma }} }}{{1 + e^{{s_{0} /\sigma }} }} \), where F
S
(S) is the devaluation function of the normalized accumulated reward, S, S
0 is the inflection point of the sigmoid, and σ quantifies the width of the sigmoid around S
0. The normalized accumulated reward, S, which ranged from 0 (at the beginning of the session) to 1 (at the end of the session), was defined as the ratio between the amount of total reward delivered up to time t, R
cum(t), and the total amount of reward, R
cum max, delivered in the entire session: \( S = \frac{{R_{\text{cum}} (t)}}{{R_{{{\text{cum }}\max }} }} \). Using the heuristic devaluation function, the inverse model was modified as: \( E = \frac{1}{{aR \cdot F_{S} \left( S \right)}} \).
In this study, we tried to model the motivational value as being discounted as a function of the blood osmolality shift: \( R^{\prime } = R \cdot F_{\text{OSM}} (S) = R \cdot \frac{{O_{\text{SM}} (S) - \rho }}{{O_{\max } - \rho }} \), where F
OSM(S) is the osmolality devaluation function, which originated from the average blood osmolality change along with the reward consumption, O
SM(S), O
max is the maximum value of O
SM(S), and ρ is a free parameter corresponding to the threshold of the osmolality level, where the motivational value would be 0. The average blood osmolality change, O
SM(S), was obtained individually as follows: Data for the changes in blood osmolality along with reward accumulation were linearly interpolated, and they were then averaged across sessions along with the cumulative reward from 0 to the smallest R
cum max among all sessions (thick lines in Fig. 2). It was then normalized by the smallest R
cum max. Using the osmolality devaluation function, the inverse model was modified as: \( E = \frac{1}{{aR \cdot F_{\text{OSM}} (S)}} \).
To examine the changes in the error rate along with water accumulation, the behavioral data from each session were divided into consecutive 9-quantiles with respect to the value of S; the error rate was evaluated in every 2 consecutive 9-quantiles, and thus, we obtained the error rate for 8 sub-sessions. These were then averaged across sessions for each sub-session. We fitted the models to these error data (4 reward size × 8 sub-sessions) with the least square minimization procedure described earlier (Minamimoto et al. 2009). The coefficient of determination (R
2) was reported as a measure of goodness of fit. Since these 2 models have a different number of free parameters, we used the Bayesian information criterion (BIC; BIC = − 2 × log-likelihood + klogN, where k is the number of free parameters and N is the number of data points) to compare the goodness of fit in each model.