Prepulse inhibition (PPI) of the acoustic startle reflex is an operational measure of sensorimotor gating. It refers to the reduction of the magnitude of the startle response when a weak prepulse precedes the startle-eliciting pulse stimulus (Graham, 1975; Hoffman & Searle, 1965). Perception of the prepulse stimulus is believed to trigger an automatic pre-attentive gating mechanism that inhibits or ‘gates’ the processing of the subsequent pulse stimulus in order to protect the processing of the initial prepulse stimulus from distraction and interference by the pulse stimulus (Graham, 1992). While the expression of PPI is typically measured within sub-second time windows and does not require learning, it is nonetheless modulated by activities in brain regions that are also implicated in higher cognitive processes (Swerdlow, Geyer, & Braff, 2001).

PPI is simple to measure compared with traditional neuropsychological tests (Geyer, & Swerdlow, 2001). Importantly, it enjoys good test-retest reliability in both humans and animals (e.g., Abel, Waikar, Pedro, Hemsley, & Geyer, 1998; Feifel, 1999; Ludewig, Geyer, Etzenberger, & Vollenweider, 2002; Schwarzkopf, McCoy, Smith, Boutros, 1993). Hence, there is considerable interest in exploring the potential of PPI as a marker or a predictor of other overt cognitive and behavioral traits amongst both diseased and healthy individuals (Geyer, 2006a, b). A number of studies have investigated a potential link between PPI magnitude and cognitive performance in the general population. In healthy humans, there are reports that PPI positively correlated with strategy formation, planning efficiency, execution speed, and working memory performance (Bitsios & Giakoumaki, 2005; Bitsios, Giakoumaki, Theou, & Frangou, 2006; Csomor et al., 2008a; Giakoumaki, Bitsios, & Frangou, 2006; Holstein et al., 2011). A positive relationship between PPI and cognitive performance is also suggested by the evaluation of clinical populations including patients with schizophrenia in which PPI deficiency often co-exists with other cognitive impairments (Hagan & Jones, 2005; Young, Powell, Risbrough, Marston, & Geyer, 2009). Given that cognitive deficiency in schizophrenia is now an accepted drug target independent of the psychotic symptoms (Laughren & Levin, 2006, 2011), the relationship between PPI and cognition is of interest with respect to drug discovery for the treatment of cognitive symptoms in schizophrenia and related disorders (Geyer, 2006a, b).

PPI is highly translational since it is measured in very similar fashions in humans and rodents. The PPI paradigm therefore has been widely used to study the neurochemical and genetic basis of sensorimotor gating in animal models (Geyer, McIlwain, & Paylor, 2002; Swerdlow et al., 2001). However, only a few studies have investigated the relationship between PPI expression and cognitive performance in normal animals. In our first attempt, we have recently examined in a homogeneous cohort of adult wild-type C57BL/6 mice the presence of a potential link between PPI magnitude and performance in standard tests of learning and memory including spatial reference memory, spatial working memory, and contextual fear conditioning (Singer et al., 2013). Apart from a highly specific positive correlation linking working memory performance with PPI induced by weak (but not strong) prepulse stimuli, no significant association between PPI and cognitive performance was discovered.

Here, to further explore the relationship between PPI and cognition and to test the suggestion that intrinsically low-PPI may be associated with poorer cognitive functions, we segregated a homogeneous cohort of 100 male adult C57BL/6 mice into two subgroups of subjects with low or high basal PPI levels. Cognitive performance of the two subgroups was then compared on spatial reference memory in the Morris water maze (including a challenge with new learning under high proactive interference) and spatial recognition memory in the Y-maze. In addition, spontaneous motor activity in the open field and anxiety-like behavior in the elevated plus maze were measured to control for confounding non-cognitive effects. The present segregation approach complements our previous study that solely relied on correlative analysis of individual differences to identify potential links between PPI expression and hippocampus-dependent spatial learning and memory function (Singer et al., 2013).

Our decision to adopt the current approach is to overcome an inherent limitation of the correlative approach in the study by Singer et al. (2013) and an earlier one by Bitanihirwe, Dubroqua, Singer, Feldon, & Yee (2011). Namely, the limited sample size in each case arguably may limit the statistical power to detect correlative links especially when statistical corrections (for Type-I errors) may be deemed necessary to correct the family-wise Type-I error rate with the inclusions of multiple variables. In other words, a substantial increase in the number of subjects would be needed to provide robust statistical power, which could have severely limited our ability to perform multiple labor-intensive and time-consuming behavioral tests. By contrast, our current approach has enabled us to test our hypothesis based on a simple a priori comparison between two groups with a demonstrably robust difference in baseline PPI (an effect size η p 2 = 0.70) as our independent variable. It has simplified interpretation as well as afforded us superior statistical power in answering whether a substantial baseline PPI difference entailed divergence in cognitive performance in specific tests. It should complement the correlative approach in the search of association between traits.

Methods

Subjects

A cohort of 100 12-week-old male C57BL/6NCrl mice were used. They were bred in the Laboratory of Behavioural Neurobiology (Swiss Federal Institute of Technology Zurich) from C57BL/6NCrl (strain code 027) breeding pairs originating from Charles River (Germany). They were born within one week of each other, weaned and sexed on postnatal day 21. All mice were housed with ad libitum access to food and water, in a temperature- and humidity-controlled (22±1 °C, 55±5 %) vivarium maintained under a reversed 12/12-h light–dark cycle (lights on 1900–0700 h). All behavioral evaluations were conducted during the dark phase of the cycle. All procedures were conducted in accordance with the National Institutes of Health Guide for Care and Use of Laboratory Animals (1996) and had been approved by the Cantonal Veterinarian’s Office of Zurich.

All animals were naive at the beginning of the PPI screening. Subsequently, 23 “high-PPI” and 23 “low-PPI” subjects were identified (see Results section ) and evaluated a week later in four behavioral tests in the order described below. Successive tests were separated by 2−3 days of recovery.

Prepulse inhibition (PPI) of the acoustic startle reflex

The apparatus and procedures have been fully described elsewhere (Yee, Chang, & Feldon, 2004a). In brief, four startle chambers for mice (SR-LAB, San Diego Instruments, San Diego, CA, USA) were used. During a PPI testing session, the animals were presented with a series of acoustic (white noise) stimuli comprising a mixture of four different types of trials: Pulse-alone trials, prepulse-alone trials, prepulse-plus-pulse trials, and no-stimulus trials in which no discrete stimulus other than the constant background noise (65 dBA) was presented. The intensity of the pulse stimulus was 120 dBA while prepulses of the following intensities were employed: 69, 73, 77, 81, and 85 dBA, corresponding to 4, 8, 12, 16, and 20 decibel units above background, respectively. The duration of the pulse was 40 ms, while a prepulse lasted 20 ms. The stimulus onset asynchrony between the prepulse and pulse stimuli in prepulse-plus-pulse trials was 100 ms.

The PPI test session consisted of three blocks of trials. In the first block, the animals were exposed to six consecutive pulse-alone trials in order to habituate and stabilize their startle response. PPI was measured in the middle block in which 12 series of discrete test trials were presented. Each series consisted of one trial of each of the following trial types: startle-alone, prepulse-plus-pulse of each of the five levels of prepulse, prepulse-alone of each of the five levels of prepulse, and no stimulus (background alone). The 12 discrete trials within each series were presented in a pseudorandom order, with a variable inter-trial interval averaging 15 s (ranging from 10 to 20 s). The session was concluded with a final block of six consecutive pulse-alone trials similar to that presented in the first block. Whole body acceleration was measured by a stabilimeter on each and every trial within a 65-ms response window (from the onset of the pulse in pulse-alone and prepulse-plus-pulse trials, and the onset of the prepulse on prepulse-alone trials, respectively). This output was referred to as reactivity score (in arbitrary units) and constituted the raw data from which a number of dependent measures were derived, as detailed below:

Startle reaction

The mean startle reactivity across the 12 pulse-alone trials presented in the middle block was separately analyzed in order to evaluate the animals’ startle reactivity.

Startle habituation

To examine whether the startle reactivity habituated within the period of testing, the mean startle reactivity in the first and last block of pulse alone-trials was subjected to a separate analysis.

Prepulse-elicited reactivity

A comparison between prepulse-alone trials at each level of intensity and no-stimulus trials allowed the assessment of the direct reaction to the prepulse stimulus. Prepulse-elicited reactivity on each prepulse level was normalized against baseline reactivity on the no-stimulus trial by a difference score as performed by Yee et al. (2004a). In addition, the reaction to the prepulse was approximated by the linear component derived from the prepulse reactivity curve including no-stimulus trials (referred to as “prepulse steepness index”).

Baseline reactivity

Reactivity on no-stimulus trials was used as a measure of baseline reactivity in the absence of any discrete stimulus other than the background noise.

PPI

The reactivity scores obtained on the 12 pulse-alone and prepulse-plus-pulse trials presented in the middle block were utilized to measure PPI. PPI was expressed as percent reduction of the startle reaction (%PPI) by the formula %PPI = [(pulse-alone – prepulse-plus-pulse) / pulse-alone × 100 %] as well as by the absolute reduction of the startle reaction as a function of increasing prepulse intensity (see Yee et al., 2004a). In the latter approach of PPI indexation, the PPI magnitude was indexed by the linear rate of which the startle reaction induced by the pulse stimulus is reduced by increasingly more intense prepulse stimuli. This method of indexing the magnitude of PPI is individually approximated by the linear component (namely, the fitted slope) of the startle reactivity curve as a function of increasing prepulse intensity (Yee, Chang, Pietropaolo, & Feldon, 2005). This index is referred to as the “PPI steepness index” here.

Anxiety-related behavior in the elevated plus maze

Spontaneous anxiety-related behavior and unconditioned fear were evaluated by the elevated plus maze test. The apparatus and test procedures have been described in full detail elsewhere (Yee et al., 2004b). The test lasted 5 min during which the animal was allowed to explore freely undisturbed in the maze. Two anxiety-related measures were calculated: (i) percentage of time spent in the open arms [time in open arms / time in open and enclosed arms × 100 %], and (ii) percentage of entries made into the open arms [entries into open arms / entries into open and enclosed arms × 100 %]. In addition, spontaneous locomotor activity was measured by the cumulative distance travelled (in m) in the entire maze. All raw data were generated by the Ethovision tracking software (Version 3.1, Noldus Technology, Wageningen, The Netherlands).

Locomotor activity in the open field

The apparatus consisted of four identical open-field arenas constructed from plastic laminated wood. Each arena measured 40 × 40 cm in surface area and was surrounded on all sides by 35-cm high walls. The experiment was conducted in a quiet testing room under diffused dim lighting (20 lux). The animals were tested in squads of four (one per arena). The allocation to the four arenas was counterbalanced between the two groups. A trial began by gently placing the animals in the center of the appropriate open field. They were then allowed to explore undisturbed in the open field for 1 h. Afterwards, they were returned to the home cage and the arenas cleansed with diluted alcohol (10 %) and dried prior to the next squad. Locomotor activity was indexed by the distance travelled (in m) expressed in successive 5-min bins. These data were calculated by the Ethovision (Version 3.1, Noldus Technology) tracking software.

Spatial recognition memory in the Y-maze

The Y-maze used to assess spatial recognition memory was identical to the one described by Pietropaolo et al. (2009). It could be placed in two distinct testing rooms (Rooms 1 and 2) each enriched with a unique set of visual extra-maze cues. Prior to the experiment, the animals were assigned two arms (start arm and familiar arm) to which they were exposed during the first phase of the test (sample phase). The remaining third arm constituted the novel arm and it was only accessible in the second phase (test phase). Allocation of arms (start, familiar, and novel) was counterbalanced within each experimental group. The maze floor was covered by a thin layer of sawdust to encourage exploration. The sawdust was never re-used between animals; and the sawdust was also replaced with fresh one between the two phases of the test to avoid any renmant of olfactory cues.

The sample phase began by releasing the animal from the distal end of the start arm facing the center of the maze. The animal was then given 5 min to freely explore the start and familiar arms. Timing started when the animal first entered the sample arm as detected by the Ethovision tracking software. In the subsequent test phase, the animal was again placed in the start arm and was allowed to explore all three arms for 3 min. Timing began upon detection of the animal in the central area of the maze by the Ethovision tracking software.

Two delay intervals (2 min and 3 h) were examined on two consecutive days. On day 1, the sample and test phases were separated by a delay of 2 min, conducted in Room 1. On day 2, a 3-h delay was imposed and the test was performed in Room 2. During the delay interval, the animals were either kept in waiting cages in the testing room (2-min delay) or were returned to the home cage (3-h delay).

Spatial novelty preference in the test phase was indexed by a discrimination ratio [time in novel arm / (time in novel + familiar arm)] as suggested by Bannerman et al. (2008). All data were generated by the Ethovision (Version 3.1, Noldus Technology) tracking software.

Spatial reference memory and new learning in the water maze

A detailed description of the apparatus and procedure has been provided elsewhere (Singer, Boison, Möhler, Feldon, & Yee, 2009a). On the first day, the animals were pre-trained using a visually “cued” platform in order to familiarize them to swimming in the pool and climbing onto the platform. Each animal underwent three trials. The animals were allowed a maximum of 60 s to locate the platform on a given trial. If an animal failed to do so, it was guided to the platform by the experimenter and an escape latency of 60 s was scored on that trial. Across the three “cued” trials, the platform remained in a constant position, 25 cm from the center of the maze in the middle of one of four quadrants (NE, SE, SW, and NW). Assignment of the cued platform location was counterbalanced within in each group. A unique non-repeating sequence of release points (varied among N, E, S, and W) was randomly generated for each animal. The inter-trial interval (ITI) was 40 s, of which the animal spent the first 20 s on the platform and the remaining 20 s in a waiting cage positioned next to the water maze.

Reference memory acquisition training commenced the next day. The platform was moved 90° clockwise relative to its position in the cued trials. A new non-repeating sequence of starting points (N, E, S, and W) was generated daily for each animal. Four trials were performed daily, but they were separated into two sessions 3 h apart. The animals were returned to the home cage between the two sessions. Each session thus comprised two trials, separated by an ITI of 40 s. Acquisition training continued for 3 days for a total of 12 trials. On day 4, the animals were subjected to a single probe trial in the first session when the escape platform was removed and the animals allowed to swim freely in the maze for 45 s after being randomly released from one of the four starting points. Next, new learning commenced in the second session on day 4 when the platform was moved to a new position – in the middle of the quadrant opposite to where it was positioned during acquisition training. Finally, new learning continued for another two trials on day 5.

Performance during initial acquisition and new learning was measured by the latency (in sec) and path-length (in m) to reach the platform. Search accuracy during the probe test was indexed by percentage of time spent in the target quadrant, and percentage of path-length made inside the target quadrant.

Statistical analysis

All data were analyzed by parametric analysis of variance (ANOVA) with the between-subject factor basal PPI (high vs. low). Additional within-subject factors were included according to the nature of the considered dependent variables. To assist interpretation of the statistical outcomes, post-hoc analyses were performed based on the associated error terms taken from the overall ANOVA whenever appropriate. To better conform to the normality and variance homogeneity assumptions of parametric ANOVA, a natural logarithmic transformation (hereafter referred to as “ln-transformed”) (see Csomor et al., 2008b) was applied to the reactivity data obtained in the PPI experiment except for %PPI which was calculated based on the non-transformed data. All statistical analyses were carried out using SPSS for Windows (release 19, SPSS, Chicago, IL, USA) implemented on a PC running the Windows 7 operating system.

Results

Segregation of subjects with high and low-PPI

Based on the initial screening of the cohort of C57BL/6 male mice (n = 100), two subsets differing in baseline magnitude of PPI were identified. First, the cohort was trimmed with respect to four variables: (i) average baseline reactivity score (ln-transformed) obtained across “no-stimulus” trials, (ii) average startle reactivity (ln-transformed) obtained in “pulse-alone” trials, (iii) average percentage PPI across all prepulse-plus-pulse trials (as defined in the Procedures), and (iv) the alternative index of PPI − the “PPI steepness index” − which measured the steepness of the downward-sloping reactivity curve from pulse-alone trials to prepulse-plus-pulse trials with increasing prepulse intensity. Because our initial sample size was sufficiently large, we decided a priori that it is prudent to exclude subjects with the three highest and lowest scores in each of the four variables. It could minimize potential confounding effects of outlying subjects that lay at both extreme ends of the distribution. The trimming was to approximate the convention that the extreme 5 % of the general population (in effect 6 % here, with the top and bottom three extremes dropped) might be considered as outside the “norm” of the general population. Indeed, 15 out of the 22 mice excluded had at least one variable scoring beyond or very close to the ±1.96 × SD cut offs from the respective grand mean as expected from a normal Gaussian distribution.

Next, the remaining 78 subjects were then split according to the median of the two complementary variables (iii) and (iv) that index PPI. We did not rely on the common measure of %PPI alone because of fear that it can be distorted by baseline startle reactivity (see Csomor et al., 2008b). To this end, only animals receiving the same categorization for both variables were included in the final segregation, which yielded 23 “high-PPI” and 23 “low-PPI” subjects. It implied that 32 mice received inconsistent allocations between the two variables. The disagreement reflected partly the opposing influence of baseline startle on the two variables, as lower baseline startle tended to increase %PPI but decrease the PPI steepness index. Thus, exclusion of subjects with inconsistent allocation ensured that our segregation was robust. The inclusion of the two complementary PPI indexes further ensured that the two groups were highly comparable in terms of startle baseline recorded on pulse-alone trials (Fig. 1) without any explicit attempt to match the baseline startle between the two subgroups. A few subjects received mixed allocation because they scored close to the median of the variables, and their exclusion should increase the effect size of the group difference in baseline PPI. The descriptive and inferential statistics comparing the “high-PPI” and “low-PPI” subjects are as follows.

Fig. 1
figure 1

Comparison of the high-PPI (n = 23) and low-PPI (n = 23) groups. (A) Reactivity scores (ln-transformed) obtained on pulse-alone and prepulse-plus-pulse trials. (B) Linear component derived from the data set shown in A. (C) %PPI expressed as function of increasing prepulse intensity and as overall mean in the bar-plot on the right. (D) Prepulse-elicited reactivity was indexed by a difference score (as deviation from no stimulus trials) which is also shown as a function of increasing prepulse intensity and as overall mean. Prepulse intensity is expressed as decibel units above the constant background (BG) of 65 dBA. “+0” refers to pulse-alone trials in which no prepulse was present. All values refer to mean ± SE. * denotes a significant group difference (p < .05) based on the respective ANOVA or according to subsequent post hoc comparisons

The expression of PPI

A 2 × 6 (Basal PPI grouping × Prepulse intensity) ANOVA of the ln-transformed reactivity scores obtained in pulse-alone and prepulse-plus-pulse trials yielded a significant main effect of prepulse intensity [F(5, 220) = 310.66, p < .001, η p 2 = .88]. The prepulse stimulus was very effective in inhibiting the startle reaction elicited by the succeeding pulse stimulus, and the inhibition is proportional to the intensity of the prepulse (Fig. 1A). However, the rate of startle reduction as a function of prepulse intensity was clearly more pronounced in the high-PPI group. The difference led to the emergence of a significant group × prepulse intensity interaction [F(5, 220) = 17.57, p < .001]. Polynomial contrast analysis further revealed that the group × prepulse intensity interaction was the strongest in its linear component [F(1, 44) = 41.84, p < .001, η p 2 = .49] (Fig. 1B). Post-hoc pairwise comparisons indicated that the startle reaction to the pulse stimulus was significantly lower in the high-PPI group when compared with the low-PPI group from prepulse intensity >8 dBA above background. Furthermore, this group difference grew with increasing prepulse intensity with the inhibition of the startle reaction (i.e., PPI) approaching asymptote earlier, and at a lower level of inhibition, in the low-PPI group compared with the high-PPI group.

Parallel analysis of %PPI by a 2 × 5 (Basal PPI grouping × Prepulse intensity) ANOVA essentially yielded a similar pattern of results (Fig. 1C). The general dependency of the PPI magnitude on the prepulse intensity (indexed by %PPI) was again demonstrated by the significant effect of prepulse intensity [F(4, 176) = 87.91, p < .001, η p 2 = .67]. Furthermore, %PPI was consistently weaker in the low PPI group regardless of prepulse intensity yielding a significant overall group difference [F(1, 44) = 101.34, p < .001, η p 2 = .70] while the group by prepulse intensity interaction was far from statistical significance (p > .34). As expected, the percent transformation conferred additional power to the detection of group difference in PPI expression by comparison with the critical effect size (the linear component of the group × prepulse intensity interaction was η p 2 = .49) obtained in the previous analysis.

Reactivity obtained on no-stimulus and prepulse-alone trials

Apart from the expected divergence on PPI, the two groups also differed in terms of baseline reactivity obtained on no-stimulus trials and the direct reaction elicited by the prepulse stimulus (Fig. 1D). Baseline spontaneous reactivity was significantly higher in low-PPI than the high-PPI group [F(1, 44) = 24.58, p < .001, η p 2 = .36] as indicated by an one-way ANOVA of the mean reactivity (ln-transformed) recorded on no-stimulus trials (high-PPI = 2.0, low-PPI = 2.5, standard error (SE) =.1). To control for this difference in spontaneous reactivity, prepulse-elicited reactivity was indexed by a difference score that reflects the deviation from no-stimulus trials (Fig. 1D). This measure showed a clear increase with stronger prepulse intensity, and this increase appeared progressively weaker in the low-PPI relative to the high-PPI group. Consistent with these observations, a 2 × 5 (Basal PPI grouping × Prepulse intensity) ANOVA of the difference score revealed a significant main effect of group [F(1, 44) = 9.04, p < .005, η p 2 = .17], prepulse intensity [F(4, 176) = 28.26, p < .001, ηp 2 = .39], and their interaction [F(4, 176) = 13.56, p < .001, η p 2 = .24]. Additional polynomial contrast analysis showed that the interaction reached its highest significance level in the linear component [F(1, 44) = 51.98, p < .001, ηp 2 = .54].

Lack of difference in startle habituation

To examine whether the startle reaction habituated within the period of testing the first and last block of pulse-alone trials were subjected to a 2 × 2 (Basal PPI grouping × Blocks) ANOVA. It showed that the startle reaction did not significantly differ across blocks (p > .3), implying that the animals’ startle reaction was stable during the period of PPI assessment. The mean startle reaction (ln-transformed) on the two blocks was as follows: first block: high-PPI = 4.23, low-PPI = 4.29, SE = .14; last block: high-PPI = 4.20, low-PPI = 4.47, SE = .15.

The lack of clear startle habituation here was expected as we have repeatedly shown that C57BL/6 mice did not show reliable stable habituation (e.g., Pietropaolo, Singer, Feldon, & Yee, 2008; Singer, Feldon, & Yee, 2009b; Singer & Yee, 2012; Yee et al., 2004a, 2005) in our laboratory. Singer et al. (2013) reported that while some subjects (C57BL/6 mice) showed habituation, a similar number showed sensitization. However, it should be noted that habituation is typically observed in other mouse strains and rodent species as well as humans.

Comparable expression of baseline anxiety or locomotor activity

The probability to enter and spend time in the open arms in the elevated plus maze provided an index of anxiety, and this expression was highly comparable between the two groups. Analysis of percentage open arm time (high-PPI = 26.0 %, low-PPI = 18.6 %, SE = 3.9) and percentage open arm entries (high-PPI=31.4 %, low-PPI=29.7 %, SE = 3.3) did not reveal any significant difference. Locomotor activity as measured by the cumulative distance travelled (in m) was also comparable between the two groups (high-PPI = 11.20 m, low-PPI = 11.13 m, SE = .5). The impression of a lack of difference in anxiety agreed with the open field test that had examined activity and spatial exploration over a much longer period of time. As shown in Fig. 2, the temporal profile of locomotor activity was nearly indistinguishable between the high-PPI and low-PPI groups, including the expression of locomotor habituation over time. A 2 × 12 (Basal PPI grouping × bins) ANOVA of the distance travelled per 5-min bin revealed only a significant main effect of bins [F(11, 484)= 80.66, p < .001].

Fig. 2
figure 2

Spontaneous locomotor activity in the open field expressed in 5-min bins. All values refer to the mean ± SE

Comparable expression of spatial recognition memory in the Y-maze

Data of one high-PPI subject were lost since it had escaped from the Y-maze during test.

Test phase

Irrespective of groups, the animals exhibited a delay-dependent preference for the novel arm (Fig. 3), which led to a significant main effect of delays [F (1, 43) = 5.81, p < .05] in a 2 × 2 (Basal PPI grouping × Delays) ANOVA of the discrimination ratio. One-sample t-tests based on the pooled data of the two groups indicated that the preference was significantly above chance (i.e., discrimination ratio > .5) on the 2-min delay condition [t(44) = 6.28, p < .001]. The novelty preference was clearly weaker on the 3-h delay condition while performance still maintained at above chance level [t(44) = 2.02, p = .0497].

Fig. 3
figure 3

Spatial familiarity judgment in the Y-maze. Novelty preference is indexed by a discrimination ratio based on time spent exploring the novel and familiar arms, namely, [novel / (novel + familiar)]. A ratio > .5 indicates a preference towards the novel over the familiar arm. * denotes a significant delay effect (p < .05) emerged in a 2 × 2 (Basal PPI grouping × Delays) ANOVA of the discrimination ratio. All values refer to the mean ± SE

Additional analysis of the absolute exploration times by a 2 × 2 × 2 (Basal PPI grouping × Delays × Arms) ANOVA revealed a similar pattern of results. Irrespective of group, the animals spent more time in the novel arm than in the familiar arm [F(1, 43) = 44.59, p < .001], and the magnitude of this bias was dependent on the delay as indicated by a significant Delays × Arms interaction [F(1, 43) = 6.70, p < .05]. Similar to the sample phase, the level of exploration (familiar and novel arms) generally decreased across delays [F(1, 43) = 5.08, p < .05] but did not differ between groups (F < 1).

Sample phase

The amount of time spent in the familiar arm generally decreased across the two delay conditions (Table 1), yielding a significant main effect of delays [F (1, 43) = 7.68, p < .01] in a 2 × 2 (Basal PPI grouping × Delays) ANOVA of the time spent in the familiar arm. The exploration time appeared somewhat longer in the high–PPI group than the low–PPI group but this numerical difference, in the range of 6–10.7 %, was far from statistical significance.

Table 1 Exploration times (in s) in the familiar and novel arms during the sample and test phases are shown separately for the two delay conditions (2 min and 3 h). All values refer to mean ± SE

Comparable performance throughout the water maze test

Cued test

All animals rapidly acquired the escape response onto the visually cued platform across the three trials and performance on the cued test was highly comparable between groups (Fig. 4). Separate 2 × 3 (Basal PPI grouping × Trials) ANOVAs of escape latency and path-length yielded only a significant main effect of trials [latency: F(2, 88) = 19.27, p < .001; path-length: F(2, 88) = 28.94, p < .001]. The average swim speed was highly comparable between groups (high-PPI = .16, low-PPI = .17, SE = .004 ms-1).

Fig. 4
figure 4

Water maze cued test performance. Performance was measured by escape latency (A) and path-length (B). Both groups learned to locate and escape onto the visually cued escape platform across the three trials. All values refer to the mean ± SE

Reference memory

There was a gradual decrease in escape latency and path-length as training progressed, which was similarly observed in the two groups (Fig. 5A and B). A significant main effect of sessions [latency: F(5, 220) = 6.32, p < .001, path-length F(5, 220) = 8.61, p < .001] was revealed by separate 2 × 6 × 2 (Basal PPI grouping × Sessions × Trials) ANOVAs. It was accompanied by a significant main effect trials [latency: F(1, 44) = 10.25, p < .001, path-length F(1, 44) = 14.90, p < .001], further suggesting that performance improved across the two trials within a session. Neither the main effect of group nor any of its interactions attained statistical significance. Analysis of swim speed again failed to reveal any significant group difference. The average swim speed (in ms-1) was nearly identical to that obtained in the cued test: high-PPI = .16, low-PPI = .17, SE = .003.

Fig. 5
figure 5

Spatial reference memory and new learning. The two groups showed similar escape performance on the initial acquisition phase and learned the novel platform location in the new phase at a comparable rate in terms of latency (A) as well as path–length (B). Likewise, search accuracy during the probe test did not differ between the two groups. Both groups preferentially searched in the target quadrant as expressed by percentage time (C) and percentage path-length (D) per quadrant. * denotes that the preference for the target quadrant was significantly above chance (i.e., > 25 % as indicated by the dashed line) according to one-sample t-tests (p < .05). All values refer to mean ± SE

New learning

As expected, the change in the platform location produced a general reduction in performance on the first session of new learning as evidenced by an increase in both latency (Fig. 5a) and path-length (Fig. 5B) to a level comparable to the initial training session. However, the animals quickly adapted their search strategy and reached within two sessions a level of performance comparable to that seen at the end of the acquisition phase. The rapid new learning was evidenced by the significant main effect of sessions in a 2 × 2 × 2 (Basal PPI grouping × Sessions × Trials) ANOVA of latency [F(1, 44) = 52.79, p < .001] or path-length: [F(1, 44) = 52.34, p < .001]. As seen before, performance generally improved from the first to the second trial within a session [latency: F(1, 44) = 7.10, p < .05; path-length: F(1, 44) = 8.03, p < .01]. This pattern of results was similarly observed in both groups. Likewise, the two groups swam at a comparable speed. The average swim speed (in ms-1) of each group was: high-PPI = .16, low-PPI = .17, SE = .006.

Probe test

Search accuracy as indexed by the percentage of time (Fig. 5C) and percentage of path-length (Fig. 5D) spent inside the target quadrant did not differ between groups. There was an overall search bias for the target quadrant yielding a significant main effect of quadrants in two separate 2 × 4 (Basal PPI grouping × Quadrants) ANOVAs of percentage time [F(3, 132) = 31.85, p < .001] and percentage path-length [F(3, 132) = 32.33, p < .001] per quadrant. Supplementary one-sample t-tests confirmed that the preference for the target quadrant (in percentage time and percentage path-length) was significantly above chance level (i.e. > 25 %) in each group (p’s < .05). We also restricted the analysis to the first 15 s in order to examine whether the two groups potentially differed in beginning of the probe test and again did not find any statistical evidence for a significant group difference (data not shown).

Discussion

Despite the substantial difference in the magnitude of PPI expression between low and high PPI subjects that measured an effect size of 0.7 (η p 2) in terms of %PPI, the two groups remained highly comparable in all measures of spatial cognition in the present study. These negative findings in the memory tests were paralleled by the absence of any difference between low and high PPI animals in the open field and the elevated plus maze test of anxiety. In addition, as we have reported elsewhere, the low-PPI compared with the high-PPI subjects evaluated here were marked by significant neurochemical differences. Post-mortem HPLC analysis showed that the levels of dopamine and its metabolites in the nucleus accumbens and medial prefrontal cortex of the low-PPI subjects were significantly lower than in the high-PPI subjects (Peleg-Raibstein et al, 2013). This neurochemical difference might underlie the stronger development of behavioral sensitization to amphetamine observed in the low-PPI subjects (Peleg-Raibstein et al, 2013).

In the Morris water maze, low and high PPI subjects acquired the location of the escape platform at a similar pace and showed comparable long-term retention of the newly acquired reference memory as indicated by the nearly identical search preference for the target quadrant in the probe test conducted 24 h after the last day of acquisition. The absence of an effect here is consistent with the negative result of a previous study which failed to detect any significant correlation between individual PPI levels and reference memory performance within a homogeneous cohort of 23 naïve C57BL/6 mice (Singer et al., 2013). Even when the animals were further challenged to acquire a second position of the escape platform under the negative influence of proactive interference from the previously acquired platform (i.e., new learning following the initial acquisition), performance was still highly comparable between low and high PPI subjects. These findings strengthen the impression that reference memory performance within the normal population of adult male C57BL/6 mice cannot be predicted by baseline PPI levels.

Likewise, the Y-maze test also did not reveal any difference between the high- and low-PPI subjects. Both groups showed a strong and comparable preference for the novel over the familiar arm when the demand on memory retention was low (2-min delay); and their performance remained very similar when the overall preference for the novel arm was weakened by extending the retention period to 3 h.

The lack of difference between high- and low-PPI subjects on these two well-known hippocampus-dependent tests of spatial learning (e.g., Bannerman et al., 2004; Sanderson et al., 2007) is consistent with our previous study, which showed that individual difference in PPI expression also failed to predict the expression of contextual fear acquired through Pavlovian conditioning (Singer et al., 2013), a test that similarly depends on the hippocampus (e.g., Phillips & LeDoux, 1992; Seldon, Everitt, Jarrard, & Robbins, 1991). Hence, individual difference in spatial memory is largely orthogonal to variation in PPI expression among healthy male C57BL/6 mice.

Against the above conclusion, our recent demonstration of a positive correlation between PPI and spatial working memory performance in C57BL/6 mice (Singer et al., 2013) may seem to be an exceptional finding. However, it should be noted that the reported statistical link was only evident when the memory load was relatively low because the test only required the animal to remember one specific location on a given trial. Furthermore, the statistical link was restricted to PPI obtained with the weakest prepulses. It suggests that PPI is unlikely to reliably predict the retention capacity of the working memory buffer in mice. In keeping with this interpretation, no evidence for a potential link between basal PPI and memory capacity has been forthcoming in healthy humans, even though the assessment and conception of working memory do differ somewhat between humans and rodents (Baddeley, 2012; Dudchenko, 2004; Honig, 1978; Olton, Becker, & Handelmann, 1979). Instead, poor PPI in human volunteers has been repeatedly shown to predict poor strategy formation and slow execution time in spatial working memory tests (Bitsios & Giakoumaki, 2005; Bitsios et al., 2006; Csomor et al., 2008a ; Giakoumaki et al., 2006; Holstein et al., 2011).

In line with the emphasis on executive functioning based on the human data, we have previously identified a significant positive correlation between PPI and sustained attention or vigilance in a visual two-choice discrimination task in a different cohort of C57BL/6 mice (Bitanihirwe et al., 2011). A potential link between PPI and attention is further supported by several reports showing that attentional processes may critically modify the expression of PPI (Dawson, Hazlett, Filion, Nuechterlein, & Schell, 1993; Scholes & Martin-Iverson, 2009; Scholes & Martin-Iverson, 2010). Hence, attention and executive functions could be important mediating factors that underlie the co-existence of PPI deficiency and memory deficiency in specific diseases such as schizophrenia (Geyer, 2006b).

Indeed, the extent to which basal PPI levels may predict cognitive performance in schizophrenia patients remains unclear and may not be a reliable tool to aid clinical diagnosis (Young, Powell, Risbrough, Marston, & Geyer, 2009). One notable study failed to detect any correlation between PPI and cognitive performance in a large cohort of schizophrenia patients tested across a comprehensive battery of cognitive tests (Swerdlow et al., 2006). Furthermore, cognitive deficits are not always associated with low PPI. Thus, patients with Alzheimer’s disease who show memory deficits do not necessarily exhibit poor PPI levels compared with healthy controls (Hejl, Glenthøj, Mackeprang, Hemmingsenm, & Waldemar, 2004; Ueki, Goto, Sato, Iso, & Morita, 2006). Likewise, PPI does not seem to decrease with normal aging while cognitive performance shows a clear age-dependent decline in the general population (Ellwanger, Geyer, & Braff, 2003) leading to the suggestion that age-related cognitive dysfunction is orthogonal to PPI (Young et al., 2009).

Hence, it would seem premature to rely on individual differences in PPI to predict, qualitatively or quantitatively, memory performance in healthy C57BL/6 mice whilst its correlative links with other attentional and executive functions may merit more serious consideration. Moreover, it remains to be ascertained whether our findings that have been gathered in this specific inbred mouse strain could be generalized to other strains or species. Indeed, the individual differences observed here and in our previous studies using in-bred C57BL/6 mice likely reflect the influences of environmental and/or developmental factors rather than genetic variations. In order to better capture the situations in the normal human population, it is imperative to extend our investigation to outbred strains (as well as other inbred strains) in combination with gene mapping technology (e.g., Flint and Mott, 2008) to undercover potential gene-environment interaction.