Abstract
Rationale
The basolateral amygdala (BLA) and medial geniculate nucleus of the thalamus (MGN) have both been shown to be necessary for the formation of associative learning. While the role that the BLA plays in this process has long been emphasized, the MGN has been less well-studied and surrounded by debate regarding whether the relay of sensory information is active or passive.
Objectives
We seek to understand the role the MGN has within the thalamoamgydala circuit in the formation of associative learning.
Methods
Here, we use optogenetics and in vivo electrophysiological recordings to dissect the MGN-BLA circuit and explore the specific subpopulations for evidence of learning and synthesis of information that could impact downstream BLA encoding. We employ various machine learning techniques to investigate function within neural subpopulations. We introduce a novel method to investigate tonic changes across trial-by-trial structure, which offers an alternative approach to traditional trial-averaging techniques.
Results
We find that the MGN appears to encode arousal but not valence, unlike the BLA which encodes for both. We find that the MGN and the BLA appear to react differently to expected and unexpected outcomes; the BLA biased responses toward reward prediction error and the MGN focused on anticipated punishment. We uncover evidence of tonic changes by visualizing changes across trials during inter-trial intervals (baseline epochs) for a subset of cells.
Conclusion
We conclude that the MGN-BLA projector population acts as both filter and transferer of information by relaying information about the salience of cues to the amygdala, but these signals are not valence-specified.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Preface
In tribute to Nadia Chaudhri, and her discoveries regarding how contexts can modulate the representation of cues in the amygdala (Sciascia et al. 2015), and other mesolimbic circuit components (Chaudhri et al. 2009, 2008; Valyear et al. 2020), here we explore what information processing precedes entry into the basolateral amygdala (BLA) from the thalamus and develop visualization approaches for exploring across-trial temporal dynamics in addition to within-trial population activity.
Introduction
Many similarities exist between the medial geniculate nucleus of the thalamus (MGN) and the basolateral amygdala (BLA) in how each region encodes information (Weinberger 2011). Not only are both regions necessary for the formation of cued fear learning, but they both exhibit plasticity following learning, and encode information of multiple sensory modalities (Edeline et al. 1990; Wepsic 1966). Numerous studies have focused on the BLA for associative learning (Nishijo et al. 1988; Weinberger 2011), and for fear and reward learning (McKernan and Shinnick-Gallagher 1997; Namburi et al. 2015; Rogan et al. 1997).
However, few studies have explicitly tested associative learning in the MGN (Bordi and LeDoux 1994; Cruikshank et al. 1992; Edeline et al. 1990), due to the long-held assumption that the MGN only passively contributes to learning by relaying crude tone and somatosensory information (LeDoux 2000; Orsini and Maren 2012). However, there is evidence that the MGN likely plays a dynamic role in the formation of associative learning (Edeline et al. 1990; Parsons et al. 2006), fear conditioning (Ferrara 2015; Nabavi et al. 2014), and fear memories (Antunes and Moita 2010; Han et al. 2008). It has been argued that the MGN is the “root” of the auditory associative learning circuit, playing an active role and complementing the BLA in the formation of cued fear memories (Weinberger 2011).
The current work explores mechanisms underlying the formation of associative learning for both reward and punishment-associated outcomes by evaluating the similarities and differences between the MGN and the BLA neural population responses. We use a variety of approaches, such as hierarchical clustering and neural trajectories, to identify categorical differences in how these regions process discrimination learning. To do so, we dissect the neural circuit connectivity by exploring direct and indirect connections between the two regions. We use traditional experimenter-defined a priori methods to categorize arousal and valence encoding patterns in each region. We further employ an algorithmic hierarchical clustering approach to uncover unbiased similarities and differences between how and what these regions encode. This work employs in vivo single-unit electrophysiology and circuit-specific optogenetics; the former allows us to characterize what information the MGN transmits to the BLA during Pavlovian learning and the latter provides a method to identify which cells in the MGN project to the BLA.
Materials and methods
Experimental design
Animal care and subject details
Adult wild-type male C57BL/6 J mice aged 8 weeks upon arrival were used (Jackson Laboratory; RRID: IMSR-JAX:000,664). All experimental subjects were maintained in reverse light-cycle cubicles with ad libitum food and water until behavioral experiments began. Animals were housed four to one cage until electrode implantation, after which they were housed singly. All animal handling procedures were in accordance with those put forth by the National Institute of Health (NIH) and were approved by the Massachusetts Institute of Technology’s Institutional Animal Care and Use Committee (IACUC).
Stereotaxic surgery procedures
For all subjects, surgery was performed under aseptic conditions using a small animal stereotax (David Kopf Instruments, Tujunga, CA, USA) and body temperature was maintained using a heating pad. Anesthesia was induced using a 5% mixture of isoflurane and oxygen. Following induction, this mixture was reduced to 2–2.5% and was maintained throughout the duration of the procedure (0.5 L/min oxygen flow rate). Once subjects were adequately anesthetized, ophthalmic ointment was applied to the subject’s eyes, hair was removed from the incision site using hair clippers, and the area was scrubbed using 70% alcohol and betadine 3 times each in alternation, with an injection of 2% lidocaine under the skin for local anesthetic. All measurements for viral injections, electrode implants, or optrode implants were made with Bregma as the origin. Following surgery, subjects were placed in clean cages with water-softened mouse chow to recover. Cages were placed either under a heating lamp or on a heating pad to aid in recovery.
Viral surgery
To record from the medial geniculate nucleus (MGN) neurons in a circuit-specific manner, a combination viral approach was employed. Following the general surgery procedures detailed above, an incision was made to provide access to the skull. After cleaning the skull, craniotomies were made above the basolateral amygdala (BLA) and the MGN. In order to selectively express channelrhodopsin-2 (ChR2) in BLA-projecting cells in the MGN, an anterogradely traveling adeno-associated virus serotype 5 (AAV5) coding for ChR2 fused with an enhanced yellow fluorescent protein (eYFP) in a double floxed inverted open reading frame (DIO) under the control of elongated factor-1α promotor (250 nL of AAV5-EF1α-DIO-ChR2-eYFP) was injected into the MGN (stereotaxic coordinates from Bregma, in mm: − 3.0 anteroposterior, 1.75 mediolateral, − 3.9 dorsoventral). Concurrently, the retrogradely traveling canine adenovirus-2 coding for cre-recombinase (400 nL CAV2-cre) was injected into the BLA (stereotaxic coordinates from Bregma, in mm: − 1.6 anteroposterior, 3.35 mediolateral, − 4.9 dorsoventral). Injections were carried out using a 10 μL microsyringe (33 g beveled needle) driven at a rate of 0.05 μL/min by a microsyringe pump and controller (Micro4; WPI, Sarasota, FL, USA). Viral material was allowed to penetrate tissue prior to needle extraction (10–15 min/injection site). Separate needles were used for each virus to avoid contamination and were flushed thoroughly with sterile water following the surgical procedure. AAV5 viral aliquots were obtained from the University of North Carolina Vector Core (Chapel Hill, NC). CAV2 viral aliquots were obtained from the Institut de Génétique Moléculaire de Montpellier, France.
Electrode surgery
Implantation of electrode and optrode arrays took place 2–3 months following viral surgery to allow for robust viral expression. Following the general surgery procedures detailed above, an incision was made to provide access to the skull, and the skull was cleaned and scraped to provide a stable surface for anchoring microelectrode arrays. A total of 8 craniotomies were made to accommodate the implantation of three anchoring skull screws, one microelectrode wire bundle, one optrode (comprised of a wire bundle and optical fiber), a ground wire, and passive headbar anchoring. The headbar (1in × 1/8in × 1/8in aluminum square stock) was attached using cement (Adhesive Dental Cement C&B Metabond; Parkell, Edgewood, NY, USA). Cement dried prior to implantation of skull screws (15–20 min). Following skull screw implantation, electrode and optrode arrays were lowered into the BLA (− 1.6 mm anteroposterior, 3.35 mm mediolateral, − 4.9 mm dorsoventral) and the MGN (− 3.0 mm anteroposterior, 1.75 mm mediolateral, − 3.9 mm dorsoventral), respectively. The multiarray ground wire was placed between the skull and brain surface in a craniotomy contralateral to electrode and optrode implantation sites. Cement was then applied to the screws, arrays, headbar, and ground wire, and allowed to dry for a minimum of 45 min or until hard to the touch. A protective layer of dental cement was then applied to secure all array and headbar elements. A nylon suture was used to close the incision around the implanted device. Prior to removal from anesthesia, subjects received subcutaneous injections of warm saline (1 mL) and meloxicam (5 mg/kg). Following surgery, subjects were allowed to fully recover from anesthesia prior to being returned to the animal housing facility.
In vivo electrophysiological recording
Electrode construction
Multielectrode arrays used for recording single-unit activity were custom-designed and built by the experimenter (CL). Arrays consisted of two multichannel single-wire probes and were constructed using Omnetics microconnector plugs (Omnetics Connector Corp., Minneapolis, MN) as the main structural component. The optrode array used for recording in the MGN was connected directly to the plug using a plastic spacer. This array was comprised of 17 single wires (22 μm stablohm wire, polyamide insulated; California Fine Wire, Grover Beach, CA) connected to a 300 μm optical fiber using polyamide microtubing to align the wires. The second array was floating and constructed such that it could be moved independently from the connector and optrode array. This electrode array also consisted of 17 wires of the same material, and wires were aligned using a length of syringe needle cut to suit (2–3 mm in length). The wires in both arrays were connected to the contacts of the Omnetics connector by hand, and wire insulation was stripped by hand using forceps. Each connection was then painted with conductive adhesive to ensure a good connection (Silver Print; MG Chemicals, Burlington ON). Both cyanoacrylate glue and epoxy were used to secure components. Both arrays were cut to length by hand using fine serrated tungsten scissors (Fine Science Tools, Foster City, CA), with optrode wire bundle extended beyond the tip of the optical fiber (500–800 μm). All wires were gold plated to decrease impedance to 150–250 kΩ using a 50/50 gold plating solution comprised of gold solution (Neuralynx, Bozeman, MT) and 1 μM polyethylene glycol. Following gold plating, a low-impedance bare silver wire (California Fine Wire, Grover Beach, CA) was soldered to the final pin on the connector, and the connection was coated in dental cement.
In vivo single-unit electrophysiological recordings
Neural activity was recorded using an Open Ephys acquisition board (Siegle et al. 2017, J Neural Eng, PMID: 28169219) in conjunction with 32-channel Intan headstages (Intan Technologies, Los Angeles, CA). These data were recorded at a sampling rate of 30 kHz, using a band pass filter to collect signals between 1 and 7000 Hz. Data processing was carried out using a combination of commercially available as well as custom-written suite of software. Initial waveform detection and creation of files for cluster sorting was carried out using a custom algorithm in MATLAB (MATLAB; MathWorks, Natick, MA). Identified spikes were thresholded using a 6-sigma criterion to reduce the probability of including spatially attenuated multiunit signals and traces were aligned to the depolarization peak. Thresholded and aligned traces were then exported in the .plx format to be imported into Offline Sorter (Plexon Inc., Dallas TX), which was used for cluster sorting the spike data using principal component analysis (PCA). Behavioral events were recorded simultaneously with neural data using analog inputs to the Open Ephys acquisition board with the same sampling rate as neural data and were in some cases multiplexed to reduce the number of recording channels required.
Optogenetic stimulation
Optogenetic stimulation for phototagging experiments was carried out using a 473 nm Diode-Pumped Solid-State (DPSS) laser (OEM Laser Systems, Draper, UT) connected to a patch cable using FC/PC connections (Doric, Quebec, Canada). Light power for phototagging was calibrated to ~ 22 mW at the beginning of each recording session using a Thorlabs light power meter designed for use with lasers (Thorlabs; Newton, NJ).
Photoidentification protocol
To identify BLA-projecting cells in the MGN as well as those cells in the BLA receiving either direct or indirect input from the MGN, we expressed ChR2 in the MGN → BLA projections using the circuit-specific viral approach detailed above. Following the conclusion of behavioral training, but within the same neural recording session, light was delivered into the MGN using a laser connected to the optical fiber implanted 500–800 μm above the recording site (473 nm, 22 mW). Light was delivered as a series of pulses of varying length and frequencies as follows: 1 s pulses, 5 ms 1 Hz pulses, 5 ms 10 Hz pulses, and 5 ms 20 Hz pulses. A custom MATLAB script was used to identify phototagged cells in the MGN using the following criteria: signed rank test, p < 0.001, and z-score > 3.5 within 10 ms of light onset for 5 ms 1 Hz laser pulses. The 10 ms photoresponse latency threshold was used since no cells exhibiting recurrent excitatory connections were observed during ex vivo whole-cell patching experiments. Network cells in the BLA were identified using the following criteria: rank-sum test, p < 0.001, within 50 ms of laser onset 5 ms 1 Hz pulses.
Electrolytic lesioning
Electrolytic lesions were used to verify the placements of microelectrode arrays. To carry this out, subjects were placed under anesthesia using the same procedure as that for stereotaxic surgery. Lesions were created by passing current through single wires in the multiarray. In the MGN, lesions were made on the channels identified to have phototagged cells. In the BLA, between 4 and 6 channels were chosen randomly for lesioning. Lesions consisted of passing current through each channel for 10 s. Subjects were sacrificed 15–30 min following lesions to allow for gliosis to occur.
Ex vivo electrophysiological recording
Ex vivo determination of photoresponse latency (PRL) threshold
Viral surgeries for all subjects were conducted on a cage-by-cage basis; therefore, one subject from each cage was selected for collecting PRL data, while the other 3 subjects were used in behavioral recording experiments. Patching experiments to determine the PRL threshold were conducted concurrently with behavioral training to ensure that PRL data were collected from subjects that had undergone a viral incubation period of similar length to behavioral recording animals. This period was approximately 3 months for all subjects. For this, subjects were deeply anesthetized using sodium pentobarbital (200 mg/kg; IP injection), and transcardially perfused using 20 mL of ice-cold artificial cerebrospinal fluid (aCSF) modified as follows to preserve brain sections: (composition in mM) NaCl 87, KCl 2.5, NaH2PO4*H20 1.3, MgCl2*6H2O 7, NaHCO3 25, sucrose 75, ascorbate 5, CaCl2*2H2O 0.5 in ddH20 (osmolarity 322–326 mOsm, pH 7.20–7.30, saturated with carbogen gas 95% oxygen, 5% carbon dioxide). The subject’s head was removed, and the brain rapidly extracted. The brain was mounted on the vibratome stage using adhesive and placed in the slicing chamber submerged in partially frozen modified aCSF (Leica VT1000S, Leica Microsystems, Germany). Coronal sections measuring 300um were taken along the extent of the MGN (AP: ~ 2.5–3.5 mm posterior to bregma). Brain slices were placed in a recovery chamber at 32 °C containing carbogen-saturated aCSF (composition in mM: NaCl 126, KCl 2.5, NaH2PO4*H20 1.25, MgCl2*6H2O 1, NaHCO3 26, glucose 10, CaCl2*H2O 2.4 in ddH20; osmolarity 298–301 mOsm; pH 7.28–7.32) for at least 1 h prior to initiation of recording. Following recovery, slices were transferred to the electrophysiological recording chamber.
Whole-cell patch-clamp recordings
Slices were restrained in the recording chamber using a platinum and nylon “harp” and were continuously perfused with oxygenated aCSF (2 mL/min; 30–32 °C). Slices were imaged through a × 40 water-immersion objective using an IR-DIC enabled upright microscope (Scientifica, Uckfield, UK) equipped with a QImaging Retiga EXi camera (Q Imaging, Canada). Whole-cell patch-clamp recordings were taken using borosilicate glass capillaries (World Precision Instruments, Hertfordshire, UK), pulled on a P-97 puller (Sutter Instrument, CA, USA). When filled within an internal solution, the resulting recording pipettes had resistance values of 4–6 MΩ (internal solution composition in mM: potassium gluconate 125, NaCl 10, HEPES 20, MgATP 3, and 0.1% neurobiotin, in ddH20; osmolarity 287 mOsm; pH 7.3). Whole-cell patch-clamp recordings were made from ChR2-eYFP expressing MGN cells using pClamp 10.4 software (Molecular Devices, CA, USA). Analog signals were amplified using a Multiclamp 700B amplifier, filtered at 3 kHz, and digitized at 10 kHz with a Digidata 1550 (Molecular Devices, CA, USA). ChR2-eYFP expressing BLA-projecting cells in the MGN were located by illuminating the slice briefly with 470 nm light (pE-100; CoolLed, Andover, UK). In order to confirm ChR2 expression in the recorded cell, a sustained 1 s pulse of blue light was delivered while recording from the cell in voltage-clamp mode. If sustained inward current was observed, the cell was determined to express ChR2-eYFP. Cells were then exposed to 5 ms pulses of blue light (10 pulses delivered at 1 Hz, every 60 s) while recording in current-clamp mode to observe light-evoked spiking behavior. Data were analyzed post hoc using Clampfit 1.4 software (Molecular Devices, Sunnyvale, CA). Latency to spike was determined for each cell by taking the mean time from pulse onset to spike (peak) for 30 responses to blue light pulses. Because no spiking response was observed in non-expressing neighboring cells, the PRL was set at 10 ms (Fig. 2h).
Histology and storage
Following the conclusion of recording experiments subjects were anesthetized using 90 mg/kg sodium pentobarbital and perfused transcardially with 20 mL of ice-cold lactated Ringer’s solution, followed by 20 mL ice-cold paraformaldehyde (4%; PFA) in phosphate-buffered saline (PBS). Brains were extracted and placed in 4% PFA for 24 h. The tissue was then equilibrated in a cryo-protectant solution (30% sucrose in PBS, w/v). Coronal slices measuring 60um were taken from the tissue using a sliding microtome (HM430; Thermo Fisher Scientific, Waltham, MA), and stored in PBS at 4 °C.
Immunohistochemistry
Sectioned tissue was incubated in 1x PBS solution containing 3% Donkey serum in 0.03% Triton for 1 h at room temperature. Sections were then washed using 1xPBS for 10 min, followed by incubation in a solution containing a DNA-specific fluorescent probe (DAPI; 4',6-diamidino-2-phenylindole; 1:50,000 in PBS). Sections were then washed in 1x PBS 4 times, 10 min each. Sections were then mounted on slides for imaging using PVA-DABCO (Sigma-Aldrich; St. Louis, MO).
Confocal or epifluorescence imaging
Stained tissue slices were imaged using either a confocal laser scanning microscope (Olympus FV1000), or an epifluorescence microscope (Keyence BZ-X). Images were taken using a 10 × objective lens. Following imaging, the images were evaluated to determine the location of viral expression as seen via fluorescence from the ChR2-eYFP. Recording sites were located using electrolytic lesion locations, which were identified using auto fluorescence produced by gliosis. In some cases, lesions or electrode extraction created holes in the tissues at the recording sites making probe location readily identifiable. Data from animals with wires outside the regions of interest were excluded from the analysis. If lesions were correctly located in only one region, then only data from that region were used for that animal. Following imaging, slides were stored in slide boxes at room temperature in case additional imaging or validation was necessary.
Pavlovian behavioral training paradigms
All subjects were placed on a food regulation schedule prior to the initiation of Pavlovian training (4 g/day standard mouse chow, ad libitum access to water). For this Pavlovian paradigm, a highly palatable Ensure nutrient drink was the rewarding unconditioned stimulus (US), and a mildly aversive airpuff to the subject’s face (~ 25 psi, 75 ms) acted as the punishing US. On the first day of training, subjects were placed in the experimental apparatus, using head fixation via a small aluminum headbar held in place by a modified stereotaxic holder. The initial training session was used to expose subjects to reward-related stimuli only while the infrared (IR) beam break used for lick detection was manually calibrated. In cases where subjects demonstrated reluctance in lick responding to presentations of the rewarding US, the experimenter would deliver US rewards directly into the subjects’ mouth. Following equipment calibration, subjects underwent Acquisition training before proceeding to Discrimination trials. Acquisition training consisted of interspersed conditioned stimuli (CS) and unconditioned stimuli (US) presentation of reward-association trials and free reward trials to encourage licking. Reward association trials consisted of a presentation of a 4 s pure tone (8.5 kHz and 14 kHz, counterbalanced). This was followed by a variable (jitter) delay of 1–1.5 s whereupon US delivery took place. CSs were separated by a 15 ± 4 s inter-trial interval (ITI). The number of trials presented in each training session was dictated by the experimenter, who monitored the subject’s performance to infer motivation level. During the first acquisition session, if subjects exhibited a low rate of responding, the experimenter in some cases would also present free-reward trials during the session (these trials are not included in the later analysis). Behavioral criteria determining subject performance are described below in the “Statistical methods” section. Subjects meeting acquisition criteria begin discrimination training the following day. Discrimination sessions were comprised of 75 reward trials and 35 punishment trials (4 s, 65 dB, 20 s ± 4 s ITI) and lasted 1–1.5 h in length. The trial structure included jitter in US presentation timing following CS onset but before US delivery: in this case between 1 s and 1.5 s. If subjects did not perform correct anticipatory lick responses to > 50% or anticipatory licking trials, learning was deemed unsuccessful.
Statistical methods
All computation was performed in MATLAB, unless explicitly stated otherwise.
Determining parameters for statistical comparison of ITI vs. anticipatory licking
To determine how many trials should be evaluated for statistical comparison between licking behavior during ITI periods and licking during anticipatory period, bootstrapping was performed on pilot behavioral data (n = 6 mice). The number of trials necessary to find a significant result for the effect size of licking differences observed was calculated by finding the probability of identifying a significant result for 1 through 50 trial presentations. Significance was defined as having Student’s t test with a p-value < 0.05. Each condition was iterated 1000 times by randomly sampling the data. Under these conditions, we could detect a significant difference present in the data 93% of the time with 15 trials. By 20 trial presentations, the detection probability would be 97%.
Statistical determination of learning behavior for experimental data
To measure behavioral performance during acquisition, the licking behavior during the anticipatory period (the time between CS onset and US delivery) was statistically compared to licking behavior during a period equal in length during the ITI period preceding each CS. If the Student’s t test returned a significant result (p < 0.01) and the raw number of licks during the anticipatory period was greater on average, the animal was determined to have successfully learned the CS-US association. This test was performed twice for each session using the first 20 and last 20 trials within the session. The number of trials (20) was determined using the bootstrapping method described above. Animals meeting the criteria moved into discrimination training in the next session. Discrimination sessions randomly presented CSs consisting of Ensure reward (CS-E) and airpuff punishment (CS-A) trials. To measure behavioral performance during discrimination sessions, a statistical comparison of the difference scores for licking behavior during ITI and anticipatory periods was made; difference scores allowed licking behavior to be compared across protocols as the baseline licking behavior varied between the two protocols. The difference scores were calculated by subtracting the raw number of licks during the anticipatory period from the raw number of licks during a period of equal length taken from the preceding ITI period for each trial. A Student’s t test was used to compare the difference scores for CS-E and CS-A trials. This test was carried out three times for each session, using the first 20 trials, the last 20 trials, and a set of 20 randomly selected trials. If the test returned a result of p < 0.01 and the raw number of licks for the rewarding CS-E trials was greater than the raw number of licks for the punishing CS-A trials on any of the three tests, the subject was determined to have successfully discriminated between the reward and punishment associated CSs.
Statistical determination of differences of task responsiveness and multimodal encoding within MGN and BLA
To determine cell responsiveness, a Wilcoxon signed rank-sum test was applied using an experimental window of 0–100 ms from stimulus onset and baseline windows 3 s from the preceding ITI period; the cell was deemed responsive to stimulus if resulting p < 0.01. To determine whether the cell was excited to or inhibited by cue, we computed the z-score response for baseline and experimental windows; the cell was deemed excited if the average z-score response was greater than 0 (inhibited if less than 0). To determine the proportional differences between categories (CS, US, both, none), chi-squared tests were applied; where data violated the assumptions of the chi-squared test due to low n (n < 5), Fisher’s exact test was applied. To identify the higher proportion of cells responding to experimenter-defined categories, the Kruskal–Wallis test was used. If results were significant, Wilcoxon’s rank-sum post hoc tests were performed pairwise, and Bonferroni’s correction was applied to identify which category was significantly higher.
Calculation of neural trajectories for time-averaged data
To explore the neuronal dynamics during reward and punishment stimuli, peri-stimulus time histograms (PSTHs) for each neuronal unit were computed at 50 ms bin width. PSTHs were separated according to stimulus (CS-E or CS-A) and cell type (MGN/BLA). We performed principal component analysis (PCA) to find the representative features for each cell type. Data were smoothed using a Gaussian-weighted moving average within a smoothing window of 25 previous bins. The first two PCs captured 77% variance in the MGN cells and 46% in the BLA.
Also, the dynamics depending on whether the behavior was correct or incorrect for each stimulus were explored, yielding 4 conditions for each cell type. The four conditions are defined as hit (correct licking behavior during CS-E); miss (not licking during CS-E); false alarm (licking during CS-A); and correct rejection (not licking during CS-A), where licking is measured during the anticipatory period, e.g., after the cue and prior to stimulus delivery. Dimensionality reduction was performed as above within a smoothing window consisting of 10 previous bins. These trajectories were projected on 3D space using the first three principal components, PC1, PC2, and PC3. To determine the mean length of trajectories, a leave-one-out approach was used; the length was computed k times, where k is the number of subjects, then averaged across each group. To compute the distance between trajectories, the same leave-one-out approach was taken, this time computing the distance between any two trajectories and averaging across the groups.
Functional hierarchical clustering algorithm for phasic responses of trial-averaged data
Prior to clustering, data with NaN values were removed (n = 1). The trial-averaged response (a.k.a. PSTH) was computed using 100 ms bin widths. Normalization consisted of z-scores, which were calculated using the mean and standard deviation during the baseline period (− 2 s to cue onset) individually for each neuron. Finally, the data were smoothed along the time dimension using a Gaussian-weighted moving average within a smoothing window of 10 prior bins. To compare reward and punishment trials, data were concatenated to calculate universal clusters, allowing for comparisons between and within CS-E and CS-A protocols. A hierarchical cluster tree was generated using Ward’s method, which uses inner squared distance to determine hierarchy using a Euclidean distance metric. A cutoff threshold value of 0.23 of the max value in the set was selected and used to determine clusters. Clusters containing less than 3 cells were discarded (for Ensure/airpuff: 7 cells did not meet cluster criteria; for hit/miss/false alarm/correct rejection: 4 cells did not meet cluster criteria). These clusters were then divided according to the different brain regions of interest, MGN non-phototagged, MGN → BLA projections, out-of-network BLA, and in-network BLA. Heatmaps plotted for each region represent the smoothed z-score input data; clusters for each region are color-coded based on the original cluster tree. An identical approach was taken to preprocess hit/miss/false alarm/correct rejection data, concatenating each category appropriately by cell type; the same clustering parameters were applied.
Statistical analysis to measure clusters within each neural region
To determine if one cluster in the MGN contained significantly more cells than other clusters, a leave-one-out approach was taken; the number of cells in each cluster was computed k times, each time leaving out one subject. A Kruskal–Wallis ANOVA was applied to the leave-one-out data to determine if there was a difference across all the clusters. If the ANOVA result was statistically significant, post hoc Wilcoxon’s rank-sum tests were applied to the following subset of pairs: the cluster with the highest mean number of cells was compared to other clusters. To correct for multiple tests, the Bonferroni correction was applied by adjusting the significance level (alpha = 0.05), dividing by the number of pairs being tested. This same approach was used for BLA data.
Functional hierarchical clustering algorithm for tonic changes to baseline periods
To identify tonic changes occurring during baseline or ITI periods, the following approach was taken: for each neuron and stimulus condition, three periods of interest were (1) an ITI baseline from − 3 to − 2 s prior to cue onset, (2) a pre-cue region from − 1 s prior to cue onset to cue onset, and (3) a response window from cue onset to 1 s past cue onset. For each time window, the number of spikes per trial was summed. This yielded (1) baseline spike count per trial, (2) pre-cue spike count, and (3) the response period spike for each neuron and stimulus condition. To normalize, the mean and standard deviations of the baseline period were subtracted from the trial-averaged spike counts for both pre-cue and post-cue and for both CS-E and CS-A. Data were then smoothed over trials using a Gaussian-weighted moving average with a smoothing factor of 0.85. The resulting four z-scored datasets were concatenated prior to clustering. Hierarchical clustering was performed using Ward’s method on a Euclidean distance metric and the threshold set at 20% of the max value. Clusters containing less than 3 cells were discarded (for trial-based clusters: 5 cells did not meet cluster criteria). To compute whether a change to the firing rate occurred during the pre-cue period, data for each cluster were plotted as spike count by trial number. A regression line was fit to the spike count data. If the confidence intervals around the slope of the regression line did not contain 0, then a change to the baseline firing rate was determined to have occurred from the first trial to the last during the pre-cue period.
Results
Experimental paradigm
To achieve robust discriminative learning, we employed a two-stage Pavlovian paradigm that allowed mice to acquire a reward association (acquisition phase) before introducing punishing stimuli (discrimination phase) to ensure robust and consistent learning across animals. Discrimination sessions randomly presented conditioned stimuli (CS) consisting of Ensure reward (CS-E) and airpuff punishment (CS-A) trials, pairing each with the appropriate tonal unconditioned stimuli (4 s, 65 dB, 20 s ± 4 s ITI). US delivery was randomly jittered during an interval of 1 to 1.5 s after CS onset to provide adequate time for subjects to exhibit reliable anticipatory licking throughout the first second of tone onset. The anticipatory licking period was defined as the period following CS onset and preceding US onset (yellow region in Fig. 1a). The correct behavioral response to CS-E was anticipatory licking while that of CS-A was lick omission. Discrimination learning was considered successful when subjects demonstrated a statistically significant difference between the licking scores across reward and punishment trials, as well as success in the anticipatory licking control during a single session (Fig. 1b). Lick behavior was recorded using an infrared (IR) beam break that was calibrated to each subject during Acquisition phase training (Fig. 1c). Data (N = 12) were bootstrapped to determine that 20 trial presentations were sufficient to correctly determine significance in behavioral data (Fig. 1d).
Optogenetic stimulation allows identification of four distinct neural populations of interest: MGN non-phototagged, MGN → BLA, out-of-network BLA, and in-network BLA
To test the hypothesis that medial geniculate nucleus of the thalamus (MGN) neurons that project to basolateral amygdala (BLA) exhibit differences from MGN neurons during learning, we employed an optogenetic technique to identify specific subpopulations in the MGN and the BLA (Lima et al. 2009). We used a dual virus recombination approach to selectively express Channelrhodopsin-2 (ChR2) fused to a reporter, an enhanced yellow fluorescent protein (eYFP) in a single circuit: MGN neurons that project to the BLA (Fig. 2a). By injecting an anterogradely traveling virus carrying a cre-dependent ChR2 into the upstream brain region (MGN), and a retrogradely traveling virus expressing cre-recombinase in the downstream region (BLA), only cells containing both viruses will express ChR2-eYFP, and thereby become light-sensitive and fluorescently labeled through cre-mediated specificity (Beyeler et al. 2016). Because the retrogradely traveling viruses are chosen for their propensity to enter neurons through axon terminals, viral recombination preferentially occurs in cells terminating in the downstream injection site (Fenno et al. 2011; Tye and Deisseroth 2012). These cells can then be identified as expressing ChR2-eYFP by recording the light-evoked spiking, a method known as phototagging. Furthermore, this technique can be combined with electrophysiological recording in multiple brain regions to identify other downstream neurons that are in the same connected network. Thus, by shining light into the upstream brain region and evoking responses there, we can also capture downstream responses; we then identify cells projecting to a downstream region by defining a short-latency response window to capture those cells firing in response to the light (Fig. 2a, b).
Thus, the proposed circuit model represents the connectivity structure between the MGN and the BLA; it accounts for both the MGN neurons directly connected with the BLA identified via phototagging (MGN → BLA) as well as those polysynaptic connections for which only the BLA projections (in-network) are identifiable (Fig. 2b). Representative images of the lesion and viral expression showed that neurons achieved robust expression of ChR2 in the regions of interest (Fig. 2c). In vivo neurons express ChR2 fire in response to blue light while neighboring neurons do not; thus, we can selectively stimulate cells in the MGN and evoke spikes in the downstream cells receiving input via the monosynaptic or polysynaptic connections (Allsop et al. 2018; Beyeler et al. 2018, 2016; Burgos-Robles et al. 2017; Nieh et al. 2015; Senn et al. 2014) (Fig. 2d).
To validate our approach phototagging MGN → BLA projectors during single unit recordings, a photoresponse latency threshold was determined based on ex vivo whole-cell patch-clamp recordings (N = 3 mice; n = 10 non-expressing neighbors; n = 3 ChR2-eYFP expressing cells). Robust inward currents were observed in ChR2-expressing MGN cells during voltage-clamp recordings (Fig. 2f), as well as short latency evoked spikes while recording in current-clamp mode (Fig. 2g). Contrastingly, of the non-expressing neighboring cells recorded, no short latency light-evoked responses were observed (Fig. 2e–g). This suggests the BLA-projecting subpopulation of the MGN does not have robust local recurrent connections. Having successfully validated the identification of the MGN → BLA projections via optogenetic stimulation, we recorded in vivo neural activity simultaneously in the MGN and the BLA using multiarray electrodes during behavioral tasks (Supplemental Fig. 1).
MGN neurons predominantly code for arousal while BLA neurons code for both arousal and valence
To gain understanding about the order of processes implemented across the thalamoamgydala circuit, we examined neural responses of MGN and BLA neurons during the Pavlovian discrimination task described above. To compare across subjects, data were normalized using z-score transformation and plotted on two different timescales (Fig. 3a, b). The responses to CS-E were notably different between brain regions. All MGN cells exhibited a strong excitatory response to the reward cue. In contrast, BLA cells showed an excitatory phasic response of short duration (~ 150 ms) then appeared to cease firing for the duration of the cue (Fig. 3a). Because we were primarily interested in the learned association, these analyses are limited to the first second of the cue, prior to delivery of the US. No significant differences were found between the average response during the first second of the MGN phototagged and non-phototagged cells to either stimulus (CS-E: Wilcoxon’s rank-sum test, p = 0.89; CS-A: Wilcoxon’s rank-sum test, p = 0.75). Similarly, among BLA neurons, no significant differences were found between the in-network and out-of-network BLA neurons in response to either stimulus (CS-E: Wilcoxon’s rank-sum test, p = 0.54; CS-A: Wilcoxon’s rank-sum test, p = 0.30).
We found that although the MGN → BLA subpopulation had a trend toward higher levels of task responsiveness to both CS and US compared to the non-phototagged MGN population, this difference was not significant at the 5% level (Χ2 test, p < 0.07; Fig. 3c). However, significantly more BLA in-network cells responded to both CS and US compared to the out-of-network population (Fisher’s exact test, ***p < 0.005).
To further characterize the roles the MGN and BLA play in the formation of associative learning, we analyzed the overall encoding strength in both regions with respect to both reward- and punishment-associated cues. To provide a categorical way of characterizing the entire population, neuronal response profiles were defined a priori as inhibitory or excitatory in response to reward and punishment CSs (Fig. 3d). Of the cue-responsive neurons in the MGN, significantly more cells are excited to both CSs (E + , A + category) than almost all other categories (Fig. 3d inset: Kruskal–Wallis’ test, ****p = 6.1602e − 07, E + A + post hoc comparisons to E + A − ****p < 0.0001, E + n.s., E − A + ****p < 0.0001, E − A − ***p < 0.003, E − ****p < 0.0002, A + ****p < 0.0001, A − ***p < 0.0001). However, no significant differences were found between categories in the BLA, after correcting for multiple comparisons (Fig. 3d inset: Kruskal–Wallis’ test *p = 0.027; E − A − post hoc comparisons to E + A + , n.s., p = 0.04; E + A − , n.s., p = 0.03; E + , n.s., p = 0.1; E–A + , n.s., p = 0.02; E − , n.s., p = 0.07; A + , n.s., p = 0.02; A − , n.s., p = 0.02). Moreover, there is a significantly higher excitatory response profile in the MGN to both cues compared to the BLA (Fisher’s exact test, E + A + *** p = 0.0036). In contrast, proportionally more BLA neurons have an inhibitory response to both reward and punishment than MGN neurons (Fisher’s exact test, E − A − ****p = 0.0001).
Notably, the highest proportion of phototagged MGN cells across the population are contained in two response categories (E + , A + and E − , A −) that do not discriminate between valence; they are excited to both CSs or inhibited to both. This suggests information relayed to the amygdala from the auditory thalamus is largely salience or arousal rather than assigning valence. Taken together, these data are consistent with the notion that the thalamus is not assigning valence to the information, merely relaying its salience to the amygdala where valence is assigned.
To further dissect neural responses within our model of circuit connectivity, we explored the breakdown of responses across the subpopulations to each experimenter-defined a priori category (Fig. 3d, e). Despite the small sample size within categories, we observed the following points of interest: there is no difference in the proportion across the non-phototagged and phototagged MGN neurons in either the E + , A + nor in E − , A − categories (E + A + : Χ2 test, n.s.; E − A − : Fisher’s exact test, n.s.). However, in the BLA, there is a significantly smaller proportion of in-network cells in the E − A − category compared to the out-of-network population (Fisher’s exact test, *p = 0.048). It is also interesting that the E + A + category only appears in in-network cells and is absent in the out-of-network population. Despite the low number of neurons, it appears that the in-network BLA population is richly representing both salience and valence.
Hierarchical clustering of neural responses to reward and punishment highlights greater variety of response profiles in MGN compared to BLA
To explore the dynamic neural response to stimuli across brain regions, we plotted neural trajectories, which show how the response evolves through time in reduced dimensional space (Fig. 4a, b). The responses to each stimulus in the MGN population start off close together, then travel in separate paths following cue onset. While trajectories in the MGN have a roughly circular shape, CS-A returns to nearly the same starting point while CS-E does not complete the circle, ending farther away from the baseline starting point. This indicates a sustained difference from baseline firing in response to reward in the MGN. Here, the CS-A trajectory is longer than the CS-E trajectory indicative of a more dynamic response to the punishment cue (Wilcoxon’s rank-sum test, ***p = 0.0029). The average distance between the two trajectories was measured over time; the distance, averaged across 1 s baseline (light purple), is significantly higher from the distance averaged 1 s after cue onset (dark purple) indicating that there are divergent dynamics in the MGN that occur following the cue (Fig. 4a inset: Wilcoxon’s rank-sum test, ****p = 5.3e − 19).
In the BLA, again we find the total length of the punishment trajectory is significantly longer than reward, mirroring a more complex dynamic response in the BLA to punishment (Wilcoxon’s rank-sum test, **p = 0.0152). The distance between trajectories increases following cue onset and there is a pronounced difference in the average distance over a 1 s baseline window compared to the 1 s response window following cue (Fig. 4b inset, Wilcoxon’s rank-sum, ****p = 5.3650e − 19).
We next performed hierarchical clustering to investigate neuronal responses without a priori determination of response type groups. This exploratory technique can reveal underlying structures that yield clues as to how these data are organized; the resulting clusters are evaluated for similarities and differences within and between experimental groups and brain regions. These resulting categories offer an alternative grouping than the experimenter-determined categories in the previous section. Hierarchical clustering performed on 241 neurons aligned at cue onset over a 6 s time interval for each protocol, yielded eight functionally distinct clusters (Fig. 4c, d).
When broken down by cell type and subpopulation (Fig. 4e–h), only five of eight clusters were common to both brain regions (#2 orange, #5 red, #6 yellow, #7 green, #8 pink). The remaining three clusters (#1 purple, #3 blue, and #4 light blue) represent strong, excitatory responses; although these occur only in the MGN, interestingly, all three are present in both phototagged and non-phototagged populations (Fig. 4e, g). Cluster #7 (green) is the most prevalent in the MGN (Kruskal–Wallis’ test, ****p = 4.3344e − 17, Wilcoxon’s rank-sum post hoc tests comparing #7 cluster to #1 (purple) ***p = 0.00021; #2 (orange) p = 0.00033; #3 (dark blue) **p = 0.00031; #4 (light blue) **p = 0.00029; #5 (red) **p = 0.00031; #6 (yellow) **p = 0.00028; #8 (pink) p = 0.00032). In the BLA, cluster #8 (pink) is the most prevalent response profile (Kruskal–Wallis’ test, ****p = 2.2658e − 17, Wilcoxon’s rank-sum post hoc tests comparing cluster #8 (pink) to #2 (orange) **p = 0.00018; #5 (red) **p = 0.00022; #6 (yellow) **p = 0.00012; #7 (green) **p = 0.00025), showing a modest excitatory response to both CSs. From this analysis, we see a variety of responses specifically represented in the MGN phototagged neurons that are providing information directly to the BLA; yet the in-network BLA does not exhibit evidence of similar response profiles.
MGN neurons as a population display more diversity in response profile compared with BLA neurons
To investigate the neural dynamics during specific signal detection responses, we examined the normalized neural activity in a reduced dimensional space using principal component analysis to visualize the neural trajectories (Fig. 5a, b). Four conditions were defined using the licking behavioral responses to protocol stimuli, yielding the following combinations: hit (licking during CS-E), miss (not licking during CS-E), false alarm (licking during CS-A), and correct rejection (not licking during CS-A). Neural trajectories within the MGN are well organized and smoothly varying according to condition, with false alarm, miss, and hit tracing similar patterns with slightly different lengths. The correct rejection population response travels along a different plane and is longer than the other three (Kruskal–Wallis’ test ****p < 2e-07; Wilcoxon’s rank-sum post hoc tests: CR to hit, ***p = 4.7768e − 04; CR to miss, ***p = 3.6585e-05; CR to FA, ***p = 6.0058e − 05). Correct rejection is farthest from the hit trajectory compared to the distance to miss or to false alarm (Kruskal–Wallis’ test, p = 9.11e − 05: Wilcoxon’s rank-sum post hoc tests hit-to-CR compared to hit-to-miss p = 0.0058; hit-to-FA, p = 0.0086; CR-to-miss, n.s.; CR-to-FA, n.s.; miss-to-FA, p = 0.0013). In the BLA, trajectory lengths are not significantly different when corrected for multiple comparisons (Kruskal–Wallis’ test, *p = 0.02; Wilcoxon’s rank-sum tests, FA to Hit, p = 0.0931, FA to CR, p = 0.026, FA to miss, p = 0.026). The distance between false alarm and hit categories was greater than other pairings (Kruskal–Wallis’ test, p = 1.11e − 05; Wilcoxon’s rank-sum post hoc tests compared hit-to-FA distances to: hit-to-CR, n.s.; hit-to-miss, n.s.; CR-to-miss, p = 0.001; miss-to-FA, p = 0.0047). Qualitatively, the MGN appears to respond differently to a correct rejection compared with the other categories. However, the BLA appears to differentiate false alarms from the other three conditions, given that false alarm is the furthest away from the others. Although this false alarm trajectory was not significantly longer in the BLA, the dynamics are clearly different.
Performing hierarchical clustering on this set yields nine functional clusters (Fig. 5c, d). The MGN overall represents all nine clusters (Fig. 5e), of which cluster #7 (purple) is most prevalent (Kruskal–Wallis’ test, p = 7.29e − 19; Wilcoxon’s rank-sum post hoc tests compare #7 (purple) to #1 (light green) p = 0.00033; #2 (dark blue) p = 0.00032; #3 (hot pink) p = 0.00027; #4 (yellow) p = 0.00026; #5 (moss green) p = 0.00021; #6 (orange) p = 0.00033; #8 (light blue) p = 0.00021; #9 (red) p = 3.3e − 5). However, only six of nine clusters are present in the phototagged MGN subpopulation (Fig. 5g). In the BLA only four clusters are present (Fig. 5f), of which cluster #9 (red) is the most prevalent (Kruskal–Wallis’ test p = 3.76e − 19; Wilcoxon’s rank-sum post hoc tests compare #9 (red) to #6 (orange) p = 1.7e − 05; #7 (purple) p = 2.02e − 05; #8 (blue) p = 1.7e − 05). In both in-network and out-of-network populations, the same four clusters are present in both populations (Fig. 5f, h). It is interesting to note that this pattern of cluster breakdown follows a similar pattern to the previous section (Fig. 4g, h). There, we observed more variety in response profiles in the MGN with some overlap between MGN → BLA clusters. However, there was far less representation across clusters in both in-network and out-of-network BLA.
Between-trial hierarchical clustering reveals that a subset of neurons exhibit tonic changes during learning
Switching between tonic and burst firing patterns allows the thalamus to relay information to the cortex for efficient processing (Murray Sherman 2001). We wondered whether there were differences in tonic and burst firing patterns in the thalamoamygdala circuit as well. We explored the inter-trial interval (ITI) for tonic changes during learning in concert with the strong phasic response following cue onset. To explore trends in the population response across the two brain regions, we examined the spike activity of all simultaneously recorded neurons. We computed the average z-score response for each raster, maintaining the trial structure by aligning each trial to the CS. The MGN population has robust phasic responses to each CS-E resulting in higher magnitude overall during the 1 s response window (Fig. 6a). In contrast, the BLA population exhibits mildly inhibitory responses to the reward cue (Fig. 6b). In response to the punishment cue, the MGN exhibits a strong excitatory response immediately following the cue whereas the BLA exhibits an initial inhibitory response (Fig. 6c, d). There is no evidence of tonic changes across the population for either stimulus, e.g., as might be seen by a steady change across baseline or response windows.
During associative learning tasks, trial-to-trial changes during the response window are expected and are often taken as evidence of learning. However, the pre-cue (baseline) periods are typically not expected to exhibit trial-by-trial changes; we expect the firing to be relatively static during pre-cue periods across trials. To test for subtle changes to the tonic firing rate during the pre-CS periods, we denoted a baseline period as 1 s period before cue onset and an experimental response period of 1 s response following cue onset for every trial. Rather than averaging across trials, we instead summed the number of spikes across trials within each window, yielding the spike count per trial across each time interval (Fig. 7a). This allowed us to compare the number of spikes for each trial before and after the cue to identify trends. We note that this approach is unlike the one taken in Figs. 4 and 5, where clustering used standard peri-stimulus time histograms for each neuron. There, the approach used trial-averaged histograms making within-trial comparisons on a seconds timescale. Here, the spike count per trial during baseline window and response windows are used, which allows us to compare across epochs, making between-trial comparisons of a 1-s window across the session on a minutes timescale. We performed hierarchical clustering on the spike count per trial over all neurons and identified 13 distinct clusters (Fig. 7b, c). We used linear regression to identify if a trend was present during the ITI in either the reward or punishment protocol. We found a significant change across epochs in half of the clusters we identified.
Summary of results
In brief, we have seen that while MGN cells are significantly excited to either the reward or punishment stimulus compared to other a priori categories, BLA cells have no significant difference between categories (Fig. 3d). We compared two a priori categories (E + A + , E − A −) between MGN subpopulations and found no significant differences in either case; however, the BLA does show a difference between in-network and out-of-network subpopulations for E − A − (Fig. 3e). We adopted an unsupervised clustering approach to explore responses to CS-E and CS-A and found that a variety of response profiles were represented in the MGN but fewer of these response profiles were represented in the BLA (Fig. 4e, f). We used the same clustering approach on data that explored CS-E and CS-A sorted by correct or incorrect behavioral responses into hit, miss, correct rejection, false alarm categories and found a similar trend; the MGN represented all clusters while the BLA contained fewer than half the number of clusters (Fig. 5e, f). Moreover, the neural trajectories for these four categories presented different dynamics in the two regions. In the MGN, correct rejection was the longest trajectory and was furthest from hit; however, in the BLA, there was no significant difference between trajectory lengths and, instead, the greatest distance was between hit and false alarm (Fig. 5a, b). Finally, we also uncovered evidence of slower, tonic changes during baseline periods using a novel, albeit elementary, method to dissect trial-by-trial changes (Fig. 7).
In conclusion, we have identified strong evidence that robust signals sent from the MGN to the BLA largely carry valence-independent signals that refer to the “absolute value” or salience of the stimulus, while the lower amplitude signals found in the BLA reveal some valence coding populations. In summary, filtering occurs at the level of transmission between the MGN and BLA that enables this transformation, which we speculate may include axo-axonal connections, heavy local inhibition, and summation of other inputs, dendritic nonlinearities, or shunting. Furthermore, we show that in both the BLA and MGN, we see tonic changes in addition to changes in phasic responses to discrete cues that may reflect changes in internal state or contextual modulation.
Discussion
There is a rich history of studying Pavlovian fear conditioning in the amygdala to better understand the cellular mechanisms underlying learning. Due to the simple experimental design, robust results, and repeatability, this paradigm has been widely studied at the anatomical (LeDoux 1986; Yaniv et al. 2001) and circuit levels (Romanski and LeDoux, 1992), and in reference to synaptic pathways (Duvarci and Pare 2014; Maren 2005). There are two ways that sensory information enters the amygdala: from thalamic input or via cortical input pathways. Cortical input is generally considered to be slower and more precise, whereas the thalamus represents the rapid relay of sensory information (Maren and Quirk 2004). For associative conditioning to occur, there must be NMDAR-dependent long-term potentiation in the amygdala (Clem and Huganir 2010; Janak and Tye 2015; McKernan and Shinnick-Gallagher 1997; Rodrigues et al. 2001; Rogan et al. 1997; Rumpel et al. 2005; Tye et al. 2008). However, given that these processes also occur in the thalamus, theoretically, the thalamus could be synthesizing and filtering this information before passing it to the amygdala, and acting as more than a simple relay (Jones 1991; Lee et al. 2010; Murray Sherman 2001; Sherman 2007; Weinberger 2011).
We have uncovered strong evidence in support of the notion that the thalamus is indeed more than a simply relay system as suggested by others. First, the MGN is filtering salient information to the BLA and transforming other valence-enriched signals in the MGN. Therefore, we postulate that the thalamoamygdala circuit follows a model described by the two-factor theory of emotion (Aron et al. 2005; Schachter and Singer 1962; Tye 2018), where recognition of salience in the thalamus is followed by assignment of valence downstream in the amygdala. Second, the MGN and the BLA process expected outcomes differently, as evidenced by correct vs. incorrect behavioral response. Dynamic changes in firing rates accompany anticipatory punishment in the MGN, whereas in the BLA, these greater dynamics instead follow unexpected punishments. Third, we have implemented a simple method to identify tonic changes that occur during inter-trial intervals, dynamics which previously have not been studied in detail.
MGN filtering salient information directly to BLA but routing valence-assigned signals via indirect pathways to BLA
In this study, an optogenetic technique known as phototagging allowed us to identify four populations: (1) cells that project from the MGN to the BLA (MGN → BLA), (2) non-specific MGN cells, (3) cells in the BLA that fire in response to light-evoked activation of the MGN within a short latency (in-network), and (4) those that do not (out-of-network BLA). In our analyses, we frequently perform computational techniques using a combined BLA and MGN data set to allow for direct comparisons across these distinct subpopulations; hierarchical clustering, for example, groups all neurons with similar response profiles together and allows explicit identification of the different populations from which they originated. First, we found that the BLA has proportionally more in-network neurons that respond to both CS and US than the out-of-network populations; however, there is no proportional difference for MGN → BLA and non-specific cells in the MGN (Fig. 3c). Second, we defined explicit, a priori response categories and observed how the neurons in each brain region responded. We found evidence that all MGN cells overwhelmingly exhibited a valence-independent excitatory response profile. By contrast, the BLA did not show a statistically significant difference in proportion across the distinct categories (Fig. 3d). Within the preferred category, we did not find notable differences in the proportion of MGN → BLA or non-specific MGN cells. However, the BLA did show significant differences between in-network and out-of-network populations across the largest categories (Fig. 3e). While it is hardly surprising that the MGN cells respond to salience, it is interesting to note the MGN phototagged cells exhibited less functional diversity than non-phototagged cells. Certain responses that differentiate positive and negative valence actually disappear in the MGN → BLA population (E + A − , E − , and A −) while others are richly represented. Somehow, the large amplitude, sharp transient responses in MGN cells are not present in the BLA — either through presynaptic modulation, dendritic filtering, or local inhibition within the BLA, these robust signals are either muted or absent in the BLA (Fig. 3e). Why they exist in MGN→BLA neurons but not in BLA neurons receiving input from MGN→BLA neurons is unclear. Perhaps they send collaterals to the cortex or other regions where the sharp amplitude signals are propagated, while axo-axonal, dendritic, or somatic inhibition may contribute to the most robust signals being filtered out before arriving within the BLA.
Another interesting note is that the individual a priori categories are represented in MGN → BLA and in-network BLA subpopulations to different degrees. This is a surprising result, as these two populations directly share information (Fig. 3e). We see evidence of the A-category in in-network BLA yet the same category is absent entirely in the MGN → BLA subpopulation. One could interpret this by noting the importance of the polysynaptic connections that make up the in-network population; information is propagating into the BLA but not, apparently, via direct MGN → BLA connections. The thalamo-cortico-amygdala pathway, which plays a strong role in memory and fear learning (Ferrara et al. 2017), could be collateralizing signals. Of the two signaling stream pathways in the thalamocortical system, the higher order pathway does appear to be collateralized; we speculate here that by sending the signal to multiple brain regions, one region is connecting with interneurons that cancels out the direct signals coming from the thalamus (Wolff et al. 2021), resulting in different response profiles in evidence in the BLA. Alternatively, input sent along direct projections into the BLA could undergo dendritic filtering, which exists to intentionally dampen strong signals coming into the amygdala and is likely to preserve signals with the highest salience.
Our general interpretation of these data is that the BLA represents a diverse response that incorporates both arousal and valence while the MGN primarily represents arousal. This result follows one of two models highlighted in a recent review article (Tye 2018), specifically, the two-factor theory of emotion. Within the framework of valence processing, the two-factory theory outlines the following order of operations for two distinct neural processes: first, the brain must recognize the salience of a stimulus as the overall magnitude of the intensity of the response (|n|) before assigning a positive or negative valence to the stimulus (+ n or − n). Here, we see an indication of pure salience occurring upstream in the MGN, where information is largely independent of valence, and valence is assigned downstream in the BLA. To our knowledge, this is the first time this model has been validated through anatomical structure and function. Moreover, although diverse response categories are present in the MGN (Figs. 3e, 4e–h, 5e–h), there is not a one-to-one match across MGN → BLA and in-network populations indicating that the MGN is routing information via alternate, indirect pathways that ultimately terminate in the BLA.
MGN and BLA respond differently to expected and unexpected outcomes
We investigated the response of the MGN and the BLA to account for different representations when behavior matched or contrasted with the expected outcome of the tone (hit, miss, correct rejection, false alarm) by plotting the neural trajectories in reduced dimensional space. In the MGN, there was a significantly longer trajectory associated with correct rejection and significantly greater distances between hit and correct rejection trajectories than most other pairings. This indicates more dynamics for these two conditions and fewer dynamic differences during miss or false alarm (Fig. 5a). By contrast, in the BLA, although trajectories are of similar length, their relative distances have shifted such that hit and false alarm are further apart whereas miss and correct rejection are less dynamic. Why would the MGN appear to represent hit and correct rejection with different dynamics compared to the other responses, while the BLA minimizes the role of correct rejection and instead emphasize false alarm with hit?
One interpretation of these findings is that error signals are more pronounced in the BLA, while the expectation of punishment is amplified in the MGN. The MGN projects to multiple layers within the cortex (Lee 2015) and the thalamocortical pathway was found to be the principal fear memory pathway of an intact brain (Boatman et al. 2006). Thus, it follows that anticipatory knowledge of a coming punishment is amplified in the MGN as this signal is likely sent to cortical regions. However, since false alarm is paired with an unexpected punishment, an incorrectly identified signal would receive greater emphasis in the BLA. In contrast to the classic reward prediction error signals seen in some VTA dopamine neurons (Schultz et al. 1997), a subset of BLA neurons have actually been shown to exhibit a phasic excitation in response to unexpected reward omission that is correlated to frustrative nonreward. In this study, frustration was operationally defined as an increase in vigor or intensity of responding following an unexpected reward omission in an operant conditioning task (Tye et al. 2010).
Evidence of both phasic and tonic changes
Strong phasic responses within an experimental response window are the basis for investigating neural responses to stimuli. Frequently, researchers average across multiple trials to form a detailed response profile in time. Although working with trial-averaged responses has a long and successful practice, averaging obscures slow-scale changes that could emerge during the learning process. In this paper, we introduced a novel method for investigating slow-scale, tonic changes that accompany learning. Instead of averaging across trials, we drill down into the trial-by-trial changes by clustering spike counts. We identified subsets of neurons that experience slow but observable changes to their firing rates across the trials. More surprisingly, we uncovered changes during baseline periods, which are traditionally assumed to be static. Tonic firing more often consists of a steady firing at a specific frequency so changes to this background state that accompany learning are an unexplored area. It is our opinion that investigating trial-by-trial changes is a potentially rich area of study, which we intend to study further with increased specificity and granularity.
Summary and future directions
In the current study, we chose to focus on direct projections between the MGN and BLA, which have a specific response profile, namely excitatory long-range responses that propagate downstream to other brain regions. However, the thalamus, and specifically the MGN, has a diverse representation of other types of cells such as interneurons that might have firing rates orders of magnitude higher than projectors. Another important future direction will be to observe the activity of MGN interneurons or other locally projecting neurons which could have distinct activity patterns involved in the filtering and processing of information distinct from MGN → BLA cells.
We have demonstrated that the role of the MGN in relaying sensory information is not a static one. It appears to aid the learning process by filtering incoming sensory information to route saliency information directly to the BLA and other information, such as error signaling, indirectly to the BLA. While additional work will be necessary to evaluate the validity of our findings, our work supports the two-factor theory of emotion with MGN routing arousal information to the BLA and suggests the exploration of computations in the MGN that shape learning, as opposed to treating MGN as a sensory relay.
Data Availability
The datasets generated during and analysed during the current study are available from the corresponding author on reasonable request. Matlab codes for performing hierarchical clustering are available at https://github.com/Tyelab/HAC.
References
Allsop SA, Wichmann R, Mills F, Burgos-Robles A, Chang C-J, Felix-Ortiz AC, Vienne A, Beyeler A, Izadmehr EM, Glober G, Cum MI, Stergiadou J, Anandalingam KK, Farris K, Namburi P, Leppla CA, Weddington JC, Nieh EH, Smith AC, Ba D, Brown EN, Tye KM (2018) Corticoamygdala transfer of socially derived information gates observational learning. Cell 173:1329-1342.e18. https://doi.org/10.1016/j.cell.2018.04.004
Antunes R, Moita MA (2010) Discriminative auditory fear learning requires both tuned and nontuned auditory pathways to the amygdala. J Neurosci 30:9782–9787. https://doi.org/10.1523/JNEUROSCI.1037-10.2010
Aron A, Fisher H, Mashek DJ, Strong G, Li H, Brown LL (2005) Reward, motivation, and emotion systems associated with early-stage intense romantic love. J Neurophysiol 94:327–337. https://doi.org/10.1152/jn.00838.2004
Beyeler A, Namburi P, Glober GF, Simonnet C, Calhoon GG, Conyers GF, Luck R, Wildes CP, Tye KM (2016) Divergent routing of positive and negative information from the amygdala during memory retrieval. Neuron 90:348–361. https://doi.org/10.1016/j.neuron.2016.03.004
Beyeler A, Chang C-J, Silvestre M, Lévêque C, Namburi P, Wildes CP, Tye KM (2018) Organization of valence-encoding and projection-defined neurons in the basolateral amygdala. Cell Rep 22:905–918. https://doi.org/10.1016/j.celrep.2017.12.097
Boatman JA, Kim JJ (2006) A thalamo‐cortico‐amygdala pathway mediates auditory fear conditioning in the intact brain. European Journal of Neuroscience 24(3):894–900. https://doi.org/10.1111/j.1460-9568.2006.04965.x
Bordi F, LeDoux JE (1994) Response properties of single units in areas of rat auditory thalamus that project to the amygdala. II. Cells receiving convergent auditory and somatosensory inputs and cells antidromically activated by amygdala stimulation. Exp Brain Res 98:275–286. https://doi.org/10.1007/BF00228415
Burgos-Robles A, Kimchi EY, Izadmehr EM, Porzenheim MJ, Ramos-Guasp WA, Nieh EH, Felix-Ortiz AC, Namburi P, Leppla CA, Presbrey KN, Anandalingam KK, Pagan-Rivera PA, Anahtar M, Beyeler A, Tye KM (2017) Amygdala inputs to prefrontal cortex guide behavior amid conflicting cues of reward and punishment. Nat Neurosci 20:824–835. https://doi.org/10.1038/nn.4553
Chaudhri N, Sahuque LL, Janak PH (2008) Context-induced relapse of conditioned behavioral responding to ethanol cues in rats. Biol Psychiat 64:203–210. https://doi.org/10.1016/j.biopsych.2008.03.007
Chaudhri N, Sahuque LL, Schairer WW, Janak PH (2009) Separable roles of the nucleus accumbens core and shell in context-and cue-induced alcohol-seeking. Neuropsychopharmacology 35:783–791
Clem RL, Huganir RL (2010) Calcium-permeable AMPA receptor dynamics mediate fear memory erasure. Science 330:1108–1112. https://doi.org/10.1126/science.1195298
Cruikshank SJ, Edeline JM, Weinberger NM (1992) Stimulation at a site of auditory-somatosensory convergence in the medial geniculate nucleus is an effective unconditioned stimulus for fear conditioning. Behav Neurosci 106:471–483. https://doi.org/10.1037//0735-7044.106.3.471
Duvarci S, Pare D (2014) Amygdala microcircuits controlling learned fear. Neuron 82:966–980. https://doi.org/10.1016/j.neuron.2014.04.042
Edeline J-M, Neuenschwander-El Massioui N, Dutrieux G (1990) Discriminative long-term retention of rapidly induced multiunit changes in the hippocampus, medial geniculate and auditory cortex. Behav Brain Res 39:145–155. https://doi.org/10.1016/0166-4328(90)90101-J
Fenno L, Yizhar O, Deisseroth K (2011) The development and application of optogenetics. Annu Rev Neurosci 34:389–412. https://doi.org/10.1146/annurev-neuro-061010-113817
Ferrara NC (2015) Neural mechanisms supporting differential auditory fear conditioning. Theses and Dissertations 1048 https://dc.uwm.edu/etd/1048
Ferrara NC, Cullen PK, Pullins SP, Rotondo EK, Helmstetter FJ (2017) Input from the medial geniculate nucleus modulates amygdala encoding of fear memory discrimination. Learning & Memory 24(9):414–421
Han J-H, Yiu AP, Cole CJ, Hsiang H-L, Neve RL, Josselyn SA (2008) Increasing CREB in the auditory thalamus enhances memory and generalization of auditory conditioned fear. Learn Mem 15:443–453. https://doi.org/10.1101/lm.993608
Janak PH, Tye KM (2015) From circuits to behaviour in the amygdala. Nature 517:284–292. https://doi.org/10.1038/nature14188
Jones EG (1991) Chapter 3 The anatomy of sensory relay functions in the thalamus In: Holstege G (ed) Progress in Brain Research, Role of The Forebrain in Sensation and Behavior. Elsevier. pp 29–52. https://doi.org/10.1016/S0079-6123(08)63046-0
LeDoux JE (1986) Sensory systems and emotion: a model of affective processing. Integr Psychiatry 4:237–243
LeDoux JE (2000) Emotion circuits in the brain. Annu Rev Neurosci 23:155–184. https://doi.org/10.1146/annurev.neuro.23.1.155
Lee CC (2015) Exploring functions for the non-lemniscal auditory thalamus. Frontiers in Neural Circuits. https://doi.org/10.3389/fncir.2015.00069
Lee S-C, Cruikshank SJ, Connors BW (2010) Electrical and chemical synapses between relay neurons in developing thalamus. J Physiol 588:2403–2415. https://doi.org/10.1113/jphysiol.2010.187096
Lima SQ, Hromádka T, Znamenskiy P, Zador AM (2009) PINP: a new method of tagging neuronal populations for identification during in vivo electrophysiological recording. PLoS One 4:1–10. https://doi.org/10.1371/journal.pone.0006099
Maren S (2005) Synaptic mechanisms of associative memory in the amygdala. Neuron 47:783–786. https://doi.org/10.1016/j.neuron.2005.08.009
Maren S, Quirk GJ (2004) Neuronal signalling of fear memory. Nat Rev Neurosci 5:844–852. https://doi.org/10.1038/nrn1535
McKernan MG, Shinnick-Gallagher P (1997) Fear conditioning induces a lasting potentiation of synaptic currents in vitro. Nature 390:607–611. https://doi.org/10.1038/37605
Murray Sherman S (2001) Chapter 4 Thalamic relay functions Progress in brain research, vision: from neurons to cognition. Elsevier. pp 51–69. https://doi.org/10.1016/S0079-6123(01)34005-0
Nabavi S, Fox R, Proulx CD, Lin JY, Tsien RY, Malinow R (2014) Engineering a memory with LTD and LTP. Nature 511:348–352. https://doi.org/10.1038/nature13294
Namburi P, Beyeler A, Yorozu S, Calhoon GG, Halbert SA, Wichmann R, Holden SS, Mertens KL, Anahtar M, Felix-Ortiz AC, Wickersham IR, Gray JM, Tye KM (2015) A circuit mechanism for differentiating positive and negative associations. Nature 520:675–678. https://doi.org/10.1038/nature14366
Nieh EH, Matthews GA, Allsop SA, Presbrey KN, Leppla CA, Wichmann R, Neve R, Wildes CP, Tye KM (2015) Decoding neural circuits that control compulsive sucrose seeking. Cell 160:528–541. https://doi.org/10.1016/j.cell.2015.01.003
Nishijo H, Ono T, Nishino H (1988) Single neuron responses in amygdala of alert monkey during complex sensory stimulation with affective significance. J Neurosci 8:3570–3583. https://doi.org/10.1523/JNEUROSCI.08-10-03570.1988
Orsini CA, Maren S (2012) Neural and cellular mechanisms of fear and extinction memory formation. Neurosci Biobehav Rev, Memory Formation 36:1773–1802. https://doi.org/10.1016/j.neubiorev.2011.12.014
Parsons RG, Riedner BA, Gafford GM, Helmstetter FJ (2006) The formation of auditory fear memory requires the synthesis of protein and mRNA in the auditory thalamus. Neuroscience 141:1163–1170. https://doi.org/10.1016/j.neuroscience.2006.04.078
Rodrigues SM, Schafe GE, LeDoux JE (2001) Intra-amygdala blockade of the NR2B subunit of the NMDA receptor disrupts the acquisition but not the expression of fear conditioning. J Neurosci 21:6889–6896
Rogan MT, Stäubli UV, LeDoux JE (1997) Fear conditioning induces associative long-term potentiation in the amygdala. Nature 390:604–607. https://doi.org/10.1038/37601
Romanski LM, LeDoux JE (1992) Equipotentiality of thalamo-amygdala and thalamo-cortico-amygdala circuits in auditory fear conditioning. J Neurosci 12:4501–4509
Rumpel S, LeDoux J, Zador A, Malinow R (2005) Postsynaptic receptor trafficking underlying a form of associative learning. Science 308:83–88. https://doi.org/10.1126/science.1103944
Schachter S, Singer JE (1962) Cognitive, social, and physiological determinants of emotional state. Psychol Rev 69:379–399
Schultz W, Dayan P, Montague PR (1997) A neural substrate of prediction and reward. Science 275:1593–1599. https://doi.org/10.1126/science.275.5306.1593
Sciascia JM, Reese RM, Janak PH, Chaudhri N (2015) Alcohol-seeking triggered by discrete Pavlovian cues is invigorated by alcohol contexts and mediated by glutamate signaling in the basolateral amygdala. Neuropsychopharmacol 40:2801–2812. https://doi.org/10.1038/npp.2015.130
Senn V, Wolff SBE, Herry C, Grenier F, Ehrlich I, Gründemann J, Fadok JP, Müller C, Letzkus JJ, Lüthi A (2014) Long-range connectivity defines behavioral specificity of amygdala neurons. Neuron 81:428–437. https://doi.org/10.1016/j.neuron.2013.11.006
Sherman SM (2007) The thalamus is more than just a relay. Curr Opin Neurobiol, Sensory Systems 17:417–422. https://doi.org/10.1016/j.conb.2007.07.003
Siegle JH, López AC, Patel YA, Abramov K, Ohayon S, Voigts J (2017) Open Ephys: an open-source, plugin-based platform for multichannel electrophysiology. Journal of Neural Engineering 14(4):045003. https://doi.org/10.1088/1741-2552/aa5eea
Tye KM (2018) Neural circuit motifs in valence processing. Neuron 100:436–452. https://doi.org/10.1016/j.neuron.2018.10.001
Tye KM, Deisseroth K (2012) Optogenetic investigation of neural circuits underlying brain disease in animal models. Nat Rev Neurosci 13:251–266. https://doi.org/10.1038/nrn3171
Tye KM, Stuber GD, De Ridder B, Bonci A, Janak PH (2008) Rapid strengthening of thalamo-amygdala synapses mediates cue–reward learning. Nature 453:1253–1257
Tye KM, Cone JJ, Schairer WW, Janak PH (2010) Amygdala neural encoding of the absence of reward during extinction. J Neurosci 30:116–125. https://doi.org/10.1523/JNEUROSCI.4240-09.2010
Valyear MD, Glovaci I, Zaari A, Lahlou S, Trujillo-Pisanty I, Andrew Chapman C, Chaudhri N (2020) Dissociable mesolimbic dopamine circuits control responding triggered by alcohol-predictive discrete cues and contexts. Nat Commun 11:1–14
Weinberger NM (2011) The medial geniculate, not the amygdala, as the root of auditory fear conditioning. Hear Res 274:61–74. https://doi.org/10.1016/j.heares.2010.03.093
Wepsic JG (1966) Multimodal sensory activation of cells in the magnocellular medial geniculate nucleus. Exp Neurol 15:299–318. https://doi.org/10.1016/0014-4886(66)90053-7
Wolff M, Morceau S, Folkard R, Martin-Cortecero J, Groh A (2021) A thalamic bridge from sensory perception to cognition. Neurosci Biobehav Rev 120:222–235. https://doi.org/10.1016/j.neubiorev.2020.11.013
Yaniv D, Schafe GE, LeDoux JE, Richter-Levin G (2001) A gradient of plasticity in the amygdala revealed by cortical and subcortical stimulation, in vivo. Neuroscience 106:613–620. https://doi.org/10.1016/s0306-4522(01)00312-8
Acknowledgements
KT is an HHMI Investigator and the Wylie Vale Professor at the Salk Institute for Biological Studies, and this work was supported by funding from Salk, HHMI, Clayton Foundation, Kavli Foundation, Dolby Family Fund, R01-MH115920 (NIMH), R37-MH102441 (NIMH), and Pioneer Award DP1-AT009925 (NCCIH).
Author information
Authors and Affiliations
Contributions
CL and KT conceptualized the project and designed the experiments. CL performed preprocessing components of data analysis for the behavioral and in vivo electrophysiology data. LK performed statistical analyses and hierarchical clustering analyses. LK and KB analyzed neural trajectories. LK conceptualized and implemented hierarchical clustering of spike count data and implemented state-space analysis. GM, GG, MJ, YF, and CL contributed to data collection. JO, EN, CW, RW, AB, and EK contributed technical expertise in establishing hardware or software used for data collection. LK, CL, and KT wrote the manuscript; CL, GM, and LK plotted the figures; all authors contributed to revising the manuscript. KT supervised all stages of the work.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article belongs to a Special Issue on Conditioned Determinants of Reward Seeking
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Leppla, C.A., Keyes, L.R., Glober, G. et al. Thalamus sends information about arousal but not valence to the amygdala. Psychopharmacology 240, 477–499 (2023). https://doi.org/10.1007/s00213-022-06284-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00213-022-06284-5