Kinetic features dictate sensorimotor alignment in the superior colliculus

González-Rueda, Ana; Jensen, Kristopher; Noormandipour, Mohammadreza; de Malmazet, Daniel; Wilson, Jonathan; Ciabatti, Ernesto; Kim, Jisoo; Williams, Elena; Poort, Jasper; Hennequin, Guillaume; Tripodi, Marco

doi:10.1038/s41586-024-07619-2

Kinetic features dictate sensorimotor alignment in the superior colliculus

Article
Open access
Published: 03 July 2024

Volume 631, pages 378–385, (2024)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue Submit your manuscript

Kinetic features dictate sensorimotor alignment in the superior colliculus

Download PDF

4136 Accesses
26 Altmetric
Explore all metrics

Abstract

The execution of goal-oriented behaviours requires a spatially coherent alignment between sensory and motor maps. The current model for sensorimotor transformation in the superior colliculus relies on the topographic mapping of static spatial receptive fields onto movement endpoints^1,2,3,4,5,6. Here, to experimentally assess the validity of this canonical static model of alignment, we dissected the visuo-motor network in the superior colliculus and performed in vivo intracellular and extracellular recordings across layers, in restrained and unrestrained conditions, to assess both the motor and the visual tuning of individual motor and premotor neurons. We found that collicular motor units have poorly defined visual static spatial receptive fields and respond instead to kinetic visual features, revealing the existence of a direct alignment in vectorial space between sensory and movement vectors, rather than between spatial receptive fields and movement endpoints as canonically hypothesized. We show that a neural network built according to these kinetic alignment principles is ideally placed to sustain ethological behaviours such as the rapid interception of moving and static targets. These findings reveal a novel dimension of the sensorimotor alignment process. By extending the alignment from the static to the kinetic domain this work provides a novel conceptual framework for understanding the nature of sensorimotor convergence and its relevance in guiding goal-directed behaviours.

Integration of Vestibular, Visual, and Proprioceptive Inputs in the Cerebral Cortex during Movement Control

Article 27 April 2023

Rat superior colliculus encodes the transition between static and dynamic vision modes

Article Open access 12 February 2024

Organization of the inputs and outputs of the mouse superior colliculus

Article Open access 28 June 2021

Main

Goal-oriented behaviours rely on the integration of relevant sensory signals and their transformation into appropriate motor actions. To that end, premotor centres such as the superior colliculus (SC) need to embed connectivity matrices that ensure a meaningful alignment of spatial and motor maps^{7,8,9,10,11,12,13}. The layered and modular structure of the SC, with its distinct sensory and motor maps^{9,14,15,16,17,18,19,20}, makes it a tractable system for the study of the connectivity rules underpinning sensorimotor convergence. It provides a conceptual blueprint for understanding the circuit and computational logic that underlies spatiomotor alignment.

To unambiguously assess the functional alignment between sensory and motor maps, one should systematically measure both motor and sensory responses, focusing particularly on the sensory tuning of well-defined motor units. However, traditionally, sensory and motor tuning within the SC have been studied either independently^16,21,22,23, thus precluding the identification of convergent signals, or in animals trained to orient to sensory stimuli^3,4,24,25,26, which impedes the unambiguous dissociation of visual responses from movement preparation signals. Moreover, these earlier studies have primarily focused on characterizing the static and spatially restricted receptive fields of visuo-motor units, while neglecting responses to kinetic visual features, which are more prominent in the intermediate and deep layers of the SC, also known as the motor layers^21,22. This has led to the currently accepted view that sensorimotor integration in the SC results from the systematic mapping of static sensory features, such as spatial visual receptive fields, and movement vector endpoints. Here we experimentally assessed the validity of this canonical model, first by dissecting the collicular microcircuit responsible for the transfer of visual features from the retina to genetically defined collicular motor units in vitro. Then, we characterized the responses to visual stimuli in collicular premotor units in vivo using whole-cell recordings, tetrode recordings and 2-photon Ca²⁺ imaging. Finally, we characterized the conjunctive visual and motor tuning properties of identified individual collicular motor units. Contrary to the currently held view, we found that motor units in the SC have poor or no static spatial receptive fields (ssRFs) and are instead primarily tuned to kinetic visual features. Specifically, collicular motor units are preferentially tuned to visual flow of the opposite direction to the head movement that they decode. We show that a network model built on these kinetic alignment principles is ideally placed to support key ethological functions of the SC, such as prey capture, by facilitating the rapid interception of both moving and static targets in an energetically favourable manner.

In summary, these findings challenge the traditional view of the mapping of static sensory fields onto movement endpoints as the guiding principle of sensorimotor convergence and favour a kinetic model based on the alignment in vectorial space between sensory flow and movement vectors. Although in this study we focused exclusively on visuo-motor transformations in the mouse SC, the kinetic model presented here is potentially compatible with the mapping of any sensory modality onto movement vectors and might represent a general principle guiding sensorimotor convergence in other brain areas and in other species.

Dual retino-premotor pathway in the SC

The electrophysiological and morphological properties of neurons across different layers of the SC have been widely studied^{9,27,28,29,30,31}. However, the specific connectivity between retino-recipient neurons in the superficial layers and motor neurons in the intermediate layers remains unknown, mainly owing to the difficulty of unambiguously identifying collicular motor units. Our recent identification of genetically defined collicular motor neurons that express the transcription factor PITX2 and drive specific 3D head rotations⁹ provides a useful entry point to characterize the logic of convergence of retinal input onto the collicular premotor network. Thus, we first assessed the location of neurons presynaptic to Pitx2^ON cells (premotor collicular (Pitx2^ON-PRE) neurons) within the SC using rabies virus-based synaptic tracing (Fig. 1a,b and Extended Data Fig. 1a). We found that Pitx2^ON-PRE neurons were mostly located within the intermediate layers of the colliculus near to Pitx2^ON neurons, with a smaller fraction of about 12% of Pitx2^ON-PRE neurons being located in the superficial visual layers (Fig. 1b,c). To assess how visual information is transmitted onto motor neurons, we then used a combination of viral tracing tools to light-activate retinal inputs while recording collicular Pitx2^ON and Pitx2^ON-PRE neurons (Fig. 1d and Extended Data Fig. 1c). Retinal inputs to the SC were specifically targeted by intraocular injections of AAV2-CAG-ChR2. This resulted in localized expression of ChR2 in retinal axons restricted to the superficial layers of the SC (Fig. 1e). We targeted Pitx2^ON-PRE neurons in different collicular layers for whole-cell recording in acute brain slices and assessed the prevalence and strength of their retinal input (Fig. 1f–g). The reversal potential of monosynaptic currents following light activation indicates an exclusively excitatory retinal drive onto Pitx2^ON-PRE (Extended Data Fig. 1d). Pitx2^ON-PRE neurons were pooled according to their distribution across layers within three main domains: (1) stratum zonale (SZ) and stratum griseum superficiale (SGS), hereafter referred to as SZ/SGS or superficial layers; (2) the stratum opticum (SO) and upper stratum griseum intermediale (uSGI), hereafter referred to as SO/uSGI or upper intermediate layers; and (3) lower SGI (lSGI) and stratum griseum profundum (SGP), hereafter referred to as lSGI/SGP or lower intermediate and deep layers. All Pitx2^ON-PRE neurons recorded in the superficial layers received direct monosynaptic retinal input (approximately 5 ms latency). By contrast, 12.5% of neurons recorded in the upper intermediate layers received monosynaptic retinal input and 37.5% received polysynaptic input. Among the neurons recorded in the lower intermediate and deeper layers, 10% showed a direct monosynaptic activation and 30% showed a polysynaptic activation (Fig. 1g).

**Fig. 1: Two functionally distinct premotor populations channel direct and indirect retinal input onto collicular motor neurons.**

We also identified the presence of prominent tonic inhibition on Pitx2^ON-PRE neurons by applying antagonists of type A and type B GABA (γ-aminobutyric acid) receptors (GABA_A and GABA_B receptors, respectively) during recordings. Specifically, intermediate Pitx2^ON-PRE neurons receiving disynaptic retinal input were strongly disinhibited in the presence of the GABAergic receptor antagonists (Fig. 1h). This was observed as an increase in the likelihood of eliciting spikes, the number of spikes elicited (Fig. 1i) and the duration of the depolarization, which lasted up to 20 s following a 5-ms retinal stimulation (Extended Data Fig. 1j). Notably, Pitx2^ON-PRE disinhibition led to a delayed but strong activation of Pitx2^ON neurons following retinal activation (Fig. 1h,i). Monosynaptic SZ Pitx2^ON-PRE neurons showed a significantly reduced response to disinhibition compared with deeper premotor and motor neurons (Fig. 1i). Although we found that half of the Pitx2^ON-PRE neurons displayed retinal activation, this is likely to be an underestimation owing to the inability to target all retinal ganglion cells and the existence of slower response kinetics (Extended Data Fig. 1e–j).

Overall, we found that prominent retinal input reaches collicular motor neurons via two major classes of premotor neurons that mediate visuo-motor transformations. One class of premotor neurons (22% of total premotor neurons) receives direct retinal input and is located predominantly in superficial layers. The other (28% of total premotor neurons) receives disynaptic or polysynaptic retinal input, is located in the intermediate and deep collicular layers and is modulated by a tonic inhibitory gate (Fig. 1j).

Motor layers respond to kinetic visual features

Neurons in the superficial layers of the SC display strong responses to drifting gratings and have receptive fields consistent with a retinotopic map^13,18. To test whether collicular premotor neurons inherit any of those visual features, we first performed in vivo whole-cell recordings across visual and motor layers of the SC to assess both the subthreshold and suprathreshold tuning of collicular neurons in response to moving gratings or flashing squares (Fig. 2a and Methods).

**Fig. 2: Premotor neurons and neurons in motor layers respond preferentially to visual flow.**

As expected, a high proportion of neurons in superficial layers displayed prominent responses to moving gratings and well-defined spatially confined static receptive fields (ssRFs) as measured with localized flashing black or white squares (65% and 54% of neurons, respectively; Fig. 2b,d,f and Extended Data Fig. 2a,e,f). Some neurons also had morphological and physiological properties consistent with narrow-field neurons, which have been proposed to be direction-selective (DS) or orientation-selective (OS), involved in orienting behaviours³², and to project vertically to intermediate and deep layers of the SC²⁷ (Extended Data Fig. 2a and Supplementary Tables 1 and 2). Indeed, we found that Pitx2^ON-PRE neurons partially overlap with a genetically defined population of narrow-field neurons (Extended Data Fig. 1n–p). Similarly to previous studies in primary visual cortex (V1)³³, intracellular recordings in superficial layers of the SC revealed broad subthreshold tuning to all directions with a preferred tuning to the selective orientation or direction, and more sharply tuned spiking activity (Fig. 2b and Extended Data Fig. 2b).

The proportion of neurons with ssRFs was lower in intermediate and deep motor layers of the SC (11% of neurons; Fig. 2e,h). However, 56% of neurons in motor layers were still tuned to moving gratings, 20% of which displayed subthreshold tuning only (Fig. 2c,h). The greater latency and sharp subthreshold tuning in intermediate and deep layers could reflect synaptic input from the putative presynaptic narrow-field neurons recorded in the superficial layers (Fig. 2b,f and Extended Data Fig. 1p). The lack of suprathreshold tuning in those neurons is compatible with the presence of tonic inhibitory currents on the collicular premotor network (Fig. 1h), which might be further enhanced by anaesthesia. Notably, the selectivity index and the maximum response to moving gratings were significantly higher for moving than static gratings (Fig. 2g and Extended Data Fig. 2c,d).

To confirm whether kinetic features were also preferentially transferred across the SC during wakefulness, we next assessed the visual tuning of SC neurons using chronic tetrode recordings of awake head-fixed mice, while concurrently monitoring eye movements and running bouts on a wheel equipped with an accelerometer (Fig. 2i–n and Extended Data Figs. 3a–o and 4a–j). The overall proportion of OS and DS neurons decreased only slightly across layers (from 47% in superficial layers to 38% in lSGI and deep layers; Fig. 2n and Methods) but visually tuned neurons in the superficial layers were more sharply tuned (Fig. 2k) and exhibited a 1.9-fold increased firing rate at their preferred direction compared with visually tuned neurons of the lower intermediate and deep layers (Fig. 2l and Extended Data Fig. 3l). We found a bias towards DS neurons (Extended Data Fig. 3h,i), and although direction selectivity was modulated by running, this did not affect the preferred tuning of those neurons²¹ (Extended Data Fig. 4d–g). Of note, a much smaller fraction of collicular neurons presented ssRFs in response to flashing squares; this fraction decreased sharply across layers, from 19% in superficial layers to 9% in lower intermediate and deep layers (Fig. 2n and Extended Data Fig. 3o).

We also assessed the responses of neurons to full-field changes in luminescence, directionally moving spots mimicking a moving target, and localized Gabor patches drifting in eight different directions (Extended Data Fig. 5 and Methods) as examples of ethologically relevant full-field static and spatially localized kinetic visual stimuli, respectively. We found that changes in luminescence elicited strong responses across the colliculus with 30% of neurons in the superficial layers and 18% in the lower intermediate and deep layer responding to either positive or negative changes in luminescence (Extended Data Fig. 5l–n). SC neurons were also tuned to target-mimicking spots moving towards and away from the centre of gaze. Although the spots only crossed 18% of the screen throughout the full stimulation protocol, 21% of all collicular neurons recorded showed a directional response to moving spots. Whereas 75% of these responses in superficial layers reflected the activation of the neuron’s spatial receptive field, this decreased to 33% in lower intermediate and deep layers (Extended Data Fig. 5j). Similarly, 71% of the responses to drifting Gabor patches in superficial layers corresponded to the activation of spatial receptive fields, whereas this proportion decreased to 24% in lower intermediate and deep layers, where DS and OS neurons were more prevalent (Extended Data Fig. 5g–i), further strengthening the notion that kinetic features of the visual scene are preferentially transmitted to the motor layers of the SC. Notably, we found a conservation of the preferred tuning direction among stimulation paradigms, meaning that neurons tuned to a particular direction of gratings would be tuned to moving spots and drifting Gabor patches of the same direction (average preferred gratings direction − spots direction = 30°, average preferred gratings direction − Gabor direction = 27°; Extended Data Fig. 5k).

To confirm whether premotor neurons encode visual kinetic features, we leveraged two newly developed tools—intersectional gene expression³⁴ and self-inactivating rabies (SiR)^35,36,37—that enabled us to selectively record from genetically defined Pitx2^ON-PRE premotor neurons. We performed an initial injection with a mix of an AAV with a FLEX-dependent conditionally expressed TVA receptor together with either an AAV containing Flp-dependent GCamp6f, or an AAV with Cre^OFF/Flp^ON-ChR2 in Pitx2-cre::tdTomato mice, followed by a second injection with a SiR expressing the Flp recombinase (Fig. 2o). We recorded Pitx2^ON-PRE neurons using either optetrodes, enabling us to also record the same neurons in freely moving mice, or under a 2-photon microscope, yielding higher numbers of recordings to assess their visual tuning properties in superficial and upper intermediate layers. We found the same preservation of tuning to kinetic features across layers of the SC as previously found in wild-type mice (Fig. 2p–r and Extended Data Fig. 6). Together, these data show that premotor neurons in lower intermediate and deep layers of the SC receive preferential input reporting kinetic features of the visual scene, such as externally generated motion flow. This is surprising given that visual-motor integration in the SC has generally been hypothesized to rely on the alignment of static sensory and motor features, such as the stimulus spatial receptive field and movement endpoints^1,2,3,4,5,6. These findings suggest the need to reconsider the computational and circuit strategies underpinning the visuo-motor alignment and the visuo-motor transformation by taking into account the kinetic domain.

Kinetic anti-alignment of visuo-motor neurons

To characterize the degree of visuo-motor alignment, we next identified and recorded motor units tuned to 3D head rotations in awake freely moving mice. Head-movement-related neurons were identified on the basis of their firing activity while the mouse was freely foraging for droplets of condensed milk in a white open-field arena with minimal external visual information in both dark and light conditions^9,38. Motor units, consistently tuned to specific head rotations in all light and dark trials, were—as a population—tuned to the expected head-movement vector encoded in the area of the SC used for recordings⁹, which was also consistent across trials (Fig. 3a–c and Extended Data Fig. 7a–d). As expected, motor units were primarily located within the deep layers of the SC (representing 7% of neurons in putative SZ/SGS, 16% of neurons in SO/uSGI and 25% of neurons in lSGI; Fig. 3e). We also identified neurons that were tuned only in light or only in darkness that were preferentially located in more superficial layers (Fig. 3e and Extended Data Fig. 7a–k); however, we focused on well-defined motor units to avoid the confounding effect of purely visually driven responses. To confirm the absence of visually driven responses in motor units during foraging, we enforced self-generated visual flow in the open arena by covering the walls with a pattern of either horizontal or vertical bars. As expected, motor tuning was conserved in all tested conditions (Extended Data Fig. 8a–c).

**Fig. 3: Collicular motor units respond preferentially to externally generated visual flow of the opposite direction to their motion vector tuning.**

Next, we assessed the visual response properties of the same units under head restraint. We measured their responses to static and moving sinusoidal gratings as well as their receptive field responses (Fig. 3d, Extended Data Fig. 3 and Methods). Surprisingly, around half of all motor-tuned neurons across all layers were also tuned to moving gratings (60% in putative SZ/SGS, 46% in SO/uSGI and 55% in lSGI; Fig. 3e), preferentially displaying direction selectivity, and none had detectable ssRF. Notably, we found that none of the visuo-motor neurons recorded were tuned to eye movements in head-restrained conditions (Extended Data Fig. 4h–j).

To determine whether a coherent alignment exists between the head motion vector that these neurons decode and their preferred direction tuning to drifting gratings, we built a geometrical model of the head and eye of the mouse that enabled us to infer the gaze path resulting from any given movement of the head. Assuming a resting position of the mouse’s head in front of the screen at the time of spike (Extended Data Fig. 9e), equivalent to that under head restraint, we translated the time course of yaw, pitch and roll angles of head rotation decoded from each single unit into an equivalent time course of gaze location on the screen (Fig. 3f). We then estimated the average direction of this gaze path which could be directly compared to the direction of a drifting grating (Fig. 3g). We subtracted the decoded gaze motion direction (in degrees) from the preferred grating direction of each visuo-motor unit. The resulting angle would be 0° if head movement and preferred gratings directions were perfectly aligned, and 180° if they were anti-aligned (Fig. 3h). The recorded motor units exhibited visual tuning largely anti-aligned to their motor tuning (190°; Rayleigh P = 0.03; Fig. 3h). This empirical distribution of sensorimotor alignment was indistinguishable from a random distribution with equal s.d. centred at 180°. However, it differed significantly from control distributions centred at 0°, 90° or 270° (Watson–Williams F-statistic 0.24 at 180° versus 43.72 at 0° for a critical value of 3.85; Extended Data Fig. 9a–d). The same anti-alignment was found in genetically defined Pitx2^ON motor neurons (Extended Data Fig. 6a–c) and for neurons tuned to both head rotations and moving spots (Extended Data Fig. 9j). We also found an alignment between gaze direction and preferred Gabor patch location for neurons tuned to head rotations and Gabor patches (Extended Data Fig. 9k). However, more than 90% of these units maintained their motor tuning only in light conditions but lost their head-rotation tuning in darkness, implying that, although these are not canonical motor units, they are reminiscent of visuo-motor units found in primates³⁹.

In freely moving conditions mice can display both gaze-shifting saccades and compensatory eye movements, aligned and anti-aligned to head rotations, respectively^40,41, with the former being predominant during goal-oriented behaviours. Although all visuo-motor units were recorded during goal-oriented foraging, we also considered the effect of purely compensatory eye movements during head rotations by implementing our geometrical model of the mouse gaze with head–eye rotations measured during a foraging task (Extended Data Fig. 9f–h). Incorporating these compensatory eye movements in the model estimating the average gaze direction did not change the results (Extended Data Fig. 9i).

Overall, these data point to a systematic congruence in kinetic space between visual and motor features such that the movement vector tuning of motor collicular units is anti-aligned to their tuning to externally generated visual flow. These findings suggest that the connectivity of the visuo-motor system in the SC is constrained within coherent visuo-motor direction columns^9,18, a hypothesis that is also supported by our assessment of the visual tuning preferences of the premotor network, whereby neurons presynaptic to Pitx2 neurons tend to cluster in direction coherent columns (Extended Data Fig. 6l,m). Given the lateralization of both visual and motor responses in the SC, the uncovered alignment would favour the rapid interception of targets moving towards the centre of the visual field, suggesting a potential ethological advantage for such an arrangement in prey capture-like scenarios.

The kinetic model supports target interception

To assess quantitatively how these connectivity premises can be exploited to direct spatially tuned actions towards moving and static targets, we built a three-layer network model of the left SC composed of a ssRF layer, a DS layer and a motor layer. The connectivity between these layers was strictly top-down and constrained by the experimentally observed connectivity, visual tuning and sensorimotor alignment (Fig. 4a). We also incorporated in the model the experimental observation that OS and DS neurons in the superficial layer of the SC are arranged in a centripetal pattern^18,19,42, a finding also confirmed in our recordings (Fig. 4a, Extended Data Fig. 10a–c). We used this three-layer network to systematically compare how a direct retino-premotor pathway (‘kinetic pathway’, orange; Fig. 1j and Fig. 4a), that relies exclusively on kinetic information of the visual target, or an indirect retino-premotor pathway (‘static pathway’, blue; Fig. 1j and Fig. 4a), that relies on ssRF activation, would enable interception of moving and static targets.

**Fig. 4: The direct kinetic visuo-motor pathway model supports energy-efficient and fast target interception.**

In the model that engages the kinetic pathway, a moving target activates neurons in the DS layer that are tuned to the radial component of target motion. In turn, these DS neurons excite those motor units with opposite (anti-aligned) preferred movement vector. This operating regime supports fast interception of moving targets if the target motion includes a radial component towards the agent. When quantifying the accuracy of this model by the Euclidean distance to the target at the end of each simulated trial, we found it to be comparable to that of a canonical static model that first engages ssRF neurons (Fig. 4d,g,i).

Of note, a visuo-motor anti-alignment was the only alignment that supported target interception and was required when either of the two pathways is engaged. When we consistently shifted the alignment between the DS layer and the motor layer, the agent failed to reach the target (Fig. 4h).

The model is also compatible with reaching static targets. In this scenario, the location of the visual stimulus would drive the appropriate neurons in the ssRF layer. The ssRF neurons then feed into the DS layer according to the topographic constraints dictated by concentricity, and the DS neurons activate the motor neurons on the basis of the kinetic alignment constraints described above (Fig. 4e,f).

Because our network is constrained by the concentricity of the encoded visual features, we found that the kinetic pathway prioritizes the interception of targets with the highest chances of success by ignoring targets that are moving away from the agent. This feature of the model improved energy efficiency as measured by the total amount of movement produced over all attempted interceptions (Fig. 4j,k). Although our results indicate a preferential bias towards fast, energy-efficient target interception, the model is also consistent with tracking and reaching of eccentric targets if the retinotopic layer can produce enough drive, in agreement with our finding of a strong inhibitory gate primarily impinging on this pathway (Fig. 1h–j) and affecting direction and orientation selectivity (Extended Data Fig. 3p–s).

Overall, these simulations show that a collicular network whose sensorimotor connectivity is constrained by the type of kinetic alignment that we have uncovered experimentally here is able to sustain key ethological functions that are normally associated with collicular activity, such as prey capture.

Discussion

Understanding the circuitry and computational logic underpinning the transformation of sensory signals into motor commands is essential to understand how animals interact with the environment. The lower intermediate and deep layers of the SC are involved in the translation of sensory signals into motor commands for the control of eye and head movements and provide a well-defined system to characterize the general logic of sensorimotor transformations. However, until now, it has remained unclear what type of sensory stimuli are conveyed to collicular motor units, through what circuitry, and what type of spatial alignment exists, if any, between the encoded sensory responses and the decoded movement vectors. Understanding these aspects—and particularly the issue of the geometrical alignment between sensory and motor signals in well-defined motor units—is key to understanding how spatio-temporal information, as conveyed by the sensory system, is translated into appropriate motor commands.

Here, we reveal a disynaptic collicular pathway that relays visual information from the eye on collicular motor units. We show that motor-tuned collicular units respond primarily to kinetic visual features. Specifically, we have uncovered the existence of a counterintuitive spatial visuo-motor alignment whereby motor-tuned collicular units preferentially receive visual flow input of the opposite direction to the movement vector that they decode. Although our superficial layer recordings support the existence of a saliency map in retinotopic space^43,44, we also uncovered a previously overlooked feature-biased saliency-to-action map conversion in deeper layers of the SC. Essentially, the logic governing sensorimotor transformation in the SC is not driven primarily by the spatial mapping of visual receptive fields and moment vector endpoint, as commonly hypothesized^1,2,3,4,5,6; instead, it is driven by the alignment of externally generated visual flow and movement direction.

The broadly accepted model of a mapping of spatial static visual receptive field onto movement vector endpoint that we challenge here seems to originate primarily from two lines of evidence: first, from the observation that the map of visual receptive fields in the superficial SC is broadly aligned with the movement vector map of the intermediate layers; and second, from the apparently narrow receptive field of visuo-motor cells as observed in classical primate studies^3,45,46. However, a topographic alignment of superficial receptive field and deeper movement vector maps in the SC is not in itself an indication of direct connectivity and, although prominent in superficial layers, localized visual receptive field responses tend to disappear deeper in the SC²², weakening the support for direct flow of information from ssRF responsive neurons to collicular motor units. Importantly, the topographic nature of the visual receptive field maps is mirrored in the topographic alignment of DS and OS neurons in the superficial SC^17,19,42, which we propose ultimately mediates the sensorimotor transformation process. Indeed, in our network model, the concentricity of direction-selective responses is sufficient to mediate target-directed movements within the proposed kinetic alignment framework. The narrow receptive field of putative visuo-motor cells previously described in primates has been characterized in trained animals during target-directed tasks. Therefore, it is difficult to decouple the sensory response to the cue from the underlying motor programme^47,48. Indeed, during the execution of spontaneous saccades (that is, not in target-driven tasks), the motor tuning preferences of bona fide visuo-motor cells are only maintained in light trials, casting a doubt on their true motor nature³⁹. Instead, even in primates, units that exhibit motor tuning in both dark and light trials (which one would regard as pure motor units) do not show obvious spatial receptive field responses³⁹.

Our study provides experimental support to an alternative kinetic alignment model, whereby DS visual neurons in the superficial and upper intermediate layers of the SC selectively converge onto collicular motor units responsible for driving movement in the direction opposite to the incoming sensory flow. We show that a network model built on this logic is able to perform spatially accurate visually guided actions such as visual grasping towards static and moving targets. The SC has been implicated in other ethologically relevant behaviours such as escape from predators^12,49,50. For simplicity, our model of the left SC is built in homogeneously distributed columns spanning the whole SC; however, there are input–output differences in different areas of the SC⁵¹, which have also been associated with different behaviours. Here we have recorded in the lateral-temporal area, which preferentially encodes visual information of the lower visual field, linked to predation, and where genetically defined goal-oriented motor units are enriched^9,49,52. Moreover, we have only tested motor tuning during goal-oriented foraging. Although the alignment that we describe could also potentially account for orienting towards shelter⁵³ or for the preferred direction of escape depending on the angle of approach of a predator, this remains to be tested. Furthermore, we propose that the kinetic anti-alignment that we have uncovered in afoveate mice could also expand to other systems and be the substrate of other ethologically relevant goal-oriented behaviours observed in the animal kingdom, such as the preferential directional flight of flies towards incoming odours⁵⁴ or the bias towards upstream-orienting movements in swimming and flying animals nearing a final goal destination⁵⁵.

The proposed kinetic alignment framework, which also recognizes the existence of a localized receptive field map in superficial layers and predicts a receptive field-to-DS-to-motor alignment, would fit well not only with early studies on sensorimotor transformation in the SC but also with two recent discoveries about the organization of visual and motor responses in the SC. Namely, the existence of orientation and direction columns in the superficial layer of the SC and their concentric organization^{17,18,19,22,42} as well as the discovery of topographically arranged motor columns in the intermediate layers of the SC⁹. The topographic motor organization shown by our model matches that previously found for head movements in mice⁹ and saccadic movements in monkeys¹³. With that regard, it is possible that the alignment of these two modular systems might provide the anatomical substrate for the selective sensorimotor convergence that we have unravelled here.

Methods

Mice

Seven- to fourteen-week-old C57BL/6 wild-type, Pitx2-cre::Tau-LSL-Flp0-INLA (Pitx2-Flp, derived from Pitx2-cre and Tau-LSL-Flp0-INLA mice, provided by J. Martin and S. Arber, respectively), Pitx2-cre::Rosa-LSL-tdTomato (derived from Pitx2-cre and Rosa-LSL-tdTomato mice; 007914, The Jackson Laboratory) or Vgat-cre (016961, The Jackson Laboratory) mice were used for this study. Mice of both sexes were used for anatomical experiments and only males were used for behavioural experiments. All animal procedures were conducted in accordance with the UK Animals (Scientific procedures) Act 1986 and European Community Council Directive on Animal Care under project license PPL PCDD85C8A and approved by The Animal Welfare and Ethical Review Body (AWERB) committee of the MRC Laboratory of Molecular Biology. Animals undergoing surgical implantation were individually housed to prevent damage to implants. Lighting was set to a reversed light:dark cycle, with simulated dawn and dusk at 19:00 and 07:00, respectively. Temperature was controlled at 19–23 °C and humidity at 45–65%. For open-field recordings, mice were placed on a restricted diet sufficient to maintain 85% of their free-feeding weight. When possible, analyses were blinded to data collection. While sample sizes were not determined using statistical methods, we selected sample sizes for each experiment based on variance observed in similar studies^9,38 and on practical experimental considerations.

Surgery

Mice were anaesthetized with isoflurane. Upon cessation of reflexes, the top of the head of the mouse was shaved, the mouse was placed on a stereotaxic frame and the skin was opened with a scalpel in a single clean vertical cut.

Viral injections in the brain were done using a nanoject (Scientific Laboratory Supplies) equipped with a pulled borosilicate glass capillary (1.5 outer diameter × 0.86 internal diameter × 100 mm, Harvard Apparatus). Up to a maximum of 300 nl of virus were injected at 3 heights within the SC (coordinates: 3.80 AP, 1 ML, 1.2, 1.5 and 1.8 DV) at a rate of 5 nl every 5 s. For retrograde tracing, AAV(1)-CMV-FRTed-TVAmCherry-2A-Gly (500 nl, titre: 3.9 × 10¹² genomic copies per ml) was injected at day 0, followed by injection of virus DG-rabies-GFP(EnvA) (500 nl, titre: 4.3 × 10⁸ infectious units per ml) at day 21 through the same craniotomy. Mice were perfused one week after injection of rabies virus. Brain tissue was processed as described in ‘Histology’. For selective long-term labelling of Pitx2^ON-PRE neurons in the SC, we performed an initial injection with a mix of AAV(2)-hSyn1-FLEX-nucHA-2A-TVA-2A-G(N2c) (500 nl, titre: 1.5 × 10¹² genomic copies per ml) and either AAV(1)-pAAV-nEF-Cre^OFF/Flp^ON-ChR2(ET/TC)-EYFP (500 nl, Addgene #137141, titre: 4.3 × 10¹² genomic copies per ml) or AAV(1)-Ef1a-fDIO-GCaMP6f (500 nl, Addgene #128315, titre: 5 × 10¹² genomic copies per ml). Following 3 weeks of expression, a second injection with a self-inactivating rabies virus^35,36,37 SiR-N2c-Flp (EnvA) (500 nl, titre: 1 × 10⁷ genomic copies per ml) was performed before implanting an optetrode or a cranial window for chronic Ca²⁺ imaging (see below). In a subset of experiments AAV(9)-EF1a-double floxed-hChR2(H134R)-mCherry-WPRE-HGHpA (800 nl, Addgene #20297, titre: 5 × 10¹² genomic copies per ml) was injected in the SC followed by optetrode implant. For Ca²⁺ experiments injections were performed at 3 different locations surrounding the area to be imaged, namely: 3.2, 3.7 and 4.2 mm posterior from Bregma, 0.8, 1.5 and 0.8 mm lateral of the midline and 1.3, 1.5 and 1.2 mm ventral to the brain surface. For a subset of experiments, the right eye of the mouse was injected with an AAV(2)-CAG-ChR2(H248R)-mCherry (1.5 ml, titre: 3.9 × 10¹² genomic copies per ml). A drop of 1% tropicamide and another of 2.5% phenylephrine hydrochloride were applied on the eye before injecting up to 3 μl of the virus with a Hamilton syringe. Following viral injection, a drop of 0.5% proxymetacaine and another of 0.5% chloramphenicol were applied to the injected eye. Acute brain slices were performed 4 weeks post injection.

For all in vivo recordings, the skin covering the left hemisphere was removed and a craniotomy covering 3.5–3.8 mm AP and 0.8–1.2 ML was performed and custom-made head plate was cemented (Super-Bond C & B; Prestige Dental) around the craniotomy. Care was taken to avoid bleeding or drying of the meninges and brain tissue. For whole-cell recordings in anaesthetized mice, saline (0.9% NaCl) was superfused constantly with a 2 ml min⁻¹ laminar flow using a peristaltic pump and body temperature was maintained at 36 °C using a low-noise heating pad (FHC, Termobit).

For tetrode recordings, mice were implanted with moveable 17-mm-diamteer platinum-iridium (H-ML insulated) microelectrodes (California Fine Wire), configured as four tetrodes and carried by 16-channel microdrives (Axona) and with a custom-made head plate. Tetrodes were platinum electroplated to an impedance of 100–250 kΩ using a Kohlraush:gelatin (9:1, 0.5% gelatin) solution. Electrodes were implanted at the surface of the SC: at coordinates 3.8–4.2 mm posterior from Bregma, 1.25 mm lateral of the midline and 1.2 mm ventral to the brain surface. All mice were given at least one week to recover before recording and food deprivation. The same protocol was used for optetrode surgery. For in vivo chronic selective Ca²⁺ imaging of Pitx2^ON-PRE neurons in the SC recordings, mice underwent an initial viral surgical injection with an AAV-TVA and an AAV-GCaMP6f as described above and were injected with Dexafort at 2 μg g⁻¹ the day prior to surgery. A head post and a cranial window over the SC were implanted as previously described¹⁷. In brief, following isoflurane anaesthesia, Vetergesic was injected subcutaneously at 0.1 mg kg⁻¹ and a metal head post was affixed to the skull with Crown & Bridge Metabond. Epivicaine was splashed over the skull, and a 3-mm-diameter craniotomy was performed on the left hemisphere, centred on the rostral SC. The surface of the SC was exposed through removal of the overlying cortex and the SiR was then injected as described above. A 3-mm cannular window was then fixed on top of the colliculus using dental cement (Crown & Bridge Metabond).

Histology

Once tetrodes were estimated to have passed beyond the SC, mice were anaesthetized with Euthatal (0.2 ml) and transcardially perfused with 4% formaldehyde in phosphate buffered saline (PBS). Brains were stored in the fixative and then 30% w/v sucrose solution for 24–48 h in order to cryoprotect the tissue. Brains were subsequently embedded in O.C.T. (VWR), frozen to ~−20° and cut in 30 μm coronal sections using a CM1950 cryostat (Leica). Nissl staining was used to determine tetrode depth as previously described³⁸.

For immunohistochemistry (IHC) experiments, 40 μm cryo-sections were performed. Free-floating sections were rinsed in PBS and incubated in blocking solution (1% donkey serum and 0.3% Triton X-100 in PBS) containing primary antibodies for 24 h at 4 °C. Sections were washed with PBS four times at room temperature and incubated for 24 h at 4 °C in blocking solution with secondary antibodies. Immunolabelled sections were washed four times with PBS at room temperature and mounted on glass slides (SuperFrost Plus, Thermo Scientific) using DAPI Fluoromount-G (SouthernBiotech). Biocitin-filled neurons were manually traced and aligned across sections to obtain the final reconstruction. Primary antibodies used in this study were: chicken anti-GFP (Aves Labs, GFP-1020, 1:2,000) and rabbit anti-RFP (Rockland, 600-401-379, 1:2,000). Secondary antibodies used were Alexa Fluor 488 donkey anti-chicken (Jackson ImmunoResearch, 703-545-155, 1:1,000), Cy3 donkey anti-rabbit (Jackson ImmunoResearch, 711-165-152, 1:1,000) and Alexa Fluor 488-conjugated streptavidin (Invitrogen, 1:2,000). Images were acquired using a Zeiss780 confocal microscope using a 20×/0.8 NA air lens (Carl Zeiss).

Retinas were dissected from eyecups, incubated in 4% formaldehyde for 24 h. Following several washes in PBS full retinas were mounted for imaging.

Electrophysiology

Whole-cell

For whole-cell recordings, coronal slices (350 μm) containing the SC were prepared using a vibrating microtome (7000smz-2, Campden Instruments) in ice-cold sucrose-based cutting solution oxygenated with carbogen gas (95% O₂, 5% CO₂) and with the following composition (in mM): KCl 3, NaH₂PO₄1.25, MgSO₄ 2, MgCl₂ 1, CaCl₂ 1, NaHCO₃ 26.4, glucose 10, sucrose 206, ascorbic acid 0.40, kynurenic acid 1. Slices were incubated at 37 °C for 30 min in a submerged-style holding chamber with oxygenated artificial cerebrospinal fluid (aCSF; in mM: NaCl 126, KCl 3, NaH₂PO₄ 1.25, MgSO₄ 2, CaCl₂ 2, NaHCO₃ 26.4, glucose 10) with an osmolarity adjusted to 280–300 mOsm l⁻¹ and stored thereafter in the same holding chamber at room temperature for at least a further 30 min. Slices were individually transferred to the recording chamber and were superfused with oxygenated aCSF at room temperature at a flow-rate of approximately 2 ml min⁻¹. To block GABAergic receptors CGP (52431, 10 μM, Tocris Bioscience) and gabazine (SR95531, 10 μM, Tocris Bioscience) were diluted into the superfusate.

Whole-cell current-clamp recordings were obtained from collicular neurons using 5–8 MΩ pipettes pulled from borosilicate glass capillaries (1.5 mm outer diameter × 0.86 mm inner diameter). Pipettes were filled with artificial intracellular solution containing (in mM): potassium gluconate 150, HEPES 10, NaCl 4, magnesium ATP 4, sodium GTP 0.3 and EGTA 0.2, 0.4% biocitin; adjusted to pH 7.2 and osmolarity 270–290 mOsm l⁻¹. Data were recorded using an Axon Multiclamp 700B amplifier (Molecular Devices) and signals were low-pass filtered at 2 kH and acquired at 5 kHz using a digitizer (Axon Digidata 1550 A, Molecular Devices) on a PC running pClamp. Light-evoked responses were elicited using a 450–490 nm LED light (pE-300 coolLED system, Scientifica) through a 40× water immersion objective (0.8 NA).

Whole-cell in vivo recordings were performed under isoflurane anaesthesia as previously described⁵⁶. In brief, the recording pipette was placed perpendicularly to the brain surface and a reference silver pellet electrode (A-M Systems) was placed in the saline bath covering the craniotomy. High positive pressure (>500 mbar) was applied before lowering the pipette to the surface of the brain. A 5-ms-long square pulse of voltage of 4 to 8 mV at 100 Hz was delivered via the recording electrode. The pipette was quickly advanced ~1.2 mm to reach the surface of the SC. The pressure was lowered to 40–60 mbar and the pipette was advanced in 2-μm steps. Cell contact produced a small reduction (around 10%) in resistance of the pipette, which could be seen as a proportional decrease in the size of the step of current in the oscilloscope.

Tetrodes

Single-unit recording was carried out using a multi-channel DacqUSB recording system (Axona) as previously described³⁸. The microdrive and head stage were attached to the pre-amplifier via a lightweight cable. Signals were amplified 12–20,000 times and bandpass filtered between 500 Hz and 7 kHz. Recording thresholds were set to 70% above baseline activity levels, and data from spikes above the threshold from all channels were collected across a period spanning 200 µs preceding and 800 µs following the peak amplitude of a spike. The activity of channels from any given tetrode was referenced against the activity of a single channel from another tetrode to increase the signal-to-noise ratio. Tetrodes were advanced ventrally into the brain by ~50–75 μm after each recording session. The inertial sensor was attached to the head stage on the head of the mice using Mill-Max connectors. The signal from the sensor was passed through a lightweight cable via one Arduino for processing the signal and computing the direction cosine matrix algorithm. The control Arduino was connected to the DacqUSB system using the system’s digital input–output port. A custom BASIC script was written in DacqUSB to synchronize the start of single-unit recording with the key-press initiation of inertial sensor recording (controlled using the Processing software sketchbook; https://processing.org) or visual stimulation. During head-restrained recordings, a TTL pulse was sent to an infrared LED to align eye tracking and electrophysiology.

Optetrodes

For simultaneous single-unit recording and optogenetic stimulation, a single optic fibre (core = 100 μm, NA = 0.22; Doric Lenses) was inserted between each bundle of tetrodes during production as previously described⁹. Light was delivered using a 473 nm laser diode module (Cobolt 06-MLD, Cobolt) coupled to a 100-μm multimode fibre (NA = 0.22) through a Schäfter + Kirchhoff fibre coupler (Cobolt). The laser power employed in all stimulation experiments was 3–5 mW at the tip of the fibre. For optotagging, mice were probed in an open-field arena. Following 10 min of acclimatization, bursts of 30× 5-ms-long pulses at 30 Hz were delivered with a 9-s rest period in between burst for a total of 900 stimulations over 5 min. Blue light-activated units were defined on the basis of the latency of the response to a pulse of light within a time window⁵⁷ of 5 ms. In a subset of experiments, Vgat-cre mice expressing ChR2 were recorded in head-restrained conditions to assess the change in tuning to moving gratings following light stimulation of collicular Vgat^ON. The same implants and setup was used in those experiments, although continuous 10-ms-long pulses at 30 Hz were delivered during visual stimulation in light-ON trials instead.

Ca²⁺ imaging

Mice were imaged from one week after surgery with a two-photon microscope (Bergamo II, Thorlabs), equipped with a 16× 0.8 NA objective (Nikon). Mice were recorded awake and head-fixed on a custom-made floating platform. tdTomato-positive Pitx2^ON neurons and SiR-infected cells were excited with a Ti:Sapphire laser at 1,030 nm and 920 nm, respectively, with a power of around 20 mW (Mai TaiDeepSee, Spectra Physics). Red and green emitted fluorescence were collected through a 607 ± 35 nm and 525 ± 25 nm filters, respectively (Brightline). For imaging of neuronal responses, recordings consisted of either multiple planes at different depths imaged quasi-simultaneously using a piezo device, or a single plane. The pixel resolution was kept at around 0.8 µm per pixel, while the number of pixels, field of view and imaging rate were adjusted to cover the labelled cells. On average this resulted in imaging frame rates of 30 to 60 Hz and pixel dwell times of 0.2 to 0.08 µs.

Two-photon recordings were then registered and ROIs were determined manually and extracted using CaImAn⁵⁸ (Flatiron Institute) in Python. Variation of fluorescent values over baseline (ΔF/F) were computed and used for further analysis.

Behaviour

Training procedure

Before recording, mice were acclimatized with being handled for two days and with carrying the head stage and being head-fixed for three more days. For all head-fixed recordings, mice were positioned standing over a wheel. During the acclimation phase on the wheel mice were able to run freely at all times but the duration of restraint was gradually increased from 5–10 min the first day to up to 30 min the third day.

Visual stimulation

Visual stimuli were generated using a customized version of Python PsychoPy toolbox, presented on a LCD monitor (Dell P2414H; mean luminance 35–45 cd m⁻²) positioned 20 cm from the right eye of the mouse, spanning 31° down, 42° up, 45° nasal and 59° temporal. The screen was gamma-corrected and refreshed at 60 Hz. Electrophysiological recordings were aligned to the visual stimulus using a photodiode placed at a bottom right corner of the stimulus monitor and covered to not elicit a visual response on the mouse. The signal from the photodiode and the accelerometer on the wheel were time stamped and recorded using an Arduino.

Static receptive field position was estimated by 750-ms-long flashes of uniform black or white 9 cm² squares on a grey background. The screen was divided in 165 locations covering 73° by 104° (corresponding to 30 × 53 cm). To assess the receptive field the full protocol was repeated three times.

Full screen sinusoidal gratings of 12 different directions (30° steps) were used to determine direction and orientation selectivity. Each grating (spatial frequency: 0.08 cycles per degree) would first remain static for 1 s, then move at 2.83 Hz for 2 s and stop for a further 1 s before changing direction. Gratings were displayed three times in a semi-randomized manner. This protocol was repeated three more times for a total of nine presentations per stimulus over the three trials.

For a subset of mice two extra visual stimulation paradigms were used: directional moving spot and moving Gabor patches were also used. A small black spot (1.3 cm diameter corresponding to ~3.7°) moving at 30 cm s⁻¹ in a grey background was used in order to assess the direction selectivity of neurons to a stimulus mimicking a prey moving. The stimulation paradigm consisted of a small black spot moving towards the centre of the screen (roughly aligned to the centre of the visual field) for 1 s, then staying static in the centre of the screen for 0.5 s before retracting to the opposite direction with the same speed of approach. Eight starting points were used: all four corners of the screen and all midpoints between two corners of the screen. Each starting location was presented three times per trial in a semi-randomized manner. Similar to the other visual stimulation paradigms, we recorded three trials for a total of nine presentations per movement direction.

In order to assess the spatial receptive field of neurons to moving stimuli, we used moving Gabor patches at 24 different locations (6 along the x axis and 4 along y axis) moving in 8 different directions (45° steps) for 1.5 s (spatial frequency: 0.08 cycles per degree and 2.83 Hz temporal frequency). All directions and locations were randomly presented 3 times for a total of 576 presentations corresponding to 3 times 8 directions at 24 locations.

Before all visual stimulation trials, ‘spontaneous’ firing rates were estimated over a black screen (15 s) followed by a grey screen of same average luminescence as the grating presentation (15 s). After each trial the mouse was shown first the grey screen (15 s) and then a black screen (15 s). Average activity during grey screen presentation was used as baseline and the transitions were used to compare responses to luminescence.

For in vivo whole-cell recordings, visual presentation was done using a Dell E176FP LCD screen. Full screen gratings (12 directions) were presented a total at least 3 times and up to 9. Flashing squares (covering a 100 locations) were presented 0 to 3 times, depending on the length and stability of recording.

Open-field foraging

Single units were recorded as mice foraged for droplets of 30% diluted condensed milk on a white Perspex arena (50 × 50 cm) to limit variability on visual input. Recording sessions consisted of 4× 5-min foraging trials, with the first and last occurring in light conditions and the second and third occurring in complete darkness. During dark trials all other sources of light within the experimental room such as computer screens were switched off or covered with red screens. For a subset of mice, open-field recordings were also performed in an open-field arena covered with either vertical or horizontal 3-cm-wide black and white stripes. Those recordings were performed in light conditions to enforce orientation-specific self-generated visual flow.

Eye tracking

For eye tracking in head-restrained conditions we used a camera (DMK 21BU04.H, The Imaging Source) with a zoom lens (MVL7000, ThorLabs) focused on the right eye. The eye was illuminated with an infrared LED lamp (LIU850A, ThorLabs) and an infrared filter was used on the camera (FEL0750, ThorLabs; with adapters SM2A53, SM2A6 and SM1L03, ThorLabs). When fully zoomed and placed ~20 cm from the mouse, this setup provided ~73 pixels per mm. The video was acquired using DacqUSB and synchronized to the electrophysiological recording using a small flashing infrared LED linked to the bottom edge of the camera.

In order to measure eye movements in freely moving mice we used a custom head-mounted eye and head tracking system, as previously described^40,59. In brief, we used a commercially available camera module (1937, Adafruit; infrared filter removed). A custom 3D printed camera holder with a 21 G cannula (Coopers Needle Works) was used to hold the camera, IR LEDs (VSMB2943GX01, Vishay) and a 7.0 mm × 9.3 mm IR mirror (Calflex-X NIR-Blocking Filter, Optics Balzers). A connector (852-10-00810-001101, Preci-Dip) was used to attach the camera holder to the head plate of the mice. Mice were head-fixed, and the mirror’s position was adjusted until the eye was in the centre of the eye camera. Epoxy (Araldite Rapid, Araldite) was used to fix the mirror position. A single-board computer (Raspberry Pi 3 model B, Raspberry Pi Foundation) recorded camera data at 30 Hz, capturing images of 1,296 × 972 pixels per frame for eye camera. The head roll, pitch, and yaw were estimated using an inertial motion unit including an accelerometer, gyroscope and magnetometer using previously described methods³⁸ and open source Arduino code (https://github.com/razor-AHRS) using an Arduino Mega 2560 rev 3.

Quantification and statistical analysis

Whole-cell in vitro and in vivo electrophysiology

For optogenetic stimulation in acute brain slices, light was adjusted to elicit ~5 mV postsynaptic potentials following 5-ms-long pulses. In order to calculate the latency of light-evoked responses a linear fit was made between time points corresponding to 25–30% and 70–75% of peak amplitude of the excitatory postsynaptic potential (EPSP), or of the first slope in the case of polysynaptic EPSPs. The latency was measured as the time elapsed between light onset and the point of crossing between the linear fit on the EPSP slope and the resting membrane potential level. To determine the latency of response to visual stimuli recorded in whole-cell mode in vivo we first calculated the differential of the membrane potential during baseline (grey screen) to assess the s.d. of baseline presynaptic activity. For each repetition at the preferred direction or orientation of tuning we determined the time of the first event with a slope greater than 3 × s.d. of baseline presynaptic activity and of at least 0.5 mV ms⁻¹. Recordings with a s.d. of baseline presynaptic activity >1.5 mV were discarded.

Spike sorting for tetrode recordings

The electrophysiological data were spike sorted using Tint cluster cutting software (Axona). Cluster cutting was carried out by hand as clusters were generally well separated. Clusters were included in analysis if they were stable across all trials throughout the day and did not belong to clusters identified in previous recording days.

3D head-rotation recording

To determine the precise head rotation of the mouse during tetrode recordings we employed a sensor (50 Hz sampling frequency) equipped with accelerometers, gyroscopes, and magnetometers as previously described³⁸. In brief, the sensor outputs were fed to a direction cosine matrix algorithm to provide measurements of head orientation expressed in Euler angles with respect to the Earth reference frame (yaw, pitch and roll). The rotation matrix is:

$${R}_{xyz}=\left(\begin{array}{ccc}\cos \theta \cos \psi & \cos \theta \sin \psi & -\sin \theta \\ \sin \phi \sin \theta \cos \psi -\cos \phi \sin \psi & \sin \phi \sin \theta \sin \psi +\cos \phi \cos \psi & \sin \phi \cos \theta \\ \cos \phi \sin \theta \cos \psi +\sin \phi \sin \psi & \cos \phi \sin \theta \sin \psi -\sin \phi \cos \psi & \cos \phi \cos \theta \end{array}\right)$$

(1)

From which one can extract the Euler angles as:

$$\begin{array}{c}\phi ={\rm{a}}{\rm{t}}{\rm{a}}{\rm{n}}2({R}_{23},{R}_{33})\\ \theta =\,-\arcsin ({R}_{13})\\ \psi ={\rm{a}}{\rm{t}}{\rm{a}}{\rm{n}}2({R}_{12},{R}_{11})\end{array}$$

(2)

The primary source of the Euler angles is the gyroscope measurements, which are expressed as angular velocity

$$\omega =\left(\begin{array}{c}{\omega }_{x}\\ {\omega }_{y}\\ {\omega }_{z}\end{array}\right)=\left(\begin{array}{c}\frac{\partial \phi }{\partial t}\\ \frac{\partial \theta }{\partial t}\\ \frac{\partial \psi }{\partial t}\end{array}\right)$$

(3)

The main equation used to update the rotation matrix over time from gyroscope signals:

$${R}^{T}\left(t+{\rm{d}}t\right)={R}^{T}(t)\left(\begin{array}{ccc}1 & -{\omega }_{z}{\rm{d}}t & {\omega }_{y}{\rm{d}}t\\ {\omega }_{z}{\rm{d}}t & 1 & -{\omega }_{x}{\rm{d}}t\\ {-\omega }_{y}{\rm{d}}t & {\omega }_{x}{\rm{d}}t & 1\end{array}\right)$$

All drift corrections and calibrations were performed as previously described³⁸.

Motion tuning

The motion tuning of SC neurons was determined by carrying out STAs of head displacements. To compute the STA of motion, the angular head velocity for the 25 temporal bins (0.5 s) preceding and 50 bins (1 s) following the onset of spike were averaged for all spikes and for each of the three Eulerian components. The direction of the head at the onset of each spike was normalized to zero for each Eulerian component. For each spike, the calculated angular head velocities were cumulatively summated for each temporal bin to produce a head displacement for the 0.5 s preceding and 1 s following the onset of spike. The mean and s.e.m. of spike related head displacements were calculated for each temporal bin to illustrate the tuning of neurons. Displacement vectors for each Eulerian component were calculated as the difference between the minimum and maximum of the computed average displacement. The direction of the displacement vector was defined according to the temporal order of the minimum and maximum values of the computed displacement. A neuron was considered to be tuned to either light or dark if the average displacement vector for at least one of the Eulerian components was >5°, with the same direction and ranking >95% compared to a shuffled distribution for both light trials or dark trials, respectively. When angular difference was reported this referred to the magnitude in degrees of the average displacement angle a neuron was tuned to in a specific condition (for example, light condition) minus the displacement angle at another (for example, horizontal stripes).

Visual tuning

Analysis routines for visual tuning were developed in Igor Pro (WaveMetrics). The neuronal response to drifting sinusoidal gratings was averaged for all trials and normalized to the baseline firing rate. The selectivity was then calculated both in direction and orientation space (360° and 180°, respectively) by computing the mean orientation and direction vectors in polar coordinates, described by their modulus (corresponding to the selectivity index) and average angle.

$$\begin{array}{c}{\rm{S}}{\rm{I}}=\left|\frac{{\sum }_{k}R({\theta }_{k})\,{e}^{2i{\theta }_{k}}}{{\sum }_{k}R({\theta }_{k})}\right|\\ \bar{\theta }={\rm{a}}{\rm{t}}{\rm{a}}{\rm{n}}\left(\frac{{\sum }_{k}R({\theta }_{k})\sin {\theta }_{k}}{{\sum }_{k}R({\theta }_{k})\cos {\theta }_{k}}\right)\\ {\rm{f}}{\rm{o}}{\rm{r}}\,\cos {\theta }_{k} < 0,\,\bar{\theta }={\rm{a}}{\rm{t}}{\rm{a}}{\rm{n}}\left(\frac{{\sum }_{k}R({\theta }_{k})\sin {\theta }_{k}}{{\sum }_{k}R({\theta }_{k})\cos {\theta }_{k}}\right)+{\rm{\pi }}\\ {\rm{f}}{\rm{o}}{\rm{r}}\,\sin {\theta }_{k} < 0\,{\rm{a}}{\rm{n}}{\rm{d}}\,\cos {\theta }_{k} > 0,\\ \,\,\bar{\theta }={\rm{a}}{\rm{t}}{\rm{a}}{\rm{n}}\left(\frac{{\sum }_{k}R({\theta }_{k})\sin {\theta }_{k}}{{\sum }_{k}R({\theta }_{k})\cos {\theta }_{k}}\right)+2{\rm{\pi }}\end{array}$$

Where R(θ_k) is the response at each sampled direction or orientation θ_k (12 for direction space and 6 for orientation space). Static gratings’ response was averaged before and after drifting and probed only in orientation space. Given that the modulus of the vectors is dependent on the firing pattern and firing rate of the neuron, the same calculation was performed for shuffled spike times to obtain a probability distribution of shuffled DSI and SI for OS neurons (OSI) to probe for both direction selectivity and orientation selectivity (see ‘Generation of shuffled datasets’). To determine whether a neuron was tuned to moving or static gratings 3 parameters were used: the selectivity index (≥0.1), the trial-to-trial angular variance (≤0.8; used as a measure of the reliability of selectivity) and the significance of the SI compared to the shuffled distribution. If two out of the three criteria were fulfilled the neuron was considered tuned. If the criteria were met for both direction and orientation spaces, a neuron was classified as DS if DSI > OSI, and OS if OSI > DSI.

Direction and orientation selectivity for small moving spots and for Gabor patches was measured as described for gratings. Two separate analyses of selectivity and preferred angle of tuning were performed for Gabor patches: a location-independent analysis by averaging the responses to each direction of movement across all locations and another analysis only considering the responses at the location of maximal average activity.

In order to determine the spatial tuning of neurons to static stimuli, we averaged the response (firing rate) to flashing 9 cm² squares per location in the screen (15 by 11 locations) for each colour separately (black or white) and divided it by the response during baseline conditions and obtained a 2D matrix corresponding to the increase or decrease of firing rate over baseline per location. This matrix was further transformed into z scores. The maximal z score was compared to those obtained performing the same analysis in 1000 shuffled trials (see ‘Generation of shuffled datasets’). If the maximal z score was >2 and ranked higher than the top 5% of those obtained from the shuffled distribution, a 2D Gaussian was the fit to the matrix. The centre of the Gaussian fit was used to determine the centre of the ssRF. The same analysis was performed to assess whether neurons had a spatial and kinetic receptive field as measured with Gabor patches. The centre of the location of maximal response was used as centre of the receptive field. The overlap of the tunings of individual neurons to different visual stimuli was computed using Intervene⁶⁰.

To investigate the modulation of the tuning to gratings by locomotion, trials for each angle presentation (4 s total) were divided based on whether the mouse was running or not. To identify running bouts we low-pass filtered the angular velocity of the wheel and used a threshold of 20 deg s⁻¹. We averaged the angular velocity of the wheel on each trial of visual stimulation and considered as ‘run’ trials, those in which the average running speed exceeded 20 deg s⁻¹.

Eye tracking in head-restrained mice

We used DeeplabCut⁶¹ to extract the pupil position from the eye videos. Four cardinal pupil points located at the top, bottom, left and right extremities of the pupil were tracked (Extended Data Fig. 4h). The pupil position was then computed as the centre of mass of these four points. The pupil position varied across two axes: the horizontal nasal–temporal axis, and the vertical upwards–downwards axis.

The tuning of SC neurons to eye displacement was determined by carrying out STAs, following the same method as for head displacements. The angular amplitude of eye movements relative to the resting position was estimated as: α = atan(d/r), where d is the distance travelled by the pupil centre and r is the radius of the eye, approximated to a sphere. A neuron was considered to be tuned for eye movement if the average displacement vector for at least one of the movement components was >1°, and ranking >95% compared to a shuffled distribution.

Extraction of pupil position in freely moving mice

Eye tracking was performed as previously described⁴⁰. In brief, we tracked the position of the pupil, defined as its centre, together with the nasal and temporal eye corners. The eye corners were used to automatically align the horizontal eye axis. Thirty to fifty randomly selected frames were labelled manually for each recording day. The labelled data were used to train a deep convolutional network via transfer learning using open source code⁶¹ (https://github.com/AlexEMG/DeepLabCut). The origin of the eye coordinate system was defined as the mid-point between the nasal and temporal eye corners. Pixel values in the 2D video plane were converted to angular eye positions using a model-based approach developed for the C57BL/6J mouse line used in this study⁶². Saccades were defined as rapid, high-velocity movements occurring in both eyes with a magnitude exceeding 350 deg s⁻¹.

Generation of shuffled datasets

For each cell, the spike-onset times were temporally shifted by 2–180 s in a wrap-around manner. This works to shift the relationship between the spike times and the recorded heading directions of the mice or the visual stimuli while maintaining the temporal relationship between spiking events. Once these data were shifted, analyses were carried out to determine the mean displacement vector or selectivity index of the temporally shifted data. This process was repeated 1,000 times so as to produce a random distribution.

Visuo-motor alignment

This analysis was performed on neurons that had a significant tuning to moving gratings and a significant tuning to head rotations. In order to determine whether any alignment existed between these 2 types of tuning we first modelled the 3D head-rotation tuning of a neuron as a 2D projection of a vector coming out of the eye of that mouse (corresponding to gaze) into the 2D plane of the screen in which the visual stimuli were shown. We considered the head position at time 0 equal to the position of the mouse head when head-restrained for visual stimuli (see ‘3D head rotations to 2D screen plane transformation model’). We then determined the weighted average of the newly computed motion vectors that make up the 2D trajectory. We weighted the motion vector at each time-point by the instantaneous velocity of the gaze movement. This analysis gave us a vector with an angle ${\bar{\theta }}_{{\rm{gaze}}}$ on the same plane as the gratings presentation and that could be directly compared to the angle of maximum selectivity ${\bar{\theta }}_{{\rm{gratings}}}$. We focussed on comparing the direction of these two vectors by subtraction: ${\bar{\theta }}_{{\rm{gaze}}}-{\bar{\theta }}_{{\rm{gratings}}}$.

Modelling

3D head rotations to 2D screen plane transformation model

In order to project a gaze vector from the eye of the mouse onto the plane of the screen in which the visual stimuli are shown we first computed the equation of the plane in the laboratory’s reference frame. l is a line from point 1 to point 2 in the screen plane and since it also lies within the xy plane (for the particular choice of points), we have (see Supplementary Fig. 1):

$$\begin{array}{c}{\boldsymbol{l}}\perp \widehat{z}\\ {\boldsymbol{l}}\,{\boldsymbol{\cdot }}\,\widehat{z\,}=\left(-{x}_{0},{y}_{0},\,00,0,1\right)=0\end{array}$$

Thus, the normal vector to the plane is given by:

$${{\boldsymbol{n}}}^{{\prime} }=\widehat{z\,}\times {\boldsymbol{l}}=\left|\begin{array}{ccc}\widehat{i} & \widehat{j} & \widehat{k}\\ 0 & 0 & 1\\ {-x}_{0} & {y}_{0} & 0\end{array}\right|=(-{y}_{0}\widehat{i},{-x}_{0}\,\widehat{j},0\widehat{k})$$

(4)

For simplicity we pick the plane normal vector as:

$${\boldsymbol{n}}=-{n}^{{\prime} }=(\,{y}_{0}\widehat{i},{x}_{0}\,\widehat{j},0\widehat{k})$$

(5)

Using point 1 coordinates, we can find the equation of the plane:

$$\begin{array}{c}{\bf{n}}\,{\boldsymbol{\cdot }}\,(x-{P}_{1}^{x},y-{P}_{1}^{y},z-{P}_{1}^{z})=0\\ (\,{y}_{0},{x}_{0},0)\,{\boldsymbol{\cdot }}\,(x-{x}_{0},y,z)=0\\ {y}_{0}\left(x-{x}_{0}\right)+{x}_{0}\,y=0\end{array}$$

(6)

Next, we determined the equation of a vector ${{\bf{r}}}_{e}^{H}$ coming out of the mouse eye in the mouse head’s reference frame:

$${{\bf{r}}}_{e}^{H}=\left(\begin{array}{c}{x}^{{\prime} }\\ {y}^{{\prime} }\\ {z}^{{\prime} }\end{array}\right)=\left(\begin{array}{c}{P}_{0}^{{x}^{{\prime} }}\\ {P}_{0}^{{y}^{{\prime} }}\\ {P}_{0}^{{z}^{{\prime} }}\end{array}\right)+{m}^{{\prime} }\left(\begin{array}{c}a\\ b\\ c\end{array}\right)$$

(7)

Where a, b and c are the elements of a vector that determines the direction of the gaze and ${P}_{0}^{{\prime} }$ is the initial location of the pupil in the head frame (see Supplementary Fig. 2).

Next, we need to find the coordinates of ${r}_{e}^{H}$ in the inertial reference frame once the head rotation has occurred:

$${r}_{e}^{I}=R{r}_{e}^{H}=\left(\begin{array}{c}{P}_{0}^{x}\\ {P}_{0}^{y}\\ {P}_{0}^{z}\end{array}\right)+m\left(\begin{array}{c}a\\ b\\ c\end{array}\right)=\left(\begin{array}{c}x\\ y\\ z\end{array}\right)$$

(8)

With x, y and z corresponding to the components of the eye vector in the inertial frame.

$$\begin{array}{c}x={P}_{0\,}^{x}+ma\\ y={P}_{0\,}^{y}+mb\\ z={P}_{0\,}^{z}+mc\end{array}$$

where $m\left(t\right)=R\left(t\right){m}^{{\prime} }$ and ${P}_{0}\left(t\right)=R\left(t\right){P}_{0\,}^{{\prime} }$.

Now, we need to find the intersection of ${r}_{e}^{I}$ with the plane defined in Equation (6) at times t and t + dt:

$${r}_{e}^{I}(t)={P}_{0}\left(t\right)+m(t)\left(\begin{array}{c}a\\ b\\ c\end{array}\right)$$

(9)

$${r}_{e}^{I}(t+{\rm{d}}t)={P}_{0}\left(t+{\rm{d}}t\right)+m(t+{\rm{d}}t)\left(\begin{array}{c}a\\ b\\ c\end{array}\right)$$

(10)

$$\begin{array}{l}{r}_{e}^{I}=\left(\begin{array}{ccc}\cos \theta \cos \psi & \cos \theta \sin \psi & -\sin \theta \\ \sin \phi \sin \theta \cos \psi -\cos \phi \sin \psi & \sin \phi \sin \theta \sin \psi +\cos \phi \cos \psi & \sin \phi \cos \theta \\ \cos \phi \sin \theta \cos \psi +\sin \phi \sin \psi & \cos \phi \sin \theta \sin \psi -\sin \phi \cos \psi & \cos \phi \cos \theta \end{array}\right)\\ \,\,\left(\begin{array}{c}{P}_{0}^{{x}^{{\prime} }}\\ {P}_{0}^{{y}^{{\prime} }}\\ {P}_{0}^{{z}^{{\prime} }}\end{array}\right)+{m}^{{\prime} }\left(\begin{array}{c}a\\ b\\ c\end{array}\right)\end{array}$$

(11)

$${r}_{e,x}^{I}={P}_{0}^{{x}^{{\prime} }}\left(\cos \theta \cos \psi \right)+\,{P}_{0}^{{y}^{{\prime} }}(\cos \theta \cos \psi )-{P}_{0}^{{z}^{{\prime} }}(\sin \theta )+{m}^{{\prime} }[a\cos \theta \cos \psi +b\cos \theta \sin \psi -c\sin \theta ]$$

(12.1)

$$\begin{array}{l}{r}_{e,y}^{I}\,=\,{P}_{0}^{{x}^{{\prime} }}(\sin \phi \sin \theta \cos \psi -\cos \phi \sin \psi )\\ \,+\,{P}_{0}^{{y}^{{\prime} }}(\sin \phi \sin \theta \sin \psi +\cos \phi \cos \psi )+{P}_{0}^{{z}^{{\prime} }}(\sin \phi \cos \theta )\\ \,+\,{m}^{{\prime} }[a(\sin \phi \sin \theta \cos \psi -\cos \phi \sin \psi )\\ \,+\,b(\sin \phi \sin \theta \sin \psi +\cos \phi \cos \psi )+c\sin \phi \cos \theta ]\end{array}$$

(122)

$$\begin{array}{c}{r}_{e,z}^{I}={P}_{0}^{{x}^{{\prime} }}(\cos \phi \sin \theta \cos \psi +\sin \phi \sin \psi )\\ \,\,+\,{P}_{0}^{{y}^{{\prime} }}(\cos \phi \sin \theta \sin \psi -\sin \phi \cos \psi )\\ \,\,+\,{P}_{0}^{{z}^{{\prime} }}(\cos \phi \cos \theta )+{m}^{{\prime} }[a(\cos \phi \sin \theta \cos \psi +\sin \phi \sin \psi )\\ \,\,+\,b(\cos \phi \sin \theta \sin \psi -\sin \phi \cos \psi )+c\cos \phi \cos \theta ]\end{array}$$

(123)

To find the point of intersection, we can substitute the ${r}_{e}^{I}$ component in the equation of the plane and compute the coefficient m. Starting with Equation (6):

$$\begin{array}{c}{y}_{0}\left(x-{x}_{0}\right)+{x}_{0}\,y=0\\ {y}_{0}\left({r}_{e,x}^{I}-{x}_{0}\right)+{x}_{0}{r}_{e,y}^{I}=0\end{array}$$

(13)

In order to have a better perspective of the track that the intersection of ${r}_{e}^{I}$ with the screen at different times produces, we treat each of the intersection points as the endpoint of a vector with its base at the inertial frame origin. Then we can rotate these vectors around the z axis of the inertial frame with a desired angle to have a better view. In other words, it is as if we have rotated the screen with that angle, since the relative geometry of the points on the screen would not change after this rotation. In order to perform this rotation, we use a Rodrigues’ rotation formula⁶³:

$${{\bf{v}}}_{rot}={\bf{v}}\cos \beta +(\widehat{z}\,\times \,{\bf{v}})+\widehat{z\,}(\widehat{z}\,{\rm{\cdot }}\,{\bf{v}})(1-\cos \beta )$$

(14)

Where v is the vector of intersection points and β is the rotation angle around the $\widehat{z}$ axis. If we pick $\beta ={\sin }^{-1}\left(\frac{{y}_{0}}{\sqrt{{x}_{0}^{2}+{y}_{0}^{2}}}\right)$, we practically rotate the screen such that it becomes parallel to $\widehat{x}$ and $\widehat{z}$ axes of the inertial frame and perpendicular to $\widehat{y}$.

In the above treatment, the gaze vector ${{\bf{r}}}_{e}^{H}$ is fixed in the head frame. However, this is not correct as the pupil would move in the head frame. In order to correct for this effect, we simultaneously recorded eye and head rotations in mice during foraging, as described⁴⁰, and implemented the pupil rotations in the head frame to our model. This effectively makes the direction of the gaze dependent on pitch, roll and yaw. In mathematical terms, in Equation (7) we would make the correction:

$$d=\left(\begin{array}{c}a\\ b\\ c\end{array}\right)\to d\left(\phi ,\theta ,\psi \right)={R}_{{\rm{correction}}}^{H}\left(\phi ,\theta ,\psi \right)\left(\begin{array}{c}a\\ b\\ c\end{array}\right)$$

(15)

The rest of the transformations follows as before. The correction rotation matrix in head frame, ${R}_{{\rm{correction}}}^{H}\left(\phi ,\,\theta ,\psi \right)$, was computed from the head–eye rotations measured and represented in Extended Data Fig. 9f,g.

Neural network model

The neural network model consisted of 3 layers, each with 500 neurons. The first layer of neurons had receptive fields, ${{\bf{z}}}_{i}=\left[{z}_{i,{\rm{NT}}}\,;{z}_{i,{\rm{VD}}}\right]\in {{\mathbb{R}}}^{2}$, spanning a (signed) range of ${z}_{{\rm{NT}}}\in [0,140]$ degrees along the naso-temporal (NT) axis and ${z}_{{\rm{VD}}}\in [0,70]$ degrees along the ventro-dorsal (VD) axis. The angle in the NT–VD plane corresponding to the centre of the receptive field of neuron i was denoted ${\theta }_{i}^{{\rm{RF}}}:\,={\tan }^{-1}({z}_{i,{\rm{VD}}}/{z}_{i,{\rm{NT}}})$. Each neuron in layer 1 connected to a corresponding DS neuron in layer 2, which was selective to movement in direction ${\theta }_{i}^{{\rm{DS}}}={\theta }_{i}^{{\rm{RF}}}+{\rm{\pi }}$. That is, if a RF neuron responded to stimuli in a particular part of the receptive field, the corresponding DS neuron responded to motion from this location towards the agent. Finally, layer 3 consisted of motor neurons that were ‘anti-aligned’ with the DS neurons, ${\theta }_{i}^{{\rm{M}}}={\theta }_{i}^{{\rm{DS}}}+{\rm{\pi }}$. Each motor neuron induced movement in the direction ${{\boldsymbol{m}}}_{i}=\left[\cos {\theta }_{i}^{{\rm{M}}};\sin {\theta }_{i}^{{\rm{M}}}\right]$.

All simulations were run using Euler integration with a discrete timestep of size $\Delta t=0.5$ ms and a neural time constant of ${\tau }_{{\rm{neural}}}\,=\,10$ ms. The firing rates of all neurons evolved according to ${r}_{t+1}={\left[{r}_{t}+\frac{\Delta t}{{\tau }_{{\rm{neural}}}}\left(-{r}_{t}+x+{\epsilon }\right)\right]}_{+}$, where ${\left[\cdot \right]}_{+}$ indicates a rectified linear unit nonlinearity and ${\epsilon } \sim N(0,{\sigma }^{2})$ is Gaussian input noise with s.d. of $\sigma =0.1$. x indicates the input to each neuron, which is described for each layer in the following.

The receptive field neurons responded to a stimulus at location ${\bf{s}}=\left[{s}_{{\rm{NT}}}\,;{s}_{{\rm{VD}}}\right]$ in egocentric coordinates with Gaussian tuning curves of the form ${x}_{i}^{{\rm{RF}}}=1.5\,\exp (-0.6\,\kappa \,{| {\boldsymbol{s}}-{{\boldsymbol{z}}}_{i}| }_{2}^{2})$, where $\kappa =40$.

In the static setting, the DS neurons received input from the RF neurons such that ${x}_{i}^{{\rm{DS}}}={r}_{i}^{{\rm{RF}}}$. In the kinetic setting, the input was given by ${x}_{i}^{{\rm{DS}}}=\gamma \,\exp \left(\kappa \,\left[\cos \left({\theta }^{{\rm{S}}}-{\theta }_{i}^{{\rm{DS}}}+{\rm{\pi }}\right)-1\right]\right)$, where ${\theta }^{{\rm{S}}}={\tan }^{-1}\left({s}_{{\rm{DV}}}/{s}_{{\rm{NT}}}\right)$ is the angle of the stimulus within the visual field in egocentric coordinates. Here, $\gamma ={\left[-\frac{{{\boldsymbol{s}}}^{{\rm{T}}}\dot{{\boldsymbol{s}}}}{\left|{\boldsymbol{s}}\right|\left|\dot{{\boldsymbol{s}}}\right|}\right]}_{+}$ is a scale factor that adjusts the input strength according to the ‘concentricity’ of the stimulus, such that the responses of all DS neurons are stronger when the motion of the stimulus ($\dot{{\bf{s}}}$) is ‘concentric’ to the stimulus location (${\bf{s}}$) in the visual field.

Finally, motor neurons received input from the DS neurons, ${x}_{i}^{{\rm{M}}}={r}_{i}^{{\rm{DS}}}$, and the motion of the agent was computed as $\Delta {{\bf{a}}}_{t}=\frac{{\Delta }_{t}}{{\tau }_{m}}{\sum }_{i}{r}_{i}^{M}{{\bf{m}}}_{i}$. Simulations were terminated once (1) 2.5 s had elapsed, (2) the stimulus left the receptive field of the agent, or (3) the total agent motion exceeded 200° in the NT direction or 100° in the VD direction. ${\tau }_{m}$ took a default value of 250 ms and was adjusted to modulate the relative velocity of the agent compared to the stimulus.

For quantitative analyses, energy consumption was computed as proportional to the total cumulative movement speed, $E\propto {\sum }_{t}{\left|\Delta {{\bf{a}}}_{t}\right|}_{2}$. An ‘intercept’ was considered successful if the agent moved within 24° of the stimulus (20% of the receptive field).

Statistical methods

Data were tested for normality. If the distribution was not normal, non-parametric tests such as Mann–Whitney and Kruskal–Wallis tests were used instead of t-tests and ANOVA. When the number of recordings per experimental condition was too low to assess the type of distribution, the data were assumed normal.

In all STA analyses, a t-test was applied between the displacement vector and the displacement vectors of the shuffled data with a threshold of 0.05. Neurons were considered to be motion tuned if the t-test determined a significant difference between the displacement vectors of the real and shuffled data. Similarly, to determine SI significant, a t-test was applied between the SI and the SIs of the shuffled distribution with a threshold of 0.05. To test the angular alignment of visuo-motor neurons Watson–Williams tests were performed between the dataset and a random distribution of equal s.d. centred at 0°, 90°, 180° and 270°, an F-statistic value lower than the critical value (3.85) indicates significant similarity between distributions.

All results are presented as mean ± s.e.m. unless otherwise stated. Results were considered statistically significant at *P ≤ 0.05, **P ≤ 0.01, ***P ≤ 0.001. Relevant P values and tests used are reported in the figure legends.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request. When possible, pre-processed electrophysiological data have been uploaded to Zenodo (https://doi.org/10.5281/zenodo.11105001)⁶⁴. Source data are provided with this paper.

Code availability

The code corresponding to the computational neuronal network model developed for Fig. 4 is available at https://github.com/AnaG-R/Visuomotor-model-SC.git.

References

Schiller, P. H. & Stryker, M. Single-unit recording and stimulation in superior colliculus of the alert rhesus monkey. J. Neurophysiol. 35, 915–924 (1972).
Article CAS PubMed Google Scholar
Drager, U. C. & Hubel, D. H. Physiology of visual cells in mouse superior colliculus and correlation with somatosensory and auditory input. Nature 253, 203–204 (1975).
Article CAS PubMed ADS Google Scholar
Sparks, D., Rohrer, W. H. & Zhang, Y. The role of the superior colliculus in saccade initiation: a study of express saccades and the gap effect. Vision Res. 40, 2763–2777 (2000).
Article CAS PubMed Google Scholar
Marino, R. A., Rodgers, C. K., Levy, R. & Munoz, D. P. Spatial relationships of visuomotor transformations in the superior colliculus map. J. Neurophysiol. 100, 2564–2576 (2008).
Article PubMed Google Scholar
Sadeh, M., Sajad, A., Wang, H. Y., Yan, X. G. & Crawford, J. D. Spatial transformations between superior colliculus visual and motor response fields during head-unrestrained gaze shifts. Eur. J. Neurosci. 42, 2934–2951 (2015).
Article PubMed Google Scholar
Chen, C. Y., Hoffmann, K. P., Distler, C. & Hafed, Z. M. The foveal visual representation of the primate superior colliculus. Curr. Biol. 29, 2109–2119.e7 (2019).
Article CAS PubMed Google Scholar
Gandhi, N. J. & Katnani, H. A. Motor functions of the superior colliculus. Annu. Rev. Neurosci. 34, 205–231 (2011).
Article CAS PubMed PubMed Central Google Scholar
Harris, L. R. The superior colliculus and movements of the head and eyes in cats. J. Physiol. 300, 367–391 (1980).
Article CAS PubMed PubMed Central Google Scholar
Masullo, L. et al. Genetically defined functional modules for spatial orienting in the mouse superior colliculus. Curr. Biol. 29, 2892–2904.e8 (2019).
Article CAS PubMed PubMed Central Google Scholar
Stryker, M. P. & Schiller, P. H. Eye and head movements evoked by electrical stimulation of monkey superior colliculus. Exp. Brain Res. 23, 103–112 (1975).
Article CAS PubMed Google Scholar
Masullo, L. & Tripodi, M. Goal-oriented behaviour: the ventral tegmental area in motivated movements. Curr. Biol. 29, R922–R925 (2019).
Article CAS PubMed Google Scholar
González-Rueda, A. & Tripodi, M. Eloge de la fuite: neural circuits for avoiding dangerous situations. Trends Neurosci. 42, 657–659 (2019).
Article PubMed Google Scholar
Sparks, D. L. & Nelson, J. S. Sensory and motor maps in the mammalian superior colliculus. Trends Neurosci. 10, 312–317 (1987).
Article Google Scholar
Chevalier, G., Vacher, S. & Deniau, J. M. Inhibitory nigral influence on tectospinal neurons, a possible implication of basal ganglia in orienting behavior. Exp. Brain Res. 53, 320–326 (1984).
Article CAS PubMed Google Scholar
Bolton, A. D. et al. A diencephalic dopamine source provides input to the superior colliculus, where D₁ and D₂ receptors segregate to distinct functional zones. Cell Rep. 13, 1003–1015 (2015).
Article CAS PubMed Google Scholar
Cang, J., Savier, E., Barchini, J. & Liu, X. Visual function, organization, and development of the mouse superior colliculus. Annu. Rev. Vis. Sci. 4, 239–262 (2018).
Article PubMed Google Scholar
de Malmazet, D., Kuhn, N. K. & Farrow, K. Retinotopic separation of nasal and temporal motion selectivity in the mouse superior colliculus. Curr. Biol. 28, 2961–2969.e4 (2018).
Article PubMed Google Scholar
Feinberg, E. H. & Meister, M. Orientation columns in the mouse superior colliculus. Nature 519, 229–232 (2015).
Article CAS PubMed ADS Google Scholar
Ahmadlou, M. & Heimel, J. A. Preference for concentric orientations in the mouse superior colliculus. Nat. Commun. 6, 6773 (2015).
Article PubMed ADS Google Scholar
Wheatcroft, T., Saleem, A. B. & Solomon, S. G. Functional organisation of the mouse superior colliculus. Front. Neural Circuits 16, 792959 (2022).
Article CAS PubMed PubMed Central Google Scholar
Ito, S., Feldheim, D. A. & Litke, A. M. Segregation of visual response properties in the mouse superior colliculus and their modulation during locomotion. J. Neurosci. 37, 8428–8443 (2017).
Article CAS PubMed PubMed Central Google Scholar
Lee, K. H., Tran, A., Turan, Z. & Meister, M. The sifting of visual information in the superior colliculus. eLife 9, e50678 (2020).
Article CAS PubMed PubMed Central Google Scholar
Gordon, B. Receptive fields in deep layers of cat superior colliculus. J. Neurophysiol. 36, 157–178 (1973).
Article CAS PubMed Google Scholar
Jay, M. F. & Sparks, D. L. Sensorimotor integration in the primate superior colliculus. I. Motor convergence. J. Neurophysiol. 57, 22–34 (1987).
Article CAS PubMed Google Scholar
Mays, L. E. & Sparks, D. L. Dissociation of visual and saccade-related responses in superior colliculus neurons. J. Neurophysiol. 43, 207–232 (1980).
Article CAS PubMed Google Scholar
Wurtz, R. H. & Goldberg, M. E. Activity of superior colliculus in behaving monkey. 3. Cells discharging before eye movements. J. Neurophysiol. 35, 575–586 (1972).
Article CAS PubMed Google Scholar
Gale, S. D. & Murphy, G. J. Distinct representation and distribution of visual information by specific cell types in mouse superficial superior colliculus. J. Neurosci. 34, 13458–13471 (2014).
Article CAS PubMed PubMed Central Google Scholar
Isa, T., Endo, T. & Saito, Y. The visuo-motor pathway in the local circuit of the rat superior colliculus. J. Neurosci. 18, 8496–8504 (1998).
Article CAS PubMed PubMed Central Google Scholar
Isa, T. & Hall, W. C. Exploring the superior colliculus in vitro. J. Neurophysiol. 102, 2581–2593 (2009).
Article PubMed PubMed Central Google Scholar
Lee, P. H., Helms, M. C., Augustine, G. J. & Hall, W. C. Role of intrinsic synaptic circuitry in collicular sensorimotor integration. Proc. Natl Acad. Sci. USA 94, 13299–13304 (1997).
Article CAS PubMed PubMed Central ADS Google Scholar
Graybiel, A. M. A stereometric pattern of distribution of acetylthiocholinesterase in the deep layers of the superior colliculus. Nature 272, 539–541 (1978).
Article CAS PubMed ADS Google Scholar
Hoy, J. L., Bishop, H. I. & Niell, C. M. Defined cell types in superior colliculus make distinct contributions to prey capture behavior in the mouse. Curr. Biol. 29, 4130–4138.e5 (2019).
Article CAS PubMed PubMed Central Google Scholar
Jia, H., Rochefort, N. L., Chen, X. & Konnerth, A. Dendritic organization of sensory input to cortical neurons in vivo. Nature 464, 1307–1312 (2010).
Article CAS PubMed ADS Google Scholar
Fenno, L. E. et al. Comprehensive dual- and triple-feature intersectional single-vector delivery of diverse functional payloads to cells of behaving mammals. Neuron 107, 836–853.e11 (2020).
Article CAS PubMed PubMed Central Google Scholar
Ciabatti, E., González-Rueda, A., Mariotti, L., Morgese, F. & Tripodi, M. Life-long genetic and functional access to neural circuits using self-inactivating rabies virus. Cell 170, 382–392.e14 (2017).
Article CAS PubMed PubMed Central Google Scholar
Lee, H. S. et al. Combining long-term circuit mapping and network transcriptomics with SiR-N2c. Nat. Methods 20, 580–589 (2023).
Article CAS PubMed PubMed Central Google Scholar
Ciabatti, E. et al. Genomic stability of self-inactivating rabies. eLife 12, e83459 (2023).
Article PubMed PubMed Central Google Scholar
Wilson, J. J., Alexandre, N., Trentin, C. & Tripodi, M. Three-dimensional representation of motor space in the mouse superior colliculus. Curr. Biol. 28, 1744–1755.e12 (2018).
Article CAS PubMed PubMed Central Google Scholar
Schiller, P. H. & Koerner, F. Discharge characteristics of single units in superior colliculus of the alert rhesus monkey. J. Neurophysiol. 34, 920–936 (1971).
Article CAS PubMed Google Scholar
Meyer, A. F., O’Keefe, J. & Poort, J. Two distinct types of eye–head coupling in freely moving mice. Curr. Biol. 30, 2116–2130.e6 (2020).
Article CAS PubMed PubMed Central Google Scholar
Michaiel, A. M., Abe, E. T. & Niell, C. M. Dynamics of gaze control during prey capture in freely moving mice. eLife 9, e57458 (2020).
Article CAS PubMed PubMed Central Google Scholar
Li, Y. T., Turan, Z. & Meister, M. Functional architecture of motion direction in the mouse superior colliculus. Curr. Biol. 30, 3304–3315.e4 (2020).
Article CAS PubMed PubMed Central Google Scholar
Itti, L., Koch, C. & Niebur, E. A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. 20, 1254–1259 (1998).
Article Google Scholar
Veale, R., Hafed, Z. M. & Yoshida, M. How is visual salience computed in the brain? Insights from behaviour, neurobiology and modelling. Philos. Trans. R. Soc. B 372, 20160113 (2017).
Article Google Scholar
Wurtz, R. H. & Albano, J. E. Visual-motor function of the primate superior colliculus. Annu. Rev. Neurosci. 3, 189–226 (1980).
Article CAS PubMed Google Scholar
Horwitz, G. D. & Newsome, W. T. Separate signals for target selection and movement specification in the superior colliculus. Science 284, 1158–1161 (1999).
Article CAS PubMed ADS Google Scholar
Engert, F., Tao, H. W., Zhang, L. I. & Poo, M. M. Moving visual stimuli rapidly induce direction sensitivity of developing tectal neurons. Nature 419, 470–475 (2002).
Article CAS PubMed ADS Google Scholar
Linkenhoker, B. A. & Knudsen, E. I. Incremental training increases the plasticity of the auditory space map in adult barn owls. Nature 419, 293–296 (2002).
Article CAS PubMed ADS Google Scholar
Evans, D. A. et al. A synaptic threshold mechanism for computing escape decisions. Nature 558, 590–594 (2018).
Article CAS PubMed PubMed Central ADS Google Scholar
De Franceschi, G., Vivattanasarn, T., Saleem, A. B. & Solomon, S. G. Vision guides selection of freeze or flight defense strategies in mice. Curr. Biol. 26, 2150–2154 (2016).
Article PubMed Google Scholar
Benavidez, N. L. et al. Organization of the inputs and outputs of the mouse superior colliculus. Nat. Commun. 12, 4004 (2021).
Article CAS PubMed PubMed Central ADS Google Scholar
Baden, T., Euler, T. & Berens, P. Understanding the retinal basis of vision across species. Nat. Rev. Neurosci. 21, 5–20 (2020).
Article CAS PubMed Google Scholar
Campagner, D. et al. A cortico-collicular circuit for orienting to shelter during escape. Nature 613, 111–119 (2023).
Article CAS PubMed ADS Google Scholar
Kadakia, N. et al. Odour motion sensing enhances navigation of complex plumes. Nature 611, 754–761 (2022).
Article CAS PubMed PubMed Central ADS Google Scholar
Chapman, J. W. et al. Animal orientation strategies for movement in flows. Curr. Biol. 21, R861–R870 (2011).
Article CAS PubMed Google Scholar
González-Rueda, A., Pedrosa, V., Feord, R. C., Clopath, C. & Paulsen, O. Activity-dependent downscaling of subthreshold synaptic inputs during slow-wave-sleep-like activity in vivo. Neuron 97, 1244–1252.e5 (2018).
Article PubMed PubMed Central Google Scholar
Anikeeva, P. et al. Optetrode: a multichannel readout for optogenetic control in freely moving mice. Nat. Neurosci. 15, 163–170 (2012).
Article CAS Google Scholar
Giovannucci, A. et al. CaImAn: an open source tool for scalable calcium imaging data analysis. eLife 8, e38173 (2019).
Article PubMed PubMed Central Google Scholar
Meyer, A. F., Poort, J., O’Keefe, J., Sahani, M. & Linden, J. F. A head-mounted camera system integrates detailed behavioral monitoring with multichannel electrophysiology in freely moving mice. Neuron 100, 46–60.e7 (2018).
Article CAS PubMed PubMed Central Google Scholar
Khan, A. & Mathelier, A. Intervene: a tool for intersection and visualization of multiple gene or genomic region sets. BMC Bioinformatics 18, 287 (2017).
Article PubMed PubMed Central Google Scholar
Mathis, A. et al. DeepLabCut: markerless pose estimation of user-defined body parts with deep learning. Nat. Neurosci. 21, 1281–1289 (2018).
Article CAS PubMed Google Scholar
Sakatani, T. & Isa, T. PC-based high-speed video-oculography for measuring rapid eye movements in mice. Neurosci. Res. 49, 123–131 (2004).
Article PubMed Google Scholar
Dai, J. S. Euler–Rodrigues formula variations, quaternion conjugation and intrinsic connections. Mech. Mach. Theory 92, 144–152 (2015).
Article Google Scholar
González-Rueda, A. et al. Kinetic features dictate sensorimotor alignment in the superior colliculus. Zenodo https://doi.org/10.5281/zenodo.11105001 (2024).

Download references

Acknowledgements

The authors thank the Laboratory of Molecular Biology electronics and mechanical workshops for the help with hardware development, members of the Biological Service Group for their support with animal husbandry and F. Morgese for assistance in viral production. This study was supported by the Medical Research Council core funds to M.T. (MC_UP_1201/2), the UKRI (ERC Consolidator Replacement Grant, EP/X034666/1) to M.T., the European Research Council with an ERC Starting Grant to M.T. (STG 677029), the ERANET-NEURON (Micronet, MC_PC_16036) grant to M.T. and the Wellcome Trust (J.P., 211258/Z/18/Z). A.G.-R. held a Henslow Fellowship, awarded by the Cambridge Philosophical Society and St Edmund’s College (University of Cambridge). K.J. was funded by a Cambridge Gates Scholarship. M.N. was funded by a Trinity Henry Barlow scholarship, by M. C. Payne at the TCM Group and, for the last phase of revisions, by Nokia Bell Labs for research unrelated to this manuscript. D.d.M. was supported by the European Union’s Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie grant agreement no. 894697. J.K. was supported by a PhD studentship funded by Wolfson College, the Department of Physiology, Development and Neuroscience and the School of Biological Sciences DTP at the University of Cambridge.

Author information

Authors and Affiliations

MRC Laboratory of Molecular Biology, Cambridge, UK
Ana González-Rueda, Daniel de Malmazet, Jonathan Wilson, Ernesto Ciabatti, Elena Williams, Guillaume Hennequin & Marco Tripodi
St Edmund’s College, University of Cambridge, Cambridge, UK
Ana González-Rueda
Department of Engineering, University of Cambridge, Cambridge, UK
Kristopher Jensen & Guillaume Hennequin
TCM Group, Cavendish Laboratory, University of Cambridge, Cambridge, UK
Mohammadreza Noormandipour
Nokia Bell Labs, Cambridge, UK
Mohammadreza Noormandipour
Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK
Jisoo Kim & Jasper Poort

Authors

Ana González-Rueda
View author publications
You can also search for this author in PubMed Google Scholar
Kristopher Jensen
View author publications
You can also search for this author in PubMed Google Scholar
Mohammadreza Noormandipour
View author publications
You can also search for this author in PubMed Google Scholar
Daniel de Malmazet
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Wilson
View author publications
You can also search for this author in PubMed Google Scholar
Ernesto Ciabatti
View author publications
You can also search for this author in PubMed Google Scholar
Jisoo Kim
View author publications
You can also search for this author in PubMed Google Scholar
Elena Williams
View author publications
You can also search for this author in PubMed Google Scholar
Jasper Poort
View author publications
You can also search for this author in PubMed Google Scholar
Guillaume Hennequin
View author publications
You can also search for this author in PubMed Google Scholar
Marco Tripodi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.G.-R. and M.T. conceived the project. A.G.-R. performed the experiments with contributions from D.d.M., J.K. and E.W. K.J. built the neuronal network model with input from G.H. J.W. and A.G.-R. set up tetrode recordings. A.G.-R. analysed all data except for the STA of eye movements under head restraint and calcium imaging, which was performed by D.d.M., and the coupling of eye and head movement in freely moving mice, which was performed by J.K. and J.P. E.C. produced viral constructs. M.N. developed the geometrical model converting 3D head rotations into 2D gaze projections and performed associated analysis. A.G.-R. and M.T. wrote the manuscript and implemented input from all authors.

Corresponding authors

Correspondence to Ana González-Rueda or Marco Tripodi.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature thanks Peng Cao, Mark Hübener and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Characteristics of Pitx2^ON-PRE and Pitx2^ON neurons.

a, Full diagram of the experimental design in Fig. 1a and in Fig. 1d (b). c, Reversal potential of direct retinal input onto monosynaptically connected Pitx2^ON-PRE neurons (n = 8 neurons). d, Example neuron displaying a slow raising depolarisation (ramp) following long retinal stimulation. Due to the slow raising time, these neurons were considered as non-responsive in Fig. 1. e, Time between light pulse start and postsynaptic response initiation (latency). f, Number of action potentials elicited following different current injections. g, Latency of responses per layer also considering Ramp activation. h, Number of spikes elicited by light activation after GABAzine and CGP superfusion. i, Amplitude and duration of activation in response to light stimulation after GABAR blockade. j, Decay time constant of excitatory postsynaptic activity before and after GABAR block for neurons receiving direct (left, one-tailed paired t-test p = 0.0979) and indirect (right, one-tailed paired t-test p = 0.0214) retinal input. k, Resting membrane potential before and after GABAR block (one-tailed paired t-test p = 0.0260). l, Reconstruction obtained from 10 of the neurons included in Fig. 1. m, Example Pitx2^ON-PRE narrow-field neurons traced with G-deleted Rabies virus, image extracted from dataset used for Fig. 1 (N = 4 mice, bars: 200 µm and 50 µm). n, In situ hybridization was performed on SC slices containing Pitx2^ON-PRE neurons traced with G-deleted rabies using RNAscope. o, Example straining of markers of narrow-field (Grp and Cbln4) and wide-field (Ntsr1 and Npnt, bars: 250 µm). p, quantification of Npnt⁺ and Cbln4⁺ Pitx2^ON-PRE neurons. Individual replicates are shown in red. All box plots indicate: minimum, 1^st quartile, median, 3^rd quartile and maximum; cross indicates mean. All other error bars represent ± s.e.m.

Extended Data Fig. 2 Reconstruction and visual properties of neurons recorded in vivo.

a, Reconstruction of biocytin-filled neurons in vivo in whole-cell mode in the Superior Colliculus. Red dashed box indicates recorded neurons tuned to moving gratings with subthreshold and suprathreshold tuning. Yellow indicates neurons tuned to moving gratings only in the subthreshold domain and Grey are neurons with no significant visual tuning. The layer in which they were recorded is shown on the top corners of each box. b, Comparison of subthreshold and suprathreshold selectivity indexes (SI). c, Increase in firing rate over baseline during the presentation of the direction eliciting maximal response for neurons tuned to moving gratings. The maximal spiking response is shown both when those gratings moved (red) and when they were static (blue). d, Integral (AUC) of the same responses shown in c and of those cells tuned only in the subthreshold domain. e, Distribution per layers of the SI presented in Fig. 2g. f, Increase in firing rate at most responsive location for neurons with a spatial receptive field measured using flashing squares. Increase in spiking activity is shown for the same neurons in response to black squares in grey background (red marker) and white squares (white marker) in grey background. g, AUC of the maximal response to black (yellow marker) or white squares (white marker). Error bars represent ± s.e.m.

Extended Data Fig. 3 Visual tuning properties in the SC in awake mice.

a, Illustration of the experimental setup to record neuronal responses to visual stimuli in the Superior Colliculus in awake conditions. b, Example Nissl staining of electrode tract in the SC (left) and reconstructed tracts for mice included in Fig. 2 (N = 5, right). c, Example tuning curves of two OS neurons (top), one with positive selectivity over baseline and another below. At the bottom two neurons tuned to moving gratings but showing no specific selectivity are shown: one increased its firing when gratings move and the other decreased it. These last two behaviours were not considered tuning to moving gratings for further analysis. d, Example tuning curve of a direction selective neuron (DS) in the superficial layers of the SC. e, Example responses of the neurons in d to two grating presentations. f, Z-score of the response of neuron in d to its preferred direction for all 9 trials this same direction was presented. The average Z-score is shown underneath. g, Single trial (open circles) and average Selectivity Indexes when gratings were moving (red) or static (blue) and the distribution of the average SI of all 1000 shuffled replicates (top). The red dot indicates how the average SI of this neuron ranks among the shuffled distribution (black). h, Comparison of the orientation SI (OSI) and the direction SI (DSI) for the same neurons. Linear regression y = 0.6164x + 0.04868. 95% CI of slope: 0.5322-0.7006. i, SIs for moving gratings >0.1 in direction space (D) or orientation space (O). j, Three properties were used to determine whether a neurons was tuned to moving gratings: the SI (>0.1), its significance when compared to the shuffled distribution (rank >95%, corresponding to > 0.95 in the probability distribution), and the trial to trial variability shown as the angle mean deviation of the preferred angles per trial (<0.8 rad). Neurons fulfilling all three criteria were considered strongly tuned and if they only fulfilled 2, they would still be considered tuned albeit less strongly. We did not considered a neuron tuned if only one criteria was fulfilled. k, Firing rate during baseline (i) and increase in response to preferred direction (ii) for neurons of all 4 classes described in j. l, Change in firing rate during gratings presentation (Kruskal-Wallis p < 0.0001, Dunn’s multiple comparison test p = 0.0005) for neurons tuned to moving gratings across layers. m, SIs for Orientation selective neurons. Paired OSIs for each cell (moving gratings, red, and static gratings, blue) are shown. Mean ± s.e.m. is also displayed. Two-tailed paired t-test p = 9.10 × 10⁻⁵. n, Example receptive field responses for a neuron in the superficial layers of the SC (top) and deep layers (bottom). o, Proportion of all neurons tuned to moving gratings (Grats, black), flashing squares (RF, white) or both (grey) across layers. p, Vgat-cre mice, injected with an AAV-DIO-ChR2 and implanted with optrodes were recorded before and after light stimulation (n = 17 neurons in N = 3 mice; constant 10 ms pulses at 30 Hz). An example neuron losing its directional selectivity to moving gratings (red) when VGat⁺ neurons were activated (blue) is shown. q, Change in SI after light stimulation (opto) for all neurons (left, one-tailed paired t-test p = 0.0051) and only neurons tuned in control conditions (right, one-tailed paired t-test p = 0.0308). r, Change in the rank of the SI compared to a shuffled distribution (significance considered for >= 0.95) before and after light stimulation for all recorded neurons (left, one-tailed paired t-test p = 5.52 × 10⁻⁵) and those tuned in control conditions (right, one-tailed paired t-test p = 0.0036). s, Maximal response over baseline (left, one-tailed paired t-test p = 0.2076) and baseline firing rate (right, one-tailed paired t-test p = 0.40124), in control conditions (red) and during optogenetic stimulation (blue). All box plots indicate: minimum, 1^st quartile, median, 3^rd quartile and maximum; cross indicates mean. All other error bars represent ± s.e.m.

Extended Data Fig. 4 Modulation of visual responses in the SC.

a, Diagram of a head restrained mouse running on the wheel. The angular velocity of the wheel was extracted to detect running bouts (green). b, Two example neurons strongly modulated by running and recorded simultaneously. The response of both neurons to 80 running bouts is shown as a raster (ii). The average of the angular velocity of the wheel for all bouts (i), the average change in firing frequency (iii) and its corresponding z-score (iv) are also shown. c, Two neurons (top and bottom graphs respectively) tuned to moving gratings (9 trials, i) with their tuning either positively modulated (bottom) or negatively modulated (top) by running. Neither of these had a strong modulation to running in baseline conditions (ii). d, Cumulative distribution of the correlation between the wheel’s angular velocity and the firing rate of the neurons (red and blue shading indicates the proportion of neurons with a positive >0.1 or negative <0.1 correlation respectively. e, SIs for DS or OS neurons in trials when the mouse was running on the wheel (Run) or not (No_run). f, Cumulative distribution of the change in response of neurons tuned to moving gratings in trials when the mouse was running vs trials when it was not (red and blue shading indicates the proportion of neurons with an increase >20% or decreased <20% response when running). g, Comparison of the preferred grating angle of DS neurons (left) or OS neurons (right) for all trials and those trials when the mouse was running (red) or not (black). h, Frame from a video of the right eye of a mouse during visual stimulation and the markers used for deeplabcut tracking of the pupil position throughout the recording. i, Example STA of the right eye movements of a neuron tuned during baseline conditions (grey screen, top), but not in darkness (middle) or during visual presentation (bottom). j, Eye angular displacement in the temporal-anterior axis (positive = anterior movements) and the dorso-ventral axis (positive = downward movements) for neurons tuned to eye movements in darkness (black screen, B) or light (grey screen, G). k, Venn diagram showing the overlap of the neurons tuned to saccadic eye movements in baseline (grey screen), darkness and visuo-motor neurons as described in Fig. 3. All box plots indicate: minimum, 1^st quartile, median, 3^rd quartile and maximum; cross indicates mean. All other error bars represent ± s.e.m.

Extended Data Fig. 5 Mapping of visual tuning properties in response to drifting gabor patches, small moving spots and changes in luminescence in the SC.

a, Gabor patches drifting in 8 different directions were randomly presented at 24 locations of the screen. b, Example neuron displaying location-specific tuning to moving Gabor patches measured by averaging all directions of movement per location (left, overlaid arrows length and direction indicate the selectivity index and preferred direction of movement per location), location-independent directional tuning as measured by averaging the response to drifting patches for each direction across all locations (middle, yellow), and a strong direction tuning at the preferred location (right, green). Shaded area in polar plots indicates ± s.e.m. c, Example neuron without a location preference but tuned to particular direction of movement of the gabor patches. d, Mice were also presented with a small spot (1.3 cm diameter) starting at 8 locations along the corners and border of the screen (red dots) and moving in 8 directions towards and away from the mouse’s centre of gaze on the screen (red cross). e, Example tuning of a neuron responding to a small spot moving in a particular orientation. Shaded area in polar plots indicates ± s.e.m. Average firing rate for each direction is shown as a heat map and the average change in firing rate over time (2.5 sec) across directions is also shown at the bottom (i). The tuning of this neuron corresponds to the spot crossing its spatial receptive field mapped using drifting gabor patches (ii). f, Example tuning to a small moving spot for a neuron without spatial receptive field. g, Example neuron tuned to full field moving gratings (i), gabor patches (ii) and moving spot (iii). h, Distribution of neurons responding to particular locations in space mapped using gabor patches (spatial gabor, blue), neurons showing direction or orientation selectivity mapped using gabor patches (gabor OS/DS, green) or full field gratings (red), neurons tuned to moving spots (spot OS/DS, black) or tuned to head rotations (head tuned, purple). The number of recorded neurons per depth of recording, indicated by the shading in the SC diagram, is shown in parenthesis (N = 4 mice). i, Distribution of neurons tuned to particular locations (spatial gabor, blue), direction or orientation selective neurons (Gabor OS/DS, green) or both (grey) among all neurons tuned to drifting gabor patches. j, Overlap of the tunings shown in h for individual neurons, computed using Intervene⁶⁰. Similarly to a Venn diagram, this plot illustrates the logical relation between sets, with darker areas indicating a larger number of neurons at the intersection of sets. Static features are preferentially represented at the top, while kinetic ones are preferentially represented towards the bottom. Note that the number of neurons is represented by the shade of colour and not by the surface of the intersection. k, Angular difference between the preferred angle of tuning measured using full field gratings and moving spots (black) or gabor patches (green). l, Example neurons recorded with tetrodes in awake head-restrained conditions and responding to changes in luminescence (increased luminescence from black to grey, top, and decrease in luminescence, bottom). For both neurons a raster of one example trial and the average PSTH are shown. m, Firing rate changes from black to grey or grey to black. Number of tuned neurons is shown in parenthesis. n, Percentage of neurons displaying a response to luminescence across layers. Total number of neurons recorded is shown in parenthesis.

Extended Data Fig. 6 Visual and motor tuning of motor Pitx2^ON and premotor Pitx2^ON-PRE neurons in the SC.

a, Example opto-tagged Pitx2^ON motor neurons. Peri-stimulus time histogram of 900 5 ms-long laser stimulations (right, bottom) and example raster plot of the 1^st 300 stimulations are shown (right, top). b, Example STA of angular head displacements in yaw, pitch and roll for a Pitx2^ON motor neuron (i). The 3D STA was projected onto the 2D plane of the visual stimulation screen to draw the tuned direction of gaze. The decomposed and average gaze direction is shown in ii. c, Tuning to moving gratings of the neuron in b. d, Pitx2^ON-PRE neurons were opto-tagged using a combinatorial Cre^OFF/Flp^ON virus expressing ChR2 and a Self-Inactivating Rabies virus expressing FLP recombinase in Pitx2-cre mice. e, Example Pitx2^ON-PRE neuron displaying tuning to moving gratings (i) and to drifting gabor patches (ii, red cross indicates centre of gaze). f, Overlap of tuning to all types of visual stimuli tested in premotor neurons and their tuning to head rotations (n = 17 Pitx2^ON-PRE neurons recorded across 10 mice). g, A similar strategy was used to express GCaMP6f in Pitx2^ON-PRE neurons by injecting an AAV leading to FLP recombinant-dependent expression of GCaMP6f. h, Example Pitx2^ON-PRE neuron displaying tuning to moving gratings (i) and to drifting gabor patches (ii). The average ΔF/F for gratings moving in all 12 directions is shown in red (i, left, shaded area corresponds to ± s.e.m.). The average responses to gratings of two opposing directions (black) and individual trial responses (grey) are also shown. i and j, Distribution of tuned Pitx2^ON-PRE across layers of the SC recorded using optetrodes (i, N = 10 mice, n = 17 Pitx2^ON-PRE neurons) or Ca²⁺ imaging (i, N = 7 mice, n = 74 Pitx2^ON-PRE neurons). k, proportion of direction and orientation selective Pitx2^ON-PRE neurons. l, Absolute value of the difference in preferred angle of direction of motion of visual stimulus (as measured using gratings or gabor patches) plotted against the 3D distance in the SC. The number of pairs is shown in parenthesis. DS/OS correspond to pairs of either OS or DS neurons compared in orientation space (180°, left), while DS pairs only consider DS neurons in direction space (360°, right). For OS/DS neurons (N = 29 cells, n = 4 mice), orange line corresponds to the median of the population, lower and upper box boundaries correspond to the 25th and 75th percentile of the population, respectively and notches correspond to the 95% confidence interval of the median calculated by bootstraping (10000 iterations). For DS pairs (N = 20 cells, n = 3 mice), orange line corresponds to the median of the population and lower and upper box boundaries correspond to the 25th and 75th percentile of the population, respectively. m, All direction and orientation tuned cells to full field gratings or local Gabor patches within a volume in the SC are plotted as spheres and colour-coded according to preferred motion axis (i). A, anterior; M, medial; S, superficial. The same volume with only DS neurons colour-coded indicating their preferred direction of motion is also shown (ii).

Extended Data Fig. 7 Distribution and characteristics of collicular neurons tuned to head rotations.

a, Intersection diagram of all 64 types of combinatorial possible tunings of the 3 head rotations (yaw, pitch, roll) tuned in light (_L) or dark (_D) conditions, produced online with Intervene⁶⁰. b, Maximum average displacement angles for neurons tuned to yaw, pitch or roll either in both light and dark conditions (half-sun/half-moon symbol, tuning in light shown), light only (sun) or dark only (moon). Number of neurons tuned to each individual component are shown in parenthesis. c, Comparison of the displacement angle in light and dark for neurons tuned in both conditions (motor units). Linear regression: y = 0.7956x – 0.9162. Slope 95% CI: 0.6302-0.9611, shows a conservation of tuning. d, Spike triggered averages for the two dark trials and the second light trial corresponding to the motor neuron in Fig. 3b. e, Distribution of neurons tuned to head displacements as shown in Fig. 3. Those also tuned to moving gratings, having a receptive field or both are also shown. f, Displacement angles for DS neurons also tuned to head movements (visuo-motor neurons). g, Average change in firing rate for all presentations for direction selective neurons that are also tuned to head displacements showing that neurons tuned only in darkness are overall inhibited by visual stimulation. h, Response to preferred direction of same neurons in e showing that neurons tuned only in darkness still show an increase firing over baseline to their preferred orientation. i, Selectivity indexes of neurons in f. j, Change in response to moving gratings while running for neurons in f. k, Example neuron tuned to moving gratings and head displacements only in darkness. l, Neuron recorded in vivo in whole-cell mode displaying a similar selectivity as that in k. The spiking activity of this cell was inhibited by light (iv). All error bars represent ± s.e.m.

Extended Data Fig. 8 Head rotation tuning changes during enforced visual flow foraging.

a, Mice were left to freely forage for droplets of condensed milk while visual flow was enforced by having either horizontal (for 5 min, black dots) or vertical (for 5 min, red open circles) black and white stripes across all 4 walls of the arena (N = 4, n = 44 tuned neurons out of 134 recorded neurons). b, Angular difference between the maximal average displacement angle for all neurons tuned in a white arena and their displacement angle with either horizontal or vertical stripes. c, Break-down of the results in b. The displacement angles for neurons tuned in Light (top) or Dark (bottom) for yaw (left), pitch (middle) or roll (right) is shown against their displacement angle with horizontal or vertical stripes. Shaded area indicates a change of direction of the tuned cell. The triangle indicates the markers corresponding to the same neuron tuned both in light and dark conditions. The R² are shown for all conditions.

Extended Data Fig. 9 Characterisation of the alignment of visuo-motor neurons.

a, Visuo-motor alignment of all direction selective neurons also tuned to head displacements (red, also shown in Fig. 3). These were further divided depending on whether they were tuned to head displacements during both light and darkness (grey, also shown in Fig. 3), only dark (black) or only light (sun symbol, light blue). b, Probability distribution centred at 0° of a random distribution of angles with the same SD as those in a. Their means were also centred at 90°, 180° and 270° for Watson-Williamson analysis. c, Display of the F-statistics of Watson-Williamson tests performed between matching data in a and b, with the distribution at b centred at either 0°, 90°, 180° or 270°. Dotted line marks the critical value: 3.85. Yellow shading indicates statistical significance. d, Distributions of the circular means of 1000 repetitions of bootstrap with replacement for each type of neuron. The 95% confidence intervals of the means and the distribution are indicated. The red dotted lines mark 180° ± 30°. e, Average head rotation angle at the time of spike in yaw (magenta), pitch (blue) and roll (orange) for all neurons included in a-d and Fig. 3h (n = 33 neurons, error bars: ± s.e.m.). f, Simultaneous eye tracking and head pitch, roll and yaw measurement in freely moving mice foraging for milk drops (N = 4 mice). g, Horizontal (blue lines) and vertical eye position (red lines) as a function of head pitch for right eye in freely moving mice (left), head roll (middle) and head velocity during stabilisation periods (right). Eye movements between saccadic gaze shifts counteract head rotations; Plot shows mean ± s.e.m. This was implemented on our geometrical model of the head and eye of the mouse that was used to transform 3D head rotations into gaze direction as projected onto the plane of the screen in which visual stimuli are shown. h, Gaze direction for the neuron in Fig. 3g after implementing compensatory eye movements to the model. Both the trajectory on the screen (i) and the decomposed and average movement vectors are shown (ii). i, Angular difference between the gaze direction of all visuo-motor neurons estimated using the model without (model 1) and with (model 2) compensatory eye movements implemented. j, Visuo-motor alignment between the direction of gaze and preferred direction of movement of a moving spot, for neurons tuned to both head rotations and moving spots (measured as in Extended Data Fig. 5). k, Visuo-motor alignment between the direction of gaze and the vector drawn by the centre of gaze on the screen (red cross on diagram) and the centre of the spatial receptive field measured using drifting gabor patches (red arrow to gabor patch on diagram) for neurons tuned in both conditions. 9 out of 11 of these putative premotor neurons were tuned to head rotations in light conditions only (sun).

Extended Data Fig. 10 Concentricity of individual OS and DS neurons with localised RF.

a, Example DS neuron (i) with a spatial receptive field measured using flashing squares (ii). A 2D Gaussian was fitted to the spatial receptive field data to determine the centre of the receptive field (iii). A black line crossing the centre of the receptive field and oriented following the preferred direction angle as calculated during grating display is overlayed. The red “x” indicates the average location of gaze and the light blue line crosses both that location and the centre of the RF. b, All DS and OS neurons with spatial RFs. The lines indicate the orientation preference of single neurons crossing that cell’s RF centre (i). ii Same as (i) but without the outline of the RFs. The angular difference between the orientation/direction preference (corresponding to black line in aiii) and gaze (corresponding to the light blue line in aiii) are also displayed (iii). A difference of 0° indicating perfect concentricity. c, Same as b but only for DS neurons. Direction is indicated by an arrow. d, A second set of experiments was performed to estimate the spatial RFs of neurons across the SC using drifting gabor patches (N = 4 mice). For neurons displaying spatial and direction selective tuning, a vector was drawn between the centre of the spatial RF (red box and black arrow, ii) and the average centre of gaze on the screen (red cross). The direction of this concentric vector was compared to the preferred direction of gabor patch movement at the preferred location (green, iii, corresponding to the location marked by the red box in ii). e, The angular difference of those two vectors is compatible with concentric direction selectivity.

Supplementary information

Supplementary Information

This file contains Supplementary Figs. 1 and 2 and Supplementary Tables 1 and 2

Reporting Summary

Peer Review file

Source data

Source Data Figs. 1–3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

González-Rueda, A., Jensen, K., Noormandipour, M. et al. Kinetic features dictate sensorimotor alignment in the superior colliculus. Nature 631, 378–385 (2024). https://doi.org/10.1038/s41586-024-07619-2

Download citation

Received: 15 December 2022
Accepted: 28 May 2024
Published: 03 July 2024
Issue Date: 11 July 2024
DOI: https://doi.org/10.1038/s41586-024-07619-2
Springer Nature Limited

Kinetic features dictate sensorimotor alignment in the superior colliculus

Abstract

Similar content being viewed by others

Main

Dual retino-premotor pathway in the SC

Motor layers respond to kinetic visual features

Kinetic anti-alignment of visuo-motor neurons

The kinetic model supports target interception

Discussion

Methods

Mice

Surgery

Histology

Electrophysiology

Whole-cell

Tetrodes

Optetrodes

Ca2+ imaging

Behaviour

Training procedure

Visual stimulation

Open-field foraging

Eye tracking

Quantification and statistical analysis

Whole-cell in vitro and in vivo electrophysiology

Spike sorting for tetrode recordings

3D head-rotation recording

Motion tuning

Visual tuning

Eye tracking in head-restrained mice

Extraction of pupil position in freely moving mice

Generation of shuffled datasets

Visuo-motor alignment

Modelling

3D head rotations to 2D screen plane transformation model

Neural network model

Statistical methods

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data figures and tables

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation

Ca²⁺ imaging