A reafferent and feed-forward model of song syntax generation in the Bengalese finch

Hanuschkin, Alexander; Diesmann, Markus; Morrison, Abigail

doi:10.1007/s10827-011-0318-z

A reafferent and feed-forward model of song syntax generation in the Bengalese finch

Open access
Published: 15 March 2011

Volume 31, pages 509–532, (2011)
Cite this article

Download PDF

You have full access to this open access article

Journal of Computational Neuroscience Aims and scope Submit manuscript

A reafferent and feed-forward model of song syntax generation in the Bengalese finch

Download PDF

Alexander Hanuschkin^1,2,
Markus Diesmann^2,3,4,5 &
Abigail Morrison^1,2,5

3405 Accesses
21 Citations
3 Altmetric
Explore all metrics

Abstract

Adult Bengalese finches generate a variable song that obeys a distinct and individual syntax. The syntax is gradually lost over a period of days after deafening and is recovered when hearing is restored. We present a spiking neuronal network model of the song syntax generation and its loss, based on the assumption that the syntax is stored in reafferent connections from the auditory to the motor control area. Propagating synfire activity in the HVC codes for individual syllables of the song and priming signals from the auditory network reduce the competition between syllables to allow only those transitions that are permitted by the syntax. Both imprinting of song syntax within HVC and the interaction of the reafferent signal with an efference copy of the motor command are sufficient to explain the gradual loss of syntax in the absence of auditory feedback. The model also reproduces for the first time experimental findings on the influence of altered auditory feedback on the song syntax generation, and predicts song- and species-specific low frequency components in the LFP. This study illustrates how sequential compositionality following a defined syntax can be realized in networks of spiking neurons.

Cantor Coding of Song Sequence in the Bengalese Finch HVC

Neural coding of sound envelope structure in songbirds

Article 12 December 2017

Intrinsic neuronal properties represent song and error in zebra finch vocal learning

Article Open access 19 February 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Several experimental studies have shown that the song of the Bengalese finch relies on auditory feedback. The Bengalese finch typically produces a set of ordered sequences of syllables, but after deafening this song syntax is disrupted, i.e. the sequence becomes more random and unstable (Okanoya and Yamaguchi 1997, 1997; Woolley and Rubel 1999; Watanabe and Aoki 1998). In a subsequent experiment Woolley and Rubel (2002) reversibly deafened Bengalese finches and showed that normal vocal behavior can be restored when hearing is restored. They argued that a template of the song exists independently of auditory input. Yamada and Okanoya (2003) reported that the song syntax is also reversibly changed if the Bengalese finch is singing in a helium atmosphere. More recently, studies have demonstrated that the vocal motor control system of the Bengalese finch relies on real time auditory feedback (Sakata and Brainard 2006) and that the HVC activity (caudal nucleus of the ventral hyperpallium or high vocal center, nowadays used as proper name) is influenced instantaneously by feedback perturbance (Sakata and Brainard 2008). These experimental findings suggest an auditory reafferent signal to be necessary for the generation of correct song syntax.

Early experimental studies on the songbird suggested that HVC neurons projecting to the pre-motor nucleus RA (robust nucleus of the arcopallium) only encode for the temporal structure within the song and that a sequence of fixed motor commands is replayed within the RA (Yu and Margoliash 1996; Hahnloser et al. 2002; Nottebohm 2002). Recent results using moderate local cooling (Long and Fee 2008; Glaze and Troyer 2008) have changed our understanding of this motor pathway. Temporal coding not only of the song elements but on all timescales (motif, syllable, note) is located in the HVC, whilst the RA serves to encode the HVC commands into firing rates suitable for the muscles needed to control the vocal output (Yu and Margoliash 1996; Fee et al. 2004). During singing the HVC neurons projecting to RA (HVC_RA) show bursting activity that is time locked with sub-millisecond precision to the stereotyped song motif (Hahnloser et al. 2002).

How the HVC_RA neurons are able to generate the activity pattern with sub-millisecond precision can be explained by two different approaches. The first hypothesis explains the pattern by rhythmic drive to the HVC_RA neurons from an afferent nucleus, based on studies suggesting temporal structuring originating from the Uva (nucleus uvaeformis) or NIf (nucleus interface of the nidopallium) (McCasland 1987; Williams and Vicario 1993; Vu et al. 1994; Coleman and Vu 2005). Several model studies are based on this assumption (Troyer and Doupe 2000a; Drew and Abbott 2003; Katahira et al. 2007; Yamashita et al. 2008; Gibb et al. 2009b). The second hypothesis is that the activity pattern originates from circuits intrinsic to the HVC. Models of neural circuits known as synfire chains that are composed of divergently and convergently connected feed-forward structures can reliably produce activity patterns on a sub-millisecond time scale (Abeles 1991; Herrmann et al. 1995; Diesmann et al. 1999). Mooney and Prather (2005) reported convergent and divergent connection structures in the HVC, which provides supporting evidence for the involvement of such assemblies in song production. Additionally, a recent experiment has revealed that the subthreshold dynamics visible in the intracellular recordings in HVC_RA of freely behaving zebra finches exhibits large and rapid depolarization before spiking, and that spike times are only weakly affected by injected currents. These findings also provide strong support for the hypothesis that HVC contains feed-forward networks (Long et al. 2010). This insight has been influential; a number of recent theoretical studies investigate the functional consequences of synfire chain involvement in HVC (Li and Greenside 2006; Jin et al. 2007; Jin 2009) or the development of the chains themselves (Fiete et al. 2010).

Similarly, there are two hypotheses to account for the generation of sequences of syllables. In the simplest realization the syllable sequence is predefined by the HVC connectivity. This ‘motor tape’ in the HVC is able to reproduce the highly stereotyped sequences generated by adult zebra finches. A simple example is a song starting with an introductory note ‘i’ that is followed by a repeating sequence of syllables ‘ABC’, resulting in ‘iABC − ABC’ (Bottjer and Arnold 1984; Drew and Abbott 2003; Weber and Hahnloser 2007). This can be extended to a more complex realization that reproduces stochastic branching as observed in the song of the Bengalese finch (e.g. ‘iABCD − ABD’; Katahira et al. 2007; Yamashita et al. 2008; Jin 2009). The major shortcoming in predefining the song syntax in the HVC connectivity is that it is static, whereas multiple experimental findings demonstrate that the syntax can be transiently altered, and even entirely lost and regained. An alternative approach is based on the assumption that syntax generation is controlled by a state signal. This signal can be either an efference copy of the motor command (e.g. within HVC; Troyer and Doupe 2000b), a neural feedback signal (e.g. from the brain stem; Gibb et al. 2009a) or a reafferent cue (e.g. from the auditory system; Sakata and Brainard 2006).

Here, we investigate for the first time the reafferent hypothesis in a functional network model. To account for the findings on both undisturbed and disturbed song syntax, we combine the feed-forward model of Jin (2009) with the reafferent hypothesis of Sakata and Brainard (2006) to a unified model. We present a functional network model of spiking neurons with feed-forward circuits in HVC and a reafferent state signal that reproduces key experimental findings on Bengalese finches. We assume all-to-all interconnections between the chains in HVC such that all transitions between syllables are possible. A specific syntax is then generated by priming the transition sites of the synfire chains with auditory feedback generated from the perception of the bird’s own song (BOS). This mechanism is called reafference (the afference evoked by the efference; Holst and Mittelstaedt 1950) and has been suggested to explain the influence of the auditory feedback by Sakata and Brainard (2006).

Our model is able to reproduce the normal sequence generation and the main findings of experiments on auditory feedback disturbance of the Bengalese finch. In the absence of the auditory feedback more random sequences of syllables are produced, whilst perturbing the auditory feedback by playing the bird a syllable different from the one sung results in a stereotyped change in sequencing (Okanoya and Yamaguchi 1997; Watanabe and Aoki 1998; Sakata and Brainard 2006). Moreover, by postulating synaptic plasticity within the model we can reproduce the finding that syntax is gradually lost over a period of days after deafening, but is regained when hearing is restored (Woolley and Rubel 2002).

In our model we investigate two plasticity hypotheses to explain the gradual loss of syntax. If the transition sites between the synfire chains are plastic, an imprinting of the more frequently used syllable transitions occurs. After feedback suppression the syntax is approximately maintained by the imprinted structure until it is eroded. If plastic outgoing connections of HVC_RA realize an efference copy of the song syntax, the causal relationship between the efferent and reafferent signals can be learned. In the absence of the reafferent signal, the efference copy stabilizes the song syntax for a period of time. Interactions of an efference copy with a reafferent signal have previously been reported, for example in the electric sense of the weakly electric fish (Bell 1981).

From a theoretical point of view our study investigates the compositionality of sequences. The syllables of the song are elementary building blocks, called primitives, which are combined following a defined syntax. Previous studies have investigated the generation of sequences of primitives without syntax, i.e. any primitive can follow any other primitive (Chang and Jin 2009; Schrader et al. 2010). A syntax can be imposed on the selection of the next primitive by using a higher level controller (Yamashita and Tani 2008), a pre-defined connectivity within the same level (Jin 2009; Schrader et al. 2010; Hanuschkin et al. 2010c), or a combination of both. Here, we generate the syntax using a combination of a pre-defined all-to-all connectivity between the primitive syllables encoded in HVC together with higher level signals from the auditory system. The all-to-all connectivity allows the system to realize any syntax defined in terms of transitions between pairs of syllables; the control signal consists of the system’s reafferent input and constrains the set of possible syntaxes to a specific one.

The paper is organized as follows; in the last part (Section 1.1) of this introduction we briefly review the relevant song bird anatomy and the neurophysiological details of the HVC neurons. In Section 2 we explain the concept of synfire chains, the details of the numerical simulations and the evaluation of song syntax. In Section 3 we introduce the model and demonstrate its functional consequences. Experimental predictions arising from our study and its limitations and possible future extensions are presented in Section 4.

Preliminary results have been published in abstract form (Hanuschkin et al. 2010a, b, 2011).

1.1 Songbird anatomy

In the adult songbird three main brain regions play key roles in the production and learning of songs. The motor pathway consists of two nuclei: HVC and RA. The anterior forebrain pathway (AFP) resembles a basal ganglia/ thalamus equivalent structure and consists of three nuclei: Area X (Area X of the medial striatum), which projects to DLM (dorsolateral thalamic nucleus) which in turn projects to LMAN (lateral magnocellular nucleus of the anterior nidopallium). The two pathways are interconnected; Area X receives input from HVC and LMAN projects to RA and back to Area X. Both Area X and LMAN receive input from dopamine neurons (Brainard and Doupe 2002; Gale and Perkel 2010). The AFP is necessary for song learning but not for reproduction of the learned song (Bottjer et al. 1984; Sohrabji et al. 1990; Scharff and Nottebohm 1991), however it is responsible for variability in syllable structure (Sakata et al. 2008; Hampton et al. 2009). The third brain region related to the song is the sensory input area consisting of field L (avian primary forebrain auditory area; roughly equivalent to mammalian primary auditory cortex) which projects to HVC via CM (caudal mesopallium) and NIf (Roy and Mooney 2009). NIf and HVC also receive dopaminergic input (Brainard and Doupe 2002). For a review of songbird anatomy and its revised nomenclature, see Brainard and Doupe (2002) and Jarvis et al. (2005).

The HVC neurons can be subdivided into three populations with distinct morphological and neural properties: interneurons (HVC_I) and neurons that project to RA (HVC_RA) or Area X (HVC_X) (Dutar et al. 1998; Mooney 2000). Experiments by Mooney and Prather (2005) on the HVC network structure revealed projections from HVC_RA to HVC_X via HVC_I and divergent connections from HVC_I to HVC_RA and HVC_X. Furthermore, they describe convergent and divergent connection structures. Activity synchrony between the population of interneurons and HVC_X, and weak synchrony between individual HVC_I neurons has been observed during singing (Kozhevnikov and Fee 2007). All three HVC populations increase their firing rate in response to BOS playback due to common excitatory input from NIf or CM (Cardin et al. 2005; Rosen and Mooney 2006; Shaevitz and Theunissen 2007; Bauer et al. 2008; Roy and Mooney 2009; Akutagawa and Konishi 2010) but exhibit differentiated sub-threshold activity. HVC_X neurons show hyper-polarization (Lewicki 1996) while HVC_RA neurons exhibit BOS-specific depolarization (Mooney 2000). The different sub-threshold behaviors can be explained by the local HVC interaction of direct inhibition from HVC_I and indirect inhibition of the HVC_X neurons by the HVC_RA neurons (Mooney 2000; Rosen and Mooney 2003; Mooney and Prather 2005; Rosen and Mooney 2006). It should be noted that nearly all studies on the HVC network structure (Lewicki 1996; Mooney 2000; Rosen and Mooney 2003; Mooney and Prather 2005; Rosen and Mooney 2006; Kozhevnikov and Fee 2007; Poirier et al. 2009) were carried out with zebra finches and that the role of local inhibition in the HVC might be more elaborate in the Bengalese finch, due to its more variable song syntax (Sakata and Brainard 2008).

2 Materials and methods

2.1 Synfire chains

The concept of a convergent and divergent connected feed-forward structure that allows the propagation of synchronous spiking activity was originally introduced to explain precise spike timing patterns in cortical tissue (Abeles 1991). In its simplest realization, neurons are organized in successive pools and each neuron is connected to all neurons in the following pool. The resultant structure of neuronal assemblies is known as a synfire chain (SFC). Activity volleys reliably propagate along a synfire chain under quite general conditions (Herrmann et al. 1995; Diesmann et al. 1999; Goedeke and Diesmann 2008). Here, we consider chains of 20 pools containing 100 neurons each, which is similar to the number of neurons estimated to be active at each moment during song (Fee et al. 2004; Fiete et al. 2010). The number of feed-forward connections made by each neuron is governed by a dilution factor p = 0.5 (Abeles et al. 2004). This is illustrated schematically in Fig. 1a. The activation period of a chain is around 160 ms, which is comparable to the characteristic length of birdsong syllables (Brainard and Doupe 2001; Leonardo and Fee 2005; Sakata and Brainard 2006). Additionally, each SFC neuron makes connections to random targets in a population of inhibitory neurons, which in turn globally inhibits the synfire chains with short synaptic delays (Jin 2009). The neurons in the final pool of each chain make feed-forward connections to the initial pools of each of its potential successor chains. This fundamental architecture is shown in Fig. 1b. Due to the global inhibition, activity in a single chain is the unique attractor so reliable switching from one synfire chain to exactly one of its potential successor chains is assured (Chang and Jin 2009). By applying additional priming in the form of an excitatory stimulus to the initial pool of a chain, the probability of selecting that chain during competition can be increased.

The synfire chain connectivity is described in detail in Table 1, the corresponding parameters are specified in Table 3.

Table 1 Summary of model structure after Nordlie et al. (2009)

Full size table

2.2 Neuron and synapse model

We perform numerical simulations of leaky integrate-and-fire model neurons (Lapicque 1907; Abbott 1999) with post synaptic currents (PSCs) described by the alpha-function. This neuron model reproduces the basic features of cortical nerve cells and can be efficiently simulated (Rotter and Diesmann 1999; Plesser and Diesmann 2009). This enables us to simulate large networks with biologically realistic connectivity, which is necessary for the investigation of collective phenomena such as the propagation of ensemble firing of neuronal groups.

In the experiments involving synaptic plasticity the synaptic weights are altered according to an additive spike-timing dependent plasticity (STDP) rule (Song et al. 2000). The change of weight is defined as

$$ \Delta w=\lambda\begin{cases} -e^{-\left|\Delta t\right|/\tau_{-}} & if\:\:\Delta t\leq0\\ e^{-\Delta t/\tau_{+}} & if\:\:\Delta t>0\end{cases},\label{eq:STDP-rule}$$

(1)

where λ defines a step size and τ ₊ and τ ₋ are the time constants of the STDP window for potentiation and depression, respectively. The time difference Δt = t _post − t _pre is given by the timing of the post-synaptic and pre-synaptic spikes. Between spike times, the synaptic weights decay exponentially with time constant τ _decay:

$$ w(t)=w_{0}e^{-t/\tau_{\mathrm{decay}}}.$$

(2)

Synapses are bounded within a range $\left[W_{\mathrm{min}},W_{\mathrm{max}}\right]$.

All simulations were performed using NEST revision 1.9.8718 (see www.nest-initiative.org and Gewaltig and Diesmann 2007) with a computational step size of $0.1\:\mathrm{ms}$. Simulations were carried out using a single or multiple cores of a 24×SUN X4140 2 quad core machine (AMD Opteron Processor 2218, 2.6 GHz, 8 GB) in parallel.

To avoid synchrony artefacts (Hansel et al. 1998), all simulations without plasticity were performed employing precise simulation techniques with the bisectioning method in a globally time driven framework (Morrison et al. 2007b; Hanuschkin et al. 2010d). A description of the neuronal and synaptic dynamics and the corresponding parameters are provided in Tables 2, 4 and 5. To allow other researchers to perform their own experiments, at the time of publication we are making a module available for download at www.nest-initiative.org containing all relevant scripts.

Table 2 Summary of model dynamics after Nordlie et al. (2009)

Full size table

2.3 Analysis of stochastic sequences

We use the song definition of Woolley and Rubel (1997) who define a stereotype set of separated acoustic elements to be one syllable and each song to consist of several syllables. This is, the song starts with an introductory note ‘i’ followed by a sequence of syllables ‘ABC..’. For alternative definitions, see Okanoya and Yamaguchi (1997), Sakata and Brainard (2006), Katahira et al. (2007).

To evaluate the syntax of the song produced by our model, we consider transitions between syllables. For a specific syntax, a transition between two syllables can be deterministic, stochastic or forbidden. We use two measures to characterize the song structure. The sequence stereotype score S _⋆ is defined as the average of sequence linearity and sequence consistency (Scharff and Nottebohm 1991; Woolley and Rubel 1997). The sequence linearity is given by the ratio between the number of different syllables per bout and the number of transition types per bout. The sequence consistency is given by the ratio of the sum over allowed transitions per bout and the sum over total transitions per bout.

Another measure used to describe the syllable sequencing is the transition entropy

$$ H_{j}=-\sum\limits_{i=1}^{N}p_{i}\mathrm{log_{2}}p_{i},$$

(3)

where p _i is the probability that when syllable j occurs it is followed by syllable i (Sakata and Brainard 2006). The average transition entropy $\hat{H}$ is given by the average over all H _j.

3 Results

3.1 A reafferent and feed-forward model of song syntax generation

The feed-forward model of the Bengalese finch HVC recently proposed by Jin (2009) is motivated by the resemblance of synfire activity to the activity of HVC_RA neurons: each syllable is represented as a synfire chain, and excitatory connections from the final pools of chains to the initial pools of specific other chains dictate the song syntax. If a chain projects to only one other chain, stereotypical sequences are produced (e.g. ‘CD’ in Fig. 2b), whereas if multiple successor chains are activated, this results in stochastic branching (e.g. ‘AA’ or ‘AB’). Here, we consider only distinguishable states and the transitions between them; an even better fit to the statistics of song syntax can be obtained by including hidden states in the first order Markov process (Jin 2009; Katahira et al. 2010).

However, the feed-forward model neither explains the loss of syntax of the Bengalese after deafening and the recovery of the song syntax after the auditory feedback is restored (Okanoya and Yamaguchi 1997; Woolley and Rubel 1997, 1999; Watanabe and Aoki 1998), nor the instantaneous modifications of syntax due to auditory feedback perturbations (Sakata and Brainard 2006, 2008). These findings suggest a sparse song template of syllables where the song syntax is generated by reafferent cues of the auditory system (Sakata and Brainard 2006).

To account for the findings on both undisturbed and disturbed song syntax, we combine the feed-forward model of Jin (2009) with the reafferent hypothesis of Sakata and Brainard (2006) to a unified model as illustrated in Fig. 2a. To produce the syntax shown in Fig. 2b, 4 synfire chains {SFC_A, SFC_B, SFC_C, SFC_D} as described in Section 2.1 code for the four different syllables {A,B,C,D}. The final pool of each synfire chain projects to the initial pool of every chain with the same feed-forward excitatory connectivity as within the chains, thus potentially allowing all syllable transitions (see Fig. 2c). As in the model proposed by Jin (2009), the excitatory neurons are reciprocally connected with the population of fast spiking HVC_I interneurons with short synaptic delays. The activity of the HVC_I interneurons (referred to in the following as the inhibition network, IN) results in dominant global inhibition that stabilizes the HVC network. Supra-threshold input to the neurons of the synfire chain network leads to the spontaneous ignition of synfire activity, but the numerous fast connections from IN to the synfire chains suppress the synfire activity in all but one chain. Consequently, activity in a single chain is the only attractor in the network and ensures reliability of the switching.

The reafferent hypothesis suggests that the vocal output determines or influences the next vocalization mediated by immediate auditory feedback. For the sake of simplicity, we reduce the complex auditory system of the finch to a single network which we will call auditory network (AN) in the following. The auditory network consists of a set of four balanced networks (Brunel 2000) representing auditory perceptions of the four syllables {A_au,B_au,C_au,D_au}. As the spiking activity of individual HVC_RA neurons is locked to specific syllables (Hahnloser et al. 2002), we approximate the reafferent signal to the auditory system by the HVC_RA activity, assuming a long propagation delay to account for the production and perception of the syllable. Thus, when syllable i is sung excitatory connections from SFC_i to i _au lead to an increase in firing rate in the auditory subnetwork i _au. The excitatory neurons of the auditory sub-networks project to the first pool of the synfire chains selected for priming. The priming of individual synfire chains modifies the switching probabilities (Jin 2009; Hanuschkin et al. 2010c) such that only one of the primed synfire chains can win the competition. The interaction between the synfire chains representing the syllables and the auditory network is illustrated in Fig. 2d.

The interactions between the networks comprising the reafferent and feed-forward model are illustrated in Fig. 3. A tabular description of our model is given in Tables 1 and 2; unless otherwise stated, model parameters are as given in Tables 3, 4 and 5.

Table 3 Specification of connectivity parameters

Full size table

Table 4 Specification of neuron model parameters

Full size table

Table 5 Specification of synapse parameters and applied input

Full size table

3.2 Synaptic plasticity

In the absence of auditory feedback the Bengalese finch loses the song syntax within days. This implies that the song system of the finch does not exclusively rely on auditory feedback for the generation of the song syntax. We extend our network model to realize two different hypotheses of how the syntax is gradually lost.

The first hypothesis is that the song syntax is imprinted on the synfire chain network (SFCN). We therefore incorporate spike-timing dependent plasticity (STDP) at the transition sites between the synfire chains as illustrated in Fig. 4a. The syntax is determined by the priming from the auditory network (see Fig. 2). For example, in the syntax given in Fig. 2b, the transition from ‘A’ to ‘B’ occurs often, whereas the transition from ‘A’ to ‘C’ is not observed. Consequently, the synaptic strengths between SFC_A and SFC_B are potentiated, imprinting the sequence ‘AB’, but the synaptic strengths between SFC_A and SFC_C are depressed. If the reafferent priming is removed, the sequence ‘AB’ is initially more likely but occasionally ‘AC’ occurs. After a period of time the imprinting vanishes, resulting in a more random syntax.

The second hypothesis is based on the argument that an efference copy of the song has to be present to allow learning of the song syntax (Troyer and Doupe 2000a). We propose that such an efference copy could also be used to keep the song syntax stable after the auditory feedback is disrupted. We do not elaborate on the site of the efference copy, but simply postulate that the auditory input converges with the efference copy in the auditory network. We therefore assume plastic connections from the SFCN to the auditory network as illustrated in Fig. 4b. The correlation between the activity in the synfire chains representing the syllables and the elevation of activity in the corresponding auditory sub-networks causes these connections to be potentiated. Consequently, if the auditory input is removed, activity in an individual synfire chain still results in elevated activity in the corresponding auditory sub-network and thus in the correct priming behavior. Over time the absence of external reinforcement weakens the connections, leading to a gradual loss of syntax.

Both hypotheses can be realized with an additive STDP rule (Song et al. 2000) with additional depression on a long timescale. The details of the plasticity model are described in Section 2.2 and the parameters are given in Table 5.

3.3 Song generation

The reafferent and feed-forward model generates sequences of syllables with a syntax determined by the excitatory priming connections from the auditory network. A 5 s example of a generated sequence following the song syntax in Fig. 2c is shown in Fig. 5a. Reliable switching is achieved throughout the simulation in accordance with experimental findings that the combination of syllables is a rare error in intact bird (Woolley and Rubel 1997). The syllable distribution is p _A = 0.17, p _B = 0.17, p _C = 0.33 and p _D = 0.33. In order to quantify the quality of the song produced by the model, we calculate the stereotype score and the average transition entropy as described in Section 2.3. For the syntax shown in Fig. 2c the allowed transitions are ‘AA’, ‘AB’, ‘BB’, ‘BC’, ‘BD’, ‘CD’, ‘DC’ and ‘DA’. A perfect reproduction of the example song syntax would therefore result in $S_{\star}=\frac{1}{2}\left(\frac{4}{8}+1\right)=0.75$. The measured sequence score S _⋆ = 0.79 is slightly larger than this because the sequence ‘BD’ does not occur during the recorded period (see Fig. 5c). The average transition entropy is $\hat{H}=0.68$. As some sequences occur more frequently than others (e.g. ‘DC’ is observed more often than ‘DA’), the average transition entropy is less than the value that would be obtained if all transitions allowed by the song syntax were equally probable ($\hat{H}=1.04$). Both measures indicate that the probabilities between allowed transition are unbalanced. This result emerges naturally from the random network connectivity and is characteristic of bird song (Katahira et al. 2007).

Figure 5b shows the modulation in the firing rate in the auditory and HVC networks during song production. The instantaneous firing rate is estimated from the spike data using a Gaussian filter (σ = 5 ms). The firing rate of the auditory sub-networks follows the generated syllables since it is driven by the reafferent input. The song initiation is triggered by a synfire activity ignition in the HVC_RA neurons which results in a brief increase in firing rate of the HVC_I network. Furthermore, at each SFC transition site (i.e. change from one syllable to the other) a local increase in firing rate results from the competition between the chains. These dynamical features of the model activity are in good agreement with experimental findings that HVC activity modulation is locked to the structure of the song: before song initiation the activity rises and at the beginning of each syllable an increase in firing rate is observed (Sakata and Brainard 2008). The synfire chains in our model have an activation period of 160 ms resulting in periodic modulations with a frequency of approximately 6 Hz. We have shown in a model of the motor cortex of the monkey that such modulations evoked by synfire chain competition may be observable in mesoscopic signals such as the LFP (Hanuschkin et al. 2010c).

Figure 5d shows a 5 s example of a generated sequence with no auditory feedback. For example, after the syllable ‘C’ is sung there is no auditory perception of the syllable and thus no increase in the firing rate of the auditory sub-network C_au (compare Fig. 5b and e). As a consequence, the temporal cue for the next element in the song sequence is missing. All chains are activated by the final pool of SFC_C due to the all-to-all feed-forward connectivity illustrated in Fig. 2c. As syllable ‘D’ is not primed by the auditory network, all synfire chains compete to follow syllable ‘C’. Since activity of a single chain is the only attractor, only a single one will be selected. Due to the heterogeneities in the connectivity some sequences are still more likely than others, and some sequences may never occur. As a result of the increased number of possible sequences, the original song syntax is lost. Figure 5f shows the transition probabilities measured over the 5 s song sample. The upper panel of Fig. 5d highlights the occurrence of the forbidden transitions ‘BA’, ‘CA’, ‘CB’, ‘CC’ and ‘DB’. The average transition entropy $\hat{H}=0.77$ is increased with respect to the previous example with auditory priming and the sequencing stereotype is decreased to S _⋆ = 0.53. This shows that the generated sequence has much less structure: a purely random syntax would result in $S_{\star}=\frac{1}{2}\left(\frac{4}{16}+\frac{8}{16}\right)=0.375$. This example reproduces the experimental findings on song syntax a few days after deafening (see reviews by Konishi 2004; Woolley 2008). On restoring the auditory pathway, the original song syntax is regained as in Fig. 5b, in agreement with experimental findings (Woolley and Rubel 2002).

3.4 Gradual loss of syntax

After deafening the Bengalese finch song becomes distorted within 7 days (Okanoya and Yamaguchi 1997; Watanabe and Aoki 1998; Woolley and Rubel 1999). In this section we test the ability of two different hypotheses to reproduce this behavior. (see Section 3.2).

We first consider the case that the feed-forward connections from the final pool of each synfire chain to the initial pools of every synfire chain are plastic such that transitions that occur often tend to strengthen those connections, whereas transitions that do not occur often lead to weakened connections (Fig. 4a). In other words, song repetition results in imprinting the song structure on the transition sites between the synfire chains encoding the individual syllables. The details of the spike-timing dependent plasticity rule with additional decay term can be found in Section 3.2 and the parameters in Table 5. In Fig. 6a the average synaptic weight for each transition SFC_i→SFC_j is shown for a time period of 12,000 s. In the presence of auditory feedback the weights converge reflecting the song syntax, i.e. transitions allowed by the syntax have stronger synaptic weights than those of forbidden transitions (compare the right panel of Figs. 6a and 2b). For example, the average weights from the final pool of SFC_A split into two groups. The average strengths of the synapses to the initial pools of SFC_A and SFC_B are greater than the average strengths of the synapses to the first pools of SFC_C and SFC_D, reflecting the song syntax which permits the sequences ‘AA’ and ‘AB’ but forbids the sequences ‘AC’ and ‘AD’. The outgoing synaptic weights from SFC_C are small compared to the outgoing connections from the other chains. This is a result of the uneven syllable distribution during auditory feedback (p _A = 0.27±0.02, p _B = 0.37±0.02, p _C = 0.06±0.02, p _D = 0.30±0.01) due to the unbalanced random connectivity of the network. Figure 6b shows the development of the sequencing stereotype and the average transition entropy. While the auditory feedback is present, the sequencing stereotype S _⋆ = 0.75±0.02 shows that the song syntax is produced accurately (S _⋆ = 0.75 for perfect syntax). The average transition entropy reaches a stable value ($\hat{H}=0.58\pm0.08$) at around 2,000 s, the time at which the weights converge.

At t _df = 4,000 s the auditory feedback is suppressed. After this point the average synaptic weights slowly begin to equalize, since in the absence of the priming cue transitions that were previously forbidden can now occur by chance (see for example Fig. 5d). The sequencing stereotype decreases gradually and can be fitted with a power law ($y\left(t-t_{\mathrm{df}}\right)=kt^{n}$ with k = 0.97 and n = − 0.07) and the average transition entropy increases rapidly. This indicates a gradual loss of syntax. When the auditory feedback is no longer suppressed (t > 8,000 s) the original song syntax is completely and instantaneously restored with S _⋆ = 0.73±0.02 and $\hat{H}=0.54\pm0.04$. The sequencing stereotype is marginal lower than before the deafening. This is explained by minor differences in the converged average synaptic weights in the feed-forward connections after the deafening interlude (compare the outgoing synaptic weights of SFC_B before and after auditory feedback suppression period in Fig. 6a).

Note that the slow weight decay term in the synaptic plasticity model (Eq. 2) is a necessary condition for the symmetry breaking in the outgoing connections of the synfire chains and also for the gradual loss of syntax. This is because the neurons in the synfire chains only fire when a volley of activity travels through the chain. Hence, the only activity pattern that occurs between a neuron in the final pool of one chain and a neuron in the initial pool of a successor chain is a pre-synaptic spike before a post-synaptic spike (i.e Δt > 0). This pattern results in an increase in synaptic strength; in the absence of an additional decay term all synaptic weights converge to the maximum value W _max.

The second hypothesis to be tested is that additional plastic excitatory connections exist between the synfire chains encoding the syllables and the corresponding auditory sub-networks (Fig. 4b). Through song repetition the connections are strengthened, thus realizing an efference copy of the song in the auditory network. For the sake of simplicity, we assume the same synaptic plasticity model as above. Figure 7a shows that in the presence of auditory feedback, the average synaptic weights converge to stable values. The sequencing stereotype S _⋆ = 0.749±0.003 and the average transition entropy $\hat{H}=0.81\pm0.02$ show that the song syntax in Fig. 2b is followed accurately (perfect syntax production: S _⋆ = 0.75) with nearly balanced transition probabilities (perfect symmetry: $\hat{H}=1.04$) (see Fig. 7b). At t _df = 8,000 s the feedback is suppressed. The connection strengths decrease and finally converge to lower values. The sequencing stereotype gradually decreases and the average transition entropy gradually increases. Both tendencies can be fitted with power laws $y\left(t-t_{\mathrm{df}}\right)=kt^{n}$ with k = 0.73, n = − 0.01 and k = 0.77, n = 0.02, respectively. Additionally, the syllable distribution rapidly changes from p _A = 0.45±0.03, p _B = 0.17±0.01, p _C = 0.14±0.01, p _D = 0.24±0.01 before deafening to p _A = 0.31±0.02, p _B = 0.17±0.01, p _C = 0.18±0.02, p _D = 0.33±0.01 afterwards. These results indicate that the efference copy hypothesis is also capable of accounting for the gradual loss of syntax.

3.5 Auditory feedback perturbations

In our reafferent model of the songbird the auditory feedback is responsible for the correct sequence generation. By additional and specific excitation from the auditory network, the SFCs in the HVC network are primed to be active in the desired sequence (see Fig. 8a, b). However, at the stochastic branching points of the song the auditory feedback does not completely determine the next syllable, it simply reduces the competition to the syllables allowed by the given song syntax. In this section, we investigate the behavior of the model when the auditory network is disturbed resulting in inconsistent priming cues.

Sakata and Brainard (2006) observed that perturbing the auditory feedback by playing the bird a syllable different from the one sung results in a stereotyped change in sequencing. The effect of altered auditory feedback (AAF) can easily be demonstrated in our model by stimulating a specific auditory sub-network with additional Poisson input. This can activate an auditory feedback cue for a syllable that would not be activated in the usual song sequence. In Fig. 8c, d the auditory sub-network D_au is stimulated with an excitatory Poissonian spike train at 1 kHz with a synaptic strength of 5 pA to mimic the additional perception of syllable ‘D’. In the syntax given in Fig. 2b the sequence ‘CC’ is not possible in the presence of the correct auditory feedback because the auditory sub-network C_au just primes SFC_D. Due to the additional input to D_au, chains SFC_A and SFC_C are also primed so the selection of SFC_C becomes possible (compare Fig. 8a and c). The syntax alteration induced by an additional and unique syllable playback of sufficient amplitude is entirely determined by the pre-defined syntax transition diagram (Fig. 2b).

An alternative hypothesis for how AAF induces syntax change is that the feedback reduces the signal-to-noise ratio (SNR) in the auditory perception of the bird, such that the bird cannot unambiguously identify which syllable it perceived. We simulate this by providing an additional stimulus to the auditory subnetworks A_au, B_au and C_au (excitatory Poissonian spike train at 1 kHz for 200 ms with a variable synaptic strength μ) whenever syllable ‘D’ is sung. We estimate the transition probabilities for a 10 s song excerpt (compare Fig. 5a) where the syllable ‘D’ is sung 20 times, averaged over 2 trials. The SNR is given by the ratio between the firing rate of D_au and the mean firing rate of the auditory subnetworks A_au, B_au and C_au. Increasing the synaptic weight μ decreases the SNR and reveals distinct regions of characteristic transition changes (Fig. 8g). In the first region (I) the probability of the primary transition decreases while the probability of the second transition increases. In the second region (II) forbidden transitions arise with finite probability. The right border of this region is given by SNR = 1, i.e. the point at which all auditory subnetworks receive the same mean excitatory input (μ ≈ 3.75 pA). Increasing the strength of the AAF further has the net effect of suppressing the allowed transitions from ‘D’ as defined in Fig. 2b, such that forbidden transitions become more likely than allowed transitions.

Experimental findings suggest that a strong feedback perturbation results in the song being broken off and restarted later (Cynx and von Rad 2001; Sakata and Brainard 2006). In Fig. 8e,f we test the response of the system to a brief burst of noise to the auditory system. All neurons in the auditory sub-networks receive additional Possonian input at 10 kHz with a synaptic strength of 25 pA for 10 ms. The song is immediately interrupted and restarts after a period of a few milliseconds due to the subthreshold drive to the HVC_RA neurons. These results show that feedback perturbances to the auditory network change the sequence ordering and result in violation of the song syntax as observed in experiment.

4 Discussion

The discovery of the sub-millisecond precision of the sparse HVC_RA activity pattern by Hahnloser et al. (2002) led to the hypothesis of a chain-like activity generation within HVC_RA by Fee et al. (2004). This hypothesis has been recently strengthened by a report that the subthreshold dynamics of the HVC_RA neurons in the freely behaving zebra finch exhibits features characteristic of chain-like structures (Long et al. 2010). Here, intrinsic HVC connectivity gives rise to the generation of the activity pattern within HVC rather then a rhythmically external drive. Additionally, Cynx (1990) and Seki et al. (2008) reported that a flash of light interrupts ongoing song at discrete locations in the song which almost always fall between song syllables. As it is difficult to interrupt the stable and reliable propagation of synfire activity within a chain, this observation supports the theory that individual syllables are represented by such feed-forward networks. This hypothesis has been influential in subsequent modeling studies of the HVC (Li and Greenside 2006; Jin et al. 2007; Jin 2009).

A feed-forward model of the HVC of the Bengalese finch was recently proposed by Jin (2009). This model accounts for the sparse sequences of HVC_RA activity and is able to produce the song syntax of Bengalese finches with a fixed inter-chain connectivity. Moreover, Jin (2009) showed that priming can lead to changes in branching probabilities, however the influence of external areas that could generate such a priming signal was not explicitly modeled. Experimental studies on the effect of auditory feedback perturbations on the song syntax suggest a direct influence of the auditory feedback on syllable sequencing in the Bengalese finch (Woolley and Rubel 1997, 2002; Okanoya and Yamaguchi 1997; Watanabe and Aoki 1998; Sakata and Brainard 2006, 2008; Woolley 2008) and also in the zebra finch (Nordeen and Nordeen 1992, 2010; Leonardo and Konishi 1999; Lombardino and Nottebohm 2000; Cynx and von Rad 2001; Brainard and Doupe 2001; Hough and Volman 2002; Roy and Mooney 2009). These results led to the development of a reafferent model of syntax generation (Sakata and Brainard 2006). In the reafferent model, immediate auditory feedback cues the motor system to generate the correct song syntax. In the current study we merge the feed-forward and the reafferent models. The combined model is able to explain key result of the experimental studies on the adult Bengalese finch song production and enables us to investigate hypotheses which are beyond the scope of today’s experiments. In the following we summarize and motivate the model assumptions and then outline the experimental findings that are reproduced by the model together with its specific predictions. Finally, we discuss the model’s limitations and possible extensions.

4.1 Model assumptions

The spiking of individual HVC_RA neurons is time locked with sub-millisecond precision to distinct song syllables (Hahnloser et al. 2002), motivating our key assumption of feed-forward HVC circuitry. However, whether the observed spiking pattern originates from such intrinsic structures or are rhythmically driven from outside HVC (e.g. Uva) remains an unresolved question (Fee et al. 2004). As we use predefined synfire chains, our study also implicitly assumes that such structures can be developed by the brain. Some studies on the basis of Hebbian synaptic plasticity have reported the development of feed-forward sub-networks (Izhikevich et al. 2004; Buonomano 2005; Doursat and Bienenstock 2006; Jun and Jin 2007; Masuda and Kori 2007; Hosaka et al. 2008; Liu and Buonomano 2009; Waddington et al. 2010), but these findings have so far not been verified by investigations of large-scale model networks with biologically realistic numbers of synapses per neuron (Morrison et al. 2007a; Kunkel et al. 2010). Recently, Fiete et al. (2010) showed that the combination of STDP with heterosynaptic plasticity in a small network generates wide chains with a length distribution similar to the one estimated in the zebra finch HVC. This finding further supports our assumption that HVC_RA neurons are organized into synfire chains.

The neural activity of the HVC_RA is characterized by zero firing rate in the absence of singing or playback of BOS (Hahnloser et al. 2002). We have therefore made the assumption of robust winner-takes-all chain switching on the basis of dominant global inhibition (Chang and Jin 2009). An advantage of this approach is that the presence of superthreshold drive to the HVC_RA neurons assures ongoing song activity. Previously, we have also shown reliable switching from one to several potential successor chains in a model of the motor cortex by combining mutual cross-inhibition and global inhibition (Hanuschkin et al. 2010c). In this case, global dominant inhibition is not a plausible assumption, because the motor cortex is characterized by asynchronous and irregular neural activity (Burns and Webb 1976; Softky and Koch 1993; van Vreeswijk and Sompolinsky 1996; Ponce-Alvarez et al. 2009) which can be reproduced by balanced, rather than dominant, inhibition (Brunel 2000).

We further assume that many biological details of the neurons are not critical for the network function. In general, a specific network behavior can be achieved by completely different parameter sets (Prinz et al. 2004). In our model of the HVC, the feed-forward network structure (reviewed in Kumar et al. 2010) is crucial whereas the precise details of the neuron model are not. It has been shown that synfire chains can reliably propagate pulse packages in the presence of noise (Diesmann et al. 1999; Goedeke and Diesmann 2008), an intra-dilution rate (Hayon et al. 2005), distributed delays (private observation) or unreliable synapses (Guo and Li 2010) and for various forms of neural models such as leaky integrate-and-fire neurons with conductance- or current-based synapses (Kumar et al. 2006; Schrader et al. 2010), intrinsic bursting neurons (Teramae and Fukai 2008) or compartmental bursting neurons (Jin et al. 2007). Hence, we do not model the neurons in the HVC or the auditory network neurons in detail because our level of description is sufficient to draw conclusions on the functional level of the song syntax production.

We have not included the AFP in our model, which is the second major pathway in the song system. The AFP is crucial for song learning in young birds, since lesion of LMAN leads to an early crystallization of the song whereas lesion of Area X prevents song stabilization (Bottjer et al. 1984; Sohrabji et al. 1990; Scharff and Nottebohm 1991). In the adult zebra finch it has been reported that the activity in LMAN drives variability in either syllable structure in isolation (Kao et al. 2005; Aronov et al. 2008; Horita et al. 2008) or together with changes in song sequencing (Brainard and Doupe 2000; Ölveczky et al. 2005; Nordeen and Nordeen 2010). However, in the Bengalese finch only changes in syllable structure have been experimentally verified (Kao and Brainard 2006; Hampton et al. 2009). Furthermore, as song sequencing is fixed in the adult zebra finch, it is difficult to draw conclusions from studies on zebra finch as to the role of LMAN in the variability of sequences (Hampton et al. 2009). As we are investigating song sequencing and not song learning, these experimental results motivate our model assumption that the AFP is not responsible for the song syntax generation in the adult Bengalese finch and thus need not be modeled.

We assume that the song sequence is controlled directly by the means of state cues delivered by the auditory system. The state cue consists of only the last syllable produced, based on the assumption that the song syntax can be fully characterized by a first order Markov process. This assumption has been used explicitly in several model studies (Katahira et al. 2007; Jin 2009) and implicitly in numerous transition diagrams (e.g. Yamada and Okanoya 2003; Sakata and Brainard 2006; Wohlgemuth et al. 2010). Recent investigations of the song syntax statistics reveal additional hidden states and adaptation in the first order Markov process (Jin 2009; Katahira et al. 2010; Jin and Kozhevnikov 2010). Common excitatory drive to the HVC from NIf or CM in response to BOS has been found experimentally (Rosen and Mooney 2006; Roy and Mooney 2009). In our model these reafferent signals are directly used to generate the correct song syntax via priming the synfire chain transition sites. Indirect reafferent influence on local circuits within the HVC or on a possible brain stem feedback loop are not investigated because they would neither change the function of the model nor deliver further insight into the song system based on current knowledge.

We test two different hypotheses of how synaptic plasticity could generate a gradual loss of syntax when the auditory feedback is depressed. We either assume plasticity in the HVC_RA synapses to imprint the song syntax or the interaction of an efference copy with the reafferent signal. The latter is realized in the model by assuming additional plastic connections from the HVC to the auditory network. An efference copy in field L and CM neurons of zebra finches has recently been found by Keller and Hahnloser (2009). An additional efference copy could also be situated in the HVC itself (Troyer and Doupe 2000a); recent experiments on swamp sparrows indicate that an efference copy is established in the connections from HVC_RA to HVC_X via HVC_I (Prather et al. 2008, 2009). The specific site of the interaction of the efference copy or alternatively the brain stem feedback with the reafferent signal does not alter the conclusions we draw from our model. In both cases the synaptic plasticity is modeled by additive STDP with fixed upper and lower bound of the weights in order to prevent runaway excitation or the disconnection of the chains. A weight-dependent STDP rule would behave similarly, as the results do not depend on symmetry breaking properties; only the causal relationship of the pre- and post-synaptic activity plays a role. STDP coupled to hard synaptic weight boundaries has been shown to prevent the destabilization of network activity by Hebbian plasticity (Abbott and Nelson 2000; Turrigiano and Nelson 2004). Additionally, we assume an exponential decay of weights over time. This is needed to introduce depression, as typically only the pre-before-post spiking pattern is found in the activity of feed-forward networks.

4.2 Reproduction and predictions of experimental findings

By combining the reafferent and feed-forward model it is possible to reliably produce a predefined song syntax that is stored in the excitatory afferent connections from auditory sub-networks to the HVC. In our model each syllable is represented by the synfire activity propagating through one chain in the HVC network; the transformation into sounds via RA and the vocal organs is not modelled. Consequently, the reafferent signal consists of temporal rather than spectral cues, which is in accordance with experimental findings (Woolley and Rubel 1999). The population activity of the HVC_I neurons in our simulated network is modulated with song structure as observed in experiments by Kozhevnikov and Fee (2007) and Sakata and Brainard (2008).

Our model reproduces for the first time the loss of song syntax when the auditory feedback is suppressed and its recovery when the auditory feedback is restored (see Section 3.3) as observed in experiments (e.g. reviewed in Woolley 2008). The model song exhibits an increase in transition entropy (Sakata and Brainard 2006) and a decrease in sequence score (Woolley and Rubel 1997) in the absence of auditory feedback. We test two hypotheses for how plasticity in the model could result in a gradual loss of syntax after the auditory feedback is suppressed (see Section 3.4). Both hypotheses, imprinting of song syntax in the HVC or maintaining an efference copy in additional connections from the HVC to the auditory network, are able to reproduce the experimental observation that the song syntax deteriorates over a long period after deafening (Okanoya and Yamaguchi 1997; Watanabe and Aoki 1998; Woolley and Rubel 1999).

In Section 3.3 we have shown that the heterogeneities in the HVC connectivity result in sequences where some transitions are more likely than others, a typical song property (Katahira et al. 2007). When the auditory feedback is suppressed these heterogeneities prevent the song syntax from becoming completely randomized. This effect has also been reported in experiments, compare Fig. 5 with, for example, Figure 1 of Sakata and Brainard (2006). We therefore conclude that transition probabilities are partly determined by the network connectivity and that the suppression of the auditory feedback reveals this underlying network heterogeneity. By comparing the transition probabilities before and after deafening, the priming bias of the auditory feedback could be quantified.

In order to investigate the gradual loss of syntax we introduced plasticity to the model (see Section 3.4). Both hypotheses presented, efference copy and imprinting of the syntax, can account for the gradual loss of syntax after deafening. Hence, we predict the presence of Hebbian synaptic plasticity in the adult bird’s HVC_RA interconnections or connections from the HVC_RA to the auditory system, with a depressing component on a time scale similar to the ‘loss of syntax period’. This can be investigated by future electrophysiological experiments on the HVC and will reveal whether either of the hypotheses is correct. Independent of which mechanism takes place in the Bengalese finch, it has significant impact on the behavior level of the bird because it reduces the reliance on auditory feedback and keeps the syntax correct in the absence or perturbance of a constant reafferent input. We therefore predict that birds within the same species that show longer lasting stability in the song syntax will be less affected by online perturbations. Across species such experiments have already been conducted. Indeed, the zebra finch, which maintains its song syntax over several weeks after deafening, is less affected by feedback perturbances than the Bengalese finch, which loses its song syntax within a week (Sakata and Brainard 2008). Additionally, it has been shown that the age at deafening determines the time period over which the song syntax is lost (Lombardino and Nottebohm 2000). The younger the bird, the earlier and faster the song syntax is lost, suggesting that experienced singers possess a more robust memory. This is in accordance with our findings that a memory of the song syntax develops over a substantial period of time, irrespective of the plasticity hypothesis assumed.

An active process of song decrystallization in the adult bird by unlearning or renewed vocal plasticity has been suggested by Roy and Mooney (2009). In our study we show that at least the loss of song syntax can be completely explained by the gradual and passive depression of synaptic weights due to the lack of auditory cues in the deafened bird. Whether the syllable structure is actively altered or passively disturbed by the LMAN in the absence of auditory feedback is beyond the scope of the current study but is an interesting question to be addressed in future research.

The syntax can be changed online with altered auditory feedback (see Section 3.5) resembling experimental findings by Sakata and Brainard (2006, 2008). These experimental studies report the occurrence of novel transitions in response to selective AAF, and also a decrease of primary transition and an increase of secondary transition probabilities. The former effect can be reproduced by our model if an additional syllable is overlaid on the sung syllable. To reproduce the second effect, it is necessary to lower the signal-to-noise ratio of the sung syllable by stimulating the auditory sub-networks of all syllables. These results suggest that in the experiment the bird cannot classify the artificial syllable playback uniquely and that multiple, conflicting syllable responses in BOS selective neurons are generated by the stimulus. An experimental prediction arising from these results is that the type and amount of transition probability changes depend on the value of the signal-to-noise ratio and that distinct regions of different AAF effects can be defined. To observe these regions experimentally, the auditory feedback would have to be manipulated in such a way that the bird’s classification of the feedback into distinct syllables can be controlled. For example, artificial syllables could be generated by gradually superimposing syllables of the BOS or using a combination of notes from different syllables.

We previously reported that synfire chain competition of motor primitives can lead to low frequency oscillations in collective signals such as the LFP (Hanuschkin et al. 2010c). The frequency of these oscillations depends on the characteristic length of competing motor primitives. Experiments have already revealed modulations of the HVC activity with song structure (Kozhevnikov and Fee 2007; Sakata and Brainard 2008). Our model predicts the existence of a low frequency component of the LFP that is characteristic for a given song bird species, where the characteristic frequency is given by the inverse of the characteristic song syllable length. Assuming that the mean syllable length is a fair approximation of the modal syllable length, for the zebra finch (mean syllable duration ∼100 ms, Brainard and Doupe 2001; Leonardo and Fee 2005) we predict a 10 Hz component and in the case of the Bengalese finch (mean syllable duration ∼64 ms, Sakata and Brainard 2006) a 15 Hz component. However, a sharp peak is not to be expected, due to the distribution of syllable lengths (Seki et al. 2008; Fiete et al. 2010) and masking of the effect as a result of the spatial distribution of neurons coding for different syllables and the activity of HVC_X neurons. Moreover, the amplitude of the low frequency component will be reduced in the presence of a strong priming bias or imprinting, as this reduces the competition between the chains. We therefore predict that the low frequency component has a greater amplitude in deafened birds. The discovery of a species characteristic low frequency component would be strong supporting evidence for our model assumption of competing synfire chains for motor pattern construction and hence may shed light on the mechanism of motor pattern generation in general.

4.3 Limitations and extensions

The investigation of auditory responses in the HVC revealed common BOS selective excitation from NIf or CM which results in sparse bursts of HVC_RA and HVC_X neurons, depolarized HVC_RA neurons, and hyperpolarized HVC_X neurons (Rosen and Mooney 2006; Roy and Mooney 2009). Due to the superthreshold drive to the HVC_RA and the auditory input to only the initial pools of the synfire chains, no spontaneous BOS selective spiking responses occur in our model. In order to reproduce the results of the playback of BOS and delayed BOS more accurately, we would need to model the drive from the auditory network in a more elaborate fashion. For example, projections to neurons within the chains in addition to the priming projections would elicit HVC_RA activity without synfire ignition.

Sakata and Brainard (2008) showed that the HVC_I activity decreases in response to feedback perturbations. This is not observed in our simulations, since the auditory input to the HVC_RA is excitatory and followed by an increase in HVC_I activity. This leads to an increase of the total HVC activity. The effect introduced by the feedback perturbations is marginal but strong in the case of a flash of sound (compare Fig. 8a, c, e). However, decrease in total HVC activity could be reproduced if the HVC_X neurons were included in the model, as they are suppressed by increasing HVC_I activity. Alternatively, a modulation of the external drive to the HVC neurons could be assumed.

While pools of neurons in the HVC_I do increase their firing rate in response to specific syllables, the model does not create neurons that are selective for temporal order (Lewicki and Konishi 1995). This feature selectivity has been shown for a model with large time constants in the HVC_I neurons and an appropriately chosen connectivity of the HVC network (Drew and Abbott 2003); it is likely that temporal order selectivity would naturally emerge from our network model if we also made these assumptions. Recently, Nishikawa et al. (2008) reported population coding of the song element sequences rather then temporal order selective neurons in the HVC. We have yet to determine whether such syllable sequence selective assembles could be extracted from our simulated neuron populations.

We made the assumption that the Bengalese song syntax can be fully characterized by a first order Markov process. However, it has recently been shown that a partially observable Markov model (POMM) provides a better account of the higher-order state dependencies of transition probabilities observed in the Bengalese song syntax (Jin 2009; Katahira et al. 2010). The POMM is an extension of a first order Markov model introducing hidden states which provide multiple encodings of the same syllable but with different transition probabilities. Hidden states are indistinguishable for the observer and can only be deduced from an analysis of the transitions. The model presented here can be trivially extended from a first order Markov process to the POMM. Moreover, it could be extended to include adaptation; Jin and Kozhevnikov (2010) showed that when adaptation is introduced to the POMM the reproduction of the song syntax statistics is improved further and the number of hidden states is substantially reduced.

The sequencing stereotype S _⋆ and the average transition entropy $\hat{H}$ quantify the song syntax generated by the model. Even though both measures are widely used in experimental literature (e.g. Scharff and Nottebohm 1991; Woolley and Rubel 1997; Sakata and Brainard 2006) it should be noted that the measures are prone to wrong syllable characterization. Deafening tends to alter the syllable structure, and so it is challenging to find a consistent and unambiguous characterization of syllables in such protocols (e.g. Lombardino and Nottebohm 2000; Woolley and Rubel 2002; Horita et al. 2008). A careful examination of song data after deafening with respect to the change of transition probabilities with the assumption of an underlying adaptive POMM might give better insight. Such data would also allow the extension of the presented model to reproduce loss and restoration of syntax on biologically realistic time scales.

Our model predicts that the duration of individual song syllables remain constant after deafening. A shortening of syllables has been reported (Brainard and Doupe 2001), but a more recent study did not reproduce these findings (Nordeen and Nordeen 2010). Similarly, reduction of song tempo as a result of altered auditory feedback has been observed (Sakata and Brainard 2006). Such effects could be reproduced by manipulating the external drive to the neurons of the synfire chain (Wennekers and Palm 1996). Dropping syllables and the emergence of new or unrecognizable syllables have previously been reported (Woolley and Rubel 1997; Watanabe and Aoki 1998; Leonardo and Konishi 1999; Horita et al. 2008), but are not observed in our model even though the activation probability can become low. Increasing the total number of syllables would probably result in the loss of some syllables after deafening.

We postulate synaptic plasticity in the HVC to account for the gradual loss of syntax. The plastic connections either imprint the song structure within HVC_RA or construct an efference copy of the song. Both hypotheses reproduce the loss of syntax over a period of time. Unfortunately, in the current study we cannot further distinguish between these hypotheses. Even though the trajectories of S _⋆ and $\hat{H}$ differ, we cannot derive predictions for the parameters of the synaptic plasticity or the duration of the syntax loss period. Without modeling the behavior states of the bird, it is not feasible to make a comparison of these variables to the real experimental setting. The gradual increase of the sequencing stereotype S _⋆ during hair cell recovery (Woolley and Rubel 2002) is not reproduced by the current study but could easily be modeled by increasing the strength of auditory feedback gradually rather than abruptly.

It has been shown that lesion of the Uva in the zebra finch evokes changes in the syllable sequencing (Williams and Vicario 1993). The original syllable sequencing recovers after a period of time if the Uva is only unilaterally lesioned (Coleman and Vu 2005). These and other experiments on interhemispherical interaction suggest that the Uva delivers nonauditory feedback information during singing (Wild 1994; Vu et al. 1994). Our model does not include the Uva influence on the song syntax generation because we assume the influence of auditory feedback to be the major source of priming in the Bengalese finch. In further extensions to our model an additional source of priming from the Uva to the HVC network via NIf would have to be incorporated.

In the model a constant suprathreshold drive is applied to the HVC network. This ensures ongoing synfire activity; the presence of dominant global inhibition creates a strong competition between the chains that results in robust winner-takes-all competition (Jin 2009). However, this approach introduces some limitations because the drive has to be controlled from outside HVC. The reproduction of behavior states such as periods of silence, sleep, habituation to AAF (Sakata and Brainard 2006) or even the relevance of the social context (Jarvis et al. 1998; Sakata et al. 2008) or motivation is beyond the scope of the presented model. By introducing a variable drive to the HVC such behavior states could be incorporated in the model. This drive could originate in other functional brain areas, such as the Uva or the auditory network (Rosen and Mooney 2006; Akutagawa and Konishi 2010) and may also be controlled by neuromodulation (Dave et al. 1998; Sakata et al. 2008), context-dependent gene expression (Jarvis et al. 1998) and hormones (Pröve 1974).

A further limitation is the lack of defined song initiation and termination by introductory notes and terminal states respectively. Additional drive to a synfire chain representing the initial note would ensure that the song always starts at that point, but the termination would have to be controlled by reducing the common external drive to the HVC. The constant high drive also results in shorter interruptions following bursts of noise than are observed experimentally (Sakata and Brainard 2006), which further demonstrates that the constant drive is over-simplified.

The finding that zebra finch song is most interruptable between song syllables (Cynx 1990) fits nicely to the concept that individual syllables are represented by propagating synfire activity, because the interruption of such activity is generally difficult. Recently, Seki et al. (2008) showed that flashes of light also tend to interrupt the Bengalese finch song between syllables, with high probability transitions exhibiting greater robustness. However, modeling the HVC with synfire chains that are driven by a suprathreshold input masks the robustness of the synfire propagation, as any failure in propagating activity leads to a direct restart of the same or another chain in the network.

An alternative approach to the suprathreshold drive is to apply a subthreshold drive to the individual neurons that raises their average membrane potential to the proximity of the threshold. Synfire activity in the network is initiated by applying a synchronous stimulus to the initial pool of an individual chain or a set of competing chains. In a model of the motor cortex of a monkey performing 2-dimensional movements, we have shown that robust competition can be achieved in such a network, but that it requires fine tuning of the mutual cross-inhibition between competing chains. The working regime can be enlarged by priming signals, but the probability that activity fails to propagate from a chain to one of its successor chains rises with increasing number of possible successor chains (Hanuschkin et al. 2010c). In such a network, the song would automatically stop after some repetitions of the song motif and it would predict that substantially shorter songs are produced in the absence of auditory priming. Additional predictions of such an approach would be that interruptions occur between syllables (Cynx 1990; Seki et al. 2008), that altered auditory feedback is most effective at a specific delay (Sakata and Brainard 2006) and that combinations of syllables can occur in some circumstances (Woolley and Rubel 1997).

In the investigation of the song syntax generation in the adult Bengalese finch presented here we have not touched upon the question of the acquisition of the song syntax of the juvenile bird. Our key assumption is that the song syntax is stored in the afferent connections from the auditory system to the HVC. This syntax is probably learned through the interplay of AFP, auditory network and synaptic plasticity within HVC. Dopaminergic input to NIf, HVC and the AFP nuclei Area X and LMAN has been reported (Brainard and Doupe 2002; Gale and Perkel 2010), which could represent a reinforcement signal (Hollerman and Schultz 1998; Potjans et al. 2010). Our model could therefore be extended to incorporate dopaminergic reinforcement signals to investigate song syntax learning. A similar approach to learning syllable structure has already yielded promising results (Fiete et al. 2007).

In summary, our findings support the theory that song syntax memory after song crystallization is stored in the connections from auditory related areas to the HVC. Our combined model is not only capable of reproducing a variety of experimental findings and generating specific predictions for future experiments, but also lends itself to extensions that allow many other aspects of song learning and production to be studied.

References

Abbott, L. F. (1999). Lapicque’s introduction of the integrate-and-fire model neuron (1907). Brain Research Bulletin, 50(5/6), 303–304.
Article PubMed CAS Google Scholar
Abbott, L. F., & Nelson, S. B. (2000). Synaptic plasticity: Taming the beast. Nature Neuroscience, 3, 1178–1183.
Article PubMed CAS Google Scholar
Abeles, M. (1991). Corticonics: Neural circuits of the cerebral cortex (1st ed.). Cambridge: Cambridge University Press.
Book Google Scholar
Abeles, M., Hayon, G., & Lehmann, D. (2004). Modeling compositionality by dynamic binding of synfire chains. Journal of Computational Neuroscience, 17(2), 179–201.
Article PubMed Google Scholar
Akutagawa, E., & Konishi, M. (2010). New brain pathways found in the vocal control system of a songbird. Journal of Comparative Neurology, 518, 3086–3100.
Article PubMed Google Scholar
Aronov, D., Andalman, A. S., & Fee, M. S. (2008). A specialized forebrain circuit for vocal babbling in the Juvenile songbird. Science, 320(5876), 630–634.
Article PubMed CAS Google Scholar
Bauer, E. E., Coleman, M. J., Roberts, T. F., Roy, A., Prather, J. F., & Mooney, R. (2008). A synaptic basis for auditory-vocal integration in the songbird. Journal of Neuroscience, 28(6), 1509–1522.
Article PubMed CAS Google Scholar
Bell, C. (1981). An efference copy which is modified by reafferent input. Science, 214(4519), 450–453.
Article PubMed CAS Google Scholar
Bottjer, S., & Arnold, A. (1984). The role of feedback from the vocal organ. I. Maintenance of stereotypical vocalizations by adult zebra finches. Journal of Neuroscience, 4(9), 2387–2396.
PubMed CAS Google Scholar
Bottjer, S., Miesner, E., & Arnold, A. (1984). Forebrain lesions disrupt development but not maintenance of song in passerine birds. Science, 224(4651), 901–903.
Article PubMed CAS Google Scholar
Brainard, M. S., & Doupe, A. J. (2000). Interruption of a basal ganglia-forebrain circuit prevents plasticity of learned vocalizations. Nature, 404, 762–766.
Article PubMed CAS Google Scholar
Brainard, M. S., & Doupe, A. J. (2001). Postlearning consolidation of birdsong: Stabilizing effects of age and anterior forebrain lesions. Journal of Neuroscience, 21(7), 2501–2517.
PubMed CAS Google Scholar
Brainard, M. S., & Doupe, A. J. (2002). What songbirds teach us about learning. Nature, 417, 351–358.
Article PubMed CAS Google Scholar
Brunel, N. (2000). Dynamics of sparsely connected networks of excitatory and inhibitory spiking neurons. Journal of Computational Neuroscience, 8(3), 183–208.
Article PubMed CAS Google Scholar
Buonomano, D. V. (2005). A learning rule for the emergence of stable dynamics and timing in recurrent networks. Journal of Neurophysiology, 94, 2275–2283.
Article PubMed Google Scholar
Burns, B. D., & Webb, A. C. (1976). The spontaneous activity of neurones in the cat’s visual cortex. Proceedings of the Royal Society of London, B 194, 211–223.
Article Google Scholar
Cardin, J. A., Raksin, J. N., & Schmidt, M. F. (2005). Sensorimotor nucleus NIf is necessary for auditory processing but not vocal motor output in the avian song system. Journal of Neurophysiology, 93(4), 2157–2166.
Article PubMed Google Scholar
Chang, W., & Jin, D. Z. (2009). Spike propagation in driven chain networks with dominant global inhibition. Physical Review E, 79(5), 051917.
Article CAS Google Scholar
Coleman, M. J., & Vu, E. T. (2005). Recovery of impaired songs following unilateral but not bilateral lesions of nucleus uvaeformis of adult zebra finches. Journal of Neurobiology, 63, 70–89.
Article PubMed Google Scholar
Cynx, J. (1990). Experimental determination of a unit of song production in the zebra finch (taeniopygia guttata). Journal of Comparative Psychology, 104(1), 3–10.
Article PubMed CAS Google Scholar
Cynx, J., & von Rad, U. (2001). Immediate and transitory effects of delayed auditory feedback on bird song production. Animal Behaviour, 62(2), 305–312.
Article Google Scholar
Dave, A. S., Yu, A. C., & Margoliash, D. (1998). Behavioral state modulation of auditory activity in a vocal motor system. Science, 282(5397), 2250–2254.
Article PubMed CAS Google Scholar
Diesmann, M., Gewaltig, M.-O., & Aertsen, A. (1999). Stable propagation of synchronous spiking in cortical neural networks. Nature, 402(6761), 529–533.
Article PubMed CAS Google Scholar
Doursat, R., & Bienenstock, E. (2006). The self-organized growth of synfire patterns. In 10th international conference on cognitive and neural systems (ICCNS), Massachusetts. Boston University.
Google Scholar
Drew, P. J., & Abbott, L. F. (2003). Model of song selectivity and sequence generation in area HVc of the songbird. Journal of Neurophysiology, 89(5), 2697–2706.
Article PubMed Google Scholar
Dutar, P., Vu, H. M., & Perkel, D. J. (1998). Multiple cell types distinguished by physiological, pharmacological, and anatomic properties in nucleus HVC of the adult zebra finch. Journal of Neurophysiology, 80(4), 1828–1838.
PubMed CAS Google Scholar
Fee, M. S., Kozhevniko, A. A., & Hahnloser, R. H. (2004). Neural mechanisms of vocal sequence generation in the songbird. Annals of the New York Academy of Sciences, 1016, 153–170.
Article PubMed Google Scholar
Fiete, I. R., Fee, M. S., & Seung, H. S. (2007). Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances. Journal of Neurophysiology, 98(4), 2038–2057.
Article PubMed Google Scholar
Fiete, I. R., Senn, W., Wang, C. Z. H., & Hahnloser, R. H. R. (2010). Spike-time-dependent plasticity and heterosynaptic competition organize networks to produce long scale-free sequences of neural activity. Neuron, 65, 563–576.
Article PubMed CAS Google Scholar
Gale, S. D., & Perkel, D. J. (2010). A basal ganglia pathway drives selective auditory responses in songbird dopaminergic neurons via disinhibition. Journal of Neuroscience, 30(3), 1027–1037.
Article PubMed CAS Google Scholar
Gewaltig, M.-O., & Diesmann, M. (2007). NEST (neural simulation tool). Scholarpedia, 2(4), 1430.
Article Google Scholar
Gibb, L., Gentner, T. Q., & Abarbanel, H. D. I. (2009a). Brain stem feedback in a computational model of birdsong sequencing. Journal of Neurophysiology, 102(3), 1763–1778.
Article PubMed Google Scholar
Gibb, L., Gentner, T. Q., & Abarbanel, H. D. I. (2009b). Inhibition and recurrent excitation in a computational model of sparse bursting in song nucleus HVC. Journal of Neurophysiology, 102(3), 1748–1762.
Article PubMed Google Scholar
Glaze, C. M., & Troyer, T. (2008). Neuroscience: Cool songs. Nature, 456, 187–188.
Article PubMed CAS Google Scholar
Goedeke, S., & Diesmann, M. (2008). The mechanism of synchronization in feed-forward neuronal networks. New Journal of Physics, 10, 015007.
Article Google Scholar
Guo, D., & Li, C. (2010). Signal propagation in feedforward neuronal networks with unreliable synapses. Journal of Computational Neuroscience. doi:10.1007/s10827-010-0279-7.
Hahnloser, R. H., Kozhevnikov, A. A., & Fee, M. S. (2002). An ultra-sparse code underlies the generation of neural sequences in a songbird. Nature, 419(6902), 65–70.
Article PubMed CAS Google Scholar
Hampton, C. M., Sakata, J. T., & Brainard, M. S. (2009). An avian basal ganglia-forebrain circuit contributes differentially to syllable versus sequence variability of adult Bengalese finch song. Journal of Neurophysiology, 101(6), 3235–3245.
Article PubMed Google Scholar
Hansel, D., Mato, G., Meunier, C., & Neltner, L. (1998). On numerical simulations of integrate-and-fire neural networks. Neural Computation, 10(2), 467–483.
Article PubMed CAS Google Scholar
Hanuschkin, A., Diesmann, M., & Morrison, A. (2010a). Functional compositionality realized in biological realistic spiking neural networks by synfire chain competition. Proceedings of the 40th annual meeting of the Society for Neuroscience.
Hanuschkin, A., Diesmann, M., & Morrison, A. (2010b). A reafferent model of song syntax generation in the Bengalese finch. BMC Neuroscience, 11(Suppl 1), P33.
Hanuschkin, A., Diesmann, M., & Morrison, A. (2011). Plasticity in the HVC of the Bengalese finches is crucial for song syntax stability. Proceedings of the 9th Göttingen Meeting of the German Neuroscience Society.
Hanuschkin, A., Herrmann, J. M., Morrison, A., & Diesmann, M. (2010c). Compositionality of arm movements can be realized by propagating synchrony. Journal of Computational Neuroscience, doi:10.1007/s10827-010-0285-9.
PubMed Google Scholar
Hanuschkin, A., Kunkel, S., Morrison, A., & Diesmann, M. (2010d). A general and efficient method for incorporating precise spike times in globally time-driven simulations. Frontiers in Neuroinformatics, 4, 113.
Article PubMed Google Scholar
Hayon, G., Abeles, M., & Lehmann, D. (2005). A model for representing the dynamics of a system of synfire chains. Journal of Computational Neuroscience, 18, 41–53.
Article PubMed Google Scholar
Herrmann, M., Hertz, J. A., & Prügel-Bennett, A. (1995). Analysis of synfire chains. Network, 6, 403–414.
Article Google Scholar
Hollerman, J. R., & Schultz, W. (1998). Dopamine neurons report an error in the temporal prediction of reward during learning. Nature Neuroscience, 1, 304–309.
Article PubMed CAS Google Scholar
Holst, E., & Mittelstaedt, H. (1950). Das Reafferenzprinzip. Naturwissenschaften, 37(20), 464–476.
Article Google Scholar
Horita, H., Wada, K., & Jarvis, E. D. (2008). Early onset of deafening-induced song deterioration and differential requirements of the pallial-basal ganglia vocal pathway. European Journal of Neuroscience, 28, 2519–2532.
Article PubMed Google Scholar
Hosaka, R., Araki, O., & Ikeguchi, T. (2008). STDP provides the substrate for igniting synfire chains by spatiotemporal input patterns. Neural Computation, 20, 415–435.
Article PubMed Google Scholar
Hough, G. E. I. I., & Volman, S. F. (2002). Short-term and long-term effects of vocal distortion on song maintenance in zebra finches. Journal of Neuroscience, 22(3), 1177–1186.
PubMed CAS Google Scholar
Izhikevich, E. M., Gally, J. A., & Edelman, G. M. (2004). Spike-timing dynamics of neuronal groups. Cerebral Cortex, 14, 933–944.
Article PubMed Google Scholar
Jarvis, E. D., Gunturkun, O., Bruce, L., Csillag, A., Karten, H., Kuenzel, W., et al. (2005). Avian brains and a new understanding of vertebrate brain evolution. Nature Reviews Neuroscience, 6, 151–159.
Article PubMed CAS Google Scholar
Jarvis, E. D., Scharff, C., Grossman, M. R., Ramos, J. A., & Nottebohm, F. (1998). For whom the bird sings: Context-dependent gene expression. Neuron, 21(4), 775–788.
Article PubMed CAS Google Scholar
Jin, D. Z. (2009). Generating variable birdsong syllable sequences with branching chain networks in avian premotor nucleus HVC. Physical Review E, 80(5), 051902.
Article CAS Google Scholar
Jin, D. Z., & Kozhevnikov, A. A. (2010). A compact statistical model of the song syntax in Bengalese finch. arXiv. 1011.2998v1 [q-bio.NC].
Jin, D. Z., Ramazanoglu, F. M., & Seung, H. S. (2007). Intrinsic bursting enhances the robustness of a neural network model of sequence generation by avian brain area HVC. Journal of Computational Neuroscience, 23(3), 283–299.
Article PubMed Google Scholar
Jun, J. K., & Jin, D. Z. (2007). Development of neural circuitry for precise temporal sequences through spontaneous activity, axon remodeling, and synaptic plasticity. PLoS ONE, 2(8), e723.
Article Google Scholar
Kao, M. H., & Brainard, M. S. (2006). Lesions of an avian basal ganglia circuit prevent context-dependent changes to song variability. Journal of Neurophysiology, 96(3), 1441–1455.
Article PubMed Google Scholar
Kao, M. H., Doupe, A. J., & Brainard, M. S. (2005). Contributions of an avian basal ganglia-forebrain circuit to real-time modulation of song. Nature, 433, 638–643.
Article PubMed CAS Google Scholar
Katahira, K., Okanoya, K., & Okada, M. (2007). A neural network model for generating complex birdsong syntax. Biological Cybernetics, 97(5–6), 441–448.
Article PubMed Google Scholar
Katahira, K., Suzuki, K., Okanoya, K., & Okada, M. (2010). Complex sequencing rules of birdsong can be explained by simple hidden Markov processes. arXiv. 1011.2575v1 [q-bio.NC].
Keller, G. B., & Hahnloser, R. H. R. (2009). Neural processing of auditory feedback during vocal practice in a songbird. Nature, 457, 187–190.
Article PubMed CAS Google Scholar
Konishi, M. (2004). The role of auditory feedback in birdsong. Annals of the New York Academy of Sciences, 1016, 463–475.
Article PubMed Google Scholar
Kozhevnikov, A., & Fee, M. S. (2007). Singing-related activity of identified HVC neurons in the zebra finch. Journal of Neurophysiology, 97, 4271–4283.
Article PubMed Google Scholar
Kumar, A., Rotter, S., & Aertsen, A. (2006). Propagation of synfire activity in locally connected networks with conductance-based synapses. In Computational and Systems Neuroscience (Cosyne) 2006.
Kumar, A., Rotter, S., & Aertsen, A. (2010). Spiking activity propagation in neuronal networks: Reconciling different perspectives on neural coding. Nature Reviews Neuroscience, 11, 615–627.
Article PubMed CAS Google Scholar
Kunkel, S., Diesmann, M., & Morrison, A. (2010). Limits to the development of feed-forward structures in large recurrent neuronal networks. Frontiers in Computational Neuroscience, 4, 160.
Google Scholar
Lapicque, L. (1907). Recherches quantitatives sur l’excitation electrique des nerfs traitee comme une polarization. Journal de physiologie et de pathologie générale, 9, 620–635.
Google Scholar
Leonardo, A., & Fee, M. S. (2005). Ensemble coding of vocal control in birdsong. Journal of Neuroscience, 25(3), 652–661.
Article PubMed CAS Google Scholar
Leonardo, A., & Konishi, M. (1999). Decrystallization of adult birdsong by perturbation of auditory feedback. Nature, 399, 466–470.
Article PubMed CAS Google Scholar
Lewicki, M. S. (1996). Intracellular characterization of song-specific neurons in the zebra finch auditory forebrain. Journal of Neuroscience, 16(18), 5854–5863.
CAS Google Scholar
Lewicki, M. S., & Konishi, M. (1995). Mechanisms underlying the sensitivity of songbird forebrain neurons to temporal order. PNAS, 92(12), 5582–5586.
Article PubMed CAS Google Scholar
Li, M., & Greenside, H. (2006). Stable propagation of a burst through a one-dimensional homogeneous excitatory chain model of songbird nucleus HVC. Physical Review E, 74(1), 011918.
Article CAS Google Scholar
Liu, J. K., & Buonomano, D. V. (2009). Embedding multiple trajectories in simulated recurrent neural networks in a self-organizing manner. Journal of Neuroscience, 29(42), 13172–13181.
Article PubMed CAS Google Scholar
Lombardino, A. J., & Nottebohm, F. (2000). Age at deafening affects the stability of learned song in adult male zebra finches. Journal of Neuroscience, 20(13), 5054–5064.
PubMed CAS Google Scholar
Long, M. A., & Fee, M. S. (2008). Using temperature to analyse temporal dynamics in the songbird motor pathway. Nature, 456, 189–194.
Article PubMed CAS Google Scholar
Long, M. A., Jin, D. Z., & Fee, M. S. (2010). Support for a synaptic chain model of neuronal sequence generation. Nature, 468, 394–399.
Article PubMed CAS Google Scholar
Masuda, N., & Kori, H. (2007). Formation of feedforward networks and frequency synchrony by spike-timing-dependent plasticity. Journal of Computational Neuroscience, 22, 327–345.
Article PubMed Google Scholar
McCasland, J. (1987). Neuronal control of bird song production. Journal of Neuroscience, 7(1), 23–39.
PubMed CAS Google Scholar
Mooney, R. (2000). Different subthreshold mechanisms underlie song selectivity in identified HVC neurons of the zebra finch. Journal of Neuroscience, 20(14), 5420–5436.
PubMed CAS Google Scholar
Mooney, R., & Prather, J. F. (2005). The HVC microcircuit: The synaptic basis for interactions between song motor and vocal plasticity pathways. Journal of Neuroscience, 25(8), 1952–1964.
Article PubMed CAS Google Scholar
Morrison, A., Aertsen, A., & Diesmann, M. (2007a). Spike-timing dependent plasticity in balanced random networks. Neural Computation, 19, 1437–1467.
Article PubMed Google Scholar
Morrison, A., Diesmann, M., & Gerstner, W. (2008). Phenomenological models of synaptic plasticity based on spike-timing. Biological Cybernetics, 98, 459–478.
Article PubMed Google Scholar
Morrison, A., Straube, S., Plesser, H. E., & Diesmann, M. (2007b). Exact subthreshold integration with continuous spike times in discrete time neural network simulations. Neural Computation, 19(1), 47–79.
Article PubMed Google Scholar
Nishikawa, J., Okada, M., & Okanoya, K. (2008). Population coding of song element sequence in the Bengalese finch HVC. European Journal of Neuroscience, 27(12), 3273–3283.
Article PubMed Google Scholar
Nordeen, K., & Nordeen, E. (1992). Auditory feedback is necessary for the maintenance of stereotyped song in adult zebra finches. Behavioral and Neural Biology, 57, 58–66.
Article PubMed CAS Google Scholar
Nordeen, K. W., & Nordeen, E. J. (2010). Deafening-induced vocal deterioration in adult songbirds is reversed by disrupting a basal ganglia-forebrain circuit. Journal of Neuroscience, 30(21), 7392–7400.
Article PubMed CAS Google Scholar
Nordlie, E., Gewaltig, M.-O., & Plesser, H. E. (2009). Towards reproducible descriptions of neuronal network models. PLoS Computational Biology, 5(8), e1000456.
Article CAS Google Scholar
Nottebohm, F. (2002). Birdsong’s clockwork. Nature Neuroscience, 5, 925–926.
Article PubMed CAS Google Scholar
Okanoya, K., & Yamaguchi, A. (1997). Adult Bengalese finches (lonchura striata var. domestica) require real-time auditory feedback to produce normal song syntax. Journal of Neurobiology, 33(4), 343–356.
Article PubMed CAS Google Scholar
Ölveczky, B. P., Andalman, A. S., & Fee, M. S. (2005). Vocal experimentation in the juvenile songbird requires a basal ganglia circuit. PLoS Biol, 3(5), e153.
Article CAS Google Scholar
Plesser, H. E., & Diesmann, M. (2009). Simplicity and efficiency of integrate-and-fire neuron models. Neural Computation, 21, 353–359.
Article PubMed Google Scholar
Poirier, C., Boumans, T., Verhoye, M., Balthazart, J., & Van der Linden, A. (2009). Own-song recognition in the songbird auditory pathway: Selectivity and lateralization. Journal of Neuroscience, 29(7), 2252–2258.
Article PubMed CAS Google Scholar
Ponce-Alvarez, A., Kilavik, B. E., & Riehle, A. (2009). Comparison of local measures of spike time irregularity and relating variability to firing rate in motor cortical neurons. Journal of Computational Neuroscience, 29(1–2), 351–365.
PubMed Google Scholar
Potjans, W., Diesmann, M., & Morrison, A. (2011). An imperfect dopaminergic error signal can drive temporal-difference learning. PloS Computational Biology (in press).
Prather, J. F., Nowicki, S., Anderson, R. C., Peters, S., & Mooney, R. (2009). Neural correlates of categorical perception in learned vocal communication. Nature Neuroscience, 12, 221–228.
Article PubMed CAS Google Scholar
Prather, J. F., Peters, S., Nowicki, S., & Mooney, R. (2008). Precise auditory-vocal mirroring in neurons for learned vocal communication. Nature, 451, 305–310.
Article PubMed CAS Google Scholar
Prinz, A. A., Bucher, D., & Marder, E. (2004). Similar network activity from disparate circuit parameters. Nature Neuroscience, 7, 1345–1352.
Article PubMed CAS Google Scholar
Pröve, E. (1974). Der Einfluß von Kastration und Testosteronsubstitution auf das Sexualverhalten männlicher Zebrafinken (Taeniopygia guttata castanotis Gould). Journal für Ornithologie, 115, 338–347.
Article Google Scholar
Rosen, M. J., & Mooney, R. (2003). Inhibitory and excitatory mechanisms underlying auditory responses to learned vocalizations in the songbird nucleus HVC. Neuron, 39, 177–194.
Article PubMed CAS Google Scholar
Rosen, M. J., & Mooney, R. (2006). Synaptic interactions underlying song-selectivity in the avian nucleus HVC revealed by dual intracellular recordings. Journal of Neurophysiology, 95(2), 1158–1175.
Article PubMed Google Scholar
Rotter, S., & Diesmann, M. (1999). Exact digital simulation of time-invariant linear systems with applications to neuronal modeling. Biological Cybernetics, 81(5/6), 381–402.
Article PubMed CAS Google Scholar
Roy, A., & Mooney, R. (2009). Song decrystallization in adult zebra finches does not require the song nucleus NIf. Journal of Neurophysiology, 102(2), 979–991.
Article PubMed Google Scholar
Sakata, J. T., & Brainard, M. S. (2006). Real-time contributions of auditory feedback to avian vocal motor control. Journal of Neuroscience, 26(38), 9619–9628.
Article PubMed CAS Google Scholar
Sakata, J. T., & Brainard, M. S. (2008). Online contributions of auditory feedback to neural activity in avian song control circuitry. Journal of Neuroscience, 28(44), 11378–11390.
Article CAS Google Scholar
Sakata, J. T., Hampton, C. M., & Brainard, M. S. (2008). Social modulation of sequence and syllable variability in adult birdsong. Journal of Neurophysiology, 99(4), 1700–1711.
Article PubMed Google Scholar
Scharff, C., & Nottebohm, F. (1991). A comparative study of the behavioral deficits following lesions of various parts of the zebra finch song system: Implications for vocal learning. Journal of Neuroscience, 11(9), 2896–2913.
PubMed CAS Google Scholar
Schrader, S., Diesmann, M., & Morrison, A. (2010). A compositionality machine realized by a hierarchic architecture of synfire chains. Frontiers in Computational Neuroscience, 4, 154.
Google Scholar
Seki, Y., Suzuki, K., Takahasi, M., & Okanoya, K. (2008). Song motor control organizes acoustic patterns on two levels in Bengalese finches (lonchura striata var. domestica). Journal of Comparative Physiology, 194(6), 533–543.
Article PubMed Google Scholar
Shaevitz, S. S., & Theunissen, F. E. (2007). Functional connectivity between auditory areas field L and CLM and song system nucleus HVC in anesthetized zebra finches. Journal of Neurophysiology, 98(5), 2747–2764.
Article PubMed Google Scholar
Softky, W. R., & Koch, C. (1993). The highly irregular firing of cortical cells is inconsistent with temporal integration of random EPSPs. Journal of Neuroscience, 13(1), 334–350.
PubMed CAS Google Scholar
Sohrabji, F., Nordeen, E. J., & Nordeen, K. W. (1990). Selective impairment of song learning following lesions of a forebrain nucleus in the juvenile zebra finch. Behavioral and Neural Biology, 53(1), 51–63.
Article PubMed CAS Google Scholar
Song, S., Miller, K. D., & Abbott, L. F. (2000). Competitive Hebbian learning through spike-timing-dependent synaptic plasticity. Nature Neuroscience, 3(9), 919–926.
Article PubMed CAS Google Scholar
Teramae, J.-n., & Fukai, T. (2008). Complex evolution of spike patterns during burst propagation through feed-forward networks. Biological Cybernetics, 99(2), 105–114.
Article PubMed Google Scholar
Troyer, T. W., & Doupe, A. J. (2000a). An associational model of birdsong sensorimotor learning I. Efference copy and the learning of song syllables. Journal of Neurophysiology, 84(3), 1204–1223.
PubMed CAS Google Scholar
Troyer, T. W., & Doupe, A. J. (2000b). An associational model of birdsong sensorimotor learning II. temporal hierarchies and the learning of song sequence. Journal of Neurophysiology, 84(3), 1224–1239.
PubMed CAS Google Scholar
Turrigiano, G. G., & Nelson, S. B. (2004). Homeostasic plasticity in the developing nervous system. Nature Reviews Neuroscience, 5, 97–107.
Article PubMed CAS Google Scholar
van Vreeswijk, C., & Sompolinsky, H. (1996). Chaos in neuronal networks with balanced excitatory and inhibitory activity. Science, 274, 1724–1726.
Article PubMed Google Scholar
Vu, E., Mazurek, M., & Kuo, Y. (1994). Identification of a forebrain motor programming network for the learned song of zebra finches. Journal of Neuroscience, 14(11), 6924–6934.
PubMed CAS Google Scholar
Waddington, A., Appleby, P. A., de Kamps, M., & Cohen, N. (2010). Triphasic spike-time-dependent plasticity organizes networks to produce robust sequences of neural activity. (submitted).
Watanabe, A., & Aoki, K. (1998). The role of auditory feedback in the maintenance of song in adult male Bengalese finches lonchura striata var. domestica. Zoological Science, 15, 837–841.
Article Google Scholar
Weber, A. P., & Hahnloser, R. H. R. (2007). Spike correlations in a songbird agree with a simple Markov population model. PLoS Computational Biology, 3(12), e249.
Article CAS Google Scholar
Wennekers, T., & Palm, G. (1996). Controlling the speed of synfire chains. In C. von der Malsburg, W. von Seelen, J. C. Vorbrüggen, & B. Sendhoff (Eds.), Artificial neural networks – ICANN 96 (pp. 451–456). Berlin, Springer-Verlag.
Google Scholar
Wild, J. M. (1994). Visual and somatosensory inputs to the avian song system via nucleus uvaeformis (Uva) and a comparison with the projections of a similar thalamic nucleus in a nonsongbird, columbia livia. Journal of Comparative Neurology, 349, 512–535.
Article PubMed CAS Google Scholar
Williams, H., & Vicario, D. (1993). Temporal patterning of song production: Participation of nucleus uvaeformis of the thalamus. Journal of Neurobiology, 24(7), 903–912.
Article PubMed CAS Google Scholar
Wohlgemuth, M. J., Sober, S. J., & Brainard, M. S. (2010). Linked control of syllable sequence and phonology in birdsong. Journal of Neuroscience, 30(39), 12936–12949.
Article PubMed CAS Google Scholar
Woolley, S. M., & Rubel, E. W. (1997). Bengalese finches lonchura striata domestica depend upon auditory feedback for the maintenance of adult song. Journal of Neuroscience, 17(16), 6380–6390.
PubMed CAS Google Scholar
Woolley, S. M., & Rubel, E. W. (1999). High-frequency auditory feedback is not required for adult song maintenance in Bengalese finches. Journal of Neuroscience, 19(1), 358–371.
PubMed CAS Google Scholar
Woolley, S. M. N. (2008). Neuroscience of birdsong, Chapter 19. Auditory feedback and singing in adult birds, pp. 228–239. Cambridge University Press.
Woolley, S. M. N., & Rubel, E. W. (2002). Vocal memory and learning in adult Bengalese finches with regenerated hair cells. Journal of Neuroscience, 22(17), 7774–7787.
PubMed CAS Google Scholar
Yamada, H., & Okanoya, K. (2003). Song syntax changes in Bengalese finches singing in a helium atmosphere. Neuroreport, 14(13), 1725–1729.
Article PubMed Google Scholar
Yamashita, Y., Takahasi, M., Okumura, T., Ikebuchi, M., Yamada, H., Suzuki, M., et al. (2008). Developmental learning of complex syntactical song in the Bengalese finch: A neural network model. Neural Networks, 21(9), 1224–1231.
Article PubMed Google Scholar
Yamashita, Y., & Tani, J. (2008). Emergence of functional hierarchy in a multiple timescale neural network model: A humanoid robot experiment. PLoS Computational Biology, 4(11), e1000220.
Article CAS Google Scholar
Yu, A. C., & Margoliash, D. (1996). Temporal hierarchical control of singing in birds. Science, 273(5283), 871–1875.
Article Google Scholar

Download references

Acknowledgments

Partially funded by DIP F1.2, BMBF Grant 01GQ0420 to BCCN Freiburg, EU Grant 15879 (FACETS), EU Grant 269921 (BrainScaleS), Helmholtz Alliance on Systems Biology (Germany), Next-Generation Supercomputer Project of MEXT (Japan), Neurex, and the Junior Professor Program of Baden-Württemberg. The authors would like to thank Jun Nishikawa and Kentaro Katahira for stimulating and fruitful discussions. The computations were conducted on the high performance computer cluster of the CNPSN group at RIKEN BSI, Wako, Japan.

Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Author information

Authors and Affiliations

Functional Neural Circuits Group, Faculty of Biology, Albert-Ludwig University of Freiburg, Schänzlestrasse 1, 79104, Freiburg, Germany
Alexander Hanuschkin & Abigail Morrison
Bernstein Center Freiburg, Hansastr. 9A, 79104, Freiburg, Germany
Alexander Hanuschkin, Markus Diesmann & Abigail Morrison
Institute of Neuroscience and Medicine (INM-6), Computational and Systems Neuroscience, Research Center Jülich, Jülich, Germany
Markus Diesmann
RIKEN Computational Science Research Program, Wako City, Saitama, Japan
Markus Diesmann
RIKEN Brain Science Institute, Wako City, Saitama, Japan
Markus Diesmann & Abigail Morrison

Authors

Alexander Hanuschkin
View author publications
You can also search for this author in PubMed Google Scholar
Markus Diesmann
View author publications
You can also search for this author in PubMed Google Scholar
Abigail Morrison
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alexander Hanuschkin.

Additional information

Action Editor: M. D. Israel Nelken

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Hanuschkin, A., Diesmann, M. & Morrison, A. A reafferent and feed-forward model of song syntax generation in the Bengalese finch. J Comput Neurosci 31, 509–532 (2011). https://doi.org/10.1007/s10827-011-0318-z

Download citation

Received: 08 November 2010
Revised: 28 January 2011
Accepted: 03 February 2011
Published: 15 March 2011
Issue Date: November 2011
DOI: https://doi.org/10.1007/s10827-011-0318-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A reafferent and feed-forward model of song syntax generation in the Bengalese finch

Abstract

Similar content being viewed by others

Cantor Coding of Song Sequence in the Bengalese Finch HVC

Neural coding of sound envelope structure in songbirds

Intrinsic neuronal properties represent song and error in zebra finch vocal learning