Toward a study of gene regulatory constraints to morphological evolution of the Drosophila ocellar region

Aguilar-Hidalgo, Daniel; Becerra-Alonso, David; García-Morales, Diana; Casares, Fernando

doi:10.1007/s00427-016-0541-8

Toward a study of gene regulatory constraints to morphological evolution of the Drosophila ocellar region

Original Article
Open access
Published: 01 April 2016

Volume 226, pages 221–233, (2016)
Cite this article

Download PDF

You have full access to this open access article

Development Genes and Evolution Aims and scope Submit manuscript

Toward a study of gene regulatory constraints to morphological evolution of the Drosophila ocellar region

Download PDF

Daniel Aguilar-Hidalgo^1,2,
David Becerra-Alonso³,
Diana García-Morales¹ &
…
Fernando Casares¹

3576 Accesses
5 Citations
9 Altmetric
Explore all metrics

Abstract

The morphology and function of organs depend on coordinated changes in gene expression during development. These changes are controlled by transcription factors, signaling pathways, and their regulatory interactions, which are represented by gene regulatory networks (GRNs). Therefore, the structure of an organ GRN restricts the morphological and functional variations that the organ can experience—its potential morphospace. Therefore, two important questions arise when studying any GRN: what is the predicted available morphospace and what are the regulatory linkages that contribute the most to control morphological variation within this space. Here, we explore these questions by analyzing a small “three-node” GRN model that captures the Hh-driven regulatory interactions controlling a simple visual structure: the ocellar region of Drosophila. Analysis of the model predicts that random variation of model parameters results in a specific non-random distribution of morphological variants. Study of a limited sample of drosophilids and other dipterans finds a correspondence between the predicted phenotypic range and that found in nature. As an alternative to simulations, we apply Bayesian networks methods in order to identify the set of parameters with the largest contribution to morphological variation. Our results predict the potential morphological space of the ocellar complex and identify likely candidate processes to be responsible for ocellar morphological evolution using Bayesian networks. We further discuss the assumptions that the approach we have taken entails and their validity.

Genetic variation of morphological scaling in Drosophila melanogaster

Article 06 March 2023

Interlocking of co-opted developmental gene networks in Drosophila and the evolution of pre-adaptive novelty

Article Open access 15 September 2023

Gene expression analysis of potential morphogen signalling modifying factors in Panarthropoda

Article Open access 29 September 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

The evolution of animals has resulted in a staggering diversity of forms. But, what are the limits to morphological variation? The answer to this question requires considering that the shape of body parts is controlled by complex genetic programs operating during embryonic development. These programs integrate the action of many genes across growing fields of cells forming extensive developmental gene regulatory networks (“GRN”) (Arnone and Davidson 1997). Therefore, if form is determined to a large extent by gene networks, it follows that these networks should restrict the potential evolutionary routes to morphological variation (Oster et al. 1988; Kauffman 1993; Arthur 2006; Davidson and Erwin 2006; Felix 2012; Jaeger and Monk 2014), an idea first formulated by C. H. Waddington (Waddington 1957). Determining the potential range of phenotypes allowed by a particular GRN, however, is not straightforward, because gene networks are complex and their analysis often entails the combined use of model organisms and mathematical simulations. Examples of this combined approach in animal development are studies analyzing the contribution of gene network organization (or topology) to morphological variation of teeth (Salazar-Ciudad and Jernvall 2010; Harjunmaa et al. 2014), the number and pattern of digits in the tetrapod limb (Lopez-Rios et al. 2014; Raspopovic et al. 2014), the patterning of the Drosophila eggshell epithelium (Fauré et al. 2014), or the segmentation of the early Drosophila embryo ((Jaeger et al. 2004); see also Felix (2012) for a recent review).

Another experimental system well suited to study the relation between a developmental GRN and morphological variation is the ocellar region in dipterans. The ocellar region is part of the visual system of insects and is morphologically simple: it comprises three single-lens eyes (the ocelli) located at the vertices of a triangular cuticle patch on the insect dorsal head (Fig. 1a). Therefore, main quantitative traits in this system are the sizes of the ocelli and their separating (“interocellar”) distance. Interestingly, the ocellar region shows morphological variation in different fly species (Fig. 1c, d), which permits to explore not only the phenotypic variation induced experimentally in one model organism (D. melanogaster), but also the variation generated during evolution across species. Recently, our group generated a GRN model of the ocellar region patterning (Aguilar-Hidalgo et al. 2013). In this GRN, the evolutionary conserved Hedgehog (Hh) signaling pathway plays a pivotal role, controlling the specification of the two major fates (retina/ocellus and interocellar cuticle), as well as their size and spacing (Royet and Finkelstein 1996; Royet and Finkelstein 1997; Blanco et al. 2009; Brockmann et al. 2011; Aguilar-Hidalgo et al. 2013; Dominguez-Cejudo and Casares 2015) (Fig. 1b). One of the most interesting predictions derived from this GRN model was that random variations in parameter sets resulted in a non-random specific morphological space. Therefore, one potential application of the GRN model analysis could be the identification of the parameters controlling the paths to morphological variation within this restricted space. However, the GRN model in Aguilar-Hidalgo et al. (2013) was very complex (1 partial differential equation and 12 ordinary differential equations with 68 parameters, of which 32 were studied) which makes this sort of analysis cumbersome.

Here, we used a reduced “three-node” GRN that still recapitulates the expression patterns of key genes in the ocellar region. Our results indicate that the topology of the ocellar GRN defines a particular potential morphological space for the ocellar complex. In this GRN, quantitative changes in parameter values seem sufficient to explain the quantitative morphological variation found in nature without the need of gene network rewiring. Our analysis further identifies likely candidate processes to be responsible for ocellar morphological evolution.

Materials and methods

Fly species and Drosophila melanogaster strains

Drosophila melanogaster (strain Oregon-R), D. gunungcola, D. lutescens, D. lulchrella, D. guttifera, D. prolongata, D. ustulata, D. deflecta, D. fuyamai, D. suzukii, D. biarmipes, D. pseudoobscura, D. bipectinata, D. ananassae, D. sechellia, D. mauritiana, D. yakuba, D. parabipectinata, D. kikkawai, D. teissieri, D. santomea, D. takahashii, D. eugracilis, D. simulans, D. orena, D. erecta, D. willistoni, and Chymomyza pararufithorax were obtained as EtOH-preserved specimens from B. Prud’homme (IBDML, Marseille); D. virilis from J. Vieira (IBMC/I3S, Oporto); Megaselia abdita and Episyrphus balteatus (EtOH-preserved) from J. Jeager/K. Wotton (CRG, Barcelona/KLI, Vienna); Calliphora vicina from P. Simpson (U. Cambridge, Cambridge); and Ceratitis capitata and Bactrocerus oleae (EtOH-preserved) from M. Averof (IGFL, Lyon). D. hydei (strain KS13) was established as a culture at the CABD (Seville). Megaselia scalaris specimens were captured at the CABD fish facility; Musca domestica and other dipteran specimens were captured from the wild. The phylogenetic range of this collection spans about 150 million years (Myrs), with Phoridae (M. abdita and M. scalaris) having the oldest origin. The divergence time of Syrphidae has been set about 95 Myrs ago. The remaining species belong to Schizophora, with an estimated origin 75 Myrs ago (for an updated and detailed Dipteran phylogeny, please check (Wiegmann et al. 2011).

In addition, the following D. melanogaster strains were used: en-Z (en[xho25]; Flybase: FBti0002246); an hh-GAL4, UAS-GFP::Hh strain was used to monitor the Hh expression domain in the ocellar complex (Callejo et al. 2008).

Head cuticle preparation and measurements

Dorsal head cuticle pieces were dissected from adult or late female pharate heads in PBS and mounted in Hoyers’ solution/acetic acid (1:1), as described in (Casares and Mann 2000). Images were obtained in a Leica DM500B microscope with a Leica DFC490 digital camera. Measurements were carried out using the line measurement tool of ImageJ (http://imagej.nih.gov/ij/).

Immunostaining and imaging

Immunofluorescence in eye imaginal discs and embryos was carried out according to standard protocols. Antibodies used were mouse anti-eya (10H6; from Developmental Studies Hybridoma Bank, University of Iowa (http://dshb.biology.uiowa.edu/)) 1/200; rabbit anti-β-galactosidase antibody (Cappel), 1/1000; mouse anti-Ptc (gift from I. Guerrero, CBM-SO, Madrid), 1/100; rabbit anti-GFP (A11122, Molecular Probes), 1/1000. Alexa-conjugated anti-rabbit-488 and anti-mouse-555 secondary antibodies were used at a 1/1000 dilution. Image acquisition was carried out in a Leica SP2 AOBS confocal microscope. Images were processed with Adobe Photoshop CS5.

Model simulation

To simulate the three-node GRN ocellar region model, we first assume that the Hh profile is in steady state. We can assume this as we want to compare signaling patterns with sizes of differentiated tissues in adult flies, thus the development of the ocellar region is in steady state. Additionally, we do not consider tissue growth, but instead the Hh profile grows in a fixed-size grid. We solved the reaction–diffusion equation for Hh (equation S1) in steady state analytically, the solution of which is a spatial-dependent function Hh(x) (Eq. 1). This function serves as input to the three ordinary differential equations that show the spatial pattern for PtcHh, cubitus interruptus (CiA), and engrailed (En) (see Eqs. 2, 3, and 4 in Fig. 2d). Due to the high coupling between the three equations, which makes the analytical study of these equations difficult, we solved this system numerically following a finite differences scheme. We impose homogeneous initial conditions for the three variables and run the simulation with a stop criterion satisfying stationary profiles to the three variables. Specifically, we use as stop criterion that the Norm-2 of difference between the profile of each variable and the previous one in the finite differences scheme is less than 0.01. The model was implemented using Matlab software.

Parameter sensitivity analysis and phenotypic phase space

To perform the parameter sensitivity analysis, we run simulations in the model fixing all the parameters to a control value but one, which is randomized over two orders of magnitude around its control value. This process is repeated for each parameter. The resulting CiA pattern of the simulations (A) is compared to the pattern obtained by the control set of parameters (B). We measure the Euclidean distance (λ) between the two normalized patterns to obtain a goodness value for the randomized simulation (Eq. ##6).

$$ \lambda =\left\Vert \overrightarrow{AB}\right\Vert =\sqrt{{\displaystyle \sum_i{\left({b}_i-{a}_i\right)}^2}} $$

(6)

where a _i and b _i are the components of vectors A and B, respectively.

The distance distributions are shown in Fig. S1 (considered as complementary distance, 1-λ) for all the parameters. From this analysis, we can extract important information about which parameters are more sensitive or more insensitive to variations away from the control parameter values. A complementary distance value of 0.8 was selected as a “goodness” threshold, as every pattern checked for a parameter set with a complementary distance value equal or higher to this value fits the target ocellar pattern. Following this “goodness” threshold, every parameter whose distance distribution falls below 0.8 is considered “sensitive.” We find that all the parameters in the simplified model can be considered as sensitive.

To evaluate whether the simplified model shows a restricted phenotype space of the ocellar region, we performed simulations (N = 9000) with randomized parameters, modifying random seeds, within three goodness intervals 1-λ ≥ 0.8 (“good”), 0.8 > 1-λ ≥ 0.6 (“medium”), and 0.6 > 1-λ ≥ 0.4 (“bad”), (3000 simulations each) (Fig. 3a). Effective Hh diffusion coefficient D was varied in the following ranges: good = [0.068, 0.109], medium = [0.068, 0.010], and bad = [0.068, 0.010] in μm² s⁻¹. The effective turnover of Hh, β_Hh, was varied in the following ranges: good = [2.1, 2.5], medium = [1.5, 2.1], and bad = [1.0, 1.5] in 10⁻⁴ s⁻¹. Figure S2 shows two morphospace samples of the three-node GRN (A) and including parameters D and β_Hh (B). Both samples contain 9000 points each with the same random seed.

Phenotypic classification using Bayesian networks

In this work, the same dataset of parameters is used to attempt the prediction of three types of phenotype class: ocellar size (OC), interocellar cuticle size (IOC), and Near/Far (NF). Thus, three different classification problems are attempted with the same machine learning method. For each parameter set (instance), we calculated λ_CiA, λ_En, and (λ_CiA ² + λ_En ²)^1/2 for the class OC, IOC, and NF, respectively. In OC, values with λ_CiA < 0 (λ_CiA > 0) received class value 0 (1). In IOC, values with λ_En < 0.15 (λ_En ≥ 0.15) received class value 0 (1). And, in NF, values with (λ_CiA ² + λ_En ²)^1/2 < 0.3 ((λ_CiA ² + λ_En ²)^1/2 ≥ 0.3) received class value 0 (1). Learning takes place in the following way:

1.
The instances in the dataset are divided in ten subsets. Each subset must have a collection of instances that is representative of the whole dataset.
2.
Subset 1 is chosen as a test subset, while the remaining subsets are used to train the learning method.
3.
The machine learning method takes the training subset and infers the relationship between parameters needed to determine the phenotype class for every instance.
4.
This learning process is then validated using the test subset, comparing the actual phenotype classes with the ones predicted by the machine learning method. The success rate (percentage of classes correctly predicted) is called predictive accuracy.
5.
Steps 2–4 are repeated using each one of the ten subsets as test subsets, while the other nine subsets are used as training subsets in each case. This system of swapping subsets as tests is called 10-fold cross validation. It is used to increase the chances of having a representative test sample.
6.
The test test predictive accuracies obtained from this repetition are averaged, giving a final predictive accuracy for this method, using this dataset.

A BN learns from the data provided by arranging the parameters in an ascending network, where the relative probability between parameters is established. Thus, the heuristics of BN returns a network of relative probabilities between parameters. Parameters are related to one another by probability distributions, according to the frequency (combined or not), with which a certain parameter has a certain value. For example, in order to establish the statistical relationship between a parent parameter and a child parameter, the question being asked is as follows: provided that this child parameter (for this particular instance) has a certain value X, what range of values are expected on this parent parameter, and what probabilities are assigned to those ranges? These probabilities are expressed with the basic formula of Bayes’ theorem:

Knowing

The frequency P(Y) with which a parent parameter has a value Y.
The frequency P(X) with which a child parameter has a value X.
The relative frequency P(X|Y) with which, having Y in the parent parameter, we have X in the child parameter.

We can obtain the relative frequency P(Y|X) with which, having X in the child parameter, we have Y in the parent parameter, according to

$$ P\left(Y\left|X\right.\right)=\frac{P(Y)P\left(X\left|Y\right.\right)}{P(X)} $$

As we climb up the network of parameters, these ranges and probabilities are refined in accordance to an optimal classification, thus maximizing the predictive accuracy. The final inference is made from the topmost parameter or parameters to the class. This is when the network class is decided. The connections in these networks are averaged in one final network that represents the overall connections of the parameters of a certain dataset needed to correctly classify instances. Since, in this work, the aim is to classify three different phenotypic classes using the same set of parameters, it follows that three networks (one per classification problem) were obtained from the BN method.

A BN, as represented in this work (see Fig. 4b), is read from the bottom up. At the top of the network lies the phenotype class. Parameters on higher levels of the network are considered as parents of the parameters immediately below. Parameters are related to one another by probability distributions, according to the frequency (combined or not), with which a certain parameter has a certain value (or lies within a certain interval). As we climb up the network of parameters, the intervals and probabilities are refined in accordance to an optimal classification, thus maximizing the predictive accuracy. The final inference is made from the topmost parameter or parameters to the class. This is when the class is decided. Classification with Bayesian networks was performed using WEKA 3.7.11 (Hall 2009).

Results

A simplified GRN model recapitulates the ocellar pattern and predicts a specific morphological space for the ocellar region

The ocellar region (Fig. 1a) arises from the fusion along the dorsal midline of the left and right cephalic primordia (often called “eye-antennal imaginal discs”; Fig. 1b). In each primordium, a single Hedgehog (Hh)-producing domain provides cells with positional information, by generating a signaling gradient. Signaling activity can be visualized using the expression levels of patched (Ptc) as its readout (Fig. 2a, d). This is so because Ptc, in addition to being the Hh receptor, is a positive target of the pathway—i.e., the levels of Ptc increase as the signal intensity increases (Chen and Struhl 1996). Activation of the Hh pathway leads to the stabilization of the activator form of CiA, the Gli-type transcription factor that mediates the nuclear transduction of the pathway (Alexandre et al. 1996). The Hh signaling gradient is then translated into two cell fates. At its highest levels, and basically coinciding with the Hh-producing cells, the pathway activates the expression of the transcriptional repressor En. This leads to a pathway shut off, as En represses the transcription of ci and ptc. This signaling-off region gives rise to the interocellar cuticle (IOC). Maintenance of En expression in the IOC region requires Delta (Dl)/Notch signaling (Aguilar-Hidalgo et al. 2013). Flanking the Hh-producing/En-expressing domain, graded Hh signaling results in the stabilization of CiA which, in turn, activates the expression of genes that specify the ocellar retinas, including eyes absent (eya) (Blanco et al. 2009) on both sides of the Hh-producing domain (Fig. 2b). During metamorphosis, as the two cephalic primordia fuse, the two anterior eya domains merge into the anterior (or medial), unpaired ocellus (aOC), while the two posterior domains remain separate and form the two posterior (or lateral) ocelli (pOC). As mentioned above, the region in between the two eya patches expresses the transcription factor En and forms the intervening IOC in the adult (see Fig. 1b). Therefore, the early patterning of the ocellar region entails the generation of basically two cell fates (OC and IOC), the control of their respective size, and their spacing into an “OC–IOC–OC” pattern.

As mentioned, the evolutionary conserved Hh signaling pathway plays a pivotal role in these processes of fate assignment and size control. Although the pattern is bidimensional, it can be simplified as a monodimensional process along the anteroposterior axis (Aguilar-Hidalgo et al. 2013), and described by two variables, the lengths of the OC and the IOC distance (schematized in Fig. 2d). A previous model of the detailed GRN, including 13 molecules (such as CiA) or molecular complexes (such as Ptc/Hh) as network’s nodes, predicted that the phenotypic space available to the GRN (i.e., the sets of OC and IOC lengths) was limited. This being so, the analysis of the model could identify the parameter, or subset of parameters with the largest impact on size variation. However, the size and complexity of the model makes this analysis difficult. To make this analysis more tractable, we resorted to a simplified GRN model that retains critical genetic/molecular interactions and which we showed previously that recapitulates the ocellar pattern (Aguilar Hidalgo et al. 2015) (see Fig. 2c). Pattern in this GRN is dependent on the specific topology of a core regulatory network motif containing an activator–repressor regulatory mechanism describing the dynamics of 3 variables with 16 parameters, what we call the 3-node GRN (Aguilar Hidalgo et al. 2015). We solved the model to find the steady state pattern—i.e., the final, stable pattern that is reflected in the adult ocellar complex. As the equations of the three-node GRN contain nonlinear terms, we chose to solve these numerically. Hh (Eq. 1) then serves as source for PtcHh complex production (Ptc being Hh receptor, Eq. 2), which activates the production of CiA (Eq. 3). CiA favors the maintenance of PtcHh and can activate expression of En (Eq. 4) and eya, the two readouts of the model. En is a low-sensitivity Hh target and a repressor of the pathway components CiA and PtcHh (and therefore, of eya). Above a certain concentration threshold ζ_En, En is self-maintained (genetically, this step requires the Dl/Notch pathway (Aguilar-Hidalgo et al. 2013), Eqs. 4 and 5) and becomes independent on the Hh signaling. Due to En being a low-sensitivity target, En is only self-maintained in the zone of maximal Hh concentration that closely corresponds to the Hh-producing domain. The En-expressing domain gives rise to the IOC region. In regions adjacent to the Hh-producing domain, where the Hh concentrations are not enough as to activate En, CiA is stabilized and eya expression is induced, generating the OC domains. Because eya expression is induced by CiA, in the model, CiA is used as a marker of OC identity. Therefore, the variables that define the morphology of the ocellar complex are lengths of the En and CiA domains, which represent the IOC and OC regions, respectively.

In order to find the parameters for which small variations caused significant deviations from “control” OC and IOC lengths (see the “Materials and methods” section), which represents D. melanogaster, we first performed an individual sensitivity analysis for each of the 16 three-node GRN parameters. To find a metric for this deviation, we calculated the distance between the control pattern and the patterns generated by varying each of the parameters. We established three thresholds for the complementary of this distance (1-λ): 1-λ ≥ 0.8 (good), 0.8 > 1-λ ≥ 0.6 (medium), and 0.6 > 1-λ ≥ 0.4 (bad), with 1-λ ≥ 0.8 giving the patterns closest to the control. This analysis showed that every parameter in the three-node GRN is sensitive to small variations, as their distributions mostly fall below the 0.8-threshold (Fig. S1). Then, we performed simulations using randomized values (from the good, medium, and bad intervals) for every parameter simultaneously to generate a point (a “phenotype”) in the phase space. Therefore, this phase-space is a “phenotype space” or “morphospace”. The axes of this phenotype space represent the deviations of the lengths of the CiA and En expression domains (λ_CiA, λ_En) from the control (at (0, 0)). For example, −0.20, 0.25 would be an ocellar complex with smaller OC (λ_CiA = −0.20) and larger IOC (λ_En = +0.25) than the control. We found that (1) the simulations with randomized parameter sets show a non-random distribution, yielding a sort of “butterfly wing” pattern in the phenotype space (Fig. 3a); in addition, (2) the model may yield very similar phenotypes even when the randomized parameters come from different goodness intervals (i.e., the results, expressed as a point (λ_CiA, λ_En) in the morphospace, lie close to one another) (Fig. 3a). (3) However, we also find that the “goodness” of parameters biases the distribution of solutions in the morphospace. Thus, parameter values chosen from the “good” interval mostly result in larger OC than the control (i.e., positive λ_CiA), while medium and bad parameter values avoid larger OC and smaller IOC values. In addition, globally considered, parameter variation in the three-node GRN tends to yield ocellar regions with larger IOC (i.e., λ_En > 0) (Fig. 3a). Although our study focuses on the intracellular GRN driving the ocellar pattern, we analyzed to what extent the variation of parameters affecting the gradient of Hh affected the shape of the morphospace. Specifically, we varied the effective Hh diffusion coefficient D and the effective turnover of Hh, β_Hh, as these parameters together define the gradient’s length scale λ = (D/β_Hh)^1/2 (see Eq. 1). We found that the extended morphospace that resulted distributed medium and bad parameter spreads slightly further away from the control values. However, globally, the extended morphospace is very similar to the three-node GRNs with a fixed Hh gradient (Fig. S2). Therefore, the intracellular GRN determines, to a great extent, the ocellar complex phenotype space. In what follows, we continue our analysis of the intracellular three-node GRN without considering variations in the extracellular Hh gradient.

Quantitative phenotypic variation of the ocellar region in different fly species

The study of the phenotype space allowed by the three-node GRN predicted that simultaneous variation of all parameters (by assigning each parameter a random value within a certain interval; see “Materials and methods” section) should result in non-random phenotypes—i.e. the phenotypic space available for morphological variation is limited. To test whether this prediction agrees with the phenotypic variation observed in actual fly species, we measured the length of the anterior and posterior OC and the IOC distance in a sample of 41 fly species (Fig. 3b). To account for body size differences, these measurements were normalized using the inter-anterior occipital bristle distance, as a proxy of head width. Only females were measured. The species set surveyed is not comprehensive across Schizophoran flies and is strongly biased toward Drosophilidae species close to D. melanogaster, for which we had the easiest access to (see “Materials and methods” section). When plotted, the distribution of (λ_OC, λ_IOC), which represents the variation in the respective OC and IOC lengths (only pOC were used) relative to D. melanogaster, showed a pattern resembling the butterfly wing pattern predicted by the model (Fig. 3b).

In general, we find that species belonging to groups far away from Drosophilidae show the most divergent morphologies. Such is the case of M. abdita (Phoridae, no. 32 in Fig. 3b), E. balteatus (Syrphidae, no. 40 in Fig. 3b), or M. domestica (Muscidae, no. 33 in Fig. 3b). This qualitative similarity in distributions is best observed when the predicted and measured phenotypic spaces are overlapped (Fig. 3c). Although the similarity noticed is purely qualitative and based on a limited sample of species, and therefore still has to be regarded as preliminary, we find that it lends support to the idea that, in nature, the phenotypic variability available to the ocellar region is also restricted and follows similar patterns as those predicted by the model.

Machine learning method finds parameter relations defining ocellar and interocellar sizes

For each parameter set, the three-node GRN yields a value for the OC and IOC lengths—i.e., defines a point in the phenotypic space. But, does every parameter contribute equally to localize a point in this space or, instead, one parameter (or a subset of parameters) has a major contribution to determining the localization of this point—that is, to morphological variation? If the latter were the case, the identification of this set of control parameters may point to genetic/molecular links of particular relevance in controlling the OC and IOC lengths.

In order to establish a relationship between the parameters in the three-node GRN and the morphological variation of the ocellar region, we can envision a number of potential approaches. A developmental genetics approach, without prior knowledge, would entail the systematic perturbation of the genetic links implicit in the 16 parameters of the model, alone and in combination. A quantitative genetics approach (QTL) would be capable of identifying important elements of the network, but it would be limited to cross-hybridizing species showing significant differences in ocellar morphology. In addition, a QTL approach could be capable of identifying causes for existent variation, not for all potential variation. From a numerical perspective, the full parameter space is vast. An alternative to dynamical model simulation analysis could be the use of classification methods to infer morphological variation directly from the randomized parameter vectors. One such method is Bayesian networks (BNs) (Pazzani 1996; Friedman et al. 1998; Keogh and Pazzani 1999). A BN is an acyclic, directed graph connecting a series of variables linked by their conditional probabilities (non-linked variables are independent from each other). These BNs can be used to compute the probability of a given output. In our case, the variables are the 13 parameters of the 3-node GRN, and the output is whether a phenotype (a point in the (λ_CiA, λ_En) plane) falls within a given region of this space. As we climb up the network of parameters, the conditional probabilities maximize the predictive accuracy (for a more detailed description of the BN learning method and classification, please see the “Materials and methods” section). Specifically, we used this method to try to identify relevant parameters for morphological variation.

We subdivided the phenotypic space into three different morphological classes: (1) OC smaller or larger than the control (left: λ_CiA < 0 or right: λ_CiA > 0, respectively); (2) small or large IOC (up: λ_En < 0.15 or down: λ_En ≥ 0.15, respectively), and (3) Near/Far (N/F), which distinguishes between positions in the phenotypic space that are more or less similar (“near” or “far”, respectively) to the control. In this case, we impose the same sign to the size variation of the OC and IOC—that is, large OC with large IOC and small OC with small IOC. Specifically, a point is near the control (i.e., it is “similar”) if it is located inside a circumference with radius 0.3. If the point is located outside the circumference, it is classified as far from the control (see Fig. 4a). Note that we consider only points with λ_En ≥ 0 due to the low number of points with λ_En < 0 (i.e., the model does not yield many cases of ocelli smaller than the control). We applied BN analysis to identify parameters which, when covaried, localize points to one of these zones. For each class, the BN heuristics returned a network of relative probabilities between parameters, with very good classification results (90.35 % for N/F, 96.23 % for OC, and 94.58 % for IOC). The analysis of the three networks, which establish a hierarchy of relations between parameters (in Fig. 4b, the networks includes the set of eight parameters with the highest classification value), resulted in a number of observations. First, the three BNs show the same nodes in a similar hierarchy, despite the fact that they inform about different phenotype classes. This implies that the same genetic interactions (represented by parameters in the model) control the variation of different phenotypic classes. Second, the three topmost parameters in each BN suffice for a good classification. These three parameters include, with decreasing relevance, the one determining the transcriptional efficiency by which CiA activates Ptc expression (α_CiA–PtcHh), the intensity of repression of CiA by En (α_En–CiA), and α_En–En, which controls en autoregulation.

To validate the BN results, we compared the morphospace generated when the three predicted control parameters (α_CiA–PtcHh, α_En–CiA, and α_En–En) were randomly covaried with the morphospace resulting from the overlap of the three simulations generated when each of the parameters were varied individually. While the morphospace resulting from parameter covariation recapitulated most of the butterfly wing pattern (Fig. 4 (C1)), the ones resulting from varying the parameters individually matched the butterfly wing pattern much more poorly (Fig. 4 (C2)). Still, covariation of the three top-ranked parameters missed the “right forewing” (i.e., λ_CiA > 0, λ_En > 0). We sought among the five remaining parameters in the BNs the parameter or parameters that, when covaried, showed the missing “wing.” We found that β_En, which correspond to the degradation rate of En, when covaried with the three top parameters in the BNs, yielded the “butterfly wing” pattern (Fig. 4 (C3)). Again, this pattern was just sketched when the four parameters were independently randomized and their patterns overlapped (Fig. 4 (C4)). This analysis indicates that the control of morphological variations in the ocellar region requires the cooperation of four major parameters. In addition, we noted that, of the 16 parameters, those corresponding to non-linear terms in the model, such as Hill coefficients, have the least relevance in the classification in the three BNs (OC, IOC, and N/F) (not included in the BNs in Fig. 4). Finally, although similar, the exact topology of the three networks varies, with the BN for OC size being the most connected.

Discussion

In this paper, we have studied the ocellar GRN, as an example of gene network regulated by the Hh morphogen, to predict the range of available phenotypic space for morphological variation and tried to predict parameters within this network with a major effect in controlling that morphological variation. We have found that a simple three-node GRN that recapitulates the pattern of the ocellar region predicts restrictions to variations in the size of the ocelli (OC) and the distance in between the ocelli (IOC). When measured, the distribution of OC and IOC lengths from a sample of dipteran species seemed to follow, qualitatively, the same distribution in the phenotypic space predicted by the model. We take this result as lending support to the notion that the GRN structure indeed restricts the evolvability not only of the model’s output but also of its real surrogate, as these restrictions would be reflected by the actual phenotypes found in nature. However, as we noted, this conclusion is tentative. First, because the sample of species is not sufficiently large and comprehensive across the higher dipterans; second, because the morphologies in extant species may as well be the result of natural selection—i.e., the pattern of morphologies observed having been shaped by functional constraints, such as ocellar regions having an IOC length above a certain limit, to allow the aOC and pOC to scan separate regions of vision (however, for this particular example, we note that the model also predicts that too short IOC distances are unlikely). We believe that most likely, the actual phenotypes have resulted from the action of natural selection of the advantageous phenotypes from the morphospace allowed by the GRN’s structure.

To more precisely define the contribution of gene regulatory steps to shaping the ocellar morphospace, we envision two approaches. A developmental genetics approach in which, by using a priori information of the most likely relevant parameters, the morphological variation of allelic series in genes affecting those parameters is used to compare the predicted to the actual phenotypes measured in each allelic combination. A second approach would be a comparative one: to increase the size and breadth of the sample of dipteran species studied to examine how closely their ocellar morphology maps are within the predicted butterfly-shaped morphospace, so that the closer the correlation, the more likely that the phenotypic range is determined by the GRN structure.

Basic to our approach to studying, the role that gene network structure has in controlling the evolvability of the ocellar region (as a model of an Hh-patterned organ) is the assumption that the GRN structure remains constant in the species we examine. This allows us to compare different morphologies generated by the same GRN structure through the sole quantitative variation of its parameters. Although this assumption may seem a strong one, we think it is justified. The three-node GRN comprises a set of Hh-related regulatory linkages that have been shown to be operating in other developmental contexts, including an Hh source and a steady-state Hh gradient; the basic Hh signal transduction path hh→Ptc/Hh→CiA→Ptc/Hh or the CiA→En–ΙCiA repression feedback. This likely also extends to the activation of retinal genes, such as eya, by the Hh signaling pathway—i.e., they can be considered conserved regulatory modules, or “kernels” (Davidson et al. 2003), and therefore they are likely to be invariant in the network. Even if new nodes were to appear during evolution, it is conceivable that their effect could be incorporated as a quantitative variation of some of the parameters that define the network. For example, recent work (Dominguez-Cejudo and Casares 2015) has shown that the Six3-type transcription factor Optix is expressed in the aOC and not in the pOC during development in D. melanogaster. During larval development the aOC, primordium is smaller than the pOC primordium (DGM, FC, unpublished). One hypothesis is that Optix would modify some OC-controlling parameters in the network leading to a smaller-sized aOC. If this were the case, Optix’s action could be modeled implicitly as the variation of one parameter (specifically affecting the aOC) without the need to add it explicitly to the network model. Therefore, the network would still be of use to explore the potential range of morphologies even if not containing explicitly all the playing genes and interactions, provided that these elements and interactions can be represented implicitly in the model equations, and that they do not alter the three-node network’s structure (note that our three-node model is symmetrical—i.e., it does not consider potential regulatory differences between anterior and posterior OC). We have circumscribed our analysis to dipterans as we can more confidently assume the conservation of the GRN structure. Whether this model is applicable to other insects depends on whether the ocellar GRN is conserved beyond dipterans in these groups.

In principle, one of the advantages of the use of models is the possibility to extract information relevant to the behavior of the biological process modeled. If we accept the assumption that the GRN structure remains constant within higher Diptera (see above), an important point is to determine how parameter variation impacts morphological variation. The parameters in the model are surrogates of biochemical rate constants, including those for protein–protein interactions (i.e., activation of the Ptc receptor (as PtcHh) by its ligands), protein degradation and, most importantly, activating or repressing protein–DNA interactions between transcription factors and cis-regulatory elements. As sequence variation is generated in a given population, a mixture of variants will be combined in each individual of this population. Therefore, it is of interest to analyze the combined effects of allelic variants (i.e., parameter variants), rather than of individual variants, on the final morphology of the system. Even in our relatively simple three-node GRN, a comprehensive analysis of parameter covariation entails long calculations. Although doable, we have opted to introduce an alternative approach: the use of Bayesian networks to identify the most relevant parameters in defining a particular morphological class and their probabilistic relationship. This approach has been recently used to identify critical interleukins within the murine cytokine–hormonal network (Field et al. 2015). In our BN analysis, four parameters stand out as most relevant: α_CiA–PtcHh, α_En–CiA, α_En–En, and β_En. The first three are transcriptional regulatory steps. α_CiA–PtcHh represents the activation rate of Ptc (which engages with Hh in an active PtcHh signaling complex) by the activator form of the Gli transcription factor ci: CiA. α_En–CiA reflects the repressing action of En on ci transcription (represented in the model as CiA repression), a regulatory step that controls the establishment of the IOC; and α_En–En, which maintains the IOC region in the CiA-repressed region. We propose that these parameters, jointly, may be responsible for most of the morphological variation seen in the ocellar region in different species.

Another observation derived from the BN analysis is that variation in OC length is defined, at least probabilistically, by a more connected network than for the IOC length. This suggests to us that morphological variation of OC size is genetically more complex than that of the IOC. The N/F BN shows an intermediate complexity, as it reflects the phenotypic covariation of OC and IOC. Finally, we noted that the eight parameters with significant contribution to defining morphological classes were linear terms in our model. The non-linear terms that include, for example, the Hill constants have been shown to be required for the system’s stability (Aguilar Hidalgo et al. 2015). Therefore, from a modeling perspective, morphological variation is basically defined by the linear terms (transcriptional activations and repression and decay constants).

This study, combining GRN modeling and machine learning with biological measurements, indicates that morphological variation in the ocellar region is limited by the specific topology of its GRN and identifies a very short list of biochemical parameters, mostly representing transcriptional regulatory steps that jointly control such variation. These results reinforce the notion that, as a general principle, the potential for morphological variation of organs is limited by the specific regulatory interactions governing their development, and that morphological variation can be the result of combination of genetic variants that modify, simultaneously, several biochemical parameters within those interactions.

References

Aguilar Hidalgo D, Lemos MC, Cordoba A (2015) Core regulatory network motif underlies the ocellar complex patterning in Drosophila melanogaster. Physica D: Nonlinear Phenom 295–296:91–102
Article Google Scholar
Aguilar-Hidalgo D, Dominguez-Cejudo MA, Amore G, Brockmann A, Lemos MC, Cordoba A, Casares F (2013) A Hh-driven gene network controls specification, pattern and size of the Drosophila simple eyes. Development 140(1):82–92
Article CAS PubMed Google Scholar
Alexandre C, Jacinto A, Ingham PW (1996) Transcriptional activation of hedgehog target genes in Drosophila is mediated directly by the cubitus interruptus protein, a member of the GLI family of zinc finger DNA-binding proteins. Genes Dev 10(16):2003–2013
Article CAS PubMed Google Scholar
Arnone MI, Davidson EH (1997) The hardwiring of development: organization and function of genomic regulatory systems. Development 124(10):1851–1864
CAS PubMed Google Scholar
Arthur W (2006) Biased embryos and evolution. Cambridge University Press
Blanco J, Seimiya M, Pauli T, Reichert H, Gehring WJ (2009) Wingless and Hedgehog signaling pathways regulate orthodenticle and eyes absent during ocelli development in Drosophila. Dev Biol 329(1):104–115
Article CAS PubMed Google Scholar
Brockmann A, Dominguez-Cejudo MA, Amore G, Casares F (2011) Regulation of ocellar specification and size by twin of eyeless and homothorax. Dev Dyn 240(1):75–85
Article PubMed Google Scholar
Callejo A, Culi J, Guerrero I (2008) Patched, the receptor of Hedgehog, is a lipoprotein receptor. Proc Natl Acad Sci U S A 105(3):912–917
Article CAS PubMed PubMed Central Google Scholar
Casares F, Mann RS (2000) A dual role for homothorax in inhibiting wing blade development and specifying proximal wing identities in Drosophila. Development 127(7):1499–1508
CAS PubMed Google Scholar
Chen Y, Struhl G (1996) Dual roles for patched in sequestering and transducing Hedgehog. Cell 87(3):553–563
Article CAS PubMed Google Scholar
Davidson EH, Erwin DH (2006) Gene regulatory networks and the evolution of animal body plans. Science 311(5762):796–800
Article CAS PubMed Google Scholar
Davidson EH, McClay DR, Hood L (2003) Regulatory gene networks and the properties of the developmental process. Proc Natl Acad Sci U S A 100(4):1475–1480
Article CAS PubMed PubMed Central Google Scholar
Dominguez-Cejudo MA, Casares F (2015) Anteroposterior patterning of Drosophila ocelli requires an anti-repressor mechanism within the hh pathway mediated by the Six3 gene Optix. Development 142(16):2801–2809
Article CAS PubMed Google Scholar
Fauré A, Vreede BMI, Sucena É, Chaouiya C (2014) A discrete model of drosophila eggshell patterning reveals cell-autonomous and juxtacrine effects. PLoS Comput Biol 10(3):e1003527
Felix MA (2012) Evolution in developmental phenotype space. Curr Opin Genet Dev 22(6):593–599
Article CAS PubMed Google Scholar
Field SL, DAsgupta T, Cummings MR, Savage RS, Adebayo J, McSara H, Gunawardena J, Orsi NM (2015) Bayesian modeling suggests that IL-12 (p40), IL-13 and MCP-1 drive murine cytokine networks in vivo. BMC Syst Biol 9:76
Article PubMed PubMed Central Google Scholar
Friedman N, Goldszmidt M, Lee TJ (1998) Bayesian network classification with continuous attributes: getting the best of both discretization and parametric fitting. ICML 98:179–187
Google Scholar
Hall M (2009) The WEKA data mining software: an update. ACM SIGKDD Explor Newsl 11(1):10–18
Article Google Scholar
Harjunmaa E, Seidel K, Hakkinen T, Renvoise E, Corfe IJ, Kallonen A, Zhang ZQ, Evans AR, Mikkola ML, Salazar-Ciudad I et al (2014) Replaying evolutionary transitions from the dental fossil record. Nature 512(7512):44–48
CAS PubMed PubMed Central Google Scholar
Jaeger J, Monk N (2014) Bioattractors: dynamical systems theory and the evolution of regulatory processes. J Physiol 592(Pt 11):2267–2281
Article CAS PubMed PubMed Central Google Scholar
Jaeger J, Surkova S, Blagov M, Janssens H, Kosman D, Kozlov KN, Manu, Myasnikova E, Vanario-Alonso CE, Samsonova M et al (2004) Dynamic control of positional information in the early Drosophila embryo. Nature 430(6997):368–371
Article CAS PubMed Google Scholar
Kauffman SA (1993) The origins of order. Oxford University Press
Keogh E, Pazzani M (1999) Learning augmented bayesian classifiers: a comparison of distribution-based and classification-based approaches. Seventh International Workshop on Artificial Intelligence and Statistics
Lopez-Rios J, Duchesne A, Speziale D, Andrey G, Peterson KA, Germann P, Unal E, Liu J, Floriot S, Barbey S et al (2014) Attenuated sensing of SHH by Ptch1 underlies evolution of bovine limbs. Nature 511(7507):46–51
Article CAS PubMed Google Scholar
Oster G, Shubin N, Murray JD, Alberch P (1988) Evolution and morphogenetic rules: the shape of the vertebrate limb in ontogeny and phylogeny. Evolution 42(5):862–884
Article Google Scholar
Pazzani MJ (1996) Searching for Dependencies in Bayesian Classifiers. Springer
Raspopovic J, Marcon L, Russo L, Sharpe J (2014) Modeling digits. Digit patterning is controlled by a Bmp-Sox9-Wnt turing network modulated by morphogen gradients. Science 345(6196):566–570
Article CAS PubMed Google Scholar
Royet J, Finkelstein R (1996) Hedgehog, wingless and orthodenticle specify adult head development in Drosophila. Development 122(6):1849–1858
CAS PubMed Google Scholar
Royet J, Finkelstein R (1997) Establishing primordia in the Drosophila eye-antennal imaginal disc: the roles of decapentaplegic, wingless and hedgehog. Development 124(23):4793–4800
CAS PubMed Google Scholar
Salazar-Ciudad I, Jernvall J (2010) A computational model of teeth and the developmental origins of morphological variation. Nature 464(7288):583–586
Article CAS PubMed Google Scholar
Waddington CH (1957) The strategy of the genes: a discussion of some aspects of theoretical biology. Ruskin House/George Allen and Unwin Ltd., London
Google Scholar
Wiegmann BM, Trautwein MD, Winkler IS, Barr NB, Kim JW, Lambkin C, Bertone MA, Cassel BK, Bayless KM, Heimberg AM et al (2011) Episodic radiations in the fly tree of life. Proc Natl Acad Sci U S A 108(14):5690–5695
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgments

Open access funding provided by Max Planck Society (Max Planck Institute for the Physics of Complex Systems). We thank J. Culí, B. Prud’Homme, P. Simpson, J. Jaeger, K. Wotton, and M. Averof for the flies and D. G. Míguez and M. Popovic for critically reading the manuscript. Research at the Casares laboratory is funded by the Spanish Ministry for Economy and Innovation (MINECO) and Feder Funds through grant BFU2012-34324 to FC. DGM is a MINECO PhD Fellow. DBA was supported in part by the Spanish Inter-Ministerial Commission of Science and Technology under Project TIN2014-54583-C2-1-R, the European Regional Development fund, and the “Junta de Andalucía” (Spain), under Project P2011-TIC-7508. We also thank A. Iannini for technical assistance, the Developmental Studies Hybridoma Bank, University of Iowa, for antibodies, and the CABD Advanced Light Microscopy Facility for their help with confocal imaging and members of the Casares lab for discussions.

Author information

Authors and Affiliations

CABD (Andalusian Centre for Developmental Biology), CSIC-UPO-JA, Campus Universidad Pablo de Olavide, 41013, Seville, Spain
Daniel Aguilar-Hidalgo, Diana García-Morales & Fernando Casares
Max Planck Institute for the Physics of Complex Systems, Nöthnitzer Straße 38, 01187, Dresden, Germany
Daniel Aguilar-Hidalgo
Universidad Loyola Andalucía (AYRNA), 41014, Seville, Spain
David Becerra-Alonso

Authors

Daniel Aguilar-Hidalgo
View author publications
You can also search for this author in PubMed Google Scholar
David Becerra-Alonso
View author publications
You can also search for this author in PubMed Google Scholar
Diana García-Morales
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Casares
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Daniel Aguilar-Hidalgo or Fernando Casares.

Additional information

Communicated by Nico Posnien and Nikola-Michael Prpic

This article is part of the Special Issue “Size and Shape: Integration of morphometrics, mathematical modeling, developmental and evolutionary biology”, Guest Editors: Nico Posnien—Nikola-Michael Prpic.

Electronic supplementary material

Below is the link to the electronic supplementary material.

ESM 1

(PDF 221 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Aguilar-Hidalgo, D., Becerra-Alonso, D., García-Morales, D. et al. Toward a study of gene regulatory constraints to morphological evolution of the Drosophila ocellar region. Dev Genes Evol 226, 221–233 (2016). https://doi.org/10.1007/s00427-016-0541-8

Download citation

Received: 18 November 2015
Accepted: 28 February 2016
Published: 01 April 2016
Issue Date: June 2016
DOI: https://doi.org/10.1007/s00427-016-0541-8

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Toward a study of gene regulatory constraints to morphological evolution of the Drosophila ocellar region

Abstract

Similar content being viewed by others

Genetic variation of morphological scaling in Drosophila melanogaster

Interlocking of co-opted developmental gene networks in Drosophila and the evolution of pre-adaptive novelty

Gene expression analysis of potential morphogen signalling modifying factors in Panarthropoda

Introduction

Materials and methods

Fly species and Drosophila melanogaster strains