An agent-based model of the Notch signaling pathway elucidates three levels of complexity in the determination of developmental patterning
The Notch signaling pathway is involved in cell fate decision and developmental patterning in diverse organisms. A receptor molecule, Notch (N), and a ligand molecule (in this case Delta or Dl) are the central molecules in this pathway. In early Drosophila embryos, these molecules determine neural vs. skin fates in a reproducible rosette pattern.
We have created an agent-based model (ABM) that simulates the molecular components for this signaling pathway as agents acting within a spatial representation of a cell. The model captures the changing levels of these components, their transition from one state to another, and their movement from the nucleus to the cell membrane and back to the nucleus again. The model introduces stochastic variation into the system using a random generator within the Netlogo programming environment. The model uses these representations to understand the biological systems at three levels: individual cell fate, the interactions between cells, and the formation of pattern across the system. Using a set of assessment tools, we show that the current model accurately reproduces the rosette pattern of neurons and skin cells in the system over a wide set of parameters. Oscillations in the level of the N agent eventually stabilize cell fate into this pattern. We found that the dynamic timing and the availability of the N and Dl agents in neighboring cells are central to the formation of a correct and stable pattern. A feedback loop to the production of both components is necessary for a correct and stable pattern.
The signaling pathways within and between cells in our model interact in real time to create a spatially correct field of neurons and skin cells. This model predicts that cells with high N and low Dl drive the formation of the pattern. This model also be used to elucidate general rules of biological self-patterning and decision-making.
KeywordsAgent-based modeling Notch signaling pathway Self-patterning
Decision making during development is a widely studied problem. Generally speaking, biologists have genetically dissected fate pathways to identify molecules that interact through extracellular receptors with other cells and their environment to alter their transcription patterns through signaling pathways. However, the route to cell fate is not straightforward and a detailed understanding of how groups of cells develop spatial relationships necessitates a systems-level approach.
One of the canonical pathways for fate decisions and patterning is the Notch pathway. This pathway is used in multiple tissue types and across evolutionary time; it is the most common pathway used by adjacent cells to make binary fate decisions . The Drosophila early embryo is the best studied Notch pathway, where Notch (N), an extracellular receptor, and its ligand Delta (Dl) act to determine the fate of a sheet of ectodermal cells, with cells adopting either a neuronal or epidermal fate in a reproducible rosette spatial pattern [2, 3]. Cells with high levels of N protein assume an epidermal fate, while a low level of N produces the neuronal fate. This signaling pathway yields a reproducible geometry and consistent number of neural and epidermal cells, however it is unclear how the identical cellular pathways within each cell interact to create pattern within a group of cells.
In addition to the molecular genetic work, mathematical and computational models of the N signaling have been created [15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25]. Many of these models uses equations or algorithms to represent the interactions between cells, although other approaches, such viewing the system as a dynamic network, have been utilized. These models replicate the in vivo pattern accurately and make important contributions to the understanding of the system, but model the development of fate choice within a single cell, or the dynamics between cells that results in the pattern within the system. No existing model captures the dynamics that connect the underlying molecular signaling pathway that produces the intercellular interactions to the overall patterning of cells during development.
We have built a fine-grained and hierarchical model using Agent-Based Modeling (ABM) that reproduces and assesses three levels of complexity: intracellular dynamics, intercellular interactions, and system dynamics [26, 27, 28]. Informed by the myriad molecular studies done with the Notch pathway, we created agents that mimic the actual cellular and protein components for a population of cells in a sheet and that undergo transformations with each cycle of the model. We believe it is necessary to include the intracellular components of the system, since the dynamics of these components set up the interaction between cells. The signaling pathways within and between cells in our model interact in real time to create a spatially correct field of neurons and skin cells. As agents interact with each other, an evolving dynamic picture of an emergent system can be observed and analyzed. We developed tools to track the stabilization of individual cell fate and the stable formation of pattern in the system and used these tools to evaluate the contributions of individual signal components to cell fate and system pattern. We altered both the levels and timing of these components using model features that control the production and transitions between agents. Our model suggests that cell fate and larger system pattern are generated through oscillations of the number of N agents and that out of sync oscillations created by the stochastic nature of the model produced alternative fates in neighboring cells. Each individual cell charts its own path to a cell fate choice, but interactions between cells cement those cell fates and the larger system pattern as the model progresses. The model replicates a number of features of the biological system and produces a number of testable hypotheses that can be explored further in both in silico and in vivo.
To address the problem of multicellular patterning, we chose to model the N signaling pathway described above used since the current understanding of this pathway has broad experimental support . An attractive aspect of using this signaling pathway in our modeling of this problem is that there is a single central signaling component, N, and its receptor. Both N and Dl are absolutely required for cell fate, cell interaction and the larger pattern. There is no cascade and the N signal in a cell is not amplified at any step, therefore the exact level of N protein controls cell fate. Using ABM, we can focus on one product, and add or control effectors in a stepwise fashion to see which are important to the overall patterning. We describe the basic features of the model below, with more details in the Methods.
Agents, their movements and transformations
Movement and/or transformation
Variable names within the code
Radius size around nucleus
Cell radius and position # of lipid agents
Dl as initially transcribed
Dl moves towards Mem and converts to Dlm
Chance that a single Dl will be produced
Dl associated with membrane
Dlm moves laterally from Mem to Mem
Dlm converts Dlm’
Time period before Dlm is converted to Dlm’
Dl form that interacts with N from another cell
moves laterally from Mem to Mem
Time period a Dl will exist
N as initially transcribed
N moves towards Mem and converts to Nm
Chance that a single N will be produced
N associated with membrane, interacts with Dl from another cell, cleaved
Nm moves laterally from Mem to Mem
Nm converts to Nc when across from Dlm’ on another cell
Cleaved N moves from cell membrane to nucleus
Nc converts to Nn when it reaches the nucleus
Sets randomness, direction and time frame of movement
Nuclear form of N, transcription factor
Alters N transcription
Alters Dl transcription
Time period a N will exist
Alters N and Dl initial transcription rates
The second type of agents are active components representing various states of N and Dl that model the steps seen in Fig. 1. Agents Dl and N are generated at a variable initial rate and tagged as belonging to each cell’s Nuc. The N/Dl agents are assigned a random heading and move towards Mem agents and transition to Nm and Dlm as they associate with these agents (step 1). Both agents move randomly between Mem agents after production. Dlm is converted to an active form called Dlm’ at a variable rate within the model (step 2). When Nm and Dlm’ agents are opposite each other on adjacent cells, Nm is converted to Nc (the first cleavage step, step 3). The average time of the second cleavage step and subsequent migration of Nc to the nucleus (the conversion of Nc to Nn) can also be varied within the model (step 4). The number of Nn agents provides feedback on the system, upregulating N and downregulating Dl average transcription settings.
Transitions between agents are mediated in various ways within the model. N/Dl is converted to Nm/Dlm, when they approach a Mem agent. Likewise, the Nm to Nc transition occurs when a Nm and Dlm’ move to adjacent Mems across from each other on the adjoining cell. In other cases, transitions are generated by rates that have stochastic features that model complex molecular interactions or transit in cell compartments. Such stochastic features are a property of biological systems and random elements of fate determination for the N pathway have been confirmed in vivo . For example, the initial production of N and Dl are set as a variable that creates a distribution of values around a mean. Nc transitions to Nn using a random step feature with a rate set as an average distribution. The transition from Dlm to Dlm’ can be manipulated by a setting that creates an average time but not an exact time for the transition. The stochastic features of the model are produced using pseudorandom number generator that is part of the Netlogo programming language (see methods). To prevent a simple buildup of agents over time, we also have an age-out feature for all N and Dl agents. Agent “death” is set relative to a birth date for an average number of ticks and then the agents are removed from the model space. The age-out feature models the biological lifespan of molecules. These transition variables can be set initially within the NetLogo model or specified by a driver program that directs multiple runs of the model.
Running the model
We can observe a single run of the model using the NetLogo program; each agent has its own color designation and we can watch as the agents move within the model space (Fig. 2). Generally speaking though, we perform multiple runs of the same experiment of the model using a program that specifies a range of parameters and distributes the model runs across a cluster of computers. Production of agents, transitions, feedback and age-out occur in a specific order for each tick of a model run. The NetLogo program calculates two outputs from the model runs: an integer count of Nn agents (the signal level-N count) in each cell produced as a ordered string at each tick, and a raw count of the neurons at each tick. Neurons are defined as having a zero Nn count, while other cells are defined as nonneurons or skin cells.
Analysis of model data
The developing pattern can also be analyzed dynamically since the number string can be remapped in a visual representation. It can also be reduced to a numerical value to make comparisons easier. A roset pattern, a central neuron surrounded by six epidermal cells in a repeating pattern, is the goal developmental pattern for the whole sheet of cells, with an optimal number of rosettes of 27 or 28 for 77 cells. Using the string that relates signal level to position, we defined rosettes as neurons with six non-neuron neighbors, and then accounted for edge effects by adding in edge neurons that also had an edge base optimum of non-neuron neighbors. Occasionally one imperfect rosette is generated at the edge, but rosette counts that stabilized and were within two neurons of the 27/28 optimum were consistently found to have the correct pattern over a sample of the data. Therefore, our measure of rosette counts is a good proxy for overall pattern.
Once rosettes and stabilization times have been determined, these measures are displayed on color-coded graphs with hot colors representing high numbers of rosettes and high stabilization times and cool colors representing low numbers for these measures. In addition, graphs that combine these measures were created to give an overview by assigning category designations and colors. In addition, dynamics can be measured by tracking change of signal levels. Using the string that related signal level to position, we can use a variation of Hamming distance (derived from information theory) to measure the amount of dis-similarity between two strings of codes at different time points. In traditional hamming distance, two binary codes are compared for differences and a count of these differences becomes the distance between the codes. With our approach, two strings are compared and a new string is be generated that flags any differences with an asterisk, representing where the two strings do not match and retaining the character code when there is no change. Thus, variation in time can be captured in a progressive way and used to characterize model stabilization during a run.
Model output-single runs
Conditions for model stability and patterning
Model parameter settings
Representation in the model
N agents initially transcribed
8 to 24 agents in increments of 2
Dl agents initially transcribed
8 to 24 agents in increments of 2
Dlm to Dlm’
Transition from form associated with membrane to form that interacts with N
0, 50, 100, 150 ticks
Nc to Nn
Transition from cleaved form to nuclear form
50, 75, 175, 225, 275 ticks
The other parameter settings do produce some or all runs where the model works. The shortest stabilization time is about 2500 ticks and this end of the data represents both runs that stabilize with the correct number and some that do not. There is a peak of runs that stabilize very late or not at all (about 30%). If a run doesn’t reach stability by 20,000 ticks, it is assigned that value. These runs are difficult to assess as a group since our rosette measure may inaccurately assess the pattern for runs that are still oscillating at the end of the 20,000 tick time frame. By looking at a sample of runs in this peak, we are convinced that some of these appear to converge on a stabilization time with good pattern beyond the 20,000 tick time frame of the experiment. In addition, there are some long time frame runs that appear to never stabilize and continue to oscillate with no suggestion of stability or that crash to produce either no neurons or no skin cells. The distribution of rosette then is skewed to the right, since many runs produce correct pattern.
In comparing the three sets of graphs, there are several clear conclusions from the data. First, stability and pattern are not strictly correlated. High rosette pattern is found with a range of stabilization times. Some very short stabilization times do not produce good pattern. Systems that go through at least a few oscillations of Nn levels have a better probability of developing proper pattern. Second, stabilization time is strongly altered by the parameter settings, however many of the parameter sets generate rosette numbers representative of a proper pattern. It seems as if the model drives towards correct pattern and the parameter settings may largely determine when it gets to a stable place. Third, initial N levels are more important in determining a successful run than initial Dl levels, although both are important for stabilization time and pattern. Finally, the model has a sweet spot for the various parameters (best observed using Fig. 8c). For initial settings of N and Dl, the spot is on the diagonal, where there is close to equal levels of N and Dl. For the Nc to Nn transition, intermediate values again are best. Dlm to Dlm’ transition is best with lower values. This tells us that there is an optimum timing for the system that involves all parameters.
The most dramatic effect on the model is obtained by varying the timing of the transition between Nc and Nn. This parameter would alter the timing of the availability of Nn to provide feedback within the cell and presumably alter the timing of oscillations in the system. Transition times of 125 and 175 ticks produce an overall sweet spot for the model. Decreasing transition time to 75 produces long stabilization times. Increasing transition time to 225 ticks only works when Nn initial levels are high. This reinforces our previous idea that longer transition times may work less well as they approach Nn age-out settings of 350.
The initial levels of both N and Dl are important since their agent numbers determine the probability of the Nm cleavage and subsequent effects on downstream transcription of the agent through feedback. The initial level is modified by the level of Nn at each tick. The best pattern and stabilization times are produced when initial N is low and initial Dl is greater than (for 75,125) or equal to (for 175, 225) initial N. It would seem that the level of initial N is most influential in setting up the stability properties for the system. However, the levels of initial Dl are not completely without consequence. Increasing amounts of initial Dl relative to initial N generates problems with pattern as seen in the higher Nc to Nn transition times.
The timing of Dlm’ availability controlled by the Dlm to Dlm’ transition time seems to somewhat impact stabilization time, with little impact on pattern. This was confirmed by looking at a small number of individual runs, most of which stabilize beyond 18,000 ticks with the proper pattern. The model works best when Dlm’ is immediately available with longer availability times increasing model stability times.
An exception to these trends is present when both transition times are low (top left in Fig. 8c). High initial N requires less initial Dl to work. This is the only set of parameters where the majority of white box runs fail to stabilize based on a sampling of a small number of runs.
Feedback within the model is essential
Results of Feedback experiments
Nn turns up N transcription and turns down Dl transcription
Good stability and pattern across a wide set of parameters
Nn turns up N transcription only
Model crashes with 0 neuron count and fails to oscillate
Nn turns down Dl transcription only
Low neuron and rosette count (< 5), no oscillating runs that stabilize
Model crashes with 0 neuron count and fails to oscillate
We have created an agent-based model (ABM) that simulates the molecular components for the N signaling pathway as agents acting within a spatial representation of a cell. Using a set of assessment tools, we show that the current model accurately reproduces the rosette pattern of neurons and skin cells in the system over a wide set of parameters and looked at the impact of levels, timing and feedback for the N and Dl components of the model. The data presented in this paper looks at aggregated aspects of the system, like number of neurons and pattern.
We expected that the initial rates of both N and Dl agents would be an important feature of the model since the level of these protein products in the biological system would determine the probability of the N cleavage and subsequent effects on downstream transcription. Based on the hypothesis presented above, different ratios of N and Dl agents would change the timing of the oscillations, as well as having some impact on pattern. Indeed, increased N levels result in longer stabilization times and larger oscillations. For a given level of N, Dl level does not have a big impact on stability or pattern, except when there is a considerable mismatch in levels.
The Nc to Nn transition was most influential in the outcomes from the model, suggesting timing is important to the formation of system pattern. Biologically, the transition Nc to Nn consists of two cleavages, one at the cell membrane and a second an endocytic compartment that releases a fragment that can then enter the nucleus and alter transcription [21, 22]. We simply represented those multiple steps as one transition, and represented this transition as a timing variable. This parameter interacts with other features of the model in interesting ways. Long transition times interfere with the action of N and essentially limit feedback in the model. Short transition times cause faster oscillations, but also long stability times, most likely because the chance for substantial asynchrony to develop between cells would be limited and the system swings back and forth with little progress in fate assignment. Within the sweet spot for this transition, we see more of the effects of other parameters.
Feedback is absolutely necessary for the model to work. This is expected since it is the chief mechanism by which change in the system occurs, but our model suggests that feedback is required for both N and Dl components. Although, there is limited evidence that N regulates its own transcription or Dl transcription, such feedback has been seen in a few instances and has been postulated to part of this regulatory circuit [1, 10, 11, 12, 13]. Feedback to N or Dl transcription may be directly performed by N or by one of the downstream genes, but our model confirms its importance for the biological decision-making carried out by the system.
Many of the experiments that have been done over the last 20 years have been aimed at discovering the biological players in the system and the downstream components necessary for the expression of cell fate. The focus has been on the cell as the unit of function, and molecular biology experiments have created complete loss of function or gain of function for the system. In most cases, the pattern has been completely disrupted, generating all neurons or all skin cells. Less emphasis has been placed on altered pattern from these early developmental steps.
However, several outcomes of this model are supported by the existing literature. The model mirrors the findings of much of the literature that manipulation of N levels and its processing alone can impact both fate and pattern. The stoichiometric nature of the N pathway had been postulated from genetic experiments since the N locus is both haplo-insufficient and triplo-mutant (both ½ dose and 3 doses lead to a altered phenotype) and our model findings are consistent with that interpretation . Genetic data suggests that a ratio of the levels of the N ligand and its Dl receptor is important in fate and in establishing an asymmetry in the levels of these proteins that develop over time through feedback loops . Our model is consistent with the idea that both initial and relative levels of these proteins over time determine both fate and pattern and that the process is dependent on feedback. Our model also produces oscillations of these components. These oscillations have been observed in the biological system for N protein levels and downstream targets [32, 33].
We believe there are further experiments that could be done to validate the model. Levels of N and Dl protein might be controlled through RNAi constructs in a mutant background or using antibodies or small molecules that modulate signaling [33, 34, 35, 36]. There are chemical inhibitors of the proteases involved in the cleavage step that could be used at varying dosages to produce alterations in the efficiency of the process [37, 38]. The pattern can be monitored using antibodies targeted at various nervous system molecules and a technique has been developed to look at oscillations of the downstream proteins Hes 1 and 7 . These types of experiments are necessary for further validation of the model. The N signaling system produces other patterns beside rosettes and one of the challenges will be to see if our model can also produce those other patterns and how we might produce them with the basic components we have already incorporated.
How our model compares to other models
Several models of the Notch signaling pathway have been created previously. Waddington was one of the first to look conceptually at the process of fate determination as that of the cell moving on an energy-landscape molded by the effects of the transcribing genome [15, 16]. Waddington represented these influences as a system of differential equations implementing the continuous effects of the various interacting genes. Many of the current models have followed suit, by implementing individual cells in the population as compartments within a differential equation system. The Collier model is the most successful of these recent efforts with the general idea of the model modified and improved by others [17, 18, 19, 24, 39, 40]. Initially analytical and numerical analyses focused on system dynamics while some later work extended the Collier results, focusing on the molecular interactions between cell compartments . The Marnellos group relied exclusively on numerical analysis, but used an evolutionary optimization technique to tune the model dynamics . Finally, a modeling approach has been used to simulate genetic expression state propagation across cell populations in the spinal nervous system during vertebrate development . All of these models show proper pattern of the basic rosette structure. A recent modification has allowed the model to be extended to produce other patterns associated with N signaling . Other approaches to N pathway modeling include gene-regulatory networks [20, 21], extended Boolean Networks , and discrete stochastic processes . One recent model that uses a differential equation approach takes into account the volume and geometry of the cells making up the system and shows the impact of these factors in producing pattern .
While all of these models represent the patterning of the N signaling system well, they focus on either the interactions between cells and those pattern dynamics, or they look at the development of fate choice within the cell. None of them integrate the three levels of hierarchy inherent in our model. These models are also harder to manipulate than ours, often requiring altered or additional equations to ask experimental questions. Some of the models assume a role for inhibition in the model, while our model allows the molecular interactions to illuminate the process. Our modeling environment is robust and does not require a precise description as seen in the differential equation based systems. Overall, our model represents the biology better, where the overall behavior of molecules is not precise and systems can often handle some deviation.
Several authors have suggested that ABM may be a fruitful approach to modeling biological systems. Walker and Southgate (2009) suggest that spatially hierarchical models such as agent-based systems are a good approach to modeling biological phenomena at multiple scales . However, they state that few multi-scale models had been developed as of that time. Bartocci and Lió (2016) suggest that agent-based models are well suited to understanding how cellular interactions produce systems characteristics . Richmond et al. (2010) in an earlier paper had made similar points about the utility of agent-based models and developed a modeling environment using an agent-based program call FLAME that runs on both single and clustered systems and provides a template for these types of modeling experiments . Our choice of ABM and the NetLogo programming environment embrace the advantages laid out by the authors in these papers and facilitates a multi-scale analysis of this developmental process. It allows us to easily manipulate components and the processing steps associated with the pathway, and make it easy to add more steps into the model. For example, we have already built the model with Nm and Dlm’ cis inhibition interactions that are observed in the biological system . Dl in this case acts to sequester N from intramembrane cleavage, essentially making too much Dl a detriment to the system. We look forward to confirming additional biological features in this system.
Our model is based on an understanding of the components and their actions discovered through traditional methods. It is strongly based in the biological literature, although we have made choices and assumptions on implementation. The hypothesis-driven framework creates an internalized reconstruction of the subcellular process and an external analysis of the system dynamics that allows for an integrated exploration of the role of the subcellular in the multicellular pattern. In addition, a fuller understanding and a formal description of how our model generates this dynamic pattern may inform computation problems. The N signaling system and its biological approach to pattern formation inspired an algorithm that addresses a key problem in distributed programming . The flow of information between individual components in a distributed system as presented by our model may be used in a similar way to approach these problems in computational systems.
We have created an agent-based model that simulates the molecular components of the N signaling pathway within representational cells capable of creating a multicellular pattern consistent with what is observed in the biological system. The model has 3 levels of complexity: the specific timing and level of each molecular component within each cell, the interactions between cells, and the formation of pattern across the system. The signaling pathways within and between cells in our model interact in real time to create a spatially correct field of neurons and skin cells. The model produces a stable correct pattern for the system under a variety of model parameter settings. We found that the dynamic timing and availability of N and Dl components of the system were central to the formation of a cell fate and a correct and stable pattern. Levels of the Nn agent oscillate up and down within individual cells and in the system. Positive feedback to N levels and negative feedback on Dl levels provided by N levels over time are essential to the model. According to our model, cells that have high levels of N and low levels of Dl engage with neighbors that have low or no N and high Dl levels, stabilizing these cells into their fate. The timing of oscillations between neighboring cells most likely establish stable fate at the middle level of complexity and the construction and deconstruction of pattern is necessary for the whole field to stabilize correctly, since model runs that have short stabilization times often do not have correct pattern. Therefore, the components N and Dl control cell fate, neighboring cell fate, and the larger pattern. This model can be used to make predictions about the N signaling system, but can also be used to elucidate general rules of biological self-patterning and decision-making.
NetLogo is a modeling tool developed following in the long tradition of Logo and directly descended from StarLogo [16, 24]. Conceptually, NetLogo allows for the visualization of agents that are autonomous software entities embodied in a virtual environment that are capable of navigating, sensing, and manipulating that environment, including other agents. These agents are called turtles, following from Logo, but can be assigned to a specific population called a breed. Agents are akin to objects in an Object Oriented (OO) language. These populations can be called upon to perform different actions and agents can create other agents, allowing for a mechanism fundamental to the model where one agent produces a range of agents to construct a sophisticated architecture. Globals set up the initial starting features of the environment and the placement of the agents within that environment as well as configure model output. The program is constructed in a modular fashion as described in the following sections. The complete code for the model and driver and the data for the experiments in the paper can be found in supporting files as follows: Additional files 1, 2, 3, 4, 5 and 6.
There are four categories of globals: configuration, which sets up the initial structural features of the model environment such as cell radius and unit move based on that radius; model, which governs the actions of the agents within the model; general, which sets up how information is collected during model runs; and reporting, which directs the model output.
The model is constructed from four general types of agents (Nuc, Mem, Dl and N) with a total of nine different breeds within the programming environment (Table 1). Nuc and Mem are structural components that provide the major organizing features for the model. The Nuc breed provides points of orientation for cells in the structure and act as factories for production of Mem, Dl, and N agents. These agents are then linked with their progenitor Nuc (their parent). The Mem breed is more complex. The Mem represent the lipid membrane that form the walls of the cells around a parent nucleus, but also form the basis of locations within cells and in interactions between cells. The placement of the Mem is determined by a cell radius setting from the configuration global, and the number and spacing of the Mem agents can be altered to impact the granularity of the model. The Nuc and Mem agents are put into place during setup procedures and do not change during the model run. N and Dl agents are breeds that are dynamically created and destroyed during model run, providing the signaling mechanisms in the model (Table 1). There are 3 different Dl category breeds (Dl, Dlm, Dlm’) and 4 different N category breeds (N, Nm, Nc, Nn) that represent the transitions of these proteins during signaling (Table 1). N and Dl agents move through the model environment with their location in the model tracked and their spatial association with other agents often results in one breed being transformed into another.
Procedures within the model
The model is initiated through a setup procedure that clears the model space, and initiates globals and model settings that are variable. In general, the model takes direction from a head program (described below) that defines specific parameters and the range of parameters that will be tested within an “experiment” and then initiates the Go command below. Each experiment is given a unique designation or seed. Setup then initiates the Nuc breed and lays out a sheet of cells with Nuc and their children Mem agents in a hexagonal pattern. Relative position of each cell is determined in the sheet via the build-neighbors command that groups a set of detailed commands responsible for assigning agents to a specific parent Nuc and then determining neighboring agents and their parents. Action within the model takes place within the context of this sheet of cells. Different layout commands implement cell topology within the sheet, nuclear topology and cell radius.
Remove components that have hit an age limit.
Ask each nucleus to produce some number of Dl and N agents.
Move existing Dl and N breeds within the system, depending on their type.
Allow agents to manipulate neighboring agents if applicable.
Increment time. Time begins for the model at t = 0.
Step 1: The age-limit (age-out) for N and Dl agents are set as one of the initial model parameters. This age limit removes agents from the system and is dependent on breed type. Thus, when a Dl agent is created the clock begins and is never reset even though it goes through a set of breed transformations. Regular destruction of agents is essential for the model to run, since the definitions within the model for a neuron is based on a zero N count, which would never be achieved if components did not “age-out.”
Step 2: At each tick, additional N and Dl agents are added to the system. The initial agents are Dl or N breeds. There is an initial transcription setting for these components specified by the head program and then each additional tick brings a reassessment of the transcription rate based on the amount of Nn. The initial setting establishes a mean for production of agents, and variation around that mean is based the number of Nn and on a pseudorandom number generator that sets the value of that rate (described below).
The N and Dl agents move in a stepwise fashion to the membrane, moving one patch space per tick with the distance defined by the cell and nucleus radius.
When N and Dl agents move within one patch distance occupied by a Mem agent, they transition to Nm or Dlm respectively.
Dlm agents transitions to the Dlm’ breed based on a parameter setting for the transition
Nm, Dlm or Dlm’ agents move laterally from Mem agent to Mem agent.
Once Nm is converted to Nc (see step 4), Nc then moves from the membrane to the nucleus. These movements are encoded as a diffuse-proteins command, where each agent has code related to their movement (See below).
Nc agent within 1 patch of the nucleus transition to Nn.
Step 4: This is the most complex part of the model. In the current iteration of the model, this step is about the transition of Nm to Nc. In our implementation, the transition is coded to look at positions of Dl associated with Mem agents on a neighboring cell in relation to the position of N associated with Mem agents on the home cell. If a Dlm’ is within a defined set of locations across from a Nm agent, then Nm transitions to Nc.
Step 5: This step moves forward the model timing mechanism and assigns a tick number to all events within the model.
Stochastic features of the model
The model is designed to implement random features to ensure that a regular ordering of events does not introduce artifacts into the results and to match the stochastic nature of the biological system. The stochastic features rely on a pseudorandom number generator using the Mersene-Twister algorithm implemented in NetLogo . Random agent selection is implemented when a specific set of agents is required to act, and randomized action selection is implemented when there is a range of possible choices. Random agent selection is used primarily to ensure that order of agent selection does not bias the results in the favor of one agent or another based on one agent always performing its action first. Random action selection is based on the notion that proteins have the Markov property in that they do not retain a memory of previous actions upon which to make future decisions beyond their current state. Through a combination of these two random features, the model should prevent errors from being introduced due to one component having greater privileged over other competing components.
Much of the biological “noise” of the model is implemented using the random command feature of NetLogo. Each run of the model is seeded, which supplies a series of pseudorandom values that are used to generate variable parameters of the model. Parameters varied in this paper have some “noise” incorporated. Initial transcription and subsequent transcription rates of N and Dl are randomly generated around a mean setting. Transition between Dlm and Dlm’ has similar stochastic components. The movement of Nc towards the Nuc has a random component since Nc agents use the diffuse-protein command for movement. This command directs movement towards the nucleus based on a Gaussian distribution around the parameter setting for the Nc to Nn transition.
A driver program provides an isolated execution environment between individual runs, stores model output as a set of nested directories based on the parameter space, and captures results of how the model was run, to allow for later examination and confirmation of the execution environment. The driver program functions by running and manipulating the model in headless-mode (without visualization) within the Java Virtual Machine (JVM), which is a designed feature of NetLogo. Once the JVM is started, Java classes are loaded into the environment, allowing the driver to load the model as if a human user were working with the software directly. Once running, the driver iterates, in a nested order, the five parameters we varied in this paper: Notch transcription initial rate, Delta transcription initial rate, Delta membrane transform time, cleaved Notch diffusion time, and random seed. This implementation starts with a directory named for the first value of notch transcription initial rate, which will contain all other combinations of the remaining parameters. At the bottom of the directory structure is a directory identifying the random seed. The nested directory structure can be later merged with similarly structured directories. This allows for a larger data-set to be built incrementally from small sets of data. This also allows for a common structure to analyze and aggregate data for later assessment of model behavior and performance.
Output from the model and measures
From each model run, two files are generated: notchNucCounts.txt, and neuronCnts.txt. The file notchNucCounts.txt contains the individual cell signal levels. The file neuronCnts.txt contains the count of neurons (based on a zero signal-level metric) of the system for each time tick as two columns. These basic files are then manipulated to produce the seed or aggregated data. This data then is pulled into graphing programs to create the graphs represented in this manuscript. See Additional file 6 for more information.
This measure is determined using neuronCnts.txt and scanning from the last time point to identify the place where the neuron count for a given run does not vary by more than one neuron. The earliest tick frame where that occurs is the stabilization time.
Difference assessment provides a different perspective on stabilization of the model. Each tick produces a string of data that catalogues the number of Nn agents in the each cell, notchNucCounts.txt. The strings are compared using a Hamming distance function. The number of changes over time is then graphed. Changes over time can be mapped for an individual run or an aggregate set of runs.
Pattern is assessed using a rosette count. For a rosette to be tallied, a neuron must have 6 neighbors that are skin cells, except in the case of edge neurons, which have an expected number of neighbors based on their position. For every cell, the numbers of the surrounding cells are catalogued and then the comparisons are made on a tick by tick basis or at the time end point of 20,000 ticks.
Step 1. Is the average stability time less than or equal to 18,000, goes to Step 2
If not goes to cat 3, white boxes
Step 2. Is the rosette count greater than 25, goes to Step 3
If not goes to cat 2, light green boxes
Step 3 All remaining run are assigned cat 1, dark green boxes
Graphs are produced using Mathematica or Gnuplot. Individual graphs for experiments are assembled into a grid.
This project was funded by Lafayette College. The College had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Availability of data and materials
All data generated or analyzed during this study are included in this published article [and its supplementary information files].
ERR was responsible for project and model conceptualization, implementation and development of methodology, investigation, formal analysis, project administration and supervision, validation, visualization, and writing-original draft preparation. JOP was responsible for project and model conceptualization, implementation and development of software and methodology, investigation, formal analysis, data curation, resources, project administration and supervision, validation, visualization, and writing-review and editing. RH was responsible for implementation and development of software and methodology, investigation, formal analysis, data curation and visualization. CS was responsible for implementation and development of software and methodology, and investigation. All authors read and approved the final manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
- 19.Marnellos G, Deblandre G, Mjolsness E, Kintner C. Delta-Notch lateral inhibitory patterning in the emergence of ciliated cells in Xenopus: experimental observations and a gene network model. Pac Symp Biocomput. 2000;2000:329–40.Google Scholar
- 22.Schaub MA, Henzinger TA, Fisher J. Qualitative networks: a symbolic approach to analyze biological signaling networks. BMC Syst Biol. 2007;1(4).Google Scholar
- 24.Formosa- Jordan P, Sprinzak D. Modeling Notch Signaling: a practical tutorial. In: Bellen HJ, Yamamoto S, editors. Notch Signaling: methods and protocols. Methods in molecular biology, vol. 1187. New York: Springer; 2014. p. 285–310.Google Scholar
- 26.Resnick M. Turtles, termites, and traffic jams: explorations in massively parallel microworlds. 1st ed. Cambridge: MIT Press; 1994.Google Scholar
- 29.Wilensky U. Netlogo. Center for connected learning and computer-based modeling, Northwestern University 1999. http://ccl.northwestern.edu/netlogo/. Accessed 27 Dec 2017.
- 33.Shimojo H, Harima Y, Kageyama R. Visualization of Notch signaling oscillation in cells and tissues. In: Bellen HJ, Yamamoto S, editors. Notch Signaling: methods and protocols. Methods in molecular biology, vol. 1187. New York: Springer; 2014. p. 169–79.Google Scholar
- 36.Bellen HJ, Yamamoto S. Notch Signaling: Methods and protocols. Methods in molecular biology Vol 1187. New York: Springer; 2014.Google Scholar
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.