Abstract
We present a theoretical framework for the analysis of the effect of a fully differentiated cell population on a neighboring stem cell population in Multi-Cellular Organisms (MCOs). Such an organism is constituted by a set of different cell populations, each set of which converges to a different cycle from all possible options, of the same Boolean network. Cells communicate via a subset of the nodes called signals. We show that generic dynamic properties of cycles and nodes in random Boolean networks can induce cell differentiation.
Specifically we propose algorithms, conditions and methods to examine if a set of signaling nodes enabling these conversions can be found. Surprisingly we find that robust conversions can be obtained even with a very small number of signals. The proposed conversions can occur in multiple spatial organizations and can be used as a model for regeneration in MCOs, where an islet of cells of one type (representing stem cells) is surrounded by cells of another type (representing differentiated cells). The cells at the outer layer of the islet function like progenitor cells (i.e. dividing asymmetrically and differentiating). To the best of our knowledge, this is the first work showing a tissue-like regeneration in MCO simulations based on random Boolean networks.
We show that the probability to obtain a conversion decreases with the log of the node number in the network, showing that the model is relevant for large networks as well. We have further checked that the conversions are not trivial, i.e. conversions do not occur due to irregular structures of the Boolean network, and the converting cycle undergoes a respectable change in its behavior.
Finally we show that the model can also be applied to a realistic genetic regulatory network, showing that the basic mathematical insight from regular networks holds in more complex experiment-based networks.
Similar content being viewed by others
Notes
A directed graph is strongly connected (SC), if a directed pathway between each of its nodes exists. Strongly connected components of a directed graph are its maximal strongly connected sub-graphs. A giant SCC (GSCC) is the largest SCC of a directed graph.
References
Ahn, S., Wang, R. T., Park, C. C., Lin, A., Leahy, R. M., Lange, K., & Smith, D. J. (2009). Directed mammalian gene regulatory networks using expression and comparative genomic hybridization microarray data from radiation hybrids. PLoS Comput. Biol., 5, e1000407.
Alonso, L., & Fuchs, E. (2003). Stem cells of the skin epithelium. Proc. Natl. Acad. Sci. USA, 100, 11830–11835. doi:10.1073/pnas.1734203100.
Badorff, C., Brandes, R. P., Popp, R., Rupp, S., Urbich, C., Aicher, A., Fleming, I., Busse, R., Zeiher, A. M., & Dimmeler, S. (2003). Transdifferentiation of blood-derived human adult endothelial progenitor cells into functionally active cardiomyocytes. Circulation, 107, 1024–1032.
Barker, N., Bartfeld, S., & Clevers, H. (2010). Tissue-resident adult stem cell populations of rapidly self-renewing organs. Cell Stem Cell, 7, 656–670.
Bertrand, V., & Hobert, O. (2010). Lineage programming: navigating through transient regulatory states via binary decisions. Curr. Opin. Genet. Dev., 20, 362–368.
Cui, Q., Ma, Y., Jaramillo, M., Bari, H., Awan, A., Yang, S., Zhang, S., Liu, L., Lu, M., O’Connor-McCourt, M., Purisima, E. O., & Wang, E. (2007). A map of human cancer signaling. Mol. Syst. Biol., 3, 152. doi:10.1038/msb4100200.
Damiani, C., Serra, R., Villani, M., Kauffman, S., & Colacci, A. (2011). Cell–cell interaction and diversity of emergent behaviours. IET Syst. Biol., 5, 137–144.
Elowitz, M. B., Levine, A. J., Siggia, E. D., & Swain, P. S. (2002). Stochastic gene expression in a single cell. Science, 297, 1183–1186. doi:10.1126/science.1070919.
Ferguson-Smith, A. C., & Surani, M. A. (2001). Imprinting and the epigenetic asymmetry between parental genomes. Science, 293, 1086.
Gama-Castro, S., Salgado, H., Peralta-Gil, M., Santos-Zavaleta, A., Muñiz-Rascado, L., Solano-Lira, H., Jimenez-Jacinto, V., Weiss, V., García-Sotelo, J. S., & López-Fuentes, A. (2011). RegulonDB version 7.0: transcriptional regulation of Escherichia coli K-12 integrated within genetic sensory response units (Gensor units). Nucleic Acids Res., 39, D98–D105.
Harris, S. E., Sawhill, B. K., Wuensche, A., & Kauffman, S. (2002). A model of transcriptional regulatory networks based on biases in the observed regulation rules. Complexity, 7, 23–40.
Jackson, E. R., Johnson, D., & Nash, W. G. (1986). Gene networks in development. J. Theor. Biol., 119, 379–396.
Jaenisch, R., & Bird, A. (2003). Epigenetic regulation of gene expression: how the genome integrates intrinsic and environmental signals. Nat. Genet., 33, 245–254.
Kauffman, S. A. (1969). Metabolic stability and epigenesis in randomly constructed genetic nets. J. Theor. Biol., 22, 437–467.
Kauffman, S. A. (1993). The origins of order: self-organization and selection in evolution.
Kauffman, S., Peterson, C., Samuelsson, B., & Troein, C. (2004). Genetic networks with canalyzing Boolean rules are always stable. Proc. Natl. Acad. Sci. USA, 101, 17102–17107. doi:10.1073/pnas.0407783101.
Krause, D. S., Theise, N. D., Collector, M. I., Henegariu, O., Hwang, S., Gardner, R., Neutzel, S., & Sharkis, S. J. (2001). Multi-organ, multi-lineage engraftment by a single bone marrow-derived stem cell. Cell, 105, 369–377. doi:10.1016/S0092-8674(01)00328-2.
Maduro, M. F., & Rothman, J. H. (2002). Making worm guts: the gene regulatory network of the Caenorhabditis elegans endoderm. Dev. Biol., 246, 68–85.
Pittenger, M. F., Mackay, A. M., Beck, S. C., Jaiswal, R. K., Douglas, R., Mosca, J. D., Moorman, M. A., Simonetti, D. W., Craig, S., & Marshak, D. R. (1999). Multilineage potential of adult human mesenchymal stem cells. Science, 284, 143.
Polynikis, A., Hogan, S., & Di Bernardo, M. (2009). Comparing different ODE modelling approaches for gene regulatory networks. J. Theor. Biol., 261, 511–530.
Samuelsson, B., & Troein, C. (2003). Superpolynomial growth in the number of attractors in Kauffman networks. Phys. Rev. Lett., 90. doi:10.1103/PhysRevLett.90.098701.
Serra, R., Villani, M., Barbieri, A., Kauffman, S., & Colacci, A. (2010). On the dynamics of random Boolean networks subject to noise: attractors, ergodic sets and cell types. J. Theor. Biol., 265, 185–193.
Silva, H., & Martins, M. (2003). A cellular automata model for cell differentiation. Phys. A, Stat. Mech. Appl., 322, 555–566. doi:10.1016/s0378-4371(02)01807-1.
Takahashi, K., & Yamanaka, S. (2006). Induction of pluripotent stem cells from mouse embryonic and adult fibroblast cultures by defined factors. Cell, 126, 663–676.
Takahashi, K., Tanabe, K., Ohnuki, M., Narita, M., Ichisaka, T., Tomoda, K., & Yamanaka, S. (2007). Induction of pluripotent stem cells from adult human fibroblasts by defined factors. Cell, 131, 861–872.
Tarjan, R. E., Korf, R. E., Kumar, V., Kao, M., & Aggarwal, A. (1972). Depth-first search. SIAM J. Comput., 1.
Urnov, F. D., & Wolffe, A. P. (2001). Above and within the genome: epigenetics past and present. J. Mammary Gland Biol. Neoplasia, 6, 153–167.
Villani, M., Barbieri, A., & Serra, R. (2011). A dynamical model of genetic networks for cell differentiation. PLoS ONE, 6, e17703.
Vohradsky, J. (2001). Neural model of the genetic network. J. Biol. Chem., 276, 36168–36173. doi:10.1074/jbc.M104391200.
Acknowledgements
We would like to thank Professor N. Linial from the School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel, for a fruitful discussion and the MOSIX team for providing cluster computing support.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A
In this appendix, we explore the option of using a slightly different set of candidate nodes to induce UQC. The algorithm for detecting candidate nodes first eliminates nodes that behave similarly in the two cycles, and then eliminates nodes that cannot close a directed circle or that do not link between directed circles in the remaining graph. Finally, the algorithm selects the nodes that are fixed in both cycles. However, if the decision which cycle is going to be the inducing one (potentially) is taken in advance, an alternative algorithm can be proposed, with the small difference that at the last step of the algorithm, nodes fixed only in the potentially inducing cycle are chosen.
Such nodes actually enable UQC to occur in MCOs as well. However, such nodes are problematic. We will refer to UQC obtained from such nodes as Raw UQC. The focus of the work is not on Raw UQC. Since tissue stem cells are located in small stable groups surrounded by differentiated tissue cells, it is important that the conversions will take place layer by layer with only external conversion of stem cells by differentiated cells and no natural differentiation of stem cells induced by other stem cells, as would have happened if nodes twinkling in the induced set of cells are used as signaling nodes. We have tested that Raw UQC properties in MCOs are similar to the ones of UQC in general (Table 3).
We have checked whether all possible unsynchronized conversions in MCOs are the result of Raw UQC. There were a few cases (0.5 % of cycle pairs), where a conversion occurred in MCO although there was no Raw UQC (the conversion occurred using the default choice of using nodes with different patterns as signals). In these cases, the cell populations first synchronized themselves and then conversion occurred. However, those cases under a little different MCO dimensions lose the ability to self synchronize as well as the ability to convert.
Thus, while Raw UQC does not cover all the possible unsynchronized conversions between any two cycles, the results indicate that there are no unsynchronized conversions that are not sensitive to the MCOs’ dimensions that do not follow Raw UQC.
From another perspective, in order to determine whether there will be conversion in cases that have potential inducing nodes but do not induce UQC in the two cell system, 1000 pairs of such cycles were tested and no MCO conversion was found for any of them.
Appendix B
In this appendix, we validate that the UQC conversions are not trivial in terms of the locations of the nodes that enable the conversions in the Boolean network and also in terms of validating that the induced cycle undergoes a significant change in its character, otherwise, its conversion is meaningless. For each network size, UQC was checked in 10,000 random Boolean networks.
2.1 B.1 Relation Between UQC and the Giant Connectivity Component
We have here studied K=2 homogeneous networks. Such networks have a giant strongly connected componentFootnote 1 (GSCC) (Tarjan et al. 1972), but they still may have other small strongly connected components. Note that the small SCCs can be connected to the GSCC in the equivalent undirected network. It is easier to induce UQC in a small network than in a large one (Fig. 4). Thus one may wonder if the results in the large networks are not an artifact of the presence of small strongly connected components. What happens in real genomes is unclear; however, we do know that there are strongly connected components (SCCs) in real genomes containing hundreds of genes, so it is important to measure the fraction of cases where all nodes inducing UQC belong to the GSCC. This fraction was found to be constant at around 40 % and was not sensitive to the network size (Fig. 6A). For clarity, in the following sections, the UQC with all the inducing nodes in the GSCC will be referred as case C, where the regular UQC cycle pairs will be referred as case B and a random pair of cycles will be referred as case A.
2.2 B.2 Lengths of the Converting and Converted Cycles
The lengths of the cycles are also important. Very long cycles are obviously non-biological. On the other hand, too short cycles might be meaningless as well. Very short cycles may not reflect the dynamic character of living cells. We ensured that there were pairs of cycles that follow the UQC, where the lengths of both the induced and the target cycles were equal or bigger than the median cycle length. This fraction was between 25–35 % (Fig. 6A). Thus, UQC is not an artifact of short cycles. Those cycle pairs will be further referred to as case D.
2.3 B.3 Difference Between the Length of the Inducing and Induced Cycle
An interesting case is pairs of UQC cycles that have different lengths. Mathematically these cases are interesting, since they must require that a set of fixed nodes will become unfixed or the opposite—all this being done via signaling fixed nodes (mostly bimodal). Biologically these cases are interesting since they may reflect arousal/shutting of some dynamical cellular activity. This fraction was between 5–15 % (Fig. 6A). Those cases will be referred as case E. Finally, we introduce one more category (case F), which is the combination of the three previously described categories: (A) all required nodes belong to the GSCC, (B) both the induced and target cycles have lengths equal or bigger than the median, and (C) their cycle length is different. This fraction is shown in Fig. 6A, and was found to be very small.
2.4 B.4 Similarity Between the Converting and Converted Cycles
Conversion between two cycles is meaningless if the induced and target cycles are practically the same. We have thus checked if conversion is indeed limited to the practically meaningless conversion of almost equal cycles. Two measures of similarity were proposed. The first is the fraction of nodes with a different pattern between the cycles. The second measure is the correlation between the average value vectors of each cycle (i.e. the fraction of time each node receives a value of 1 in each cycle (see Sect. 4)). Note that we do expect some correlation, since even if 10 % of the nodes are changed, this may represent a significant pattern change.
From a biological point of view, such a correlation is equivalent to the correlation between the average mRNA expression levels in two types of asynchronized cell population. We have tested the correlation for all the cycle pair groups mentioned above. One can clearly observe (Fig. 6B and 6C) that as the complexity requests for the cycles increases, the fraction of different nodes increases and the similarity decreases. On the other hand, as the network size increases, the similarity increases.
To summarize, when comparing the length of the cycles in all the above mentioned categories, the cycles in groups E and F are the longest and are the most different from one another. This was consistent over all network sizes (number of nodes) tested.
Rights and permissions
About this article
Cite this article
Bodaker, M., Louzoun, Y. & Mitrani, E. Mathematical Conditions for Induced Cell Differentiation and Trans-differentiation in Adult Cells. Bull Math Biol 75, 819–844 (2013). https://doi.org/10.1007/s11538-013-9837-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11538-013-9837-2