Predictor Ranking using Modified Zhegalkin Functions

  • Pey-Chang Kent Lin
  • Sunil P. Khatri


Inference of the underlying gene regulatory network structure (i.e. predictors and functions) from gene expression is an important challenge in genomics. With continuing improvements in microarray technology, the ability to measure expression levels of many genes has improved significantly, making available large amount of gene expression data for analysis. In previous chapters, all gene expressions have been assumed to be digital in nature. However, actual gene expressions (from microarrays for example) are continuous. On the other hand, many genes have been observed to exhibit switch-like or Boolean behavior. In this chapter, we utilize modified Zhegalkin polynomials to express the Boolean behavior of gene expression in an analog or continuous manner. Given gene expression data in the form of microarray measurements normalized to the unit interval, we present a method for ranking and selecting predictors which fits the data with the least mean square error according to the modified Zhegalkin function. Our methods are validated on synthetic gene expressions from a mutated mammalian cell-cycle network and then demonstrated on measured gene expressions from a melanoma network study. The results of our approach can be used to identify potential genes in future expression experiments or for possible targeted drug development experiments.


  1. 1.
    Kauffman, S. A.: Metabolic stability and epigenesis in randomly constructed genetic nets. J. Theoreti. Biolo. 22(3), 437–467 (1969)CrossRefGoogle Scholar
  2. 2.
    Pal, R., Ivanov, I., Datta, A., Bittner, M.L., Dougherty, E.R.: Generating Boolean networks with a prescribed attractor structure. Bioinformat. 21(21), 4021–4025 (2005)CrossRefGoogle Scholar
  3. 3.
    Layek, R., Datta, A., Dougherty, E.R.: From biological pathways to regulatory networks. Decision and Control (CDC), 2010 49th IEEE Conference on, pp. 5781–5786. (2010)Google Scholar
  4. 4.
    Lin, P. K., Khatri, S.P.: Inference of gene predictor set using Boolean satisfiability,” in Genomic Signal Processing and Statistics (GENSIPS), 2010 IEEE International Workshop on, pp. 1–4. (2010)Google Scholar
  5. 5.
    Layek, R., Datta, A., Bittner,M., E.R. Dougherty: Cancer therapy design based on pathway logic. Bioinform. 27(4), 548–555 (2011)CrossRefGoogle Scholar
  6. 6.
    Lin, P. K., Khatri, S.P.: Efficient cancer therapy using Boolean networks and Max-SAT-based ATPG. Genomic Signal Processing and Statistics (GENSIPS), 2011 IEEE International Workshop on. IEEE, pp. 87–90. (2011)Google Scholar
  7. 7.
    Chen, T., He, H.L., Church, G.M. et al.: Modeling gene expression with differential equations. Pacific. Symp. Biocomput. 4, 4 (1999)Google Scholar
  8. 8.
    Wu, F.X., Zhang, W.J., Kusalik, A.J.: Modeling gene expression from microarray expression data with state-space equations. Pacific. Symp. Biocomput. 9, 581–592 (2004)Google Scholar
  9. 9.
    Geard, N., Wiles, J.: A gene network model for developing cell lineages. Artif. Life, 11,(3), 249–268 (2005)CrossRefGoogle Scholar
  10. 10.
    Arkin,A., Ross,J., McAdams, H.H.: Stochastic kinetic analysis of developmental pathway bifurcation in phage lambda-infected escherichia coli cells. Genet. 49, 1633–1648 (1998)Google Scholar
  11. 11.
    Faisal, S., Lichtenberg, G., Werner, H.: Canalizing Zhegalkin polynomials as models for gene expression time series data. Engineering of Intelligent Systems, 2006 IEEE International Conference on. IEEE, pp. 1–6. (2006)Google Scholar
  12. 12.
    Faisal, S., Lichtenberg, G., Werner, H.: An approach using Zhegalkin polynomials for modelling microarray timeseries data of eucaryotes. Proceedings of International Conference on Systems Biology. p. 312. (2004)Google Scholar
  13. 13.
    Stone, M.H.: The theory of representation for Boolean algebras. Trans. American Math. Soc. 40(1), 37–111 (1936)Google Scholar
  14. 14.
    Dhaeseleer, P., Liang, S., Somogyi, R.: Gene expression data analysis and modeling. Pacific. Symp. Biocomput. 99, (1999)Google Scholar
  15. 15.
    Zien, A., Aigner, T., Zimmer, R., Lengauer, T.: Centralization: a new method for the normalization of gene expression data. Bioinfor. 17(1) 323–331 (2001)CrossRefGoogle Scholar
  16. 16.
    Faryabi, B., Chamberland, J.-F., Vahedi, G., Datta, A., Dougherty, E.R.: Optimal intervention in asynchronous genetic regulatory networks. IEEE J. Sel. Top. Signal. Process. 2(3), 412–423 (2008)Google Scholar
  17. 17.
    Bittner, M. et al.: Molecular classification of cutaneous malignant melanoma by gene expression profiling. Nature. 406(3), 536–540 (2000)CrossRefGoogle Scholar
  18. 18.
    Datta, A., Dougherty, E.R.: Introduction to Genomic Signal Processing with Control. CRC Press, (2007)Google Scholar
  19. 19.
    Xiao, Y., Dougherty, E.R.: The impact of function perturbations in Boolean networks. Bioinforma. 23(10), 1265–1273 (2007)CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2014

Authors and Affiliations

  1. 1.Electrical and Computer EngineeringTexas A&M UniversityCollege StationUSA

Personalised recommendations