Decision Making Association Rules for Recognition of Differential Gene Expression Profiles

  • C. Rubio-Escudero
  • Coral del Val
  • O. Cordón
  • I. Zwir
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4224)


The rapid development of methods that select over/under expressed genes from RNA microarray experiments have not yet satisfied the need for tools that identify differential profiles that distinguish between experimental conditions such as time, treatment and phenotype. We evaluate several microarray analysis methods and study their performance, finding that none of the methods alone identifies all observable differential profiles, nor subsumes the results obtained by the other methods. Therefore, we propose a machine learning based methodology that identifies and combines the abilities of microarray analysis methods to recognize differential profiles. We encode the results of this methodology in decision making association rules able to decide which method or method-aggregation is optimal to retrieve a set of genes exhibiting a common profile. These solutions are optimal in the sense that they constitute partial ordered subsets of all method-aggregations bounded by the most specific and the most sensitive available solution. This methodology was successfully applied to a study of inflammation and host response to injury data set derived from the analysis of longitudinal blood microarray profiles of human volunteers treated with intravenous endotoxin compared to placebo. Our approach was able to uncover a cohesive set of differentially expressed genes and novel members exhibiting previously studied differential profiles. This guideline serves as a means to support decisions on new microarray problems.


Association Rule Differential Gene Expression Multiobjective Optimization Microarray Gene Expression Data Differential Expression Profile 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Durbin, R., Eddy, S., Krogh, A., Mitchison, G.: Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University Press, Cambridge (1998)MATHCrossRefGoogle Scholar
  2. Brown, P., Botstein, D.: Exploring the new world of the genome with DNA microarrays. Nature Genet 21, 33–37 (1999)CrossRefGoogle Scholar
  3. Inza, I., Larranaga, P., Blanco, R., Cerrolaza, A.J.: Filter versus wrapper gene selection approaches in DNA microarray domains. Artif. Intell. Med. 31(2), 91–103 (2004)CrossRefGoogle Scholar
  4. Pan, W., Lin, J., Le, C.: A mixture model approach to detecting differentially expressed genes with microarray data. Funct. Integr. Genomics 3(3), 117–124 (2001)Google Scholar
  5. Park, T., Yi, S.G., Lee, S., Lee, S.Y., Yoo, D.H., Ahn, J.I., Lee, Y.S.: Statistical tests for identifying differentially expressed genes in time-course microarray experiments. Bioinformatics 19(6), 694–703 (2003)CrossRefGoogle Scholar
  6. Tusher, V.G., Tibshirani, R., Chu, G.: Significance analysis of microarrays applied to the ionizing radiation response. Proc. Natl. Acad. Sci. USA 98, 5116–5121 (2001)MATHCrossRefGoogle Scholar
  7. Vaquerizas, J.M., Conde, L., Yankilevich, P., Cabezon, A., Minguez, P., Diaz-Uriarte, R., Al-Shahrour, F., Herrero, J., Dopazo, J.: GEPAS, an experiment-oriented pipeline for the analysis of microarray gene expression data. Nucleic Acids Res. 1(33)(Web Server issue), 616–620 (2005)Google Scholar
  8. Agrawal, R., Imielinski, T., Swami, A.N.: Mining association rules between sets of items in large databases. In: Buneman, P., Jajodia, S. (eds.) Proceedings of the ACM SIGMOD. International Conference on Management of Data, Washington, DC, pp. 207–216 (1993)Google Scholar
  9. Zwir, I., Shin, D., Kato, A., Nishino, K., Latifi, K., Solomon, F., Hare, J.M., Huang, H., Groisman, E.A.: Dissecting the PhoP regulatory network of Escherichia coli and Salmonella enterica. Proc. Natl. Acad. Sci. 102, 2862–2867 (2005)CrossRefGoogle Scholar
  10. Zwir, I., Huang, H., Groisman, E.A.: Analysis of Differentially-Regulated Genes within a Regulatory Network by GPS Genome Navigation. Bioinformatics 21(22), 4073–4083 (2005b)CrossRefGoogle Scholar
  11. lI, C., Wong, W.H.: DNA-Chip Analyzer (dChip). In: Parmigiani, G., Garrett, E.S., Irizarry, R., Zeger, S.L. (eds.) The analysis of gene expression data: methods and software. Springer, Heidelberg (2003)Google Scholar
  12. Der, G., Everitt, B.S.: Handbook of Statistical Analyses using SAS. Chapman and Hall/CRC (2001)Google Scholar
  13. Calvano, S.E., Xiao, W., Richards, D.R., Feliciano, R.M., Baker, H.V., Cho, R.J., Chen, R.O., Brownstein, B.H., Cobb, J.P., Tschoeke, S.K., Miller-Graziano, C., Moldawer, L.L., Mindrinos, M.N., Davis, R.W., Tompkins, R.G., Lowry, S.F.: The Inflammation and Host Response to Injury Large Scale Collaborative Research Program. A Network-Based Analysis of Systemic Inflammation in Humans. Nature 13, 437(7061), 1032–1037 (2005)Google Scholar
  14. Gao, X., Song, P.: Nonparametric tests for differential gene expression and interaction effects in multi-factorial microarray experiments. BMC Bioinformatics 21(6), 186 (2005)CrossRefGoogle Scholar
  15. Romero-Zaliz, R., Rubio-Escudero, C., Cordón, O., Harari, O., del Val, C., Zwir, I.: Mining Structural Databases: An Evolutionary Multi-Objective Conceptual Clustering Methodology. In: Rothlauf, F., Branke, J., Cagnoni, S., Costa, E., Cotta, C., Drechsler, R., Lutton, E., Machado, P., Moore, J.H., Romero, J., Smith, G.D., Squillero, G., Takagi, H. (eds.) EvoWorkshops 2006. LNCS, vol. 3907, pp. 159–171. Springer, Heidelberg (2006)CrossRefGoogle Scholar
  16. Tavazoie, S., Hughes, J.D., Campbell, M.J., Cho, R.J., Church, G.M.: Systematic determination of genetic network architecture. Nat. Genet. 22, 281–285 (1999)CrossRefGoogle Scholar
  17. Cheeseman, P., Oldford, R.W.: Selecting models from data: artificial intelligence and statistics IV. Springer, New York (1994)MATHGoogle Scholar
  18. Cooper, G., Herskovits, E.: Bayesian method for the induction of probabilistic networks from data. Machine Learning 9, 309–347 (1992)MATHGoogle Scholar
  19. Ruspini, E.: Introduction to Longitudinal Research. In: Bulmer, M. (ed.) Social Research Today, Routledge, London (2002)Google Scholar
  20. Bezdek, J.C.: Pattern Analysis. In: Pedrycz, W., Bonissone, P.P., Ruspini, E.H. (eds.) Handbook of Fuzzy Computation, Institute of Physics, Bristol F6.1.1-F6.6.20 (1998)Google Scholar
  21. Mitchell, T.: Machine Learning. McGraw-Hill, New York (1997)MATHGoogle Scholar
  22. Duda, R.O., Hart, P.E.: Pattern Classification and Scene Analysis. John Wiley & Sons, New York (1973)MATHGoogle Scholar
  23. Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. On Pattern Analysis and Machine Intelligence 1(2), 224–227 (1979)CrossRefGoogle Scholar
  24. Klir, G.J., Yuan, B.: Fuzzy Sets and Fuzzy Logic: Theroy and Applications. Prentice-Hall, Englewood Cliffs (2005)Google Scholar
  25. Deb, K.: Multi-objective optimization using evolutionary algorithms. John Wiley & Sons, Chichester, New York (2001)MATHGoogle Scholar
  26. Cordón, O., del Jesus, M.J., Herrera, F.: A Proposal on Reasoning Methods in Fuzzy Rule-Based Classification Systems. International Journal of Approximate Reasoning 20, 21–45 (1999)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • C. Rubio-Escudero
    • 1
  • Coral del Val
    • 1
  • O. Cordón
    • 1
    • 2
  • I. Zwir
    • 1
    • 3
  1. 1.Department of Computer Science and Artificial IntelligenceUniversity of GranadaSpain
  2. 2.European Center for Soft ComputingMieresSpain
  3. 3.Howard Hughes Medical InstituteWashington University School of MedicineSt. Louis

Personalised recommendations