Estimation Distribution of Algorithm for Fuzzy Clustering Gene Expression Data

  • Feng Liu
  • Juan Liu
  • Jing Feng
  • Huaibei Zhou
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4222)


With the rapid development of genome projects, clustering of gene expression data is a crucial step in analyzing gene function and relationship of conditions. In this paper, we put forward an estimation of distribution algorithm for fuzzy clustering gene expression data, which combines estimation of distribution algorithms and fuzzy logic. Comparing with sGA, our method can avoid many parameters and can converge quickly. Tests on real data show that EDA converges ten times as fast as sGA does in clustering gene expression data. For clustering accuracy, EDA can get a more reasonable result than sGA does in the worst situations although both methods can get the best results in the best situations.


Fuzzy Logic Convergence Speed Fuzzy Cluster Acute Lymphoblastic Leukaemia Gray Code 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Baldi, P., Hatfield, G.W.: DNA Microarrays and Gene Expression: From Experiments to Data Analysis and Modeling. Cambridge University Press, Cambridge (2002)CrossRefGoogle Scholar
  2. 2.
    Han, J., Kamber, M.: Data Mining: Concepts and Techniques, pp. 355–395. Morgan Kaufmann, San Francisco (2000)Google Scholar
  3. 3.
    Zhao, L., Tsujimura, Y., Gen, M.: Genetic Algorithm For Fuzzy Clustering. In: International Conference on Evolutionary Computation, pp. 716–719 (1996)Google Scholar
  4. 4.
    Larranga, P., Lozano, J.A.: Estimation of Distribution Algorithms: a New Tool For Evolution Computation. Kluwer Academic Press, Boston (2001)Google Scholar
  5. 5.
    Ruspini, E.: A New Approach to Clustering. Inf. Control 15, 22–32 (1969)MATHCrossRefGoogle Scholar
  6. 6.
    Zimmermann, H.J.: Fuzzy set Theory and Its Applications, 4th edn. Kluwer Academic Publishers, Dordrecht (2001)Google Scholar
  7. 7.
    Henrion, M.: Propagation of Uncertainty In Bayesian Networks By Probabilistic Logic Sampling. In: Uncertainty in Artificial Intelligence, vol. 2, pp. 149–163. Elsevier, North-Holland (1988)Google Scholar
  8. 8.
    Murali, T.M., Kasif, S.: Extracting Conserved Gene Expression Motifs From Gene Expression Data. In: PSB, vol. 8 (2003)Google Scholar
  9. 9.
    Su, Y., Murali, T.M., Pavlovic, V., et al.: RankGene: Identification of Diagnostic Genes Based on Expression Data. Bioinformatics 19(12), 1578–1579 (2003)CrossRefGoogle Scholar
  10. 10.
    Golub, T.R.: Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression Monitoring. Science 286(15), 531–537 (1999)CrossRefGoogle Scholar
  11. 11.
    Cutcliffe, C., Kersey, D., et al.: Clear Cell Sarcoma of The Kidney: Up-regulation of NeuralMarkers With Activation of The Sonic Hedgehog and Akt Pathways. Clin Cancer Res. 11(22), 7986–7994 (2005)CrossRefGoogle Scholar
  12. 12.
    Dunn, J.C.: A graph theoretic analysis of pattern classification via Tamura’s fuzzy relation. IEEE Trans. SMC 4(3), 310–313 (1974)MATHGoogle Scholar
  13. 13.
    Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)MATHGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Feng Liu
    • 1
  • Juan Liu
    • 1
  • Jing Feng
    • 1
  • Huaibei Zhou
    • 2
  1. 1.Computer School of Wuhan UniversityWuhan UniversityWuhanChina
  2. 2.International School of SoftwareWuhan UniversityWuhanChina

Personalised recommendations