Integrating Gene Expression Data from Microarrays Using the Self-Organising Map and the Gene Ontology

  • Ken McGarry
  • Mohammad Sarfraz
  • John MacIntyre
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4774)


The self-organizing map (SOM) is useful within bioinformatics research because of its clustering and visualization capabilities. The SOM is a vector quantization method that reduces the dimensionality of original measurement and visualizes individual tumor sample in a SOM component plane. The data is taken from cDNA microarray experiments on Diffuse Large B-Cell Lymphoma (DLBCL) data set of Alizadeh. The objective is to get the SOM to discover biologically meaningful clusters of genes that are active in this particular form of cancer. Despite their powers of visualization, SOMs cannot provide a full explanation of their structure and composition without further detailed analysis. The only method to have gone someway towards filling this gap is the unified distance matrix or U-matrix technique. This method will be used to provide a better understanding of the nature of discovered gene clusters. We enhance the work of previous researchers by integrating the clustering results with the Gene Ontology for deeper analysis of biological meaning, identification of diversity in gene expression of the DLBCL tumors and reflecting the variations in tumor growth rate.


Gene Ontology Gene Expression Data Lateral Connection Biological Process Term Visualization Capability 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Berkum, N., Holstege, F.: Dna microarrays raising the profile. Current Opinions in Biotechnology 12(1), 48–52 (2001)CrossRefGoogle Scholar
  2. 2.
    Soinov, L., Krestyaninova, M., Brazma, A.: Towards reconstruction of gene networks from expression data by supervised learning. Genome Biology 4(1), 1–10 (2003)CrossRefGoogle Scholar
  3. 3.
    Sherlock, G.: Analysis of large-scale gene expression data. Current Opinion in Immunology 12, 201–205 (2000)CrossRefGoogle Scholar
  4. 4.
    Alizadeh, A., Eisen, M., Davis, R., Ma, C.: Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403, 503–511 (2000)CrossRefGoogle Scholar
  5. 5.
    Kuo, P., Kim, E., Trimarchi, J., Jenssen, T., Vinterbo, S., Ohno-Machado, L.: A primer on gene expression and microarrays for machine learning researchers. Journal of Biomedical Bioinformatics 37, 293–303 (2004)CrossRefGoogle Scholar
  6. 6.
    Huges, T., et al.: Functional discovery via a compendium of expression profiles. Cell 102, 109–126 (2000)CrossRefGoogle Scholar
  7. 7.
    Lu, Y., Han, J.: Cancer classification using gene expression data. Information Systems 28, 242–268 (2003)CrossRefGoogle Scholar
  8. 8.
    Peterson, C., Ringer, M.: Analyzing tumor gene expression profile. Artificial Intelligence in Medicine 28(1), 59–74 (2003)CrossRefGoogle Scholar
  9. 9.
    Moreau, Y., Aerts, S., Moor, B.D., DeStrooper, B., Dabrowski, M.: Comparison and meta-analysis of microarray data: from the bench to the computer desk. Trends in Genetics 19(10), 570–577 (2004)CrossRefGoogle Scholar
  10. 10.
    Kuo, P., Jenssen, T., Butte, A., Ohno-Machado, L., Kohane, I.: Analysis of matched mRNA measurements from two different microarray technologies. Bioinformatics 18(3), 405–412 (2003)CrossRefGoogle Scholar
  11. 11.
    Quackenbush, J.: Computational analysis of microarray data. Nature Reviews Genetics 2, 418–427 (2001)CrossRefGoogle Scholar
  12. 12.
    Quackenbush, J.: Microarray data normalisation and transformation. Nature Genetics Supplement 32, 496–501 (2002)CrossRefGoogle Scholar
  13. 13.
    Kohonen, T., Oja, E., Simula, O., Visa, A., Kangas, J.: Engineering applications of the self-organizing map. Proceedings of the IEEE 84(10), 1358–1383 (1996)CrossRefGoogle Scholar
  14. 14.
    Kaski, S., Nikkilä, J., Törönen, P., Castrén, E., Wong, G.: Analysis and visualization of gene expression data using self-organizing maps. In: Proceedings of NSIP-01, IEEE-EURASIP Workshop on Nonlinear Signal and Image Processing 2001, Baltimore, USA (2001)Google Scholar
  15. 15.
    Nikkila, J., Kaski, S., Toronen, P., Castren, E., Wong, G.: Analysis and visualization of gene expression data using self-organizing maps. Neural Networks 8(9), 953–966 (2002)CrossRefGoogle Scholar
  16. 16.
    Bard, J., Rhee, S.: Ontologies in biology: design applications and future challenges. Nature Reviews Genetics 5, 213–222 (2004)CrossRefGoogle Scholar
  17. 17.
    Ashburner, M.: Gene ontology: tool for the unification of biology. Nature Genetics 25, 25–29 (2000)CrossRefGoogle Scholar
  18. 18.
    Ultsch, A., Siemon, H.P.: Kohonens self organizing feature maps for exploratory data analysis. In: Proceedings of the International Neural Network Conference, pp. 305–308 (1990)Google Scholar
  19. 19.
    Kaski, S.: Dimensionality reduction by random mapping: Fast similarity computation for clustering. In: Proceedings of IJCNN 1998, International Joint Conference on Neural Networks, Piscataway, NJ, vol. 1, pp. 413–418 (1998)Google Scholar
  20. 20.
    Malone, J., McGarry, K., Bowerman, C., Wermter, S.: Rule extraction from kohonen neural networks. Neural Computing Applications Journal 15(1), 9–17 (2006)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Ken McGarry
    • 1
  • Mohammad Sarfraz
    • 1
  • John MacIntyre
    • 1
  1. 1.School of Computing and Technology, University of Sunderland, St Peters Campus, St Peters Way, SR6 ODDUK

Personalised recommendations