Skip to main content

Gene Selection via Discretized Gene-Expression Profiles and Greedy Feature-Elimination

  • Conference paper
Methods and Applications of Artificial Intelligence (SETN 2004)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3025))

Included in the following conference series:

Abstract

Analysis and interpretation of gene-expression profiles, and the identification of respective molecular- or, gene-markers is the key towards the understanding of the genetic basis of major diseases. The problem is challenging because of the huge number of genes (thousands to tenths of thousands!) and the small number of samples (about 50 to 100 cases). In this paper we present a novel gene-selection methodology, based on the discretization of the continuous gene-expression values. With a specially devised gene-ranking metric we measure the strength of each gene with respect to its power to discriminate between sample categories. Then, a greedy feature-elimination algorithm is applied on the rank-ordered genes to form the final set of selected genes. Unseen samples are classified according to a specially devised prediction/matching metric. The methodology was applied on a number of real-world gene-expression studies yielding very good results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alizadeh, A.A., et al.: Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403(3), 503–511 (2000)

    Article  Google Scholar 

  2. Baim, P.W.: A Method for Attribute Selection in Inductive Learning Systems. IEEE PAMI 10(6), 888–896 (1988)

    Google Scholar 

  3. Bassett, D.E., Eisen, M.B., Boguski, M.S.: Gene expression informatics: it’s all in your mine. Nature Genetics 21(Suppl. 1), 51–55 (1999)

    Article  Google Scholar 

  4. Brazma, A., Parkinson, H., Schlitt, T., Shojatalab, M.: A quick introduction to elements of biology - cells, molecules, genes, functional genomics, microarrays. EMBL- European Bioinformatics Institute (EBI) (October 2001), http://www.ebi.ac.uk/microarray/biology_intro.html (accessed October 2003)

  5. Fayyad, U., Irani, K.: Multi-interval discretization of continuous-valued attributes for classification learning. In: Procs of the 13th Inernational Joint Conference of Artificial Intelligence, pp. 1022–1029. Morgan Kaufmann, San Francisco (1993)

    Google Scholar 

  6. Friend, H.F.: How DNA microarrays and expression profiling will affect clinical practice. Br. Med. J. 319, 1–2 (1999)

    Google Scholar 

  7. Ginsburg, G.S., McCarthy, J.J.: Personalized medicine: revolutionizing drug discovery and patient care. Trends Biotechnol 19(12), 491–496 (2001)

    Article  Google Scholar 

  8. Golub, T.R., Slonim, D.K., Tamayo, P., Huard, C., Gaasenbeek, M., Mesirov, J.P., Coller, H., Loh, M.L., Downing, J.R., Caligiuri, M.A., Bloomfield, C.D., Lander, E.S.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286, 531–537 (1999)

    Article  Google Scholar 

  9. Guttmacher, A.E., Collins, F.S.: Genome Medicine. Special issue of N Engl. Med. 349 (2003)

    Google Scholar 

  10. Hall, M.A.: Correlation-based Feature Selection for Machine Learning. PhD thesis, University of Waikato (1999)

    Google Scholar 

  11. Hedenfalk, I., et al.: Gene-expression profiles in hereditary breast cancer. N. Engl. J. Med. 344(8), 539–548 (2001)

    Article  Google Scholar 

  12. Kinzler, K.W., Vogelstein, B.: Lessons from hereditary colorectal cancer. Cell 87(2), 159–170 (1996)

    Article  Google Scholar 

  13. Kohane, I.S.: Bioinformatics and Clinical Informatics: The Imperative to Collaborate. JAMIA 7, 512–516 (2000)

    Google Scholar 

  14. Kohavi, R., John, G.: Wrappers for feature subset selection. Artificial Intelligence (special issue on Relevance) 97(1-2), 273–324 (1996)

    Google Scholar 

  15. Li, L., Weinberg, C.R., Darden, T.A., Pedersen, L.G.: Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method. Bioinformatics 17(12), 1131–1142 (2001)

    Article  Google Scholar 

  16. Maojo, V., Iakovidis, I., Martín-Sánchez, F., Crespo, J., Kulikoswki, C.: Medical Informatics and Bioinformatics: European efforts to facilitate synergy. Journal of Biomedical Informatics 34(6), 423–427 (2001)

    Article  Google Scholar 

  17. Nadon, R., Shoemaker, J.: Statistical issues with microarrays: Processing and analysis. Trends in Genetics 15, 265–271 (2002)

    Article  Google Scholar 

  18. Pomeroy, S.L., et al.: Prediction of central nervous system embryonal tumour outcome based on gene expression. Nature 415, 436–442 (2002)

    Article  Google Scholar 

  19. Potamias, G.: Utilizing Gene Functional Classification in Microarray Data Analysis: a Hybrid Clustering Approach. In: 9th Panhellenic Conference in Informatics, Thessaloniki, Greece, November 21-23 (2003)

    Google Scholar 

  20. Quinlan, J.R.: Induction of decision trees. Machine Learning 1(81), 81–106 (1986)

    Google Scholar 

  21. Su, A.I., et al.: Molecular Classification of Human Carcinomas by Use of Gene Expression Signatures. Cancer Research 61, 7388–7399 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Potamias, G., Koumakis, L., Moustakis, V. (2004). Gene Selection via Discretized Gene-Expression Profiles and Greedy Feature-Elimination. In: Vouros, G.A., Panayiotopoulos, T. (eds) Methods and Applications of Artificial Intelligence. SETN 2004. Lecture Notes in Computer Science(), vol 3025. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24674-9_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24674-9_27

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-21937-8

  • Online ISBN: 978-3-540-24674-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics