Skip to main content

A Tool for Biomedical – Documents Classification Using Support Vector Machines

  • Conference paper
  • First Online:
Computational Intelligence and Decision Making

Abstract

This paper presents a web tool for text mining of biomedical literature using clustering and support vector machines. The study is specific to the domain of Peptidases, based on curated literature. It has been evaluated the use of ontologies in the text mining and feature selection process, and the preliminary results show that the classifier performance may be improved along with a reduction of the number of features.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. NCBI Pubmed Database (2011) http://www.ncbi.nlm.nih.gov/pubmed/

  2. Correia D, Campos D, Pereira C, Verissimo P, Dourado A (2010) A platform for intelligent search and classification of biomedical literature. Paper presented at the workshop on applications of computational intelligence, WACI’10 - Workshop Application of Computacional Intelligence, Coimbra

    Google Scholar 

  3. Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G (2000) Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat Genet 25(1):25–29, citeulike-article-id:212874

    Article  Google Scholar 

  4. Lu Z (2011) PubMed and beyond: a survey of web tools for searching biomedical literature. Database 2011. doi:10.1093/database/baq036

  5. Doms A, Schroeder M (2005) GoPubMed: exploring PubMed with the gene ontology. Nucleic Acids Res 33(2):W783–W786. doi:10.1093/nar/gki470

    Article  Google Scholar 

  6. Glenisson P, Coessens B, Van Vooren S, Mathys J, Moreau Y, De Moor B (2004) TXTGate: profiling gene groups with text-based information. Genome Biol 5(6):1–12. doi:10.1186/gb-2004-5-6-r43

    Article  Google Scholar 

  7. Delfs R, Doms A, Kozlenkov A, Schroeder M (2004) GoPubMed: ontology-based literature search applied to Gene Ontology and PubMed. In: German Conference on Bioinformatics, pp 169–178

    Google Scholar 

  8. Lloyd SP (1982) Least squares quantization in PCM. IEEE Trans Inf Theory 28(2):129–137. doi:10.1109/TIT.1982.1056489

    Article  MathSciNet  MATH  Google Scholar 

  9. Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297. doi:10.1007/bf00994018

    MATH  Google Scholar 

  10. Vapnik VN (1995) The nature of statistical learning theory. Springer, New York

    MATH  Google Scholar 

  11. Joachims T (2002) Learning to classify text using support vector machines: methods, theory and algorithms. Kluwer Academic Publishers, Norwell

    Book  Google Scholar 

  12. NCBI Entrez Programming Utilities (2011) http://www.ncbi.nlm.nih.gov/entrez/query/static/eutils_help.html

  13. Wurst M (2007) The word vector tool-user guide/operator reference/developer tutorial. Can be found in here: http://ftp.heanet.ie/disk1/sourceforge/r/project/ra/rapidminer/2.%20Text%20Plugin/4.0beta/rapidminer-wvtool-4.0beta-tutorial.pdf

  14. Carbon S, Ireland A, Mungall CJ, Shu S, Marshall B, Lewis S, Hub tA, Group tWPW (2009) AmiGO: online access to ontology and annotation data. Bioinformatics 25(2):288–289. doi:10.1093/bioinformatics/btn615

    Article  Google Scholar 

  15. Rawlings ND, Barrett AJ, Bateman A (2010) MEROPS: the peptidase database. Nucleic Acids Res 38(Database issue):D227–233. doi:10.1093/nar/gkp971

    Article  Google Scholar 

  16. Chang C-C, Lin C-J (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):1–27. doi:10.1145/1961189.1961199

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by FCT (Foundation for Science and Technology) and FEDER through Program COMPETE (QREN) executed under the project FCOMP-01-0124-FEDER-010160 (PTDC/EIA/71770/2006), designated BIOINK - Incremental Kernel Learning for Biological Data Analysis. The platform used services from third parties to whom we thank and reference.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to João Oliveira .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer Science+Business Media Dordrecht

About this paper

Cite this paper

Oliveira, J., Correia, D., Pereira, C., Veríssimo, P., Dourado, A. (2013). A Tool for Biomedical – Documents Classification Using Support Vector Machines. In: Madureira, A., Reis, C., Marques, V. (eds) Computational Intelligence and Decision Making. Intelligent Systems, Control and Automation: Science and Engineering, vol 61. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-4722-7_38

Download citation

  • DOI: https://doi.org/10.1007/978-94-007-4722-7_38

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-007-4721-0

  • Online ISBN: 978-94-007-4722-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics