Abstract
This paper presents a web tool for text mining of biomedical literature using clustering and support vector machines. The study is specific to the domain of Peptidases, based on curated literature. It has been evaluated the use of ontologies in the text mining and feature selection process, and the preliminary results show that the classifier performance may be improved along with a reduction of the number of features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
NCBI Pubmed Database (2011) http://www.ncbi.nlm.nih.gov/pubmed/
Correia D, Campos D, Pereira C, Verissimo P, Dourado A (2010) A platform for intelligent search and classification of biomedical literature. Paper presented at the workshop on applications of computational intelligence, WACI’10 - Workshop Application of Computacional Intelligence, Coimbra
Ashburner M, Ball CA, Blake JA, Botstein D, Butler H, Cherry JM, Davis AP, Dolinski K, Dwight SS, Eppig JT, Harris MA, Hill DP, Issel-Tarver L, Kasarskis A, Lewis S, Matese JC, Richardson JE, Ringwald M, Rubin GM, Sherlock G (2000) Gene ontology: tool for the unification of biology. The gene ontology consortium. Nat Genet 25(1):25–29, citeulike-article-id:212874
Lu Z (2011) PubMed and beyond: a survey of web tools for searching biomedical literature. Database 2011. doi:10.1093/database/baq036
Doms A, Schroeder M (2005) GoPubMed: exploring PubMed with the gene ontology. Nucleic Acids Res 33(2):W783–W786. doi:10.1093/nar/gki470
Glenisson P, Coessens B, Van Vooren S, Mathys J, Moreau Y, De Moor B (2004) TXTGate: profiling gene groups with text-based information. Genome Biol 5(6):1–12. doi:10.1186/gb-2004-5-6-r43
Delfs R, Doms A, Kozlenkov A, Schroeder M (2004) GoPubMed: ontology-based literature search applied to Gene Ontology and PubMed. In: German Conference on Bioinformatics, pp 169–178
Lloyd SP (1982) Least squares quantization in PCM. IEEE Trans Inf Theory 28(2):129–137. doi:10.1109/TIT.1982.1056489
Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20(3):273–297. doi:10.1007/bf00994018
Vapnik VN (1995) The nature of statistical learning theory. Springer, New York
Joachims T (2002) Learning to classify text using support vector machines: methods, theory and algorithms. Kluwer Academic Publishers, Norwell
NCBI Entrez Programming Utilities (2011) http://www.ncbi.nlm.nih.gov/entrez/query/static/eutils_help.html
Wurst M (2007) The word vector tool-user guide/operator reference/developer tutorial. Can be found in here: http://ftp.heanet.ie/disk1/sourceforge/r/project/ra/rapidminer/2.%20Text%20Plugin/4.0beta/rapidminer-wvtool-4.0beta-tutorial.pdf
Carbon S, Ireland A, Mungall CJ, Shu S, Marshall B, Lewis S, Hub tA, Group tWPW (2009) AmiGO: online access to ontology and annotation data. Bioinformatics 25(2):288–289. doi:10.1093/bioinformatics/btn615
Rawlings ND, Barrett AJ, Bateman A (2010) MEROPS: the peptidase database. Nucleic Acids Res 38(Database issue):D227–233. doi:10.1093/nar/gkp971
Chang C-C, Lin C-J (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2(3):1–27. doi:10.1145/1961189.1961199
Acknowledgments
This work was supported by FCT (Foundation for Science and Technology) and FEDER through Program COMPETE (QREN) executed under the project FCOMP-01-0124-FEDER-010160 (PTDC/EIA/71770/2006), designated BIOINK - Incremental Kernel Learning for Biological Data Analysis. The platform used services from third parties to whom we thank and reference.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media Dordrecht
About this paper
Cite this paper
Oliveira, J., Correia, D., Pereira, C., Veríssimo, P., Dourado, A. (2013). A Tool for Biomedical – Documents Classification Using Support Vector Machines. In: Madureira, A., Reis, C., Marques, V. (eds) Computational Intelligence and Decision Making. Intelligent Systems, Control and Automation: Science and Engineering, vol 61. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-4722-7_38
Download citation
DOI: https://doi.org/10.1007/978-94-007-4722-7_38
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-4721-0
Online ISBN: 978-94-007-4722-7
eBook Packages: EngineeringEngineering (R0)