Soft Computing

, Volume 11, Issue 4, pp 369–373

Using SVM to Extract Acronyms from Text

Authors

    • College of SoftwareNankai University
  • Yalou Huang
    • College of SoftwareNankai University
Focus

DOI: 10.1007/s00500-006-0091-5

Cite this article as:
Xu, J. & Huang, Y. Soft Comput (2007) 11: 369. doi:10.1007/s00500-006-0091-5

Abstract

The paper addresses the problem of extracting acronyms and their expansions from text. We propose a support vector machines (SVM) based approach to deal with the problem. First, all likely acronyms are identified using heuristic rules. Second, expansion candidates are generated from surrounding text of acronyms. Last, SVM model is employed to select the genuine expansions. Analysis shows that the proposed approach has the advantages of saving over the conventional rule based approaches. Experimental results show that our approach outperforms the baseline method of using rules. We also show that the trained SVM model is generic and can adapt to other domains easily.

Keywords

AcronymExpansionClassificationSupport vector machines

Copyright information

© Springer-Verlag 2006