Abstract
An automatic terminology extraction method in specific domain is proposed based on condition random fields (CRF) in this paper. We treat extraction of terminology in one domain as a sequence labeling problem, and terminology distribution characteristics as features of the CRF model. Then we use the CRF model to train a template for the terminology extraction. Experimental results show that the method is effective and efficient with common domains.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
The ACE 2005 Evaluation Plan (2005), http://www.ldc.upenn.edu/Projects/ACE/Annotation
Muslea, I.: Extraction Patterns for Information Extraction Tasks: A Survey. In: Proceedings of AAAI 1999 Workshop on Machine Learning for Information Extraction, Orlando, Florida, USA (1999)
Freitag, D., McCallum, A.: Information Extraction with HMM Structures Learned by Stochastic Optimization. In: Proceedings of the 17th National Conference on Artificial Intelligence, Texas, USA, pp. 584–589 (2000)
Freitag, D., McCallum, A.: Information Extraction with HMMs and Shrinkage. In: Proceedings AAAI 1999 Workshop on Machine Learning for Information Extraction. MIT Press, Orlando (1999)
Gruber, T.R.: A Translation Approach to Portable Ontology Specifications. Knowledge Acquisition (5), 199–220 (1993)
Borst, W.N.: Construction of Engineering Ontologies for Knowledge Sharing and Reuse. PhD thesis, University of Twente, Enschede (1997)
Studer, R., Benjamins, V.R., Fensel, D.: Knowledge Engineering, Principles and Methods. Data and Knowledge Engineering 25(122), 161–197 (1998)
Gruber, T.R.: Towards Principles for the Design of Ontologies Used for Knowledge Sharing. International Journal of Human Computer Studies 43(6), 907–928 (1995)
Park, J., Barbosa, D.: Adaptive Record Extraction From Web Pages. In: Poceedings of WWW 2007, Banff, Alberta, Canada (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag GmbH Berlin Heidelberg
About this paper
Cite this paper
Chen, F. (2012). Automatic Extraction of Terminology under CRF Model. In: Hu, W. (eds) Advances in Electric and Electronics. Lecture Notes in Electrical Engineering, vol 155. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28744-2_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-28744-2_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28743-5
Online ISBN: 978-3-642-28744-2
eBook Packages: EngineeringEngineering (R0)