Asia Information Retrieval Symposium

AIRS 2006: Information Retrieval Technology pp 633-641

Towards Automatic Domain Classification of Technical Terms: Estimating Domain Specificity of a Term Using the Web

  • Takehito Utsuro
  • Mitsuhiro Kida
  • Masatsugu Tonoike
  • Satoshi Sato
Conference paper

DOI: 10.1007/11880592_56

Volume 4182 of the book series Lecture Notes in Computer Science (LNCS)
Cite this paper as:
Utsuro T., Kida M., Tonoike M., Sato S. (2006) Towards Automatic Domain Classification of Technical Terms: Estimating Domain Specificity of a Term Using the Web. In: Ng H.T., Leong MK., Kan MY., Ji D. (eds) Information Retrieval Technology. AIRS 2006. Lecture Notes in Computer Science, vol 4182. Springer, Berlin, Heidelberg

Abstract

This paper proposes a method of domain specificity estimation of technical terms using the Web. In the proposed method, it is assumed that, for a certain technical domain, a list of known technical terms of the domain is given. Technical documents of the domain are collected through the Web search engine, which are then used for generating a vector space model for the domain. The domain specificity of a target term is estimated according to the distribution of the domain of the sample pages of the target term. Experimental evaluation results show that the proposed method achieved mostly 90% precision/recall.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Takehito Utsuro
    • 1
  • Mitsuhiro Kida
    • 2
  • Masatsugu Tonoike
    • 3
  • Satoshi Sato
    • 4
  1. 1.Graduate School of Systems and Information EngineeringUniversity of TsukubaTsukubaJapan
  2. 2.Nintendo Co.,Ltd.Kyoto-shiJapan
  3. 3.Graduate School of InformaticsKyoto UniversityKyotoJapan
  4. 4.Graduate School of EngineeringNagoya UniversityNagoyaJapan