The Naïve Bayes Model for Unsupervised Word Sense Disambiguation

Aspects Concerning Feature Selection

  • Florentina T. Hristea

Part of the SpringerBriefs in Statistics book series (BRIEFSSTATIST)

Table of contents

  1. Front Matter
    Pages i-xiii
  2. Florentina T. Hristea
    Pages 1-8
  3. Florentina T. Hristea
    Pages 17-33
  4. Florentina T. Hristea
    Pages 35-53
  5. Back Matter
    Pages 69-70

About this book


This book presents recent advances (from 2008 to 2012) concerning use of the Naïve Bayes model in unsupervised word sense disambiguation (WSD).

While WSD, in general, has a number of important applications in various fields of artificial intelligence (information retrieval, text processing, machine translation, message understanding, man-machine communication etc.), unsupervised WSD is considered important because it is language-independent and does not require previously annotated corpora. The Naïve Bayes model has been widely used in supervised WSD, but its use in unsupervised WSD has led to more modest disambiguation results and has been less frequent. It seems that the potential of this statistical model with respect to unsupervised WSD continues to remain insufficiently explored.

The present book contends that the Naïve Bayes model needs to be fed knowledge in order to perform well as a clustering technique for unsupervised WSD and examines three entirely different sources of such knowledge for feature selection: WordNet, dependency relations and web N-grams. WSD with an underlying Naïve Bayes model is ultimately positioned on the border between unsupervised and knowledge-based techniques. The benefits of feeding knowledge (of various natures) to a knowledge-lean algorithm for unsupervised WSD that uses the Naïve Bayes model as clustering technique are clearly highlighted. The discussion shows that the Naïve Bayes model still holds promise for the open problem of unsupervised WSD.


62-XX, 91B70, 68-XX, 68Txx, 68T50 Bayesian classification Expectation-Maximization algorithm Feature selection Naïve Bayes model Word sense disambiguation

Authors and affiliations

  • Florentina T. Hristea
    • 1
  1. 1.Department of Computer ScienceUniversity of Bucharest Faculty of Mathematics & Computer SciencBucharestRomania

Bibliographic information

  • DOI
  • Copyright Information The Author(s) 2013
  • Publisher Name Springer, Berlin, Heidelberg
  • eBook Packages Mathematics and Statistics
  • Print ISBN 978-3-642-33692-8
  • Online ISBN 978-3-642-33693-5
  • Series Print ISSN 2191-544X
  • Series Online ISSN 2191-5458
  • Buy this book on publisher's site