Adaptive Term Weighting through Stochastic Optimization
Term weighting strongly influences the performance of text mining and information retrieval approaches. Usually term weights are determined through statistical estimates based on static weighting schemes. Such static approaches lack the capability to generalize to different domains and different data sets. In this paper, we introduce an on-line learning method for adapting term weights in a supervised manner. Via stochastic optimization we determine a linear transformation of the term space to approximate expected similarity values among documents. We evaluate our approach on 18 standard text data sets and show that the performance improvement of a k-NN classifier ranges between 1% and 12% by using adaptive term weighting as preprocessing step. Further, we provide empirical evidence that our approach is efficient to cope with larger problems.
KeywordsInformation Retrieval Linear Transformation Gradient Descent Weighting Scheme Text Mining
Unable to display preview. Download preview PDF.
- 1.Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill, Inc, New York (1986)Google Scholar
- 2.Robertson, S., Walker, S., Jones, S., Hancock-Beaulieu, M.M., Gatford, M.: Okapi at trec-3. In: Proceedings of the Third Text REtrieval Conference (TREC 1994), pp. 109–126 (1996)Google Scholar
- 7.Ernandes, M., Angelini, G., Gori, M., Rigutini, L., Scarselli, F.: Adaptive context-based term (re)weighting an experiment on single-word question answering. Frontiers in Artificial Intelligence and Applications 141, 1 (2006)Google Scholar
- 9.Manning, C.D., Schutze, H.: Foundations of Statistical Natural Language Processing, vol. 8. MIT Press, Cambridge (2000, 2002)Google Scholar
- 12.Zhao, Y., Karypis, G.: Evaluation of hierarchical clustering algorithms for document datasets. In: Proc. of CIKM 2002, McLean, Virginia, pp. 515–524 (2002)Google Scholar