Abstract
In this paper document classification methods using multinomial naïve Bayes are improved in a number of ways. We use the value weighting method, a new fine-grained weighting method, to calculate the weights of the feature values. Our experiments show that the proposed approach outperforms other state-of-the-art methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
McCallum, A., Nigam, K.: A comparison of event models for Naive Bayes text classification. In: Proc. AAAI-1998 Workshop on Learning for Text Categorization. AAAI Press, pp. 41–48 (1998)
Rennie, J., Shih, L., Teevan, J., Karger, D.: Tackling the poor assumptions of naive bayes text classifiers. In: Proceedings of the 20th International Conference on Machine Learning (ICML), pp. 616–623 (2003)
Kullback, S., Leibler, R.A.: On Information and Sufficiency. The Annals of Mathematical Statistics, 79–86 (1951)
Lee, C.H., Gutierrez, F., Dou., D.: Calculating Feature Weights in Naive Bayes with Kullback-Leibler Measure. In: 11th IEEE International Conference on Data Mining, pp. 1146–1151. IEEE Press, Vancouver (2011)
Kim, S., Han, K., Rim, H., Myaeng, S.: Some effective techniques for naive bayes text classification. IEEE Transactions on Knowledge and Data Engineering 18(11), 1457–1466 (2006)
Tunedit Data Mining Blog. http://tunedit.org/
UCI Machine Learning Repository. http://archive.ics.uci.edu/ml/
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Song, SH., Lee, CH. (2015). Improving Document Classification Using Fine-Grained Weights. In: Ali, M., Kwon, Y., Lee, CH., Kim, J., Kim, Y. (eds) Current Approaches in Applied Artificial Intelligence. IEA/AIE 2015. Lecture Notes in Computer Science(), vol 9101. Springer, Cham. https://doi.org/10.1007/978-3-319-19066-2_47
Download citation
DOI: https://doi.org/10.1007/978-3-319-19066-2_47
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19065-5
Online ISBN: 978-3-319-19066-2
eBook Packages: Computer ScienceComputer Science (R0)