A Comparative Study of Classifiers for Extractive Text Summarization
- 18 Downloads
Automatic text summarization (ATS) is a widely used approach. Through the years, various techniques have been implemented to produce the summary. An extractive summary is a traditional mechanism for information extraction, where important sentences are selected which refers to the basic concepts of the article. In this paper, extractive summarization has been considered as a classification problem. Machine learning techniques have been implemented for classification problems in various domains. To solve the summarization problem in this paper, machine learning is taken into consideration, and KNN, random forest, support vector machine, multilayer perceptron, decision tree and logistic regression algorithm have been implemented on Newsroom dataset.
KeywordsText summarization Extractive Sentence scoring Machine learning
- 3.Meena, Y.K., and D. Gopalani. 2014. Analysis of sentence scoring methods for extractive automatic text summarization. In Proceedings of the 2014 international conference on information and communication technology for competitive strategies, November 2014, 53. ACM.Google Scholar
- 5.Joachims, T. 1998. Text categorization with support vector machines: Learning with many relevant features. In European conference on machine learning, April 1998, 137–142. Springer, Berlin, Heidelberg.Google Scholar
- 6.Nobata, C., S. Sekine, M. Murata, K. Uchimoto, M. Utiyama, H., and Isahara. 2001. Sentence extraction system assembling multiple evidence. In NTCIR.Google Scholar
- 7.Jafari, M., J. Wang, Y. Qin, M. Gheisari, A.S. Shahabi, and X. Tao. 2016. Automatic text summarization using fuzzy inference. In 22nd International conference on automation and computing (ICAC), September 2016, 256–260. IEEE.Google Scholar
- 9.NewsRoom Dataset Available (2017) Cornell Newsroom. https://summari.es. 2017.
- 10.Powers, D.M. 2011. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation.Google Scholar
- 11.Davis, J., and M. Goadrich. 2006. The relationship between precision-recall and ROC curves. In Proceedings of the 23rd international conference on machine learning, June 2006, 233–240. ACM.Google Scholar