Sentence-Based Topic Modeling Using Lexical Analysis
- 476 Downloads
Data is not meaningful unless its information could be extracted. In every second in this world, we are generating millions of data over the Internet in different form. Most of them are in text format. Usually, data is written based on any topic, or sometimes on few topics. Following this, identifying topic of any text data is very important. Topic identification may help text summarization tools, text classification tool, etc. Machine learning applications may need less training on their data, only if once the topic of text is identified. Therefore, the demand of topic modeling is higher than ever right now. Data scientists are working day and night to make it more effective and accurate using different methods. Topic modeling focuses on the keywords that can express or identify the topic discussed in the document. Topic modeling can save a lot of time by releasing its user from page-to-page manual reviewing. In this paper, a model has been proposed to find out topic of a document. This model works based on the relations between most frequent words and their relation with sentences in the document. This model can be used to increase the accuracy of the topic modeling.
KeywordsTopic Model Text Summarization Text Categorization Tools Latent Dirichlet Allocation (LDA) Valid Word
We would like to thank Daffodil International University and DIU NLP and Machine Learning Research LAB for all their support and help.
- 2.Rosen-Zvi, M., Griffiths, T., Steyvers, M., Smyth, P.: The author-topic model for authors and documents. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, pp. 487–494 (2004)Google Scholar
- 4.Liu, Y., Niculescu-Mizil, A., Gryc, W.: Topic-link LDA: joint models of topic and author community. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 665–672 (2009)Google Scholar
- 5.Rakshit, G., Ghosh, A., Bhattacharyya, P., Haffri, G.: Automated analysis of Bangla poetry for classifiation and poet identifiation. IITB-Monash Research Academy, India, IIT Bombay, India Monash University, AustraliaGoogle Scholar
- 6.Das, A., Bandyopadhyay, S.: Topic-based Bengali opinion summarizationGoogle Scholar
- 7.Jiang, H., Zhou, R., Zhang, L., Zhang, Y.: A topic model based on poisson decomposition. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, Singapore, pp 1489–1498, November (2017)Google Scholar
- 8.Ruohonen, J.: Classifying web exploits with topic modeling. In: 28th International Workshop on Database and Expert Systems Applications, Lyon, France (2017). https://doi.org/10.1109/DEXA.2017.35
- 9.Karami, A., Gangopadhyay, A., Zhou B., Kharrazi, H.: Fuzzy approach topic modeling for health and medical corpora. Int. J. Fuzzy Syst. (2017)Google Scholar
- 10.Zhai, C.: Probabilistic topic models for text data retrieval and analysis. In: 40th International ACM SIGIR Conference, Shinjuku, Tokyo, Japan, pp. 1399–1401 (2017)Google Scholar