Advertisement

Sentence-Based Topic Modeling Using Lexical Analysis

  • Shahinur RahmanEmail author
  • Sheikh Abujar
  • S. M. Mazharul Hoque Chowdhury
  • Mohd. Saifuzzaman
  • Syed Akhter Hossain
Conference paper
  • 476 Downloads
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 814)

Abstract

Data is not meaningful unless its information could be extracted. In every second in this world, we are generating millions of data over the Internet in different form. Most of them are in text format. Usually, data is written based on any topic, or sometimes on few topics. Following this, identifying topic of any text data is very important. Topic identification may help text summarization tools, text classification tool, etc. Machine learning applications may need less training on their data, only if once the topic of text is identified. Therefore, the demand of topic modeling is higher than ever right now. Data scientists are working day and night to make it more effective and accurate using different methods. Topic modeling focuses on the keywords that can express or identify the topic discussed in the document. Topic modeling can save a lot of time by releasing its user from page-to-page manual reviewing. In this paper, a model has been proposed to find out topic of a document. This model works based on the relations between most frequent words and their relation with sentences in the document. This model can be used to increase the accuracy of the topic modeling.

Keywords

Topic Model Text Summarization Text Categorization Tools Latent Dirichlet Allocation (LDA) Valid Word 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Notes

Acknowledgements

We would like to thank Daffodil International University and DIU NLP and Machine Learning Research LAB for all their support and help.

References

  1. 1.
    Gambhir, M., Gupta, V.: Recent automatic text summarization techniques: a survey. Artif. Intell. Rev. 47(1), 1–66 (2017).  https://doi.org/10.1007/s10462-016-9475-9CrossRefGoogle Scholar
  2. 2.
    Rosen-Zvi, M., Griffiths, T., Steyvers, M., Smyth, P.: The author-topic model for authors and documents. In: Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, pp. 487–494 (2004)Google Scholar
  3. 3.
    Tsai, F.S.: A tag-topic model for blog mining. Expert Syst. Appl. 38(5), 5330–5335 (2011)CrossRefGoogle Scholar
  4. 4.
    Liu, Y., Niculescu-Mizil, A., Gryc, W.: Topic-link LDA: joint models of topic and author community. In: Proceedings of the 26th Annual International Conference on Machine Learning, pp. 665–672 (2009)Google Scholar
  5. 5.
    Rakshit, G., Ghosh, A., Bhattacharyya, P., Haffri, G.: Automated analysis of Bangla poetry for classifiation and poet identifiation. IITB-Monash Research Academy, India, IIT Bombay, India Monash University, AustraliaGoogle Scholar
  6. 6.
    Das, A., Bandyopadhyay, S.: Topic-based Bengali opinion summarizationGoogle Scholar
  7. 7.
    Jiang, H., Zhou, R., Zhang, L., Zhang, Y.: A topic model based on poisson decomposition. In: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management, Singapore, pp 1489–1498, November (2017)Google Scholar
  8. 8.
    Ruohonen, J.: Classifying web exploits with topic modeling. In: 28th International Workshop on Database and Expert Systems Applications, Lyon, France (2017).  https://doi.org/10.1109/DEXA.2017.35
  9. 9.
    Karami, A., Gangopadhyay, A., Zhou B., Kharrazi, H.: Fuzzy approach topic modeling for health and medical corpora. Int. J. Fuzzy Syst. (2017)Google Scholar
  10. 10.
    Zhai, C.: Probabilistic topic models for text data retrieval and analysis. In: 40th International ACM SIGIR Conference, Shinjuku, Tokyo, Japan, pp. 1399–1401 (2017)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  • Shahinur Rahman
    • 1
    Email author
  • Sheikh Abujar
    • 1
  • S. M. Mazharul Hoque Chowdhury
    • 1
  • Mohd. Saifuzzaman
    • 1
  • Syed Akhter Hossain
    • 1
  1. 1.Department of Computer Science and EngineeringDaffodil International UniversityDhakaBangladesh

Personalised recommendations