Skip to main content

An Approach for Bengali News Headline Classification Using LSTM

  • Conference paper
  • First Online:
Emerging Technologies in Data Mining and Information Security

Abstract

Headline is called the soul of news. Headline carries a very important meaning. Generally, many people start reading the news after seeing the headlines. It is very important for the user to classify the headlines that he/she preferred. Classifying news type based on their headlines is a problem of text classification which lies under natural language processing (NLP) research. In different languages, there are many works done but none of them observed in Bengali. In our work we tried to visualize our very first approach to solve this problem. A LSTM-based architecture is used for solving this problem. A total of 4580 headlines are trained in our model. Finally, our model gives us 91.22% accuracy. The main challenge of our work is finding the right word vector. As far headlines are made up with different types of words and there is no similarity between any of them, so it is difficult to map them.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Hassan, S., Zaidi A.: Urdu News Headline, Text Classification by using different machine learning algorithms (2019)

    Google Scholar 

  2. Liu, X., Rujia, G., Liufu, S.: Internet news headlines classification method based on the N-Gram language model. In: 2012 International Conference on Computer Science and Information Processing (CSIP)

    Google Scholar 

  3. Deshmukh, R.R., Kirange, D.K.: Classifying news headlines for providing user centered e-newspaper using SVM. Int. J. Emerg. Trends & Technol. Comput. Sci. (IJETTCS) 2(3) 2013. www.ijettcs.org

  4. Rana, M.I., Khalid, S., Akbar, M.U.: News classification based on their headlines: a review. ISBN: 978–1–4799–5754–5/14/\$26.00 ©2014 IEEE

    Google Scholar 

  5. Drury, B., Torgo, L., Almeida, J.J.: Classifying news stories to estimate the direction of a stock market index. Paper presented at the information systems and technologies (CISTI) 6th Iberian conference (2011)

    Google Scholar 

  6. Heb, A., Dopichaj, P., Maab, C.: Multi-value classification of very short texts. In: Proceedings of the 31st Annual German Conference on Advances in Artificial Intelligence, pp.70–77. Springer, Berlin (2008)

    Google Scholar 

  7. Kirange, D.K., Deshmukh, R.R.: Emotion classification of news headlines using SVM. Asian J. Comput. Sci. Inf. Technol. 104–106 (2012)

    Google Scholar 

  8. Jia, Y., Chen, Z., Yu, S.: Reader emotion classification of news headlines. In: Proceedings of the International Conference on Natural Language Processing and Knowledge Engineering (NLP-KE), pp. 1–6 (2009)

    Google Scholar 

  9. Dilrukshi, I., De Zoysa, K., Caldera, A.: Twitter news classification using SVM. In: 2013 8th International Conference on Computer Science & Education (ICCSE), pp. 287–291. IEEE (2013)

    Google Scholar 

  10. Rana, M.I., Khalid, D.S., Abid, F., Ali, A., Durrani, M.Y., Aadil, F.: News headlines classification using probabilistic approach. VAWKUM Trans. Comput. Sci. 7(1) (2015)

    Google Scholar 

  11. Yin, Z., Tang, J., Ru, C., Luo, W., Luo, Z., Ma, X.: A Semantic representation enhancement method for Chinese news headline classification. In: Huang, X., et al. (eds.) NLPCC 2017, LNAI 10619. Springer International Publishing AG, pp. 318–328 (2018)

    Google Scholar 

  12. Dadgar, S.M.H., Araghi, M.S., Farahani, M.M.: A novel text mining approach based on TF-IDF and support vector machine for news classification. In: 2nd IEEE International Conference on Engineering and Technology (ICETECH), 17th & 18th March 2016, Coimbatore, TN, India (2016)

    Google Scholar 

  13. Daskalopoulos, V.: Stock price prediction from natural language understanding of news headlines (2021)

    Google Scholar 

  14. Latest Bangladesh national & local news, photo, video. (n.d.). Prothomalo. https://www.prothomalo.com/bangladesh

  15. Beautiful Soup Documentation — Beautiful Soup 4.9.0 documentation. (n.d.). https://www.crummy.com/software/BeautifulSoup/bs4/doc/

  16. Kingma, D.P., Jimmy L.B.: Adam: a method for stochastic optimization.” ICLR 2015: International Conference on Learning Representations (2015)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Md. Rafiuzzaman Bhuiyan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bhuiyan, M.R., Keya, M., Masum, A.K.M., Hossain, S.A., Abujar, S. (2021). An Approach for Bengali News Headline Classification Using LSTM. In: Hassanien, A.E., Bhattacharyya, S., Chakrabati, S., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 1286. Springer, Singapore. https://doi.org/10.1007/978-981-15-9927-9_30

Download citation

Publish with us

Policies and ethics