Abstract
Text summarization is one of the strategies of compressing a long document to create a version of the main points of the original text. Due to the excessive amount of long posts these days, the value of summarization is born. Reading the main document and obtaining a desirable summary, time and trouble are worth it. Using machine learning and natural language processing built an automated text summarization system can solve this problem. So our proposed system will distribute an abstractive summary of a long text automatically in a period of some time. We have done the whole analysis with the Bengali text. In our designed model, we used chain to chain models of RNN with LSTM in the encrypting layer. The architecture of our model works using RNN decoder and encoder, where the encoder inputs text document and generates output as a short summary at the decoder. This system improves two things, namely summarization and establishing benchmarks performance with ignoble train loss. To train our model, we use our dataset that was created from various online media, articles, Facebook, and some people's personal posts. The challenges we face most here are Bengali text processing, limited text length, enough resources for collecting text.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abualigah, L., Bashabsheh, M.Q., Alabool, H., Shehab, M.: Text summarization: a brief review. In: Abd Elaziz, M., Al-qaness, M., Ewees, A., Dahou, A. (eds.) Recent Advances in NLP: The Case of Arabic Language. Studies in Computational Intelligence, vol. 874. Springer, Cham (2020)
Qaroush, A., et al.:An efficient single document Arabic text summarization using a combination of statistical and semantic features.J. King Saud Univ. Comput. Inf. Sci. (2019)
Padmakumar, A., Saran, A.: Unsupervised text summarization using sentence embeddings. Technical Report, University of Texas at Austin (2016)
Alhasan, A., Al-Taani, A.T.: POS tagging for arabic text using bee colony algorithm. Procedia Comput. Sci. 142, 158–165 (2018)
Talukder, Md.A.I., et al.:Bengali abstractive text summarization using sequence to sequence RNNs. In: 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT). IEEE (2019)
Shang, L., Lu, Z., Li, H.: Neural responding machine for short-text conversation. Association for Computational Linguistics (ACL) (2015)
Bahdanau, D., Cho, K., Bengio, Y.: Neural machine translation by jointly learning to align and translate. In: International Conference on Learning Representation (ICLR), 19 May 2014
Abujar, S., et al.:An approach for Bengali text summarization using Word2Vector. In: 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT). IEEE (2019)
Masum, A.K.M., et al.:Abstractive method of text summarization with sequence to sequence RNNs. In: 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT). IEEE (2019)
Jing, H.:Sentence reduction for automatic text summarization. In: Sixth Applied Natural Language Processing Conference (2000)
Gao, S., et al.: Abstractive text summarization by incorporating reader comments. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 33 (2019)
Hovy, E., Lin, C.-Y.:Automated text summarization in SUMMARIST. In: Advances in Automatic Text Summarization, 14 p (1999)
Cho, K., et al.: Learning phrase representations using RNN encoder decoder. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP) (2014)
Acknowledgements
We are grateful to our Daffodil international university’s (NLP) laboratory from, where we got all kinds of facilities for our work. We are also grateful to our honorable department head sir and our respective supervisor who helps us to come out from all kinds of obstacles which we faced in our work.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Fouzia, F.A., Rahat, M.A., Alie - Al - Mahdi, M.T., Masum, A.K.M., Abujar, S., Hossain, S.A. (2021). A Bengali Text Summarization Using Encoder-Decoder Based on Social Media Dataset. In: Hassanien, A.E., Bhattacharyya, S., Chakrabati, S., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 1300. Springer, Singapore. https://doi.org/10.1007/978-981-33-4367-2_51
Download citation
DOI: https://doi.org/10.1007/978-981-33-4367-2_51
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-33-4366-5
Online ISBN: 978-981-33-4367-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)