Text Classification Using Deep Learning: A Survey

Bhawsar, Samarth; Dubey, Sarthak; Kushwaha, Shashwat; Sharma, Sanjeev

doi:10.1007/978-981-19-2126-1_16

Samarth Bhawsar⁷,
Sarthak Dubey⁷,
Shashwat Kushwaha⁷ &
…
Sanjeev Sharma⁷

Part of the book series: Algorithms for Intelligent Systems ((AIS))

284 Accesses

Abstract

In this paper, we have briefly reviewed the previous paper in this domain. The paper presents some of the state-of-the-art text classification techniques. We have discussed some of the best deep learning classification techniques and word representations. After reviewing several papers, we found that some of the authors had improved their performance by doing better preprocessing while some of them have made changes in the algorithms for better accuracy. We have compared models on different data sets based on their accuracy score. We have also discussed some metrics for evaluating the text classification models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aggarwal CC, Zhai CX (2012) A survey of text classification algorithms, pp 163–222
Google Scholar
Anaby-Tavor A, Carmeli B, Goldbraich E, Kantor A, Kour G, Shlomov S, Tepper N, Zwerdling N (2020) Do not have enough data? Deep learning to the rescue! 34(05):7383–7390
Google Scholar
Arora G (2002) inltk: Natural language toolkit for indic languages. arXiv preprint arXiv:2009.12534
Cai J, Li J, Li W, Wang J (2018) Deep learning model used in text classification, pp 123–126
Google Scholar
Chen J, Hu Y, Liu J, Xiao Y, Jiang H (2019) Deep short text classification with knowledge powered attention 33(01):6252–6259
Google Scholar
Feldman R (2013) Techniques and applications for sentiment analysis. Commun ACM 56(4):82–89
Google Scholar
Gallo I, Nawaz S, Landro N, La Grassa R (2020) Visual word embedding for text classification, pp 339–352
Google Scholar
Heidarysafa M, Kowsari K, Brown DE, Meimandi KJ, Barnes LE (2018) An improvement of data classification using random multimodel deep learning (rmdl). arXiv preprint arXiv:1808.08121
Joulin A, Grave E, Bojanowski P, Douze M, Jégou H, Mikolov T (2016) Fasttext.zip: compressing text classification models. arXiv preprint arXiv:1612.03651
Joulin A, Grave E, Bojanowski P, Mikolov T (2016) Bag of tricks for efficient text classification. arXiv preprint arXiv:1607.01759
Karisani P, Karisani N (2021) Semi-supervised text classification via self-pretraining. In: Proceedings of the 14th ACM international conference on web search and data mining, pp 40–48
Google Scholar
Korde V, Namrata Mahender C (2012) Text classification and classifiers: a survey. Int J Artif Intelli Appl 3(2):85
Google Scholar
Kowsari K, Brown DE, Heidarysafa M, Meimandi KJ, Gerber MS, Barnes LE (2017) Hdltex: hierarchical deep learning for text classification, pp 364–371
Google Scholar
Kamran K, Meimandi KJ, Heidarysafa M, Mendu S, Barnes L, Brown D (2019) Text classification algorithms: a survey. Information 10(4):150
Google Scholar
Le Q, Mikolov T (2014) Distributed representations of sentences and documents, pp 1188–1196
Google Scholar
Li Q, Peng H, Li J, Xia C, Yang R, Sun L, Yu PS, He L (2020) A survey on text classification: from shallow to deep learning. arXiv preprint arXiv:2008.00364
Meng Y, Zhang Y, Huang J, Xiong C, Ji H, Zhang C, Han J (2020) Text classification using label names only: a language model self-training approach. arXiv preprint arXiv:2010.07245
Minaee S, Nal K, Erik C, Narjes N, Meysam C, Jianfeng G (2021) Deep learning-based text classification: a comprehensive review. ACM Comput Surv (CSUR) 54(3):1–40
Google Scholar
Nguyen DQ, Vu T, Nguyen AT (2020) Bertweet: a pre-trained language model for english tweets. arXiv preprint arXiv:2005.10200
Ohashi S, Takayama J, Kajiwara T, Arase Y (2021) Distinct label representations for few-shot text classification. In: Proceedings of the 59th annual meeting of the association for computational linguistics and the 11th international joint conference on natural language processing (vol 2: Short Papers), pp 831–836
Google Scholar
Pavlinek M, Vili P (2017) Text classification method based on self-training and lda topic models. Expert Syst Appl 80:83–93
Google Scholar
Wang B, Li C, Pavlu V, Aslam J (2017) Regularizing model complexity and label structure for multi-label text classification. arXiv preprint arXiv:1705.00740
Wang G, Li C, Wang W, Zhang Y, Shen D, Zhang X, Henao R, Carin L (2018) Joint embedding of words and labels for text classification. arXiv preprint arXiv:1805.04174
Wei J, Zou K (2019) Eda: easy data augmentation techniques for boosting performance on text classification tasks. arXiv preprint arXiv:1901.11196
Wu LY, Fisch A, Chopra S, Adams K, Bordes A, Weston J (2018) Starspace: embed all the things!
Google Scholar
Yamada I, Shindo H (2019) Neural attentive bag-of-entities model for text classification. arXiv preprint arXiv:1909.01259
Zhang C, Wu J, Zhu H, Xu K (2021) Tent: text classification based on encoding tree learning. arXiv preprint arXiv:2110.02047
Zhang X, LeCun Y (2017) Which encoding is the best for text classification in Chinese, English, Japanese and Korean? arXiv preprint arXiv:1708.02657

Download references

Author information

Authors and Affiliations

Indian Institute of Information Technology, Pune, India
Samarth Bhawsar, Sarthak Dubey, Shashwat Kushwaha & Sanjeev Sharma

Authors

Samarth Bhawsar
View author publications
You can also search for this author in PubMed Google Scholar
Sarthak Dubey
View author publications
You can also search for this author in PubMed Google Scholar
Shashwat Kushwaha
View author publications
You can also search for this author in PubMed Google Scholar
Sanjeev Sharma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Samarth Bhawsar .

Editor information

Editors and Affiliations

Indian Institute of Information Technology, Pune, India
Ritu Tiwari
University of Catania, Catania, Italy
Mario F. Pavone
Indian Institute of Information Technology, Pune, India
Ranjith Ravindranathan Nair

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bhawsar, S., Dubey, S., Kushwaha, S., Sharma, S. (2023). Text Classification Using Deep Learning: A Survey. In: Tiwari, R., Pavone, M.F., Ravindranathan Nair, R. (eds) Proceedings of International Conference on Computational Intelligence. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-19-2126-1_16

Download citation

DOI: https://doi.org/10.1007/978-981-19-2126-1_16
Published: 04 October 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-2125-4
Online ISBN: 978-981-19-2126-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics