Abstract
Twitter data has been used to improve political campaigns, product quality, and sentiment analysis over the last several years. An essential and collaborative effort for many businesses is to classify tweets based on user sentiment. A machine learning classifier is proposed in this research in order to aid in sentiment analysis for these types of organizations. The classifier employs a soft voting process based on Logistic Regression (LR) to determine the final prediction. Based on the content and tone of the tweets, we divided them into three categories: “good,” “negative,” and “neutral.” Additionally, the accuracy and F1-scores were used to assess the performance of several machine learning classifiers. Classification accuracy was also examined in terms of feature extraction strategies, such as term frequencies, Inverse Document Frequencies (TF-IDF), and Words-To-Vectors (W2V). Furthermore, the performance of the Deep Long-Term Memory (DLTM) network was evaluated on the dataset. Compared to other classifiers, the presented classifier performs better. With TF-IDF feature extraction, the LR can attain an accuracy of 0.9616 and an F1-score of 0.7633. According to these findings, ensemble classifiers outperform non-ensemble classifiers. According to experiments, using TF-IDF as a feature extraction approach improves the performance of machine learning classifiers. The extraction of W2V features is less efficient than the extraction of TF-IDF features.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Khalid M, Ashraf I, Mehmood A et al (2020) GBSVM: sentiment classification from unstructured reviews using ensemble classifier. Appl Sci 10:2788. https://doi.org/10.3390/app10082788
Shoumy N, Ang L, Seng K et al (2020) Multimodal big data affective analytics: a comprehensive survey using text, audio, visual and physiological signals. J Netw Comput Appl 149:102447. https://doi.org/10.1016/j.jnca.2019.102447
Schnebly J, Sengupta S (2019) Random forest twitter bot classifier. In: 2019 IEEE 9th annual computing and communication workshop and conference (CCWC). IEEE, pp 0506–0512
Charbuty B, Abdulazeez A (2021) Classification based on decision tree algorithm for machine learning. J Appl Sci Technol Trends 2(01):20–28
Olaru C, Wehenkel L (2003) A complete fuzzy decision tree technique. Fuzzy Sets Syst 138(2):221–254
Rathi M, Malik A, Varshney D, Sharma R, Mendiratta S (2018) Sentiment analysis of tweets using machine learning approach. In: 2018 eleventh international conference on contemporary computing (IC3). IEEE, pp 1–3
Zhu L, Yang Y (2016) Improvement of decision tree ID3 algorithm. In: International conference on collaborative computing: networking, applications and worksharing. Springer, Cham, pp 595–600
Kaewrod N, Jearanaitanakij K (2018) Improving ID3 algorithm by ignoring minor instances. In: 2018 22nd international computer science and engineering conference (ICSEC). IEEE, pp 1–5
Hamad Y, Mohammed OKJ, Simonov K (2019) Evaluating of tissue germination and growth rate of ROI on implants of electron scanning microscopy images. In: Proceedings of the 9th international conference on information systems and technologies, pp 1–7
Devi BL, Bai VV, Ramasubbareddy S, Govinda K (2020) Sentiment analysis on movie reviews. In: Emerging research in data engineering systems and computer communications. Springer, Singapore, pp 321–328
Guerreiro J, Rita P (2020) How to predict explicit recommendations in online reviews using text mining and sentiment analysis. J Hosp Tour Manag 43:269–272
Mehta RP, Sanghvi MA, Shah DK, Singh A (2020) Sentiment analysis of tweets using supervised learning algorithms. In: First international conference on sustainable technologies for computational intelligence. Springer, Singapore, pp 323–338
Zhang J (2020) Sentiment analysis of movie reviews in Chinese
López-Chau A, Valle-Cruz D, Sandoval-Almazán R (2020) Sentiment analysis of Twitter data through machine learning techniques. In: Software engineering in the era of cloud computing. Springer, Cham, pp 185–209
Addi HA, Ezzahir R, Mahmoudi A (2020) Three-level binary tree structure for sentiment classification in Arabic text. In: Proceedings of the 3rd international conference on networking, information systems & security, pp 1–8
Patel R, Passi K (2020) Sentiment analysis on Twitter data of world cup soccer tournament using machine learning. IoT 1(2):218–239
Wang Y, Chen Q, Shen J, Hou B, Ahmed M, Li Z (2021) Aspect-level sentiment analysis based on gradual machine learning. Knowl-Based Syst 212:106509
Baccouche A, Garcia-Zapirain B, Elmaghraby A (2018) Annotation technique for health-related tweets sentiment analysis. In: 2018 IEEE international symposium on signal processing and information technology (ISSPIT). IEEE, pp 382–387
Hameed Z, Garcia-Zapirain B (2020). Sentiment classification using a single-layered BiLSTM model. IEEE Access 8:73992–74001
Zhang M (2020) E-commerce comment sentiment classification based on deep learning. In: 2020 IEEE 5th international conference on cloud computing and big data analytics (ICCCBDA). IEEE, pp 184–187
Mandloi L, Patel R (2020) Twitter sentiments analysis using machine learning methods. In: 2020 international conference for emerging technology (INCET). IEEE, pp 1–5
Misopoulos F, Mitic M, Kapoulas A, Karapiperis C (2014) Uncovering customer service experiences with Twitter: the case of airline industry. Manage Decis
Hamad YA, Simonov K, Naeem MB (2019) Lung boundary detection and classification in chest X-rays images based on neural network. In: International conference on applied computing to support industry: innovation and technology. Springer, Cham, pp 3–16
Kirasich K, Smith T, Sadler B (2018) Random forest vs logistic regression: binary classification for heterogeneous datasets. SMU Data Sci Rev 1(3):9
Meier L, Van De Geer S, Bühlmann P (2008) The group lasso for logistic regression. J Roy Stat Soc Ser B (Stat Methodol) 70(1):53–71
Nelder JA, Wedderburn RW (1972) Generalized linear models. J Roy Stat Soc Ser A (Gen) 135(3):370–384
Kabaev E, Hamad Y, Simonov K, Zotin A (2020) Visualization and analysis of the shoulder joint biomechanics in postoperative rehabilitation. In: SibDATA, pp 34–41
Cameron AC, Windmeijer FA (1997) An R-squared measure of goodness of fit for some common nonlinear regression models. J Econometrics 77(2):329–342
Ayer T, Chhatwal J, Alagoz O, Kahn CE Jr, Woods RW, Burnside ES (2010) Comparison of logistic regression and artificial neural network models in breast cancer risk estimation. Radiographics 30(1):13–22
Cummins N, Amiriparian S, Ottl S, Gerczuk M, Schmitt M, Schuller B (2018) Multimodal bag-of-words for cross domains sentiment analysis. In: 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 4954–4958
Kadhim AI (2019) Term weighting for feature extraction on Twitter: a comparison between BM25 and TF-IDF. In: 2019 international conference on advanced science and engineering (ICOASE). IEEE, pp 124–128
Soares ER, Barrére E (2019) An optimization model for temporal video lecture segmentation using word2vec and acoustic features. In: Proceedings of the 25th Brazillian symposium on multimedia and the web, pp 513–520
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Shihab, F.F., Ekmekci, D. (2023). Tweet Classification on the Base of Sentiments Using Deep Learning. In: Shukla, P.K., Singh, K.P., Tripathi, A.K., Engelbrecht, A. (eds) Computer Vision and Robotics. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-19-7892-0_12
Download citation
DOI: https://doi.org/10.1007/978-981-19-7892-0_12
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-7891-3
Online ISBN: 978-981-19-7892-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)