Abstract
Sarcasm is primary reason behind the faulty classification of the tweets. The tweets of sarcastic nature appear in the different compositions, but mainly deflect the meaning different than their actual composition. This confuses the classification models and produces false results. In the paper, the primary focus remains upon the classification of sarcastic tweets, which has been accomplished using the textual structure. This involves the expressions of speech, part of speech features, punctuations, term sentiment, affection, etc. All of the features are extracted individually from the target tweet and combined altogether to create the cumulative feature for the target tweet. The proposed model has been observed with accuracy slightly higher than 84%, which depicts the clear improvement in comparison with existing models. The random forest-based classification model has outperformed all other candidates deployed under the experiment. The random forest classifier is observed with accuracy of 84.7, which outperforms the SVM (78.6%), KNN (73.1%), and Maximum entropy (80.5%).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
http://language.worldofcomputing.net/category/machine-translation
S. Tanwar et al., Multimedia big data computing and internet of things applications: a taxonomy and process model. J. Netw. Comput. Appl. (2018)
E. Riloff, A. Qadir, P. Surve, L. De Silva, N. Gilbert, R. Huang, Sarcasm as contrast between a positive sentiment and negative situation, in Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (2013), pp. 704–714
M.A. Walker, J.E.F. Tree, P. Anand, R. Abbott, J. King, A corpus for research on deliberation and debate, in LREC (2012), pp. 812–817
E. Fersini, F.A. Pozzi, E. Messina, Detecting irony and sarcasm in microblogs: the role of expressive signals and ensemble classifiers, in Proceedings of the IEEE Conference on Data Science and Advanced Analytics, IEEE (2015), pp. 1–8
M. Bouazizi, T. Ohtsuki, A pattern-based approach for sarcasm detection on Twitter. IEEE 4, 5477–5488 (2016)
S. Poria, E. Cambria, D. Hazarika, P. Vij, A deeper look into sarcastic tweets using deep convolutional neural networks (2016)
A. Joshi, S. Agrawal, P. Bhattacharyya, M. Carman, Expect the unexpected: harnessing sentence completion for sarcasm detection, in Proceedings of the International Conference of the Pacific Association for Computational Linguistics (Springer, Singapore, 2017), pp. 275–287
S.K. Bharti, R. Pradhan, K.S. Babu, S.K. Jena, Sarcastic sentiment detection based on types of sarcasm occurring in twitter data. Int. J. Sem. Web Inf. Syst. (IJSWIS) 13, 89–108 (2017)
S.K. Bharti, K.S. Babu, S.K. Jena, Parsing-based sarcasm sentiment recognition in twitter data, in Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, IEEE/ACM (2015), pp. 1373–1380
J. Kaur et al., Text analytical models for data collected from micro-blogging Portal–a review. J. Emerg. Technol. Innov. Res. (JETIR) 5, 81–86 (2018)
M. Khodak, N. Saunshi, K. Vodrahali, A large self-annotated corpus for Sarcasm (2018)
S. Saha, J. Yadav, P. Ranjan, Proposed approach for sarcasm detection in Twitter. Indian J. Sci. Technol. 10 (2017)
P. Deshmukh, S. Solanke, Review paper: sarcasm detection and observing user behavioral. Int. J. Comput. Appl. 166 (2017)
V. Haripriya, P.G. Patil, A survey of sarcasm detection in social media. Int. J. Res. Appl. Sci. Eng. Technol. (IJRASET) (2017)
A. Joshi, P. Bhattacharya, M.J. Carman, Automatic sarcasm detection: a survey. ACM Comput. Surv. (CSUR) 50 (2017)
A.D. Dave, N.P. Desai, A comprehensive study of classification techniques for sarcasm detection on textual data, in Proceedings of the International Conference on Electrical, Electronics and Optimization Techniques (ICEEOT) (2016), pp. 1985–1991
W. Medhat, A. Hassan, H. Korashy, Sentiment analysis algorithms and applications: a survey. Eng. J. 5, 1093–1113 (2014)
Tomas Ptacek, Ivan Habernal and Jun Hong, “Sarcasm detection on Czech and English twitter”, Proceedings of the 25th International Conference on Computational Linguistics: Technical Papers, COLING, pp. 213–223, 2014
D. Khurana, A. Koli, K. Khatter, S. Singh, Natural language processing: state of the art, current trends and challenges (2017)
E. Lunando, A. Purwarianti, Indonesian social media sentiment analysis with sarcasm detection, in Proceedings of the International Conference on Advanced Computer Science and Information Systems (ICACSIS), IEEE ( 2013), pp. 195–198
D. Dmitry, O. Tsur, A. Rappoport, Enchanced sentiment learning using Twitter hashtags and smileys, in Proceedings of the 23rd International Conference on Computational Linguistics: posters (2010), pp. 241–249
B. Ohana, B. Tierney, Sentiment classification of reviews using SentiWordNet (2009)
S.B. Kotsiantis, I. Zaharakis, P. Pintelas, Supervised machine learning: a review of classification techniques, in Emerging Artificial Intelligence Applications in Computer Engineering (2007)
A. Pak, P. Paroubek, Twitter as a corpus for sentiment analysis and opinion mining. LREC 10, 1320–1326 (2010)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this chapter
Cite this chapter
Kumar, R., Kaur, J. (2020). Random Forest-Based Sarcastic Tweet Classification Using Multiple Feature Collection. In: Tanwar, S., Tyagi, S., Kumar, N. (eds) Multimedia Big Data Computing for IoT Applications. Intelligent Systems Reference Library, vol 163. Springer, Singapore. https://doi.org/10.1007/978-981-13-8759-3_5
Download citation
DOI: https://doi.org/10.1007/978-981-13-8759-3_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-8758-6
Online ISBN: 978-981-13-8759-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)