Random Forest-Based Sarcastic Tweet Classification Using Multiple Feature Collection

Kumar, Rajeev; Kaur, Jasandeep

doi:10.1007/978-981-13-8759-3_5

Rajeev Kumar⁶ &
Jasandeep Kaur⁶

Part of the book series: Intelligent Systems Reference Library ((ISRL,volume 163))

205k Accesses
7 Citations

Abstract

Sarcasm is primary reason behind the faulty classification of the tweets. The tweets of sarcastic nature appear in the different compositions, but mainly deflect the meaning different than their actual composition. This confuses the classification models and produces false results. In the paper, the primary focus remains upon the classification of sarcastic tweets, which has been accomplished using the textual structure. This involves the expressions of speech, part of speech features, punctuations, term sentiment, affection, etc. All of the features are extracted individually from the target tweet and combined altogether to create the cumulative feature for the target tweet. The proposed model has been observed with accuracy slightly higher than 84%, which depicts the clear improvement in comparison with existing models. The random forest-based classification model has outperformed all other candidates deployed under the experiment. The random forest classifier is observed with accuracy of 84.7, which outperforms the SVM (78.6%), KNN (73.1%), and Maximum entropy (80.5%).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Hardcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

http://language.worldofcomputing.net/category/machine-translation
S. Tanwar et al., Multimedia big data computing and internet of things applications: a taxonomy and process model. J. Netw. Comput. Appl. (2018)
Google Scholar
E. Riloff, A. Qadir, P. Surve, L. De Silva, N. Gilbert, R. Huang, Sarcasm as contrast between a positive sentiment and negative situation, in Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) (2013), pp. 704–714
Google Scholar
M.A. Walker, J.E.F. Tree, P. Anand, R. Abbott, J. King, A corpus for research on deliberation and debate, in LREC (2012), pp. 812–817
Google Scholar
E. Fersini, F.A. Pozzi, E. Messina, Detecting irony and sarcasm in microblogs: the role of expressive signals and ensemble classifiers, in Proceedings of the IEEE Conference on Data Science and Advanced Analytics, IEEE (2015), pp. 1–8
Google Scholar
M. Bouazizi, T. Ohtsuki, A pattern-based approach for sarcasm detection on Twitter. IEEE 4, 5477–5488 (2016)
Google Scholar
S. Poria, E. Cambria, D. Hazarika, P. Vij, A deeper look into sarcastic tweets using deep convolutional neural networks (2016)
Google Scholar
A. Joshi, S. Agrawal, P. Bhattacharyya, M. Carman, Expect the unexpected: harnessing sentence completion for sarcasm detection, in Proceedings of the International Conference of the Pacific Association for Computational Linguistics (Springer, Singapore, 2017), pp. 275–287
Google Scholar
S.K. Bharti, R. Pradhan, K.S. Babu, S.K. Jena, Sarcastic sentiment detection based on types of sarcasm occurring in twitter data. Int. J. Sem. Web Inf. Syst. (IJSWIS) 13, 89–108 (2017)
Article Google Scholar
S.K. Bharti, K.S. Babu, S.K. Jena, Parsing-based sarcasm sentiment recognition in twitter data, in Proceedings of the International Conference on Advances in Social Networks Analysis and Mining, IEEE/ACM (2015), pp. 1373–1380
Google Scholar
J. Kaur et al., Text analytical models for data collected from micro-blogging Portal–a review. J. Emerg. Technol. Innov. Res. (JETIR) 5, 81–86 (2018)
Google Scholar
M. Khodak, N. Saunshi, K. Vodrahali, A large self-annotated corpus for Sarcasm (2018)
Google Scholar
S. Saha, J. Yadav, P. Ranjan, Proposed approach for sarcasm detection in Twitter. Indian J. Sci. Technol. 10 (2017)
Article Google Scholar
P. Deshmukh, S. Solanke, Review paper: sarcasm detection and observing user behavioral. Int. J. Comput. Appl. 166 (2017)
Article Google Scholar
V. Haripriya, P.G. Patil, A survey of sarcasm detection in social media. Int. J. Res. Appl. Sci. Eng. Technol. (IJRASET) (2017)
Google Scholar
A. Joshi, P. Bhattacharya, M.J. Carman, Automatic sarcasm detection: a survey. ACM Comput. Surv. (CSUR) 50 (2017)
Article Google Scholar
A.D. Dave, N.P. Desai, A comprehensive study of classification techniques for sarcasm detection on textual data, in Proceedings of the International Conference on Electrical, Electronics and Optimization Techniques (ICEEOT) (2016), pp. 1985–1991
Google Scholar
W. Medhat, A. Hassan, H. Korashy, Sentiment analysis algorithms and applications: a survey. Eng. J. 5, 1093–1113 (2014)
Article Google Scholar
Tomas Ptacek, Ivan Habernal and Jun Hong, “Sarcasm detection on Czech and English twitter”, Proceedings of the 25th International Conference on Computational Linguistics: Technical Papers, COLING, pp. 213–223, 2014
Google Scholar
D. Khurana, A. Koli, K. Khatter, S. Singh, Natural language processing: state of the art, current trends and challenges (2017)
Google Scholar
E. Lunando, A. Purwarianti, Indonesian social media sentiment analysis with sarcasm detection, in Proceedings of the International Conference on Advanced Computer Science and Information Systems (ICACSIS), IEEE ( 2013), pp. 195–198
Google Scholar
https://www.google.com/search?q=starting+window+of+spyder&tbm=isch&source=iu&ictx=1&fir=dlJnEECXUMNvM%253A%252Cf1HkEIjoqWS7zM%252C_&usg=__3JDt9STOUxjMyHHYZ2UmfpL697k%3D&sa=X&ved=0ahUKEwiUoo2zz_LaAhVKr48KHcmQDIkQ9QEIajAG#imgrc=dlJnEEC-XUMNvM
https://en.wikipedia.org/wiki/Spyder_(software)
https://wiki.python.org/moin/BeginnersGuide/Overview
D. Dmitry, O. Tsur, A. Rappoport, Enchanced sentiment learning using Twitter hashtags and smileys, in Proceedings of the 23rd International Conference on Computational Linguistics: posters (2010), pp. 241–249
Google Scholar
B. Ohana, B. Tierney, Sentiment classification of reviews using SentiWordNet (2009)
Google Scholar
S.B. Kotsiantis, I. Zaharakis, P. Pintelas, Supervised machine learning: a review of classification techniques, in Emerging Artificial Intelligence Applications in Computer Engineering (2007)
Google Scholar
A. Pak, P. Paroubek, Twitter as a corpus for sentiment analysis and opinion mining. LREC 10, 1320–1326 (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

DAV Institute of Engineering and Technology, Jalandhar, Punjab, India
Rajeev Kumar & Jasandeep Kaur

Authors

Rajeev Kumar
View author publications
You can also search for this author in PubMed Google Scholar
Jasandeep Kaur
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jasandeep Kaur .

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, Institute of Technology, Nirma University, Ahmedabad, Gujarat, India
Sudeep Tanwar
Department of Electronics and Communication Engineering, Thapar Institute of Engineering and Technology, Deemed University, Patiala, Punjab, India
Sudhanshu Tyagi
Department of Computer Science and Engineering, Thapar Institute of Engineering and Technology, Deemed University, Patiala, Punjab, India
Neeraj Kumar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Kumar, R., Kaur, J. (2020). Random Forest-Based Sarcastic Tweet Classification Using Multiple Feature Collection. In: Tanwar, S., Tyagi, S., Kumar, N. (eds) Multimedia Big Data Computing for IoT Applications. Intelligent Systems Reference Library, vol 163. Springer, Singapore. https://doi.org/10.1007/978-981-13-8759-3_5

Download citation

DOI: https://doi.org/10.1007/978-981-13-8759-3_5
Published: 18 July 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-8758-6
Online ISBN: 978-981-13-8759-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics