Abstract
As election day approaches, politics, elections, and candidates are all topics that come up regularly in conversations. Citizens anticipate their favored candidate to be selected, and they try to predict how likely it is that their preferred candidate will be elected, as well as how likely it is that the other candidates will be elected. The goal of this article is to use NLP and machine learning algorithms to forecast the election outcome from Twitter data. For preprocessing, NLP technologies were used on the dataset. For a better result from machine learning models, punctuation and stop words were removed, lower casing, tokenization, stemming, and lemmatization were utilized. Then, using machine learning techniques such as LGBMClassifier, LogisticRegression, ExtraTreeClassifier, DecisionTreeClassifier, RandomForestClassifier, GaussianNB, and KNeighborsClassifier, each result was generated based on the input feature, which was tweet and user information, respectively. When the tweet was used as a variable, the total result was about 80% of the accuracy score, while the user information variable accounted for roughly 60% of the accuracy score. As a result of this finding, it is determined that the tweet column is a far more significant component than the user information one. Pre-trained models would be used for additional study, with the goal of getting a higher accuracy score and applying this outcome to the next Korean election.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Khan A, Zhang H, Boudjellal N, Ahmad A, Shang J, Dai L, Hayat B (2017) Election prediction on Twitter: a systematic mapping study. Complexity 1–27
WhatIs. https://whatis.techtarget.com/definition/social-media, last accessed 2021/12/13
Statista. https://www.statista.com/statistics/278414/number-of-worldwide-social-network-users/, last accessed 2021/12/13
Youngausint. https://www.youngausint.org.au/post/social-media-in-the-2020-u-s-election, last accessed 2021/12/14
Nugroho DK (2021) US presidential election 2020 prediction based on Twitter data using lexicon-based sentiment analysis. In: 2021 11th International conference on cloud computing, data science & engineering (confluence), India
Joyce B, Deng J (2017) Sentiment analysis of tweets for the 2016 US presidential election. In: 2017 IEEE MIT undergraduate research technology conference (URTC), Cambridge, MA, USA
Bilal M, Asif S, Yousuf S, Afzal U (2018) Pakistan General Election: understanding the predictive power of social media. In: 2018 12th International conference on mathematics, actuarial science, computer science and statistics (MACS), Karachi, Pakistan
Sakiyama KM, Silva AQB, Matsubara ET (2019) Twitter breaking news detector in the 2018 Brazilian presidential election using word embeddings and convolutional neural networks. In: 2019 International joint conference on neural networks (IJCNN), Budapest, Hungary
Kaggle. https://www.kaggle.com/manchunhui/us-election-2020-tweets, last accessed 2021/12/14
Bahad P, Saxena P (2020) Study of adaboost and gradient boosting algorithms for predictive analytics. In: International conference on intelligent computing and smart communication 2019. Springer, Singapore, pp 235–244
Maudes J, RodrÃguez JJ, GarcÃa-Osorio C, GarcÃa-Pedrajas N (2012) Random feature weights for decision tree ensemble construction. Inf Fusion 13(1):20–30
Ke G, Meng Q, Finley T, Wang T, Chen W, Ma W, Liu T Y (2017) Lightgbm: a highly efficient gradient boosting decision tree. Adv Neural Inf Process Syst 30:3146–3154
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Park, J., Cheon, M., Hou, S., Lee, O. (2023). Forecasting Election Result via Artificial Intelligence Approach: NLP and Machine Learning. In: Kumar, S., Hiranwal, S., Purohit, S.D., Prasad, M. (eds) Proceedings of International Conference on Communication and Computational Technologies . Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-19-3951-8_57
Download citation
DOI: https://doi.org/10.1007/978-981-19-3951-8_57
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-3950-1
Online ISBN: 978-981-19-3951-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)