Abstract
Facebook, Twitter, LinkedIn and Tumblr are online social networking platforms where the users send and receive messages on the topic of their choice and express their sentiments. The usage of these sites has exponentially increased over the last few years, thereby increasing the information posted on online social media sites. The quantity of information/tweets keeps increasing on a daily basis. Twitter has become a stable platform to identify personality-related indicators and encrypted in user profiles and pages related to a subject. In this proposed work, we present a scalable real-time system for sentiment analysis of Twitter data. This work will collect tweets of the users in real time and thus provide a basis to identify each tweet into either positive or negative based on the mind-set of the user, thereby providing a real-time analysis of the users regarding a certain topic. The system relies on feature extraction from the tweets generated in real time. A supervised learning approach based on ensemble learning is used to train various classifiers based on the features extracted. A design and implementation in Flask and Celery has been carried out which contains the feature extraction and classification tasks. The system is scalable with respect to the size of the input data and the rate of data arrival. The merits of the proposed system in terms of scalability, performance and classification accuracy was evaluated experimentally.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
I. Hemalatha, G.P.S. Varma, A. Govardhan, Social network analysis and mining using machine learning techniques. Assoc. Adv. Artif. Intell. 0020–0190, 603–607 (2012)
B. Liu. Sentiment Analysis and Opinion Mining. Assoc. Advancement of Artif. Intell. 1947–4040 (2013)
M.A. Olowe, M.M. Gaber, F. Stahl. A Survey of data mining techniques for social media analysis. ICAISC, Int. Conf. Artif. Intell. Soft Comput. J. Data Mining Digital Humanit. ACM Sigmod Record 34(2), 18–26 (2014)
B. Pang, L. Lee, Opinion mining and sentiment analysis. Found. Trends Inform. Retrieval 2(1–2), 1–135 (2008)
D. Markovikj, S. Gievska, M. Kosinski, D. Stillwell. Mining facebook data for predictive personality modelling. Assoc. Advancement Artifi. Intell. (www.aaai.org), 110(15), 5802–5805 (2013)
C. Costea, D. Joyeux, O. Hasan, L. Brunie. A Study and Comparison of Sentiment Analysis Methods for Reputation Evaluation. A. Collomb.Laboratoire d’InfoRmatique en Image et Systèmes d’information, Rapport de Recherche, RR-LIRIS-2014–002 (2014)
E.F.G.M. Beig, Data mining techniques for web mining: A review. Appl. Math. Eng. Manage. Technol. 3(5), 81–90 (2015)
L.R. Goldberg. A broad-bandwidth, public domain, personality inventory measuring the lower-level facets of several five-factor models. 7–29 (1992)
K. Tao, C. Hauff, G.J. Houben, F. Abel, G. Wachsmuth. Facilitating twitter data analytics: platform, language and functionality. IEEE Int. Conf. 421–430 (2014)
A. Pak, P. Paroubek. Twitter as a corpus for sentiment analysis and opinion mining. in Proceedings of the Seventh Conference on International Language Resources and Evaluation, (2010), pp. 1320–1326
R. Parikh, M. Movassate. Sentiment analysis of user-generated twitter updates using various classification techniques. CS224 N Final Report (2009)
A. Go, R. Bhayani, L. Huang. Twitter sentiment classification using distant supervision. Stanford University, Technical Paper (2009)
A. Agarwal, B. Xie, I. Vovsha, O. Rambow R. Passonneau. Sentiment analysis of twitter data. in Proceedings of the ACL 2011 Workshop on Languages in Social Media, (2011), pp. 30–38
D. Davidova, A. Rappoport. Enhanced sentiment learning using twitter hash tags and smiley’s Cooling: Poster. 241–249 (2010)
B. Wagh, J.V. Shinde, P.A. Kale, A Twitter sentiment analysis using NLTK and machine learning techniques. Int. J. Emerg. Res. Manage. Technol. 6(12), 2278–9359 (2017)
F. Rosenblatt, Principles of neurodynamics: Perceptrons and the theory of brain mechanisms (Spartan Books, Washington DC, 1961)
D.E. Rumelhart, G.E. Hinton, R.J. Williams. Learning internal representations by error propagation. Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol. 1 (Foundation, MIT Press, 1986)
G. Cybenko. Approximation by superpositions of a sigmoidal function. Mathematics Control, Signals Syst. 2(4), 303–314 (1989)
A.O. Steinskog J.F. Therkelsen. Characterizing Twitter data using sentiment analysis and topic modeling. Artificial Intelligence Group Department of Computer and Information Science Faculty of Information Technology, Mathematics and Electrical Engineering. Master’s Thesis, Spring (2016)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sengupta, A., Ghosh, A. (2020). Mining Social Network Data for Predictive Personality Modelling by Employing Machine Learning Techniques. In: Maharatna, K., Kanjilal, M., Konar, S., Nandi, S., Das, K. (eds) Computational Advancement in Communication Circuits and Systems. Lecture Notes in Electrical Engineering, vol 575. Springer, Singapore. https://doi.org/10.1007/978-981-13-8687-9_11
Download citation
DOI: https://doi.org/10.1007/978-981-13-8687-9_11
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-8686-2
Online ISBN: 978-981-13-8687-9
eBook Packages: EngineeringEngineering (R0)