Skip to main content

Mining Social Network Data for Predictive Personality Modelling by Employing Machine Learning Techniques

  • Conference paper
  • First Online:
Computational Advancement in Communication Circuits and Systems

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 575))

Abstract

Facebook, Twitter, LinkedIn and Tumblr are online social networking platforms where the users send and receive messages on the topic of their choice and express their sentiments. The usage of these sites has exponentially increased over the last few years, thereby increasing the information posted on online social media sites. The quantity of information/tweets keeps increasing on a daily basis. Twitter has become a stable platform to identify personality-related indicators and encrypted in user profiles and pages related to a subject. In this proposed work, we present a scalable real-time system for sentiment analysis of Twitter data. This work will collect tweets of the users in real time and thus provide a basis to identify each tweet into either positive or negative based on the mind-set of the user, thereby providing a real-time analysis of the users regarding a certain topic. The system relies on feature extraction from the tweets generated in real time. A supervised learning approach based on ensemble learning is used to train various classifiers based on the features extracted. A design and implementation in Flask and Celery has been carried out which contains the feature extraction and classification tasks. The system is scalable with respect to the size of the input data and the rate of data arrival. The merits of the proposed system in terms of scalability, performance and classification accuracy was evaluated experimentally.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. I. Hemalatha, G.P.S. Varma, A. Govardhan, Social network analysis and mining using machine learning techniques. Assoc. Adv. Artif. Intell. 0020–0190, 603–607 (2012)

    Google Scholar 

  2. B. Liu. Sentiment Analysis and Opinion Mining. Assoc. Advancement of Artif. Intell. 1947–4040 (2013)

    Google Scholar 

  3. M.A. Olowe, M.M. Gaber, F. Stahl. A Survey of data mining techniques for social media analysis. ICAISC, Int. Conf. Artif. Intell. Soft Comput. J. Data Mining Digital Humanit. ACM Sigmod Record 34(2), 18–26 (2014)

    Google Scholar 

  4. B. Pang, L. Lee, Opinion mining and sentiment analysis. Found. Trends Inform. Retrieval 2(1–2), 1–135 (2008)

    Article  Google Scholar 

  5. D. Markovikj, S. Gievska, M. Kosinski, D. Stillwell. Mining facebook data for predictive personality modelling. Assoc. Advancement Artifi. Intell. (www.aaai.org), 110(15), 5802–5805 (2013)

  6. C. Costea, D. Joyeux, O. Hasan, L. Brunie. A Study and Comparison of Sentiment Analysis Methods for Reputation Evaluation. A. Collomb.Laboratoire d’InfoRmatique en Image et Systèmes d’information, Rapport de Recherche, RR-LIRIS-2014–002 (2014)

    Google Scholar 

  7. E.F.G.M. Beig, Data mining techniques for web mining: A review. Appl. Math. Eng. Manage. Technol. 3(5), 81–90 (2015)

    Google Scholar 

  8. L.R. Goldberg. A broad-bandwidth, public domain, personality inventory measuring the lower-level facets of several five-factor models. 7–29 (1992)

    Google Scholar 

  9. K. Tao, C. Hauff, G.J. Houben, F. Abel, G. Wachsmuth. Facilitating twitter data analytics: platform, language and functionality. IEEE Int. Conf. 421–430 (2014)

    Google Scholar 

  10. A. Pak, P. Paroubek. Twitter as a corpus for sentiment analysis and opinion mining. in Proceedings of the Seventh Conference on International Language Resources and Evaluation, (2010), pp. 1320–1326

    Google Scholar 

  11. R. Parikh, M. Movassate. Sentiment analysis of user-generated twitter updates using various classification techniques. CS224 N Final Report (2009)

    Google Scholar 

  12. A. Go, R. Bhayani, L. Huang. Twitter sentiment classification using distant supervision. Stanford University, Technical Paper (2009)

    Google Scholar 

  13. A. Agarwal, B. Xie, I. Vovsha, O. Rambow R. Passonneau. Sentiment analysis of twitter data. in Proceedings of the ACL 2011 Workshop on Languages in Social Media, (2011), pp. 30–38

    Google Scholar 

  14. D. Davidova, A. Rappoport. Enhanced sentiment learning using twitter hash tags and smiley’s Cooling: Poster. 241–249 (2010)

    Google Scholar 

  15. B. Wagh, J.V. Shinde, P.A. Kale, A Twitter sentiment analysis using NLTK and machine learning techniques. Int. J. Emerg. Res. Manage. Technol. 6(12), 2278–9359 (2017)

    Google Scholar 

  16. F. Rosenblatt, Principles of neurodynamics: Perceptrons and the theory of brain mechanisms (Spartan Books, Washington DC, 1961)

    Book  Google Scholar 

  17. D.E. Rumelhart, G.E. Hinton, R.J. Williams. Learning internal representations by error propagation. Parallel Distributed Processing: Explorations in the Microstructure of Cognition, vol. 1 (Foundation, MIT Press, 1986)

    Google Scholar 

  18. G. Cybenko. Approximation by superpositions of a sigmoidal function. Mathematics Control, Signals Syst. 2(4), 303–314 (1989)

    Article  MathSciNet  Google Scholar 

  19. A.O. Steinskog J.F. Therkelsen. Characterizing Twitter data using sentiment analysis and topic modeling. Artificial Intelligence Group Department of Computer and Information Science Faculty of Information Technology, Mathematics and Electrical Engineering. Master’s Thesis, Spring (2016)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Arjun Sengupta .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sengupta, A., Ghosh, A. (2020). Mining Social Network Data for Predictive Personality Modelling by Employing Machine Learning Techniques. In: Maharatna, K., Kanjilal, M., Konar, S., Nandi, S., Das, K. (eds) Computational Advancement in Communication Circuits and Systems. Lecture Notes in Electrical Engineering, vol 575. Springer, Singapore. https://doi.org/10.1007/978-981-13-8687-9_11

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-8687-9_11

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-8686-2

  • Online ISBN: 978-981-13-8687-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics