Skip to main content

Collecting and Processing Multilingual Streaming Tweets for Sentiment Analysis

  • Conference paper
  • First Online:
International Conference on Information Technology and Communication Systems (ITCS 2017)

Abstract

Sentiment analysis is a new field of study that allows to transform the publications on the social networks into exploitable data to analyze trends, probing consumer opinion or direct advertising campaigns. Many studies have focused on the sentiment analysis for the English language. However, few studies have focused on the Arabic language which is a native language for Millions of people who use social network. This paper addresses some approaches in sentiment analysis for multilingual tweets. At first we have installed and configured a platform for real-time collecting and preprocessing multilingual tweets (Arabic, French and English). In the second time, we have applied factorial correspondence and multiple correspondence analysis for analyzing tweets. We have used this platform for sentiment analysis on the Mawazine festival, which took place in Rabat between May 20 and 28, 2016.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Nasukawa, T., Yi, J.: Sentiment analysis. In: Proceedings of the International Conference on Knowledge Capture - K-CAP 2003 (2003)

    Google Scholar 

  2. Dave, K., Lawrence, S., Pennock, D.: Mining the peanut gallery. In: Proceedings of the Twelfth International Conference on World Wide Web - WWW 2003 (2003)

    Google Scholar 

  3. Wang, X., Wei, F., Liu, X., Zhou, M., Zhang, M.: Topic sentiment analysis in twitter. In: Proceedings of the 20th ACM International Conference on Information and Knowledge Management - CIKM 2011 (2011)

    Google Scholar 

  4. Pang, B., Lee, L.: A sentimental education. In: Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics - ACL 2004 (2004)

    Google Scholar 

  5. Saif, H., He, Y., Alani, H.: Semantic sentiment analysis of twitter. In: The Semantic Web ISWC 2012, pp. 508–524 (2012)

    Google Scholar 

  6. Kumar, A., Sebastian, T.: Sentiment analysis on twitter. IJCSI Int. J. Comput. Sci. 9, 372–378 (2012)

    Google Scholar 

  7. Kiritchenko, S., Zhu, X., Saif, M.M.: Sentiment analysis of short informal texts. J. Artif. Intell. Res. 50, 723–762 (2014)

    Google Scholar 

  8. Tang, J., Nobata, C., Dong, A., Chang, Y., Liu, H.: Propagation-based sentiment analysis for microblogging data. In: Proceedings of the 2015 SIAM International Conference on Data Mining, pp. 577–585 (2015)

    Google Scholar 

  9. Zhou, S., Chen, Q., Wang, X.: Fuzzy deep belief networks for semi-supervised sentiment classification. Neurocomputing 131, 312–322 (2014)

    Article  Google Scholar 

  10. Farra, N.,Challita, E., Assi, R., et al.: Sentence-level and document-level sentiment mining for arabic texts. In: 2010 IEEE International Conference on Data Mining Workshops (ICDMW), pp. 1114–1119. IEEE (2010)

    Google Scholar 

  11. Abdul-Mageed, M., Diab, M., Korayem, M.: Subjectivity and sentiment analysis of modern standard arabic. In: Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers, vol. 2, pp. 587–591 (2011)

    Google Scholar 

  12. Refaee, E., Rieser, V.: An arabic twitter corpus for subjectivity and sentiment analysis. In: LREC, pp. 2268–2273 (2014)

    Google Scholar 

  13. Ibrahim, S., Abdou, H.M., Gheith, M.: Sentiment analysis for modern standard arabic and colloquial. Int. J. Nat. Lang. Comput. 4, 95–109 (2015)

    Article  Google Scholar 

  14. Ingle, A., Kante, A., Samak, S., Kumari, A.: Sentiment analysis of twitter data using hadoop. Int. J. Eng. Res. Gener. Sci. 3, 144–147 (2015)

    Google Scholar 

  15. Han, J., Kamber, M., Pei, J.: Data Mining. Elsevier, New York (2011)

    MATH  Google Scholar 

  16. Morstatter, F., Pfeffer, J., Liu, H., et al.: Is the sample good enough? comparing data from twitter’s streaming api with twitter’s firehose. arXiv preprint arXiv:1306.5204 (2013)

  17. Penchalaiah, C., Murali, G.: Effective sentiment analysis on twitter data using: apache flume and hive. IJISET 1, 101–105 (2014)

    Google Scholar 

  18. Crockford, D.: The application/json media type for javascript object notation (json) (2006). https://tools.ietf.org/html/rfc4627

  19. Patel, A., Birla, M., Nair, U.: Addressing big data problem using Hadoop and Map Reduce. In: 2012 Nirma University International Conference on Engineering (NUiCONE) (2012)

    Google Scholar 

  20. Kotwal, A., Fulari, P., Jadhav, D., et al.: Improvement in sentiment analysis of twitter data using hadoop. Imperial J. Interdisc. Res. 2, 440 (2016)

    Google Scholar 

  21. Assiri, A., Emam, A., Aldossari, H.: Arabic sentiment analysis: a survey. Int. J. Adv. Comput. Sci. Appl. 6, 75–85 (2015)

    Google Scholar 

  22. Alhumoud, S., Altuwaijri, M., Albuhairi, T.: Survey on arabic sentiment analysis in twitter. Int. Sci. Index 9, 364–368 (2015)

    Google Scholar 

  23. Mawazine Festival website. http://www.festivalmawazine.ma/shows

  24. h24info website. www.h24info.ma/maroc/mawazine-cette-star-marocaine-battu-le-record-daffluence/43215

  25. Mellinger, M.: Correspondence analysis: the method and its application. Chemometr. Intell. Lab. Syst. 2, 61–77 (1987)

    Article  Google Scholar 

  26. Husson, F., Le, S., Pages, J.: Exploratory Multivariate Analysis by Example Using R. CRC Press, Boca Raton (2011)

    MATH  Google Scholar 

  27. Di Franco, G.: Multiple correspondence analysis: one only or several techniques? Qual. Quant. 50, 1299–1315 (2015)

    Article  Google Scholar 

  28. Greenacre, M.: Interpreting multiple correspondence analysis. Appl. Stoch. Models Data Anal. 7, 195–210 (1991)

    Article  Google Scholar 

  29. Bendixen, M.: A practical guide to the use of correspondence analysis in marketing research. Mark. Res. On-Line 1, 16–36 (1996)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Abdeljalil Elouardighi .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Elouardighi, A., Hammia, H., Maghfour, M. (2018). Collecting and Processing Multilingual Streaming Tweets for Sentiment Analysis. In: Noreddine, G., Kacprzyk, J. (eds) International Conference on Information Technology and Communication Systems. ITCS 2017. Advances in Intelligent Systems and Computing, vol 640. Springer, Cham. https://doi.org/10.1007/978-3-319-64719-7_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-64719-7_2

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-64718-0

  • Online ISBN: 978-3-319-64719-7

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics