Skip to main content

The Method of Data Analysis from Social Networks using Apache Hadoop

  • Conference paper
  • First Online:
Information Technology - New Generations

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 558))

Abstract

This article analyzes data from social networks. The social microblogging system called Twitter is taken as a data source. In the model of distributed computing MapReduce has been used for the implementation of the algorithm for searching the user communities. Apache Hadoop has been chosen as a platform for distributed computing. The program code was developed for retrieving tweets and distributed processing. The analysis of the interests of users of Twitter was conducted.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. http://www.vcloudnews.com/every-day-big-data-statistics-2-5-quintillion-bytes-of-data-created-daily/

  2. http://www.osp.ru/os/2013/08/13037856

  3. https://ru.wikipedia.org/wiki/Hadoop

  4. Ryabov, S., & Korshunov, A. (2011). The distributed algorithm for finding communities of users in social networks. – 2011. p. 215.

    Google Scholar 

  5. https://dev.twitter.com/overview/documentation

  6. Twitter4J: http://twitter4j.org/en/index.html

  7. http://texterra.ru/blog/kakoy-tip-kontenta-imeet-samyy-vysokiy-potentsial-rasprostraneniya-v-twitter.html

  8. Boranbayev, S., Altayev, S., & Boranbayev, A. (2015). Applying the method of diverse redundancy in cloud based systems for increasing reliability. In Proceedings of the 12th International Conference on Information Technology: New Generations, ITNG 2015 (pp.796–799) Las Vegas, April 13–15.

    Google Scholar 

  9. Boranbayev, S., Boranbayev, A., Altayev, S., & Nurbekov A. (2014). Mathematical model for optimal designing of reliable information systems. In Proceedings of the 8th IEEE International Conference on Application of Information and Communication Technologies, AICT 2014 (pp.123–127), Astana, October 15–17, 2014.

    Google Scholar 

  10. Boranbayev, A.S., & Boranbayev, S.N. (2010). Development and optimization of information systems for health insurance billing. In Proceedings of the 7th International Conference on Information Technology: New Generations, ITNG 2010 (pp.1282–1284), Las Vegas, April 12–14.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Askar Boranbayev .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this paper

Cite this paper

Boranbayev, A., Shuitenov, G., Boranbayev, S. (2018). The Method of Data Analysis from Social Networks using Apache Hadoop. In: Latifi, S. (eds) Information Technology - New Generations. Advances in Intelligent Systems and Computing, vol 558. Springer, Cham. https://doi.org/10.1007/978-3-319-54978-1_39

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-54978-1_39

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-54977-4

  • Online ISBN: 978-3-319-54978-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics