Abstract
This article analyzes data from social networks. The social microblogging system called Twitter is taken as a data source. In the model of distributed computing MapReduce has been used for the implementation of the algorithm for searching the user communities. Apache Hadoop has been chosen as a platform for distributed computing. The program code was developed for retrieving tweets and distributed processing. The analysis of the interests of users of Twitter was conducted.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
http://www.vcloudnews.com/every-day-big-data-statistics-2-5-quintillion-bytes-of-data-created-daily/
Ryabov, S., & Korshunov, A. (2011). The distributed algorithm for finding communities of users in social networks. – 2011. p. 215.
Twitter4J: http://twitter4j.org/en/index.html
Boranbayev, S., Altayev, S., & Boranbayev, A. (2015). Applying the method of diverse redundancy in cloud based systems for increasing reliability. In Proceedings of the 12th International Conference on Information Technology: New Generations, ITNG 2015 (pp.796–799) Las Vegas, April 13–15.
Boranbayev, S., Boranbayev, A., Altayev, S., & Nurbekov A. (2014). Mathematical model for optimal designing of reliable information systems. In Proceedings of the 8th IEEE International Conference on Application of Information and Communication Technologies, AICT 2014 (pp.123–127), Astana, October 15–17, 2014.
Boranbayev, A.S., & Boranbayev, S.N. (2010). Development and optimization of information systems for health insurance billing. In Proceedings of the 7th International Conference on Information Technology: New Generations, ITNG 2010 (pp.1282–1284), Las Vegas, April 12–14.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Boranbayev, A., Shuitenov, G., Boranbayev, S. (2018). The Method of Data Analysis from Social Networks using Apache Hadoop. In: Latifi, S. (eds) Information Technology - New Generations. Advances in Intelligent Systems and Computing, vol 558. Springer, Cham. https://doi.org/10.1007/978-3-319-54978-1_39
Download citation
DOI: https://doi.org/10.1007/978-3-319-54978-1_39
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-54977-4
Online ISBN: 978-3-319-54978-1
eBook Packages: EngineeringEngineering (R0)