The sheer volume of information available on Cloud, and the rate at which new data is being generated, is overwhelming the capacity of enterprises to manage it and people to use it in a meaningful manner. We examine a typical day in the life of Internet. Such a data deluge has surpassed the capacity of existing data centers to store and process it in a timely manner. This gave rise to a new class of algorithms, such as MapReduce, which we shall study in a later section. In this chapter, we will introduce MapReduce, Hadoop, and give examples of Amazon’s MapReduce (AMR). A class project of Twitter sentimental analysis using Cloud is presented, which was able to predict the outcome of 2016 US presidential elections a full year in advance.
- 4.Hadoop: https://hadoop.apache.org
- 5.Amazon Elastic Map-Reduce: https://aws.amazon.com/emr/
- 6.Kouloumpis E et al (2011) Twitter sentiment analysis: the good, the bad and the OMG!. In: Processings of the fifth international AAAI conference on weblogs and social media, pp 538–541Google Scholar
- 8.Domo, marketing data company’s 5th annual “data never sleeps” infographics (2017)Google Scholar