Skip to main content

Streaming Analytics

Insight from Data in Motion

  • Chapter
  • First Online:
Disruptive Analytics

Abstract

Streaming analytics is the application of analytic operations to streaming data. We begin the chapter with a short history of streaming analytics, followed by a review of streaming fundamentals. In the third section of the chapter, we review popular streaming data sources, such as Apache Kafka and Amazon Kinesis, followed by a survey of the top open source streaming engines. We close the chapter with some examples of streaming analytics in action, and some observations about the economics of streaming.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 29.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 37.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.marketsandmarkets.com/PressReleases/streaming-analytics.asp

  2. 2.

    https://www.idc.com/getdoc.jsp?containerId=257402

  3. 3.

    http://www.gartner.com/it-glossary/real-time

  4. 4.

    http://home.business.utah.edu/finmh/moallemi.pdf

  5. 5.

    http://www.amazon.com/Two-Second-Advantage-Succeed-Anticipating-Future--Just/dp/0307887650/ref=sr_1_1?s=books&ie=UTF8&qid=1462718451&sr=1-1&keywords=the+two+second+advantage

  6. 6.

    https://spark-summit.org/east-2016/events/5-reasons-enterprise-adoption-of-spark-is-unstoppable/

  7. 7.

    http://www.slideshare.net/sbaltagi/flink-vs-spark

  8. 8.

    http://www.computerweekly.com/opinion/Why-real-time-CRM-analytics-is-hot

  9. 9.

    http://www.trademarkia.com/information-bus-74089524.html

  10. 10.

    http://www.risk.net/operational-risk-and-regulation/feature/1507883/goldman-to-roll-out-teknekron-middleware-transaction-platform

  11. 11.

    http://www.amazon.com/Business-Applications-Neural-Networks-State/dp/9810240899

  12. 12.

    http://www.cbronline.com/news/datamind_boosts_business_intelligence

  13. 13.

    http://www.nytimes.com/1999/11/17/business/company-news-epiphany-agrees-to-acquirerightpoint.html.

  14. 14.

    Renamed the Defense Advanced Research Projects Agency in March 1996.

  15. 15.

    http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.56.876&rep=rep1&type=pdf

  16. 16.

    http://www.softwareag.com/special/thingalytics/john-bates.html

  17. 17.

    https://www.finextra.com/news/fullstory.aspx?newsitemid=13477

  18. 18.

    http://www.computerweekly.com/feature/What-can-science-do-for-IT

  19. 19.

    http://www.nytimes.com/1993/12/18/business/company-news-reuters-is-buying-teknekron.html

  20. 20.

    http://cs.brown.edu/research/aurora/sigmoddemo.pdf

  21. 21.

    http://www.siliconinvestor.com/readmsgs.aspx?subjectid=34520&msgnum=23107&batchsize=10&batchtype=Next

  22. 22.

    http://www.prnewswire.com/news-releases/streambase-systems-secures-11-million-to-expand-sales-and-marketing-activities-66325312.html

  23. 23.

    http://www.bis.org/review/r100909e.pdf

  24. 24.

    http://blogs.forrester.com/holger_kisker/10-05-13-sap_acquires_sybase_%E2%80%93_what%E2%80%99s_strategic_intent_behind_deal

  25. 25.

    http://www.geek.com/chips/ibm-releases-system-s-real-time-stream-computing-analysis-and-reporting-773531/

  26. 26.

    http://www-03.ibm.com/press/us/en/pressrelease/27508.wss

  27. 27.

    http://www.enterrasolutions.com/media/docs/2012/01/SystemS_2008-1001.pdf

  28. 28.

    http://cs.ucsb.edu/~ckrintz/papers/gedik_et_al_2008.pdf

  29. 29.

    http://www.sec.gov/Archives/edgar/data/1085280/000108528014000020/tibx1130201310k.htm

  30. 30.

    https://www.sec.gov/Archives/edgar/data/876167/000087616714000013/a201310-kmaster.htm

  31. 31.

    https://www.forrester.com/report/The+Forrester+Wave+Big+Data+Streaming+Analytics+Q1+2016/-/E-RES129023

  32. 32.

    https://cs.brown.edu/~ugur/fits_all.pdf

  33. 33.

    http://www.thenational.ae/business/the-curious-case-of-the-dog-that-did-not-bark

  34. 34.

    http://www.gartner.com/it-glossary/complex-event-processing

  35. 35.

    http://www.isn.ucsd.edu/pubs/nips00_inc.pdf

  36. 36.

    ftp://ftp.sas.com/pub/neural/FAQ2.html#A_styles_batch_vs_inc

  37. 37.

    http://research.microsoft.com/apps/pubs/default.aspx?id=69588

  38. 38.

    http://spark.apache.org/docs/latest/mllib-clustering.html#streaming-k-means

  39. 39.

    http://spark.apache.org/docs/latest/mllib-linear-methods.html#streaming-linear-regression

  40. 40.

    https://www.openhub.net/p/activemq

  41. 41.

    https://cwiki.apache.org/confluence/display/KAFKA/Powered+By

  42. 42.

    http://cdn.oreillystatic.com/en/assets/1/event/118/The%20Evolution%20of%20Hadoop%20at%20Spotify-%20Through%20Failures%20and%20Pain%20Presentation.pdf

  43. 43.

    http://opensoc.github.io/

  44. 44.

    https://www.openhub.net/p/apache-kafka

  45. 45.

    https://www.openhub.net/p/rabbitmq

  46. 46.

    http://apex.apache.org/announcements.html

  47. 47.

    https://wiki.apache.org/incubator/ApexProposal

  48. 48.

    https://www.openhub.net/p/apache_apex

  49. 49.

    http://stratosphere.eu/

  50. 50.

    https://www.openhub.net/p/flink

  51. 51.

    https://engineering.linkedin.com/data-streams/apache-samza-linkedins-low latency-stream-processing-framework

  52. 52.

    https://www.openhub.net/p/samza

  53. 53.

    https://storm.apache.org/about/scalable.html

  54. 54.

    http://storm.apache.org/documentation/Powered-By.html

  55. 55.

    https://www.openhub.net/p/apache-storm

  56. 56.

    http://www.slideshare.net/SparkSummit/realtime-risk-management-using-kafka-python-and-spark-streaming-by-nick-evans

  57. 57.

    http://www.slideshare.net/SparkSummit/realtime-anomoly-detection-with-spark-mlib-akka-and-cassandra-by-natalino-busa

  58. 58.

    http://www.slideshare.net/FlinkForward/flink-case-study-capital-one

  59. 59.

    http://data-artisans.com/flink-at-bouygues-html/

  60. 60.

    http://www.slideshare.net/SparkSummit/big-telco-yousun-jeong

  61. 61.

    http://www.jeremyfreeman.net/share/talks/spark5/ #/

  62. 62.

    http://thunder-project.org/

  63. 63.

    http://lightning-viz.org/

  64. 64.

    http://mybinder.org/

  65. 65.

    http://www.slideshare.net/SparkSummit/enable-breakthrough-in-parkinson-disease-research-ido-karavany

  66. 66.

    http://www.tibco.com/company/news/releases/2014/tibco-to-be-acquired-by-vista-equity-partners-for-24-00-per-share-in-cash

  67. 67.

    http://www.it-director.com/blogs/banks-statement/2014/9/is-tibco-a-worrying-sign-of-a-different-malaise/

  68. 68.

    http://www.internetnews.com/bus-news/article.php/321151/Epiphany+Buys+Octane+Software+for+32+Billion.htm

  69. 69.

    http://searchcrm.techtarget.com/news/1112932/SSA-Global-buys-into-CRM-with-Epiphany-acquisition

  70. 70.

    http://www.gartner.com/newsroom/id/3114217

  71. 71.

    Usability researchers report varying maximum acceptable response times; context matters. For example, see https://www.nngroup.com/articles/powers-of-10-time-scales-in-ux/ for a discussion of standards in customer-facing applications. For internal applications, standards are lower. See https://www.microstrategy.com/it/press-releases/microstrategy-introduces-new-high-performance-standards-for-business-intelligence .

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Thomas W. Dinsmore

About this chapter

Cite this chapter

Dinsmore, T.W. (2016). Streaming Analytics. In: Disruptive Analytics. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-1311-7_6

Download citation

Publish with us

Policies and ethics