NOSQL Design for Analytical Workloads: Variability Matters

Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9974)

Abstract

Big Data has recently gained popularity and has strongly questioned relational databases as universal storage systems, especially in the presence of analytical workloads. As result, co-relational alternatives, commonly known as NOSQL (Not Only SQL) databases, are extensively used for Big Data. As the primary focus of NOSQL is on performance, NOSQL databases are directly designed at the physical level, and consequently the resulting schema is tailored to the dataset and access patterns of the problem in hand. However, we believe that NOSQL design can also benefit from traditional design approaches. In this paper we present a method to design databases for analytical workloads. Starting from the conceptual model and adopting the classical 3-phase design used for relational databases, we propose a novel design method considering the new features brought by NOSQL and encompassing relational and co-relational design altogether.

Keywords

NOSQL DW Big data Relational Co-relational Database design 

Notes

Acknowledgments

We would like to thank Antoni Olivé for revising the paper.

References

  1. 1.
    Alagiannis, I., et al.: H2O: a hands-free adaptive store. In: SIGMOD (2014)Google Scholar
  2. 2.
    Ambler, S.: Agile Database Techniques: Effective Strategies for the Agile Software Developer. Wiley, New York (2003)Google Scholar
  3. 3.
    Blaha, M.: On reverse engineering of vendor databases. In: WCRE (1998)Google Scholar
  4. 4.
    Blaha, M.: Patterns of Data Modeling. CRC Press, Inc., Boca Raton (2010)Google Scholar
  5. 5.
    Bondiombouy, C., Kolev, B., Levchenko, O., Valduriez, P.: Integrating big data and relational data with a functional SQL-like query language. In: Databaseand Expert Systems Applications - 26th International Conference, DEXA 2015, Valencia, Spain, 1–4 September 2015, Proceedings, Part I, pp. 170–185 (2015). http://dx.doi.org/10.1007/978-3-319-22849-5_13
  6. 6.
    Bugiotti, F., Cabibbo, L., Atzeni, P., Torlone, R.: Database design for NoSQL systems. In: Yu, E., Dobbie, G., Jarke, M., Purao, S. (eds.) ER 2014. LNCS, vol. 8824, pp. 223–231. Springer, Heidelberg (2014). doi:10.1007/978-3-319-12206-9_18 Google Scholar
  7. 7.
    Garcia, S., et al.: DSS from an RE perspective: a systematic mapping. J. Syst. Softw. 117, 488–507 (2016)CrossRefGoogle Scholar
  8. 8.
    Garcia-Molina, H., et al.: Database Systems - The Complete Book. Pearson Education, Harlow (2009)Google Scholar
  9. 9.
    Gartner: Focus on the ’Three Vs’ of Big Data Analytics: Variability, Veracity and Value. https://www.gartner.com/doc/2921417/focus-vs-big-data-analytics
  10. 10.
    Inmon, W.H., et al.: Corporate Information Factory. Wiley, New York (2001)Google Scholar
  11. 11.
    Jagadish, H.V., et al.: Big data and its technical challenges. Commun. ACM 57(7), 86–94 (2014)CrossRefGoogle Scholar
  12. 12.
    Kimball, R.: The Data Warehouse Toolkit: Practical Techniques for Building Dimensional Data Warehouses. Wiley, New York (1996)Google Scholar
  13. 13.
    Mazón, J.-N., Trujillo, J., Lechtenbörger, J.: A set of QVT relations to assure the correctness of data warehouses by using multidimensional normal forms. In: Embley, D.W., Olivé, A., Ram, S. (eds.) ER 2006. LNCS, vol. 4215, pp. 385–398. Springer, Heidelberg (2006). doi:10.1007/11901181_29 CrossRefGoogle Scholar
  14. 14.
    Meijer, E., Bierman, G.M.: A co-relational model of data for large shared data banks. Commun. ACM 54(4), 49–58 (2011)CrossRefGoogle Scholar
  15. 15.
    OCDE: Data-driven Innovation for Growth and Well-being. http://www.oecd.org/sti/inno/data-driven-innovation-interim-synthesis.pdf
  16. 16.
    Özsu, M.T., Valduriez, P.: Principles of Distributed DB Systems. Springer, New York (2011)Google Scholar
  17. 17.
    Romero, O., et al.: Tuning small analytics on big data: data partitioning and secondary indexes in the Hadoop ecosystem. Inf. Syst. 54, 336–356 (2015)CrossRefGoogle Scholar
  18. 18.
    Sadalage, P., Fowler, M.: NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence. Addison-Wesley Professional, Upper Saddle River (2012)Google Scholar
  19. 19.
  20. 20.
    Stonebraker, M., et al.: C-store: a column-oriented DBMS. In: VLDB (2005)Google Scholar
  21. 21.
    TDWI: TDWI Best Practices Report, Achieving Greater Agility with Business Intelligence. https://tdwi.org/research/2013/01/tdwi-best-practices-report-achieving-greater-agility-with-business-intelligence.aspx
  22. 22.
    Wiese, L.: Advanced Data Management for SQL, NoSQL, Cloud and Distributed Databases. DeGruyter, Boston (2015)Google Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  1. 1.Universitat Politècnica de Catalunya - BarcelonaTechBarcelonaSpain

Personalised recommendations