Advertisement

Distributed Query Processing

  • M. Tamer Özsu
  • Patrick Valduriez
Chapter

Abstract

By hiding the low-level details about the physical organization of the data, relational database languages allow the expression of complex queries in a concise and simple manner. In particular, to construct the answer to the query, the user does not precisely specify the procedure to follow; this procedure is actually devised by a module, called a query processor. This relieves the user from query optimization, a time-consuming task that is best handled by the query processor, since it can exploit a large amount of useful information about the data.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abadi, D. J., Carney, D., Çetintemel, U., Cherniack, M., Convey, C., Lee, S., Stonebraker, M., Tatbul, N., and Zdonik, S. (2003). Aurora: a new model and architecture for data stream management. VLDB J., 12 (2): 120–139.CrossRefGoogle Scholar
  2. Abadi, D. J., Ahmad, Y., Balazinska, M., Çetintemel, U., Cherniack, M., Hwang, J.-H., Lindner, W., Maskey, A., Rasin, A., Ryvkina, E., Tatbul, N., Xing, Y., and Zdonik, S. B. (2005). The design of the Borealis stream processing engine. In Proc. 2nd Biennial Conf. on Innovative Data Systems Research, pages 277–289.Google Scholar
  3. Abadi, D. J., Marcus, A., Madden, S. R., and Hollenbach, K. (2007). Scalable semantic web data management using vertical partitioning. In Proc. 33rd Int. Conf. on Very Large Data Bases, pages 411–422.Google Scholar
  4. Abadi, D. J., Marcus, A., Madden, S., and Hollenbach, K. (2009). SW-Store: a vertically partitioned DBMS for semantic web data management. VLDB J., 18 (2): 385–406.CrossRefGoogle Scholar
  5. Aberer, K. (2001). P-grid: A self-organizing access structure for P2P information systems. In Proc. Int. Conf. on Cooperative Inf. Syst., pages 179–194.Google Scholar
  6. Aberer, K. (2003). Guest editor’s introduction. ACM SIGMOD Rec., 32 (3): 21–22.CrossRefGoogle Scholar
  7. Aberer, K., Cudré-Mauroux, P., Datta, A., Despotovic, Z., Hauswirth, M., Punceva, M., and Schmidt, R. (2003a). P-grid: a self-organizing structured P2P system. ACM SIGMOD Rec., 32 (3): 29–33.CrossRefGoogle Scholar
  8. Aberer, K., Cudré-Mauroux, P., and Hauswirth, M. (2003b). Start making sense: The chatty web approach for global semantic agreements. J. Web Semantics, 1 (1): 89–114.CrossRefGoogle Scholar
  9. Abiteboul, S., Quass, D., McHugh, J., Widom, J., and Wiener, J. (1997). The Lorel query language for semistructured data. Int. J. Digit. Libr., 1 (1): 68–88.CrossRefGoogle Scholar
  10. Abiteboul, S., Buneman, P., and Suciu, D. (1999). Data on the Web: From Relations to Semistructured Data and XML. Morgan Kaufmann.Google Scholar
  11. Abiteboul, S., Manolescu, I., Rigaux, P., Rousset, M.-C., and Senellart, P. (2011). Web Data Management. Cambridge University Press.CrossRefGoogle Scholar
  12. Abou-Rjeili, A. and Karypis, G. (2006). Multilevel algorithms for partitioning power-law graphs. In Proc. 20th IEEE Int. Parallel & Distributed Processing Symp., pages 124–124.Google Scholar
  13. Abouzeid, A., Bajda-Pawlikowski, K., Abadi, D., Silberschatz, A., and Rasin, A. (2009). HadoopDB: an architectural hybrid of MapReduce and DBMS technologies for analytical workloads. Proc. VLDB Endowment, 2 (1): 922–933.CrossRefGoogle Scholar
  14. Adali, S., Candan, K. S., Papakonstantinou, Y., and Subrahmanian, V. S. (1996a). Query caching and optimization in distributed mediator systems. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 137–148.CrossRefGoogle Scholar
  15. Adali, S., Candan, K. S., Papakonstantinou, Y., and Subrahmanian, V. S. (1996b). Query caching and optimization in distributed mediator systems. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 137–148.CrossRefGoogle Scholar
  16. Adamic, L. and Huberman, B. (2000). The nature of markets in the world wide web. Quart. J. Electron. Comm., 1: 5–12.Google Scholar
  17. Adiba, M. (1981). Derived relations: A unified mechanism for views, snapshots and distributed data. In Proc. 7th Int. Conf. on Very Data Bases, pages 293–305.Google Scholar
  18. Adiba, M. and Lindsay, B. (1980). Database snapshots. In Proc. 6th Int. Conf. on Very Data Bases, pages 86–91.Google Scholar
  19. Adler, M. and Mitzenmacher, M. (2001). Towards compressing web graphs. In Proc. Data Compression Conf., pages 203–212.Google Scholar
  20. Aggarwal, C. C., editor. (2007). Data Streams: Models and Algorithms. Springer.Google Scholar
  21. Agichtein, E., Lawrence, S., and Gravano, L. (2004). Learning to find answers to questions on the web. ACM Trans. Internet Tech., 4 (3): 129—162.CrossRefGoogle Scholar
  22. Agrawal, D. and Sengupta, S. (1993). Modular synchronization in distributed, multiversion databases: Version control and concurrency control. IEEE Trans. Knowl. and Data Eng., 5 (1): 126 –137.CrossRefGoogle Scholar
  23. Agrawal, D., Das, S., and El Abbadi, A. (2012). Data Management in the Cloud: Challenges and Opportunities. Synthesis Lectures on Data Management. Morgan & Claypool Publishers.Google Scholar
  24. Agrawal, S., Narasayya, V., and Yang, B. (2004). Integrating vertical and horizontal partitioning into automated physical database design. In Proc. ACM SIGMOD Int. Conf. on Management of Data.Google Scholar
  25. Akal, F., Böhm, K., and Schek, H.-J. (2002). Olap query evaluation in a database cluster: A performance study on intra-query parallelism. In Proc. 6th East European Conf. Advances in Databases and Information Systems, pages 218–231.CrossRefGoogle Scholar
  26. Akal, F., Türker, C., Schek, H.-J., Breitbart, Y., Grabs, T., and Veen, L. (2005). Fine-grained replication and scheduling with freshness and correctness guarantees. In Proc. 31st Int. Conf. on Very Large Data Bases, pages 565–576.Google Scholar
  27. Akbarinia, R. and Martins, V. (2007). Data management in the APPA system. J. Grid Comp., 5 (3): 303–317.CrossRefGoogle Scholar
  28. Akbarinia, R., Martins, V., Pacitti, E., and Valduriez, P. (2006). Design and implementation of Atlas P2P architecture. In Baldoni, R., Cortese, G., and Davide, F., editors, Global Data Management, pages 98–123. IOS Press.Google Scholar
  29. Akbarinia, R., Pacitti, E., and Valduriez, P. (2007a). Processing top-k queries in distributed hash tables. In Proc. 13th Int. Euro-Par Conf., pages 489–502.Google Scholar
  30. Akbarinia, R., Pacitti, E., and Valduriez, P. (2007b). Query processing in P2P systems. Technical Report 6112, INRIA, Rennes, France.Google Scholar
  31. Akbarinia, R., Pacitti, E., and Valduriez, P. (2007c). Best position algorithms for top-k queries. In Proc. 33rd Int. Conf. on Very Large Data Bases, pages 495–506.Google Scholar
  32. Akbarinia, R., Pacitti, E., and Valduriez, P. (2007d). Data currency in replicated dhts. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 211–222.Google Scholar
  33. Akidau, T., Balikov, A., Bekiroglu, K., Chernyak, S., Haberman, J., Lax, R., McVeety, S., Mills, D., Nordstrom, P., and Whittle, S. (2013). MillWheel: Fault-tolerant stream processing at internet scale. Proc. VLDB Endowment, 6 (11): 1033–1044.CrossRefGoogle Scholar
  34. Alagiannis, I., Borovica, R., Branco, M., Idreos, S., and Ailamaki, A. (2012). NoDB: efficient query execution on raw data files. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 241–252.Google Scholar
  35. Alagiannis, I., Idreos, S., and Ailamaki, A. (2014). H2O: A hands-free adaptive store. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 1103–1114.Google Scholar
  36. Alamoudi, A. A., Grover, R., Carey, M. J., and Borkar, V. R. (2015). External data access and indexing in AsterixDB. In Proc. 24th ACM Int. Conf. on Information and Knowledge Management, pages 3–12.Google Scholar
  37. Albutiu, M.-C., Kemper, A., and Neumann, T. (2012). Massively parallel sort-merge joins in main memory multi-core database systems. Proc. VLDB Endowment, 5 (10): 1064–1075.CrossRefGoogle Scholar
  38. Allard, T., Hébrail, G., Masseglia, F., and Pacitti, E. (2015). Chiaroscuro: Transparency and privacy for massive personal time-series clustering. In Proc. ACM SIGMOD Int. Conf. on Management of Data, pages 779–794.Google Scholar
  39. Alomari, M., Cahill, M., Fekete, A., and Rohm, U. (2008). The cost of serializability on platforms that use snapshot isolation. In Proc. 24th Int. Conf. on Data Engineering, pages 576 –585.Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  • M. Tamer Özsu
    • 1
  • Patrick Valduriez
    • 2
  1. 1.Cheriton School of Computer ScienceUniversity of WaterlooWaterlooCanada
  2. 2.Inria and LIRMMUniversity of MontpellierMontpellierFrance

Personalised recommendations