Abstract
The efficient execution of data-intensive computations over services is a challenging task: data are retrieved from remote sources and therefore are not available in the query engine until after the execution of these calls, but the system must be inherently efficient thereafter, by guaranteeing that data is immediately cached and processed efficiently, according to the best query plan. In this chapter, we present a flexible execution model for search computing queries, named Panta Rhei. The proposed execution engine paradigm adopts the producer/consumer model and supports both data-driven and event-driven synchronization, and their interplay. Query plans are modeled as directed graphs, whose nodes are processing units and whose edges are either control or data flows. While control flows synchronize service calls and unit execution, data flows transfer data between units that process data flows to produce query results. We present the specification of Panta Rhei by formally defining the units for data production, consumption, manipulation, and caching, as well as the control and data flows. Finally, we discuss how a query plan is expressed in terms of a query execution plan.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Abadi, D.J., Ahmad, Y., Balazinska, M., Cetintemel, U., Cherniack, M., Hwang, J.H., Lindner, W., Maskey, A.S., Rasin, A., Ryvkina, E., Tatbul, N., Xing, Y., Zdonik, S.: The Design of the Borealis Stream Processing Engine. In: Proceedings of Second Biennial Conference on Innovative Data Systems Research (CIDR 2005), Asilomar, CA, USA (January 2005)
Buyya, R., Abramson, D., Giddy, J., Stockinger, H.: Economic Models for Resource Management and Scheduling in Grid Computing. Concurrency and Computation: Practice and Experience 14(13-15), 1507–1542 (2002)
Catarci, T., Costabile, M.F., Levialdi, S., Batini, C.: Visual Query Systems for Databases: A survey. Journal of Visual Languages and Computing 8(2), 215–260 (1997)
Chaudhuri, S.: Query Optimizers: Time to Rethink the Contract? In: SIGMOD 2009: Proceedings of the 35th SIGMOD international conference on Management of Data, pp. 961–968. ACM, New York (2009)
Chaudhuri, S., Narasayya, V., Ramamurthy, R.: A Pay-As-You-Go Framework for Query Execution Feedback. Proc. VLDB Endow. 1(1), 1141–1152 (2008)
Cole, R.L., Graefe, G.: Optimization of Dynamic Query Evaluation Plans. In: Snodgrass, R.T., Winslett, M. (eds.) Proceedings of the 1994 ACM SIGMOD International Conference on Management of Data, Minneapolis, Minnesota, May 24-27, pp. 150–160. ACM Press, New York (1994)
Eckerson, W.W.: Performance Dashboards: Measuring, Monitoring, and Managing Your Business. John Wiley & Sons, Chichester (2006)
Evrendilek, C., Dogac, A., Nural, S., Ozcan, F.: Multidatabase Query Optimization. Distrib. Parallel Databases 5(1), 77–114 (1997)
Fagin, R.: Combining Fuzzy Information from Multiple Systems. J. Comput. Syst. Sci. 58(1), 83–99 (1999)
Goodenough, J.B.: Exception Handling: Issues and a Proposed Notation. Commun. ACM 18(12), 683–696 (1975)
Gounaris, A., Paton, N.W., Fernandes, A.A.A., Sakellariou, R.: Self-Monitoring Query Execution for Adaptive Query Processing. Data Knowl. Eng. 51(3), 325–348 (2004)
Graefe, G.: Query Evaluation Techniques for Large Databases. ACM Comput. Surv. 25(2), 73–169 (1993)
Graefe, G.: Iterators, Schedulers, and Distributed-Memory Parallelism. Softw. Pract. Exper. 26(4), 427–452 (1996)
Grossniklaus, M., Norrie, M.C.: An Object-Oriented Version Model for Context-Aware Data Management. In: Benatallah, B., Casati, F., Georgakopoulos, D., Bartolini, C., Sadiq, W., Godart, C. (eds.) WISE 2007. LNCS, vol. 4831, pp. 398–409. Springer, Heidelberg (2007)
Ives, Z.G., Florescu, D., Friedman, M., Levy, A., Weld, D.S.: An Adaptive Query Execution System for Data Integration. SIGMOD Rec. 28(2), 299–310 (1999)
Kabra, N., DeWitt, D.J.: Efficient Mid-Query Re-Optimization of Sub-Optimal Query Execution Plans. SIGMOD Rec. 27(2), 106–117 (1998)
Kossmann, D.: The State of the Art in Distributed Query Processing. ACM Comput. Surv. 32(4), 422–469 (2000)
Manolescu, I., Bouganim, L., Fabret, F., Simon, E.: Efficient Querying of Distributed Resources in Mediator Systems. In: Meersman, R., Tari, Z., et al. (eds.) CoopIS 2002, DOA 2002, and ODBASE 2002. LNCS, vol. 2519, pp. 468–485. Springer, Heidelberg (2002)
Rao, J., Pirahesh, H., Mohan, C., Lohman, G.: Compiled Query Execution Engine Using JVM. In: ICDE 2006: Proceedings of the 22nd International Conference on Data Engineering, p. 23. IEEE Computer Society, Washington (2006)
van Reeuwijk, C.: Maestro: A Self-Organizing Peer-to-Peer Dataflow Framework Using Reinforcement Learning. In: HPDC 2009: Proceedings of the 18th ACM International Symposium on High Performance Distributed Computing, pp. 187–196. ACM, New York (2009)
Srivastava, U., Munagala, K., Widom, J., Motwani, R.: Query Optimization Over Web Services. In: Dayal, U., Whang, K.Y., Lomet, D.B., Alonso, G., Lohman, G.M., Kersten, M.L., Cha, S.K., Kim, Y.K. (eds.) VLDB, pp. 355–366. ACM, New York (2006)
Tanin, E., Beigel, R., Shneiderman, B.: Incremental Data Structures and Algorithms for Dynamic Query Interfaces. SIGMOD Rec. 25(4), 21–24 (1996)
Urhan, T., Franklin, M.J.: XJoin: A Reactively-Scheduled Pipelined Join Operator. IEEE Data Eng. Bull. 23(2), 27–33 (2000)
Urhan, T., Franklin, M.J., Amsaleg, L.: Cost-Based Query Scrambling for Initial Delays. SIGMOD Rec. 27(2), 130–141 (1998)
Whiting, P.G., Pascoe, R.S.V.: A History of Data-Flow Languages. IEEE Ann. Hist. Comput. 16(4), 38–59 (1994)
Wong, E., Youssefi, K.: Decomposition—A Strategy for Query Processing. ACM Trans. Database Syst. 1(3), 223–241 (1976)
Yu, J., Buyya, R.: A Taxonomy of Scientific Workflow Systems for Grid Computing. SIGMOD Rec. 34(3), 44–49 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Braga, D., Ceri, S., Corcoglioniti, F., Grossniklaus, M. (2010). Chapter 12: Panta Rhei: Flexible Execution Engine for Search Computing Queries. In: Ceri, S., Brambilla, M. (eds) Search Computing. Lecture Notes in Computer Science, vol 5950. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12310-8_12
Download citation
DOI: https://doi.org/10.1007/978-3-642-12310-8_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12309-2
Online ISBN: 978-3-642-12310-8
eBook Packages: Computer ScienceComputer Science (R0)