Abstract
Probabilistic data management has recently drawn much attention of the database research community. This paper investigates safe plans of queries on block independent disjoint (BID) probabilistic databases. This problem is fundamental to evaluate queries whose time complexity is PTIME. We first introduce two new probabilistic table models which are the correlated table and the correlated block table, and a hybrid project which executes a disjoint project and then performs an independent project in an atomic operation on BID tables. After that, we propose an algorithm to find safe plans for queries on BID probabilistic databases. Finally, we present the experimental results to show that the proposed algorithm can find safe plans for more queries than the state-of-the-art and the safe plans generated by the proposed algorithm are efficient and scale well.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Transaction Processing Performance Council, TPC \(BENCHMARK^{TM}\) H Standard Specification (revision 2.9.0)
Boulos, J., Dalvi, N., Mandhani, B., Mathur, S., Re, C., Suciu, D.: Mystiq: a system for finding more answers by using probabilities. In: SIGMOD, pp. 891–893 (2005)
Cavallo, R., Pittarelli, M.: The theory of probabilistic databases. In: VLDB, pp. 71–81 (1987)
Dalvi, N., Suciu, D.: Efficient query evaluation on probabilistic database. The VLDB Journal 16(4), 523–544 (2007)
Dalvi, N., Suciu, D.: Management of probabilistic data: foundations and challenges. In: PODS, pp. 1–12 (2007)
Dey, D., Sarkar, S.: A probabilistic relational model and algebra. ACM Transactions on Databases Systems 21(3), 339–369 (1996)
Green, T., Tannen, V.: Models for incomplete and probabilistic information. IEEE Data Engineering Bulletin 29(1), 17–24 (2006)
Ives, Z. G., Khandelwal, N., Kapur, A., Cakir, M.: Orchestra: rapid, collaborative sharing of dynamic data. In: CIDR, pp. 41–46 (2005)
Jha, A., Suciu, D.: Knowledge compilation meets database theory: Compiling queries to decision diagrams. Theory of Computing Systems 52(3), 403–440 (2013)
Lakshmanan, L., Leone, N., Ross, R., Subrahmanina, V.: Probview: A flexible probabilistic database system. ACM Transactions on Database Systems 22(3), 419–469 (1997)
Olteanu, D., Huang, J.: Secondary-storage confidence computation for conjunctive queries with inequalities. In: SIGMOD, pp. 389–402 (2009)
Olteanu, D., Huang, J., Koch, C.: Sprout: lazy vs. eager query plans for tuple-independent probabilistic databases. In: ICDE, pp. 640–651 (2009)
Qin, B., Xia, Y.: Generating efficient safe query plans for probabilistic databases. Data & Knowledge Engineering 67(3), 485–503 (2008)
Ré, C., Dalvi, N., Suciu, D.: Query evaluation on probabilistic databases. IEEE Data Engineering Bulletin 29(1), 25–31 (2006)
Ré, C., Suciu, D.: Materialized views in probabilistic databases for information exchange and query optimization. In: VLDB, pp. 51–62 (2007)
Sarma, A., Theobald, M., Widom, J.: Exploiting lineage for confidence computation in uncertain and probabilistic databases. In: ICDE, pp. 1023–1032 (2008)
Sen, P., Deshpande, A., Getoor, L.: Prdb: managing and exploiting rich correlation in probabilistic databases. The VLDB Journal 18(5), 1065–1090 (2009)
Sen, P., Deshpande, A., Getoor, L.: Read-once functions and query evaluation in probabilistic databases. In: VLDB, pp. 1068–1079 (2010)
Singh, S., Mayfield, C., Shah, R., Prabhakar, S., Hambrusch, S.: Database support for probabilistic attributes and tuples. In: ICDE, pp. 1053–1061 (2008)
Widom, J.: Trio: a system for integrated management of data, accuracy, and lineage. In: ICDR, pp. 262–276 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Qin, B. (2015). Efficient Queries Evaluation on Block Independent Disjoint Probabilistic Databases. In: Renz, M., Shahabi, C., Zhou, X., Cheema, M. (eds) Database Systems for Advanced Applications. DASFAA 2015. Lecture Notes in Computer Science(), vol 9050. Springer, Cham. https://doi.org/10.1007/978-3-319-18123-3_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-18123-3_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-18122-6
Online ISBN: 978-3-319-18123-3
eBook Packages: Computer ScienceComputer Science (R0)