Abstract
MystiQ is a system that allows users to define a probabilistic database, then to evaluate SQL queries over this database. MystiQ is a middleware: the data itself is stored in a standard relational database system, and MystiQ is providing the probabilistic semantics. The advantage of a middleware over a re-implementation from scratch is that it can leverage the infrastructure of an existing database engine, e.g. indexes, query evaluation, query optimization, etc. Furthermore, MystiQ attempts to perform most of the probabilistic inference inside the relational database engine. MystiQ is currently available from mystiq.cs.washington.edu.
Supported in part by NSF grants IIS-0415193, IIS-0627585, IIS-0513877, IIS-0428168, and a gift from Microsoft.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ambite, J.L., Chaudhri, V.K., Fikes, R., Jenkins, J., Mishra, S., Muslea, M., Uribe, T.E., Yang, G.: Design and implementation of the CALO query manager. In: AAAI (2006)
Antova, L., Koch, C., Olteanu, D.: 10^(10^6) worlds and beyond: Efficient representation and processing of incomplete information. In: ICDE (2007)
Antova, L., Koch, C., Olteanu, D.: MayBMS: Managing incomplete information with probabilistic world-set decompositions (demonstration). In: ICDE (2007)
Antova, L., Koch, C., Olteanu, D.: World-set decompositions: Expressiveness and efficient algorithms. In: Schwentick, T., Suciu, D. (eds.) ICDT 2007. LNCS, vol. 4353, pp. 194–208. Springer, Heidelberg (2006)
Barbara, D., Garcia-Molina, H., Porter, D.: The management of probabilistic data. IEEE Trans. Knowl. Data Eng. 4(5), 487–502 (1992)
Benjelloun, O., Das Sarma, A., Halevy, A., Widom, J.: ULDBs: Databases with uncertainty and lineage. In: VLDB, pp. 953–964 (2006)
Benjelloun, O., Das Sarma, A., Hayworth, C., Widom, J.: An introduction to ULDBs and the Trio system. IEEE Data Eng. Bull. 29(1), 5–16 (2006)
Boulos, J., Dalvi, N., Mandhani, B., Mathur, S., Re, C., Suciu, D.: Mystiq: A system for finding more answers by using probabilities. In: SIGMOD, system demo (2005)
Cheng, R., Prabhakar, S.: Managing uncertainty in sensor databases. SIGMOD Record 32(4), 41–46 (2003)
Dalvi, N., Re, C., Suciu, D.: Query evaluation on probabilistic databases. IEEE Data Engineering Bulletin 29(1), 25–31 (2006)
Dalvi, N., Suciu, D.: Efficient query evaluation on probabilistic databases. In: VLDB, Toronto, Canada (2004)
Dalvi, N., Suciu, D.: The dichotomy of conjunctive queries on probabilistic structures. In: PODS, pp. 293–302 (2007)
Dalvi, N., Suciu, D.: Efficient query evaluation on probabilistic databases. VLDBJ 16(4), 523–544 (2007)
Dalvi, N., Suciu, D.: Management of probabilistic data: Foundations and challenges. In: PODS, pp. 1–12, Beijing, China (invited talk) (2007)
Das Sarma, A., Benjelloun, O., Halevy, A., Widom, J.: Working models for uncertain data. In: ICDE (2006)
Jampani, R., Xu, F., Wu, M., Perez, L.L., Jermaine, C.M., Haas, P.J.: MCDB: a Monte Carlo approach to managing uncertain data. In: SIGMOD, pp. 687–700 (2008)
Jayram, T.S., Kale, S., Vee, E.: Efficient aggregation algorithms for probabilistic data. In: SODA (2007)
Jha, A., Rastogi, V., Suciu, D.: Evaluating queries in the presence of soft key constraints. In: PODS (2008)
Karp, R., Luby, M.: Monte-Carlo algorithms for enumeration and reliability problems. In: Proceedings of the annual ACM symposium on Theory of computing (1983)
Koch, C., Olteanu, D.: Conditioning probabilistic databases. In: VLDB (2008)
Lakshmanan, L., Leone, N., Ross, R., Subrahmanian, V.S.: Probview: A flexible probabilistic database system. ACM Trans. Database Syst. 22(3) (1997)
Re, C., Dalvi, N., Suciu, D.: Efficient Top-k query evaluation on probabilistic data (extended version). Technical Report 2006-06-05, University of Washington (2006)
Re, C., Dalvi, N., Suciu, D.: Efficient Top-k query evaluation on probabilistic data. In: ICDE (2007)
Re, C., Suciu, D.: Efficient evaluation of having queries on a probabilistic database. In: Arenas, M., Schwartzbach, M.I. (eds.) DBPL 2007. LNCS, vol. 4797, Springer, Heidelberg (2007)
Re, C., Suciu, D.: Materialized views in probabilistic databases for information exchange and query optimization. In: Proceedings of VLDB (2007)
Re, C., Suciu, D.: Approximate lineage for probabilistic databases. In: VLDB (2008)
Sen, P., Deshpande, A.: Representing and querying correlated tuples in probabilistic databases. In: ICDE (2007)
Wang, T.Y., Re, C., Suciu, D.: Implementing not exists predicates over a probabilistic database. In: Proceedings of MUD (2008)
Widom, J.: Trio: A system for integrated management of data, accuracy, and lineage. In: CIDR, pp. 262–276 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Re, C., Suciu, D. (2008). Managing Probabilistic Data with MystiQ: The Can-Do, the Could-Do, and the Can’t-Do. In: Greco, S., Lukasiewicz, T. (eds) Scalable Uncertainty Management. SUM 2008. Lecture Notes in Computer Science(), vol 5291. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87993-0_3
Download citation
DOI: https://doi.org/10.1007/978-3-540-87993-0_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87992-3
Online ISBN: 978-3-540-87993-0
eBook Packages: Computer ScienceComputer Science (R0)