Skip to main content

Managing Probabilistic Data with MystiQ: The Can-Do, the Could-Do, and the Can’t-Do

  • Conference paper
Scalable Uncertainty Management (SUM 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5291))

Included in the following conference series:

Abstract

MystiQ is a system that allows users to define a probabilistic database, then to evaluate SQL queries over this database. MystiQ is a middleware: the data itself is stored in a standard relational database system, and MystiQ is providing the probabilistic semantics. The advantage of a middleware over a re-implementation from scratch is that it can leverage the infrastructure of an existing database engine, e.g. indexes, query evaluation, query optimization, etc. Furthermore, MystiQ attempts to perform most of the probabilistic inference inside the relational database engine. MystiQ is currently available from mystiq.cs.washington.edu.

Supported in part by NSF grants IIS-0415193, IIS-0627585, IIS-0513877, IIS-0428168, and a gift from Microsoft.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ambite, J.L., Chaudhri, V.K., Fikes, R., Jenkins, J., Mishra, S., Muslea, M., Uribe, T.E., Yang, G.: Design and implementation of the CALO query manager. In: AAAI (2006)

    Google Scholar 

  2. Antova, L., Koch, C., Olteanu, D.: 10^(10^6) worlds and beyond: Efficient representation and processing of incomplete information. In: ICDE (2007)

    Google Scholar 

  3. Antova, L., Koch, C., Olteanu, D.: MayBMS: Managing incomplete information with probabilistic world-set decompositions (demonstration). In: ICDE (2007)

    Google Scholar 

  4. Antova, L., Koch, C., Olteanu, D.: World-set decompositions: Expressiveness and efficient algorithms. In: Schwentick, T., Suciu, D. (eds.) ICDT 2007. LNCS, vol. 4353, pp. 194–208. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  5. Barbara, D., Garcia-Molina, H., Porter, D.: The management of probabilistic data. IEEE Trans. Knowl. Data Eng. 4(5), 487–502 (1992)

    Article  Google Scholar 

  6. Benjelloun, O., Das Sarma, A., Halevy, A., Widom, J.: ULDBs: Databases with uncertainty and lineage. In: VLDB, pp. 953–964 (2006)

    Google Scholar 

  7. Benjelloun, O., Das Sarma, A., Hayworth, C., Widom, J.: An introduction to ULDBs and the Trio system. IEEE Data Eng. Bull. 29(1), 5–16 (2006)

    Google Scholar 

  8. Boulos, J., Dalvi, N., Mandhani, B., Mathur, S., Re, C., Suciu, D.: Mystiq: A system for finding more answers by using probabilities. In: SIGMOD, system demo (2005)

    Google Scholar 

  9. Cheng, R., Prabhakar, S.: Managing uncertainty in sensor databases. SIGMOD Record 32(4), 41–46 (2003)

    Article  Google Scholar 

  10. Dalvi, N., Re, C., Suciu, D.: Query evaluation on probabilistic databases. IEEE Data Engineering Bulletin 29(1), 25–31 (2006)

    Google Scholar 

  11. Dalvi, N., Suciu, D.: Efficient query evaluation on probabilistic databases. In: VLDB, Toronto, Canada (2004)

    Google Scholar 

  12. Dalvi, N., Suciu, D.: The dichotomy of conjunctive queries on probabilistic structures. In: PODS, pp. 293–302 (2007)

    Google Scholar 

  13. Dalvi, N., Suciu, D.: Efficient query evaluation on probabilistic databases. VLDBJ 16(4), 523–544 (2007)

    Article  Google Scholar 

  14. Dalvi, N., Suciu, D.: Management of probabilistic data: Foundations and challenges. In: PODS, pp. 1–12, Beijing, China (invited talk) (2007)

    Google Scholar 

  15. Das Sarma, A., Benjelloun, O., Halevy, A., Widom, J.: Working models for uncertain data. In: ICDE (2006)

    Google Scholar 

  16. Jampani, R., Xu, F., Wu, M., Perez, L.L., Jermaine, C.M., Haas, P.J.: MCDB: a Monte Carlo approach to managing uncertain data. In: SIGMOD, pp. 687–700 (2008)

    Google Scholar 

  17. Jayram, T.S., Kale, S., Vee, E.: Efficient aggregation algorithms for probabilistic data. In: SODA (2007)

    Google Scholar 

  18. Jha, A., Rastogi, V., Suciu, D.: Evaluating queries in the presence of soft key constraints. In: PODS (2008)

    Google Scholar 

  19. Karp, R., Luby, M.: Monte-Carlo algorithms for enumeration and reliability problems. In: Proceedings of the annual ACM symposium on Theory of computing (1983)

    Google Scholar 

  20. Koch, C., Olteanu, D.: Conditioning probabilistic databases. In: VLDB (2008)

    Google Scholar 

  21. Lakshmanan, L., Leone, N., Ross, R., Subrahmanian, V.S.: Probview: A flexible probabilistic database system. ACM Trans. Database Syst. 22(3) (1997)

    Google Scholar 

  22. Re, C., Dalvi, N., Suciu, D.: Efficient Top-k query evaluation on probabilistic data (extended version). Technical Report 2006-06-05, University of Washington (2006)

    Google Scholar 

  23. Re, C., Dalvi, N., Suciu, D.: Efficient Top-k query evaluation on probabilistic data. In: ICDE (2007)

    Google Scholar 

  24. Re, C., Suciu, D.: Efficient evaluation of having queries on a probabilistic database. In: Arenas, M., Schwartzbach, M.I. (eds.) DBPL 2007. LNCS, vol. 4797, Springer, Heidelberg (2007)

    Google Scholar 

  25. Re, C., Suciu, D.: Materialized views in probabilistic databases for information exchange and query optimization. In: Proceedings of VLDB (2007)

    Google Scholar 

  26. Re, C., Suciu, D.: Approximate lineage for probabilistic databases. In: VLDB (2008)

    Google Scholar 

  27. Sen, P., Deshpande, A.: Representing and querying correlated tuples in probabilistic databases. In: ICDE (2007)

    Google Scholar 

  28. Wang, T.Y., Re, C., Suciu, D.: Implementing not exists predicates over a probabilistic database. In: Proceedings of MUD (2008)

    Google Scholar 

  29. Widom, J.: Trio: A system for integrated management of data, accuracy, and lineage. In: CIDR, pp. 262–276 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Re, C., Suciu, D. (2008). Managing Probabilistic Data with MystiQ: The Can-Do, the Could-Do, and the Can’t-Do. In: Greco, S., Lukasiewicz, T. (eds) Scalable Uncertainty Management. SUM 2008. Lecture Notes in Computer Science(), vol 5291. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87993-0_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-87993-0_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-87992-3

  • Online ISBN: 978-3-540-87993-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics