The Price Is Right

Models and Algorithms for Pricing Data
  • Ruiming Tang
  • Huayu Wu
  • Zhifeng Bao
  • Stéphane Bressan
  • Patrick Valduriez
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8056)

Abstract

Data is a modern commodity. Yet the pricing models in use on electronic data markets either focus on the usage of computing resources, or are proprietary, opaque, most likely ad hoc, and not conducive of a healthy commodity market dynamics. In this paper we propose a generic data pricing model that is based on minimal provenance, i.e. minimal sets of tuples contributing to the result of a query. We show that the proposed model fulfills desirable properties such as contribution monotonicity, bounded-price and contribution arbitrage-freedom. We present a baseline algorithm to compute the exact price of a query based on our pricing model. We show that the problem is NP-hard. We therefore devise, present and compare several heuristics. We conduct a comprehensive experimental study to show their effectiveness and efficiency.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Andrei, S., Robert, V.: The limits of arbitrage. Journal of Finance (1997)Google Scholar
  2. 2.
    Balazinska, M., Howe, B., Suciu, D.: Data markets in the cloud: An opportunity for the database community. PVLDB 4(12), 1482–1485 (2011)Google Scholar
  3. 3.
    Brynjolfsson, E., Hitt, L.M., Kim, H.: Strength in numbers: How does data-driven decision-making affect firm performance? In: ICIS (2011)Google Scholar
  4. 4.
    Buneman, P., Khanna, S., Tan, W.-C.: Why and Where: A Characterization of Data Provenance. In: ICDT, pp. 316–330 (2001)Google Scholar
  5. 5.
    Cheney, J., Chiticariu, L., Tan, W.C.: Provenance in Databases: Why, How, and Where. Foundations and Trends in Databases 1(4), 379–474 (2009)CrossRefGoogle Scholar
  6. 6.
    Cui, Y., Widom, J.: Practical Lineage Tracing in Data Warehouses. In: ICDE, pp. 367–378 (2000)Google Scholar
  7. 7.
    Durkee, D.: Why cloud computing will never be free. Commun. ACM (2010)Google Scholar
  8. 8.
    Hey, T., Tansley, S., Tolle, K.M. (eds.): The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research (2009)Google Scholar
  9. 9.
    Green, T.J., Karvounarakis, G., Tannen, V.: Provenance Semirings. In: PODS (2007)Google Scholar
  10. 10.
    Khanna, S., Sudan, M., Trevisan, L., Williamson, D.P.: The approximability of constraint satisfaction problems. SIAM J. Comput. 30(6), 1863–1920 (2000)MathSciNetCrossRefGoogle Scholar
  11. 11.
    Koutris, P., Upadhyaya, P., Balazinska, M., Howe, B., Suciu, D.: Query-based data pricing. In: PODS (2012)Google Scholar
  12. 12.
    Li, X.Y.: Optimization algorithms for the minimum-cost satisfiability problemGoogle Scholar
  13. 13.
    A. LLC. AggData, http://www.aggdata.com/
  14. 14.
    Microsoft. Windows Azure Marketplace, https://datamarket.azure.com/
  15. 15.
    Muschalle, A., Stahl, F., Loser, A., Vossen, G.: Pricing approaches for data markets. In: BIRTE (2012)Google Scholar
  16. 16.
    Prugovečki, E.: Quantum Mechanics in Hilbert Space, 2nd edn. Dover Books on Physics Series. Dover Publications (2006)Google Scholar
  17. 17.
    Tang, R., Wu, H., Bao, Z., Bressan, S., Valduriez, P.: Technical report: The price is right – a model and algorithms for the price of data, http://www.comp.nus.edu.sg/~tang1987/tech-reports/pricing-provenance.pdf
  18. 18.
    Wu, S., Banker, R.D.: Best pricing strategy for information services. J. AIS 11(6) (2010)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2013

Authors and Affiliations

  • Ruiming Tang
    • 1
  • Huayu Wu
    • 2
  • Zhifeng Bao
    • 3
  • Stéphane Bressan
    • 1
  • Patrick Valduriez
    • 4
  1. 1.School of ComputingNational University of SingaporeSingapore
  2. 2.Institute for Infocomm ResearchSingapore
  3. 3.IDMINational University of SingaporeSingapore
  4. 4.INRIA & LIRMMMontpellierFrance

Personalised recommendations