Abstract
This work concerns optimal customer data acquisition and selection problem. We first propose using a divide-and-conquer technique to find empirical distribution of the customer data. We then formulate a data acquisition problem as an optimization problem that maximizes the quality of the acquired data while keeping the cost of acquisition as low as possible. We propose using generalized second-price (GSP) auction to acquire customer data and show that when the number of bidders is large, GSP is a truth-telling mechanism. We derive the analytical solution for the optimization problem, which finds a set of data that best represents the probability distribution of the target population relative to the acquisition cost. An experimental study is conducted to demonstrate the effectiveness of the proposed approach.
References
Brand, R., Domingo-Ferrer, J., Mateo-Sanz, J.M.: Reference data sets to test and compare SDC methods for protection of numerical microdata. European Project IST-2000-25069 CASC (2002)
Chaudhuri, A., Stenger, H.: Survey Sampling: Theory and Methods. Marcel Dekker Inc., Florence (1992)
Edelman, B., Ostrovsky, M., Schwarz, M.: Internet advertising and the generalized second-price auction: selling billions of dollars worth of keywords. Am. Econ. Rev. 97(1), 242–259 (2007)
Friedman, J.H., Bentley, J.L., Finkel, R.A.: An algorithm for finding best matches in logarithmic expected time. ACM Trans. Math. Softw. 3(3), 209–226 (1977)
Gomes, R., Sweeney, S.: Bayes-Nash equilibria of the generalized second price auction. http://ssrn.com/abstract=1429585
Groves, R.M.: Nonresponse rates and nonresponse bias in household surveys. Public Opin. Q. 70(5), 646–675 (2006)
Krishna, V.: Auction Theory. Elsevier, Amsterdam (2010)
Mookerjee, V., Dos Santos, B.: Inductive expert system design: maximizing system value. Inf. Syst. Res. 4(4), 111–131 (1993)
Mookerjee, V., Mannino, M.: Redesigning case retrieval to reduce information acquisition costs. Inf. Syst. Res. 8(1), 51–69 (1997)
Moore, J., Whinston, A.: A model of decision-making with sequential information-acquisition (Part 1). Decis. Support Syst. 2(4), 285–307 (1986)
Moore, J., Whinston, A.: A model of decision-making with sequential information-acquisition (Part 2). Decis. Support Syst. 3(1), 47–73 (1987)
Saar-Tsechansky, M., Melville, P., Provost, F.: Active feature-value acquisition. Manage. Sci. 55(4), 664–684 (2009)
Varian, H.R.: Position auctions. Int. J. Ind. Organ. 25(6), 1163–1178 (2007)
Zheng, Z., Padmanabhan, B.: Selectively acquiring customer information: a new data acquisition problem and an active learning-based solution. Manag. Sci. 52(5), 697–712 (2003)
Acknowledgments
Xiao-Bai Li’s research was supported in part by the National Library of Medicine of the National Institutes of Health (NIH) under Grant Number R01LM010942. The content is solely the responsibility of the authors and does not necessarily represent the official views of NIH.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
Proof of Theorem 1 (GSP is asymptotically truth-telling).
Let v be the true value of a bidder and \( \beta (v) \) be the bidder’s bid. For a GSP with N bidders (large) and S slots (fixed), Gomes and Sweeney [5] show the following relationship:
where F (with the density f) represents the distribution of bidders’ values and \( \gamma_{s} (v) \) is a function of v with a complicated expression. We have derived the result \( \beta (v) = v \) by showing that
The details are omitted due to space limitations. Instead, we provide a sketch of the proof here in a Nash equilibrium context. First, suppose that the bidder underbid with \( \beta^{ - } (v) < v \) and wins. He will be paid an amount next to his price \( \beta * > \beta^{ - } (v) \). If \( \beta * > v \), then the bidder would win with bid v anyway. If \( \beta^{ * } \le v \), then the payoff to the bidder would be \( \beta^{ * } - v \le 0 \). So, underbidding does not improve the bidder’s payoff. Now, suppose that the bidder overbid with \( \beta^{ + } (v) > v \). Since N is large and S is fixed, there will be a large number of bids within \( [v,\beta^{ + } (v)] \) and this number will be larger than S. Consequently, the bidder will not win by overbidding.
Proof of Theorem 2 (Solutions to the optimization problem (1)).
We standardize the minimization problem (1) to
Applying the KKT conditions to this problem, we have for Lagrange multipliers \( \lambda_{i} , \, i = 1, \ldots ,m \),
It follows from (A.1), (A.2) and (A.3) that
or
For non-boundary solutions, \( x_{i} > 1 \). It then follows from (A.4) that \( \lambda_{i} = 0, \, i = 1, \ldots ,m \). Thus,
i.e.,
or
Substituting (A.3) into the above equation, we have
Solving it for μ, we get \( \mu = 2 - \frac{1}{{\sum\limits_{i = 1}^{m} {\frac{{p_{i} }}{{(2 + wnp_{i} d_{i} )}}} }} \). Substituting it into (A.5), we have
This solution satisfies KKT condition and the objective function is convex, so it is the unique solution.
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Liu, X., Li, XB. (2016). Acquiring High Quality Customer Data with Low Cost. In: Sugumaran, V., Yoon, V., Shaw, M. (eds) E-Life: Web-Enabled Convergence of Commerce, Work, and Social Life. WEB 2015. Lecture Notes in Business Information Processing, vol 258. Springer, Cham. https://doi.org/10.1007/978-3-319-45408-5_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-45408-5_5
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-45407-8
Online ISBN: 978-3-319-45408-5
eBook Packages: Business and ManagementBusiness and Management (R0)