Abstract
Uplift modeling refers to approaches to quantify net difference in outcome between applying a treatment and not applying it to an individual. It is a typical causal inference problem which allowing us to design a refined decision rule that only targets those susceptible. The core difficulty of the existing methods is that there is no direct supervision label for uplifts. In this paper, we propose an efficient Causal Enhanced Uplift Model (CEUM) to excavate potential uplift information. Specifically, we first construct contrastive pairs following the properties of partial order relation of uplifts and maintain the structural correlation and consistency during training. Then, we seek for a promising average causal effect in batches to approach the mean of individual estimated uplift. Finally, we benchmark the efficacy of the proposed method by conducting comprehensive experiments and the results show that CEUM achieves the state-of-the-art performance on two real-world datasets.
X. He and G. Xu contribute equally to this work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Athey, S., Imbens, G.W.: Machine learning methods for estimating heterogeneous causal effects. Stat. 1050(5), 1–26 (2015)
Betlei, A., Diemert, E., Amini, M.R.: Uplift modeling with generalization guarantees. In: Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pp. 55–65 (2021)
Chen, H., Harinen, T., Lee, J.Y., Yung, M., Zhao, Z.: Causalml: python package for causal machine learning. arXiv preprint arXiv:2002.11631 (2020)
Devriendt, F., Van Belle, J., Guns, T., Verbeke, W.: Learning to rank for uplift modeling. IEEE Trans. Knowl. Data Eng. (2020)
Eustache, D., Artem, B., Renaudin, C., Massih-Reza, A.: A large scale benchmark for uplift modeling. In: Proceedings of the AdKDD and TargetAd Workshop, KDD, London, United Kingdom, 20 August 2018, ACM (2018)
Grimmer, J., Messing, S., Westwood, S.J.: Estimating heterogeneous treatment effects and the effects of heterogeneous treatments with ensemble methods. Polit. Anal. 25(4), 413–434 (2017)
Hansotia, B.J., Rukstales, B.: Direct marketing for multichannel retailers: issues, challenges and solutions. J. Database Market. Customer Strat. Manage. 9(3), 259–266 (2002)
Hillstrom, K.: The minethatdata e-mail analytics and data mining challenge (2018). https://blog.minethatdata.com/2008/03/minethatdata-e-mail-analytics-and-data.html
Jaskowski, M., Jaroszewicz, S.: Uplift modeling for clinical trial data. In: ICML Workshop on Clinical Data Analysis, vol. 46 (2012)
Johansson, F., Shalit, U., Sontag, D.: Learning representations for counterfactual inference. In: International Conference on Machine Learning, pp. 3020–3029. PMLR (2016)
Ke, G., et al.: LightGBM: a highly efficient gradient boosting decision tree. Adv. Neural Inf. Process. Syst. 30, 3146–3154 (2017)
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
Künzel, S.R., Sekhon, J.S., Bickel, P.J., Yu, B.: Metalearners for estimating heterogeneous treatment effects using machine learning. Proc. Natl. Acad. Sci. 116(10), 4156–4165 (2019)
Li, S., Vlassis, N., Kawale, J., Fu, Y.: Matching via dimensionality reduction for estimation of treatment effects in digital marketing campaigns. In: IJCAI, pp. 3768–3774 (2016)
Lo, V.S.: The true lift model: a novel data mining approach to response modeling in database marketing. ACM SIGKDD Explor. Newsl. 4(2), 78–86 (2002)
Loh, W.Y.: Classification and regression trees. Wiley Interdiscipl. Rev. Data Mining Knowl. Disc. 1(1), 14–23 (2011)
Mouloud, B., Olivier, G., Ghaith, K.: Adapting neural networks for uplift models. arXiv preprint arXiv:2011.00041 (2020)
Radcliffe, N.J., Surry, P.D.: Real-world uplift modelling with significance-based uplift trees. White Paper TR-2011-1, Stochastic Solutions, pp. 1–33 (2011)
Rosenbaum, P.R., Rubin, D.B.: The central role of the propensity score in observational studies for causal effects. Biometrika 70(1), 41–55 (1983)
Rubin, D.B.: Estimating causal effects of treatments in randomized and nonrandomized studies. J. Educ. Psychol. 66(5), 688 (1974)
Rubin, D.B.: Causal inference using potential outcomes: design, modeling, decisions. J. Am. Statist. Assoc. 100(469), 322–331 (2005)
Schwab, P., Linhardt, L., Karlen, W.: Perfect match: a simple method for learning representations for counterfactual inference with neural networks. arXiv preprint arXiv:1810.00656 (2018)
Shalit, U., Johansson, F.D., Sontag, D.: Estimating individual treatment effect: generalization bounds and algorithms. In: International Conference on Machine Learning, pp. 3076–3085. PMLR (2017)
Shi, C., Blei, D.M., Veitch, V.: Adapting neural networks for the estimation of treatment effects. arXiv preprint arXiv:1906.02120 (2019)
Wager, S., Athey, S.: Estimation and inference of heterogeneous treatment effects using random forests. J. Am. Statist. Assoc. 113(523), 1228–1242 (2018)
Yao, L., Li, S., Li, Y., Huai, M., Gao, J., Zhang, A.: Representation learning for treatment effect estimation from observational data. Adv. Neural Inf. Process. Syst. 31, 1–10 (2018)
Zaniewicz, Ł., Jaroszewicz, S.: Support vector machines for uplift modeling. In: 2013 IEEE 13th International Conference on Data Mining Workshops, pp. 131–138. IEEE (2013)
Zhang, W., Li, J., Liu, L.: A unified survey of treatment effect heterogeneity modelling and uplift modelling. ACM Comput. Surv. (CSUR) 54(8), 1–36 (2021)
Zhao, Y., Fang, X., Simchi-Levi, D.: A practically competitive and provably consistent algorithm for uplift modeling. In: 2017 IEEE International Conference on Data Mining (ICDM), pp. 1171–1176. IEEE (2017)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
He, X. et al. (2022). Causal Enhanced Uplift Model. In: Gama, J., Li, T., Yu, Y., Chen, E., Zheng, Y., Teng, F. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2022. Lecture Notes in Computer Science(), vol 13282. Springer, Cham. https://doi.org/10.1007/978-3-031-05981-0_10
Download citation
DOI: https://doi.org/10.1007/978-3-031-05981-0_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-05980-3
Online ISBN: 978-3-031-05981-0
eBook Packages: Computer ScienceComputer Science (R0)