Abstract
A vital component in strategic segmentation is the typing tool. Little is known about their prediction performance. Even less is known how well they perform at the segment-level, in imbalanced situations, and how well they predict the smallest (minority) segment. We investigate using simulated and real-life data, how well typing tools perform overall and at the specific segment-level and we show the following. One, even when overall prediction accuracy is good, specific segments may be predicted poorly. Two, for valuable (minority) segments with high targeting costs misclassification can have a substantial impact on the profitability of the segmentation strategy. Poor prediction of a minority segment can happen in high and mildly imbalanced segments. Three, prediction of minority segments can vary substantially across different base classifiers and across imbalance correction methods. We find that performance can vary substantially across base classifiers and that support vector machines, overall, perform best. Four, the prediction of a (minority) segment can always be improved by using imbalance correction methods, and overall random under-sampling performs best.
Similar content being viewed by others
Change history
09 May 2022
Corresponding author e-mail address is missing and it has been updated
Notes
The reason why this is often called a typing tool is because firms in addition to the predictive model need a simple software solution (e.g. an Excel macro) that will automatically map customers in the database into the identified market segments.
References
Ahn, G., Y.-J. Park, and S. Hur. 2021. A membership probability-based under-sampling algorithm for imbalanced data. Journal of Classification 38 (2): 2–15.
Allaway, A.W., G. D’Souza, D. Berkowitz, and K. Kim. 2014. Dynamic segmentation of loyalty program behavior. Journal of Marketing Analytics 2 (1): 18–32.
Allenby, G., G. Fennell, A. Bemmaor, V. Bhargava, F. Christen, J. Dawley, P. Dickson, Y. Edwards, M. Garrat, J. Ginter, A. Sawyer, R. Staelin, and S. Yanmg. 2002. Market segmentation research: Beyond within and across group differences. Marketing Letters 13 (3): 233–243.
Andersen, L., and C. Ho 2011. Life is iterative, so is segmentation. Ipsos Point-of-View white paper.
Balakrishnan, P.V., S. Kumar, and P. Han. 2011. Dual objective segmentation to improve targetability: An evolutionary algorithm approach. Decision Sciences 42 (4): 831–856.
Barandela, R., J.S. Sánchez, V. Garcia, and E. Rangel. 2003. Strategies for learning in class imbalance problems. Pattern Recognition 36: 849–851.
Beane, T.P. and D.M. Ennis. 1987. Market segmentation: A review. European Journal of Marketing, 21(5): 20-42.
Bennett, K.P., and C. Campbell. 2000. Support vector machines: Hype or Hallelujah. SIGKDD Exploration 2 (2): 1–13.
Beque, A., and S. Lessmann. 2017. Extreme learning machines for credit scoring: An empirical evaluation. Expert Systems with Applications 86: 42-53.
Breiman, L., J.H. Friedman, R.A. Olshen, and C.J. Stone. 1984. Classification and regression trees. Belmont: Wadsworth.
Brudvig, S., M.J. Brusco, and J.D. Cradit. 2019. Joint selection of variables and clusters recovering the underlying structure of marketing data. Journal of Marketing Analytics 7 (1): 1–12.
Bruwer, J., and E. Li. 2017. Domain-specific market segmentation using latent class mixture modelling approach and wine-related lifestyle (WRL) algorithm. European Journal of Marketing 5 (9/10): 1552–1576.
Bult, J.R. 1993. Semi-parametric versus parametric classification models: An application to direct marketing. Journal of Marketing Research XXX: 380–390.
Caruana, R., and A. Niculescu-Mizil. 2006. An empirical comparison of supervised learning algorithms. In Proc. 23rd international conference on machine learning. CiteSeerX 10.1.1.122.5901
Chang, C.C., and C.J. Lin. 2011. LIBSVM: a library for support vector machines. ACM transactions om Intelligent Systems & Technology 2(3): 1-27.
Chawla, N.V., K.W. Bowyer, L.O. Hall, and W.P. Kegelmeyer. 2002. SMOTE: Synthetic minority over-sampling technique. Journal of Artificial Intelligence Research 16: 321–357.
Christensen, C.M., S. Cook, and T. Hall. 2005. Marketing malpractice: The cause and the cure. Harvard Business Review 83 (12): 74–83.
Cortes, C., and V. Vapnik. 1995. Support-vector networks. Machine Learning 20: 273–297.
Coussement, K., F.A.M. Van den Bosche, and K.W. De Bock. 2014. Data accuracy’s impact on segmentation performance: Benchmarking RFM analysis, logistic regression, and decision trees. Journal of Business Research 67: 2751–2758.
Coussement, K., and D. Van den Poel. 2008. Churn prediction in subscription services: An application of support vector machines while comparing two parameter-selection techniques. Expert Systems and Applications 34: 313–327.
Coussement, K., M. Phan, A. De Gaigny, D.F. Benoit, and A. Raes. 2020. Predicting student drop-out in subscription-based online learning environments: The beneficial impact of the logit leaf model. Decision Support Systems 135: 113325.
DeSarbo, W.S., M. Wedel, M. Vriens, and V. Ramsawamy. 1992. Latent class metric conjoint analysis. Marketing Letters 3 (3): 273–288.
Dibb, S. 1999. Criteria guiding segmentation implementation: Reviewing the evidence. Journal of Strategic Marketing 7: 107–129.
Dibb, S., and L. Simkin. 2001. Market segmentation: Diagnosing and treating the barriers. Industrial Marketing Management 30: 609–625.
Dibb, S., and L. Simkin. 2010. Judging the quality of customer segments: Segmentation effectiveness. Journal of Strategic Marketing 18 (2): 113–131.
Dolcinar, S. 2002. A review of data driven market segmentation in Tourism. Journal of Travel and Tourism Marketing 12 (1): 1–22.
Dolcinar, S., and K. Lazarevski. 2009. Methodological reasons for the theory/practice divide in market segmentation. Journal of Marketing Management 25 (3–4): 357–373.
Douzas, J., and F. Bação. 2017. Effective data generation for imbalanced learning using conditional generative adversarial networks. Expert Systems with Applications 91: 464–471.
Estabrooks, A., T. Jo, and N. Japkowicz. 2004. A multiple re-sampling method for learning from imbalanced datasets. Computational Intelligence 20 (1): 18–36.
Gibson, L.D. 2001. Is something rotten in segmentation? Marketing Research Magazine 13: 20–25.
Gini, C. 1936. On the measure of concentration with special reference to income and statistics. Colorado College Publication, General Series 208: 73–79.
Gordini, N., and V. Veglio. 2017. Customer churn prediction and marketing retention strategies. An application of support vector machines based on the AUC parameter-selection technique in B2B e-commerce industry. Industrial Marketing Management 62: 100–1007.
Guyon, I. 2003. Design of experiments for the NIPS study 2003 variable selection benchnmark.
Hastie, T., R. Tibshirani, and J. Friedman. 2001. The elements of statistical learning. New York: Springer.
He, H., and E.A. Garcia. 2009. Learning from imbalanced data. IEEE Transactions on Knowledge and Data Engineering 21 (9): 1263–1284.
Hiziroglu, A. 2013. A neuro-fuzzy two-stage clustering approach to customer segmentation. Journal of Marketing Analytics 1 (4): 202–221.
Japkowicz, N., and S. Stephen. 2002. The class imbalance problem: A systematic study. Intelligent Data Analysis 6 (5): 429–449.
Kamakura, W.A., M. Wedel, and J. Agrawal. 1994. Concomitant variable latent class analysis for conjoint analysis. International Journal of Research in Marketing 11: 451–464.
Kim, T., and H.-Y. Lee. 2011. External validity of market segmentation methods. European Journal of Marketing 45 (1–2): 153–169.
King, G., and L. Zeng. 2001. Logistic regression in rare events data. Political Analysis 9 (2): 137–163.
Kotler, P. 1967. Marketing management. Englewood Cliffs, NJ: Prentice Hall.
Laurikkala, J. 2001. Improving identification of difficulty small classes by balancing class distribution. In: Proceedings of the conference of AI in Medicine in Europe: Artificial Intelligence Medicine, pp 63–66.
Lee, P.H. 2014. Resampling methods improve the predictive power of modeling in class-imbalanced datasets. International Journal of environmental research and Public Health 11: 9776–9789.
Lemmens, A., and C. Croux. 2006. Bagging and boosting classification trees to predict churn. Journal of Marketing Research XLIII: 276–286.
Liu, Y., S. Ram, R.F. Lusch, and M. Brusco. 2010. Multi-criterion market segmentation: A new model, implementation, and evaluation. Marketing Science 29 (5): 880–984.
Liu, Y., M. Kiang, and M. Brusco. 2012. A unified framework for market segmentation and its applications. Expert Systems with Applications 39: 10292–10302.
Louviere, J.J., T.N. Flynn, and A.A.J. Marley. 2015. Best-worst scaling: Theory, methods and applications. Cambridge: Cambridge University Press.
Magidson, J. 1994. The CHAID approach to segmentation modeling: Chi-squared automatic interaction detection. In Advanced methods of marketing research, ed. R.P. Bagozzi, 118–159. Cambridge: Basil Blackwell.
Marinakos, G., and S. Daskalaki. 2017. Imbalanced customer classification for bank direct marketing. Journal of Marketing Analytics 5: 14–30.
Marqués, A.I., V. García, and J.S. Sánchez. 2013. On the suitability of re-sampling techniques for the class imbalance problem in credit scoring. Journal of the Operational Research Society 64 (7): 1060–1070.
Natter, M. 1999. Conditional market segmentation by neural networks: A Monte Carlo Study. Journal of Retailing & Consumer Services 6: 237–248.
McCarty, J.A., and M. Hastak. 2007. Segmentation approaches in data mining: A comparison of RFM, CHAID and logistic regression. Journal of Business Research 60: 656–662.
Miguéis, V.L., A. Freitas, P.J.V. Garcia, and A. Silva. 2018. Early segmentation of students according to their academic performance: A predictive modeling approach. Decision Support Systems 115: 36–51.
Mora Cortez, R., A.H. Clarke, and P.V. Freytag. 2021. B2B market segmentation: A systematic review and research agenda. Journal of Business Research 126: 415–428.
Nanni, L., C. Fantozzi, and N. Lazzarini. 2015. Coupling different methods for overcoming the class imbalance problem. Neurocomputing 158: 48–61.
Ngai, E.W.T., Y. Hu, Y. Chen, and X. Sun. 2011. The application of data mining techniques in financial fraud detection: A classification framework and an academic review of the literature. Decision Support Systems 50: 559–569.
Pedregosa, F., G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, D. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J.T. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. OPerrot, and E. Duchesnay. 2011. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research 12: 2825–2830.
Pires, G.D., J. Stanton, and P. Stanton. 2011. Revisiting the substantiality criterion: From ethnic marketing to market segmentation. Journal of Business Research 64: 988–996.
Reibstein, J. 1987. Benefit segmentation in industrial markets. Journal of Business Research 14: 463–486.
Rousseeuw, P.J. 1987. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. Computational and Applied Mathematics. 20: 53–65.
Smith, W. 1956. Product differentiation and market segmentation as alternative marketing strategies. Journal of Marketing 21: 3–8.
Trevenen, L. 2018. Why segmentation matters. In Innovation in pricing, ed. A. Hinterhuber, and S.M. Liozu, 174–189.
Thoeni, A.T., G.W. Marshall, and S.M. Campbell. 2016. A resource-advantage theory typology of strategic segmentation. European Journal of Marketing 50 (12): 2192–2215.
Thomas, J.W. 1980. Market segmentation. Quarterly Review of Marketing 6 (1): 25–28.
Tonks, D.G. 2009. Validity and the design of market segments. Journal of Marketing Management 25 (3–4): 341–356.
Tuma, M.N., R. Decker, and S. Scholz. 2011. A survey of challenges and pitfalls of cluster analysis applications in market segmentation. International Journal of Market Research 53 (3): 391–414.
Van der Heijden, P.G.M., J. Dessens, and U. Böckenholt. 1996. Estimating the concomitant variable latent-class model with the EM algorithm. Journal of Educational and Behavioral Statistics 21 (3): 215–229.
Van Rijsbergen, C.J. 1979. Information retrieval, 2nd ed. London: Butterworth-Heinemann.
Vriens, M., M. Wedel, and T. Wilms. 1996. Metric conjoint segmentation methods: A Monte Carlo comparison. Journal of Marketing Research 33 (1): 73–85.
Vriens, M. 2001. Market segmentation: Analytical developments and application guidelines. New York: Millward Brown Intelliquest.
Vidden, C., M. Vriens, and C. Song. 2016. Comparing clustering methods for market segmentation: A simulation study. Applied Marketing Analytics, 2(3): 225–238.
Wedel, M., and W.A. Kamakura. 2000. Market segmentation: Conceptual and methodological foundations. Dordrecht: Kluwer Academic Publishers.
Wedel, M., and W.S. DeSarbo. 2002. Market segment derivation and profiling via a finite mixture model framework. Marketing Letters 13 (1): 17–25.
Wedel, M., and J.-B. Steenkamp. 1989. Fuzzy cluster-wise regression approach to benefit segmentation. International Journal of Research in Marketing 6 (4): 241–258.
West, P.M., P.L. Brockett, and L.L. Golden. 1997. A comparative analysis of neural networks and statistical methods for predicting consumer choice. Marketing Science 16 (4): 370–391.
Wind, Y. 1978. Issues and advances in segmentation research. Journal of Marketing Research 15: 317–337.
Yankelovich, D., and D. Meer. 2006. Rediscovering market segmentation. Harvard Business Review 84 (2): 122–166.
Young, S., L. Ott, and B. Feigin. 1978. Some practical considerations in market segmentation. Journal of Marketing Research XV: 405–512.
Zeng, G. 2009. On the existence of maximum likelihood estimates for weighted logistic regression. Communication in Statistics 46 (22): 11194–11203.
Zhu, B., B. Baesens and S.K.L.M. vandenBroucke. 2017. An empirical comparison of techniques for the class imbalance problem in churn prediction. Information Sciences, 408: 84–99.
Acknowledgements
We would like to thank the three anonymous reviewers for their constructive comments.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix 1
Some selected market segmentation applications: Number of segments and imbalance
No. | Studies | Number of segments | Segment imbalance |
---|---|---|---|
1 | Otoo, F.E., Kim, S. & Park, J. (2020). Motivation-based segmentation of Chinese travelers: The role of preferences, sociodemograhics and travel-related features. Journal of Vacation Marketing, 26, 4, 457–472 | 5 | Balanced |
2 | Delley, M., & Brunner, T. A. (2020). A segmentation of Swiss fluid milk consumers and suggestions for target product concepts. Journal of Dairy Science, 103, 4, 3095–3106 | 3 | 1:3 |
3 | Chakrabarti, A., Campbell, B. L., & Shonkwiler, V. (2019). Eliciting Consumer Preference and Willingness to Pay for Mushrooms: A Latent Class Approach. Journal of Food Distribution Research, 50, 1, 46–62 | 3 | Balanced |
4 | Albayrak, T., Dursun, A. & Űnal, C. (2019). Do tourists have different motivations for online tyravel purchasing? A segmentation of the Russian market. Journal of Vacation Marketing, 25, 4, 432–443 | 3 | 1:3 |
5 | Janardhanan, S. & Muthalagu, R. (2020). Market segmentation for profit maximization using Machine Learning. Journal of Physics: Conference Series, 1706 | 4 | 1:3 |
6 | Jürkenbeck, K., Spiller, A. & Meyerding, G.H. (2019). Tomato attributes and consumer preferences – a consumer segmentation approach. British Food Journal, 122, 1, 328–344 | 6 | 1:2 |
7 | Scheuffelen, S., Kemper, J. & Brettel, M. (2019). How do human attitudes and values predict online marketing responsiveness: Comparing consumer segmentation bases toward brand purchase and marketing response. Journal of Advertising Research, 142–157 | 6 | Balanced |
8 | Elhai, J. D., & Contractor, A. A. (2018). Examining latent classes of smartphone users: Relations with psychopathology and problematic smartphone use. Computers in Human Behavior, 82, 159–166 | 2 | Balanced |
9 | Liu, H.-Z., Zheng, Y., Rao, L.-L., Wang, F., Sun, Y., Huang, G.-H., Li, S., & Liang, Z.-Y. (2018). Not all gamblers are created equal: gambling preferences depend on individual personality traits. Journal of Risk Research, 21, 7, 885–898 | 3 | 1:2 |
10 | Erdem, S. (2018). Who do UK consumers trust for information about nanotechnology? Food Policy, 77, 133–142 | 2 | 1:2 |
11 | Griva, A., Bardaki, C., Pranmatari, K., Papakiriakopoulos, D. (2018). Retail business analytics: Customer visit segmentation using market basket data. Expert systems with Applications, 100, 1–16 | 10 | 1:3 |
12 | Gengler, C.E., Mulvey, M.S. (2017). Planning pre-launch positioning: Segmentation via willingness-to-pay and means-end brand differentiators. Journal of Brand Management, 24, 230–249 | 4 | 1:6 |
13 | Nadeem, W., Juntunen, M., & Juntunen, J. (2017). Consumer segments in social commerce: A latent class approach. Journal of Consumer Behaviour, 16, 3, 279–292 | 3 | 1:10 |
14 | Pomarici, E., Lerro, M., Chrysochou, P., Vecchio, R., & Krystallis, A. (2017). One size does (obviously not) fit all: Using product attributes for wine market segmentation. Wine Economics and Policy, 6(2), 98–106 | 4 | 1:2 |
15 | Bruwer, J., & Li, E. (2017). Domain-specific market segmentation using a latent-class mixture modelling approach and wine-related lifestyle (WRL) algorithm. European Journal of Marketing, 51, 9–10, 1552–1576 | 5 | 1:2 |
16 | Palma, M. A., Ness, M. L., & Anderson, D. P. (2017). Fashionable food: a latent class analysis of social status in food purchases. Applied Economics, 49, 3, 238–250 | 4 | 1:6 |
17 | Kolhede, E., & Gomez-Arias, J. T. (2017). Distinctions Between Frequent Performing Arts Consumers: Implications for Segmentation and Positioning. International Journal of Arts Management, 20, 1, 31–53 | 3 | Balanced |
18 | Birkenmaier, J., & Fu, Q. (2016). Who Uses Alternative Financial Services? A Latent Class Analysis of Consumer Financial Knowledge and Behavior. Journal of Social Service Research, 42, 3, 412–424 | 5 | 1:4 |
19 | Chan, C.C.H., Hwang, Y-R., Wu, H-C. (2016). Marketing segmentation using the particle swarm optimization algorithm. Journal of Ambient Intelligent Human Computing, 7, 855–863 | 4 | 1:4 |
20 | Verain, M.C.D., Sijtsema, S.J., Antonides, G. (2016). Consumer segmentation based on food category attribute importance: The relationship with healthiness and sustainability perceptions. Food Quality and Preference, 48, 99–106 | 3 | 1:3 |
21 | Adasme-Berios, C., Sanches, M., Mora, M. Schnettler, B.M, Lobos, G. & Diaz, J. (2016). Segmentation of consumer preferences for food safety label on vegetables: Consumer profiles in central and south-central Chile. British Food Journal, 118, 10, 2550–2566 | 3 | 1:3 |
22 | Weijters, B., & Goedertier, F. (2016). Understanding today’s music acquisition mix: a latent class analysis of consumers’ combined use of music platforms. Marketing Letters, 27, 3, 603 | 4 | 1:4 |
23 | Díaz, E., Martín-Consuegra, D., & Estelami, H. (2016). A persuasive-based latent class segmentation analysis of luxury brand websites. Electronic Commerce Research, 16, 3, 401 | 3 | 1:3 |
24 | Segovia, M. S., & Palma, M. A. (2016). Buying your way into a healthier lifestyle: a latent class analysis of healthy food purchases. Applied Economics, 48, 21, 1965–1977 | 2 | Balanced |
25 | Hosseini, M. & Shabani, M. (2015).New approach to customer segmentation based on changes in customer value. Journal of Marketing Analytics, 3, 3, 110–121 | 11 | 1:7 |
26 | Bassi, F. (2015). Forecasting financial products acquisition via dynamic segmentation: an application to the Italian market. International Journal of Market Research, 57, 6, 909 | 5 | 1:20 |
27 | Rundle-Thiele, S., Kubacki, K., Tcazzynski, A., Parkinson, J. (2014). Using two-step cluster analysis to identify homogeneous physical activity groups. Marketing Intelligence and Planning, 33, 4, 522–537 | 4 | 1:3 |
28 | Alexandra, D., Miragaia, M., Maryins, M.A.B. (2015). Mix between satisfaction and attributes destination choice: A segmentation criterion to understand the ski resorts consumers. International journal of tourism research, 17, 313–324 | 6 | 1:3 |
29 | Realini, C.E., Kallas, Z., Pérez-Juan, M., Gómez, J., Olleta, J.L., Beriain, M.J., Alberti, P., & Sañuda, C. (2014). Relative importance of cues underlying Spanish consumers’ beef choice and segmentation, and consumer liking of beef enriched with n-3 and CLA fatty acids. Food Quality & Preference, 33, 74–85 | 3 | 1:2 |
30 | Campbell, C., Ferraro, C., Sands, S. (2014). Segmenting consumer reactions to social network marketing. European Journal of Marketing, 48, 3–4, 432–452 | 5 | 1:3 |
31 | Liu, H.B., McCarthy, B., Chen, T. (2013). The Chinese wine market: a market segmentation study. Asian Pacific Journal of Marketing & Logistics, 26, 3, 450–471 | 3 | 1:10 |
32 | Simunaniemi, A. ‐M., Nydahl, M., & Andersson, A. (2013). Cluster analysis of fruit and vegetable-related perceptions: An alternative approach of consumer segmentation. Journal of Human Nutrition and Dietetics, 26, 1, 38–47 | 2 | 1:2 |
33 | Riefler, P., Diamantopoulos, A., & Siguaw, J. A. (2012). Cosmopolitan consumers as a target group for segmentation. Journal of International Business Studies, 143, 285–305 | 4 | 1:3 |
34 | Liu, Y., Kiang, M. & Brusco, M. (2012). A unified framework for market segmentation and its applications. Expert Systems with Applications, 39, 10292–10302 | 3 | 1:3 |
35 | Davis, S. (2012). Choosing the right baskets for your eggs: deriving actionable customer segments using supervised genetic algorithms. International Journal of Market Research, 54, 5, 689 | 4 | Balanced |
36 | Ko, E., Taylor, C.A., Sung, H., Lee, J., Wagner, U., Navarro, Martin-Consuegra, D., & Wang, F. (2012). Global market segmentation usefulness in the sportwear industry, 65, 1565–1575 | 4 | 1:2 |
37 | Barber, N., & Taylor, C. (2011). Equity benefits of smaller wine regions and life- style segmentation. Journal of Brand Management, 19, 2, 158–175 | 8 | 1:3 |
38 | Kim, T. & Lee, H-Y. (2011). External validity of market segmentation methods: A study of buyers of prestige cosmetic brands. European Journal of Marketing, 45, 1–2, 153–169 | 3 | 1:2 |
39 | Ponnam, A., Sahoo, D., & Balaij, M.S. (2011). Satisfaction-based segmentation: Application of the Kano model in Indian fast-food industry. Journal of Targeting, Measurement, and Analysis for Marketing, 19, 3/4, 195–205 | 4 | 1:2 |
40 | Cui, G., & Wang, Y. (2010). Consumers’ SKU choices in an online supermarket: a latent class approach. Journal of Marketing Management, 26, 5–6, 495–514 | 3 | NA |
41 | Liu, Y., Ram, S., Lusch, R.F. & Brusco, M. (2010). Multicriterion market segmentation: A new model, implementation, and evaluation. Marketing Science, 29, 5, 880–894 | 4 | 1:10 |
42 | Gil-Saura, I., Ruiz-Molina, M-E. (2009). Customer segmentation based on commitment and ICT use. Industrial Management & Data, 109, 2, 206–223 | 4 | 1:3 |
43 | Gabay, G., Paulus, K., Aarts, P., Beckley, J., Ashman, H., & Moskowitz, H. R. (2009). Cross-National Segmentation for Messaging about Cheese: Towards a New Approach to Consumer Understanding, Directed Development, and Targeted Marketing. Italian Journal of Food Science, 21, 4, 407–428 | 3 | 1:2 |
44 | Shiftan, Y., Outwater, M/L., Zhou, Y. (2008). Transit market research using structural equation modeling and attitudinal segmentation. Transport Policy, 15, 186–195 | 8 | 1:17 |
45 | Varela Mallou, J., Rial Boubeta, A., Braña Tobío, T., & Voces López, C. (2008). Application of latent class analysis to the investigation of customer loyalty in service companies. Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 4, 3, 87–96 | 4 | 1:25 |
46 | Bruning, E.R., Hu, M.Y., Hao, W. (2007). Cross-national segmentation: An application in the NAFTA airline passenger market. European Journal of Marketing, 43, 11/12, 1498–1522 | 5 | 1:2 |
47 | Bassi, F. (2007). Latent class factor models for market segmentation: an application to pharmaceuticals. Statistical Methods and Applications, 16, 279–287 | 3 | 1:4 |
48 | Brey, E.T., So, S-I., Kim, D-Y., Morrison, A.M. (2007). Web-based permission marketing: Segmentation for the lodging industry. Tourism Management, 28, 1408–1416 | 3 | 1:3 |
49 | Sajeev Varki, & Pradeep K. Chintagunta. (2004). The Augmented Latent Class Model: Incorporating Additional Heterogeneity in the Latent Class Model for Panel Data. Journal of Marketing Research, 41(2), 226 | 5 | 1:7 |
50 | Sewitch, M.J., KLeffondre, K., Dobkin, P.L. (2004). Clustering patients according to healthcare perceptions: relationships to psychosocial characteristics and medication non-adherence. Journal of Psychosomatic Research, 56, 323–332 | 5 | Balanced |
51 | Bhatnagar, A. & Ghose, S. (2004). A latent class segmentation analysis of e-shoppers. Journal of Business Research, 57, 758–767 | 3 | 1:9 |
52 | Ter Hofstede, F., Steenkamp. J-B. E.M., & Wedel, M. (1999). International market segmentation based on consumer-product relations. Journal of Marketing Research, XXXVI, 1–17 | 4 | 1:7 |
53 | Steenkamp, J-B. & Wedel, M. (1993). Fuzzy clusterwise regression in benefit segmentation: Application and investigation into its validity. Journal of Business Research, 26, 3, 237–249 | 3 | 1:2 |
54 | Morwitz, V.G. & Schmittlein, D. (1992). Using segmentation to improve sales forecasts based on purchase intent: Which “intenders” actually buy. Journal of Marketing Research, XXIX, 391–405 | 5 | 1:9 |
55 | Carroll, N.V., Gagon, J.P. (1983). Identifying consumer segments in health services markets: An application of conjoint and cluster analyses to the ambulatory care pharmacy market. Journal of Health Care Marketing, 3, 3, 22–34 | 5 | 1:10 |
Appendix 2
The SMOTE algorithm:
-
Provide the algorithm with the number of minority segment samples, T, the number of synthetic minority segment samples to generate, N, and the number of nearest neighbors to find for each sample, k.
1. For all T minority segment samples, compute the k nearest neighbors (or nearest samples) based on Euclidean distance.
2. While N is greater than 0:
2.a Select a sample i from the T minority segment samples. (If run out of samples from T we just loop over it again).
2.b Randomly select a nearest neigbor for sample i, which we refer to as nn.
2.c For all possible attributes:
2.c. (1) Compute the difference between sample nn’s and sample i’s attribute value, store this value as diff.
2.c (2) Randomly select a number between 0 and 1, store this value as gap.
2.c (3) Set the synthetic datapoint attribute value to.
sample[i][attribute] + gap*diff.
2.d Store the new synthetic datapoint.
2.e N = N − 1.
Rights and permissions
About this article
Cite this article
Vriens, M., Bosch, N., Vidden, C. et al. Prediction and profitability in market segmentation typing tools. J Market Anal 10, 360–389 (2022). https://doi.org/10.1057/s41270-021-00145-4
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1057/s41270-021-00145-4