Skip to main content
Log in

Using segment level stability to select target segments in data-driven market segmentation studies

  • Published:
Marketing Letters Aims and scope Submit manuscript

Abstract

Market segmentation is widely used by industry to select the most promising target segment. Most organisations are interested in finding one or a small number of target segments to focus on. Yet, traditional criteria used to select a segmentation solution assess the global quality of the segmentation solution. This approach comes at the risk of selecting a segmentation solution with good overall quality criteria which, however, does not contain groups of consumers representing particularly attractive target segments. The approach we propose helps managers to identify segmentation solutions containing attractive individual segments (e.g., more profitable), irrespective of the quality of the global segmentation solution. We demonstrate the functioning of the newly proposed criteria using two empirical data sets. The new criteria prove to be able to identify segmentation solutions containing individual attractive segments which are not detected using traditional quality criteria for the overall segmentation solution.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

References

  • Ball, G.H., & Hall, D.J. (1965). ISODATA, a novel method of data analysis and pattern classification (Tech. Rep. NTIS No. AD 699616). Menlo Park, CA: Stanford Research Institute.

    Google Scholar 

  • Breckenridge, J.N. (1989). Replicating cluster analysis: method, consistency, and validity. Multivariate Behavioral Research, 24(2), 147–161.

    Article  Google Scholar 

  • Breckenridge, J.N. (2000). Validating cluster analysis: consistent replication and symmetry. Multivariate Behavioral Research, 35(2), 261–285.

    Article  Google Scholar 

  • Brock, G., Pihur, V., Datta, S., & Datta, S. (2008). Clvalid: an r package for cluster validation. Journal of Statistical Software, 25(4), 1–22.

    Article  Google Scholar 

  • Calinski, R.B., & Harabasz, J. (1974). A dendrite method for cluster analysis. Communications in Statistics, 3, 1–27.

    Google Scholar 

  • Chiang, M.M.T., & Mirkin, B. (2010). Intelligent choice of the number of clusters in K-Means clustering: an experimental study with different cluster spreads. Journal of Classification, 27, 3–40.

    Article  Google Scholar 

  • Dolnicar, S. (2004). Beyond “Commonsense segmentation” – a systematics of segmentation approaches in tourism. Journal of Travel Research, 42(3), 244–250.

    Article  Google Scholar 

  • Dolnicar, S., & Leisch, F. (2010). Evaluation of structure and reproducibility of cluster solutions using the bootstrap. Marketing Letters, 21(1), 83–101.

    Article  Google Scholar 

  • Dudoit, S., & Fridlyand, J. (2002). A prediction-based resampling method to estimate the number of clusters in a data set. Genome Biology, 3(7), 1–21.

    Article  Google Scholar 

  • Everitt, B.S. (1974). Cluster analysis. London: Heinemann Educational Books.

    Google Scholar 

  • Everitt, B.S., Landau, S., Leese, M., & Stahl, D. (2011). Cluster analysis, 5th edn. Chichester: Wiley.

    Book  Google Scholar 

  • Fraley, C., & Raftery, A.E. (1998). How many clusters? Which clustering method? Answers via model-based cluster analysis. The Computer Journal, 41(8), 578–588.

    Article  Google Scholar 

  • Grün, B., & Leisch, F. (2004). Bootstrapping finite mixture models. In J. Antoch (Ed.) COMPSTAT 2004 (pp. 1115–22). Heidelberg: Physica.

    Google Scholar 

  • Hartigan, J.A. (1975). Clustering algorithms. New York, NY: Wiley.

    Google Scholar 

  • Hennig, C. (2007). Cluster-wise assessment of cluster stability. Computational Statistics and Data Analysis, 52, 258–271.

    Article  Google Scholar 

  • Iacobucci, D. (2013). Marketing models: multivariate statistics and marketing analytics. Mason, OH: South-Western.

    Google Scholar 

  • Kaufman, L., & Rousseeuw, P.J. (1990). Finding groups in data. New York: Wiley.

    Book  Google Scholar 

  • Lange, T., Roth, V., Braun, M.L., & Buhman, J.M. (2004). Stability-based validation of clustering solutions. Neural Computation, 16(6), 1299–323.

    Article  Google Scholar 

  • Leisch, F. (2006). A toolbox for K-Centroids cluster analysis. Computational Statistics and Data Analysis, 51(2), 526–544.

    Article  Google Scholar 

  • Lilien, G.L., & Rangaswamy, A. (2003). Marketing engineering, 2nd edn. Upper Saddle River: Pearson Education.

    Google Scholar 

  • Mazanec, J.A. (2000). Market segmentation. In J. Jafari (Ed.), Encyclopedia of tourism. London: Routledge.

    Google Scholar 

  • McDonald, M., & Dunbar, I. (2012). Market segmentation: how to do it and how to profit from it, 4th edn. Wiley: Hoboken.

    Book  Google Scholar 

  • Milligan, G.W., & Cooper, M.C. (1985). An examination of procedures for determining the number of clusters in a data set. Psychometrika, 50(2), 159–79.

    Article  Google Scholar 

  • Myers, J.H., & Tauber, E. (1977). Market structure analysis. Chicago: American Marketing Association.

    Google Scholar 

  • Papadimitriou, C., & Steiglitz, K. (1982). Combinatorial optimization: algorithms and complexity. Prentice Hall: Englewood Cliffs.

    Google Scholar 

  • Paulhus, D.L. (1991). Measurement and control of response bias. In J. P. Robinson, P. R. Shaver & L. S. Wrightsman (Eds.), Measures of personality and social psychological attitudes (pp. 17–59). San Diego: Academic Press.

    Chapter  Google Scholar 

  • Putler, D.S., & Krider, R.E. (2012). Customer and business analytics: applied data mining for business decision making using R. London: Chapman&Hall/CRC.

    Google Scholar 

  • R Development Core Team (2016). R: a language and environment for statistical computing r foundation for statistical computing. Vienna, Austria.

  • Roberts, J. (2000). The intersection of modelling potential and practice. International Journal of Research in Marketing, 17, 127–134.

    Article  Google Scholar 

  • Shannon, C.E. (1948). A mathematical theory of communication. The Bell Systems Technical Journal, 27, 379–423.

    Article  Google Scholar 

  • Steinley, D. (2008). Stability analysis in K-Means clustering. British Journal of Mathematical and Statistical Psychology, 61, 255–273.

    Article  Google Scholar 

  • Steinley, D., & Brusco, M.J. (2011). Choosing the number of clusters in K-Means clustering. Psychological Methods, 16(3), 285–297.

    Article  Google Scholar 

  • Tibshirani, R., & Walther, G. (2005). Cluster validation by prediction strength. Journal of Computational and Graphical Statistics, 14(3), 511–28.

    Article  Google Scholar 

  • Vinh, N.X., Epps, J., & Bailey, J. (2010). Information theoretic measures for clusterings comparison: variants, properties, normalization and correction for chance. Journal of Machine Learning Research, 11, 2837–2854.

    Google Scholar 

  • Wedel, M., & Kamakura, W.A. (2000). Market segmentation conceptual and methodological foundations, 2nd edn. Boston: Kluwer Academic Publishers.

    Book  Google Scholar 

Download references

Acknowledgements

We thank the Australian Research Council for contributing to the funding of this study (ARC, DP110101347). We also thank our research assistants Alexander Chapple and Aaron Eden for their assistance with literature searches. Special thanks to Martin Natter, Bettina Grun, Dominik Ernst, Christina Yassouridis, Homa Hajibaba, and Nazila Babakhani for their feedback on previous versions of the manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sara Dolnicar.

Appendix: Technical Appendix

Appendix: Technical Appendix

1.1 Relabelling algorithm required for the calculation of pertinaciousness

For series of partitions we propose a new relabelling algorithm which makes it possible to track segments over partitions with different numbers of clusters. Let again P 1, P 2,…, P m be a series of m partitions with numbers of clusters k 1<k 2<…<k m .

figure a

Note that if k i+1 = k i +1, then only one column needs to be inserted in step 4. However, the algorithm also works for the more general case.

1.2 Calculating segment-wise rerun stability

figure b

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Dolnicar, S., Leisch, F. Using segment level stability to select target segments in data-driven market segmentation studies. Mark Lett 28, 423–436 (2017). https://doi.org/10.1007/s11002-017-9423-8

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11002-017-9423-8

Keywords

Navigation