Segmentation: Clustering and Classification

Chapman, Chris; Feit, Elea McDonnell

doi:10.1007/978-3-319-14436-8_11

Chris Chapman⁶ &
Elea McDonnell Feit⁷

Part of the book series: Use R! ((USE R))

18k Accesses
1 Citations

Abstract

In this chapter, we tackle a canonical marketing research problem: finding, assessing, and predicting customer segments. In previous chapters we’ve seen how to assess relationships in the data (Chap. 4), compare groups (Chap. 5), and assess complex multivariate models (Chap. 10). In a real segmentation project, one would use those methods to ensure that data has appropriate multivariate structure, and then begin segmentation analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 64.99; Price excludes VAT (USA)

Softcover Book: USD 84.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32.
Article MATH Google Scholar
Caldon, P. (2013). to.dendrogram. http://stats.stackexchange.com/a/45161.
Collins, L. M., & Lanza, S. T. (2010). Latent class and latent transition analysis: With applications in the social, behavioral, and health sciences. New York: Wiley.
Google Scholar
Everitt, B. S., Landau, S., Leese, M., & Stahl, D. (2011). Cluster analysis (5th ed.). Wiley Series in Probability and Statistics. Chichester: Wiley.
Book MATH Google Scholar
Fernández-Delgado, M., Cernadas, E., Barro, S., & Amorim, D. (2014). Do we need hundreds of classifiers to solve real world classification problems? Journal of Machine Learning Research, 15, 3133–3181.
MathSciNet MATH Google Scholar
Fraley, C., & Raftery, A. E. (2002). Model-based clustering, discriminant analysis, and density estimation. Journal of the American Statistical Association, 97(458), 611–631.
Article MathSciNet MATH Google Scholar
Fraley, C., Raftery, A. E., Murphy, T. B., & Scrucca, L. (2012). mclust version 4 for R: Normal mixture modeling for model-based clustering, classification, and density estimation (Tech. Rep. 597). Seattle: University of Washington.
Google Scholar
Grün, B., & Leisch, F. (2008). FlexMix version 2: finite mixtures with concomitant variables and varying and constant parameters. Journal of Statistical Software, 28(4), 1–35. http://www.jstatsoft.org/v28/i04/.
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction (2nd ed.). New York: Springer.
Book MATH Google Scholar
Hornik, K. (2005). A CLUE for CLUster ensembles. Journal of Statistical Software, 14(12), 1–25.
Article Google Scholar
Hubert, L., & Arabie, P. (1985). Comparing partitions. Journal of Classification, 2(1), 193–218.
Article MATH Google Scholar
James, G., Witten, D., Hastie, T., & Tibshirani, R. (2013). An introduction to statistical learning: With applications in R. New York: Springer.
Book MATH Google Scholar
Kruschke, J. K. (2010). Doing Bayesian data analysis: A tutorial introduction with R. New York: Academic.
MATH Google Scholar
Kuhn, M., Wing, J., Weston, S., Williams, A., Keefer, C., Engelhardt, A., et al. (2014). caret: Classification and Regression Training. http://CRAN.R-project.org/package=caret, R package version 6.0-22.
Leisch, F. (2004). FlexMix: a general framework for finite mixture models and latent class regression in R. Journal of Statistical Software, 11(8), 1–18. http://www.jstatsoft.org/v11/i08/.
Leisch, F., & Dimitriadou, E. (2010). mlbench: Machine learning benchmark problems. R package version 2.1-1.
Google Scholar
Liaw, A., & Wiener, M. (2002). Classification and regression by randomforest. R News, 2(3), 18–22. http://CRAN.R-project.org/doc/Rnews/.
Linzer, D. A., & Lewis, J. B. (2011). poLCA: an R package for polytomous variable latent class analysis. Journal of Statistical Software, 42(10), 1–29. http://www.jstatsoft.org/v42/i10/.
Maechler, M., Rousseeuw, P., Struyf, A., Hubert, M., & Hornik, K. (2014). cluster: Cluster analysis basics and extensions. R package version 1.15.2.
Google Scholar
Meyer, D., Dimitriadou, E., Hornik, K., Weingessel, A., & Leisch, F. (2014). e1071: Misc Functions of the Department of Statistics (e1071), TU Wien. http://CRAN.R-project.org/package=e1071, R package version 1.6-3.
Raftery, A. E. (1995). Bayesian model selection in social research. Sociological Methodology, 25, 111–164.
Article Google Scholar
Rand, W. M. (1971). Objective criteria for the evaluation of clustering methods. Journal of the American Statistical Association, 66(336), 846–850.
Article Google Scholar
Ross, S. M. (2010). Introduction to probability models (10th ed.). New York: Academic.
MATH Google Scholar
Sokal, R. R., & Rohlf, F. J. (1962). The comparison of dendrograms by objective methods. Taxon, 11(2), 33–40.
Article Google Scholar
Wedel, M., & Kamakura, W. A. (2000). Market segmentation: Conceptual and methodological foundations (2nd ed.). International Series in Quantitative Marketing. Boston: Kluwer Academic.
Book Google Scholar
Wolpert, D. H., & Macready, W. G. (1997). No free lunch theorems for optimization. IEEE Transactions on Evolutionary Computation, 1(1), 67–82.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Google, Inc., Seattle, WA, USA
Chris Chapman
LeBow College of Business, Drexel University, Philadelphia, PA, USA
Elea McDonnell Feit

Authors

Chris Chapman
View author publications
You can also search for this author in PubMed Google Scholar
Elea McDonnell Feit
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Chris Chapman .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Chapman, C., Feit, E.M. (2015). Segmentation: Clustering and Classification. In: R for Marketing Research and Analytics. Use R!. Springer, Cham. https://doi.org/10.1007/978-3-319-14436-8_11

Download citation

DOI: https://doi.org/10.1007/978-3-319-14436-8_11
Published: 06 January 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14435-1
Online ISBN: 978-3-319-14436-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics