Abstract
Having an effective segmentation strategy is key to the viability of any organization. This is particularly true for colleges, universities, and other nonprofit organizations—who have seen sharp declines in private contributions, endowment income, and government grants in the past few years, and face fierce competition for donor dollars (Wall Str J p. R1, 2011). In this paper, we present a finite-mixture model framework to segment the alumni population of a university in the Midwestern United States based on the monetary value of annual contributions. A salient feature of the model is that it exploits longitudinal data, i.e., contribution sequences. Another important feature of the model is that it supports the identification of unobserved heterogeneities in the population’s giving behavior. Our empirical study presents substantive insights gained through the processing of the full contribution sequences, and establishes the presence of seven distinct segments of alumni in the population. Results provide a basis to support the design of segment-tailored solicitations, and guide the allocation of resources (e.g., telemarketing dollars) to fundraising activities.






Similar content being viewed by others
Notes
Alumni who contribute at higher levels are not targeted via direct-mailings or phon-a-thons by the development office. Major donors and major donor prospects are generally engaged via face-to-face meetings.
We note that segmentation bases can be unobserved due to missing data.
The parameter estimates and mixture proportions for Normal mixture models with one to eight segments, as well as the scaled LL, the LL divided by the total number of observations, and CAIC associated with each model is presented in Table 10, in "Goodness-of-Fit Measures and Segment Estimates" Appendix.
Alumni assigned to segments 5 through 7 are not shown because those segments require a different (larger) scale.
In this context, the organization asks for a low amount (FitD level), a high amount (DitF level), and the amount it hopes to get (the target level).
References
Andreoni, J. (1989). Giving with impure altruism: Applications to charity and ricardian equivalence. Economics of Education Review, 97(6), 1447–1458.
Baade, R., & Sundberg, J. (1996). What determines alumni generosity. Economics of Education Review, 15(1), 75–81.
Becker, G. S. (1974). The theory of social interactions. Journal of Political Economy, 82(6), 1063–1093.
Bekkers, R., & Wiepking, P. (2011a). A literature review of empirical studies of philanthropy: Eight mechanisms that drive charitable giving. Nonprofit and Voluntary Sector Quarterly, 40(5), 924–973.
Bekkers, R., & Wiepking, P. (2011b). Who gives? A literature review of predictors of charitable giving. Part I: Religion, education, age, and socialization. Voluntary Sector. Review, 2(3), 337–365.
Beranek, W., Kamerschen, D. R., & Timberlake, R. H. (2010). Charitable donations and the estate tax: A tale of two hypotheses. American Journal of Economics and Sociology, 69(3), 1054–1078.
Bozdogan, H. (1987). Model selection and Akaike’s information criterion (AIC): The general theory and its analytical extensions. Psychometrika, 52(3), 345–370.
Bristol, R, Jr. (1990). The life cycle of alumni donations. Review of Higher Education, 13(4), 503–518.
Bronfman, C., Solomon, J., & Edwards, M. (2011). Should philantropies operate like businesses? The Wall Street Journal, page R1.
Bruggink, T. H., & Siddiqui, K. (1995). An econometric model of alumni giving: A case study for a liberal arts college. The American Economist, 39(2), 53–60.
Clotfelter, C. T. (2001). Who are the alumni donors? giving by two generations of alumni from selective colleges. Nonprofit Management and Leadership, 12(2), 119–138.
Clotfelter, C. T. (2003). Alumni giving to elite private colleges and universities. Economics of Education Review, 22(2), 109–120.
Collins, L. M., & Lanza, S. T. (2010). Latent class and latent transition analysis: With applications in the social, behavioral, and health sciences. Hoboken: Wiley.
Council for Aid to Education (2012). Voluntary support of education report.
Dempster, A., Lair, N., & Rubin, D. (1977). Maximum likelihood from incomplete data via the em algorithm (with discussion). Journal of the Royal Statistical Society B, 39(1), 1–38.
Durango-Cohen, E., Torres, R., & Durango-Cohen, P. (2013a). Donor segmentation: When summary statistics don’t tell the whole story. Journal of Interactive Marketing, 27(3), 172–184.
Durango-Cohen, P., Durango-Cohen, E., & Torres, R. (2013b). A Bernoulli–Gaussian mixture model of donation likelihood and monetary value: An application to alumni segmentation in a university setting. Computers & Industrial Engineering, 66(4), 1085–1095.
Ehrenberg, R., & Smith, C. (2003). The sources and uses of annual giving at selective private research universities and liberal arts colleges. Economics of Education Review, 22(3), 223–235.
Feldstein, M. (1975). The income tax and charitable contributions: Part II the impacton religious, educational, and other organizations. National Tax Journal, 28, 209–226.
Gaier, S. (2005). Alumni satisfaction with their undergraduate academic experience and the impact of alumni giving and participation. International Journal of Educational Advancement, 5(4), 279–288.
Goodman, L. A. (1974a). The analysis of systems of qualitative variables when some of the variables are unobservable. Part I - a modified latent structure approach. American Journal of Sociology, 79, 1179–1259.
Goodman, L. A. (1974b). Exploratory latent structure analysis using both identifiable and unidentifiable models. Biometrika, 61, 215–231.
Gottfried, M. A. (2010). School urbanicity and financial generosity: Can neighborhood context predict donative behavior in spite of the economy? International Journal of Education Advancement, 9(4), 220–233.
Grant, J. H., & Lindauer, D. L. (1986). The economics of charity life-cycle patterns of alumnae contributions. Eastern Economic Journal, 12(2), 129–141.
Green, P. (1977). A new approach to market segmentation. Business Horizons, 20, 61–73.
Ha, S., Bae, S., & Park, S. (2002). Customers time-variant purchase behavior and corresponding marketing strategies: An online retailers case. Computers and Industrial Engineering, 43, 801–820.
Holmes, J. (2009). Prestige, charitable deductions and other determinants of alumni giving: Evidence from a highly selective liberal arts college. Economics of Education Review, 28, 18–28.
Kim, M., Gibson, H., & Ko, J. Y. (2011). Understanding donors to university performing arts programs: Who are they and why do they contribute? Managing Leisure, 16, 17–35.
Kim, S., Fong, D. K., & Desarbo, W. S. (2012). Model-based segmentation featuring simultaneous segment-level variable selection. Journal of Marketing Research, 49(5), 725–736.
Le Blanc, L., & Rucks, C. (2009). Data mining of university philanthropic giving: Cluster-discriminant analysis and pareto effects. International Journal of Education Advancement, 9(2), 64–82.
Lindahl, W., & Winship, C. (1992). Predictive models for annual fundraising and major gift fundraising. Nonprofit Management and Leadership, 3(1), 43–63.
Marr, K. A., Mullin, C. H., & Siegfried, J. J. (2005). Undergraduate financial aid and subsequent alumni giving behavior. The Quarterly Review of Economics and Finance, 45, 123–143.
McLachlan, G., & Thriyambakam, K. (2001). The EM algorithm and extensions. New York: Wiley.
McLachlan, G. J., & Peel, D. (2000). Finite mixture models. New York: Wiley.
Meer, J. (2011). Brother, can you spare a dime? peer pressure in charitable solicitation. Journal of Public Economics, 95, 926–941.
Meer, J., & Rosen, H. S. (2009a). Altruism and the child cycle of alumni donations. American Economic Journal, 1(1), 258–286.
Meer, J., & Rosen, H. S. (2009b). The impact of athletic performance on alumni giving: An analysis of microdata. Economics of Education Review, 28, 287–294.
Monks, J. (2003). Patterns of giving to one’s alma mater among young graduates from selective institutions. Economics of Education Review, 22, 121–130.
Okunade, A. (1996). Graduate school alumni donations to academic funds: Micro-data evidence. American Journal of Economics and Sociology, 55(2), 213–229.
Okunade, A. A., Wunnava, P. V., & Walsh, R. J. (1994). Charitable giving of alumni: Micro-data evidence from a large public university. The American Journal of Economics and Sociology, 53(1), 73–84.
Olsen, K., Smith, A. L., & Wunnava, P. V. (1989). An empirical study of the life-cycle hypothesis with respect to alumni donations. The American Economist, 33(2), 60–63.
Pearson, J. (1999). Comprehensive research on alumni relationships: Four years of market research at stanford university. New Directions for Institutional Research, 101, 5–21.
Peterson, R. (1991). Small business usage of target marketing. Journal of Small Business Management, 29(4), 79–85.
Rhoads, T., & Gerking, S. (2000). Educational contributions, academic quality, and athletic success. Contemporary Economic Policy, 18(2), 248–259.
Rodafinos, A. S., Vucevic, A., & Ideridis, G. D. (2005). The effectiveness of compliance techniques: Foot in the door versus door in the face. Journal of Social Psychology, 145, 237–239.
Sun, X., Hoffman, S., & Grady, M. (2007). A multivariate causal model of alumni giving: Implications for alumni fundraisers. International Journal of Educational Advancement, 7(4), 307–332.
Taylor, A., & Martin, J. (1995). Characteristics of alumni donors and nondonors at a research I, public university. Research in Higher Education, 36(3), 283–302.
Tucker, I. B. (2004). A reexamination of the effect of big-time football and basketball success on graduation rates and alumni giving rates. Economics of Education Review, 23, 665–661.
Verhoef, P., Spring, P., Hoekstra, J., & Leeflang, P. (2002). The commercial use of segmentation and predictive modeling techniques for database marketing in the Netherlands. Decision Support Systems, 34(4), 471–481.
Wedel, M., & Kamakura, W. (2000). Market segmentation: Conceptual and methodological foundations. Norwell: Kluwer Academic Publishers.
Wedel, M., Kamakura, W., Arora, N., Bemmaor, A., Chiang, J., Elrod, T., et al. (1999). Discrete and continuous representations of unobserved heterogeneity in choice modeling. Marketing Letters, 10(3), 219–232.
Weerts, D., & Ronca, J. (2008). Characteristics of alumni donors who volunteer at their alma mater. Research in Higher Education, 49, 274–292.
Weerts, D. J., & Ronca, J. M. (2009). Using classification trees to predict alumni giving for higher education. Education Economics, 17(1), 95–122.
Wunnava, P., & Lauze, M. (2001). Alumni giving at a small liberal arts college: Evidence from consistent and occasional donors. Economics of Education Review, 20, 533–543.
Yoo, Y., & Harrison, W. (1989). Altruism in the market for giving and receiving: A case of higher education. Education of Economics Review, 8, 367–379.
Author information
Authors and Affiliations
Corresponding author
Appendix
Appendix
1. EM Algorithm Implementation for Normal Mixture Models
In this section, we discuss the parameter estimation problem for Normal mixture models. Given the alumni contribution sequences, and having specified the finite mixture model via (1), we define the data likelihood for \(\varvec{\lambda }\) and \(\varvec{\theta }\) as
The objective then is to find parameters \(\varvec{\lambda }, \varvec{\theta }\) that maximize (3). To solve this problem, as is commonly done in the estimation of finite mixture models, we rely on the EM Algorithm. The EM Algorithm, formalized by Dempster et al. (1977), is a numerical method to solve MLE problems in cases were data are missing. As applied in the estimation of finite mixture models, the EM algorithm relies on the fact that if individual memberships, \(\varvec{z}_m, \forall m\), were known, the ensuing estimation problem would be simplified. To this end, we can write the complete data likelihood for \(\varvec{\lambda }\) and \(\varvec{\theta }\) as
where \(\varvec{z}\equiv \left\{ \varvec{z}_m \right\} _{m=1}^M\). The associated complete data LL is
The EM Algorithm is a numerical method to maximize the complete data LL function (4). The algorithm consists of two steps: An expectation step, E-Step, where we evaluate the expectation of (4) over the possible realizations of \(\varvec{z}\), given the observed data, \(\varvec{y}\), and estimates of \(\varvec{\lambda }\) and \(\varvec{\theta }\), denoted \(\hat{\varvec{\lambda }}\) and \(\hat{\varvec{\theta }}\); and a maximization step, M-Step, where we update \(\hat{\varvec{\lambda }}\) and \(\hat{\varvec{\theta }}\) with arguments that maximize the expectation of \(\ln L_c(\cdot )\). The EM algorithm alternates between the E and M steps (until a convergence criterion is met). Mathematically, the EM algorithm can be written as follows:
-
Step 0: Initialization Choose randomly generated values for the parameters that define the segments, \(\hat{\varvec{\theta }}^0\), and the mixture proportions \(\hat{\varvec{\lambda }}^0\). The values for the mixture proportions are randomly generated from a Dirichlet prior distribution. Set the solution index \(k\) \(\leftarrow 0\).
-
Step 1: Expectation Step (E-Step) Calculate the expected value of the complete-data LL function, shown in (4), with respect to the conditional distribution of the (unobserved) data on segment membership, \(\varvec{z}\), given the observed contribution sequences, \(\varvec{y}\), and the estimated parameters, \(\hat{\varvec{\theta }}^k\), and \(\hat{\varvec{\lambda }}^k\) at iteration \(k\). We calculate the expected value of the LL function, \(\mathcal {Q}\), as follows:
$$\begin{aligned} \mathcal {Q} \left( \hat{\varvec{\lambda }}^k, \hat{\varvec{\theta }}^k \right) = \sum _{m=1}^M \sum _{s=1}^S \hat{p}_{ms} \left[ \ln \left( f_s\left( \varvec{y}_m|\hat{p}_{ms}, \hat{\varvec{\theta }}_s^k\right) \right) + \ln \left( \hat{\lambda }_s^k\right) \right] \end{aligned}$$(5)where, \(\hat{p}_{ms}\) is the probability of member \(m\) belonging to segment \(s\), or the expected value of \(z_{ms}\), \(\mathbb {E}[z_{ms}|\varvec{y}_m]\).
-
Step 2: Maximization Step (M-Step) Calculate the mixture proportions, \(\hat{\varvec{\lambda }}^{k+1}\), and the parameters defining the segments, \(\hat{\varvec{\theta }}^{k+1}\), that maximize the expected value of the complete-data LL calculated in Step 1, assuming that the missing data, \(\varvec{z}_m\), are known—this is done replacing each \(z_{ms}\) by its expected value, \(\hat{p}_{ms}\), at iteration \(k\). We calculate the mixture proportions, \(\hat{\varvec{\lambda }}^{k+1}\), for \(s = 1, \ldots , S\), as follows:
$$\begin{aligned} \hat{\lambda }^{k+1}_s = \frac{1}{M}\sum _{m=1}^M \hat{p}_{ms} \end{aligned}$$(6)To calculate the optimizing parameters that define the normal distributions describing the distribution of annual contributions in each of the segments, \(\hat{\varvec{\theta }}_s^{k+1} \equiv \left( \hat{\mu }_s^{k+1}, \hat{\sigma }_s^{k+1}\right)\), we do, for \(s = 1, \ldots , S\), as follows:
$$\begin{aligned} \hat{\mu }_s^{k+1}&= \frac{1}{M \hat{\lambda }_s^{k+1}} \sum _{m=1}^M \hat{p}_{ms} \frac{1}{T} \sum _{t=1}^T y_m^t \end{aligned}$$(7)$$\begin{aligned} \hat{\sigma }_s^{k+1}&= \sqrt{\frac{1}{M \hat{\lambda }_s^{k+1}} \sum _{m=1}^M \hat{p}_{ms} \frac{1}{T} \sum _{t=1}^T \left( y_m^t - \hat{\mu }_s^{k+1}\right) ^2} \end{aligned}$$(8)Set the solution index \(k \leftarrow k+1\).
-
Step 3: Check Stop Criteria While \(k<K\) and
$$\begin{aligned} \left| \left( \mathcal {Q}(\hat{\varvec{\lambda }}^{k+1}, \hat{\varvec{\theta }}^{k+1}) -\mathcal {Q}(\hat{\varvec{\lambda }}^k, \hat{\varvec{\theta }}^k)\right) /\mathcal {Q}(\hat{\varvec{\lambda }}^{k}, \hat{\varvec{\theta }}^{k})\right| > \epsilon \end{aligned}$$continue to iterate steps 1 and 2. The stopping criteria were set to \(\epsilon = 1 \times 10^{-5}\), or a maximum of \(K = 100\) iterations.
Relevant properties of the EM Algorithm, such as those related to its convergence, are discussed in references such as McLachlan and Thriyambakam (2001).
2. Derivation of Posterior Membership Probabilities
In this section, we derive expression presented in (2), which is used to update the membership probabilities in response to a contribution sequence. In particular, given a probability density function of \(\varvec{\lambda }\), \(f(\cdot )\), and a contribution sequence, \(\varvec{y}_m\), the probability that individual \(m\) is assigned to segment \(s\), \(p_{ms}\), is updated as follows:
where Bayes’ Law yields:
Substituting (10) into Eq. (9), we have:
3. Goodness-of-Fit Measures and Segment Estimates
The CAIC is given as \(-2\cdot LL + (p\cdot S + S-1 1)\left( \ln (n)+1\right)\), where \(p\), \(S\) and \(n\) represent the number of parameters per segment (2 per segment in our analysis, \(\mu _s, \sigma _s\), for each segment \(s\)), the number of segments in the model and the number of observations, respectively. The CAIC renders AIC consistent such that Akaikes principle of minimizing the Kullback–Leibler information quantity is not violated (Bozdogan 1987).
The parameter estimates and mixture proportions for Normal mixture models with one to eight segments, as well as the scaled LL, the LL divided by the total number of observations, and CAIC (as shown in Fig. 7) associated with each model is presented in Table 10. We observe that the one-segment model estimates for the mean and standard deviation value of annual contributions equal the overall population mean and standard deviation values, as expected. In other words, these (maximum likelihood) estimates are analogous to modelling a homogeneous alumni population.
4. The Effect of Added Contribution Data on Convergence Probabilities
See Table 11
Rights and permissions
About this article
Cite this article
Durango-Cohen, E.J., Balasubramanian, S.K. Effective Segmentation of University Alumni: Mining Contribution Data with Finite-Mixture Models. Res High Educ 56, 78–104 (2015). https://doi.org/10.1007/s11162-014-9339-6
Received:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11162-014-9339-6


