Skip to main content
Log in

Black swan models for the entertainment industry with an application to the movie business

  • Published:
Empirical Economics Aims and scope Submit manuscript

Abstract

Success in popular entertainment is highly unpredictable. Yet it is the high-impact low-probability events—known as ‘black swans’—that drive entertainment industry profitability because success is highly concentrated on a small number of winners. In this research, we apply recently developed statistical tools to model motion-picture success; these tools explicitly account for rare extreme events while permitting valid statistical inferences to be made on how product attributes are associated with product success. The specific empirical application relates the attributes of a film and its theatrical release to the distribution of worldwide cumulative box-office revenue. A regression model with skew-stable random disturbances is applied to a large sample of motion pictures to quantify the correlates of film success while explicitly accounting for skewness and heavy tails. The skew-stable estimates are compared to estimates obtained from symmetric-stable regression, ordinary least-squares regression, and several alternative robust-to-outliers regression models. Failure to control explicitly for heavy tails and skewness generates misleading statistical inferences, particularly regarding the impact of production budget, opening week screens, and star power on a film’s success at the box-office.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Notes

  1. Several authors, including Walls (1997), Hand (2001), McKenzie (2008) and others fit models with Paretian tails to theatrical film returns data; Maddison (2004) performs a similar exercise for Broadway shows. Hadida (2009) discusses a number of conceptual and methodological issues relating to motion-picture performance.

  2. Even the encyclopedic Hennig-Thurau and Houston (2019) volume on entertainment science—applying data analytics and practical theory to the movie, gaming, book, and music industries—contains not a single mention of applied statistical models that account for skewness or heavy tails.

  3. It should be emphasized here that the stable distribution generalizes the normal distribution model; it does not rule it out. Because the normal model is nested within the general stable model, we can test the normal model against the stable model with heavy tails and asymmetry.

  4. In the Walls (2005a, b) paper, the logarithm of revenues is analyzed, which is not particularly skewed. But in general, it is strongly preferred to have a model that flexibly accommodates asymmetry. Walls (2005a, b) applies the skew-student-t regression model to film returns to accommodate skewness and heavy tails; although the model is ad hoc, it does accommodate skewness and heavy tails.

  5. See, for example, the discussion in Uchaikin and Zolotarev (1999).

  6. Ijiri and Simon (1977) investigated dynamic processes that possessed the momentum characteristic, where firm-level growth depends on firm size. They showed that these processes converged on distributions with Paretian tails and that this result is not sensitive to changes in initial conditions.

  7. Competing candidate distributions, such as the student-t, can be used to model heavy tails, and the skewed-student-t (Azzalini and Capitanio 2003) can accommodate skewness and heavy tails, but they are ad hoc. Other alternatives include nonparametric estimators such as Racine and Li (2004) and semi-parametric estimators such as those discussed in Hallin et al. (2008). Another flexible approach accounting for skewness and extreme outliers is the Tukey g-and-h family developed and fully operationalized by and Xu and Genton (2015, 2017) and applied to accessible linear modeling by Ricci et al. (2016).

  8. See, for example, the volume edited by Adler et al. (1998) for many papers that extend or apply the stable distribution model and Rachev and Mittnik (2000) and McCulloch (1996) for specific applications in finance. Nolan (2008) has compiled an exhaustive bibliography relating to nearly all aspects of stable modeling.

  9. See Nolan (1998a) for a useful discussion of the various parameterizations of the stable distribution.

  10. This property makes the stable model particularly appealing because it provides the connection between the behavioral model generating the data and the statistical model applied to data.

  11. The material in this section follows closely the exposition and notation of McCulloch (1998). See Judge et al. (1985, pp. 825–826) for an overview of econometric models with infinite variance.

  12. The parameter estimates do not have heavy tails. While the data may follow a stable distribution characterized by skewness and heavy tails, the parameter estimates are asymptotically normal as they would be for any maximum likelihood estimation.

  13. Due to the nature of the data set, the empirical results from this analysis will clearly be more appropriate for the mainstream commercial motion-picture industry seeking profits than for lower-budget limited-release ‘art’ films focusing solely on artistic integrity.

  14. The Ulmer Scale provides a metric for film industry talent classification: The Ulmer metric indexes on a 100-point scale that is based on the survey of dozens of film and television industry deal-makers. Complete details of the Ulmer Scale are available at http://www.ulmerscale.com. Multiple editions of the Hot List are available, and each edition covers more actors than previous editions—the most recent edition used in this research featured 1400 actors. Rankings change throughout time in movement with industry perceptions about the particular actor’s ‘bankability.’ As a guide, the cumulative distribution of rankings in the sixth edition were (A+) 0.5%, (A) 2.6%, (B+) 8.4% and (B) 17.1%. Given the relatively long time dimension of our study—and the fact that stars are ephemeral—we use various volumes of Ulmer’s Hot List to classify a star. In particular, we used volume 1 for the years 1997–1999, volume 3 for the years 2000–2002, volume 5 for the years 2003–2005, and volume 6 for 2006–2007.

  15. For a more complete listing and discussion of empirical studies on the economics of the film industry, see McKenzie (2012) and the references therein. See Walls (2013) for a survey of empirical methods used to quantify bestsellers and blockbusters in economics.

  16. The stable regression models were estimated using John Nolan’s R package ‘stable,’ version 5.1.10, serial number #231; see Nolan and Ojeda-Revah (2013) for details of the estimation procedure. Alternative approaches to estimation of stable regression models are provided by Lambert and Lindsey (1999) and Achcar and Lopes (2016).

  17. In the context of the stable model, the tail weight parameter falls into the region where the mean is defined and the variance is not defined.

  18. See, for example, the expositions in Rousseeuw and Leroy (1987), Huber (1981), Goodall (1983), and Berk (1990).

  19. The Tukey-M-regression estimates are presented for purposes of comparison only. The second-generation robust regression models presented as follows have superior Gaussian efficiency.

References

  • Achcar JA, Lopes SR (2016) Linear and non-linear regression models assuming a stable distribution. Rev Colomb Estad 39:109–128

    Google Scholar 

  • Adler RJ, Feldman RE, Taqqu MS (eds) (1998) A practical guide to heavy tails: statistical techniques and applications. Birkhäuser, Berlin

    Google Scholar 

  • Azzalini A, Capitanio A (2003) Distributions generated by perturbation of symmetry with emphasis on a multivariate skew-\(t\) distribution. J R Stat Soc B65:367–389

    Google Scholar 

  • Beaton AE, Tukey JW (1974) The fitting of power series, meaning polynomials, illustrated on band-spectroscopic data. Technometrics 16:147–185

    Google Scholar 

  • Berk RA (1990) A primer on robust regression. In: Fox J, Long JS (eds) Modern methods of data analysis. Sage, Newbury Park, pp 292–324

    Google Scholar 

  • Blattberg R, Sargent T (1971) Regression with non-Gaussian stable disturbances: some sampling results. Econometrica 39(3):501–510

    Google Scholar 

  • Box G, Cox D (1964) An analysis of transformations. J R Stat Soc B 26:211–243

    Google Scholar 

  • Brosen BW, Prekel PV (1993) Linear regression with stably distributed residuals. Commun Stat Theory Methods 22:659–667

    Google Scholar 

  • Caves RE (2000) Creative industries: contracts between art and commerce. Harvard University Press, Cambridge

    Google Scholar 

  • De Vany AS, Walls WD (1996) Bose-Einstein dynamics and adaptive contracting in the motion picture industry. Econ J 439(106):1493–1514

    Google Scholar 

  • De Vany AS, Walls WD (2002) Does Hollywood make too many R-rated movies? Risk, stochastic dominance, and the illusion of expectation. J Bus 75(3):425–451

    Google Scholar 

  • De Vany AS, Walls WD (2004) Motion picture profit, the stable Paretian hypothesis, and the curse of the superstar. J Econ Dyn Control 28(6):1035–1057

    Google Scholar 

  • De Vany AS, Walls WD (2005) Big budgets, movie stars, and wide releases: empirical analysis of the blockbuster strategy. In: De Vany AS (ed) Hollywood economics: how extreme uncertainty shapes the film industry. Routledge, London

    Google Scholar 

  • Dehon C, Gassner M, Verardi V (2009) Beware of good outliers and overoptimistic conclusions. Oxf Bull Econ Stat 71(3):437–452

    Google Scholar 

  • Fama EF (1963) Mandelbrot and the stable paretian hypothesis. J Bus 36:420–429

    Google Scholar 

  • Fama E (1965) The behavior of stock market prices. J Bus 38(1):34–105

    Google Scholar 

  • Goodall C (1983) M-estimators of location: an outline of the theory. In: Hoaglin DC, Mosteller F, Tukey JW (eds) Understanding robust and exploratory data analysis. Wiley, New York, pp 339–431

    Google Scholar 

  • Hadida AL (2009) Motion picture performance: a review and research agenda. Int J Manag Rev 11(3):297–335

    Google Scholar 

  • Hallin M, Vermandele C, Werker BJM (2008) Semiparametrically efficient inference based on signs and ranks for median-restricted models. J R Stat Soc Ser B 70(2):389–412

    Google Scholar 

  • Hand C (2001) Increasing returns to information: further evidence from the UK film market. Appl Econ Lett 8(6):419–421

    Google Scholar 

  • Hennig-Thurau T, Houston MB (2019) Entertainment science: data analytics and practical theory for movies, games, books, and music. Springer, Basel

    Google Scholar 

  • Hill BM (1974) The rank-frequency form of zipf’s law. J Am Stat Assoc 69:1017–1026

    Google Scholar 

  • Huber PJ (1964) Robust estimation of a location parameter. Ann Math Stat 35:73–101

    Google Scholar 

  • Huber PJ (1981) Robust statistics. Wiley, New York

    Google Scholar 

  • Ijiri Y, Simon HA (1977) Skew distributions and the sizes of business firms. North-Holland, New York

    Google Scholar 

  • Judge G, Griffiths W, Hill R, Lutkepohl H, Lee T (1985) The theory and practice of econometrics, 2nd edn. Wiley, New York

    Google Scholar 

  • Lambert P, Lindsey JK (1999) Analysing financial returns using regression models based on non-symmetric stable distributions. Appl Stat 48:409–424

    Google Scholar 

  • Litman BR, Ahn H (1998) Predicting financial success of motion pictures: the early ’90s experience. The motion picture mega-industry, chapter 10. Allyn and Bacon, Boston, pp 172–197

  • Maddison D (2004) Increasing returns to information and the survival of broadway theatre productions. Appl Econ Lett 11(10):639–643

    Google Scholar 

  • Mandelbrot B (1963a) New methods in statistical economics. J Polit Econ 71:421–440

    Google Scholar 

  • Mandelbrot B (1963b) The variation of certain speculative prices. J Bus 36:394–419

    Google Scholar 

  • Mandelbrot B (1967) The variation of some other speculative prices. J Bus 40:393–413

    Google Scholar 

  • Maronna RA, Yohai VJ (2000) Robust regression with both continuous and categorical predictors. J Stat Plan Inference 89(1–2):197–214

    Google Scholar 

  • McCulloch JH (1996) Financial applications of stable distributions. In: Maddala GS, Rao CR (eds) Statistical methods in finance, vol 14. North-Holland, New York, pp 393–425 Handbook of Statistics

  • McCulloch JH (1998) Linear regression with stable disturbances. In: Adler RJ, Feldman RE, Taqqu MS (eds) A practical guide to heavy tails: statistical techniques and applications. Birkhäuser, Berlin, pp 359–378

    Google Scholar 

  • McKenzie J (2008) Bayesian information transmission and stable distributions: motion picture revenues at the Australian box office. Econ Rec 84(266):338–53

    Google Scholar 

  • McKenzie J (2012) The economics of movies: a literature survey. J Econ Surv 26(1):42–70

    Google Scholar 

  • Nolan JP (1998a) Parameterizations and modes of stable distributions. Stat Probab Lett 38:187–195

    Google Scholar 

  • Nolan JP (1998b) Univariate stable distributions: parameterizations and software. In: Adler RJ, Feldman RE, Taqqu MS (eds) A practical guide to heavy tails: statistical techniques and applications. Birkhäuser, Berlin, pp 527–533

    Google Scholar 

  • Nolan NP (1999) Fitting data and assessing goodness-of-fit with stable distributions. In: Nolan NP, Swami A (eds) Proceedings of the ASA-IMS conference on heavy tailed distributions. American University, Washington, DC

  • Nolan JP (2001) Maximum likelihood estimation of stable parameters. In: Barndor-Nielsen OE, Mikosch T, Resnick SI (eds) Lévy processes: theory and applications. Birkhäuser, Boston, pp 379–400

    Google Scholar 

  • Nolan JP (2005) STABLE computer program. Available for download at http://academic2.american.edu/~jpnolan/stable/stable.html

  • Nolan JP (2008) Bibliography on stable distributions, processes and related topics. Mathematics Department, American University, Washington, D.C, Mimeograph

    Google Scholar 

  • Nolan JP (2011) Stable distributions–models for heavy tailed data. Birkhäuser, Boston

    Google Scholar 

  • Nolan JP, Ojeda-Revah D (2013) Linear and nonlinear regression with stable errors. J Econom 172(2):186–194

    Google Scholar 

  • Prag J, Casavant J (1994) An empirical study of determinants of revenues and marketing expenditures in the motion picture industry. J Cult Econ 18(3):217–35

    Google Scholar 

  • Rachev ST, Mittnik S (2000) Stable Paretian models in finance. Wiley, New York

    Google Scholar 

  • Racine J, Li Q (2004) Nonparametric estimation of regression functions with both categorical and continuous data. J Econom 119:99–130

    Google Scholar 

  • Ravid SA (1999) Information, blockbusters and stars: a study of the film industry. J Bus 72:463–486

    Google Scholar 

  • Ravid SA, Basuroy S (2004) Managerial objectives, the R-rating puzzle, and the production of violent films. J Bus 77(Supp 2):155–192

    Google Scholar 

  • Ricci L, Verardi V, Vermandele C (2016) A highly efficient regression estimator for skewed and/or heavy-tailed distributed errors. Working papers 19, European stability mechanism

  • Rousseeuw PJ, Leroy AM (1987) Robust regression and outlier detection. Wiley, New York

    Google Scholar 

  • Rousseeuw PJ, Yohai V (1984) Robust regression by means of s-estimators. In: Franke J, Härdle W, Martin D (eds) Robust and nonlinear time series analysis, number 26 in lecture notes in statistics. Springer, Berlin, pp 256–272

    Google Scholar 

  • Rousseeuw PJ, van Zomeren BC (1990) Unmasking multivariate outliers and leverage points. J Am Stat Assoc 85(411):633–639

    Google Scholar 

  • Samorodnitsky G, Taqqu MS (1994) Stable non-Gaussian random processes. Chapman and Hall, New York

    Google Scholar 

  • Taleb NN (2007) The black swan: the impact of the highly improbable. Random House, New York

    Google Scholar 

  • Uchaikin VV, Zolotarev VM (1999) Chance and stability : stable distributions and their applications. VSP, Utrecht

    Google Scholar 

  • Walls WD (1997) Increasing returns to information: evidence from the Hong Kong movie market. Appl Econ Lett 4(5):187–190

    Google Scholar 

  • Walls WD (2005a) Modeling movie success when ‘nobody knows anything’: conditional stable-distribution analysis of film returns. J Cult Econ 29(3):177–190

    Google Scholar 

  • Walls WD (2005b) Modeling skewness and heavy tails in film returns. Appl Financ Econ 15(17):1181–1188

    Google Scholar 

  • Walls WD (2009a) Robust analysis of movie earnings. J Med Econ 22(1):20–35

    Google Scholar 

  • Walls WD (2009b) Screen wars, star wars, and sequels: nonparametric reanalysis of movie profitability. Empir Econ 37(2):447–461

    Google Scholar 

  • Walls WD (2013) Bestsellers and blockbusters: movies, music, and books. In: Ginsburgh VA, Throsby D (eds) Handbook of the economics of art and culture, vol 2. Elsevier, Masterdam, pp 185–214

    Google Scholar 

  • Walls WD, McKenzie J (2012) The changing role of Hollywood in the global movie market. J Med Econ 25(4):198–219

    Google Scholar 

  • Xu G, Genton MG (2015) Efficient maximum approximate likelihood inference for Tukey’s g-and-h distribution. Comput Stat Data Anal 91:78–91

    Google Scholar 

  • Xu G, Genton MG (2017) Tukey g-ang-h random fields. J Am Stat Assoc 112(519):1236–1249

    Google Scholar 

  • Yohai V (1987) High breakdown-point and high efficiency estimates for regression. Ann Stat 15:642–665

    Google Scholar 

  • Zolotarev VM (1986) One-dimensional stable distributions, volume 65 of American Mathematical Society Translations of Mathematics Monographs. American Mathematical Society, Providence (Translation of the original 1983 Russian edition)

Download references

Funding

This study was not funded by any granting agency.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to W. D. Walls.

Ethics declarations

Conflict of interest

Both authors declare that they have no conflict of interest.

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

An earlier version of this paper was presented at the 20th Anniversary Carol and Bruce Mallen Scholars and Practitioners Conference in Filmed Entertainment Economics held at Film Universität Babelsberg, Germany, September 27–28, 2018, and at the Asian Meeting of the Econometric Society held at Xiamen University, Amoy, China, June 14-16, 2019. We are grateful to conference participants, and to two diligent referees, for comments that have helped to improve the content and clarity of the paper.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Walls, W.D., McKenzie, J. Black swan models for the entertainment industry with an application to the movie business. Empir Econ 59, 3019–3032 (2020). https://doi.org/10.1007/s00181-019-01753-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00181-019-01753-x

Keywords

Navigation