Data Mining and Knowledge Discovery

, Volume 31, Issue 6, pp 1622–1642 | Cite as

Adjusting for scorekeeper bias in NBA box scores

  • Matthew van Bommel
  • Luke Bornn
Part of the following topical collections:
  1. Sports Analytics


Box score statistics in the National Basketball Association are used to measure and evaluate player performance. Some of these statistics are subjective in nature and since box score statistics are recorded by scorekeepers hired by the home team for each game, there exists potential for inconsistency and bias. These inconsistencies can have far reaching consequences, particularly with the rise in popularity of daily fantasy sports. Using box score data, we estimate models able to quantify both the bias and the generosity of each scorekeeper for two of the most subjective statistics: assists and blocks. We then use optical player tracking data for the 2015–2016 season to improve the assist model by including other contextual spatio-temporal variables such as time of possession, player locations, and distance traveled. From this model, we present results measuring the impact of the scorekeeper and of the other contextual variables on the probability of a pass being recorded as an assist. Results for adjusting season assist totals to remove scorekeeper influence are also presented.


Basketball Optical tracking Scorekeeper bias Fantasy sports Adjusted box score 



This work was partially supported by U.S. National Science Foundation grant 1461435, by DARPA under Grant No. FA8750-14-2-0117, by ARO under Grant No. W911NF-15-1-0172, by Amazon, by NSERC, and by the National Association of Basketball Coaches.


  1. Acharya RA, Ahmed AJ, D’Amour AN, Lu H, Morris CN, Oglevee BD, Peterson AW, Swift RN (2008) Improving major league baseball park factor estimates. J Quant Anal Sports 4(2):Article 4. doi: 10.2202/1559-0410.1108
  2. Baghal T (2012) Are the “four factors” indicators of one factor? an application of structural equation modeling methodology to NBA data in prediction of winning percentage. J Quant Anal Sports 8(1):1559-0410. doi: 10.1515/1559-0410.1355 Google Scholar
  3. Basketball Reference (2015) Calculating win shares. Accessed 2 Dec 2015
  4. Cervone D, D’Amour A, Bornn L, Goldsberry K (2014) A multiresolution stochastic process model for predicting basketball possession outcomes. arXiv:1408.0777. Accessed 2 Dec 2015
  5. Craggs T (2009) The confessions of an NBA scorekeeper. Accessed 2 Dec 2015
  6. Deshpande SK, Jensen ST (2016) Estimating an NBA players impact on his teams chances of winning. J Quant Anal Sports 12(2):51–72. doi: 10.1515/jqas-2015-0027 Google Scholar
  7. DraftKings (2015) Daily fantasy basketball league rules. Accessed 20 Jan 2016
  8. Contributors (2015) Daily fantasy basketball: building blocks, fades for Nov. 17. Accessed 20 Jan 2016
  9. FanDuel (2015) Rules and scoring. Accessed 20 Jan 2016
  10. Fearnhead P, Taylor BM (2011) On estimating the ability of NBA players. J Quant Anal Sports 7(3):Article 11. doi: 10.2202/1559-0410.1298
  11. Gramacy RB, Jensen ST, Taddy M (2013) Estimating player contribution in hockey with regularized logistic regression. J Quant Anal Sports 9(1):97–111. doi: 10.1515/jqas-2012-0001 Google Scholar
  12. Groll A, Schauberger G, Tutz G (2015) Prediction of major international soccer tournaments based on team-specific regularized poisson regression: an application to the FIFA world cup 2014. J Quant Anal Sports 11(2):97–115. doi: 10.1515/jqas-2014-0051 Google Scholar
  13. Hamrick J, Rasp J (2011) Using local correlation to explain success in baseball. J Quant Anal Sports 7(4):Article 5. doi: 10.2202/1559-0410.1278
  14. Hollinger J (2004) Pro basketball forecast 2004–2005. Brasseys, Washington, DCGoogle Scholar
  15. Macdonald B (2012) Adjusted plus–minus for NHL players using ridge regression with goals, shots, fenwick, and corsi. J Quant Anal Sports 8(3):1–22. doi: 10.1515/1559-0410.1447 Google Scholar
  16. NBA (2013) Basketball U on assists. Accessed 2 Dec 2015
  17. Neal D, Tan J, Hao F, Wu SS (2010) Simply better: using regression models to estimate major league batting averages. J Quant Anal Sports 6(3):Article 12. doi: 10.2202/1559-0410.1229
  18. Oberstone J (2009) Differentiating the top English premier league football clubs from the rest of the pack: identifying the keys to success. J Quant Anal Sports 5(3):Article 10. doi: 10.2202/1559-0410.1183
  19. Okamoto DM (2011) Stratified odds ratios for evaluating NBA players based on their plus/minus statistics. J Quant Anal Sports 7(2):1–10. doi: 10.2202/1559-0410.1320 Google Scholar
  20. O’Keeffe K (2015) Daily fantasy-sports operators await reality check. Washington Post. Accessed 20 Jan 2016
  21. Price J, Wolfers J (2010) Racial discrimination among NBA referees. Q J Econ 125(4):1859–1887CrossRefGoogle Scholar
  22. Schuckers M, Macdonald B (2014) Accounting for rink effects in the national hockey league’s real time scoring system. Accessed 2 Dec 2015
  23. Teramoto M, Cross CL (2010) Relative importance of performance factors in winning NBA games in regular season versus playoffs. J Quant Anal Sports 6(3):Article 2. doi: 10.2202/1559-0410.1260
  24. Tibshirani R (1994) Regression shrinkage and selection via the lasso. J R Stat Soc Ser B 58:267–288MathSciNetMATHGoogle Scholar
  25. Yue Y, Lucey P, Carr P, Bialkowski A, Matthews I (2014) Learning fine-grained spatial models for dynamic sports play prediction. In: Proceeding of IEEE international conference on data mining, pp 670–679Google Scholar

Copyright information

© The Author(s) 2017

Authors and Affiliations

  1. 1.Department of StatisticsSimon Fraser UniversityBurnabyCanada

Personalised recommendations