Skip to main content

Performing Under Difficulty: The Magical Pressure Index

  • Chapter
  • First Online:
Cricket Performance Management

Abstract

This chapter introduces an interesting concept of measuring performance of cricketers considering the match situation under which the player performed. The difficulty level, under which the team is performing, is quantified introducing a measure called as the pressure index in the chapter. The pressure index is an inclusive measure that can be utilized to measure batting performance both individually and between partners, bowling performance, determining turning point of the match, comparing run chases and predicting the outcome of the match. Considering the huge area of applicability of the index, we call it magical. The pressure index also opens up several another area of research for people interested in sports analytics. The concept of pressure index can be extended to other sports as well. We believe that the pressure index has in its store several other hidden dimensions which are left for the readers to explore.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The test is commonly used for assessing the goodness of fit of a model. The test is similar to a χ2 goodness of fit test and has the advantage of partitioning the observations into groups of approximately equal size. The observations are grouped into deciles based on their predicted probabilities. The test statistic is calculated based on the observed and expected counts for both the wins and losses. The statistic has an approximate χ2 distribution with 8 degrees of freedom (=10 deciles—2 variables in the model). For this case the p value of the test = 0.123 indicates that the number of wins are not significantly different from those predicted by the model and that the model is a good fit.

  2. 2.

    The significance can be ascertained from the Wald statistic which follows χ2 statistic with 1 df. The Wald χ2 statistics are used to test the significance of individual coefficients in the model and are calculated by \(\left( {\frac{B}{SE(B)}} \right)^{2}\)

References

  • Allsopp, P. E. (2005). Measuring team performance and modeling the home advantage effect in cricket. Unpublished Ph. D dissertation.

    Google Scholar 

  • Allsopp, P. E., & Clarke, S. R. (2004). Rating teams and analysing outcomes in one-day and test cricket. Journal of the Royal Statistical Society, Series A (Statistics in Society), 167(4), 657–667.

    Google Scholar 

  • Bailey, M., & Clarke, S. R. (2006). Predicting the match outcome in one day international cricket matches, while the game is in progress. Journal of Sports Science and Medicine, 5, 480–487.

    Google Scholar 

  • Bandulasiri, A. (2008). Predicting the winner in one day international cricket. Journal of Mathematical Sciences & Mathematics Education, 3(1), 1–12.

    Google Scholar 

  • Barnett, T. & Clarke, S. R. (2002). Using Microsoft Excel to model tennis match. In G. Cohen & T. Langtry (Eds.), Proceedings of the 6th conference on Mathematics and Sports (pp. 63–68).

    Google Scholar 

  • Barr, G. D. I., Holdsworth, C. G., & Kantor, B. S. (2008). General evaluating performances at the 2007 Cricket World Cup. South African Statistical Journal, 42, 125–142.

    Google Scholar 

  • Beaudoin, D., & Swartz, T. B. (2003). The best batsmen and bowlers in one-day cricket. South African Statistical Journal, 37, 203–222.

    Google Scholar 

  • Bhattacharjee, D., & Lemmer, H. H. (2016). Quantifying the pressure on the teams batting or bowling in the second innings of limited overs cricket matches. International Journal of Sport Science and Coaching, 12(1), 683–692.

    Article  Google Scholar 

  • Bhattacharya, R., Gill, P. S., & Swartz, T. B. (2011). Duckworth-Lewis and Twenty20 Cricket. Journal of the Operational Research Society, 62, 1951–1957.

    Article  Google Scholar 

  • Brooks, R. D., Faff, R. W., & Sokulsky, D. (2002). An ordered response model of test cricket performance. Applied Economics, 34, 2353–2365.

    Article  Google Scholar 

  • Carter, M., & Guthrie, G. (2002). Cricket interruptions, fairness and incentive in interrupted cricket matches. Journal of the Operations Research Society, 55(8), 822–829.

    Article  Google Scholar 

  • Clarke, S. R. (1988). Dynamic programming in one-day cricket-optimal scoring rates. Journal of the Operational Research Society, 39, 331–337.

    Article  Google Scholar 

  • Clowes, S., Cohen, G., & Tomljanovic, L. (2002). Dynamic evaluation of conditional probabilities of winning a tennis match. In Proceedings of the Sixth Australian Conference on Mathematics and Computers in Sport, 1–3 July 2002, at Bond University, Gold Coast, Queensland 4229, Australia (pp. 112–118).

    Google Scholar 

  • Cui, T., Li, J., Woodward, J. R., & Parkes, A. J. (2013). An ensemble based genetic programming system to predict English football premier league games. In Proceedings of the 2013 IEEE Conference On Evolving and Adaptive Intelligent Systems, Eais 2013–2013 IEE Symposium Series On Computational Intelligence (pp. 138–143). https://doi.org/10.1109/eais.2013.6604116.

  • Damodaran, U. (2006). Stochastic dominance and analysis of ODI batting performance: the Indian cricket team, 1989–2005. Journal of Sports Science and Medicine, 5, 503–508.

    Google Scholar 

  • Daud, Ali & Muhammad, Faqir. (2013). Ranking Cricket Teams through Runs and Wickets, p. 8210. https://doi.org/10.1007/978-3-319-02750-0_16.

    Chapter  Google Scholar 

  • de Silva, R. (2011) A method to calculate the target score in rain interrupted limited over cricket matches. In International Conference on Mathematics and Science (ICOMSc) (pp. 1–8), Surabaya, Indonesia.

    Google Scholar 

  • de Silva, B. M., Pond, G. R., & Swartz, T. B. (2001). Estimation of magnitude of victory in one-day cricket. Australian and New Zealand Journal of Statistics, 43(3), 259–268.

    Article  Google Scholar 

  • Dey, P. K., Ghosh, D. N., & Mondal, A. C. (2015). IPL team performance analysis: a multi-criteria group decision approach in fuzzy environment. International Journal of Information Technology and Computer Science, 8, 8–15.

    Article  Google Scholar 

  • Douglas, J. M., & Tam, N. (2010). Analysis of team performances at the ICC world twenty20 cup 2009. International Journal of Performance Analysis in Sport, 10(1), 45–53.

    Article  Google Scholar 

  • Duckworth, F. C., & Lewis, A. J. (1998). A fair method of resetting the target in interrupted one-day cricket matches. Journal of the Operational Research Society, 49(3), 220–227.

    Article  Google Scholar 

  • Elderton, W. P., & Elderton, E. M. (1909). Primer of statistics. London: Black.

    Google Scholar 

  • Farazdaghi, H., & Harris, P. M. (1968). Plant competition and crop yield. Nature, 217, 289–290.

    Article  Google Scholar 

  • Graybill, F. A., & Iyer, H. K. (1994). Regression analysis: Concepts and applications. Belmont, Calif: Duxbury Press.

    Google Scholar 

  • Heazlewood, T. (2006). Prediction versus reality: The use of mathematical models to predict elite performance in swimming and athletics at the Olympic games. Journal of Sports Science and Medicine, 5, 541–547.

    Google Scholar 

  • Huang, K. Y., & Chang, W. L. (2010). A neural network method for prediction of 2006 world cup football game. In The 2010 International Joint Conference on Neural Networks (IJCNN) (pp. 1–8).

    Google Scholar 

  • Hucaljuk, J., & Rakipovic, A. (2011). Predicting football scores using machine learning techniques. In: MIPRO, 2011, Proceedings of the 34th International Convention. IEEE, 2011 (pp. 1623–1627).

    Google Scholar 

  • Iyengar, N. S., & Sudarshan, P. (1982). A method of classifying regions from multivariate data. Economic and Political Weekly. 18, 2048–2052.

    Google Scholar 

  • James, B. (1980). The Bill James abstract. Self-published.

    Google Scholar 

  • Jayadevan, V. (2002). A new method for the computation of target scores in interrupted limited-over cricket matches. Current Science, 83(5), 577–586.

    Google Scholar 

  • Joseph, A., Fenton, N. E., & Neil, M. (2006). Predicting football results using Bayesian nets and other machine learning techniques. Knowledge-Based Systems, 19(7), 544–553.

    Article  Google Scholar 

  • Lata, K., & Gupta, P. (2016). Forecasting English premier league match results. International Journal of Modern Trends in Engineering and Research, 03(02), 261–268.

    Google Scholar 

  • Lemmer, H. H. (2005). A method for the comparison of the bowling performances of bowlers in a match or series of matches. South African Journal for Research in Sport, Physical Education and Recreation, 27(1), 91–103.

    Google Scholar 

  • Lemmer, H. H. (2011). Predicting the names of the best teams after the knock-out phase of a cricket series. European journal of sport science, 14(sup1), S191–S196.

    Article  Google Scholar 

  • Lemmer, H. H. (2015). A method to measure choking in cricket. International Journal of Sports Science and Coaching, 10(2 + 3), 461–477.

    Article  Google Scholar 

  • Noubary, R. (2005). A procedure for prediction of sports records. Journal of Quantitative Analysis in Sports, 1(1), 1–12. Retrieved 27 May. 2017, from https://doi.org/10.2202/1559-0410.1001.

  • O’Riley, B. J., & Ovens, M. (2006). Impress your friends and predict the final score: An analysis of the psychic ability of four target resetting methods used in one-day international cricket. Journal of Sports Science and Medicine, 5(4), 488–494.

    Google Scholar 

  • Cricinfo: Sri Lanka v. Pakistan, T20I Series. (2015). Retrieved from http://www.espncricinfo.com/sri-lanka-v-pakistan-2015/engine/match/860281.html.

  • Pollard, R., Benjamin, B., & Reep, C. (1977). Sport and the negative binomial distribution. In S. P. Ladany & R. E. Machol (Eds.), Optimal strategies in sports (pp. 188–195). New York: North-Holland.

    Google Scholar 

  • Preston, I., & Thomas, J. (2000). Batting strategy in limited overs cricket. The Statistician, 49(1), 95–106.

    Google Scholar 

  • Preston, I., & Thomas, J. (2002). Rain rules for limited overs cricket and probabilities of victory. Journal of the Royal Statistical Society Series D, 51(2), 189–202.

    Article  Google Scholar 

  • Ramakrishnan, V., Sethuraman, K., & Parameshwaran, R. (2010). Target score prediction in the game of cricket. Retrieved from https://people.ucsc.edu/~praman1/static/pub/ML_project_CS7641_report.pdf.

  • Rosenfeld, J. W., Fisher, J. I., Adler, D., & Morris, C. (2010). Predicting overtime with the pythagorean formula. Journal of Quantitative Analysis in Sports, 6(2), Article 1.

    Google Scholar 

  • Scarf, P., & Shi, X. (2005). Modelling match outcomes and decision support for setting a final innings target in test cricket. IMA Journal of Management Mathematics, 16, 161–178.

    Article  Google Scholar 

  • Scarf, P., Shi, X., & Akhtar, S. (2011). On the distribution of runs scored and batting strategy in Test cricket. Journal of the Royal Statistical Society: Series A (Statistics in Society), 174(2), 471–497.

    Article  Google Scholar 

  • Shah, P., & Shah, M. (2014). Pressure index in Cricket. IOSR Journal of Sports and Physical Education, 1(5), 9–11.

    Article  Google Scholar 

  • Shinozaki, K., & Kira, T. (1956). Intraspecific competition among higher plants. VII logistic theory of the C-D effect. Journal of the Institute of Polytechnics, Osaka City University, D7, 35–72.

    Google Scholar 

  • Somboonphokkaphan, A., Phimoltares, S., & Lursinsap, C. (2009). Tennis winner prediction based on time-series history with neural modeling, In Proceedings of the International MultiConference of Engineers and Computer Scientists 2009, IMECS (Vol. I, pp. 127–132), March 18–20, 2009, Hong Kong.

    Google Scholar 

  • Tan, A., & Zhang, D. (2001). Distribution of batting scores and opening wicket partnerships in cricket. Mathematical Spectrum, 34(1), 13–17.

    Google Scholar 

  • Valero, J., & Swartz, T. B. (2012). An investigation of synergy between batsmen in opening partnerships. Sri Lankan Journal of Applied Statistics, 13, 87–98.

    Article  Google Scholar 

  • Wickramasinghe, I. P. (2014). Predicting the performance of batsmen in test cricket. Journal of Human Sport and Exercise, 9(4), 744–751.

    Article  Google Scholar 

  • Yang, T. Y., & Swartz, T. (2004). A two-stage Bayesian model for predicting winners in major league baseball. Journal of Data Science, 2(1), 61–73.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hemanta Saikia .

Appendices

Appendix 7.1: Wicket Weights of Different Batting Positions as in Lemmer (2005)

Batting position (i)

Wicket weight wi

1

1.30

2

1.35

3

1.40

4

1.45

5

1.38

6

1.18

7

0.98

8

0.79

9

0.59

10

0.39

11

0.19

Appendix 7.2: Binary Logistic Regression

In some cases, it so happens that subjects under consideration can be classified into one of the two groups based on their performance on a set of explanatory variables (X, say). In such a case, we are rather interested to compute the probability that a subject belongs to one of the two categories given the values of the explanatory variables. Let us call the proportion as p. Estimating the value of p based on a set of explanatory variables (X) or building models for estimating the value of p given values of the explanatory variables X is called logistic regression.

Binary type data, i.e., data of the form Yes and No comes from a random sample that has a binomial distribution with probability of success p. Often the value of the probability of success is unknown and is to be estimated using a model given the data. Since the binary type data cannot follow a normal distribution, an essential condition for using ordinary regression, so one needs a new type of regression model to do this job called the logistic regression.

A logistic regression model ultimately gives you an estimate of p, the probability that a particular outcome will occur in a binary set up. The estimate is based on information from one or more explanatory variables X = (X1, X2, …, Xk).

Logistic Regression can be used whenever an individual is to be classified into one of two populations. However, if the population into which an individual to be classified is more than two then we reach to the multinomial logistic regression.

  • Assumptions of Binary Logistic Regression

  • Since it assumes that P(Y = 1) is the probability of the event occurring, so it is essential that the dependent variable (also referred to as the response variable) is being coded accordingly. The factor level 1 (i.e. Y = 1) of the dependent variable should represent the desired outcome.

  • The independent variables (also referred to as the explanatory variables) should be independent of each other.

  • It requires quite a large number of samples because the coefficients are estimated by MLE method which is less powerful than the OLS method.

  • Derivation of the Regression Equation

Here, we try to understand the binomial case:

Let Y be a binary variable having two outcomes 0 and 1, such that,

$$P\left( {Y = 1} \right) + P\left( {Y = 0} \right) = 1$$

We are in search of a relation of the form

$$P\left( {Y = 1} \right) = a + b^{\prime} {\mathbf{X}}$$

Let us now introduce the example concept of odds in favor. The odds in favor of an event Y = 1 is the ratio between P(Y = 1): P(Y ≠ 1). Thus,

$${\text{Odds}}\left( {Y = 1} \right) = \frac{P(Y = 1)}{P(Y \ne 1)} = \frac{P(Y = 1)}{P(Y = 0)} = \frac{P(Y = 1)}{1 - P(Y = 1)}$$

Unlike P(Y = 1), odds (Y = 1) does not have an upper limit, however like probability odds ratio cannot be negative. However, with log transformation the value of the variable ranges from −∞ to ∞. Thus,

loge [Odds (Y = 1)] = loge\(\left[ {\frac{P(Y = 1)}{1 - P(Y = 1)}} \right]\) is called as logit of Y.

So, logit (Y) = loge\(\left[ {\frac{P(Y = 1)}{1 - P(Y = 1)}} \right].\)

Let us assume that the logit is a function of some independent variables X1, X2, …, Xk etc. which may be discrete or continuous, i.e.,

logit (Y) = \(a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k}\)

⇒ log [odds(Y = 1)] = \(a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k}\)

⇒ odds(Y = 1) = \(e^{{a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k} }}\)

⇒ \(\frac{P(Y = 1)}{1 - P(Y = 1)} = e^{{a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k} }}\)

⇒ \(P(Y = 1) = {e}^{{a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k} }} - P(Y = 1){e}^{{a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k} }}\)

⇒ \((1 + e^{{a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k} }} )P(Y = 1) = {e}^{{a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k} }}\)

⇒ \(P(Y = 1) = \frac{{{e}^{{a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k} }} }}{{1 + {e}^{{a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k} }} }}\)

Or more precisely we can write P(Y = 1) as P(Y = 1|X1, X2, …, Xk) writing, \(a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k}\) = z, we have,

$$P\left( Y \right) = \frac{{{e}^{z} }}{{1 + {e}^{z} }} = \frac{1}{{\frac{{1 + {e}^{z} }}{{{e}^{z} }}}} = \frac{1}{{1 + {e}^{ - z} }} = \frac{1}{{1 + \exp \{ - (a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k} )\} }}$$

Thus, the logit model is a form of the logistic curve and hence the name.

  • Understanding the coefficients of the model

We have,

$${\text{logit}}\left( Y \right) = a + b_{1} X_{1} + b_{2} X_{2} + \cdots + b_{k} X_{k}$$

Now,

$$\begin{aligned} a & = {\text{log odds for individual for all}}\;\;X_{i} = 0 \\ & = {\text{the odds that would have resulted for a logistic model}} \\ & {\text{without any }}X{\text{ at all}}. \\ \end{aligned}$$

Now, in order to understand the b’s let us consider the change in logit (Y) when one of the X’s varies when all others are kept fixed.

Say: Fix X2, X3, …, Xk to 0 and vary X1 from 0 to 1.

Let, logit P1(Y) when X1 = 1 and

logit P0(Y) when X1 = 0, while all others are fixed at 0 as mentioned earlier. Thus,

$${\text{logit }}P_{1} \left( Y \right) - {\text{logit }}P_{0} \left( Y \right) = a + b_{1} X_{1} - a = b_{1} X_{1} = b_{1} \;\;{\text{as}}\;\;X_{1} = 1$$

Thus, b1 represents the change in the log odds i.e. logit that would result from a one unit change in the variable X1 when other variables are fixed.

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Saikia, H., Bhattacharjee, D., Mukherjee, D. (2019). Performing Under Difficulty: The Magical Pressure Index. In: Cricket Performance Management. Indian Statistical Institute Series. Springer, Singapore. https://doi.org/10.1007/978-981-15-1354-1_7

Download citation

Publish with us

Policies and ethics