1. Introduction

Rating of teams and players is an important part of most sports. Ratings are used for the selection of teams and players for tournaments, and for seeding players and teams within tournaments (eg, Association of Tennis Professionals (ATP), 2012). They are of interest in their own right (eg, Castrol Football, 2012; FA Premier League (FAPL), 2012; Federation International of Football Association (FIFA), 2012). A rating is a numerical value allocated to each player or team based on their contributions, while a ranking is the ordinal position based on the ratings (Stefani, 2011). Broadly speaking, the rating of teams in team sports and the rating of individual players in individual sports is relatively straightforward and well developed (eg, Stefani, 1977; Elo, 1978; Harville, 1980). Recently, Stefani (2011) reviewed rating systems and discussed the three basic types of sport rating systems, namely, the subjective, objective and accumulative. Typically, objective rating systems use data on results to estimate competitors’ strengths. Estimated strengths then directly provide a rating, and rankings can be established from these (eg, Koning, 2009). Many proprietary systems use a combination of the objective and subjective, with results forming the objective component, and the scaling of results according to other factors, such as the importance of a match or tournament, the subjective component (eg, ATP, 2012; FIFA, 2012). The rating of individual players in team sports is more challenging since individual contributions have to be determined in the presence of a team effect and due to the fact that individuals may make different types of contributions according to their specialisms (McHale et al, 2012). The rating of individual players in test cricket is the focus of this paper.

In test cricket, each team consists of 11 players, a combination of batters and bowlers and a wicket keeper. The standard measures of batting and bowling averages have been used to assess player performance over many years. The Reliance ICC (2011) player rankings are the official ICC rankings, and therein are separate measures for batsmen, bowlers and all-rounders. All-rounders are players who bat and bowl to a frontline standard; all-rounders are rated using a combination of their batting and bowling averages. Interest in estimating measures of player performance in cricket goes back to Johnston et al (1993), who assessed player performance in 1-day cricket using dynamic programming. Beaudoin and Swartz (2003) proposed a new statistic for assessing players’ performances in 1-day cricket using the Duckworth/Lewis (D/L) methodology. Lewis (2005) calculates player performance in 1-day cricket. The author also uses the D/L methodology to evaluate players’ performances in relation to the runs expected for the stage of the innings and in relation to the resources that are consumed when batting, or contributed when bowling. Lewis (2008) extended this work to evaluate player performance over a longer term. However, test matches are different because in 1-day matches there is no notion of playing out the time remaining for a draw. Recently, Borooah and Mangan (2010) proposed new ways of computing test match batting averages to account for two deficiencies. First, they take account of a player’s career consistency. Second, they take account of a player’s batting contribution relative to their team’s contribution (total). However, fundamentally these adjustments do not consider the influence of a contribution on match outcome.

We propose a new rating system based on players’ performances (batting, bowling and fielding) during each session of a test match. (Note that each day of a 5-day test match is divided into three sessions: morning, afternoon and evening, each nominally 2 h in length.) There are advantages to looking at players’ performances session by session. First, a session-by-session analysis can take account of the state of the batting and bowling conditions; thus, scoring runs on the first day in a match may be quite different from scoring runs on the fourth or fifth day because test pitches deteriorate with the passage of time and batting becomes more difficult in the later stages of the match. Second, a session-by-session analysis can take account of the state of the match (pressure situation); thus, a player who scores 50 runs when his team score is 100 runs for 5 wickets may be making a more valuable contribution and should be rated more highly for this contribution than a player who scores 50 runs when his team score is 100 runs for 1 wicket. A similar argument may be put forward for bowlers and wicket-taking contributions. The system we develop also takes account of the wicket value, that is, whether a wicket of a recognized batsman or ‘tail-ender’ is taken. Furthermore, we propose a rating system that measures batting and bowling (and fielding) performances on the same scale. Then in principle the rating system can be used to choose the best player in a match or in a series (of matches), or to determine the new emerging player in a particular year, or the best player in a year, whether a batsman, a bowler, an all-rounder (a player who is a frontline batsman and a frontline bowler) or a wicket keeper. Best batting and best bowling performances over a period can also be determined. The system we propose has certain similarities to that developed by Scarf and McHale (2005) and McHale et al (2012) for rating soccer players.

To illustrate the importance of different match contexts, consider the following example: the first test match between India and Australia at Chandigarh, India in 2010. The batting scorecard for this match is shown in Table 1. The match was very closely contested and India won by 1 wicket. Watson scored 126 runs in the first innings and 56 runs in the second innings. Tendulkar scored 98 runs and 38 runs in his two innings. These players scored the maximum number of runs for their respective sides during the match. However, it is difficult to determine how important these batting contributions were in the match from Table 1; this is because the state of the match at the time it was scored is not clear. The session-by-session scorecard (Table 2) shows that Watson and Tendulkar scored runs in relatively easy situations in the match. On the other hand, the most crucial and decisive period appears to be the last four sessions in which Laxman played a very important role, scoring 73 runs and arguably playing the decisive innings for his team in this match. Traditional measures of performance ignore the context of the match, and thus the Reliance ICC player ranking, which is based on traditional measures, does likewise.

Table 1 Batting scorecard for first test, India against Australia in India in October 2010
Table 2 Session-by-session scorecard for match in Table 1

In order to take account of the match situation when runs are scored or wickets taken, we analyse the net contribution of players using a model of match outcome probabilities given the position at the end of each session that was proposed by Akhtar and Scarf (2012). This model was developed to predict match outcomes ‘in-play’. It is similar in principle to those developed by Brooks et al (2002), Scarf and Shi (2005), Scarf and Akhtar (2011) and Scarf et al (2011a). In particular, Brooks et al (2002) use an ordered probit model to predict match outcome given the batting and bowling strengths at the start of a match, Scarf and Shi (2005) use logistic regression to predict match outcome given the end of third innings position, Scarf and Akhtar (2011) extend this latter work to the end of first and second innings positions, and finally Scarf et al (2011a) consider declaration strategy using the third innings position model.

Using the session-by-session match outcome probabilities, we measure the effect of a contribution (runs scored or wickets taken during a session) on the match outcome probabilities; the greater the effect, the greater the value of the contribution and the more highly the contribution should be rated in the rating system. This principle of rewarding contributions by players in a team according to the effect of the contribution on the match outcome was essentially put forward in the development of the EA Sports Player Performance Index for soccer (McHale et al, 2012). The novelty of our paper lies in implementing this principle in the context of a sport in which players make fundamentally different kinds of contributions. Early development of the work is described in Scarf et al (2011b) and Akhtar (2011).

The structure of the paper is as follows. The session-by-session prediction model is briefly described. We then describe how the prediction model is used to determine player contributions. We then compare different potential method measures of player contribution. Furthermore, we illustrate player rating with examples. Lastly, we introduce a weighted average rating scheme.

2. Measuring batting, bowling and fielding contributions

We use match outcome probabilities to measure the contributions of players. The probabilities are predicted using a nominal multinomial logistic regression fitted session by session to data on 146 test matches. Data were obtained from the ESPNcricinfo website (ESPN Cricinfo, 2010), and the complete data set of 146 matches relates to all the test matches in the period between November 2005 and March 2010, excluding those matches for which the session-by-session data were not available and in which more than 90 overs were lost to poor weather. Session-by-session information is not generally available before November 2005. The data were processed in order to obtain session-by-session information (eg, Table 2) for each match.

The nominal multinomial logistic regression model is as follows. With match outcome Y taking values (1, 0, −1) to denote a win, draw and loss, respectively, covariates denoted by X and taking a draw (0) as a reference category, the model assumes Y has a multinomial distribution, that is, Y~MN(p 1, p 0, p −1; ∑p i =1), where p 1, p 0 and p −1 represent the probability of a win, a draw and a loss, with

A sequence of models with covariates lead, ground effect and total wickets lost for each team is obtained. These models are given in Table 3. The coefficients are shown in Table 4. There are two points to note here. First, the win–draw comparison and the loss–draw comparison depend on the covariates in different ways—this is a property of nominal as opposed to ordinal multinomial logistic regression. Second, the wickets covariate is transformed to wicket resources before fitting—this allows the value of the tenth wicket, for example, to be less than that of the first wicket in the wickets-lost effect. The models of Akhtar and Scarf (2012) included further covariates: pre-match team strengths and home advantage. We do not use these here because our view is that the worth of the player should be based on the current performance. Finally, note that the number of matches available for model fitting for later sessions decreases because not all matches last the full 5 days.

Table 3 Best fitting multinomial logistic regression model at each stage, and showing Nagelkerke R 2 and number of matches used to estimate the model
Table 4 The fitted parameter estimates for each session of the match with covariates lead (L), total wickets down by reference team (W 1 ), total wickets down opposing team (W 2 ) and ground effect (G)

Using these models, we can predict the match outcome probabilities (win, draw and loss) given the match position at the end of each session t (t=1, 2, 3 … 15). These are given by

where P t A denotes the probabilities at the end of session t, and W 1, W 2 and L are, respectively, the total wickets down of the reference team, the total wickets down of the opponent team and the lead of the reference team. The reference team bat first in the match. The actual match position at the end of session t is (L=l t ,W 1,=w 1, t , W 2=w 2, t ).

Next, to measure performance session by session, we define hypothetical positions for the batting team and for the bowling team. For the batting team, we define a hypothetical position at (the end of) session t in which batsmen have not scored any runs in session t, and obtain match outcome probabilities given the hypothetical position:

Thus, the hypothetical position for the batting side at the end of session t is (L=l t−1,W 1,=w 1,t , W 2=w 2,t ); if the batters score nothing in the session then their lead does not increase and remains at l t−1, which is the lead at the end of the previous session t−1.

Similarly for the bowling side, we define a hypothetical position at the end of session t in which bowlers have not taken any wickets in session t and obtain match outcome probabilities given the hypothetical position

Thus, the hypothetical position for the bowling team in session t is (L=l t , W 1=w 1, t−1, W 2=w 2, t−1). In this way, if the bowling team take no wickets in the session then the wickets down for the other team remains at w t−1. Of course, the wickets down for the bowling team remains unchanged during the session.

At the end of each session, to assess the worth of players’ contributions, we use the difference between the hypothetical match outcome probabilities and the actual match outcome probabilities. These differences, for batters and for bowlers, are our measures of total player contribution during the session. The probability difference (total contribution) is then awarded to a batter according to his share of the runs scored in the session, and to a bowler according to his share of wickets taken in the session. For bowling contributions, however, a share is also given to fielders. The details of these procedures are given below.

However, matters are not straightforward. This is because we can look at probabilities in relation to alternative outcomes: winning, drawing or at least one of these, or equivalently, not losing. Hence, we can calculate in the manner described above the value of a contribution to winning or the value of a contribution to not losing. Different situations have been investigated, and we propose one criterion for batters and another for bowlers. For batters, we look at probability differences with respect to ‘not losing’ and at bowlers with respect to winning. We believe that looking at not losing is a reasonable approach to rate the batters in test cricket. This is because batting teams act conservatively and early in a game will aim to bat towards a position from which they cannot lose. Even later in a game, a team will bat for a win only if a target or a lead is moderate; otherwise they play for a draw. On the other hand, bowlers and fielders predominantly play for a win—the notion of bowlers and fielders playing for a draw does not really exist, despite the fact that fields might be set to be defensive. The numerical results considered later in Section 4 (Figure 1) provide a further justification of these choices of criteria. Alternatively, one might use an expected ‘points’ gained criterion for both batters and wicket takers that assumes 1 point for a win and 2 points for a draw.

Figure 1
figure 1

For individual players: (a) total batting points (not lose criterion) versus runs in the match; (b) total wicket-taking points (not lose criterion) versus wickets share; (c) total batting points (win criterion) versus runs in the match; (d) total wicket-taking points (win criterion) versus wickets share; 1 (win), 0 (draw), −1 (loss) from point of view batting and bowling team; respectively (104 matches).

The value of a player’s contribution is computed at the end of each session in the following way. Defining the event {win} that the match outcome is a win for the reference side, and likewise for {lose}, and {not lose}={windraw}, the batting and bowling contributions of all players in session t are

The value of an individual player’s batting contribution is

where r i,t is the runs scored by player i in session t and r t is the total runs scored by his team in session t.

For wickets taken, we proceed in the same way in principle to runs scored, sharing the total points among those who take wickets. However, we make allowances for fielding contributions (catches, run-outs and stumpings), and award a share of the points for a wicket to the fielder if indeed a fielder contributed. The choice of the number to express such a share for catches and stumpings is arbitrary. We suppose there are j=1, …, n types of wicket-taking contribution, each of which has share α j of the points. Thus, j=1 corresponds to a wicket taken by the bowler (bowled, lbw, α 1=1), j=2 corresponds to a catch taken by a fielder (α 2=0.25) and j=3 corresponds to the bowler contribution in this case (α 3=0.75). Run-out contributions are shared between the fielders involved. Stumpings are treated similarly to catches. For player i in session t, the number of contributions of type j, z itj , can be obtained from the scorecard. If the total number of wickets taken in session t is z t then the wicket-taking points awarded to player i in session t is

The net contribution of player i in the match is then the sum of contributions from all sessions:

We assume in these expressions that there is no change of innings during a session. When there is, we calculate the actual and hypothetical probabilities in a slightly different way. We suppose in effect that the within-session change of innings occurs at a virtual session-end. The outcome probabilities for the virtual session-end are calculated by interpolating between the probabilities that can be calculated for the actual session-ends immediately before and after the virtual session-end. We carry out the interpolation linearly with weights determined by the number of overs into the session at which the change of innings occurs. We omit the details of the calculation here. A player will only score batting and wicket-taking points in the same session if there is a change of innings in the session. The following should also be noted. First, in modelling match outcome, lead is calculated session by session throughout the match with respect to the reference team, the team batting first in the match. Therefore, all the above formulae are strictly applicable to the reference team. For the opponents, the team bowling first, the bowling criterion {win} of the reference team becomes {lose} of the reference team and vice versa. Second, the change in the probabilities at the end of a match (last session) is calculated by comparison of the hypothetical position with the actual match result. Third, we could use different criteria, for example, {win} for batters and {not lose} for bowlers. This is explored in the example below. Finally, our intention is to produce a rating system where total batting and wicket-taking points over all time should be the same. However, we are not yet in a position to do this as we have not collected data on sufficient matches.

3. Comparison of potential measures of contribution

We now compare the approaches described above. Batting and wicket-taking points for individual players are calculated by the above stated approaches, and plotted in Figure 1, for 104 test matches played in 2010, 2011 and 2012. To calculate the ratings of not too large a number of players for a reasonably large number of matches, we only consider the top eight teams (Australia, New Zealand, Pakistan, India, South Africa, West Indies, Sri Lanka and England). Figure 1(a) clearly shows a positive relationship between batting points under the {not lose} criterion and total score. On the other hand, using the {win} criterion for batters, some of the players earn negative points (Figure 1(c)). This is because batters do not always play for a win; sometimes batters bat slowly for a draw when the opponent’s lead is large. When a batter bats slowly the win and loss probabilities decrease and the draw probability increases. Therefore, those batters who play for a draw can receive negative points under the {win} criterion. By using the {not lose} criterion for wicket taking (Figure 1(b)), some players received negative points. On the other hand, Figure 1(d)({win} criterion for wicket-taking points) gives a stronger positive relationship. Thus, these plots would appear to support the idea of using the {win} criterion for wicket-takers and the {not lose} criterion for batting points.

4. Examples

4.1. Example 1

Consider now the third test between Australia and England, Perth, 2010. Table 5 provides the summary of players’ batting, bowling and fielding contributions in the match. Batting contributions use {not lose} and bowling and fielding contributions use {win}. Australia batted first, scoring 268 runs in their first innings and 309 runs in their second innings. In reply, England scored 187 runs in their first and 123 runs in the second innings. Australia won this match by 267 runs. Hussey scored maximum runs in the match and received maximum batting points (0.53). On the other hand, Johnson scored 63 runs and took nine wickets in the match. He received the maximum points (0.66) with this all-round performance and was declared man of the match. Note that wicket-taking points are adjusted (rescaled) so that total batting points and total wicket-taking points are equal; we do this to standardize the contributions. Such standardization might be carried out over a series of matches, or even over all matches played in a specified period. The rationale for this is that ultimately, if enough matches are considered, run-scoring and wicket-taking contributions ought to be balanced.

Table 5 Player contributions in the third Ashes test, Australia versus England, Perth, 2010

4.2. Example 2

Consider now the 2010/2011 Ashes, a five-match test series between England and Australia played in Australia. The results of adopting the contribution system are shown in Table 6. This series was an interesting one for all stakeholders: fans, teams, bookmakers and the communication-media organizations. Squads for each team contained up to 17 players so that more than 22 players took part in at least one match. For brevity, only the top 20 players are presented. Note also that wicket-taking points are rescaled so that total batting points and total wicket-taking points over the series are equal in order to standardize the contributions. Using the combined contribution measure to evaluate the player of the series or tournament award, we see that Alastair Cook, the England opening batsman, is placed in top position. Behind him in second is his countryman Anderson, a bowler whose contribution with the ball was also outstanding. It is difficult to argue against these players being the best batsman and bowler, respectively. Cook in fact was awarded the man of the series accolade. England players occupy four of the top five positions; Hussey is the exception at third. England won the series 4–0.

Table 6 Ashes series 2010/2011: Players in rank order of total contribution, batting and wicket taking (bowling and fielding)

4.3. Example 3

Consider the two-match test series between South Africa and India in India, February 2010. The series was tied at 1–1. Tables 7 and 8 summarize the net wicket-taking (bowling and fielding) and batting contributions of the players. To calculate the batting points, we tried both the earlier stated criteria ({win} and {not lose}). However, under the {win} criterion, a significant number of batsmen received negative batting points. Therefore, batting contributions are calculated using the {not lose} criterion. Wicket-taking contributions are calculated using the {win} criterion. Raw unadjusted contributions are shown.

Table 7 Raw player contributions for 2010 test series between India and South Africa in India, first test match
Table 8 Raw player contributions for 2010 test series between India and South Africa in India, second test match

Table 7 summarizes the net batting and net wicket-taking contributions in the first test match. South Africa won this match by an innings and 6 runs. Amla scored 253 runs in the match and received the maximum batting points (0.187). On the other hand, Steyn received the maximum wicket-taking points (0.878) in the match. Note that Sehwag and Tendulkar scored almost the same number of runs but received quite different numbers of points. This is because Tendulkar made 100 runs in the second innings when the defeat of his team was almost certain. Table 8 summarizes the net batting and wicket-taking contributions in the second test. India won the match by an innings and 57 runs. Amla once again scored maximum runs in the match but received fewer batting points than Sehwag and Tendulkar. The Sehwag and Tendulkar partnership was the crucial and decisive one from the point of view of the match result. Laxman and Dhoni also made hundreds in the match but received fewer points than Sehwag and Tendulkar. This is because India were already in a strong position in the match and faced the less pressured situation.

Table 9 shows the total scores over the 2-match series for the top 10 players in rank order, with wicket-taking contributions rescaled so that the total batting and wicket-taking contributions over the series are equal. Harbhajan Singh is placed in top position with maximum total points on the basis of his 7.5 wickets (bowling plus fielding) in the second test; arguably, all these wickets were decisive and played a vital role in the win. Amla scored a massive 490 runs in the two-match series but is some way behind Singh in the adjusted contribution score. Sehwag, who scored 290 runs in the series, achieved nearly as many adjusted contribution points as Amla. This is because Sehwag’s runs came at a more decisive time than Amla’s, particularly in the second match of the series. It is quite clear that Amla was the most consistent batsman in the series, but he received fewer points than Harbhajan Singh. This is perhaps because one good bowling spell can change the situation in the match completely and in a very short time; therefore, under the system bowlers can accrue a large number of points. On the other hand, batters take time to make the position strong. This suggests that bowlers are more likely to receive the maximum points, even in the adjusted contribution system.

Table 9 Total adjusted player contributions for the 2010 test series between India and South Africa in India (top 10 players)

5. Rating test match players

To rate players on the above measures of contribution may not be totally satisfactory. For instance, consider the above stated Example 3, the two-match series between India and South Africa. First, Amla scored 490 runs in the series at an average of 490 runs per innings and received the maximum batting points (0.288), while Sehwag scored 290 runs in the series at an average of 97 runs per innings and received 0.282 points, nearly as many. There is a large difference between their runs but they received almost the same number of contribution points. This is in part because the contribution points system takes little account of contributions after a point when the win or draw probability of any team is close to unity. Second, in both matches, bowlers stand out in the contribution points table: Steyn in the first test (Table 7) and Harbhajan Singh (Table 8) in the second test. Furthermore, there is the fact that negative points can be achieved, and this appears to be more prevalent for batters. To overcome these problems, it may be sensible to use the contributions as one component of a weighted average rating system. The other component might be the raw runs and wickets in the match.

Such a scheme could be developed in a number of ways. Under one such scheme, the batting, bowling and fielding rating points might be calculated in the following way. Let the batting points of player i (i=1, 2, … ,22) under a weighted average rating system in a match be

where r i is total runs (raw) scored by player i in the match and R T is the total runs scored by both teams (match total). Note that t is indexing each of the 15 sessions in a match. Let the wicket-taking (bowling and fielding) points of player i under this weighted average rating system in the match be

where w i is total wickets share of player i in the match. To standardize the contribution scores under this system, we multiply the net player contribution (sum of the batting, bowling and fielding points) by 1038—the average score per match, based on all test matches from 1877 to 2007. In this way, points achieved are put on a ‘runs-like’ scale. In addition, the maximum batting (runs) and wickets points will be reserved up to 1038 points per match.

To illustrate this rating system, Table 10 summarizes the net contributions of the top 26 players in the 104 test matches in 2010, 2011 and 2012 played by the top eight teams. The top two positions are secured by bowlers. DW Steyn, in the number one spot, scored 381 runs and had a wicket share of 109.25. GP Swann secured second place with 593 runs and a wicket share of 102.75. MJ Clarke in third scored 3120 runs. JH Kallis secured the seventh position with his all-round performance. The wicket-keeper batsmen MJ Prior and MS Dhoni were in sixteenth and twenty-fifth position, respectively.

Table 10 Batting, wicket taking and net player contributions (points) for 104 matches in 2010, 2011 and 2012

Another potential application of our rating system that may be appealing relates to the selection of man of the match, as the choice is often between a batsman or a bowler and occasionally between a player on the winning team or the losing team. Selecting the highest rated player as man of the match has an objective quality. Counting the instances in which the man of the match is on the winning or losing team as opposed to the highest rated player being on the winning or losing team (Table 11), we can see: (a) in only 2 matches (of the 104 matches in 2010–2012) was the man of the match on the losing team; (b) if the man of the match was awarded to the highest rated player (by our method) then there would have been 17 occasions for which the man of the match was on the losing side. Thus, the table suggests that the subjective choice of man of the match is biased towards winners.

Table 11 2 × 2 cross-classification table for 104 test matches in 2010–2012 played by the top eight teams (30 drawn matches are ignored), classified by actual man of the match (MOM) on winning or losing side and highest rated player on winning or losing side

Not only would a man-of-the-match award based on maximum rating points be fairer to losers, but also it brings a better balance between batters, bowlers and all-rounders. Among these 104 matches, the man of the match was a batter, bowler, all-rounder in 51, 44, 5% of occasions, respectively. The corresponding figures for the highest rated player are 41, 48, 11%, respectively. Thus, these figures suggest that bowlers play a more decisive role than they are given credit for in man-of-the-match awards.

A drawback of our rating system is that the total number of points achieved is the same for every match. Thus, for matches in which the advantage often changes between the teams, while contribution points will be increased standardization moderates this effect. Thus, the unadjusted contributions method will give greater rewards to players in tight contests—this would seem to be an advantage of this method. To overcome these drawbacks, different components of a weighted average scheme might be combined once total scores on each component are determined for a series of matches. Thus, suppose in a five-match series r runs are scored and w wickets taken, and thus 1 wicket is equivalent to r/w runs. Suppose also that the raw total (over all five matches) contribution points are n for batting and m for wicket taking. Then a possible scheme that equally weights batting and wicket taking, and actual runs scored and wickets taken and contributions for each, would be:

where r i , w i , n i and m i are the runs scored, wickets taken, raw batting contribution points and raw wicket-taking contribution points of player i in the series, and T i is their total weighted average points scored in the series on a ‘run-equivalent’ scale. Note again that no adjustment accrues to batsmen for wickets lost or to bowlers for runs conceded.

6. Discussion

The purpose of this paper is to evaluate player performances based on batting, bowling and fielding contributions in the match, session by session. We propose a new rating system for bowlers and batters using a common scale. Nominal multinomial logistic regression models are used to model match outcome probabilities for hypothetical and actual positions. Measures of player performance are then calculated using changes in such probabilities session by session. The advantage of looking at the contribution at the end of each session is that we can evaluate the performance of the player taking into account the stage of match in which runs and wicket are earned and conceded. Another advantage of this rating system is that it provides measures (points) for all-rounders, wicket keepers and fielders, again on the same common scale. The ICC rating does not take into account fielding contributions of players. Furthermore, our rating system has the potential to select man of the match, man of the series, best player of the year and new emerging player of the year. These measures could also be helpful for team selectors and management to select a best possible team based on their current performance.

A limitation of this work is that batting contributions and wicket-taking contributions are measured using subtly different criteria: wicket takers are assumed to contribute to winning while run scorers are assumed to contribute to the contrary, that is, to not losing. These different contribution criteria necessitate the rescaling of contributions to a common base. This can be achieved in principle by choosing a relative rescaling factor such that the total run-scoring contributions balance with the total wicket-taking contributions over the ‘history’ of the game. This would present a challenge for implementation. A more fundamental challenge, however, is that in assigning wicket-taking contributions to fielders (for catches and run-outs) we must subjectively assign a value for the share due to a fielder. If run-scoring and wicket-taking contributions were determined using the same criterion then we might have chosen the fielder’s share objectively in the same way. Unfortunately, we cannot resolve two unknowns with a single constraint.

Another limitation of this work is the weakness of the match prediction model. For example, if both the teams lose wickets on the first day of play then selected models are not able to take into account the number of wickets down for the team batting second (Akhtar and Scarf, 2012). Therefore, the bowlers and fielders of the reference team will not be awarded at this stage. The collection of data on more matches would be beneficial to overcome this problem.

In addition, bowlers are not penalized for giving away runs, and fielders are not rewarded for saving runs, or penalized for giving them away through miss-fields. The latter events are not recorded in match scorecards and thus their incorporation requires a different form of data recording. For runs conceded by bowlers, the concept of a hypothetical match position at session-end cannot be extended because hypothetically a bowler could concede an unlimited number of runs during a session. One might instead consider runs conceded relative to certain norms; this is an avenue for further investigation. For example, one might consider the runs conceded by a bowler during a session relative to a figure for runs conceded that is ‘loss neutral’ over the session.

However, more positively, it is our view that the rating system shows potential for the more reliable assessment of player performances in batting and wicket taking and in comparing performances between these two disciplines. This potential has been shown not just for an individual match but also for a series of matches. The contributions approach might be used in combination with more traditional performance measures, as we illustrate. If the methodology is to be incorporated into or to supplant the existing standard and commercial measures of performance and rankings, further evaluation of the method is necessary, particularly regarding how the measure would perform in the longer term. To achieve this, many more matches need to be analysed.

It is implicit in the system proposed that only those who play can make a contribution, and furthermore that frontline batters and bowlers have a greater opportunity to make a contribution. Thus, the system will not measure who are the best players. Instead, it will measure who played the ‘best’ cricket. If one were to consider a rating based on contribution rate (per match played) then the best players might emerge, although again, frontline batters and bowlers would tend to be favoured. An average (eg, mean) contribution per match would also tend to favour (and disfavour in equal measure) those who play few matches due to the increasing variability in the mean as the number of observations decreases. However, one could in principle adjust for this by considering the problem of rating as one of estimating player strengths and shrinking the estimated strengths towards an overall mean strength in the manner described, for example, by McHale and Szczepański (2013) in the context of football.

Finally, the contribution measure might be used to determine the top cricketers of all time. It would be fascinating to determine, for example, whether the great all-rounders of the past (eg, Imran Khan, Ian Botham, Garry Sobers) head such a list, and where the best batters (eg, Don Bradman, Brian Lara, Sachin Tendulkar) lie in relation to the best bowlers (eg, Muttiah Muralitharan, Richard Hadlee, Michael Holding).