Review time, that is the number of days between the invitation phase and report phase, is one of the most direct and tangible measure of the efficiency of the review process. Since our sample contains information about the beginning and end of each phase, we were able to acquire distributions of review time for known and other reviewers, as well as partial distributions of days between all intermediate phases. These partial distributions are especially interesting, as they can serve as building blocks with which one can create a simulation of the entire review process and recreate the cumulative distribution of review time under various assumptions.
The distribution of review time can be reassembled using partial distributions in the following way (see Fig. 4). To each node (phase) j of the review process graph (Figs. 1, 2 and 3) one can assign the probability \(q_j\) that a realisation of the process will pass through node j and the probability distribution \(G_j(t)\) of days between the invitation phase and phase j. Similarly, each edge is characterised by the probability \(p_{i,j}\) that the review process will pass from phase i to j and the probability distribution \(P_{i,j}(t)\) of days associated with such a transition. Given all these probabilities, \(G_j(t)\) can be calculated as follows
$$\begin{aligned} G_j(t) = \sum _{\{i\}_j} w_{i, j}\ (G_{i}*P_{i, j})(t) \end{aligned}$$
(1)
where the summation is over set \(\{i\}_j\) of all predecessors of node j and symbol \(*\) represents the discrete convolution
$$\begin{aligned} (G_{i}*P_{i, j})(t) = \sum _{t'=0}^{t} G_{i}(t') P_{i, j}(t-t'). \end{aligned}$$
(2)
Weights \(w_{i, j}\) are defined as
$$\begin{aligned} w_{i, j} = \frac{q_i p_{i,j}}{q_j}. \end{aligned}$$
(3)
and the probability \(q_j\) can be expressed as
$$\begin{aligned} q_j = \sum _{\{i\}_j} q_{i} p_{i, j}. \end{aligned}$$
(4)
Equations (1–4) are recursive. The distribution \(G_j(t)\) associated with node j depends on the corresponding distributions associated with predecessors of node j and probabilities \(q_j\) exhibit similar dependence. As such, these equations can be solved recursively if one assumes appropriate initial conditions for nodes without parents (in our case it is \(q_{\text {invitation}} = 1\) and \(G_{\text {invitation}}(t)=\delta _{0,t}\) for the node that corresponds to the invitation phase) and acquires probabilities \(P_{i, j}\) and \(p_{i,j}\) from the sample. One last fact worth noting is that the quantity \(q_i p_{i,j}\) from the numerator in Eq. (3) is actually the same as the probability in Figs. 1, 2 and 3 next to each edge.
Using the aforementioned procedure we recreated the distribution of review times for both known and other reviewers which we then compared with the corresponding empirical distributions from the sample (Figs. 5, 6, 7 and 8). According to our theoretical calculations based on Eqs. (1–4) the average review time for known reviewers is 23 days with standard deviation of 12 days which is in agreement with the average review time acquired from the sample. As for other reviewers, the theoretical average review time is 20 days with standard deviation of 11 days and the sample, again, yields the same values. One-sample Kolmogorov–Smirnov test performed to compare the theoretical distribution with the sample gives p value 0.88 for known reviewers and 0.97 for other reviewers. It means that the distributions of review times calculated using partial distributions are essentially the same as the ones obtained directly from data.
This is an important and non-obvious observation, as the only underlying assumption behind Eqs. (1–4) is that the review process is memoryless (Markovian)—that is the partial distributions assigned to edges do not depend on the history of the process. Results presented thus far seem to confirm this reasonable assumption. Moreover, the findings are reinforced even further in the following section through simulations of the model.
Other than the validity of theoretical distributions, there are two main conclusions that can be drawn from results presented in Figs. 5, 6, 7 and 8. Firstly, the review time distribution is bimodal. Reviewers who either confirmed or sent in their reviews after receiving the invitation are the ones who contribute to the leftmost maximum (and they are in the majority of those who actually completed the reports—69 % of other and 82 % of known). Secondly, distributions of review time are similar for known or other reviewers. The difference between means and standard deviations for both groups is negligible from any practical standpoint: a two-sample Kolmogorov–Smirnov test for both empirical distributions gives p value \(\simeq\)0.40. Based on these facts one can make a very strong assumption that the distribution of review time is the same across the entire population of reviewers and does not depend on the reviewer group.
While in our work we were mostly interested in the time that is needed to acquire a given number of reviews, it should be mentioned that technically this is only the first major stage of the full peer review process. The second stage begins when the reviews are sent to authors and ends with the notification of acceptance or rejection. However, the dynamics of that second stage are rather linear and straightforward. In the case of our data from JSCS, one revision of the original manuscript was necessary to address the remarks of reviewers (though one has to keep in mind that we only had access to data pertaining to accepted manuscripts). On average, it took authors 34 days to deliver the revised version and final notifications were sent after 8 more days. Thus, manuscripts were accepted on average 42 days after the sub-editor received all reviews. It means that the second stage of the peer review process is longer than the first one, which is consistent with findings of other researchers (Trimble and Ceja 2011).