Introduction

Distribution of credit among the coauthors of publications represents one of the main problems in assessing publications of individuals. The most simple and popular solution to the problem is the application of the partial authorship method (fractional allocation, see Kolthun & Hafner, 2021). Accordingly, the credit of a journal paper (pi = 1) published and citations (ci) received to the paper are equally distributed among the coauthors (ai), i.e.: 1/ai and ci/ai, respectively.

Although numerous sophisticated methods have been suggested in the literature (e.g. Ioannidis et al., 2019; Kolthun & Hafner, 2021; Lukovits & Vinkler, 1995; Prathap, 2021; De Solla Price & Beaver, 1966; Sangwall, 2022; Vinkler, 1993), it does not exist a widely accepted procedure for distributing the share of credit of publications and citations for the coauthors. Even the possibility to find a standard method has been questioned (Osório, 2018).

Indicators applied in scientometric evaluations of individuals, organizations or countries have been generally calculated from publication and citation data of total publication sets. However, after the introduction of the Hirsch-index (Hirsch, 2005), indicators referring to papers in the “core” (or elite) subsets have been preferably used. Oberesch and Groppe (2017) published a comprehensive survey on the variations of h-index. The paper deals also with possible methods for fractional allocation of citations in calculating the h-index.

The central idea behind the use of core (elite) set indicators is the assumption that indices derived from the most cited publications may characterize the relevant publication impact of the total set more appropriately than the mean indices of total sets (Jin et al., 2007; Leydesdorff, 2012; Schreiber, 2010; Vinkler, 2017b).

The idea for applying core journal papers in assessments finds support by the well-known fact that the distribution of citations over papers is skewed (Seglen, 1992). In addition, it is well known that scientific progress is advanced primarily through information in publications acknowledged by relatively high number of citations (Aksnes, 2003; Plomp, 1990; Vinkler, 2010a, 2017a). Consequently, publications with high influence may be revealed by determining relatively highly cited journal papers.

Core (elite) subsets can be obtained by different statistics: h-statistics, g-statistics, percentage statistics, π-statistics, πv-statistics, etc. The core or elite part of publication sets is termed as h-core, g-core, π-core, or top-1%, 10%, etc. of total, according to the method applied. Naturally, the publications in the sets studied are ranked according to the decreasing citation frequency by each method.

The preferred use of elite set indicators has resulted in a plethora of scientometric impact indices (Leydesdorff, 2012; Schreiber, 2010; Schreiber et al., 2012; Todeschini & Baccini, 2016).

The impact indices derived through the mentioned statistics, e.g.: h-index (Hirsch, 2005) and h-related indices (Bornmann et al., 2011), g-index (Egghe, 2006), π-index (Vinkler, 2009), πv-index (Vinkler, 2010b), I3-index, indices of percentiles and percentile classes, PR(2, 10) or PR(2, 50) or PR(6) (Bornmann & Marx, 2014; Bornmann et al., 2013; Leydesdorff, 2012), top-1% highly-cited papers (Wagner, Lin Zhang., & Leydesdorff, 2022), etc. strongly depend on the bibliometric features of the corresponding field.

There are some methods applying outside standards for obtaining elite subsets. One of these methods takes those publications as “elite” to which the number of citations obtained is higher than the mean citation rate of the corresponding field (Vinkler, 2017a). According to the most wide spread method, the journal papers (e.g. of a country) in the top-1% of highly cited papers in the corresponding field may be accepted as most influential on world level (see e.g. WoS Essential Science Indicators).

Kolthun and Hafner (2021) tried to verify the h-index by comparing the appropriate values of individuals with their scientific awards. They conclude: „…fractional allocation (of citations) improves the effectiveness and predictive power of research metrics, and h-frac is consistently the most reliable bibliometric indicator. Our results suggest that the (recent) use of the h-index in ranking scientists should be reconsidered…”.

The publications in elite subsets of a scientific field may represent hot topics or relevant core information of the field depending on the time period applied. Science is developing permanently. Therefore, the dynamic study of the publications in elite subsets is highly relevant (Egghe, 2007).

The present paper compares the h-index and π-index calculated by attributing total credit (both publication and citation) to one of the coauthors (i.e. to the evaluated scientist) of multiauthored papers with h-index and π-index obtained by attributing only the relevant part of papers and citations to the scientist studied. The methods calculating impact indices from data of papers (and citations) in the relevant part of multiauthored publications may be called as part-impact methods. Accordingly, the research tasks of the present publication may be summarized as follows:

  • Study of impact of the number and rank of authors in the byline of publications on π-index and h-index applying Full Authorship Method (FAM), Partial Authorship Method (PAM), Practical Rank Score Method (PRSM), and Single plus First Authorship Method (SFAM). (For FAM, PAM, PRSM, and SFAM, see Data and methods.)

  • Comparison of the percentage share of journal papers and citations in the following elite subsets: π-set, h-set, 2π-set, 2h-set.

  • Revealing the difference in the mean citation rate (C/P) of papers in the elite subsets (π, 2π, h, and 2h-set).

  • Study of the application of part-impact methods on the rank of scientists according to h-index and π-index.

Data and methods

For investigating the impact of the number of coauthors of papers of a scientist on h-index and π-index, total sets of journal papers published by 20 Price medallists were collected. The persons selected (T Braun, B Cronin, L Egghe, W Glänzel, P Ingwersen, L Leydesdorff, B R Martin, K W McCain, H F Moed, F Narin, O Persson, R Rousseau, A Schubert, H Small, M Thelwall, AFJ vanRaan, P Vinkler, L Waltmann, H White, M Zitt) may be regarded as scientists with outstanding contribution to the field of quantitative studies of science.

The present study does not intend to evaluate the publication activity of the mentioned scientists. The publication and citation data of the studied researchers are taken as model for investigating features of some elite set indicators calculated with part-impact methods.

The publication and citation data of the mentioned scientists were collected from Web of Science in Mai, 2021. Only papers classified by WoS as: Information Science Library Science and Business Economics were selected (ILBE papers). Each ILBE paper was reviewed and only those belonging to scientometrics, bibliometrics or informetrics were taken into account (TP papers). The total number of TP publications: 2718, to which 99,396 citations were arrived up to the mentioned time.

The following indicators were calculated for each scientist: total number of papers (TP), total number of citations (TC) to TP papers, mean citation rate, TC/TP, number and per cent of papers cited more frequently than the mean (ECP and ECP%, resp.), per cent of papers in the elite subsets (π, 2π, h, 2h) and that in the rest of total [(TP-P(2h)]%. The number of papers in the π-set = √TP (Vinkler, 2009). The number of papers in the h-set is equal to the h-index. The number of papers in the 2h or 2π set is equal to two times the number of papers in the h or π set, respectively.

The number and rank of coauthors of TP papers was taken from WoS. Sharing citations among the coauthors was made by the full credit method and by three part-impact methods:

  • Full Authorship Method (FAM): total number of citations received by a journal paper is given to the author studied independent of number of coauthors and rank of the assessed author in the byline of the paper.

  • Partial Authorship Method (PAM): the studied author is given ci/ai share of citations to the paper (ci), where ai is the number of authors of the i-th paper.

  • Practical Rank Score Method (PRSM). According to this method, the number of authors and rank of the studied author in the byline of the paper is taken into account in calculating the share in citations of the studied author (Table 1).

Table 1 Cooperation factors for applying Practical Rank Score Method (PRSM)

The citations of the author studied will be obtained as a product of the total number of the citations to the paper (ci) multiplied by the corresponding cooperation factor (Table 1). The cooperation factors applied are calculated according to Vinkler (1993) and Lukovics and Vinkler (1995). (E.g. ci = 30 to a paper with ai = 3 authors where the scientist studied is the 3rd author. Accordingly, the corresponding share: 30 × 0.20 = 6, see Tables 1, 2).

Table 2 Model for demonstrating the change in rank and citations to the papers on h-index by applying Full Authorship Method (FAM), Partial Authorship Method (PAM), and Practical Rank Score Method (PRSM)

The authors ranked higher than sixth, each will be given a factor of 0.05. This way the sum of the cooperation factors of the authors would be higher than unity. Therefore, the factors of the individual authors should be normalized into unity. In case of e.g. 8 authors: the sum of the cooperation factors = 1 + 2 × 0.05 = 1.1. After normalization, the first, second, etc. author will obtain 0.35/1.1 = 0.318; 0.25/1.1 = 0.227; 0.15/1.1 = 0.136; 0.1/1.1 = 0.09; 0.1/1.1 = 0.09; 0.05/1.1 = 0.045; 0.05/1.1 = 0.045; 0.05/1.1 = 0.045 scores. The sum of scores so obtained = 0.996 (rounded to unity).

  • Single plus First Authorship Method (SFAM): the papers authored only by the studied scientist, plus that where this author ranks as first, independent of the number of coauthors, are classified as SFA-papers.

It is obvious that in the PA, PRS, and SFA-set the rank of the papers by the number of citations will be different from that in the original set (Full Authorship Method, FA-set). Accordingly, also h-index, π-index, etc. obtained for the FA-set will be different from the indices calculated for the PA, PRS and SFA-set (see Table 2).

The statistical calculations and figs. were made with TIBCO Statistica 13.4/13.5 program.

Results and discussion

Share of papers in the elite sets

The total number of journal papers (TP) published by the scientists studied, mean number (TC/TP) of citations (TC) received, percentage share of papers cited more frequently than the mean (ECP%), and that in π, 2π, h, 2h, and (TP-2h)-set are given in Table 3. The studied scientists published 135.9 papers (SD = 105.5) in average during the investigated period (1975–2021). The dynamic range of the data is rather wide (from 27 up to 369).

Table 3 Total number of papers (TP) published by the studied scientists, mean citation rate, TC/TP, number and per cent of papers cited more frequently than the mean (ECP and ECP%, resp.), per cent of papers in elite subsets (π, 2π, h, 2h) of different size and that in the rest of total [(TP-P(2h)]%

The data in Table 3 show that the mean percentage rate of the papers increases in the rank: P(π), P(2π), ECP, P(h), P(2h). The mean share of papers in the h-set (32.83, SD = 14.07) is 3.13 times higher than that in the π-set (10.50, SD = 3.73). The difference between the mean P(π)% and P(h)% is highly significant (p < 0.001). The number of papers in h-set may be similar to that in π-set only in extreme cases (Vinkler, 2017a).

The ratio of papers within a given set with higher number of citations than the mean (Table 3) may be characteristic for the distribution of citations over papers. The mean ECP% of the studied scientists is 26.56, whereas the standard deviation of the mean is rather low (SD = 5.78). Accordingly, the share of papers cited higher than the average seems to be similar in the publication sets of eminent scientometricians.

The data in Table 3 also show, the mean percentage rate of papers in the π-set, P(π)% (10.50, SD = 3.73) is significantly (p < 0.001) lower than the share of papers with higher citation rate than the mean (ECP% = 26.56, SD = 5.78). At the same time, mean P(h)% does not differ significantly (p = 0.073) from the mean ECP%, whereas its value is somewhat higher (32.83%, SD = 14.07). The difference may be attributed to the higher number of highly cited papers in the h-core compared to π-core.

The Spearman rank-order correlation coefficients reveal significantly high and positive correlation (r = 0.88) between the percentage share of papers in the π and h-set, P(π)%, P(h)%, resp. It is obvious however; the greater the size of the publication set (TP), the lower the share of papers in the π and 2π-set, relatively. Therefore, the correlation coefficients are negative: r = − 1.00 and r = − 1.00, respectively. The coefficient between TP and P(h)% is also negative but it differs from unity: r = − 0.88. In contrast, the share of papers in the (TP-2h)-set seems to be greater with greater number of papers (r = 0.69). I.e. publishing relatively many papers, it would mean that beyond a limit, the papers would receive fewer citations. These papers are published in journals with lower impact factor, in average.

It is unexpected however that no significant correlation (at p < 0.05) is found between ECP% and share of papers in the elite sets, P(π)%, P(h)%: r = 0.14 and r = 0.31, respectively. The reason for that may be the skewed distribution of citations over papers.

Share of citations in the elite sets

Indicators applying citations represent a preferred role in evaluative scientometrics. The average number of total citations obtained by the scientists in the period studied is 5038.6 (SD = 3806.1) (Table 4). The dynamic range is rather wide: 16,051 (Leydesdorff) – 1190 (Vinkler). Table 4 shows also the share of citations in the different elite subsets.

Table 4 Total number of citations (TC) obtained to TP papers, and percentage share of citations (C) received by papers in π (C(π)%), 2π (C(2π)%), h (C(h)%), 2h-set (C(2h)%), and that in the set of total papers (TP) minus papers in the 2h-set ([TC-C(2h)]%)

The percentage share of citations in the π-set takes 54.90% (SD = 11.36), in average, whereas that in the h-set is as high as 82.05% (SD = 11.58). The h-set contains 1.53 (SD = 0.23) times more citations than the π-core, in average. The number of citations in 2h-sets approaches to the total (94.25%, SD = 7.42). This fact may be attributed to ranking the papers by the number of citations and to the skewed distribution of citations over papers. The share of papers ranked higher than 2h, i.e. those in the (TP-2h) set, is equal to 38.36% (SD = 23.81) (Table 3), whereas the share of citations to those papers is only 6.19% (SD = 7.18) (Table 4).

The data in Table 5 reveal significant positive correlation (r = 0.76) between TP and TC. The share of citations in the elite subsets, C(π)%, C(2π)%, C(h)%, and C(2h)% was found to correlate negatively (and mostly significantly) both with total papers, TP and citations, TC. The correlation coefficient is however not significant for the TC-C(h)% (r = − 0.44) and TC-C(2h)% relation (r = − 0.39). The share of citations in the elite sets, e.g. C(π)% and C(h)% correlates with each other positively and significantly (r = 0.73).

Table 5 Spearman rank-order correlation coefficients between the total number of papers, TP and total number of citations, TC = 100% of the whole publication set of the scientists studied and percentage share of citations to papers in the π-set, C(π)%, 2π-set, C(2π)%, h-set, C(h)%, and 2h-set, C(2h)%

Difference in citation rate (C/P) of papers in elite sets

In agreement with the expectations, the citation rate of papers, (C/P) in the individual elite sets decreases as follows: π-set: 232.13 (SD = 107.31), 2π-set: 151.48 (SD = 68.50), h-set: 111.57 (SD = 37.36), 2h-set: 64.20 (SD = 20.83) (Table 6). The difference between the mean of the total set (M = TC/TP = 42.92, SD = 21.38) and that of the elite sets, C/P(π), C/P(2π), C/P(h), C/P(2h) was found highly significant (p < 0.003). Similarly, all differences between the mean C/P value of the elite sets are highly significant (p < 0.008).

Table 6 Mean citations per paper values, M = (C/P) for papers in the total set (TP) of the studied scientists, C/P values in the elite subsets, and Single plus First Authorship Rates (SFAR%)

The Spearman correlation coefficients between C/P(π), C/P(2π), C/P(h), C/P(2h) of the elite sets are positive and significant (Table 7).The C/P values referring to the TP-P(2h)-set do not correlate with other C/P-data. This may be due to the relatively low and changing number of citations to papers in this set.

Table 7 Spearman rank-order correlation coefficients between citation rate (C/P) values of the different elite sets and that of the total set (TC/TP)

Figure 1 represents linear relationship between mean C/P of papers in the elite sets and mean citation rate (M = TC/TP) of the corresponding total sets. The slope of the lines increases in the rank: TP-P(2h), 2h, h, 2π, and π-set (0.026, 0.734, 1.296, 2.821, and 4.315, respectively). The citation rate of papers (C/P) in the Hirsch-core, termed here as C/P(h), corresponds to the A index suggested by Jin et al. (2007).

Fig. 1
figure 1

Relationship between the citation rate of papers in the total set (TC/TP) against the citation (C) per paper (P) index of papers in the elite sets (π = pi, 2π, h, and 2h) and that in the [TP-P(2h)]-set

The mean share of single authored papers plus that signed by the studied scientist as first author (SFAR%) was found 61.27% (SD = 22.11) (Table 6). The dynamic range: 25.62% (Rousseau) – i.e. 74.38% of the papers are published in cooperation where the mentioned scientist is not the first author – 100.00% (Vinkler) – i.e. all papers are single or first authored.

Study on the relationship between π and h-index and Mean Authorship Rate (MAR)

The Mean Authorship Rate (MAR) was calculated as a specific index according to the formula: MAR = TA/P, where TA is the total number of authorships of papers in the total set or in the corresponding elite sets (π, 2π, h, 2h), further in (TP-2h)-set, and P is the number of papers in the corresponding set. The number of authorships of a paper is equal to the number of authors listed in the by-line. The number of authorships in a set can be obtained as the sum of authorships of the corresponding papers.

The following Mean Authorship Rate (MAR) indices were found for the total set of the scientists: MAR = 2.23 (SD = 0.52), whereas that for π, 2π, h and 2h-set: 2.31 (SD = 0.59), 2.31 (SD = 0.64), 2.26 (SD = 0.53), 2.28 (SD = 0.55), respectively (Table 8). The MAR index for the papers cited rarely, i.e. that in (TP-2h)-set: 2.18 (SD = 0.73) (Table 8). Significance of the difference between Mean Authorship Rate (MAR) values referring to the different sets was also calculated. No significant difference (at p < 0.05) was found between the corresponding MAR values. Accordingly, both the most cited papers in the π-set and the less cited ones in the (TP-2h)-set show similar number of authors.

Table 8 Mean Authorship Rate (MAR) of papers in the total set, TP and in the elite sets: π-set, 2π-set, h-set, 2h-set, and (TP-2h)-set

Table 9 shows the Spearman rank-order correlation coefficients between the MAR index of the total set (TP) and that in the different elite sets with total number of papers (TP, Table 3) and citations (TC, Table 4). Relations of MAR index with h and π-index of the total set (Table 10, FA-set) are also given.

Table 9 Spearman rank-order correlational coefficients between Mean Authorship Rate (MAR) in the different sets and total number of papers (TP), total number of citations (TC), π-index, h-index of the total set (for more explanation see the text)
Table 10 π-index and h-index calculated for FA-set (i.e. the original total set, TP with full credit to the studied author), PA-set, PRS-set, and SFA-set of papers of the scientists studied

The data show that the correlation coefficients between the MAR index of the different sets are significant (at p < 0.05). The MAR index of total papers (TP in FA-set) however does not correlate significantly with total number of papers (TP) (r = − 0.11). The MAR index does not correlate significantly neither with total citations (TC) (r = 0.31) nor with π-index (r = 0.23) and h-index (r = 0.20) at p < 0.05. Accordingly, the authorship rate certainly has no influence on π-index and h-index of total set.

It seems to be interesting that the mean authorship rate (MAR) referring to π and 2π-set is relatively weekly but significantly related with both TC (r = 0.56, 0.52, resp.), π-index (0.51, 0.45, resp.) and h-index (0.45, 0.52, resp.) of the total set (TP). At the same time the MAR index (referring to h-set and 2h-set) show no significant correlation with h-index and π-index of the total set (TP). Nevertheless, the difference between the significant and non-significant correlation coefficients is low.

The Spearman rank-order correlation coefficient of π-index and h-index with total papers (TP) (r = 0.68, 0.85, resp.) and total citations (TC) (r = 0.89, 0.95, resp.) is relatively high and significant. This is because both impact indices depend on the number of papers and citations, plus on the distribution of citations over papers (Iglesias & Pecharromán, 2007).

Effect of the number and rank of authors on the h and π-index

The effect of the number and rank of authors on two elite set indicators (h-index, π-index) was studied according to Full Authorship Method (FAM), Partial Authorship Method (PAM), Practical Rank Score Method (PRSM), and Single plus First Authorship Method (SFAM) (see Tables 2 and 10). The set of papers obtained by the mentioned methods is termed as FA, PA, PRS, and SFA-set, respectively.

The mean π-index of the studied scientists taking into account their FA-set (i.e. total set where total credit of the papers is attributed to the scientist studied) was found as 25.40 (SD = 15.30) (Table 10). The dynamic range is rather wide: 5.25 (Vinkler) – 70.06 (Leydesdorff). The latter value is extremely high in scientometrics taking into account the relatively low number of journals and scientists working in this field. The h-index calculated for the mentioned scientist (h-index = 64) represents also an extremely high value. The mean h-index of the studied scientists was found as 32.70 (SD = 13.39). The dynamic range: 16–64.

The π, h, 2π, and 2hc-index of FA-set (the FA-set corresponds to TP-set, i.e. total set with attributing full credit to the researcher studied) depends linearly on total number of citations (TC) (Fig. 2). The slope of the lines increases in the following rank: h, π, 2π, and 2hc-index (0.0033, 0.0037, 0.0051, 0.0083, resp.). 2hc-index of a set of papers can be calculated as the number of citations received by 2h number of papers multiplied with 0.01. Accordingly e.g., if h = 20, and the first 40 papers received 1147 citations, 2hc-index = 11.47. The 2hc-set of the studied scientists contains 93.52% (SD = 7.79) of total citations, in average.

Fig. 2
figure 2

Scatterplot of total citations (TC) against pi(TP) = π(TP), 2pi(TP) = 2π(TP), h(TP), and 2hc(TP) index (number of papers in the 2hc-set = 2h) calculated for the total set (TP = FA)

The PA-set of the scientists obtained by Partial Authorship Method, also contains all papers (TP) but the rank of the papers according to the number of citations differs from that in FA-set (see Table 2). This is because the number of citations obtained to a paper of the studied author was calculated as ci/ai, where ci is the number of citations received and ai is the number of authors of the i-th paper.

The mean π and h-index was found as 14.25 (SD = 8.24) and 23.25 (SD = 7.97), respectively. The dynamic range of π-index: 3.24 (Zitt) – 41.04 (Leydesdorff) and that for h-index: 13 (Zitt) – 45 (Leydesdorff) (Table 10).

The number of papers in the PRS-set is equal to that in the total set (TP in FA-set). Nevertheless, the PRS-method takes into account not only the number but also the rank of the assessed authors according to Tables 1 and 2. The mean π and h-index of the scientists studied was found as follows: 14.69 (SD = 7.67) and 24.05 (SD = 8.83), respectively. The dynamic range of π-index: 4.32 (Zitt) – 35.15 (Leydesdorff) and that for h-index: 12 (Zitt) – 48 (Leydesdorff) (Table 10).

According to the Single and First Authorship Method (SFAM) only those papers are taken into consideration which are signed by the studied scientist as single author or of which first author is the studied researcher (SFAR%, Table 6). Naturally, the number of those papers may be lower or significantly lower than the number of total publications (see Table 2). The mean percentage rate of SFA papers of the scientists studied is 61.27 (SD = 22.11) (Table 6). The mean π and h-index of the papers in the SFA-sets: 15.07 (SD = 8.12) and 23.60 (SD = 10.59), respectively (Table 10). The dynamic range of π-index: 4.44 (Rousseau) – 33.97 (Thelwall), and h-index: 8 (Person) – 52 (Leydesdorff).

From the data in Table 10 the corresponding significance values of the difference between the mean h and π-index of the sets obtained by Full Authorship Method (FAM), Partial Authorship Method (PAM), Practical Rank Score Method (PRSM), and Single plus First Authorship Method (SFAM) were calculated (Table 11).

Table 11 Significance level of differences between the mean π and h-index of the different sets (see Table 10)

The difference between the mean π-index of the sets obtained by Full Authorship Method (FAM), PAM, PRSM or SFAM was found significant (p < 0.025, bold values in the first column in Table 11). The conclusion is valid also for h-index (second column in Table 11). In contrast, no significant difference was found between the π and h-values obtained by PAM, PRSM and SFA-Method (values in italics and bold italics, resp., Table 11).

Comparing the h-index obtained by FAM with that obtained by PAM, PRSM, and SFAM, two trends may be observed (Table 10). There are persons with h-index of their papers in the FA-set that does not change at all or changes only slightly by applying any of the mentioned methods (PAM, PRSM, or SFAM): Cronin (32, 26, 27, and 29), Martin (16, 14, 13, and 10), McCain (22, 18, 20, and 18), Small (26, 24, 25, and 25), Vinkler (20, 20, 20, and 20), White (20, 17, 18, and 19). In contrast, there are scientists with significantly higher change in h-index: Glänzel (55, 30, 35, and 40), Ingwersen (26, 16, 18, and 15), Leydesdorff (64, 45, 48, and 52), Moed (42, 27, 30, and 31) Schubert (36, 23, 22, and 19), Waltman (35, 23, 24, and 20).

It may be assumed that π and h-index of the FA-set compared to that of PA, PRS, and SFA-set could be positively and significantly related to the Single plus First Authorship Rate (SFAR%) of the scientists. Therefore, the corresponding Spearman rank-order correlations were calculated (Table 12). In contrast to the expectations, for the FA, PA and PRS-set no significant and positive correlations were found. The correlation coefficient between SFAR% and SFA(π), SFA(h) is positive but not significant.

Table 12 Spearman rank-order correlation coefficients between the Single plus First Authorship Rate (SFAR%) and π-index, h-index of sets obtained by Full Authorship Method (FAM), Partial Authorship Method (PAM), Practical Rank Score Method (PRSM), and Single plus First Authorship Method (SFAM)

Figure 3 demonstrates the linear relationship of h(FA) with h(SFA), h(PRS), h(PA), and 2h(FA). The slope of the lines: 0.685, 0.613, 0.562, and 2.358, resp. 

Fig. 3
figure 3

Scatterplot of h(FA) index against 2hc(FA), h(PA), h(PRS), and h(SFA)-index of the studied scientists. (For FA, PA, PRS, and SFA see the text)

Figure 4 demonstrates also linear relationship of π(FA) with 2π(FA), π(PA), π(PRS), and π(SFA). The slope of the lines increases by the following rank: π(SFA): 0.398, π(PRS): 0.463, π(PA): 0.500, and 2π(FA): 1.319.

Fig. 4
figure 4

Scatterplot of pi = π-values calculated with attributing full credit of papers (FA) to the author studied against 2pi(FA) = 2π(FA)-index and pi(PA) = π(PA)-index. The latter mentioned index is calculated with ci/ai part of citations (ci) attributed to the studied scientist, where ai is the number of authors of i-th paper. The pi(PRS) = π(PRS)-index refers to papers calculated with citations taking into account number and rank of authors, whereas the pi(SFA) = π(SFA) index is calculated according to the single and first authorship method

From the above we may conclude that applying any of the part-impact methods, i.e. distributing credit of publications and citations among the coauthors, may decrease the value of h and π-index by about of 55–60 and 70–75 percent, respectively. However, the decrease is not homogeneous. Table 13 shows the effect of decrease on the rank of the scientists. The rank value of the scientists according to h or π-index (or both) obtained from the data of the (total) FA-set differs from that calculated for PA-set in several cases (e.g. Braun, Egghe, McCain, Narin, Rousseau, Small, Vinkler, and White). In contrast, there are scientists (e.g. Leydesdorff, Martin, Moed, Person, vanRaan, Waltmann, and Zitt) showing similar rank values. Similar differences may be observed in the rank comparing data from FA and PRS-set.

Table 13 Rank of the studied scientists according to π-index and h-index calculated from the data of FA-sets (i.e. all papers, TP attributing full credit to the studied author), PA-sets, PRS-sets, and SFA-sets (attributing only the relevant part of the credit to the author studied)

The discrepancies between the ranks by h or π-index obtained from the FA, PA, PRS and SFA-set of the authors may be demonstrated by the mean difference in the rank number of the scientists (Table 14, in brackets). As another characteristic index: the number of cases with a difference of three or greater in the rank number may be also applied (Table 14).

Table 14 Number of cases with a difference of three or greater in the rank numbers of the studied scientists by h-index and π-index obtained from data of FA, PA, PRS, and SFA-set. Mean change in the rank number (in brackets) of the scientists by changing the set from which the impact indices are calculated

The rank by the indices obtained from FA-set against that calculated from PA-set changes at a greater extent for the scientists publishing together with relatively many coauthors. Applying the PRS-set, the change may be high for scientists ranked as third, fourth, fifth, etc. in the list of authors in the byline of the paper.

The SFA-set shows the greatest mean differences in rank numbers and number of differences of three or higher than three of h-index (3.37, 13, respectively). There are scientists with extreme great difference in h or π-index (or both), e.g. Egghe (FA: 10, 10, SFA: 4, 5.5, resp.), Rousseau (FA: 8, 7.5, SFA: 20, 14.5, resp.), White (FA: 9, 17.5, SFA: 3, 12.5, resp.), whereas the rank numbers of other persons (e.g. Leydesdorff, Martin, and Moed) are practically unchanged. The reason may be that the number of journal papers published solely, without coauthors may strongly depend on the personality and possibilities of the scientist. Similarly, the rank of authors in the by-line of journal papers may depend on several non-scientific factors. There are senior scientists who insist on the first position whilest others prefer the last one. The measure of contribution to the paper that is acknowledged by authorship differs from laboratory to laboratory (Vinkler, 1993).

In applying PA and PRS-set for calculating h-index or π-index, low number of differences of three or greater between the corresponding rank values: 2 and 3, respectively, and low mean rank differences: 1.77 and 1.10, respectively can be observed (Table 14). Accordingly, it may be concluded that the difference between the the respective indices is relatively small in most cases (see Table 10).

Summary and conclusions

In assessing publication performance of individuals, the importance of dividing the total set of journal papers of the studied scientists according to science fields was demonstrated in an earlier publication (Vinkler, 2021). The mentioned selection seems to be important because the bibliometric features (Vinkler, 1988) depend on the field. It is known that both quantity (e.g. number of papers) and impact indicators (e.g. number of total citations, citations/paper, Hirsch-index, π-index, etc.) strongly depend on the field. Without separating the journal papers of similar bibliometric features, the evaluation of total sets would yield incorrect results.

Scientific investigation is conducted in natural sciences mostly as teamwork. Consequently, multiauthored publications are common phenomenon. However, there are many situations in the practice (e.g. selection of applicants for granting, recommendation of persons for job, rewarding scientists with prices, study of performance of a country or laboratory, etc.) when individual performance should be determined. Consequently, it is relevant to distribute the credit of journal publications and citations among the coauthors.

For evaluation of individuals, teams or countries impact indicators of the elite sets of publications are preferably applied, recently. In the present study the following elite sets are applied: √P, h, 2√P, and 2h number of publications ranked by decreasing citation frequency of the journal papers of 20 scientometricians (P is equal to the total number of papers and h is equal to h-index).

The data in Table 3 show that the mean percentage rate of papers in the elite sets increases according to the rank: P(π), P(2π), ECP, P(h), P(2h), where P(π) is the number of papers in √P = π-set, and P(h) is that in h-set. The papers in ECP-set are cited more frequently than the mean TC/TP value of the corresponding total set. The mean share of papers in h-set (32.83%, SD = 14.07) was found 3.13 times higher than that in π-set (10.50%, SD = 3.73). The difference between the mean P(π)% and P(h)% values is highly significant (p < 0.001).

The share of citations in the elite subsets, C(π)%, C(2π)%, C(h)%, and C(2h)% was found to correlate negatively with total papers, TP and total citations, TC (Table 5). C(π)% and C(h)% correlates with each other positively and significantly (r = 0.73).

In agreement with the expectations, it was found that the mean citation rate of papers (C/P) in the individual elite sets decreased as follows: π-set: 232.13 (SD = 107.31), 2π-set: 151.48 (SD = 68.50), h-set: 111.57 (SD = 37.36), 2h-set: 64.20 (SD = 20.83).

The Mean Authorship Rate (MAR) index was calculated for the total and for the elite sets of papers of the studied scientists. The calculations show that MAR of total sets does not correlate significantly with total number of papers (TP) (r = − 0.11), total citations (TC) (r = 0.23), π-index (r = 0.23), and h-index (r = 0.20) (Table 9). Accordingly, mean authorship rate of papers in the total set may not influence the mentioned indices. Nevertheless, the low but significant correlation coefficients between the number of authors of papers in the π-set and π-index, h-index (r = 0.51, 0.45, respectively) may indicate some impact of the number of authors. In contrast, no significant correlation was obtained between the number of authors in h and 2h-set with π-index (r = 0.31, 0.15, respectively). Similarly, no significant correlation was detected between MAR in h or 2h-set and h-index (r = 0.43, 0.24, respectively).

According to the results presented here, in analysing publication impact of scientists, teams or countries distributing citations among the coauthors is highly relevant. A recent paper of Ioannidis et al. (2019) also points to the importance of distributing credit among coauthors. The study of Egghe (2008) gives a mathematical theory of h and g-index in case of fractional counting of authorship.

Scientometric evaluations can be performed by taking into account several viewpoints. The type of the part-impact method applied, should be selected accordingly. If the evaluation emphasizes e.g., the importance of cooperation between scientists or teams, we have to select the total set (FA). This way evaluating e.g. grant applications, data obtained from the FA-set should be preferably applied. If we were interested in the personal contribution by a scientist, PA, PRS or SFA-set should be used. The contribution of the person to be assessed is emphasized by using PRS-set, assuming that rank of coauthors runs parallel with measure of contribution. The greatest stress is laid on personal contribution by applying the SFA-method. Anyway, it would seem most appropriate to calculate the elite set indicators according to each method. Possible weightings may be made according to relevant expert decisions.

Several important requirements for the evaluation processes (e.g. specifying the purpose of the assessment, specifying the system to be assessed, setting criteria, selecting methods, specifying indicators for each criterion, setting the time-period, selecting appropriate standards, etc. (Moravcsik, 1988; Vinkler, 2010a) cannot be tackled here.