Quantitative research assessment: using metrics against gamed metrics

Ioannidis, John P. A.; Maniadis, Zacharias

doi:10.1007/s11739-023-03447-w

Quantitative research assessment: using metrics against gamed metrics

IM - REVIEW
Open access
Published: 03 November 2023

Volume 19, pages 39–47, (2024)
Cite this article

Download PDF

You have full access to this open access article

Internal and Emergency Medicine Aims and scope Submit manuscript

Quantitative research assessment: using metrics against gamed metrics

Download PDF

3058 Accesses
15 Altmetric
Explore all metrics

Abstract

Quantitative bibliometric indicators are widely used and widely misused for research assessments. Some metrics have acquired major importance in shaping and rewarding the careers of millions of scientists. Given their perceived prestige, they may be widely gamed in the current “publish or perish” or “get cited or perish” environment. This review examines several gaming practices, including authorship-based, citation-based, editorial-based, and journal-based gaming as well as gaming with outright fabrication. Different patterns are discussed, including massive authorship of papers without meriting credit (gift authorship), team work with over-attribution of authorship to too many people (salami slicing of credit), massive self-citations, citation farms, H-index gaming, journalistic (editorial) nepotism, journal impact factor gaming, paper mills and spurious content papers, and spurious massive publications for studies with demanding designs. For all of those gaming practices, quantitative metrics and analyses may be able to help in their detection and in placing them into perspective. A portfolio of quantitative metrics may also include indicators of best research practices (e.g., data sharing, code sharing, protocol registration, and replications) and poor research practices (e.g., signs of image manipulation). Rigorous, reproducible, transparent quantitative metrics that also inform about gaming may strengthen the legacy and practices of quantitative appraisals of scientific work.

What is meaningful research and how should we measure it?

Article Open access 05 August 2020

Research Misconduct and Citation Gaming: A Critical Review on Characterization and Recent Trends of Research Manipulation

Online Indicators for Non-Standard Academic Outputs

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Quantitative bibliometric indicators have become important tools for research assessments. Their advent has elated, frustrated, and haunted investigators, institutions and funding organizations. It is well documented that metrics have both strengths and limitations and that they need to be used with due caution. For example, the Leiden manifesto summarizes such a cautious, judicious approach to the use of bibliometric and other scientometric indicators [1].

However, misuse and gaming of metrics are rampant [2, 3]. The urge to “publish or perish” (or even “get cited or perish”) creates an environment where gaming of metrics is amply incentivized. A whole generation of old and new tricks try to make CVs and their impact look good and impactful—better and more impactful than they really are. Many of these gaming tricks can reach extravagant levels, as in the case of paper mills, massive self-citations, or citation cartels [4,5,6].

Concurrently, there are several efforts to try to improve metrics and make them available for all scientists, authors, and papers in ways that allow for proper standardization and more legitimate use [7,8,9]. Healthy efforts in bibliometrics and scientometrics should try to counter gaming and flawed practices. In the same way as antivirus software can detect and eliminate software viruses, proper metrics may be used to detect and correct for flawed, biased, gamed metrics. This review examines several gaming practices, including authorship-based, citation-based, editorial-based, and journal-based gaming as well as gaming with outright fabrication. We show how quantitative metrics may be used to detect, correct and hopefully pre-emptively diminish the chances of gaming and other flawed, manipulative research practices. Such an approach may help improve more broadly the standards of worldwide conducted research.

Authorship-based gaming

Authorship of a scientific paper carries credit, responsibility, and accountability. Unfortunately, responsibility and accountability are often forgotten and the focus is placed on how to get maximal credit out of large numbers of coauthored publications (Table 1). Gift authorship (honorary authorship) refers to the phenomenon where authors are listed who have not made a sufficient contribution to the work that would justifiably deserve authorship [10, 11]. The Vancouver criteria make specific requests for the type of contributions that are necessary for authorship credit. However, it is likely that violations of these criteria are frequent. The exact frequency of gift authorship is difficult to pinpoint, but several surveys suggest high prevalence [12,13,14,15,16,17,18,19,20]. Estimates from such surveys may even be gross underestimates, since disclosing and/or admitting gift authorship is a risky disclosure. Gift authorship particularly thrives with specific researcher profiles and situations. The classic stereotype is the department chair placed as an author (often as the senior author) in most/all publications issued from that team, regardless of the level of contribution.

Gift authorship may co-exist with ghost authorship [21,22,23,24,25], where the author(s) who really wrote the paper do not even appear, while one or more gift authors take their place. The classic stereotype is when industry employees do the work and draft manuscripts published with names of academic gift authors, while the industry employees are invisible ghosts. Ghostwriting aims to confer academic prestige to the paper and minimize the perception of sponsor bias.

The advent of the concept of contributorship [26] has helped to allow provision of more granular information on the type of contributions made by each listed author in a scientific article. Moreover, efforts at standardization of contribution types, in particular the CREDIT system [27, 28] may allow some fairer appraisal of contributions. In theory, quantitative approaches can process and analyze in massive scale contributorship from each scientist across his/her papers and place these against the distribution of similar values from other authors in the same field. However, this is meaningful and feasible for papers with standardized (or at least comparable) credit types. More importantly, credit allocation may be gamed in the same way as plain authorship [29]. There is hardly any way to verify if the listed contributions are genuine, let alone at what level they occurred. Therefore, one may use information on authorship to understand whether some scientists exhibit unusual behavior suggestive of gaming.

In particular, large-scale bibliometric databases [30] have allowed the detection of hyper-prolific scientists, e.g., those with more than one full article published every 5 days (excluding editorials, commentaries, notes, and letters). This pattern may be particularly suspicious of loose authorship criteria especially when there are massive changes in the productivity of scientists linked to assumption of powerful positions, e.g., one can track that a scientist was in the 20th percentile of productivity in his field, but then moved to the top-0.01% after becoming a powerful administrator (unlikely to have much time for doing research).

In some teams and institutions, inordinate credit does not reflect on a single person, but may diffuse across many team members. This may be common in situations where multi-center work is involved with authorship awarded to many members from each of the participating components or local teams. There is an issue of balance here. On the one hand, credit needs to be given to many people, otherwise those left out would be mistreated. Mistreatment is common and often has other structural societal inequities as contributing factors (e.g., gender bias) [31]. On the other hand, there may be over-attribution of authorship to too many people, i.e., thin salami slicing of credit. The unfairness becomes more obvious when scientists from a team that over-attributes credit for authorship compete with scientists from teams that are less generous with authorship credit (or are even inappropriately not offering such credit). For example, for the same amount of work, one epidemiological consortium may decide to list 100 authors, while another one may list only 10, and a third one may list only 3.

Quantitative approaches can help sort out the co-author network of each author. They can generate citation metrics that account for co-authorship [32,33,34] and/or author position and even for relative contributions (if such information is available) and field-specific practices [35]. Therefore, two authors may have the same number of authored papers, but they may differ markedly in their relative placement and co-authorship patterns in these papers: one may have many papers as a single author or with few co-authors, while the other may routinely have 50 or more co-authors. On the other hand, they may have the same H-index for overall citations [36] but they may differ many-fold in a co-authorship-adjusted index, such as Schreiber’s hm index [31].

Citation-based gaming

Many flawed evaluation systems still emphasize numbers of publications, while this measure should not matter in itself. A good scientist may publish one, few, many or huge number of papers. What should matter is their impact, and citation metrics are used as surrogates of impact. However, these measures can also be gamed, and different metrics differ in their gaming potential.

First, publishing more papers may lead to more citations by itself. Citations are not fully rational, and many scientists cite papers without even having read them [37]. While some papers are never cited, this proportion has probably decreased over time and the frequent quote that half of the papers are never cited is a myth [38]. One may penalize publishing large numbers of papers and some have even argued that there should be a cap on how many total words a scientist can publish [39]. Such penalizing and capping is ill advised, however. It may intensify selective reporting and publication bias, as scientists would struggle to publish only extreme, nice-looking results that may attract more attention. It is probably more appropriate not to pay any attention to the number of publications (except for the extreme tail of hyper-prolific authors) and allow scientists to disseminate their work in whatever way and volume they deem most appropriate. However, one may examine other quantitative metrics such as citations per paper, and place these metrics in a percentile ranking against papers from the same field, e.g., a scientist may have 100 publications and 1000 citations and be at the 25th percentile of his field for citations per paper (1000/100 = 10). Another scientist may also have 1000 citations, but with 1000 publications may be at the bottom 0.1% percentile for citations per paper (1000/1000 = 1), suggesting he/she is publishing very trivial work.

Self-citation is a classic mechanism that increases one’s citation count [40, 41]. Self-citations can be defined in different ways. A strict definition includes references to one’s own work. A more inclusive definition includes also references to one’s work by any of the co-authors of that work. Many self-citations are entirely appropriate [42]. Science requires continuity and linking of the current work to relevant previous work. In fact, avoidance of such linking and not use of self-citations would be inappropriate and even ethically reprehensible—e.g., it may mislead that some work is entirely novel, and/or could lead to undeclared self-plagiarism [43]. Self-citations may also have both a direct effect in increasing total citations and an indirect effect—when a work is mentioned more frequently, other scientists may notice it and cite it as well [44] (Table 1).

Table 1 Examples of gaming behaviors and how quantitative metrics may help

Full size table

Self-citations would require an impossibly strenuous in-depth evaluation to examine whether each of them is inappropriate or not. However, centralized bibliometric databases [5, 45] can allow placing the proportion of self-citations for an author as a percentile ranking against the self-citations of other authors in the same scientific field. Extreme outliers (adjusting for field and possibly also age [5]) may be characteristic of gaming behavior (Table 2).

Table 2 Proportion of self-citations (PSS) among the total number of citations received: 1% and 5% threshold among top-cited established authors in different scientific fields

Full size table

Self-citation practices may take also complex forms. Occasionally, the authors may collude to cite each other’s works, even though they are not co-authors. Such citation cartels (citation farms) usually involve a small number of authors. Flow of citations may not necessarily be equally towards all members of the cartel. For instance, one or a few members may be cited, while the others may enjoy other repayments. The members of the citation farm may be in different institutions and countries. Again, quantitative metrics is the best way to diagnose a cartel. Usually, a large number of scientists cite one author’s work and citations from each citing author account for a very small portion of the total citations. Conversely, in a citation farm, a handful of citing authors may account for > 50% or even > 80% of the citations received.

Some of the inappropriate self-citing or citation farming behavior may even aim to inflate selectively some specific citation metric considered most important. For example, the Hirsch h-index has enjoyed inordinate popularity since its introduction in 2005 [35]. H-index can be more easily gamed than the number of total citations. Self-citers preferentially (“strategically”) cite papers that readily boost the H-index [44]. Again, quantitative metrics can help detect this behavior, for instance by examining the ratio of total citations over the square of the H-index. Average values for this ratio are about 4 [35]. Very small values suggest that citations have been targeted to papers that boost the H-index while total citations are relatively more difficult to manipulate.

Editorial-based gaming

Journals may not treat equally all the authors who submit their work to them. Some authors may be favored. Proportion of submissions accepted may vary markedly across authors. Often this is entirely normal: some scientists truly submit better work than others. However, difficulties arise when the submitting and publishing authors are directly involved with the journal, as editors-in-chief or as staff members. With the rapid proliferation of journals, including mega-journals [46], the numbers of editors-in-chief, guest editors and staff members has also increased markedly.

Editors are fully entitled (and even encouraged as part of their job) to write editorials and commentaries on topics that they consider important. This activity is fully legitimate. These editorial pieces may go through no or very limited review and get published quickly on hot matters. Some high-profile journals, such as Nature, Science, and BMJ have numerous staff writers and science journalists (as staff or free lancers) who write news and feature stories, often massively. An empirical analysis [47] has shown that some of these writers have published 200–2000 papers in these venues where a scientist would consider a career milestone to publish even a single article. Most of these authors are usually not competing in the world of academia. However, exceptions do occur where editorialists publishing massively in one journal may be academics. Other editors may give up their editorial career at some point and move to competitive academia. Another concern is that these editorial publications often have no disclosures of potential conflicts of interest [47]. Some editors have great power to shape science narratives in good or bad ways. Quantitative metrics can separate the impact of authors due to non-peer-reviewed editorial material versus peer-reviewed full articles.

A more contentious situation arises when an editor-in-chief publishes original full articles in his/her own journal. While this is possible and not reproachable if done sporadically (especially if the paper is handled by another editor), some authors raise concerns about this practice, when it is common. Empirical analyses have shown the prevalence of editorial nepotism practices [48]: in a survey of 5,468 biomedical journals, 5% of the journals had > 10.6% of their published articles authored by a single person, typically the editor-in-chief and/or other highly preferred members of the editorial board. Quantitative analyses can map the distribution of papers of an author across different journals and identify if there is an inordinate concentration of full, original papers in journals where the author is editor-in-chief.

Journal-based gaming

Most of the gaming at the level of journals involves efforts to boost the journal impact factor [49]. Detailed description of the multiple well-known problems and gaming practices for this metric is beyond the scope of this paper. Nevertheless, many of the gaming practices used for single scientists have equivalents for gaming at the journal level, e.g., coercive journal self-citation (requests by the editor to cite other papers published by the journal) and citation cartels involving journals rather than single authors (“citation stocking”) [50]. Multiple processes and gaming tools can be detected by bibliometric analysis at the level of journal self-citation and co-citation patterns. Journal impact factor manipulation may also involve gaming gains for specific researchers as well, in particular for the editors, as described above. Journals with higher impact factors get cited more frequently, even when it comes to papers that are identically published in other journals (e.g., reporting guideline articles) [51].

Gaming with outright fabrication

The gaming practices described so far typically do not have to involve fabrication. The gamed published and cited material is real, even though its quality may be suboptimal, given the inflated productivity. However, there are also escalating gaming practices that involve entirely fabricated work.

In paper mills, a for-profit company produces papers (typically fraudulent, fabricated ones), which it sells to scientists who want to buy authorship slots in them. The papers are for sale before submission or even after acceptance [52,53,54]. An increasing proportion of retractions in the last 7 years has been for paper mill-generated articles [55]. It is unknown though whether this may be just the tip of the iceberg and these retracted papers are those where the fabrication is more egregious and thus readily discernible. The advent of more powerful large language models may make the paper mill products more sophisticated and difficult to identify [56]. Software is evolving to detect use of such large language models, but it is unclear whether such detection software would be able to catch up. Involvement of artificial intelligence in writing scientific papers is an evolving challenge for both genuine and fraudulent papers. Several journals have tried to tackle this challenge, but reactions have not been uniform [57,58,59,60].

There are many other egregious evolutions in the publishing world, a consequence of publish-or-perish pressure. Predatory journals (journals publishing content for a fee but practically without peer review) are widely prevalent, but their exact prevalence is difficult to ascertain, given the difficulty to agree on which journals are predatory [61,62,63]. Some of the most notorious phenomena are hijacked journals and publication of spurious content. Hijacking happens when a site belonging formerly to a discontinued serious journal is taken over by a predator who uses the name and the prestige of the previous journal for operating the predatory business [64]. Some papers also get published in journals with totally unrelated aims/mission/subject matter coverage; such spurious content is indication for fraudulent behavior (e.g., may be associated with both paper mills and predatory publishing).

Again, bibliometric, quantitative indicators can be used to place the prevalence of such behaviors in publication corpora of single authors, teams, institutions, and journals into perspective. Indicators may include frequency of documented paper mill products, hints of inappropriate use of large language models, hints of predatory or other inappropriate journal behavior (e.g., percentage of papers published in journals that lost their journal impact factor), and percentage of papers with content unrelated to the other published content of the journal.

Even in very serious journals, the proportion of fabricated papers may be increasing over time. John Carlisle, an editor of a prestigious specialty journal (Anesthesia) requested the raw data of over 150 randomized trials submitted to his journal and concluded that in 30–40% of them the data were so messed up and/or improbable that he called these trials zombie trials [65, 66]. Zombie trials tend to come from particular institutions and countries. Such trials have demanding clinical research designs that are difficult to perform, let alone perform massively. Quantitative bibliometric analysis can allow the detection of sudden, massive production of papers with demanding study designs for which a scientist or team have no prior tradition and resources to run, e.g., massive sudden production of randomized trials in some institutions in less developed countries [67].

Metrics of best research practices and of poor research practices

Most bibliometric and scientometric indicators to-date have focused on counting numbers and citations of scholarly publications. However, it is very important to capture also information on research practices, good and bad. These research practices may in fact often be well reflected in these publication corpora. For example, good research practices include wide data sharing, code sharing, protocol registration, and replications. It is currently possible to capture for each scientist how often he/she used these standards in his/her published work. For example, a free, publicly available resource covers all the open-access biomedical literature for these indicators [68].

It is also possible to capture systematically the use of several poor research practices. For example, image manipulation is a common problem across many types of investigation. There are already appraisal efforts that have tried to generate data on signs of image manipulation across large publication corpora rather than doing this exercise painstakingly one paper at a time [69, 70].

Another potential sign of poor research practices is retractions. At least one science-wide assessment of top-cited scientists currently excludes from consideration those with retracted papers based on the inclusive Retraction Watch database (https://retractionwatch.com/2022/11/15/why-misconduct-could-keep-scientists-from-earning-highly-cited-researcher-designations-and-how-our-database-plays-a-part/). The majority of retractions may signal some sort of misconduct. However, in a non-negligible proportion of cases, they may actually signal honest acknowledgment of honest error—a sign of a good scientist that should be praised and encouraged if we wish to see better self-correction in the scientific record. Therefore, when retractions are present, they need to be scrutinized on a case-by-case basis regarding their provenance and rationale. Making wider use of available resources, such as the Retraction Watch database, and improving and standardizing the retraction notices [71] may help add another important dimension to research appraisals.

Putting it together

Table 3 lists a number of quantitative metrics and indicators that are currently readily available (or can be relatively easily obtained) from centralized databases. The examples of scientists shown are entirely hypothetical and do not correspond to specific real individuals; they are provided for illustrative reasons. All three scientists are highly cited, in the top 1.8%, 0.9% and 0.7% of their scientific domain, respectively. However, two of the three scientists show problematic markers and/or score very low for markers of transparency and reproducibility.

Table 3 An illustrative list of quantitative metrics and indicators to appraise a scientist

Full size table

Efforts should be devoted to make such datasets more comprehensive, covering routinely such indicators across all scientific investigation, and with percentile rankings adjusted for scientific field. Each metric should be used with full knowledge of its strengths and limitations. Attention should focus particularly on extreme outliers; modest differences between two authors should not be seen as proof that one’s work is necessarily superior to another. Even with extreme values, metrics should not be used to superficially and hastily heroize or demonize people. For example, very high productivity may reflect some of the best, committed, devoted scientists; some recipients of massive gift authorships; and some outright fraudsters. While single metrics may not suffice to fully reliably differentiate these groups, the complete, multi-dimensional picture usually can clearly separate who is who.

Data availability

Not applicable.

References

Hicks D, Wouters P, Waltman L, De Rijcke S, Rafols I (2015) Bibliometrics: the Leiden Manifesto for research metrics. Nature 520(7548):429–431
Article PubMed Google Scholar
Ioannidis JP, Boyack KW (2020) Citation metrics for appraising scientists: misuse, gaming and proper use. Med J Aust 212(6):247-249.e1
Article PubMed Google Scholar
Fire M, Guestrin C (2019) Over-optimization of academic publishing metrics: observing Goodhart’s Law in action. GigaScience 8(6):giz053
Article PubMed PubMed Central Google Scholar
Christopher J (2021) The raw truth about paper mills. FEBS Lett 595(13):1751–1757
Article CAS PubMed Google Scholar
Van Noorden R, Chawla DS (2019) Hundreds of extreme self-citing scientists revealed in new database. Nature 572(7771):578–580
Article PubMed Google Scholar
Fister I Jr, Fister I, Perc M (2016) Towards the discovery of citation cartels in citation networks. Front Phys 4:00049
Article Google Scholar
Ioannidis JPA, Baas J, Klavans R, Boyack KW (2019) A standardized citation metrics author database annotated for scientific field. PLoS Biol 17(8):e3000384
Article CAS PubMed PubMed Central Google Scholar
Hutchins BI, Yuan X, Anderson JM, Santangelo GM (2016) Relative citation ratio (RCR): a new metric that uses citation rates to measure influence at the article level. PLoS Biol 14(9):e1002541
Article PubMed PubMed Central Google Scholar
Fortunato S, Bergstrom CT, Börner K, Evans JA, Helbing D, Milojević S et al (2018) Science of science. Science 359(6379):eaao0185
Article PubMed PubMed Central Google Scholar
Smith J (1994) Gift authorship: a poisoned chalice? BMJ 309(6967):1456–1457
Article CAS PubMed PubMed Central Google Scholar
Moffatt B (2011) Responsible authorship: why researchers must forgo honorary authorship. Account Res 18(2):76–90. https://doi.org/10.1080/08989621.2011.557297
Article PubMed Google Scholar
Goodman N (1994) Survey of fulfilment of criteria for authorship in published medical research. BMJ 309:1482
Article CAS PubMed PubMed Central Google Scholar
Rajasekaran S, Shan RL, Finnoff JT (2014) Honorary authorship: frequency and associated factors in physical medicine and rehabilitation research articles. Arch Phys Med Rehabil 95(3):418–428. https://doi.org/10.1016/j.apmr.2013.09.024
Article PubMed Google Scholar
Al-Herz W, Haider H, Al-Bahhar M, Sadeq A (2014) Honorary authorship in biomedical journals: how common is it and why does it exist? J Med Ethics 40(5):346–348. https://doi.org/10.1136/medethics-2012-101311
Article PubMed Google Scholar
Kovacs J (2013) Honorary authorship epidemic in scholarly publications? How the current use of citation-based evaluative metrics make (pseudo)honorary authors from honest contributors of every multi-author article. J Med Ethics 39(8):509–512. https://doi.org/10.1136/medethics-2012-100568
Article PubMed Google Scholar
Eisenberg RL, Ngo L, Boiselle PM, Bankier AA (2011) Honorary authorship in radiologic research articles: assessment of frequency and associated factors. Radiology 259(2):479–486. https://doi.org/10.1148/radiol.11101500
Article PubMed Google Scholar
Kayapa B, Jhingoer S, Nijsten T, Gadjradj PS (2018) The prevalence of honorary authorship in the dermatological literature. Br J Dermatol 178(6):1464–1465. https://doi.org/10.1111/bjd.16678
Article CAS PubMed Google Scholar
Shah A, Rajasekaran S, Bhat A, Solomon JM (2018) Frequency and factors associated with honorary authorship in Indian biomedical journals: analysis of papers published from 2012 to 2013. J Empir Res Hum Res Ethics 13(2):187–195. https://doi.org/10.1177/1556264617751475
Article PubMed Google Scholar
Elliott KC, Settles IH, Montgomery GM, Brassel ST, Cheruvelil KS, Soranno PA (2017) Honorary authorship practices in environmental science teams: structural and cultural factors and solutions. Account Res 24(2):80–98. https://doi.org/10.1080/08989621.2016.1251320
Article PubMed Google Scholar
Eisenberg RL, Ngo LH, Bankier AA (2014) Honorary authorship in radiologic research articles: do geographic factors influence the frequency? Radiology 271(2):472–478. https://doi.org/10.1148/radiol.13131710
Article PubMed Google Scholar
Gülen S, Fonnes S, Andresen K, Rosenberg J (2020) More than one-third of Cochrane reviews had gift authors, whereas ghost authorship was rare. J Clin Epidemiol 128:13–19. https://doi.org/10.1016/j.jclinepi.2020.08.004
Article PubMed Google Scholar
Vera-Badillo FE, Napoleone M, Krzyzanowska MK, Alibhai SM, Chan AW, Ocana A, Templeton AJ, Seruga B, Amir E, Tannock IF (2016) Honorary and ghost authorship in reports of randomised clinical trials in oncology. Eur J Cancer 66:1–8. https://doi.org/10.1016/j.ejca.2016.06.023
Article PubMed Google Scholar
Wislar JS, Flanagin A, Fontanarosa PB, Deangelis CD (2011) Honorary and ghost authorship in high impact biomedical journals: a cross sectional survey. BMJ 25(343):d6128. https://doi.org/10.1136/bmj.d6128
Article Google Scholar
Hargreaves S (2007) Ghost authorship of industry funded drug trials is common, say researchers. BMJ 334(7587):223. https://doi.org/10.1136/bmj.39108.653750.DB
Article PubMed PubMed Central Google Scholar
Gøtzsche PC, Hróbjartsson A, Johansen HK, Haahr MT, Altman DG, Chan AW (2007) Ghost authorship in industry-initiated randomised trials. PLoS Med 4(1):e19. https://doi.org/10.1371/journal.pmed.0040019
Article PubMed PubMed Central Google Scholar
Rennie D, Yank V, Emanuel L (1997) When authorship fails. A proposal to make contributors accountable. JAMA 278(7):579–585
Article CAS PubMed Google Scholar
Brand A, Allen L, Altman M et al (2015) Beyond authorship: attribution, contribution, collaboration, and credit. Learn Publ 28:151–155. https://doi.org/10.1087/20150211
Article Google Scholar
McNutt MK, Bradford M, Drazen JM et al (2018) Transparency in authors’ contributions and responsibilities to promote integrity in scientific publication. Proc Natl Acad Sci USA 115:2557–2560
Article CAS PubMed PubMed Central Google Scholar
Ilakovac V, Fister K, Marusic M, Marusic A (2007) Reliability of disclosure forms of authors’ contributions. Can Med Assoc J 176(1):41–46. https://doi.org/10.1503/cmaj.060687
Article Google Scholar
Ioannidis JPA, Klavans R, Boyack KW (2018) Thousands of scientists publish a paper every five days. Nature 561(7722):167–169. https://doi.org/10.1038/d41586-018-06185-8
Article CAS PubMed Google Scholar
Ross MB, Glennon BM, Murciano-Goroff R, Berkes EG, Weinberg BA, Lane JI (2022) Women are credited less in science than men. Nature 608(7921):135–145. https://doi.org/10.1038/s41586-022-04966-w
Article CAS PubMed PubMed Central Google Scholar
Schreiber M (2008) A modification of the h-index: the hm-index accounts for multi-authored manuscripts. J Informatics 2:211–216
Google Scholar
Batista PD, Campiteli MG, Kinouchi O, Martinez AS (2006) Is it possible to compare researchers with different scientific interests? Scientometrics 68:179–189
Article CAS Google Scholar
Egghe L (2008) Mathematical theory of the h- and g-index in case of fractional counting of authorship. J Am Soc Inform Sci Technol 59:1608–1616
Article Google Scholar
Frandsen TF, Nicolaisen J (2010) What is in a name? Credit assignment practices in different disciplines. J Informetrics 4:608–617
Article Google Scholar
Hirsch JE (2005) An index to quantify an individual’s scientific research output. Proc Natl Acad Sci USA 102(46):16569–16572. https://doi.org/10.1073/pnas.0507655102
Article CAS PubMed PubMed Central Google Scholar
Simkin MV, Roychowdhury VP. Read before you cite! https://arxiv.org/abs/cond-mat/0212043.
Nicolaisen J, Frandsen TF (2019) Zero impact: a large-scale study of uncitedness. Scientometrics 119(2):1227–1254
Article Google Scholar
Martinson B (2017) Give researchers a lifetime word limit. Nature 550:303. https://doi.org/10.1038/550303a
Article CAS PubMed Google Scholar
Mishra S, Fegley BD, Diesner J, Torvik VI (2018) Self-citation is the hallmark of productive authors, of any gender. PLoS ONE 13(9):e0195773. https://doi.org/10.1371/journal.pone.0195773
Article CAS PubMed PubMed Central Google Scholar
Ioannidis JP (2015) A generalized view of self-citation: direct, co-author, collaborative, and coercive induced self-citation. J Psychosom Res 78(1):7–11. https://doi.org/10.1016/j.jpsychores.2014.11.008
Article PubMed Google Scholar
Fowler J, Aksnes D (2007) Does self-citation pay? Scientometrics 72(3):427–437
Article Google Scholar
Bruton SV (2014) Self-plagiarism and textual recycling: legitimate forms of research misconduct. Account Res 21(3):176–197
Article PubMed Google Scholar
Bartneck C, Kokkelmans S (2011) Detecting h-index manipulation through self-citation analysis. Scientometrics 87(1):85–98. https://doi.org/10.1007/s11192-010-0306-5
Article PubMed Google Scholar
Ioannidis JP, Boyack KW, Baas J (2020) Updated science-wide author databases of standardized citation indicators. PLoS Biol 18(10):e3000918
Article CAS PubMed PubMed Central Google Scholar
Ioannidis JP, Pezzullo AM, Boccia S (2023) The rapid growth of mega-journals: threats and opportunities. JAMA 329(15):1253–1254
Article PubMed Google Scholar
Ioannidis JPA (2023) Prolific non-research authors in high impact scientific journals: meta-research study. Scientometrics 128(5):3171–3184. https://doi.org/10.1007/s11192-023-04687-5
Article PubMed PubMed Central Google Scholar
Scanff A, Naudet F, Cristea IA, Moher D, Bishop DVM, Locher C (2021) A survey of biomedical journals to detect editorial bias and nepotistic behavior. PLoS Biol 19(11):e3001133. https://doi.org/10.1371/journal.pbio.3001133
Article CAS PubMed PubMed Central Google Scholar
Ioannidis JPA, Thombs BD (2019) A user’s guide to inflated and manipulated impact factors. EurJ Clin Invest 49(9):e13151. https://doi.org/10.1111/eci.13151
Article Google Scholar
Van Noorden R (2013) Brazilian citation scheme outed. Nature 500:510–511. https://doi.org/10.1038/500510a
Article CAS PubMed Google Scholar
Perneger TV (2010) Citation analysis of identical consensus statements revealed journal-related bias. J Clin Epidemiol 63(6):660–664. https://doi.org/10.1016/j.jclinepi.2009.09.012
Article PubMed Google Scholar
Christopher J (2021) The raw truth about paper mills. FEBS Lett 595:1751–1757
Article CAS PubMed Google Scholar
Else H, Van Noorden R (2021) The fight against fake-paper factories that churn out sham science. Nature 591:516–519
Article CAS PubMed Google Scholar
Abalkina A. Publication and collaboration anomalies in academic papers originating from a paper mill: evidence from a Russia-based paper mill. arXiv preprint arXiv:2112.13322. 2021 Dec 26.
Candal-Pedreira C, Ross JS, Ruano-Ravina A, Egilman DS, Fernández E, Pérez-Ríos M (2022) Retracted papers originating from paper mills: cross sectional study. BMJ 28(379):e071517. https://doi.org/10.1136/bmj-2022-071517
Article Google Scholar
Chen L, Chen P, Lin Z (2020) Artificial intelligence in education: a review. IEEE Access 8:75264–75278. https://doi.org/10.1109/ACCESS.2020.2988510
Article Google Scholar
Flanagin A, Kendall-Taylor J, Bibbins-Domingo K (2023) Guidance for authors, peer reviewers, and editors on use of AI, language models, and chatbots. JAMA 330(8):702–703. https://doi.org/10.1001/jama.2023.12500
Article PubMed Google Scholar
Flanagin A, Bibbins-Domingo K, Berkwits M, Christiansen SL (2023) Nonhuman “Authors” and implications for the integrity of scientific publication and medical knowledge. JAMA 329(8):637–639. https://doi.org/10.1001/jama.2023.1344
Article PubMed Google Scholar
Brainard J (2023) Journals take up arms against AI-written text. Science 379(6634):740–741. https://doi.org/10.1126/science.adh2762
Article CAS PubMed Google Scholar
Thorp HH (2023) ChatGPT is fun, but not an author. Science 379(6630):313. https://doi.org/10.1126/science.adg7879
Article PubMed Google Scholar
Ng JY, Haynes RB (2021) “Evidence-based checklists” for identifying predatory journals have not been assessed for reliability or validity: an analysis and proposal for moving forward. J Clin Epidemiol 138:40–48. https://doi.org/10.1016/j.jclinepi.2021.06.015
Article PubMed Google Scholar
Cukier S, Lalu M, Bryson GL, Cobey KD, Grudniewicz A, Moher D (2020) Defining predatory journals and responding to the threat they pose: a modified Delphi consensus process. BMJ Open 10(2):e035561. https://doi.org/10.1136/bmjopen-2019-035561
Article PubMed PubMed Central Google Scholar
Grudniewicz A, Moher D, Cobey KD, Bryson GL, Cukier S, Allen K, Ardern C, Balcom L, Barros T, Berger M, Ciro JB, Cugusi L, Donaldson MR, Egger M, Graham ID, Hodgkinson M, Khan KM, Mabizela M, Manca A, Milzow K, Mouton J, Muchenje M, Olijhoek T, Ommaya A, Patwardhan B, Poff D, Proulx L, Rodger M, Severin A, Strinzel M, Sylos-Labini M, Tamblyn R, van Niekerk M, Wicherts JM, Lalu MM (2019) Predatory journals: no definition, no defence. Nature 576(7786):210–212. https://doi.org/10.1038/d41586-019-03759-y
Article CAS PubMed Google Scholar
Andoohgin Shahri M, Jazi MD, Borchardt G, Dadkhah M (2018) Detecting hijacked journals by using classification algorithms. Sci Eng Ethics 24(2):655–668. https://doi.org/10.1007/s11948-017-9914-2
Article PubMed Google Scholar
Ioannidis JPA (2021) Hundreds of thousands of zombie randomised trials circulate among us. Anaesthesia 76(4):444–447. https://doi.org/10.1111/anae.15297
Article CAS PubMed Google Scholar
Carlisle JB (2021) False individual patient data and zombie randomised controlled trials submitted to Anaesthesia. Anaesthesia 76(4):472–479. https://doi.org/10.1111/anae.15263
Article CAS PubMed Google Scholar
Mol BW, Ioannidis JPA (2023) How do we increase the trustworthiness of medical publications? Fertil Steril 120(3 Pt 1):412–414. https://doi.org/10.1016/j.fertnstert.2023.02.023
Article PubMed Google Scholar
Serghiou S, Contopoulos-Ioannidis DG, Boyack KW, Riedel N, Wallach JD, Ioannidis JPA (2021) Assessment of transparency indicators across the biomedical literature: how open is open? PLoS Biol 19(3):e3001107
Article CAS PubMed PubMed Central Google Scholar
Bik EM, Casadevall A, Fang FC (2016) The prevalence of inappropriate image duplication in biomedical research publications. MBio 7(3):e00809-16. https://doi.org/10.1128/mBio.00809-16
Article PubMed PubMed Central Google Scholar
Fanelli D, Costas R, Fang FC, Casadevall A, Bik EM (2019) Testing hypotheses on risk factors for scientific misconduct via matched-control analysis of papers containing problematic image duplications. Sci Eng Ethics 25(3):771–789
Article PubMed Google Scholar
Hwang SY, Yon DK, Lee SW, Kim MS, Kim JY, Smith L, et al. Causes for retraction in the biomedical literature: a systematic review of studies of retraction notices. J Korean Med Sci. 2023;38(41):e33.

Download references

Acknowledgements

Maniadis would like to thank Constantine Sedikides and Marios Demetriadis for inspiring discussions in this research policy area.

Funding

The work of John Ioannidis is supported by an unrestricted gift from Sue and Bob O’Donnell to Stanford. Zacharias Maniadis is supported by the project SInnoPSis, funded by Horizon 2020 under grant agreement ID: 857636.

Author information

Authors and Affiliations

Departments of Medicine, of Epidemiology and Population Health, of Biomedical Data Science, and of Statistics, and Meta-Research Innovation Center at Stanford (METRICS), Stanford University, SPRC, MSOB X306, 1265 Welch Rd, Stanford, CA, 94305, USA
John P. A. Ioannidis
SInnoPSis (Science and Innovation Policy and Studies) Unit, Department of Economics, University of Cyprus, Nicosia, Cyprus
Zacharias Maniadis
Department of Economics, University of Southampton, Southampton, UK
Zacharias Maniadis

Authors

John P. A. Ioannidis
View author publications
You can also search for this author in PubMed Google Scholar
Zacharias Maniadis
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JI and ZM contributed equally to conceptual development and writing of the article.

Corresponding author

Correspondence to John P. A. Ioannidis.

Ethics declarations

Conflict of interests

Ioannidis has coauthored several papers that use commercial bibliometric resources (Scopus) to create freely publicly available science-wide databases, but has received no financial remuneration for this work. Maniadis reports no conflicts of interests related to this study.

Human and animal rights

This article does not include any primary data from human or animal research as it is a review article.

Informed consent

Informed consent for this study there were no participants and thus no informed consent was required.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ioannidis, J.P.A., Maniadis, Z. Quantitative research assessment: using metrics against gamed metrics. Intern Emerg Med 19, 39–47 (2024). https://doi.org/10.1007/s11739-023-03447-w

Download citation

Received: 20 September 2023
Accepted: 26 September 2023
Published: 03 November 2023
Issue Date: January 2024
DOI: https://doi.org/10.1007/s11739-023-03447-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Quantitative research assessment: using metrics against gamed metrics

Abstract

Similar content being viewed by others

What is meaningful research and how should we measure it?

Research Misconduct and Citation Gaming: A Critical Review on Characterization and Recent Trends of Research Manipulation

Online Indicators for Non-Standard Academic Outputs

Introduction

Authorship-based gaming

Citation-based gaming

Editorial-based gaming

Journal-based gaming

Gaming with outright fabrication

Metrics of best research practices and of poor research practices

Putting it together

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interests

Human and animal rights

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Quantitative research assessment: using metrics against gamed metrics

Abstract

Similar content being viewed by others

What is meaningful research and how should we measure it?

Research Misconduct and Citation Gaming: A Critical Review on Characterization and Recent Trends of Research Manipulation

Online Indicators for Non-Standard Academic Outputs

Introduction

Authorship-based gaming

Citation-based gaming

Editorial-based gaming

Journal-based gaming

Gaming with outright fabrication

Metrics of best research practices and of poor research practices

Putting it together

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflict of interests

Human and animal rights

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation