Introduction

The retraction of scientific papers has increased recently with research misconduct now being the primary cause instead of honest research errors (Campos-Varela & Ruano-Ravina, 2019; Fang et al., 2012; Nath et al., 2006). Although the term “research misconduct” is generally defined as fabrication, falsification, and plagiarism (FFP) (ICMJE, 2022), there is strong heterogeneity concerning research integrity guidance on crucial aspects at both an international and a national level (Godecharle et al., 2014), including the definition of this phenomenon (National Academies of Sciences, Engineering, and Medicine. Fostering. Integrity in Research., 2017; Resnik, 2019). Common definitions may include further violations of good research practice such as duplication, misleading image manipulation; deliberate failure to disclose relationships and activities; and unethical behavior, such as misleading image manipulation or failure to disclose conflicts of interest (Li & Cornelis, 2020; Resnik et al., 2015).

Research misconduct has been identified as a structural problem in research (Grieneisen & Zhang, 2012). In this regard, previous research in Europe has shown that a considerable proportion of researchers have engaged in scientific misconduct. An Italian study found that 21.2% of the researchers surveyed admitted to some form of misconduct (Mabou Tagne et al., 2020), and even higher rates were found in other studies carried out in Spain (43.3%) (Candal-Pedreira et al., 2023a, 2023b) and Norway (39.5%) (Kaiser et al., 2021). In a meta-analysis of researcher surveys conducted in 2009 worldwide, 2% acknowledged having fabricated data and 33.7% in other questionable practices (Fanelli, 2009).

The increase in retractions due to scientific misconduct highlights the need to establish mechanisms and strategies that are truly effective in preventing, detecting, and even sanctioning these unethical behaviors. In the European context, there is no supranational body which supervises research in cases where research misconduct is suspected. Only 14 out of 46 countries have national ethics committees to promote research integrity, and only six of them have the power to investigate, but none can take disciplinary action (Candal-Pedreira et al., 2021). This becomes even more relevant when considering that previous research concludes that countries with policies regulating scientific misconduct have a lower likelihood of having publications retracted (Fanelli et al., 2015).

Moreover, previous research has shown that the country of origin of authors is a contributing factor to retractions and scientific misconduct. In Portugal and Brazil, 60.1% and 55.9% of retractions, respectively, were due to misconduct, with plagiarism being the primary reason (Candal-Pedreira et al., 2023a, 2023b). Other studies have pointed out fraud or suspicion of fraud is the main reason for scientific misconduct in the USA and Germany, meanwhile duplication is more common in China and Turkey (Fang et al., 2012). A study conducted in Brazil (Stavale et al., 2019a, 2019b) concluded that the most frequent type of scientific misconduct was plagiarism, as did another study conducted in India (Elango et al., 2019). Amos (2014) concluded that Italy had the highest ratio of retractions for plagiarism, whereas Finland had the highest ratio of retractions for duplication. Having this into account, it is important to monitor retractions and categorize them by country to identify issues related to unethical behavior. Analyzing retractions is a tool to identify problems related to research misconduct within a particular country.

Previous studies have analysed scientific misconduct in Europe (Marco-Cuenca et al., 2021) or including European countries (Zhang et al., 2020). Marco-Cuenca et al. focused only on the analysis of fabrication, falsification and plagiarism found in both biomedical and non-biomedical papers, while Zhang et al. included only biomedical papers and assigned a retraction reason to each article. Marco-Cuenca et al. (2021) concluded that biomedical research accounts for the majority of retractions due to scientific misconduct. Moreover, fraud in biomedical research can have serious consequences (Stern et al., 2014) as the fraudulent results are incorporated into clinical practice. It is worth noting that retracted articles usually have multiple retraction reasons, and analysing all of them can provide valuable information. However, to the best of our knowledge, no previous study has comprehensively analysed the reasons for retraction, i.e. taking all of them into account, in the biomedical field. Therefore, this study analyzed biomedical retracted research studies by European-affiliated authors from 2000 to 2021 to characterize research misconduct in Europe and identify trends in the countries with the highest number of retractions due to misconduct.

Methods

Study design and data extraction

We conducted a cross-sectional study of papers which had been retracted for any reason from 01/01/2000 through 30/06/2021 and whose corresponding author was affiliated with an European institution, using Retraction Watch database as our main data source (The Retraction Watch Database). For this purpose, we have used the Retraction Watch database. This database has been used in previous research with similar objectives to ours (Marco-Cuenca et al., 2021). The functioning of Retraction Watch was described before (Candal-Pedreira et al., 2022). In summary, it is the most comprehensive database on retractions as its coverage is larger than other databases, such as PubMed. Retraction Watch identifies retractions by accessing various sources and assigns causes based on retraction notices, misconduct investigations, and input from PubPeer. The last access to the Retraction Watch database was on 27/09/2021. We included biomedical original articles, reviews, case reports and letters. We excluded expressions of concern and corrections, papers falling outside the field of biomedical sciences, and non-research papers, such as commentaries and editorials, as well as papers that had not been published in English, Spanish or Portuguese were excluded.

The country assigned to each included paper was based on the corresponding author’s affiliation. For study purposes, the term “Europe” was construed as encompassing current European Union Member States plus the United Kingdom, Norway, Iceland, Liechtenstein, Serbia, Turkey, Russia, Switzerland, and Belarus.

In order to include retractions pertaining to the health sciences, we selected those categorized as “Basic Life Sciences (BLS)” and/or “Health Sciences (HSC)” in the “Subject” field of the Retraction Watch database and included under the head of “Basic Life Sciences (BLS)” are specialized fields, such as biology, microbiology, toxicology, environmental sciences, biochemistry, and virology. The “Health Sciences (HSC)” category includes all medical specialties (medical, surgical, as well as cross-disciplinary specialties such as pathological anatomy and clinical analysis).

The following Retraction Watch variables were extracted for each retracted paper included: title of paper; type of paper (original paper, systematic review, meta-analysis, letter, guidelines, case report); date of paper publication and retraction; number of authors; country of corresponding author; corresponding author’s affiliation (university, hospital, research center, health center, industry, independent); and reason for retraction.

In addition, data relating to the journal of publication (quartile, category, Journal Citation Indicator and impact factor (IF) of the year of publication) were extracted from Journal Citation Reports (Clarivate Analytics).

Retraction causes were obtained from Retraction Watch database and verified using publisher’s notes. Unavailable notes were sourced from Retraction Watch. All the reasons cited by Retraction Watch, found and labeled, were taken into account.

For the categorization of the reasons for retraction, we relied on international documents related to scientific integrity and various practices considered as misconduct or unacceptable/questionable practices by different organisms (ALLEA. The European Code of Conduct for Research Integrity, 2023; COPE, 2019). In this study, we consider both practices included in the traditional definition of research misconduct (fabrication, falsification, and plagiarism), as well as other practices commonly referred to as unacceptable or questionable practices. It is important to note that no distinction is made in this study between unacceptable/questionable practices and research misconduct, and we will refer to all practices under the term “research misconduct”. The reasons for retraction were categorized as follows:

  • No research misconduct: this group included papers identified as having various types of errors unrelated with research misconduct, as well as retracted papers in which a reason for research misconduct had not been explicitly identified (all kind of errors, such as method errors, imaging errors or cell line contamination, unreliable data and/or results because or error).

  • Research misconduct was classified into the following categories: (a) falsification/fabrication of images, data and/or results, false peer review, paper mills and manipulation of results; (b) duplication: papers, images or parts of the text published in other journals by the same authors, “salami slicing”, or duplication of data; (c) plagiarism, whether of complete papers or parts of these, including text and images; (d) problems with authorship and/or affiliation: problems of copyright, dubious authorship, lack of approval of author or third party, objections by the company or institution, or false authorship or affiliation; (e) unreliable data because of misconduct: unreliable data or images, original data not provided, problems of bias or lack of balance, or problems with data; (f) unreliable results because of misconduct; (g) ethical and legal problems: breach of ethics by the author, conflict of interests, lack of informed consent, civil or criminal proceedings, lack of approval by the competent bodies; and, (h) any case in which no specific reason was given but the Retraction Watch database stated that the paper had been retracted for “misconduct”.

  • Reason not specified: papers retracted without a specific reason stated in Retraction Watch database or the notice of retraction (those categorized as “withdrawn” or “retracted” without further explanation and those stating “Unreliable data” or “Unreliable results” without other reason suggesting misconduct or no misconduct).

We excluded two articles with “miscommunication by the author, company, institution or journal” as the only reason for retraction, due to the impossibility of categorizing them in the established groups.

Papers may be associated with more than one reason for retraction. The Retraction Watch database identifies this by associating each paper with a given reason or ground. Hence, we took each of these reasons into account independently, and used the total number of reasons as the unit of measure for our analysis.

Statistical analysis

First, we performed a descriptive analysis of biomedical retractions in Europe, broken down by cause of retraction (honest research error and research misconduct), with the categorical variables being expressed as absolute and relative frequencies, and the quantitative variables as median and interquartile range (25th to 75th percentile). Chi square test was used to assess for differences between samples.

Then, we excluded retractions due to honest research error and those in which no specific reason was given, focusing on retractions because of research misconduct. To make the comparison between countries and years for a given reason of retraction, we calculated the rates using the total number of papers retracted for that reason divided by the total number of papers in that same country and year. The total number of papers assigned to each country and year was extracted from PubMed, with the search being filtered by individual country and year of the study period. To characterize the trend of retractions for research misconduct, both for European countries overall and for the countries ranked highest by number of retractions, a cross-sectional analysis was performed for every 5 years. We calculated the number of papers ascribed to each of the main reasons and expressed the results by reference to the total number of papers as the common denominator. Mann–Kendall test was used to analyze the trends in retractions in European countries overall, by main reason for retraction. To conduct a year-to-year comparison among countries and by the main reasons, the trend analysis was limited to 2020 since we do not have data for the full year of 2021.

We described the trend in misconduct-related causes of retraction across the study period, analyzed the main variations between countries with at least 10 retractions, and examined the main distinguishing patterns by reference to the corresponding author’s country of affiliation. In the results, detailed information is provided for the 4 countries with the highest number of retractions, while in the supplementary material, data for the remaining countries with at least 10 retractions can be accessed.

All analyses were performed using the Microsoft Excel 2016 and Stata v.17 statistics software programs.

Ethical aspects

The manuscript was written following the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) reporting guideline for cross-sectional studies. The study did not require institutional review board approval because it was based on publicly available information and involved no patient records.

Results

We included 2069 papers retracted from 01/01/2000 to 30/06/2021, by authors affiliated to European institutions identified in the Retraction Watch database (Fig. 1). The number of retractions was 10.7 per 100,000 publications in 2000 and 44.8 per 100,000 publications in 2020. Overall, 1383 (66.8%) retracted papers involved some type of research misconduct, while 322 (15.6%) were retracted due to honest research error. There were differences in type of publication (p < 0.001), journal quartile (p = 0.002), number of categories in which the journal is indexed (p < 0.001) and number of authors (p < 0.001) between the groups (Table 1).

Fig. 1
figure 1

Flow chart of sample selection

Table 1 Main characteristics of papers retracted due to research misconduct and honest research error

Causes for retractions due to research misconduct

Among the 1383 papers retracted because of research misconduct, we found 2466 reasons for retraction (an article may have more than one cause for retraction). Among them, the most frequent reasons were duplication (n = 542, 22.0%) and misconduct in which no subtype was specified (n = 386, 15.7%) (Table 2).

Table 2 Papers retracted due to any cause of research misconduct in Europe across the study period (2000–2021)

In addition, the most common causes for research misconduct varied over time (Fig. 2). In 2000, the main causes were duplication, problems relating to authorship and affiliations and ethical and legal problems (2.5 per 100,000 publications). Duplication remained as the main cause in 2010, 2015 and 2020 (7.6, 15.3 and 8.6 per 100,000 publications, respectively). Across the period 2000 through 2020, a significant increase was observed in falsification and/or fabrication (p = 0.02), going from 0.8 to 5.9 per 100,000 publications. In the case of plagiarism, the increase was also significant (p = 0.03), and this went from 0.8/100,000 in 2000 to 27.2/100,000 publications in 2010, with a subsequent downturn until 2020. In 2020, the main causes for research misconduct retractions were duplication (8.6/100,000 publications) and unreliable data because of misconduct (8.4/100,000 publications).

Fig. 2
figure 2

Trend in the main causes of research misconduct in Europe (number of papers retracted per 100,000 papers)

Research misconduct by Country

Four countries accounted for the largest number of retracted papers during the study period: the United Kingdom, Germany, Italy, and Spain (Table 3). Falsification and/or fabrication was identified as the main cause of misconduct in 2000, 2005, 2010 and 2020 (9.7, 3.7, 4.6 and 6.2 per 100,000 publications, respectively). In 2015, the main cause of misconduct was ethical and legal problems (8.8 per 100,000 publications). In the German case, duplication was the main retraction reason in 2000 (4.3/100,000 publications). In 2010 and 2015, 14.1 and 11.2 articles per 100,000 publications were retracted because of falsification and/or fabrication, while in 2020 this figure fell to 0. In 2020, duplication and problems relating to authorship and affiliations were the main causes (2.1 per 100,000 publications). In the case of Spain, falsification and/or fabrication and ethical and legal problems were the main causes of retraction in 2005 and 2010, respectively. In the next years, the main cause was duplication, going from 5.0 articles per 100,000 publications in 2015 to 13.2 in 2020. In Italy, in 2010, retractions due to plagiarism accounted for 16.7 per 100,000 publications and in 2020 the main cause of misconduct was duplication, with 15.6 articles per 100,000 publications.

Table 3 Trend in the main causes of research misconduct in European countries with the highest number of retractions (number of retracted papers per 100,000 publications)

This information can be also found in Supplementary Table 1 for all years of the series for countries with 10 or more retractions.

Discussion

Our study found that research misconduct was the primary cause for paper retraction in Europe from 2000 to 2021, but the specific reasons varied over time and by country. This is the first study to systematically analyze research misconduct across European institutions, taking into account the expanded FFP definition offered in this type of studies.

Our findings indicate that research misconduct has become more prevalent in Europe over the last two decades, even after adjusting for the increase in the number of papers published. The reasons for retraction due to misconduct also changed over time in all countries studied. Initially, copyright and authorship issues were the primary causes, but their prevalence decreased significantly by the end of the period, possibly due to the implementation of authorship control systems and increased researcher awareness.

Since 2010, duplication has been the main reason for retraction in Europe. However, the prevalence of plagiarism and duplication may be underestimated. These results agree with other studies which report a higher prevalence of plagiarism as compared with other types of misconduct (Marco-Cuenca et al., 2021; Mousavi & Abdollahi, 2020; Stavale et al., 2019a, 2019b). While the incipient use of duplication detection software in the early 2000s was said to be the cause of an increase in the identification of plagiarism (Van Noorden, 2011), their continued use should have contributed to its decline. In the same vein, lack of access to these tools in low-income countries may explain why copyright and plagiarism issues remain the primary reason for research misconduct-related retractions in those regions (Rossouw et al., 2020). It is essential for plagiarism control and surveillance systems to stay one step ahead. Tools for detecting plagiarism and duplication should be used routinely in scientific papers and also in dissertations by university undergraduate and postgraduate students (Bordewijk et al., 2021; Fischhoff et al., 2021; Mousavi & Abdollahi, 2020).

It is a matter of concern that, in addition to duplication, the use of unreliable data has been identified as a growing threat in some countries, such as Italy (Capodici et al., 2022; Perez-Neri et al., 2022; Rapani et al., 2020). The proportion of papers retracted for unreliable data has increased dramatically in this country in 2020. This rise may be due to journals using the phrasing “unreliable data” or “invalid data” when they lack the resources to investigate, or it may be due to the massive retractions of publications from paper mills that have taken place in recent years (The Retraction Watch Database).

Individualized analysis of the 4 countries with the highest number of retractions shows interesting data which have not been previously described. Countries’ retraction patterns are different. Falsification and/or fabrication is the main reason in the United Kingdom. In Spain and Italy, however, duplication ranks as the leading cause of research misconduct. This result may suggest a difference between southern European countries and northern/central European countries. These results are in line with other studies conducted in European countries. Thus, a study undertaken in Scandinavian countries found that the proportion of cases corresponding to fabrication and falsification exceeded the proportion corresponding to plagiarism (Hofmann et al., 2020). Another study conducted in Belgium found a prevalence of 40% of guest authorship and 4% of plagiarism (Godecharle et al., 2018). In Portugal, 12.9% of retractions were due to plagiarism and duplication (Candal-Pedreira et al., 2023a, 2023b).

Various explanations for differences between countries in research activity can be found in scientific literature. According to Fanelli et al. (2015), authors from Germany, Australia, China, and South Korea have a higher chance of having their papers retracted than authors from the USA, while authors from France and The Netherlands have a lower likelihood. This may be due to national-level policies introduced to manage misconduct. Countries with such policies and structures to supervise misconduct experience fewer retractions. Therefore, Europe needs a central scientific integrity office that national offices can refer cases to (Candal-Pedreira et al., 2021).

Furthermore, the sociocultural context may also be a determinant of unethical research conduct (Asplund, 2019; Aubert Bonn et al., 2017). It is possible that southern European countries lead in terms of retractions due to duplication and plagiarism because they do not tend to perceive plagiarism as being a cause of research misconduct (Fanelli, 2009). In addition, two studies have observed that being in the early stages of a research career is a predictive factor of retraction as well as engaging in misconduct (Asplund, 2019; Aubert Bonn et al., 2017; Fanelli et al., 2015; Gopalakrishna et al., 2022). This suggests that education in scientific integrity is inadequate and should be reinforced, especially in the case of undergraduate and postgraduate students (El Bairi et al., 2024; Ljubenkovic et al., 2021).

This study’s principal strength resides in the sample size achieved by using the Retraction Watch database as its data-source (Kocyigit & Akyol, 2022; Marco-Cuenca et al., 2021). This database contains over 50,000 retractions and is currently the best source of information on retractions, thanks to its high coverage. We think that the number of retracted papers missed by this study is minimal. This study considers all the reasons identified by Retraction Watch as causes of retraction. In our opinion, this ensures that no reason is underestimated, thus reflecting the reality of the problem more accurately.

This study has also limitations. Some retracted papers couldn’t be accessed, a frequent limitation in this type of study (Bordewijk et al., 2021; Frias-Navarro et al., 2021; Marco-Cuenca et al., 2021). Retracted papers should not be eliminated but should rather be identified and flagged as retracted, as it was recommended by the Committee of Publication Ethics (COPE) (COPE, 2019). A further limitation lies in the fact that attribution of misconduct is based on corresponding author. This may overrepresent the misconduct associated with European institutions in those articles with authors from several countries. An additional limitation may be the lack of detailed information on countries with 10 or fewer retractions due to the limited sample size. However, we believe this would potentially limit the comprehension of our results.

In conclusion, this study shows that the main reason for retraction in Europe is research misconduct, and that the causes of such research misconduct changed over the last 20 years. Currently, duplication and unreliable data are the main reasons for research misconduct in Europe. Furthermore, there are differences in the causes of research misconduct between the countries with the highest number of retractions in Europe. In view of this, the causes of research misconduct should be analyzed on an individualized basis within the context of each country. Having such information is essential for implementing mechanisms of control, surveillance and discipline that would ensure scientific research fulfilling the highest quality standards.