Propagation of errors in citation networks: a study involving the entire citation network of a widely cited paper published in, and later retracted from, the journal Nature

van der Vet, Paul E.; Nijveen, Harm

doi:10.1186/s41073-016-0008-5

Propagation of errors in citation networks: a study involving the entire citation network of a widely cited paper published in, and later retracted from, the journal Nature

Research
Open access
Published: 03 May 2016

Volume 1, article number 3, (2016)
Cite this article

Download PDF

You have full access to this open access article

Research Integrity and Peer Review Aims and scope Submit manuscript

Propagation of errors in citation networks: a study involving the entire citation network of a widely cited paper published in, and later retracted from, the journal Nature

Download PDF

Paul E. van der Vet^1,2 &
Harm Nijveen^3,4

7828 Accesses
26 Citations
83 Altmetric
7 Mentions
Explore all metrics

Abstract

Background

In about one in 10,000 cases, a published article is retracted. This very often means that the results it reports are flawed. Several authors have voiced concerns about the presence of retracted research in the memory of science. In particular, a retracted result is propagated by citing it. In the published literature, many instances are given of retracted articles that are cited both before and after their retraction. Even worse is the possibility that these articles in turn are cited in such a way that the retracted result is propagated further.

Methods

We have conducted a case study to find out how a retracted article is cited and whether retracted results are propagated through indirect citations. We have constructed the entire citation network for this case.

Results

We show that directly citing articles is an important source of propagation of retracted research results. In contrast, in our case study, indirect citations do not contribute to the propagation of the retracted result.

Conclusions

While admitting the limitations of a study involving a single case, we think there are reasons for the non-contribution of indirect citations that hold beyond our case study.

Why, When, Who, What, How, and Where for Trainees Writing Literature Review Articles

Article 21 May 2019

How to Write and Publish a Research Paper for a Peer-Reviewed Journal

Article Open access 30 April 2020

Literature reviews as independent studies: guidelines for academic practice

Article Open access 14 October 2022

Background

Scientific investigation is difficult and fallible, and its practitioners are only human. Results believed to be firm may turn out to be not reproducible or outright wrong or even faked. If detected, this leads to retraction of an article. Retraction is a dramatic event. It damages careers and may incur large costs [1, 2]. How publishers are to handle retractions is currently debated [3].

Retraction of a published article is a rare event, but its incidence is on the rise from roughly one in 100,000 cases before the year 2000 to one in 10,000 cases in the last decade [4]. The reasons for retraction vary and can be classified roughly into two categories: scientific misconduct on the one hand, and error or lack of reproducibility on the other. Earlier research found error to be the main cause for retraction [5]. Later studies find that misconduct is the main cause [6–8], although Couzin and co-workers point out that even outright fraud not always leads to retraction [9]. Behaviour of both authors and institutions is said to account for the rise of misconduct among retractions [10]. A few repeat offenders heavily bias retraction rates [4]. Repeat offenders are said to be responsible for roughly half of all retraction cases [4].

Although the retraction of a paper is normally interpreted as signalling that the results of the paper are flawed, this is not guaranteed. In our case study, see below, the matter appears far from settled. Even fraud may turn up results that are later found to be correct. We will therefore not speak about “flawed” or “erroneous” results but rather about “retracted” results.

An important question is what damage is done by the retracted article. A retracted result is formally no longer part of the body of science. Therefore, retractions must be advertised to prevent spreading of retracted results. The blog Retraction Watch is providing an invaluable service in this respect. A review of the literature on retractions has recently been published by the initiators of the blog, Marcus and Oransky [11]. Retracted results pollute their citation environments [11, 12]. Examples of a retracted result still cited years after its retraction have been reported [13–15]. A number of studies report on how often retracted articles are cited both before and after retraction [16–18]. Where one report finds that citation rates drop by approximately 35 % after retraction [16], another report finds no significant decrease in citation rates after retraction [17]. Retracted and non-retracted articles alike are all subject to attention decay with the result that most are eventually largely forgotten [19].

Particularly in the medical literature, there is the danger that patients are put at risk by what is concluded in articles that later have to be retracted. The Wakefield case is probably the most famous example. Wakefield and co-workers claimed to have found an association between measles vaccine and autism [20] but their article was retracted because of fraud 12 years later [21]. The false association has lingered on since then and may have caused unnecessary deaths through parents refusing measles vaccination of their children [22]. Treatments based on retracted articles put patients at risk [23]. Neale and co-authors find no such cases in their study involving 102 articles retracted because of misconduct [17], while Begley and co-authors conclude the opposite [24] and Couzin and co-workers provide a concrete example [9].

Chen and co-workers point to the following scenario [25]. Suppose an article A is retracted and that A has been cited in a positive way by B, C and D. In the worst case, A’s retracted findings support conclusions drawn in these papers. B, C and D, in turn, are cited by yet other papers. A’s retracted results may again be essential ingredients of the argument of these other papers. Because A’s conclusions are retracted, the conclusions in all these papers should be re-examined. Chen and co-workers have conducted a large-scale investigation that precluded them from inspecting individual articles [25]. Therefore, they did not find examples of their scenario. Fulton and co-workers, on the other hand, have studied a single case in detail but have concentrated only on articles that directly cite the retracted article [26]. Like that paper, we focus on a single case because that way we have the possibility to study the contents of the papers involved. We study articles that directly cite a retracted article both before and after retraction. Unlike Fulton and co-workers [26], we identify the entire citation environment of the retracted paper. We thus also inspect articles that are connected to the retracted article through a chain of citations in order to find out whether in this case the scenario identified in [25] has become a reality.

Methods

We have selected a particular paper published in December 2012 because it was published in Nature and because it deals with necrosis and with sirtuins (a class of proteins). Briefly, in [27] (called “the Narayan paper” from now on), Narayan and co-workers claim that inhibition of sirtuin-2 blocks cellular necrosis induced by TNF- α. The Narayan paper was retracted in February 2014 [28] when a number of groups reported they were unable to reproduce its findings [29]. Meanwhile, the National Institutes of Health, the parent organisation of Narayan and most co-authors, had published an invention based on the Narayan paper as being available for licensing [30]. We have not found a retraction of this notice. When we inspected the list of publications at the personal website of the last senior author (T. Finkel) in September, 2015, the Narayan paper was there but the retraction went unmentioned. What is more, two papers published too late to be included in the present research suggest that the results of Narayan paper are not flawed after all [31, 32]. There is no overlap between the authors of these two papers and the authors of the Narayan paper, nor is there any overlap with the authors of [29], the paper that prompted the retraction.

We need some lightweight formal apparatus to describe our definitions. We base these definitions on the primitive relation Cites(x,y) with the obvious meaning that document x cites document y. A citation chain is an ordered list L=〈D ₁,…,D _i,D _i+1,…D _n〉 such that

$$ \forall i \ D_{i} \in L \wedge D_{i+1} \in L \Rightarrow \text{Cites}(D_{i+1}, D_{i}) $$

In other words, every document in the chain (except, for trivial reasons, the last) is cited by the document following it. D ₂, the second document in the chain, is a document that directly cites the first document in the chain, D ₁. All documents further in the citation chain, in other words, all D _i such that i>2, will be said to indirectly cite D ₁, even though these documents do not acknowledge the existence of D ₁. In our case, D ₁ is always the Narayan paper.

For a given paper P, we define the citing collection C as the set of all papers that either directly or indirectly cite P:

$$ C = \{ x \ | \ \text{Cites}(x,P) \ \vee \ \exists y \ y \in C \wedge \text{Cites}(x, y) \} $$

Finally, we define the citation network of P as the directed graph 〈N,E〉 with

$$ N = \{ P \} \cup C $$

with C the citing collection of P as above and

$$ E = \{ \langle x,y \rangle \ | \ x \in N \wedge y \in N \wedge \text{Cites}(x,y) \} $$

In our case, P is the Narayan paper.

We inspected citations in two sessions, the first in March, 2014, and the second 1 year later, in March, 2015. In both sessions, we used Elsevier’s search engine for scientific publications Scopus. Starting with the papers that cite the Narayan paper, we followed all citations until we arrived at a paper that at the time was not or not yet cited. We had Scopus produce lists of citing papers in BibTE X format. BibTE X needs a unique identifier. Scopus constructs this identifier by concatenating the name of the first author, the year of publication, and the page number at which the article starts. Thus, the identifier for the Narayan paper becomes “Narayan2012199”. For reasons having to do with limitations of the programmes we used, we had to turn the identifiers allocated by Scopus into a simple ASCII form by removing diacriticals and non-alphanumeric characters. “Martínez-Redondo” becomes “MartinezRedondo”, “Nührenberg” becomes “Nuhrenberg”, and so on. The way in which Scopus constructs its identifiers and our further simplification of the Scopus identifiers may lead to the same identifier pointing to two (or even more) different articles, but in the restricted set used for the present experiment we have not found this. We used the programming language Prolog to process the files with citing papers produced by Scopus. We thus obtained the two complete citation networks, one for 2014, the other for 2015.

We read all articles that directly cite the Narayan paper to find out which text accompanies the citation. In particular, we were interested to learn whether the retraction had been acknowledged. Furthermore, to find out whether the results reported in the Narayan paper had spread to papers that indirectly cite the Narayan paper, we reasoned that any such paper should match keywords such as “sirt”, “sirtuin”, “SIRT2”, “necrosis”, “necrotic”, “necroptosis”, and similar. We found that there is sometimes a time gap between publication of an article and the moment it is incorporated into the Scopus database. Allowing for this latency, in July, 2015, we used Scopus to perform a literature search on articles published after 2011 with the search term sirt* AND necro*, where the asterisk is the Kleene star standing for zero, one or more non-white characters. We then determined the overlap between this set, on the one hand, and the 2014 and 2015 citing collections, on the other. Any article that is a member of the overlap set and furthermore does not contain a direct citation to the Narayan paper is a candidate for inspection on spreading of the retracted result through a citation chain. We read all those papers, too, to find out whether the results of the Narayan paper are mentioned as such and, if so, whether we can trace this back to the Narayan paper by following the citation chain.

Results and discussion

Briefly, articles that directly cite the Narayan paper just repeat the (retracted) result, with two exceptions. By contrast, in papers that indirectly cite the Narayan paper there is no trace of the retracted result.

Results

In the two sessions, we collected two complete citation networks. The networks are not proper trees because citation cycles occur in both. The 2014 network (Fig. 1) is a subgraph of the 2015 network (Fig. 2). The growth is spectacular. See Table 1 for the main counts. The supplementary material contains, for every article that directly cites the Narayan paper, the sentences or passages that contain the citation (Additional file 1). The supplementary material also contains a complete specification of the 2014 and 2015 networks in the form of dot files [33] (Additional files 2 and 3). A visual rendering of the 2014 graph, split over two figures for readability, is provided as scalable PDF files (Additional files 4 and 5). The accompanying BibTE X file (Additional file 6) relates the identifiers in the dot files and in the figures to the complete bibliographic descriptions.

Table 1 Summary of counts, see main text for information

Full size table

The retraction of the Narayan paper is absent in the 2014 network even though 2 weeks had passed between retraction and our Scopus search. Scopus displays some latency. In the two networks, about two thirds of the papers had not or not yet been cited at the time the network was collected. Of the papers that are cited, the median citation count is 1. The distribution of citation frequencies follows a Zipf law, in line with what has been reported in the literature [34]. In our case, only a few papers are cited more than once. The most often cited paper in both networks is the review paper by Kaczmarek c.s. [35] that directly cites the Narayan paper. The Kaczmarek paper had collected 42 citations in 2014 and 111 citations in 2015. The next most often cited paper is the Narayan paper itself.

Of the 37 papers in the 2014 network that directly cite the Narayan paper, one is the paper that prompted the retraction [29], and one paper [36] is in fact a summary of the Narayan paper in the Reviews and Comments section of the Nature issue in which the Narayan paper was published. In the 2015 network, we have one further directly citing article characterised as a note. For further data, see Table 1. With two exceptions, both from the 2015 network, none of the directly citing papers shows any awareness of the retraction. Yet most papers of the 2015 network have been published well after the retraction was published. There are two exceptions. The first [37], an original contribution, calls the Narayan result “controversial” while citing [29]. The second [38], a review article, notes both [29] and the retraction itself.

The rapid expansion of the citation network generated by the Narayan paper is remarkable. It must be ascribed to its subject and to the fact that it appeared in Nature. Moreover, its exposure was enhanced by [36] in the the Reviews and Comments section of the same Nature issue. This perhaps also explains why almost half of the primary citations are review articles. It is also evident that every review except one summarises the main finding of the Narayan paper as a matter of fact. This is significant because being cited in a review is considered the first step in canonisation of new knowledge, see also [14].

The Narayan paper is not only cited by reviews but also by original contributions. The citation is used as background knowledge in the Introduction section or as relevant evidence in the Discussion section of the paper. We have found that in this group of papers, diverse aspects of the work reported in the Narayan paper are cited. In one case [39], part of the experimental method of Narayan and co-authors is cited. This raises the interesting question whether retraction of a paper also means that its experimental methods have to be removed from the annals of science.

In both citation networks, we have looked for papers that directly cite the Narayan paper while one of its co-authors is also a co-author of the Narayan paper. We have identified two such papers in the 2015 network: [40] (published December 8, 2013, corrected December 16, 2013, and an erratum dated February 2014) and [41] (published online February 11, 2014). The dates of publication, respectively correction, are quite close to the date of retraction of the Narayan article, which is February 27, 2014. In neither paper, nor in the correction and erratum to the first, can one find any indication of the impending retraction.

The July 2015 search in Scopus of papers published after 2011 that match the search term sirt* AND necro* yielded 391 articles. (Precisely the same search on PubMed on the same day yielded 120 articles.) We checked for the presence of the Narayan paper, its retraction, and [29]: all three are present in the search result. For the overlap with the 2014 and 2015 citing collections, see Table 1. Obviously, the 2014 overlap is a subset of the 2015 overlap. The 2015 overlap has 10 papers that indirectly cite the Narayan paper: [42–45] (the only one also overlapping with the 2014 collection), [46] (in Chinese), [47–51]. We read all 10 papers. Of these, only one paper contains a passage that might refer to the Narayan paper without citing it ([43], p. 91):

“Previous studies have also indicated that SIRT2 is a mediator of cell death. In particular, SIRT2 inhibition was shown to decrease the injury in cellular and animal models of PD and HD [2].”

The only citation that accompanies this statement (“[2]”, which corresponds to our reference [52]), occurs in neither of the two citing collections for the Narayan paper. The passage we quote cannot plausibly count as a reference to the Narayan paper. The result of the inspection of papers that indirectly cite the Narayan paper thus is zero.

Discussion

Even when a paper has been retracted, it can be cited in good faith. Citing a paper before it is retracted is of course done in good faith. There is normally a time gap between publication of an article and its retraction. In one exceptional case, the gap was 24 years; the paper was still cited at that time [18].

The author who wants to avoid citing a paper that has been retracted will experience difficulties in finding out about the retraction [11]. Moreover, although Nature has put the word “RETRACTED” in capitals and red print on every digital page of the Narayan paper once the retraction was a fact, a researcher may have recourse to the hard-copy issue of Nature or may have added the digital paper to a private collection before it was retracted [53]. In our experience, none of the popular search engines Google Scholar, Scopus and Web of Science add a warning to a retracted paper in their list of search results. PubMed does, but in our experiments, PubMed was seen to retrieve far fewer documents than Scopus did. Chen c.s. provide screenshots to claim that Google Scholar explicitly marks retracted papers [25]. Our screenshot, Fig. 3, shows that this is not done consistently. That particular Google Scholar search was performed more than a year after publication of the retraction and the title of the retraction is identical to that of the original article except for the one word “Retraction”, the retraction itself does not occur among the first six hits. (In fact, it did not even occur on the first page of search results.) In search engines in general, searching on title or author may or may not turn up the retraction in the result list, and if it does, adding a year of publication to the search criteria is almost always sufficient to hide the retraction. Also, search engines have inevitable latency. In March, 2014, at least Scopus did not list the retraction in its search results when searching on “Narayan”, “NAD-dependent” and “deacetylase”. In September, 2014, the retraction was there. See also [54] for a discussion on search engines and retractions.

Finding out about a retraction becomes even more difficult when we do not look for entire articles but for passages instead. Modern information retrieval research investigates so-called passage retrieval, the retrieval of relevant passages rather than entire articles ([55], ch. 13). A paper is always retracted as a whole even though parts of it may be unaffected by the reasons for retraction. To be useful for practising scientists, a passage retrieval search engine will have to incorporate provisions for retrieving the retracted status of the paper from which the passage stems. To enable search engines to do this, publishers will have to make the status known in a structured way readable by a computer programme.

The very least that can be done is keeping track of retractions. The Retraction Watch blog does invaluable service here but it is not yet available for automated methods. Recently, the blog announced it had received a grant to set up a database of retractions [56]. PubMed explicitly marks retractions. If this is done in a machine-readable way, PubMed’s retraction list is a good starting point for a database of retractions. Also, the commercial service CrossMark by CrossRef promises to keep its users informed of retractions [57]. CrossMark relies on the voluntary participation of publishers, and although the current list of participating publishers is impressive, it is by no means complete. We feel that journal publishers should have done this long ago and for free because they publish both the original articles and their retractions.

With or without a database of retractions, it is feasible to automatically construct a citation network for a retracted paper. After all, any citation network is a subgraph of the graph defined by the citation relations identified by Scopus, Web of Science or Google Scholar. The programme can be written such that it continually monitors bibliographic descriptions added to the database to keep the network up-to-date. It would be interesting to find out the extent to which the published literature is citing retracted papers either directly or through a citation chain. The two citation networks for the Narayan paper suggest that the proportion of papers that occur in the citation network of a retracted paper may be a lot higher than we would think. Our research suggests that we can concentrate on directly citing articles to find propagation of a retracted result. With current, off-the-shelf passage retrieval techniques, it is possible to extract the citing passages in such articles automatically.

Authors of a paper published previously should be warned when one of their citations gets retracted. To be feasible, a publicly accessible database of retractions is a prerequisite. Authors should be given the opportunity to revise their paper if they think their conclusions are affected by the retraction. At the very least, they may want to flag the offending citation as being retracted.

Where automatic construction of citation trees is eminently feasible, assessing propagation beyond the primary citation in an automatic way is far more complicated, if possible at all. One possible route towards such a system exploits a proposal by Anicich to annotate every item in the list of references with markers indicating whether the citation supports the work, contradicts it, and so on [58]. Proponents of replacing normal text by hypertext documents have proposed similar markers for the relations connecting pieces of text, see for example [59].

A more thorough analysis would involve reasoning about the content of a paper. This presupposes that we have been able to translate what the paper says into a language that can be manipulated by a computer programme. Such a language is called a (knowledge) representation language ([60], ch. 12). Progress has been made in having a programme prepare such a translation (see, for example, [61]), but we are far from able to capture the relevant parts of what a text says. Complex sentences, anaphora and modalities (“we believe”, “we think”, “it is plausible”, and so on) all pose difficulties that have not yet been solved for routine use. It is not clear at which timescale these issues are solved to the extent that automatic assessment of damage done by a retracted paper is possible.

Conclusions

To conclude, in line with what earlier authors have found [16–18, 26], propagation of retracted results through directly citing articles is a real scenario. On the other hand, in our case study, we have not seen propagation of a retracted result beyond those directly citing articles. Our result suggests that in this case authors display proper citing behaviour. More specifically, authors who publish about the relation between sirtuin-1 and necroptosis will cite the Narayan paper. In our study, such authors will therefore end up in the list of directly citing articles. This is aided by the fact that the Narayan paper is highly visible. It is published in Nature and moreover has an editorial comment that draws readers’ attention to it. Although a single case study can of course never rule out that retracted results propagate through articles with indirect citations (the scenario of [25]), we think that in environments with accessible literature and proper citing behaviour, spreading of retracted results through indirectly citing articles is not a probable event. In other words, the high visibility of a document published in a top-ranked journal makes it probable that results are spread but the results can be linked to their source. Documents published in a low-tier journal, on the other hand, will not be very visible. Therefore, one may speculate, such results do not spread out so quickly but if they do, the link to their source may be lost. In all this, the citing behaviour in a scientific community is a key factor.

In the search for automated support for handling retractions, there appear to be two extremes, neither of which is attractive. One extreme is handling fully by hand, which is impossible because the amount of labour involved is prohibitively large. The other extreme, handling retractions fully automatically, is currently infeasible and will remain so for some time to come. We therefore propose an approach that utilises the best of both worlds: a highly interactive computer programme operated by domain experts. The computer is good at following citation chains and highlighting passages in which a primary citation occurs, while the domain expert is good at judging the impact of retracted results. Modern computing environments involving highly interactive, very large displays enable the expert to view a large amount of information simultaneously. When a lot of material is collected this way, we may perhaps be able to answer questions such as the following: how many generations of citation must be followed before we can safely ignore citations even further away; is the influence of review articles indeed greater than that of original contributions; and, most importantly, are there original contributions of which conclusions have to be retracted because they crucially rely on assumptions that have been retracted? Finally, even for a single paper like the Narayan paper, following all citation chains is a lot of work. It seems only worthwhile if the results can be shared. A further question thus is how the results of such an exercise should be communicated.

References

Stern AM, Casadevall A, Steen RG, Fang FC. Financial costs and personal consequences of research misconduct resulting in retracted publications. eLife. 2014; 3:02956.
Article Google Scholar
Wager E. Why are retractions so difficult?Sci Editing. 2015; 2(1):32–4.
Article Google Scholar
Resnik DB. Retracting inconclusive research: lessons from the Séralini GM maize feeding study. J Agric Environ Ethics. 2015; 28(4):621–33.
Article Google Scholar
Grieneisen ML, Zhang M. A comprehensive survey of retracted articles from the scholarly literature. PLoS ONE. 2012; 7(10):44118.
Article Google Scholar
Budd JM, Sievert ME, Schultz TR. Reasons for retraction and citation to the publications. J Am Med Assoc. 1998; 280(3):296–7.
Article Google Scholar
Ségalat L. System crash. Science and finance: same symptoms, same dangers?EMBO Rep. 2010; 11(2):86–9.
Article Google Scholar
Fang FC, Steen RG, Casadevall A. Misconduct accounts for the majority of retracted scientific publications. Proc Natl Acad Sci U S A. 2012; 109(42):17028–33.
Article Google Scholar
Madlock-Brown CR, Eichmann D. The (lack of) impact of retraction on citation networks. Sci Eng Ethics. 2014; 21(1):127–37.
Article Google Scholar
Couzin J, Unger K. Cleaning up the paper trail. Science. 2006; 312(5770):38–43.
Article Google Scholar
Steen RG, Casadevall A, Fang FC. Why has the number of scientific retractions increased?PLoS ONE. 2013; 8(7):68397.
Article Google Scholar
Marcus A, Oransky I. What studies of retractions tell us. J Microbiol Biol Educ. 2014; 15(2):151–4.
Article Google Scholar
Budd JM, Coble ZC, Anderson KM. Retracted publications in biomedicine: cause for concern In: Mueller DM, editor. Declaration of Interdependence: the Proceedings of the ACRL 2011 Conference, March 30–April 2, 2011. Philadelphia, Pa: Association of College and Research Libraries: 2011. p. 390–5.
Google Scholar
De Mot R. The need to correct suspect data as speedily as possible. Posted at Science online, 20 June 2006. 2006. http://science.sciencemag.org/content/312/5770/38.e-letters.
Katz TJ. Propagation of errors in review articles. Science. 2006; 313(5791):1236.
Article Google Scholar
Salter SJ, Cox MJ, Turek EM, Calus ST, Cookson WO, Moffatt MF, Turner P, Parkhill J, Loman NJ, Walker AW. Reagent and laboratory contamination can critically impact sequence-based microbiome analyses. BMC Biol. 2014; 12:87.
Article Google Scholar
Pfeifer MP, Snodgrass GL. The continued use of retracted, invalid scientific literature. J Am Med Assoc. 1990; 263(10):1420–3.
Article Google Scholar
Neale AV, Dailey RK, Abrams J. Analysis of citations to biomedical articles affected by scientific misconduct. Sci Eng Ethics. 2010; 16(2):251–61.
Article Google Scholar
Korpela KM. How long does it take for the scientific literature to purge itself of fraudulent material?: the Breuning case revisited. Curr Med Res Opin. 2010; 26(4):843–7.
Article Google Scholar
Della Briotta Parolo P, Pan RK, Ghosh R, Huberman BA, Kaski K, Fortunato S. Attention decay in science. 2015. Available at SSRN: http://ssrn.com/abstract=2575225.
Wakefield AJ, Murch SH, Anthony A, Linnell J, Casson DM, Malik M, Berelowitz M, Dhillon AP, Thomson MA, Harvey P, Valentine A, Davies SE, Walker-Smith JA. Ileal-lymphoid-nodular hyperplasia, non-specific colitis, and pervasive developmental disorder in children. Lancet. 1998; 351(9103):637–41.
Article Google Scholar
Editors of The Lancet. Retraction ileal-lymphoid-nodular hyperplasia, non-specific colitis, and pervasive developmental disorder in children. Lancet. 2010; 375(9713):445.
Article Google Scholar
Oransky I. Fraud’s long tail: measles outbreak shows why it’s important to look downstream of retractions. Posted at Retraction Watch, 3 February 2015. 2015. http://retractionwatch.com/2015/02/03/frauds-long-tail-measles-outbreak-shows-important-look-downstream-retractions/.
Steen RG. Retractions in the medical literature: how many patients are put at risk by flawed research?J Med Ethics. 2011; 37:688–92.
Article Google Scholar
Begley CG, Ellis LM. Raise standards for preclinical cancer research (comment). Nature. 2012; 483:531–3.
Article Google Scholar
Chen C, Hu Z, Milbank J, Schultz T. A visual analytic study of retracted articles in scientific literature. J Assoc Inf Sci Technol. 2013; 64(2):234–53.
Article Google Scholar
Fulton AS, Coates AM, Williams MT, Howe PRC, Hill AM. Persistent citation of the only published randomised controlled trial of omega-3 supplementation in chronic obstructive pulmonary disease six years after its retraction. MDPI Publ. 2015; 3:17–26.
Google Scholar
Narayan N, Lee IH, Borenstein R, Sun J, Wong R, Tong G, Fergusson MM, Liu J, Rovira II, Cheng HL, Wang G, Gucek M, Lombard D, Alt FW, Sack MN, Murphy E, Cao L, Finkel T. The NAD-dependent deacetylase SIRT2 is required for programmed necrosis. Nature. 2012; 492:199–204.
Article Google Scholar
Narayan N, Lee IH, Borenstein R, Sun J, Wong R, Tong G, Fergusson MM, Liu J, Rovira II, Cheng HL, Wang G, Gucek M, Lombard D, Alt FW, Sack MN, Murphy E, Cao L, Finkel T. Retraction: the NAD-dependent deacetylase SIRT2 is required for programmed necrosis. Nature. 2014; 506:516.
Article Google Scholar
Newton K, Hildebrand JM, Shen Z, Rodriguez D, Alvarez-Diaz S, Petersen S, Shah S, Dugger DL, Huang C, Auwerx J, Vandenabeele P, Green DR, Ashkenazi A, Dixit VM, Kaiser WJ, Strasser A, Degterev A, Silke J. Is SIRT2 required for necroptosis?Nature. 2014; 506(7489):4–6.
Article Google Scholar
National Institutes of Health. SIRT2 inhibitors as novel therapeutics for myocardial infarction and ischemic stroke and to prevent necrosis. Fed Regist. 2013; 78(18):5818–9.
Google Scholar
Preyat N, Leo O. Complex role of nicotinamide adenine dinucleotide in the regulation of programmed cell death pathways. Biochem Pharmacol. 2016; 101:13–26.
Article Google Scholar
Preyat N, Rossi M, Kers J, Chen L, Bertin J, Gough PJ, Le Moine A, Rongvaux A, Van Gool F, Leo O. Intracellular nicotinamide adenine dinucleotide promotes tnf-induced necroptosis in a sirtuin-dependent manner. Cell Death Differ. 2016; 23(1):29–40.
Article Google Scholar
Gansner ER, North SC. An open graph visualization system and its applications to software engineering. Softw Pract Experience. 2000; 30(11):1203–33.
Article Google Scholar
Redner S. How popular is your paper? An empirical study of the citation distribution. Eur Phys J B. 1998; 4(2):131–4.
Article Google Scholar
Kaczmarek A, Vandenabeele P, Krysko D. Necroptosis: the release of damage-associated molecular patterns and its physiological relevance. Immunity. 2013; 38(2):209–23.
Article Google Scholar
Zhou W, Yuan J. Cell biology: death by deacetylation. Nature. 2012; 492(7428):194–5.
Article Google Scholar
Chen X, Wales P, Quinti L, Zuo F, Moniot S, Herisson F, Rauf NA, Wang H, Silverman RB, Ayata C, Maxwell MM, Steegborn C, Schwarzschild MA, Outeiro TF, Kazantsev AG. The sirtuin-2 inhibitor AK7 is neuroprotective in models of Parkinson’s disease but not amyotrophic lateral sclerosis and cerebral ischemia. PLoS ONE. 2015; 10(1):0116919.
Google Scholar
Shao JX, Zhang TT, Liu TY, Quan YZ, Li F, Liu J, Yang X, Xie Q, Xia WL. Sirtuin functions in the brain: From physiological to pathological aspects. J Shanghai Jiaotong Univ (Science). 2014; 19(6):651–62.
Article Google Scholar
Vitner EB, Salomon R, Farfel-Becker T, Meshcheriakova A, Ali M, Klein AD, Platt FM, Cox TM, Futerman AH. RIPK3 as a potential therapeutic target for Gaucher’s disease. Nat Med. 2014; 20(2):204–8.
Article Google Scholar
Cai Z, Jitkaew S, Zhao J, Chiang HC, Choksi S, Liu J, Ward Y, Wu LG, Liu ZG. Plasma membrane translocation of trimerized MLKL protein is required for TNF-induced necroptosis. Nat Cell Biol. 2014; 16(1):55–65.
Article Google Scholar
Webster BR, Scott I, Traba J, Han K, Sack MN. Regulation of autophagy and mitophagy by nutrient availability and acetylation. Biochim Biophys Acta Mol Cell Biol Lipids. 2014; 1841(4):525–34.
Article Google Scholar
Alam MR, Baetz D, Ovize M. Cyclophilin D and myocardial ischemia-reperfusion injury: a fresh perspective. J Mol Cell Cardiol. 2015; 78:80–9.
Article Google Scholar
Chen H, Wu D, Ding X, Ying W. SIRT2 is required for lipopolysaccharide-induced activation of BV2 microglia. NeuroReport. 2015; 26(2):88–93.
Article Google Scholar
Di Fruscia P, Zacharioudakis E, Liu C, Moniot S, Laohasinnarong S, Khongkow M, Harrison IF, Koltsida K, Reynolds CR, Schmidtkunz K, Jung M, Chapman KL, Steegborn C, Dexter DT, Sternberg MJE, Lam EW-F, Fuchter MJ. The discovery of a highly selective 5, 6, 7, 8-tetrahydrobenzo[4, 5]thieno[2,3-d]pyrimidin-4(3h)-one SIRT2 inhibitor that is neuroprotective in an in vitro Parkinson’s disease model. ChemMedChem. 2014; 10(1):69–82.
Article Google Scholar
Lin J, Sun B, Jiang C, Hong H, Zheng Y. Sirt2 suppresses inflammatory responses in collagen-induced arthritis. Biochem Biophys Res Commun. 2013; 441(4):897–903.
Article Google Scholar
Liu SN, Liu Q, Sun SJ, Hou SC, Wang Y, Shen ZF. Metformin ameliorates β-cell dysfunction by regulating inflammation production, ion and hormone homeostasis of pancreas in diabetic KKAy mice. Yao Xue Xue Bao (Acta Pharm Sin). 2014; 49(11):1554–62.
Google Scholar
Maiese K. Taking aim at Alzheimer’s disease through the mammalian target of rapamycin. Ann Med. 2014; 46(8):587–96.
Article Google Scholar
Shulga N, Pastorino JG. Hexokinase II binding to mitochondria is necessary for Kupffer cell activation and is potentiated by ethanol exposure. J Biol Chem. 2014; 289(38):26213–25.
Article Google Scholar
Yang H, Bi YJ, Xue LX, Wang J, Lu Y, Zhang ZG, Chen X, Chu YW, Yang R, Wang R, Liu GW. Multifaceted modulation of SIRT1 in cancer and inflammation. Crit Rev Oncog. 2015; 20(1-2):49–64.
Article Google Scholar
Zhang C, Li X, Liu Q. Sorbitol dehydrogenase inhibitor protects the liver from ischemia/reperfusion-induced injury via elevated glycolytic flux and enhanced sirtuin 1 activity. Mol Med Rep. 2015; 11(1):283–8.
Google Scholar
Zhao T, Li Y, Liu B, Bronson RT, Halaweish I, Alam HB. Histone deacetylase III as a potential therapeutic target for the treatment of lethal sepsis. J Trauma Acute Care Surg. 2014; 77(6):913–9.
Article Google Scholar
Donmez G, Outeiro TF. SIRT1 and SIRT2: emerging targets in neurodegeneration. EMBO Mol Med. 2013; 5:344–52.
Article Google Scholar
Unger K, Couzin J. Even retracted papers endure. Science. 2006; 312(5770):40–1.
Article Google Scholar
Wright K, McDaid C. Reporting of article retractions in bibliographic databases and online journals. J Med Libr Assoc. 2011; 99(2):164–7.
Article Google Scholar
Baeza-Yates R, Ribeiro-Neto B. Modern Information Retrieval, 2nd ed. Harlow(UK): Addison-Wesley; 2011.
Google Scholar
Oransky I. Retraction Watch is growing, thanks to a $400,000 grant from the MacArthur Foundation, Posted at Retraction Watch, 15 December 2014. 2014.
CrossRef. CrossMark: cite with certainty. http://www.crossref.org/crossmark/ , updated 10 March, 2015, Accessed 6 Aug 2015. 2015.
Anicich EM. What lies within. Superscripting references to reveal research trends. Perspect Psychol Sci. 2014; 9(6):682–91.
Article Google Scholar
Trigg RH. A network-based approach to text handling for the on-line scientific community. PhD thesis, Department of Computer Science, University of Maryland. 1983.
Russell S, Norvig P. Artificial Intelligence, 3rd ed. Boston: Pearson; 2010.
Google Scholar
Ananiadou S, Thompson P, Nawaz R, McNaught J, Kell DB. Event-based text mining for biology and functional genomics. Brief Funct Genom. 2015; 14(3):213–30.
Article Google Scholar

Download references

Acknowledgements

We are indebted to Petri de Willigen (University of Twente), Gert Vriend (Radboud University of Nijmegen) and Barbara Zarzycka (Maastricht University) for help in various stages of the research, and Bing Bai (Wageningen University and Research) for help with translating an article from Chinese. We are also indebted to the reviewers for their helpful comments.

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

The authors together identified the problem and designed the research. PV carried out the literature searches and drafted the Methods and Results sections which were subsequently read and approved by HN. Both authors wrote the other sections together. Each of the two authors has read the manuscript and the supplementary material. Each author has given final approval of the work and agrees to being accountable for all aspects of it.

Author information

Authors and Affiliations

Human Media Interaction Group, Department of Computer Science, University of Twente, Drienerlolaan 5, Enschede, 7522, NB, the Netherlands
Paul E. van der Vet
ZGT Academy, Ziekenhuisgroep Twente, Zilvermeeuw 1, Almelo, 7609, PP, the Netherlands
Paul E. van der Vet
Bioinformatics Laboratory, Wageningen University, Droevendaalsesteeg 1, Wageningen, 6708, PB, the Netherlands
Harm Nijveen
Wageningen Seed Lab, Laboratory of Plant Physiology, Wageningen University, Droevendaalsesteeg 1, Wageningen, 6708, PB, the Netherlands
Harm Nijveen

Authors

Paul E. van der Vet
View author publications
You can also search for this author in PubMed Google Scholar
Harm Nijveen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Harm Nijveen.

Additional files

Additional file 1

All texts with citing passages. PDF file. For every article that directly cites the Narayan article, this supplement lists all passages in which such a citation occurs. (PDF 115 kb)

Additional file 2

Specification of the 2014 citation network. dot file [33] that contains a complete specification of the 2014 citation network. A visual rendering of this network is provided as Additional files 4 and 5. The nodes are labelled with their BibTE X identifiers, which are expanded into full references in Additional file 6. (TXT 9.23 kb)

Additional file 3

Specification of the 2015 citation network. dot file [33] that contains a complete specification of the 2015 citation network. As above. (TXT 80.4 kb)

Additional file 4

The 2014 citation network with named nodes. PDF file showing the graph that constitutes the 2014 citation network with nodes labelled with their BibTE X identifiers (expanded in Additional file 6). In order to keep the figure readable, a large subgraph has been collapsed into the node “KaczmarekTree”. The latter node is expanded in Additional file 5. The figure is scalable and can be enlarged in any PDF viewer capable of zooming without becoming fuzzy. (PDF 62.1 kb)

Additional file 5

Subgraph of the 2014 citation network with named nodes. PDF file showing the subgraph with named nodes of the 2014 citation network that was collapsed into the node “KaczmarekTree” in Additional file 4. The properties of this figure are as for Additional file 4. (PDF 57.7 kb)

Additional file 6

Bibliographic references. BibTE X file with all bibliographic references to expand the labels that occur in the two dot files and in the graphs with named nodes. (BIB 695 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

van der Vet, P.E., Nijveen, H. Propagation of errors in citation networks: a study involving the entire citation network of a widely cited paper published in, and later retracted from, the journal Nature. Res Integr Peer Rev 1, 3 (2016). https://doi.org/10.1186/s41073-016-0008-5

Download citation

Received: 21 November 2015
Accepted: 17 March 2016
Published: 03 May 2016
DOI: https://doi.org/10.1186/s41073-016-0008-5

Propagation of errors in citation networks: a study involving the entire citation network of a widely cited paper published in, and later retracted from, the journal Nature