Many attempts have been made to define nature of viruses and to uncover their origin. Our aim within this work was to show that there are different perceptions of viruses and many concepts to explain their emergence: the virus-first concept (also called co-evolution), the escape and the reduction theories. Moreover, a relatively new concept of polyphyletic virus origin called “three RNA cells, three DNA viruses” proposed by Forterre is described herein. In this paper, not only is each thesis supported by a body of evidence but also counter-argued in the light of various findings to give more insightful considerations to the readers. As the origin of viruses and that of living cells are most probably interdependent, we decided to reveal ideas concerning nature of cellular last universal common ancestor (LUCA). Furthermore, we discuss monophyletic ancestry of cellular domains and their relationships at the molecular level of membrane lipids and replication strategies of these three types of cells. In this review, we also present the emergence of DNA viruses requiring an evolutionary transition from RNA to DNA and recently discovered giant DNA viruses possibly involved in eukaryogenesis. In the course of evolution viruses emerged many times. They have always played a key role through horizontal gene transfer in evolutionary events and in formation of the tree of life or netlike routes of evolution providing a great deal of genetic diversity. In our opinion, future findings are crucial to better understand past relations between viruses and cells and the origin of both.
Nowadays, to give a concise definition of virus nature is troublesome. Researchers of different standpoints have proposed several interpretations. Viruses by their nature seem to be entities somewhere in between inert and living worlds . For decades viruses were simply considered as pathogenic biochemical entities composed of two major elements: nucleic acid (RNA or DNA) constituting their genome and protein coat (capsid). Many viral particles (virions) are even more complex and contain lipid-protein envelope or an additional capsid, and specific viral enzymes required for replication [2, 3]. On the other hand, viruses can also be considered as living organisms since upon infection of cells they turn them into virocells [4–6]. Moreover, a concept of a greater virus world has recently been formulated covering bona fide capsid-encoding viruses and other capsidless replicons such as plasmids, transpozons and viroids. The major feature of this world is not presence of a capsid but genetic, informational parasitism . These capsidless replicons were also named orphan replicons . Emergence of capsid coding sequences and proteins was a big evolutionary step as appearance of these vehicles to transfer and protect nucleic acids was one of prerequisites for evolution. A few years ago, a new division for all living organisms into two distinct groups has been proposed: ribosome-encoding organisms (REOs) and capsid-encoding organisms (CEOs) . Similarly to viruses, life itself is also difficult to define and throughout history of science from Aristotle to K. Ruiz-Mirazo definition of life has been modified many times and since life is a process and not a substance, it is challenging to confine “life” in a simple, yet exhaustive formula. A very detailed timeline with changing definitions of life or living beings is nicely depicted by Moreira and Lopez-Garcia . It is important to know these different explanations for the sake of further discussion presented herein.
The range of viral genome sizes spans three orders of magnitude and simple size-based distinction between viruses and cells valid for over a century cannot be used any longer after the discovery of giant viruses, also known as giruses . One of the smallest double-stranded DNA (dsDNA) viruses is hepatitis B virus (HBV) containing a 3.2 kb genome with only several genes. Even smaller “subviral” partner of HBV is human hepatitis delta virus (HDV), quite similar to viroids in many regards. It contains a 1.7 kb genome encoding one antigen and shows ribozyme activity [11, 12]. On the other hand, the largest dsDNA viruses, Pandoraviruses, have genomes of 2.5 Mb encompassing some of 2500 coding sequences . According to the Baltimore classification developed by David Baltimore in the early ‘70s, there are seven types of all known viruses depending on the nucleic acid content and its replication mode: dsDNA, ssDNA, dsRNA, (+)ssRNA and (−)ssRNA, ssRNA-RT and dsDNA-RT viruses, each with a different replication strategy within infected cells . Cellular forms of life use the canonical DNA-RNA-protein replication-expression pattern, whereas viruses are totally dependent on cells for multiplication and exploit all possible DNA and RNA interconversions . Retroviruses and hepadnaviruses can reverse transcribe their RNA to DNA and this process is rare to occur in cells, although exceptions have been recently reported such as telomere synthesis and presence of reverse transcriptase-related cellular genes in eukaryotes [16–18]. Eukaryotic telomeres essential for the linear chromosome organization most probably derived from an ancient retroviral activity .
Many researchers postulate that viruses are of polyphyletic origin and different RNA and DNA viruses derived independently as opposed to monophyletic cellular domains coming from one ancient ancestor LUCA (last universal common ancestor), which is a logical consequence of the binary mechanism of cell division . There is no physical ‘fossil record’ of viruses; virions persist for short time periods, and rapidly degrade leaving no direct trace of their existence. However, many viral genomes have always had the capacity to integrate into cellular genomes and the study of this genomic ‘fossil record’, called paleovirology, helps to understand the long-term evolutionary history of virus–host interactions . Viruses have been major players in the evolution by imposing high selection pressure on their hosts and manipulating the whole environment . According to recent hypotheses, viruses might have played a direct role in the origin of DNA and DNA replication mechanisms , cellular envelopes , of pathogenicity , alternative genetic codes  and formation of the three domains of life: Archaea, Bacteria and Eukarya , which were identified by 16/18S rRNA comparison . A previously described link between reverse transcriptase activity and telomeres indicates a possible early retroviral colonization of large dsDNA viruses, which are putative ancestors of the eukaryotic nucleus , although a different concept on the origin of nucleus was also reported .
With the advent of this new knowledge it was a bit unthoughtfully proposed by Carl Woese that cellular evolution could not be solely explained by the classical Darwinian mode of thinking . However, it has been recently pointed out that the core of Darwin concept of evolution relying on variation/selection processes is still sufficient to explain the history of life. Horizontal, also called lateral gene transfer (HGT or LGT) should be considered a special case of genetic variation along with mutations, recombination, and different kinds of ploidies and others [29–31]. All these processes enrich biodiversity and influence cellular evolution , and thus HGT is supplementary to vertical evolutionary mechanisms. If a proto-cell was simple and highly modular in organization, it implies that HGT could have played a greater role in evolution . This modularity of ancient RNA cells is somehow reflected by structure of current viral genomes built of major functional blocks of genes (modules): 1) replicon - ORFs involved in replication, 2) structural genes encoding coat proteins and 3) elements manipulating metabolism of infected cells. Phage genomes could be considered as collections of functional modules that evolved independently in host genomes and were acquired over time by the phage . However, nowadays an overall similarity of viral and cellular proteins having probably resulted from horizontal gene transfer is small .
If we imagine that 1 ml of seawater contains 1 million bacteria cells and ten or even a thousand times more viral sequences (up to 109 virions/ml), it can be determined that 1031 bacteriophages infect 1024 bacteria cells per second [36, 37]. This abundance and replication rate of viruses have also been one of the sources of novel functions in cellular lineages via the insertion of genes of viral origin into cellular genomes. It has been recently suggested that viruses are real nature’s genomic laboratory and a virocentric perspective on the evolution of life was put forward [15, 38, 39]. However, the viral insertion should be distinguished from the real HGT consisting of DNA exchange between cells by transformation, conjugation and transduction; in the latter viruses play the role of vehicles for cellular gene exchange . These different processes can be exemplified by a prophage providing the acquisition of more than 100 new genes in a single genome editing event  or an insertion sequence named IS607 and carried by Phycodnaviridae (members of nucleo-cytoplasmic large DNA viruses, NCLDVs) as well as by Amoeba and Algae. It suggests that these viruses could mediate horizontal transfers between different cellular genomes . The majority of sequences in viromes represent a so-called “dark matter”, they have no detectable homologues in the current databases . In case of phages, it means that their ability to transduce cellular genes does not translate into domination of these cellular “hitchhiker” genes in the phage genomic reservoir . Although the number of sequenced genomes now included in the databases dramatically increased in the last years, the percentage of unknown sequences within bacteriophages has not really decreased .
Hypotheses of virus origin
Viruses cannot multiply or carry out living processes outside the cells, therefore the ancestry of both is most probably highly intertwined. Origin of viruses is enigmatic and controversial in the light of cellular theory of life. Comparison of viral and cellular sequences shed more light on hypotheses of virus origin. It is important to know that evolutionarily, three classes of viral genes can be distinguished: 1) genes with detectable homologues in cellular life forms, 2) virus-specific genes such as ORFans and 3) viral hallmark genes with only distant homologues in cellular organisms . None of the theories is exhaustive and each has gaps difficult to explain, and for each theory pros and cons have been discussed in literature (Fig. 1).
The virus-first (or co-evolution) hypothesis was first proposed by d’Herelle who claimed that viruses are ancestral to cells . Others suggested that viruses originated in the pre-cellular world using a soup as a host [15, 46]. Evolution of life started with a virus-like stage and the advent of modern-type cells was a comparatively late event . At the dawn of life there were no cellular forms but only first RNA molecules possessing enzymatic activities and capable of self-replication. Some of these subviral forms still exist in the current world - they are viroids – the smallest (from ~250 to ~400 nucleotides) and simplest replicating RNA molecules known today . Viroids of the family Avsunviroidae possess a hammerhead ribozyme structure and can carry out cleavage of oligomeric forms of RNA to the monomeric forms, which potentially makes them descendants of the earliest biomolecules present on Earth [48, 49]. Viroids are good candidates for being survivors of the RNA world as they have a number of special features such as a small size imposed by error-prone replication, high G + C content, lack of protein-coding ability consistent with a ribosome-free habitat and several others. On the timeline of evolution, when DNA and proteins molecules already existed, those protoviroids (ancient viroids) lost some abilities to become the plant parasites and today they are dependent on cellular enzymes such as RNA polymerase, RNAaseH and RNA ligase for replication . However, it is more reasonable to claim that these protoviroids would have always relied on efficient cellular metabolism producing ATP and other ribonucleotides, and therefore ancient viroids and cells co-evolved.
Even though we have no insight, whether there were the same rules in this ancient realm, according to our knowledge of the current living world there is a strong inverse relationship between genome size and mutation rate across all replication systems, therefore it is possible that pre-LUCA genomes were both small and highly error prone and hence RNA virus-like . In the era of nucleic acid life in a niche of “supramolecular aggregates” (or SMAs), the nucleic acids evolved to accommodate available peptides , or RNA molecules could have evolved independently of host proto-cells as their “parasites”, inhabiting a common environment. RNA viruses evolved first from the nucleoprotein world, followed by retroid elements, and DNA viruses . Ancient RNA viruses, specifically (+)ssRNA, are relics of the RNA world, retroviruses and hepatitis B viruses relics of an RNA-to-DNA transition in evolutionary history of life. This seems to be confirmed by the existence of tRNA-like structures (TLSs), which are involved in virus replication (link between replication and translation) and by the discovery of reverse transcriptase . Transfer RNA-like structures (TLSs) that are sophisticated functional mimics of tRNAs are found at the 3′-termini of the genomes of a number of plant positive strand RNA viruses and three natural aminoacylation identities are represented: valine, histidine, and tyrosine . Indeed, tRNA-like motifs could be inherited from RNA replication signals accommodated to assist in the translation process. Plant RNA viruses make use of TLSs to control translation initiation and viral RNA replication .
There is a growing body of evidence that viruses arose even before LUCA, that more appropriately should be denoted as Last Universal Cellular Ancestral State (LUCAS) . Moreover, while discussing concepts of virus origin, it is crucial to distinguish viruses evolved before ancient cells and viruses evolved before modern cells, the descendants of LUCA. The theory of ancient virus origin is supported by the presence of homologous capsids and homologous packaging ATPases among diverse viruses infecting the three domains of life. Capsid protein is the most prominent example, and the sole protein found in most viruses and not in cellular organisms [51, 55, 56]. Several years ago Abrescia and colleagues identified major viral lineages based on structural comparison of non homologous capsid proteins and non homologhous packaging ATPases, where genomic similarities are no longer observable. At least two lineages of DNA viruses predating LUCA, adenovirus/PRD1 containing the double-jelly roll fold and the Hong Kong fold in the HK97 lineage were described . In the context of these two completely different structures a criticism of the virus-first theory based on structural convergence of most viral capsids adopting to a small number of simple geometrical structures can be refuted. Thus, the convergence towards similar folds for adaptation of certain capsid proteins, as their tertiary conformation is subject to strong constraints, concerns only a part of the viral world and cannot be ground for a universal evolutionary concept. Furthermore, the invention of a self-assembling capsid is very difficult to achieve and its formation by evolutionary mechanisms is very rare. It suggests that structure based lineages may tend to reflect homology rather than structural convergence . To conclude it should be noted that there is no a single gene, or a coding sequence that would be common to all the viruses, hence a common pre-LUCA viral ancestor is often questioned .
The escape or vagrancy (cell-first) hypothesis describes viruses as derived from cellular RNA or/and DNA fragments such as plasmids and transpozons, which escaped from cells. When such RNA or DNA fragments acquired protein coat they became independent entities capable of infecting cells from which they had escaped previously . As already mentioned, ancient RNA genomes were modular (“RNA chromosomes”), and were randomly distributed from cells to cells . During asymmetrical cell fission a vesicle (smaller cell-like entity) could have formed engulfing a self replicating RNA and a coat encoding RNA segment. The translation apparatus was not transferred to the newly formed vesicle and thus an ancient RNA virus emerged . In a model for early virus evolution, viruses can be regarded less as having derived from proto-cells and more as being partners in their mutual co-evolution . This model somehow merges the virus-first and the escape hypotheses into one more complex theory. On the other hand, it would be difficult to demonstrate how nucleic acids released from cells started to code for coat proteins. It can be easily imagined that plasmids evolved quite late from dsDNA viruses (not the other way round) and lost genes encoding coat proteins. Otherwise, it would be hard to prove how viruses evolved from plasmids and acquired the ability to encode capsids in the absence of already existing capsid modules [60, 61]. Furthermore, viruses resemble plasmids which do not encode cellular homologues including proteins involved in DNA replication such as rolling-circle Rep proteins and DNA polymerase E . This would indicate common evolutionary tract for plasmids and viruses. Moreover, viruses derived from cells should share a high sequence homology with their hosts, yet proteins encoded by bacteriophage T4 are more similar to eukaryotic proteins or eukaryotic viral proteins than to their bacterial homologues , and most proteins encoded by viral genomes are deprived of their cellular homologues . However, it is easier to defend the escape theory in the context of a pre-LUCA scenario for virus origin. Since viruses derived from genome fragments escaped from cells predating LUCA, any specific relationship between proteins encoded by viruses and those encoded by their hosts are not expected anymore . A good documented example of new viruses being created through gene escape events is human hepatitis delta virus (HDV), which has been shown to contain a ribozyme sequence that is closely related to the CPEB3 ribozyme present in a human intron . HDV is found only in humans and requires human hepatitis B virus to replicate. Thus, HDV probably derives from the human transcriptome, and not necessarily from a pre-LUCA world .
The reduction or degeneracy (cell-first) hypothesis states that viruses come from small primordial cells (not necessarily primitive), which lost their cellular elements in the course of evolution. They maintained, however, their genetic material and certain elements needed for replication. For a long time it has been believed that there is no intermediary form between a cell and a virus, because parasites known for the three domains of life have kept their cellular character; they still have ribosomes and are able to synthesize ATP. It is also for that reason that the reduction theory can be easily counter-argued. But then again, it is much easier to imagine this reduction leading to a virus emergence in a world of RNA cells, because these cells were much simpler than the modern ones. RNA-cell living as a parasitic endosymbiont in another RNA cell could have lost its own machinery for protein synthesis and for energy production, using instead those of the host . The presence of virus hallmark genes may be considered as evidence for their possible origin from virocells or these sequences may have been recruited from ancient cells now extinct. For instance, it was described that both human adenovirus and Bacillus subtilis bacteriophage Ф29, use a similar atypical protein-priming mechanism to replicate their DNA (unknown in the cellular world) and encode a unique type of DNA polymerase from the subfamily of polymerases B. It can use such a DNA template to initiate its own replication and has no representatives in currently living cells . It seems likely, that the DNA polymerase is a viral hallmark gene in disguise , and that these two viruses originated from a common ancestor that had existed before the divergence between Eukarya and Bacteria . However, it must be mentioned here that a small set of virus hallmark genes encoding essential functions shared by a vast range of viruses is a strong evidence, especially for positive-strand RNA, that viruses are direct descendants of the primordial RNA-protein world .
In recent years, the reduction hypothesis was revived by the discovery and genomic characterization of Acanthamoeba polyphaga mimivirus (APMV)  with a very complex set of genes (1,2 Mb genome and 911 genes) showing little horizontal gene transfer. It strongly suggested a process of reductive evolution from an even more complex ancestor that had been endowed with a protein synthetic capability . Furthermore, sequence and phylogenetic analyses of the components of the packaging machinery present in APMV show that some large DNA viruses such as mimivirus, vaccinia virus, and pandoravirus are remarkably more similar to prokaryotes (bacteria and archaea) than to other viruses in the way they process their newly synthesized genetic material to make sure that only one copy of the complete genome is generated and meticulously placed inside a newly synthesized viral particle . The discovery of giruses such as Mimiviruses , Megaviruses (Megavirus chilensis) , Pandoraviruses , and Pithoviruses  created a continuum in genome size and functional complexity between the virosphere and cells. Megavirus retained all of the genomic features unique to Mimivirus, in particular its genes encoding key-elements of the translation apparatus (seven aminoacyl-tRNA synthetases), a trademark of cellular organisms. It could suggest that large DNA viruses derived from an ancestral cellular genome by reductive evolution, which can be supported further by the presence of a large number of enzymes in genomes of giruses like various hydrolases, proteases, kinases, phosphatases and many others involved in cellular metabolic processes. The nature of this cellular ancestor remains hotly debated [70–72]. It has been pointed out by Claverie and Ogata that despite life being an all or nothing concept, “living” organisms span a continuum of autonomy and complexity in which large DNA viruses (giruses) largely overlap the smallest bacteria. It is a well described evolutionary scenario for Bacteria and Archaea to become parasites by reductive evolution. Since giruses could have predated the divergence of today’s three cellular domains, their case may be similar supported by the presence of bacterial-like, archaeal-like and eukaryan-like genes in their genome . That is why it has recently been proposed that giruses coexisted with the cellular ancestors and represent a distinct supergroup along with superkingdoms Archaea, Bacteria and Eukarya .
However, this evolutionary theory suffers from several major weaknesses. More than 93 % of Pandoraviruses genes resemble nothing known in all available sequence databases, therefore their origin cannot be traced back to any known cellular lineage , quite similarly to previously described bacteriophages. Furthermore, the term “fourth domain” is controversial and many arguments were given by opponents against viruses belonging to the tree of life (actually, the tree of cells), among others, inability to produce and capture energy or inexistence of integrated fully developed metabolic pathways [6, 9]. NCLDVs genomes do not display any characteristics of genome decay that have been observed in intracellular bacteria such as Rickettsia or parasitic protists such as microsporidia, where presence of pseudogenes, non-coding DNA, shorter genes, massive gene loss and disappearance of metabolic pathways were noted. This picture is blurred even more by the fact that Megaviruses are related to small DNA viruses and could have derived from them using a complex process of genomic accordion. It implies successive steps of genome expansions (duplication and gene transfers) and genome reduction, in addition to movement and amplification of diverse genetic elements . Furthermore, giruses can be infected by their own viruses called virophages such as Sputnik that could be a vehicle mediating lateral gene transfer between them . As Sputnik multiplies in giant factories, it resembles satellite viruses of animals (adeno-associated virus or hepatitis D virus). However, Sputnik reproduction cycle seems to impair the production of normal APMV virions significantly, indicating that it is a genuine parasite, a first virus described to propagate at the expense of its viral host . According to Krupovic and Koonin Megaviruses evolved from virophages, which in turn derived from Polintons and Tectiviridae as it is shown by homology of the major capsid protein (MCP) in these groups. The evolution of giant viruses had been pushed to the extreme, which explains their big genome size [3, 77, 78]. To conclude, one should avoid supporting the reduction concept of virus origin using NCLDVs biology.
The origin of cells and nuclei
The co-evolution of viral elements and cellular forms has also been described as incessant arms race with various forms of cooperation . It started 3 or 4 billion years ago, when LUCA also emerged  to give life to all cellular organisms we know nowadays with universal genetic code from bacterial to human cells, wherein basic processes are similar. To reconstruct LUCA as it was back in time is extremely difficult because organisms have lost many genes in the course of evolution, and additionally a horizontal gene transfer (HGT) interfered. The very nature of LUCA is still under discussion. According to a group of researchers, although it does not seem very likely in the light of more robust theories, LUCA could have been an inorganically housed assemblage of expressed and replicable genetic elements. The evolution of the enzymatic systems for DNA replication, membrane and cell wall biosynthesis, enabled independent escape of the first archaebacterial and eubacterial cells from their hydrothermal hatchery, within which the LUCA itself remained [81, 82]. A concept of LUCA growing on the H2/CO2 couple, and being naturally chemiosmotic is among many other hypotheses. This point goes a long way towards explaining why chemiosmosis, and the proteins that harness ion gradients, are universal among living cells . LUCA could have used proton gradients to drive carbon and energy metabolism, but only if the membranes were leaky. This requirement precluded ion pumping and the early evolution of phospholipid membranes . However, other researchers demonstrated in an evidence-based manner that LUCA was enclosed by a lipid membrane with secretory and insertion apparatus of protein nature. Comparative genomic analyses showed that LUCA already encoded several critical membrane-bound proteins [85, 86] as well as ATP-ase, contained ribosomes and most likely DNA [28, 87]. These sophisticated ribosomes of LUCA were built of 34 proteins that are shared by all ribosome-encoding organisms [8, 88]. The following issue after deciphering LUCA’s nature in tracing early cellular evolution is to explain the differences in the membrane composition (cytoplasmic, nuclear and belonging to reticulum) among the three major domains of life that came after LUCA. Eukarya and Bacteria are much more similar to each other in this regard than Archaea. Eukaryan lipids are bacteria-like and have an opposite chirality as compared to Archaea . Two viruses with related DNA replication systems could have infected RNA cells with different types of lipids, and some cellular lineages ended up using specifically one of the two types of lipids to produce Archaea and Eukarya .
Viruses can be considered as living organisms only when they redirect cellular metabolism to reproduce virions, hence infection transforms the ribocell (cell encoding ribosomes and dividing by binary fission) into a virocell (cell producing virions) or ribovirocell (cell that produces virions but can still divide by binary fission) [4, 5]. This nomenclature is in line with a well documented observation of a variety of nonrelated viruses inducing a recruitment of organelles, usually to the perinuclear area, and building a new structure called “virus factory” that functions in viral replication, assembly, or both. The virus factory is enclosed by a membrane, contains ribosomes and cytoskeletal elements and it can also recruit mitochondria, from which it obtains ATP . At this stage of NCLDVs replication cycle the virus factory is very similar to small unicellular parasites such as bacteria. From this perspective it is much easier to consider NCLDVs as entities linking inert world and living cells. Another interesting aspect is that a large poxvirus-like dsDNA virus might be at the origin of the eukaryotic nucleus, enclosed by an ancestral cell and adapted as an organelle. This virus factory in ancient times was very similar to a “viral nucleus” that could have evolved into a modern eukaryotic nucleus according to eukaryogenesis hypothesis . The nucleus could have already appeared in a RNA LUCA and two independent transfers of DNA from viruses to cells were suggested to explain the existence of two nonhomologous DNA replication machineries – one in Bacteria, the other in Archaea and Eukarya, which for that reason are placed on a common branch of the tree of life as opposed to Bacteria (Fig. 2) [91–93].
Later, it was proposed that DNA replication machineries of each domain could have also originated from three different viruses that helped create three major branches of life: LACA – last archaeal common ancestor, LBCA – last bacterial common ancestor, and finally LECA – last eukaryotic common ancestor . This concept of polyphyletic ancestry of viruses is called “Three RNA cells, three DNA viruses”. It is interesting to denote that RNA viruses might have been at the origin of DNA biochemistry. RNA-based viruses replicating in RNA-based cells would have acquired an RNA-to-DNA modification system to resist cellular RNA-degrading enzymes (Darwinian selection). For this to happen, RNA viruses acquired the ribonucleotide reductase for conversion of diphosphate-ribonucelotides to diphosphate-deoxyribonucelotides, and thymidylate synthase to make dTMP from dUMP, cellular RNA was then replaced by DNA of possible viral origin in the course of evolution . The genetic DNA-RNA takeover may have been driven by a combination of increased chemical stability, increased genome size and irreversibility as it was demonstrated experimentally several years ago . This scenario is supported by the fact that many modern viruses encode viral-specific versions of ribonucleotide reductases and thymidylate synthases. Interestingly, to further support the above, deoxyuridine is known to replace thymidine in the DNA of several bacteriophages . Given the complexity of ribosomes and sophisticated nature of aforementioned enzymes it would be really difficult to imagine that they originated in the world without proto-cells. The RNA-to-DNA transition must have taken place in a cellular context .
Conclusion and future perspectives
It is legitimate to say that the tree of life is composed of cells/organisms coding for ribosomes and multiplying by binary fission, and viruses are excluded from the tree of life (cells) as entities encoding capsid proteins and undergoing intracellular process in order to propagate . They are actually molecular genetic parasites. All life must survive this viral-laden habitat and survivors generally retain prophage (or provirus) or their defectives . Viruses from the very origin of life were one of major sources of global genetic biodiversity by participation in altering genomic structures (mutations) and functions. They also served as vehicles to transfer host genes horizontally between cells from different species and even distant taxa. From the time of LUCA, viruses have coevolved with their hosts, and citation by Forterre seems appropriate: “viral lineages can be viewed as lianas wrapping around the trunk, branches and leaves of the tree of life” . Koonin ventures to postulate that the concept of the tree of life should be replaced by a “complex network of treelike and netlike routes of evolution to depict the history of life”, which indeed may better reflect reality . For decades virologists have tried to understand and explain the origin of viruses. In our opinion, we probably need to cope with the idea that all concepts on virus origin described in this review are complementary. Viroids and HDV by their nature may support the co-evolution theory by the former and the escape concept by the latter. A recent discovery of mimivirus (more or less a decade ago) and other giant viruses has opened for some scientists a new perspective on evolution of these viruses from an extinct fourth cellular domain. However, the concept of this new domain is very controversial in the scientific community . This late discovery of giruses is a good example of how difficult it was to leave a dogma of viruses being the smallest entities passing through the finest filters as opposed to bacteria [99, 100] and that perhaps some new great discoveries of the viral world are still ahead of us. Viruses are especially neglected in phylogenetic studies because they lack a unifying genetic marker, similar to rRNA for cells and because their genetic activity is underestimated . The ancestors of current life forms no longer exist and it makes extremely difficult to go back to the dawn of evolution that took place several billion years ago. Paradoxically, future findings in the viral world and even more powerful tools of bioinformatics for comparative studies may help better understand the first evolutionary events.
Last universal common ancestor
Last universal common ancestor state
Last archaeal-eukaryal common ancestor
Last archaeal common ancestor
Last bacterial common ancestor
Last eukaryotic common ancestor
Horizontal gene transfer
Lateral gene transfer
Nucleo-cytoplasmic large DNA viruses
Hepatitis delta virus
Hepatitis B virus
Acanthamoeba polyphaga mimivirus
Benson SD, Bamford JK, Bamford DH, Burnett RM. Does common architecture reveal a viral lineage spanning all three domains of life? Mol Cell. 2004;3(16):673–85.
Rossmann MG. Structure of viruses: a short history. Q Rev Biophys. 2013;46:133–80.
Abrescia NG, Bamford DH, Grimes JM, Stuart DI. Structure unifies the viral universe. Annu Rev Biochem. 2012;81:795–822.
Forterre P. The virocell concept and environmental microbiology. ISME J. 2013;7:233–6.
Forterre P. The great virus comeback [Article in French]. Biol Aujourdhui. 2013;207:153–68.
Forterre P, Krupovic M, Prangishvili D. Cellular domains and viral lineages. Trends Microbiol. 2014;22:554–8.
Koonin EV, Dolja VV. Virus world as an evolutionary network of viruses and capsidless selfish elements. Microbiol Mol Biol Rev. 2014;78:278–303.
Raoult D, Forterre P. Redefining viruses: lessons from Mimivirus. Nat Rev Microbiol. 2008;6:315–9.
Moreira D, López-García P. Ten reasons to exclude viruses from the tree of life. Nat Rev Microbiol. 2009;7:306–11.
La Scola B, Audic S, Robert C, Jungang L, de Lamballerie X, Drancourt M, et al. A giant virus in amoebae. Science. 2003;299:2033.
Flores R, Ruiz-Ruiz S, Serra P. Viroids and hepatitis delta virus. Semin Liver Dis. 2012;32:201–10.
Fallot G, Neuveut C, Buendia MA. Diverse roles of hepatitis B virus in liver cancer. Curr Opin Virol. 2012;2:467–73.
Maumus F, Epert A, Nogué F, Blanc G. Plant genomes enclose footprints of past infections by giant virus relatives. Nat Commun. 2014;5:4268.
Baltimore D. Expression of animal virus genomes. Bacteriol Rev. 1971;35:235–41.
Koonin EV, Dolja VV. A virocentric perspective on the evolution of life. Curr Opin Virol. 2013;3:546–57.
Xiong Y, Eickbush TH. Origin and evolution of retroelements based upon their reverse transcriptase sequences. EMBO J. 1990;9:3353–62.
Jones SA, Hu J. Hepatitis B virus reverse transcriptase: diverse functions as classical and emerging targets for antiviral intervention. Emerg Microbes Infect. 2013;2, e56.
Gladyshev EA, Arkhipova IR. A widespread class of reverse transcriptase-related cellular genes. Proc Natl Acad Sci U S A. 2011;108:20311–6.
Witzany G. The viral origins of telomeres and telomerases and their important role in eukaryogenesis and genome maintenance. Biosemiotics. 2008;2:191–206.
Katzourakis A. Paleovirology: inferring viral evolution from host genome sequence data. Philos Trans R Soc Lond B Biol Sci. 2013;368:20120493.
Rohwer F, Thurber RV. Viruses manipulate the marine environment. Nature. 2009;459:207–12.
Forterre P. The origin of DNA genomes and DNA replication proteins. Curr Opin Microbiol. 2002;5:525–32.
Jalasvuori M, Bamford JK. Structural co-evolution of viruses and cells in the primordial world. Orig Life Evol Biosph. 2008;38:165–81.
Brüssow H. Bacteria between protists and phages: from antipredation strategies to the evolution of pathogenicity. Mol Microbiol. 2007;65:583–9.
Shackelton LA, Holmes EC. The role of alternative genetic codes in viral evolution and emergence. J Theor Biol. 2008;254:128–34.
Woese CR, Kandler O, Wheelis ML. Towards a natural system of organisms: proposal for the domains Archaea, Bacteria, and Eucarya. Proc Natl Acad Sci U S A. 1990;87:4576–9.
Takemura M. Poxviruses and the origin of the eukaryotic nucleus. J Mol Evol. 2001;52:419–25.
Woese CR. On the evolution of cells. Proc Natl Acad Sci U S A. 2002;99:8742–7.
Forterre P. Darwin’s goldmine is still open: variation and selection run the world. Front Cell Infect Microbiol. 2012;2:106.
Echeverría N, Moratorio G, Cristina J, Moreno P. Hepatitis C virus genetic variability and evolution. World J Hepatol. 2015;7:831–45.
Mau M, Lovell JT, Corral JM, Kiefer C, Koch MA, Aliyu OM, et al. Hybrid apomicts trapped in the ecological niches of their sexual ancestors. Proc Natl Acad Sci U S A. 2015;112:E2357–65.
Gogarten JP, Townsend JP. Horizontal gene transfer, genome innovation and evolution. Nat Rev Microbiol. 2005;3:679–68.
Woese CR. Interpreting the universal phylogenetic tree. Proc Natl Acad Sci U S A. 2000;97:8392–6.
Campbell, A, Botstein, D. Evolution of the lambdoid phages. In Lambda II. Edited by Hendrix, R. et al. Cold Spring Harbor (NY): Cold Spring Harbor Laboratory Press; 1983. p. 365–380.
Rappoport N, Linial M. Viral proteins acquired from a host converge to simplified domain architectures. PLoS Comput Biol. 2012;8, e1002364.
Tettelin H, Masignani V, Cieslewicz MJ, Donati C, Medini D, Ward NL, et al. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: implications for the microbial “pan-genome”. Proc Natl Acad Sci U S A. 2005;102:13950–5.
Rosario K, Breitbart M. Exploring the viral world through metagenomics. Curr Opin Virol. 2011;1:289–97.
Abroi A, Gough J. Are viruses a source of new protein folds for organisms? Virosphere structure space and evolution. Bioessays. 2011;33:626–35.
Rohwer F, Youle M, Maughan H, Hisakaw N. Life in our phage world. Edited by Wholon: San Diego California; 2014.
Campbell A. Phage integration and chromosome structure. A personal history. Annu Rev Genet. 2007;41:1–11.
Gilbert C, Cordaux R. Horizontal transfer and evolution of prokaryote transposable elements in eukaryotes. Genome Biol Evol. 2013;5:822–32.
Kristensen DM, Mushegian AR, Dolja VV, Koonin EV. New dimensions of the virus world discovered through metagenomics. Trends Microbiol. 2010;18:11–9.
Kristensen DM, Cai X, Mushegian A. Evolutionarily conserved orthologous families in phages are relatively rare in their prokaryotic hosts. J Bacteriol. 2011;193:1806–14.
Koonin EV, Senkevich TG, Dolja VV. The ancient virus world and evolution of cells. Biol Direct. 2006;1:29.
D’Herelle F. The bacteriophage; its role in immunity. Williams and Willkins: Baltimore; 1922.
Prangishvili D, Stedman K, Zillig W. Viruses of the extremely thermophilic archaeon Sulfolobus. Trends Microbiol. 2001;9:39–43.
Kovalskaya N, Hammond RW. Molecular biology of viroid-host interactions and disease control strategies. Plant Sci. 2014;228:48–60.
Diener TO. Circular RNAs: relics of precellular evolution? Proc Natl Acad Sci U S A. 1989;86:9370–4.
Flores R, Gago-Zachert S, Serra P, Sanjuán R, Elena SF. Viroids: survivors from the RNA world? Annu Rev Microbiol. 2014;68:395–414.
Holmes EC. What does virus evolution tell us about virus origins? J Virol. 2011;85:5247–51.
Forterre P. The origin of viruses and their possible roles in major evolutionary transitions. Virus Res. 2006;117:5–16.
Dreher TW. Role of tRNA-like structures in controlling plant virus replication. Virus Res. 2009;139:217–29.
Piñeiro D, Martinez-Salas E. RNA structural elements of hepatitis C virus controlling viral RNA translation and the implications for viral pathogenesis. Viruses. 2012;4:2233–50.
Koonin EV. On the origin of cells and viruses: primordial virus world scenario. Ann N Y Acad Sci. 2009;1178:47–64.
Khayat R, Tang L, Larson ET, Lawrence CM, Young M, Johnson JE. Structure of an archaeal virus capsid protein reveals a common ancestry to eukaryotic and bacterial viruses. Proc Natl Acad Sci U S A. 2005;102:18944–9.
Bamford DH, Grimes JM, Stuart DI. What does structure tell us about virus evolution? Curr Opin Struct Biol. 2005;15:655–63.
Claverie JM. Viruses take center stage in cellular evolution. Genome Biol. 2006;7:110.
Poole AM, Jeffares DC, Penny D. The path from the RNA world. J Mol Evol. 1998;46:1–17.
Hendrix RW, Lawrence JG, Hatfull GF, Casjens S. The origins and ongoing evolution of viruses. Trends Microbiol. 2000;8:504–8.
Forterre P. The two ages of the RNA world, and the transition to the DNA world: a story of viruses and cells. Biochimie. 2005;87:793–803.
Forterre P. Three RNA cells for ribosomal lineages and three DNA viruses to replicate their genomes: a hypothesis for the origin of cellular domain. Proc Natl Acad Sci U S A. 2006;103:3669–74.
Lipps G, Röther S, Hart C, Krauss G. A novel type of replicative enzyme harbouring ATPase, primase and DNA polymerase activity. EMBO J. 2003;22:2516–25.
Miller ES, Kutter E, Mosig G, Arisaka F, Kunisawa T, Rüger W. Bacteriophage T4 genome. Microbiol Mol Biol Rev. 2003;67:86–156.
Prangishvili D, Garrett RA. Exceptionally diverse morphotypes and genomes of crenarchaeal hyperthermophilic viruses. Biochem Soc Trans. 2004;32:204–8.
Salehi-Ashtiani K, Lupták A, Litovchick A, Szostak JW. A genomewide search for ribozymes reveals an HDV-like sequence in the human CPEB3 gene. Science. 2006;313:1788–92.
Filée J, Forterre P, Sen-Lin T, Laurent J. Evolution of DNA polymerase families: evidences for multiple gene exchange between cellular and viral proteins. J Mol Evol. 2002;54:763–73.
Raoult D, Audic S, Robert C, Abergel C, Renesto P, Ogata H, et al. The 1.2-megabase genome sequence of Mimivirus. Science. 2004;306:1344–50.
Chelikani V, Ranjan T, Zade A, Shukla A, Kondabagil K. Genome segregation and packaging machinery in Acanthamoeba polyphaga mimivirus is reminiscent of bacterial apparatus. J Virol. 2014;88:6069–75.
Arslan D, Legendre M, Seltzer V, Abergel C, Claverie JM. Distant Mimivirus relative with a larger genome highlights the fundamental features of Megaviridae. Proc Natl Acad Sci U S A. 2011;108:17486–91.
Philippe N, Legendre M, Doutre G, Couté Y, Poirot O, Lescot M, et al. Pandoraviruses: amoeba viruses with genomes up to 2.5 Mb reaching that of parasitic eukaryotes. Science. 2013;341:281–6.
Legendre M, Arslan D, Abergel C, Claverie JM. Genomics of Megavirus and the elusive fourth domain of Life. Commun Integr Biol. 2012;5:102–6.
Ludmir EB, Enquist LW. Viral genomes are part of the phylogenetic tree of life. Nat Rev Microbiol. 2009;7:615. author reply 615.
Claverie JM, Ogata H. Ten good reasons not to exclude giruses from the evolutionary picture. Nat Rev Microbiol. 2009;7:615.
Nasir A, Kim KM, Caetano-Anolles G. Giant viruses coexisted with the cellular ancestors and represent a distinct supergroup along with superkingdoms Archaea, Bacteria and Eukarya. BMC Evol Biol. 2012;12:156.
Filée J. Route of NCLDV evolution: the genomic accordion. Curr Opin Virol. 2013;3:595–359.
La Scola B, Desnues C, Pagnier I, Robert C, Barrassi L, Fournous G, et al. The virophage as a unique parasite of the giant mimivirus. Nature. 2008;455:100–4.
Koonin EV, Krupovic M, Yutin N. Evolution of double-stranded DNA viruses of eukaryotes: from bacteriophages to transposons to giant viruses. Ann N Y Acad Sci. 2015;1341:10–24.
Durzyńska J. Giant viruses - enfants terribles in the microbal Word. Future Virol 2015; doi:10.2217/FVL.15.27.
Forterre P, Prangishvili D. The great billion-year war between ribosome- and capsid-encoding organisms (cells and viruses) as the major source of evolutionary novelties. Ann N Y Acad Sci. 2009;1178:65–77.
Glansdorff N, Xu Y, Labedan B. The last universal common ancestor: emergence, constitution and genetic legacy of an elusive forerunner. Biol Direct. 2008;3:29.
Koonin EV, Martin W. On the origin of genomes and cells within inorganic compartments. Trends Genet. 2005;21:647–54.
Martin W, Russell MJ. On the origins of cells: a hypothesis for the evolutionary transitions from abiotic geochemistry to chemoautotrophic prokaryotes, and from prokaryotes to nucleated cells. Philos Trans R Soc Lond B Biol Sci. 2003;358:59–83. discussion 83–85.
Lane N, Allen JF, Martin W. How did LUCA make a living? Chemiosmosis in the origin of life. Bioessays. 2010;32:271–80.
Sojo V, Pomiankowski A, Lane N. A bioenergetic basis for membrane divergence in archaea and bacteria. PLoS Biol. 2014;12, e1001926.
Peretó J, López-García P, Moreira D. Ancestral lipid biosynthesis and early membrane evolution. Trends Biochem Sci. 2004;29:469–77.
Lombard J, López-García P, Moreira D. The early evolution of lipid membranes and the three domains of life. Nat Rev Microbiol. 2012;10:507–15.
Jékely G. Did the last common ancestor have a biological membrane? Biol Direct. 2006;1:35.
Yutin N, Puigbò P, Koonin EV, Wolf YI. Phylogenomics of prokaryotic ribosomal proteins. PLoS One. 2012;7, e36972.
Novoa RR, Calderita G, Arranz R, Fontana J, Granzow H, Risco C. Virus factories: associations of cell organelles for viral replication and morphogenesis. Biol Cell. 2005;97:147–72.
Bell PJ. The viral eukaryogenesis hypothesis: a key role for viruses in the emergence of eukaryotes from a prokaryotic world environment. Ann N Y Acad Sci. 2009;1178:91–105.
Brochier-Armanet C, Forterre P, Gribaldo S. Phylogeny and evolution of the Archaea: one hundred genomes later. Curr Opin Microbiol. 2011;14:274–81.
Zhao S, Burki F, Bråte J, Keeling PJ, Klaveness D, Shalchian-Tabrizi K. Collodictyon-an ancient lineage in the tree of eukaryotes. Mol Biol Evol. 2012;29:1557–68.
Chun J, Rainey FA. Integrating genomics into the taxonomy and systematics of the Bacteria and Archaea. Int J Syst Evol Microbiol. 2014;64:316–24.
Leu K, Obermayer B, Rajamani S, Gerland U, Chen IA. The prebiotic evolutionary advantage of transferring genetic information from RNA to DNA. Nucleic Acids Res. 2011;39:8135–47.
Takahashi I, Marmur J. Replacement of thymidylic acid by deoxyuridylic acid in the deoxyribonucleic acid of a transducing phage for Bacillus subtilis. Nature. 1963;197:794–5.
Villarreal LP, Witzany G. Viruses are essential agents within the roots and stem of the tree of life. J Theor Biol. 2010;262:698–710.
Koonin EV. The two empires and three domains of life in the postgenomic age. Nat Educ. 2010;3:27.
Yutin N, Wolf YI, Koonin EV. Origin of giant viruses from smaller DNA viruses not from a fourth domain of cellular life. Virology. 2014;466–467:38–52.
Glass JI, Assad-Garcia N, Alperovich N, Yooseph S, Lewis MR, Maruf M, et al. Essential genes of a minimal bacterium. Proc Natl Acad Sci U S A. 2006;103:425–30.
Boyer M, Yutin N, Pagnier I, Barrassi L, Fournous G, Espinosa L, et al. Giant Marseillevirus highlights the role of amoebae as a melting pot in emergence of chimeric microorganisms. Proc Natl Acad Sci U S A. 2009;106:21848–53.
Nasir A, Forterre P, Kim KM, Caetano-Anollés G. The distribution and impact of viral lineages in domains of life. Front Microbiol. 2014;5:194.
This work was supported by KNOW RNA Research Centre in Poznań No.01/KNOW2/2014 (Poland). We would like to thank our two anonymous reviewers for their assistance in improving this review and a number of their pertinent comments. Also, we would like to thank Andrzej Pietrzak from Adam Mickiewicz University Press for the linguistic help.
The authors declare that they have no competing interests.
JD drafted and wrote the manuscript, and designed the artwork. AGJ initiated the concept of the article andverified the content of the final version before submission. All authors read and approved the finalmanuscript.
About this article
Cite this article
Durzyńska, J., Goździcka-Józefiak, A. Viruses and cells intertwined since the dawn of evolution. Virol J 12, 169 (2015). https://doi.org/10.1186/s12985-015-0400-7
- Giant virus (girus)
- Horizontal gene transfer
- Tree of life
- Domain of life