The genomes of tupanviruses are linear double-stranded DNA molecules with ~ 28% G+C. The TPV-SL genome is 1,439,508 bp in length and contains 1276 genes, while the TPV-DO has a genome of 1,516,267 bp that contains 1359 genes (Table 1). Around 30% of the predicted genes for the tupanviruses are completely new, named ORFans (open reading frames with no similarities in the databases). Among the ORFs with known function, up to 50% exhibit best hits in other viruses, especially members of the family Mimiviridae, while the other genes have best hits in cellular organisms, mainly eukaryotes and bacteria [15]. The structure and genome architecture are well conserved between the two tupanviruses, except for the terminal region of the genome of TPV-SL, which appears to be translocated and inverted in the genome of TPV-DO. In contrast, these genomes exhibit a strongly different synteny when compared to the genomes of the prototype viruses of the genera Mimivirus and Cafeteriavirus (Fig. 2). Moreover, the tupanviruses have the promoter motif AAAATTGA and a codon and amino acid usage frequency similar to that reported for other mimiviruses [3, 15, 21].
The tupanviruses have the most complete arsenal of genes related to the protein translation process among the members of the virosphere. Surprisingly, these viruses encode 67-70 tRNAs, 20 aminoacyl-tRNA synthetases (aaRS), and 41-44 other proteins involved in the translation process, such as translation factors (initiation, elongation and release), and factors related to maturation of tRNA and mRNA, among others (Table 1). The amount and diversity of these genes far exceed what was previously known for other giant viruses, even including the klosneuviruses, which possess up to 19 aaRS [14], and Bodo Saltans virus. The origin of these genes is still a matter of a hot debate in the scientific community, and a consensus has not yet been reached on whether these genes came from several events of lateral gene transfer from hosts and/or sympatric organisms or were already contain in the ancestors of mimiviruses [22,23,24]. Furthermore, the tupanviruses present two copies of an intronic region of the 18S rRNA gene, which are highly expressed during the viral replication cycle, but their function remains to be elucidated [15]. It is noteworthy that no exonic region of the 18S rRNA gene was found in tupanviruses or other viruses, nor do they possess genes related to the energy synthesis process [15].
Proteomic analysis of TPV-SL particles revealed the presence of 127 proteins, of which 67 have no known function, and 11 are encoded by ORFans. No aaRS or translation factors were detected in the viral particles. Among the proteins with a known function, several structural components were observed, such as the major capsid protein and the core protein, as well as enzymes involved in nucleic acid metabolism, such as a DNA polymerase, a DNA-dependent RNA polymerase, a RNA helicase, and an mRNA capping enzyme. It is notable that fewer than 50% of the proteins found in TPV-SL particles are shared with APMV or CroV [15].