Background

Parvoviruses consist of small non-enveloped, autonomously replicating, single-stranded DNA viruses with genome length between 4.5 - 5.5 kb [1]. Bocavirus, a genus of the family Parvoviridae, is characterized by the presence of a third major ORF named NP1. Bocaviruses are known to infect multiple mammalian species including humans [2], cows [3], pigs [46], gorillas [7], chimpanzees [8], California sea lions [9], dogs [1013], cats [11], bats [14], and pine martens [15]. Bocavirus infections can cause respiratory and gastrointestinal symptoms in young animals and humans, but are also often subclinical in adults [2, 16]. While many bocaviruses were initially identified in feces or respiratory secretion they can also be found in blood [2, 16].

Minute virus of canines (MVC) was the first known bocavirus infecting dogs. MVC was isolated in 1967 in the feces of a clinical healthy dog, and later recognized as causing neonatal diseases and fertility disorders in dogs [16]. The second species of dog bocavirus (Canine bocavirus, CBoV) was identified in 2011 in the respiratory samples from diseased and healthy dogs [10]. One genotype of CBoV was associated with respiratory disease as it showed higher prevalence in diseased animals than healthy controls [10]. Variants of this CBoV were also detected in fecal, nasal, urine and blood samples collected from dogs in Hong Kong [11].

In this study, an infectious etiology was suspected for a dog with severe hemorrhagic gastroenteritis, necrotizing vasculitis, granulomatous lymphadenitis and anuric renal failure. The clinical and post-mortem workups for infectious causes in this case included negative test results for Canine Parvovirus 2, Canine Enteric Coronavirus, Canine distemper virus, Salmonella, Campylobacter, Clostridium perfringens enterotoxin A gene, Cryptosporidium, and Giardia. Special stains of histologic specimens revealed no detectable bacteria or other known infectious agents. Using deep sequencing, we characterized viral sequences present in the dog’s liver revealing a third species of canine bocavirus described here.

Results and discussion

A sequence of 300 bases showing sequence similarity to bocaviruses (BLASTx E < 10-3) was initially identified by 454 pyrosequencing. The sequence was extended by degenerate PCR targeting conserved bocavirus regions, yielding a ~2.5 kb partial genome sequence. The tissue nucleic acids were further analyzed using the MiSeq Illumina platform generating 16 contigs composed of 133 reads with similarities to bocaviruses (BLASTx E < 10-3), which allowed the amplification of all three ORFs. The virus was highly divergent from the other two known canine bocaviruses, MVC and CBoV, and was provisionally named Canine bocavirus 3 (CnBoV3) [10, 11, 13].

The nonstructural (NS1) protein encoded by ORF1 was 778 aa long, and contained motifs associated with rolling circle replication, helicase and ATPase. The NP1 protein encoded by the middle ORF3 was 194 aa long. The ORF2 encoded capsid proteins VP1 (689 aa) and proteolytically processed VP2 (560 aa). CnBoV3 shared 51%, 57%, 56% aa similarity with the NS1, NP1 and VP1 region of CBoV, respectively, and 49%, 52%, 57% aa similarity with the NS1, NP1 and VP1 regions of MVC. Phylogenetic analysis of the entire VP1 was performed to determine the relationship between CnBoV3 and other bocaviruses. CnBoV3 was phylogenetically distinct from the known dog bocaviruses and only loosely related to California sea lion bocaviruses, CBoV, feline bocavirus and MVC (Figure 1). Phylogenetic trees constructed by NS1 and NP1 region yielded similar topology (data not shown).

Figure 1
figure 1

Phylogenetic tree based on aligned amino acid sequences of full-length VP proteins of representative bocavirus species.

Bocaviruses are believed to replicate through the parvovirus rolling hairpin model, which generate replication intermediates of concatemers with head-to-head or tail-to-tail structure [17]. Recent experimental evidence showed the presence of head-to-tail concatemers or circularized genomes of human bocavirus (HBoV) 1&3 [18, 19] and porcine bocaviruses [20], indicating that some bocaviruses may use a rolling - circle replication model.

Using inverse PCR with primers directed outward from the 5’ and 3’ extremities of the partial genome sequence we were able to amplify head-to-tail sequence of the CnBoV3 non-translated regions (NTR) between the VP termination and the NS1 initiation codons (Figure 2A). The generation of specific PCR products indicated the presence of concatemerized or circular forms of the genome in the liver [18].

Figure 2
figure 2

A) Inverse PCR was performed to obtain the NTR sequence between the VP termination and the NS initiation codon of episomal forms of CnBoV3. Primers (P1 and P2) were situated at VP and NS region respectively. Secondary structures (symbolic in the box) may have impeded the PCR resulting in several products of different length. B) Highly identical ‘rabbit ear’ structure of CnBoV3 and MVC. C) Alignments of NTR sequences obtained by sequencing different inverse PCR products directly (PCR1&2) and subcloning products (colony 1–4). Sequences outside the two arrows were identical. Underlined sequence shows location of “rabbit ear” sequence.

To confirm these results, multiple repeats of the inverse PCR were generated and directly sequenced as well as subcloned into a plasmid vector. PCR amplicons and plasmid inserts were Sanger sequenced using protocol for GC-rich/hairpin sequences. The resulting sequences were aligned and the length of the region between the VP stop and the NS start codon calculated. This region varied in length from 392 to 506 nt (Figure 2C). Mfold analyses of the longest NTR sequence (506 nt) showed that the long palindromic hairpin terminal repeats (e.g. approximately 150 nt at both ends in MVC or bovine parvovirus, BPV) were missing, but a typical “rabbit ear” structure was detected that was nearly identical to a structure in the MVC 5'NTR sequence (Figure 2B). These results suggested that inverse PCR may have been impeded by strong secondary DNA structures (Figure 2A &C, between two red arrows). None of the currently reported NTR regions of bocavirus obtained by inverse PCR contained both complete inverted terminal repeats seen in MVC or BPV [1820].

Conclusions

In this study we genetically characterize a third species of canine bocavirus (CnBoV3), highly divergent from MVC [21] and CBoV [10, 11], in the liver of a dog with severe disease. The presence of three distinct canine bocaviruses, phylogenetically interspersed with viruses from different mammalian hosts, may reflect their origins from different cross-species transmissions. The detection of CnBoV3 in liver tissue indicated that the virus had likely breached the mucosal barrier of the typical sites of bocavirus replication in the respiratory or enteric tracts. The detection of episomal forms by PCR indicated that replication may be occurring in hepatocytes or other liver cell types and that viremia was also likely present although the lack of available blood sample prevented direct testing. We also detected in that animal co-infection with a canine circovirus. Circovirus infection can lead to lymphocytic depletion and immunosuppression in the host [22]. Whether CnBoV3, canine circovirus, or their combination were involved in this dog’s severe symptoms requires further investigation.

Materials and methods

A dog suffering from vomiting and hemorrhagic gastroenteritis was euthanized after a rapid disease course lasting seven days. Necropsy showed the presence of necrotizing vasculitis, granulomatous lymphadenitis and anuric renal failure. Clinical and post-mortem tests for multiple infectious agents of enteric disease were all negative. Liver Tissue was stored at −80°C until further processing. Tissue samples (~25 mg) were homogenized, filtered, and nuclease treated as previously described to enrich for nucleic acids within viral particles [23]. Nucleic acids were then extracted using the QIAamp viral RNA Mini kit (Qiagen), randomly amplified using random RT-PCR with randomized 3’ primers and nucleic acid libraries prepared as previously described [24] for sequencing using the Genome Sequencer FLX Instrument (454 Life Science, Roche). The pyrosequencing reads were sorted, trimmed, assembled and compared to the GenBank non-redundant databases as previously described [9]. Potential viral sequences were identified with translated protein sequence similarity matches (BLASTx to GenBank's NR database with E-value < 10-3) to known viral sequences. The presence of virus protein sequences detected by 454 pyrosequencing was confirmed by PCR and Sanger sequencing. Genome walking and degenerate PCR were used to amplify the viral genome. Enriched viral nucleic acids from the infected tissue was also handled using the ScriptSeq RNA-Seq library preparation kit (Epicentre) and sequenced by MiSeq system (illumina). The resulting near complete genome of CnBoV3 was deposited in GenBank with accession no. KC580640. Phylogenetic analyses based on aligned amino acid sequences from full-length VP proteins were generated by the neighbor joining method in MEGA [25], using amino acid p-distances, with 1,000 bootstrap replicates.