The Tibetan pig is a unique Chinese indigenous pig breed that is found on the Tibetan plateau of China [1]. Due to the natural and harsh environment with an average altitude of over 3,000 m above sea level, few viral infections have been reported in animals in this region until recently. Tibetan pigs graze in the pastures in a free-range system and are usually kept together with other livestock such as yaks (Bos grunniens), sheep, and goats, providing opportunities for viruses to circulate among the livestock. Animals on the Tibetan plateau are also kept in close contact with local humans, which increases the risk of zoonotic transmission. Viral diarrhoea is a common disease in Tibetan pigs that substantially increases the mortality and morbidity of piglets. Currently, only two viruses that can cause diarrhoeal disease in Tibetan pigs have been identified: porcine epidemic diarrhea virus (PEDV) [2, 3] and porcine deltacoronavirus (PDCoV) [4]. The identities of other viruses associated with diarrhoea in Tibetan pigs remain unclear.

Since the advent of high-throughput sequencing and new metagenomic analytical methods, viral metagenomics has shown a great potential for investigating unknown aetiologies and identifying novel viruses [5]. Since 2007, numerous studies on the discovery of new viral communities in humans and animals by virome sequencing have been reported [6]. In recent years, the approach has been successful for analysing the viromes and identifying viruses in serum, plasma, faeces, respiratory secretions, and tissue samples. The faecal virome has been characterized for various animals, and some new viruses associated with diarrhoea have been identified in domestic pigs, including PDCoV [7], swine enteric alphacoronavirus (SeACoV) [8], swine acute diarrhea syndrome coronavirus (SADS-CoV) [9], calicivirus [10], sapelovirus [11], and astrovirus (AstV) [12]. The existence of new viruses associated with diarrhoea in pigs is worthy of continuous investigation. Whether the above viruses are prevalent in Tibetan pigs at high altitude on the Tibetan plateau of China also deserves special attention.

From 2018 to 2019, a total of 146 faecal samples were collected from 10- to 40-day-old Tibetan piglets that had been associated with clinical diarrhoeal disease. These samples were obtained from 16 pig farms in Ganzi Tibetan autonomous prefecture of Sichuan province, which is located in the southeastern margin of the Tibetan plateau. Tibetan pigs are usually fed in a free-ranging system on the plateau. These animals are not vaccinated with any commercial vaccines, including live attenuated vaccines against porcine epidemic diarrhea virus or transmissible gastroenteritis virus. Soon after sampling, 10% (wt/vol) faecal suspensions were prepared using sterile phosphate-buffered saline (PBS). Before nucleic acids were extracted, these samples were subjected to a series of pretreatments as described previously [13]. Total viral RNA was prepared from the samples using a QIAamp Viral RNA Mini Kit (QIAGEN, Germany) according to the manufacturer’s instructions. Then cDNA was generated using SuperScript III reverse transcriptase (RT) (Invitrogen, USA) and random hexamers (Invitrogen, USA) following the supplier’s guidelines. Finally, one mixed pooled sample was sent to BGI Co. LTD (Shenzhen, China) for the library construction and high-throughput TruSeq Illumina sequencing.

Using high-throughput sequencing, a total of 884,471 reads were obtained from one pooled faecal sample of Tibetan pigs (GenBank accession number: SRR9276460). Approximately 7.2% (12,291 reads) of the sequence reads mapping to mammalian-associated viral sequences were recovered from the faecal samples. Using BLASTx software, members of eleven virus families (Fig. 1a) were identified, which, in order of sequence read abundance, were as follows: Smacoviridae (31.09% of all reads), Parvoviridae (21.50%), Genomoviridae (11.77%), Hepeviridae (10.58%), Adenoviridae (10.20%), Picornaviridae (8.30%), Circoviridae (2.82%), Astroviridae (2.33%), Coronaviridae (0.81%), Flaviviridae (0.33%), and Caliciviridae (0.27%). Nineteen distinct viruses in these eleven families were identified (Fig. 1b). In order of sequence read abundance, the viruses were as follows: porcine stool-associated circular virus type 7 (PsaCV-7, 31.09% of all reads), porcine bocavirus (PBoV, 16.80%), gemycircularvirus (11.77%), hepatitis E virus (HEV, 10.58%), porcine adenovirus (10.20%), enterovirus (4.46%), sapelovirus (3.04%), porcine circovirus 2 (PCV-2, 2.82%), porcine parvovirus type 6 (PPV-6, 2.51%), AstV (2.33%), PPV-2 (1.33%), porcine bufavirus (0.86%), PEDV (0.81%), posavirus (0.41%), bovine viral diarrhea virus type 1 (BVDV-1, 0.33%), porcine kobuvirus (0.29%), sapovirus (0.27%), rabovirus (0.08%), and pasivirus (0.02%). Three viruses in the study were demonstrated to be associated with animal diarrhoeal disease, including PEDV [14], porcine circovirus 2 (PCV-2) [15], and bovine viral diarrhea virus type 1 (BVDV-1) [16]. Interestingly, BVDV is frequently detected in cows, pigs, sheep, goats, and other ruminants [17], suggesting potential interspecies transmission of BVDV-1 between cattle and swine. It is noteworthy that HEV is considered to be a zoonotic pathogen, and cross-species transmission from animal reservoirs to humans is believed to be the major route of HEV transmission. In this study, the high sequence read abundance of HEV (10.58%) indicates a high risk of zoonotic transmission between humans and Tibetan pigs. This is in agreement with previous reports [18, 19]. In addition, three novel viruses, including porcine bufavirus, rabovirus, and pasivirus, may also be associated with diarrhoeal disease, which requires further investigation.

Fig. 1
figure 1

Family (a) and species (b) classification and percentage of virus sequences detected from diarrhoeic fecal samples from Tibetan pigs

A total of 10 out of the 146 faecal samples tested positive for the NS1 gene by PCR amplification (6.85% positive) (Table S1). All of the NS1 genes obtained in this study were 1738–1741 nt in length. Sequence alignment using the MegAlign program of DNAstar package version 7.0 (https://www.dnastar.com/) indicated that they shared 95.6%–96.3% nucleotide sequence identity with each other and were 90.7%–99.7% identical to the NS1 sequences of reference strains from pigs; however, they were only 43.9%–60.1% identical to those of reference strains originating from other species, including bats, canines, rats, and humans. An amino acid sequence alignment showed that the amino acid sequence identity among the ten NS1 genes ranged from 94.8% to 100% and that they shared 92.3%–99.3% amino acid sequence identity with other reference strains originating from domestic pigs. Interestingly, the 19GZ211 strain had a deletion of one amino acid at position 522 (Fig. S1), the same aa deletion that was found in the recently identified Henan019 strain in Henan province [20], more than a thousand kilometres away from the location of the Tibetan pig farms sampled in this study.

Phylogenetic analysis indicated that all of the NS1s in this study clustered in the swine-origin group, together with other bufaviruses that were identified in pigs. Notably, except for 18GZ201, which was found to be closely related to a PBuV (61/Austria/2014) identified in Austria in 2014 [21], nine of the ten strains clustered together with viruses from mainland China that are widely distributed in Guangxi, Guangdong, Fujian, Anhui, and Henan provinces [20] (Fig. 2a). The data in the present study show that the PBuVs from Tibetan pigs display abundant genetic diversity.

Fig. 2
figure 2

Phylogenetic tree based on NS1 gene (a) and full-length genomic sequences (b) of the bufavirus strains identified in Tibetan pigs and other reference strains. The strains identified in this study are indicated by red triangles. The reference bufaviruses originating from domestic pigs are indicated by black circles. The phylogenetic trees were constructed by the neighbor-joining method with 1,000 replicates using the MEGA7 program (http://www.megasoftware.net). The scale bar indicates the number of nucleotide substitutions per site.

The full-length genomic sequences of two strains (designated SMU-TP-YN18 and SMU-TP-GZ18) were successfully amplified from samples that were positive for bufavirus. These sequences have been submitted to the GenBank database (accession no. MN868670 and MN868671, respectively), and both genome sequences were 4,189 nucleotides in length. Analysis using the MegAlign program of the DNAstar package version 7.0 (https://www.dnastar.com/) revealed 96.0% nucleotide sequence identity between SMU-TP-YN18 and SMU-TP-GZ18 and 94.9% to 98.3% identity to reference genome sequences of bufaviruses of domestic pigs, but only 52.2% to 63.6% identity to reference strains from other species (Table 1).

Table 1 Genome positions, protein sizes, and nucleotide and amino acid sequence identity values for different regions of the SMU-TP-YN18 genome and SMU-TP-GZ18 compared with bufavirus reference strains

To examine the genetic variation between the two PBuVs, the nucleotide and deduced amino acid (aa) sequences of the NS1, VP1, SHP, and VP2 genes of these two isolates as well as three strains (FJNP2018, Anhui021, and 61/Austria/2014) from domestic pigs and four strains (BJ181, CBuV-88, MAG12-57, SY-2015) from humans, canines, bats, and rats, respectively, were compared. The results showed that NS1, VP1, SHP, and VP2 of SMU-TP-YN18 shared 95.6%–98.9% nucleotide (91.4%–97.4 aa) sequence identity with FJNP2018, a higher percentage than with the other reference strains. Meanwhile, NS1, VP1, SHP, and VP2 of the SMU-TP-GZ18 strain shared 98.6%–99.2% nucleotide (96.0%–99.5 aa) sequence identity with 61/Austria/2014, which was higher than that shared with other six PBuVs (Table 1). These data indicate that the two PBuVs in this study are closely related to PBuVs that originated from domestic pigs.

Further examination of the phylogenetic tree based on whole genome sequences revealed that the SMU-TP-YN18 strain from this study showed a close relationship to the FJNP2018 strain, which was recently identified in Fujian province of China [20]. The SMU-TP-GZ18 strain occupied a separate branch with 94% bootstrap support (Fig. 2b), suggesting a new evolution direction for this strain.

Bufaviruses are newly discovered zoonotic viruses that have been reported in numerous mammals and humans [22]. They are single-stranded non-segmented DNA viruses belonging to the genus Protoparvovirus, subfamily Parvoviridae (http://ictvonline.org). The bufavirus genome is about 4–6 kb in length and encodes a non-structural protein 1 (NS1), a putative structural protein 1 (VP1), a small hypothetical protein (SHP), and structural protein 2 (VP2) [22,23,24]. The virus was first detected in faecal samples from children with acute diarrhoea in Burkina Faso in 2012 [25]. Sporadic cases of diarrhoea with shedding of bufavirus have been reported in several countries, suggesting that the virus might be a possible occasional cause of diarrhoea in humans. To date, bufaviruses have been detected in humans and mammals such as rats, shrews, pigs, dogs, bats, and primates, implying a broad species host tropism with a high risk of interspecies transmission. Therefore, these viruses deserve continuous monitoring and special attention for control.

In conclusion, the results of the present study revealed 19 distinct viruses belonging to 11 viral taxonomic families, extending our knowledge of viral communities associated with porcine diarrhoeal disease for the first time to Tibetan pigs. Sequence comparisons and phylogenetic analysis based on the NS1 gene and whole-genome sequences of PBuVs indicated that the viruses in Tibetan pigs were more genetically diverse than those originating in Chinese domestic pigs. Because bufaviruses are newly discovered zoonotic viruses that can infect various mammals and humans, surveillance of PBuVs is of great significance for veterinary and public health.