RIP mutated ITS genes in populations of Ophiocordyceps sinensis and their implications for molecular systematics

Abstract

Different hypotheses have been proposed to interpret the observed unusual ITS (internal transcribed spacer) sequences in Ophiocordyceps sinensis. The coexistence of diverged ITS paralogs in a single genome was previously shown by amplifying the ITS region from mono-ascospore isolates using specific primers designed for different ITS paralog groups. Among those paralogs, are AT-biased ITS sequences which were hypothesized to result from repeat-induced point mutation (RIP). This is a process that detects and mutates repetitive DNA and frequently leads to epigenetic silencing, and these mutations have been interpreted as pseudogenes. Here we investigate the occurrence and frequency of ITS pseudogenes in populations of O. sinensis using large-scale sampling, and discusses the implications of ITS pseudogenes for fungal phylogenetic and evolutionary studies. Our results demonstrate a wide distribution of ITS pseudogenes amongst different geographic populations, and indicate how ITS pseudogenes can contribute to the reconstruction of the evolutionary history of the species.

Introduction

The nuclear ribosome DNA (nrDNA) is considered a classic example of concerted evolution in most eukaryotes (Nei and Rooney 2005; Ganley and Kobayashi 2007; Stage and Eickbush 2007). As one of the most popular components of nrDNA, the internal transcribed spacer (ITS) generally appears to evolve through concerted evolution, and thus has been widely used in species identifications and phylogenetic studies of diverse organisms (e.g. Buckler and Holtsford 1996; Denman et al. 2000; Hřibová et al. 2011; Rouland-Lefevre et al. 2002). It has also been selected as the universal fungal DNA barcode (Schoch et al. 2012). However, intra-genomic ITS polymorphisms, indicating potential escapes from concerted evolution, have long been reported in fungi (e.g. O’Donnell and Cigelnik 1997; Wang and Yao 2005). As one kind of intra-genomic polymorphisms, ITS pseudogenes are one category of such polymorphisms, and these have been reported in a variety of fungal species, including Colletotrichum graminicola, Cordyceps militaris, Epichloë spp., Laetiporus spp., Leptosphaeria maculans, Neurospora crassa, and Ophiocordyceps sinensis (Lindner and Banik 2011; Li et al. 2013; Li et al. 2017). Pseudogenes are nonfunctional genes that result from multiple mutations of parental active genes, and which are not usually transcribed.

Ophiocordyceps sinensis is a valued traditional Chinese medicine (Pegler et al. 1994), which has attracted increasing scientific attention in recent years and assumed considerable economic significance in some rural economies in the Himalayas (e.g. Shrestha and Bawa 2013; Liang 2018). ITS has been used as an important molecular marker in the identification and genetic analysis of this species (Liu et al. 2001; Kinjo and Zang 2001; Zhang et al. 2009), but dissimilar sequences which could have implications for phylogenetic analyses in this fungus have been obtained. Variations in the ITS among individuals of this species were first identified by Kinjo and Zang (2001), and the polymorphisms have been variously interpreted as genetic divergences (Kinjo and Zang 2001), different genotypes (Zhu et al. 2010), cryptic species (Stensrud et al. 2007), or potentially other species (Xiao et al. 2009).

Li et al. (2013) designed specific primers and amplified different ITS paralogs from mono-ascospore isolates of the same collection, which confirmed the coexistence of different haplotypes in single genomes of this species. Several divergent ITS sequences were highly AT-biased and identified as pseudogenes. A genome defence mechanism, repeat-induced point mutation (RIP), was subsequently shown to have a potential role in creating these pseudogenes (Li et al. 2017). RIPs were first discovered in 5S RNA genes of Neurospora crassa (Selker and Stevens 1985). These could result in duplicated sequences greater than ~ 400 bp (or ~ 1 kb for unlinked duplications) and introduce numerous G:C to A:T point mutations (up to ~ 30% of the G:C pair) during a single passage through the sexual cycle of that fungus (Cambareri et al. 1989; Watters et al. 1999). RIP-mutated sequences are frequent targets for DNA methylation, which can cause gene silencing in Neurospora (Rountree and Selker 1997), and these have been recognized as pseudogenes. While ITS pseudogenes have already been proved to exist in O. sinensis (see above), the distribution and evolutionary history of these ITS pseudogenes within O. sinensis populations remained unknown. This new study was designed to investigate the occurrence of RIP-mutated ITS pseudogenes in different population of O. sinensis using large-scale sampling and to test the utility of these pseudogenes in reconstructing the evolutionary history of the species.

Materials and methods

Taxon sampling and DNA sequencing

A total of 147 individuals were used in this study, including 86, 23, 18, 14 and 4 collected from Qinghai, Tibet, Sichuan, Yunnan and Gansu provinces, respectively, covering nearly the whole collecting regions of Ophiocordyceps sinensis in China (Table S1, Fig. 1). To investigate pseudogene occurrences at a finer scale, more extensive sampling was carried out in Guoluo (Qinghai province), especially in Maqên County, where 64 individuals were collected (Table S1). An isolate of O. emeiensis (2236), one individual of O. lanpingensis, and eight individuals of O. laojunshanensis were used as outgroups (Table S1).

Fig. 1
figure1

Sampling and detection of ITS pseudogenes in populations of Ophiocordyceps sinensis in China (a) and Guoluo, Qinghai (b), respectively. Red spots indicate the dominant ITS haplotype (haplotype 2), blue spots indicate ITS haplotype (haplotype 1) with the epitype designated by Li et al. (2013) and black spots indicate other haplotypes. Purple cycles indicate the detection of ITS pseudogenic paralogs

The individuals collected were dried with silica gel and one strain (2236) was maintained at 4 °C on Potato Dextrose Agar (PDA) slants. The stocks were transferred to new PDA slants and incubated at 25 °C for 20 d. Fresh mycelia were collected and used for DNA extraction. Total genomic DNA was extracted from either dried inner tissues or cultivated fresh mycelia using the modified CTAB method (Yao et al. 1999). The fungal universal primer pair ITS5/ITS4 (White et al. 1990) was used to amplify functional ITS sequences. Pseudogenes were amplified using 10- to 100-times dilutions of PCR products (amplified with ITS5/ITS4) as the templates and GCF/GCR (Li et al. 2013) as the primers. Amplifications were performed in a 25 μl PCR reaction system which contains 12.5 μl 2 × Taq PCR Master Mix (Tiangen Biotech, China), 0.25 μl of each primer (10 μM) and 1 μl DNA template with a thermal cycler (Applied Biosystems, Foster City, CA). The PCR conditions used were: 2 min at 94 °C; 30 cycles of 94 °C for 30 s, 53 °C (60 °C for the primers GCF/GCR) for 30 s and 72 °C for 45 s; and a final extension at 72 °C for 10 min. PCR products were directly sequenced using the same PCR primers on a capillary sequencer (Applied Biosystems 3730 Analyzer, Foster City, CA) by the Beijing Genomics Institute (Beijing, China). Direct sequencing of ITS pseudogenes for several individuals displayed polymorphisms at a few sites; a cloning method was therefore employed for those samples (Table S2). A 50 μl PCR reaction was used for cloning and sequencing, the PCR products were purified with the Gel Extraction Kit (CWbiotech, Beijing, China), cloned into vectors using the pEASY-T1 Simple Cloning Kit (TransGen Biotech, Beijing, China), and then transformed into Trans1-T1 phage resistant chemically competent cells (TransGen Biotech, Beijing, China). At least one positive clone for each individual was sequenced using primers M13F/SR in the vector. The ambiguous base pairs in obtained sequences were manually edited by checking the chromatograms of both directions before submitting to the alignment.

Sequence dataset and haplotype analyses

Nine ITS pseudogenes of O. sinensis, one ITS sequence of O. emeiensis (AJ309347) and three ITS sequences of O. lanpingensis (HQ654775, HQ654776 and HQ654777) were downloaded from GenBank and included in the analyses together with sequences obtained in this study (Table S2). The 18S and 28S nrDNA regions were removed before analyses. Haplotypes of identical sequences were identified and categorized using DAMBE version 4.2.13 (Xia and Xie 2001). The sequence alignment used in this study was deposited in TreeBASE (http://purl.org/phylo/treebase/phylows/study/TB2:S24803).

Length variations, GC contents, sequence divergence levels, and RIP analyses

The lengths and GC contents of the ITS1–5.8S-ITS2 regions were calculated using BioEdit version 7.0.9.0 (Hall 1999). Sequence divergence levels (average p-distance) of functional ITS sequences and pseudogenes were analyzed by MEGA5 (Tamura et al. 2011). Pairwise deletion was used for missing data. RIP analyses were performed using RIPCAL (Hane and Oliver 2008). Sequences with the highest GC content were used as consensus.

ITS pseudogene identification and mutation analyses

The ITS pseudogenes in O. sinensis were characterized by lower GC contents, different secondary structures, and higher minimum free energies of the 5.8S rRNA transcripts compared with their functional counterparts, due to a number of random G:C to A:T transition mutations introduced by RIP (Li et al. 2013, 2017). The G:C to A:T transition mutations in ITS sequences were analyzed and used as the main criteria for ITS pseudogene identification. Sequences amplified with primer pair GCF/GCR were aligned and compared with the functional counterparts (amplified by ITS5/ITS4 from the same individuals). Those sequences in which G:C to A:T mutations occurred compared to functional ones were identified as pseudogenes. C → T, G → A and other mutation types were counted for different ITS regions (ITS1, 5.8S and ITS2) of the identified pseudogenes using functional counterparts from the same individuals as references.

Phylogenetic analyses

Sequences were aligned with ClustalW (Thompson et al. 1994) and the alignments were manually refined with BioEdit. The boundaries of the ITS1, 5.8S, and ITS2 regions were determined according to previously published ITS sequences, and the 18S and 28S regions were removed from the alignments. ITS phylogenies were constructed using Maximum Likelihood (ML) and Bayesian Inference (BI). ML analyses were conducted with RAxML version 7.2.6 (Stamatakis 2006) to obtain the best ML tree as well as to determine bootstrap supports (BS). The best-fit nucleotide substitution model GTR + G was determined using MrModeltest version 2.2 (Nylander 2004) based on Akaike Information Criterion (Akaike 1974). Node supports were assessed with 1000 bootstrap replicates. Bayesian analyses were implemented with MrBayes v3.2.6 (Ronquist et al. 2012) using the same substitution model. Two independent analyses of two parallel runs and four chains were carried out for 5,000,000 generations. Trees were sampled every 1000 generations, the first 20% of the trees being discarded as burn in. A consensus tree and posterior probabilities (PP) were calculated. Ophiocordyceps emeiensis, O. lanpingensis, and O. laojunshanensis were used as outgroup taxa. Different phylogenetic analyses were performed independently for the three ITS datasets, i.e. functional and pseudogenes separately, and the combined dataset.

Results

Amplification, identification and geographic distribution of ITS pseudogenes

A total of 268 ITS sequences were obtained from the 147 individuals of O. sinensis, among which 123 were identified as pseudogenes (Table S2). Of those pseudogenes, 15 were obtained by direct PCR and sequencing, and the others (108) by cloning and sequencing. ITS pseudogenes were detected from ~ 29% (43 out of 147) of the tested individuals and were found to be widely distributed in populations from Qinghai, Tibet, Sichuan, and Gansu provinces, but were not observed in any from Yunnan Province (Fig. 1, Table S2). The occurrence rate of ITS pseudogenes was ~ 24% (18 out of 76) in Guoluo, Qinghai, and ~ 26% (17 out of 66) in Maqên, Guoluo, respectively (Table S1). According to the DAMBE analyses, the 130 ITS pseudogenes (nine from GenBank) represented 57 haplotypes (unique sequences), and the 147 functional genes represented 17 haplotypes. The number of ITS pseudogene haplotypes varied among the 43 individuals in which they were detected, ranging from one (e.g. CS431, CS433, CS437) to eight (CS46) haplotypes per individual (Fig. 2b).

Fig. 2
figure2figure2

Continued

Length variations, GC contents and sequence divergences

The length of the functional ITS sequences was 562 bp in most individuals of O. sinensis. The exceptions were two (CS4 and CS6) collected in Dawu and Xiaojin County, Sichuan Province; in these an 8-bp motif 5′-CGCCCCGG-3′ in ITS1 was found to be missing (Table 1). Regions of 95 bp in ITS1 and 50 bp in ITS2 were not obtained for the ITS pseudogenes of O. sinensis because the PCR primers were located inside the two spacer regions. The lengths of the amplified region of the ITS pseudogenes were generally 417 bp, the same as the corresponding region in functional counterparts (Table S2).

Table 1 ITS1–5.8S-ITS2 Length variation, GC content and p-distance analyses of O. sinensis and related species studied

The GC contents of ITS functional sequences varied amongst O. emeiensis, O. lanpingensis, O. laojunshanensis and O. sinensis, ranging from 59.58 to 59.86% (avg. 59.72%), 59.45 to 60.38% (avg. 60.19%), 62.46 to 62.77% (avg. 62.57%), and 59.79 to 63.70% (avg. 63.13%), respectively (Table 1). The GC contents of the ITS pseudogenes in O. sinensis were lower than those in the functional counterparts, ranging from 44.36 to 61.39% (avg. 51.33%) (Table S2). The significantly lower GC contents resulted from an accumulation of G:C to A:T transition mutations, identified in the RIP analyses.

The average p-distances for O. emeiensis, O. lanpingensis, O. laojunshanensis and ITS functional sequences of O. sinensis were 0.007, 0.009, 0.005, and 0.006, respectively. However, the ITS pseudogenes of O. sinensis had an average p-distance of 0.044, significantly greater than functional genes of the species and the other closely related species examined (Table 1). This indicates that the pseudogenes are more divergent than their functional counterparts.

RIP and mutation analyses

The RIPCAL analyses showed that RIP (characterized by a high rate of G:C to A:T transitions) occurred in all the ITS pseudogenes (Fig. S1). CpA → TpA, TpG → TpA dinucleotides were found to be more frequently mutated than the other dinucleotides (Fig. S1). The numbers of RIP mutations were generally close to each other in nearly all the ITS pseudogenes, and varied in only a few cases (e.g. in some clones of CS454, CS455 and CS46) in which RIP mutations occurred in less than half of the ITS region in those sequences (Fig. S1). The average frequencies of C → T and G → A transition mutations for ITS pseudogenes in O. sinensis were 7.07 and 6.33%, respectively, while the frequency of all the other mutation types accounted for less than 0.6%. Mutations occurred in both the coding (5.8S) and two non-coding spacers (ITS1 and ITS2) (Table S3).

Phylogenetic analyses

A total of 147 functional ITS sequences and 98 pseudogenes of O. sinensis were included in the phylogenetic analyses. Fourteen sequences from three closely related species, i.e. two from O. emeiensis, four from O. lanpingensis, and eight from O. laojunshanensis were used as outgroups. The consensus tree generated from the Bayesian analyses had a similar topology to the best-scoring ML tree (Fig. 2). Analyses using the three different datasets (functional ITS, pseudogenes, and the combined dataset) all supported the monophylies of the three outgroup species with high support values (BS = 98–100%, PP = 1.00, Fig. 2). A sister relationship between O. laojunshanensis and O. sinensis was supported using functional genes and pseudogenes alone (BS = 100%, PP = 1.00, Fig. 2b), but not by the combined dataset (Fig. 2a). Ophiocordyceps laojunshanensis was displayed as the closest relative of O. sinensis, followed by O. lanpingensis and then O. emeiensis. The monophyly of O. sinensis was highly supported (BS = 100%, PP = 1.00) by the pseudogenes, but was only weakly supported using either functional sequences or the combined dataset in both ML and Bayesian analyses (Fig. 2). The monophyly of the ITS pseudogenes was strongly supported by the ML (BS = 100%) and Bayesian (PP = 1.00) analyses using pseudogenes alone or in the combined dataset (Fig. 2).

Discussion

The ITS phylogeny revealed O. laojunshanensis as the closest known relative of O. sinensis, and individuals from northwestern Yunnan were more closely related to that species than those from other areas (Fig. 2). As O. laojunshanensis appears to be confined to northwestern Yunnan and also occurs in mountainous areas at high elevations, O. sinensis is most likely to have originated in northwestern Yunnan by sympatric speciation and not from Lingzhi, Tibet as suggested by Zhang et al. (2009).

The coexistence of ITS functional genes and pseudogenes within single individuals was confirmed either by PCR amplification and sequencing or by cloning and subsequent sequencing. ITS pseudogenes were detected from nearly one-third of the individuals examined, covering a wide geographical range in regions of Qinghai, Tibet, Sichuan, and Gansu Provinces. It was surprising that no pseudogenes were detected from populations in Yunnan Province, even though 14 individuals from there were tested.

The nuclear genome of O. sinensis experienced a large expansion in genome size due to an expanded repeat content compared to other closely related hypocrealean species (Hu et al. 2013; Li et al. 2016a; Xia et al. 2017). Given that RIP was shown to induce rRNA pseudogenes in filamentous ascomycetes (Li et al. 2017), including in the genome of O. sinensis (Hu et al. 2013), it is reasonable to hypothesize that the occurrence of RIP induced ITS pseudogenes accompanied this species’ genome expansion and speciation. This suggests that the ITS pseudogenes mainly arose after, but not prior to, the origin of O. sinensis, and the genomes of O. laojunshanensis and other outgroup taxa have not been found to have expanded in the same way (data not shown). However, evidence from genome analyses of these outgroup taxa together with additional genomes of O. sinensis, especially from northwestern Yunnan Province, are needed to test whether RIP induced ITS pseudogenes really corresponded with the divergence of this species.

The RIP-mediated mechanism of mutation is only known to occur during sexual reproduction (Freitag et al. 2002). As O. sinensis is one of the only known homothallic species in Ophiocordycipitaceae, allowing individual strains to mate with themselves (Bushley et al. 2013; Hu et al. 2013), it is possible that the homothallic life-style may have increased the likelihood and frequency of sex in O. sinensis, leading to increased RIP in this species. It should also be noted that RIP may not be confined to the ITS region but could extend to the entire nrDNA operon (Li et al. 2017) and also might be able to impact other single-copy sequences adjacent to duplicated sequences (Irelan et al. 1994; Li 2013a).

Haplotype analyses revealed that different ITS pseudogenes could exist in single individuals of O. sinensis, that individuals possessing the same functional ITS haplotypes could have the same (e.g. in CS51 and CS559) or different (e.g. in CS12 and CS51) pseudogene ITS haplotypes, and that individuals with different functional haplotypes might share the same pseudogenic ITS haplotypes (as in CS402, CS431 and CS436). Since the divergences in functional copies of the ITS were assumed to be correlated with species diversification in O. sinensis (Jiao 2010; Zhang et al. 2009), our results suggest that ITS pseudogenes could have emerged either before or after a new functional ITS haplotype was generated (by random mutation) and homogenized (through concerted evolution). This pattern could be explained by an RIP mechanism, responsible for nrDNA pseudogene formation, in which mutation occurred randomly. The RIP mutated pseudogenes would not face the same evolutionary constraints as functional genes but could be preserved through generations for extended periods of time (Li et al. 2017). If the pseudogene haplotypes arose by a shared RIP mutation, individuals sharing the same pseudogenic ITS haplotypes should have the same ancestors, and RIP mutated pseudogenes could serve as good candidate markers to reconstruct the evolution, diversification and dispersal history of O. sinensis.

The ITS region has been widely used in species identifications and fungal phylogenies (e.g. James et al. 2006; Stensrud et al. 2007), and is now accepted as the universal DNA barcode to be adopted in all fungal groups (Schoch et al. 2012) and is also being applied to quantify and characterize environmental fungal diversity (e.g. O’Brien et al. 2005; Buée et al. 2009). The occurrence of ITS pseudogenes has implications for both types of application. Their use in phylogenetic reconstruction has been discussed since their discovery, and they have been used as outgroups in phylogenetic analyses (Buckler and Holtsford 1996) when functional counterparts from closely related species were not available (Razafimandimbison et al. 2004). However, ITS pseudogenes may result in long-branch attraction causing phylogenetic errors (Buckler et al. 1997; Wei et al. 2003; Won and Renner 2005) and may not always cluster with the conspecific functional sequences (e.g. Muir et al. 2001). In this study, the ITS sequences from O. sinensis formed a distinct monophyletic clade in ML and BI phylogenetic analyses regardless of whether functional ITS genes, ITS pseudogenes, or the combined dataset were used – although the clade was not well supported in some cases. These results suggest that both functional and pseudogene ITS sequences can accurately distinguish this species from other closely related Ophiocordyceps species. However, if distance based methods such as Neighbour Joining were used, the distinct clustering of functional and pseudogene clades could be disturbed, and sequences from the sister species O. laojunshanensis were merged (Figure S2). The clustering pattern revealed in a distance based phylogeny would largely depend on the RIP mutation number (Li et al. 2017). There are other cases where pseudogenes were not clustered with conspecific functional sequences (e.g. Lindner and Banik 2011), but ITS pseudogenes are not likely be a critical threat to fungal taxonomy, phylogenetics and related applications as they are rarely detected using standard PCR protocols and can be technically identified (Bailey et al. 2003; Hartmann et al. 2001; Muir et al. 2001; Xiao et al. 2010). Nonetheless, the prevalence of ITS pseudogenes in this species suggests that ITS functional sequences rather than pseudogenes should be used in evolutionary phylogenetic studies, and that the recognition of rDNA pseudogenes prior to such analyses is required. Caution is therefore needed when using ITS as a sole basis for fungal species identification in O. sinensis (Kinjo and Zang 2001; Stensrud et al. 2007; Xiao et al. 2009; Zhu et al. 2010). Pseudogenes are of particular concern in relation to studies of fungal community analyses in environmental samples using high-throughput sequencing technologies; divergent rDNA pseudogenes would be unknowingly sequenced and most probably recognized as other species, so leading to overestimates of the fungal species diversity. It is clear that effective methods for rDNA pseudogene identification in sampless are an urgent need for both systematic and environmental mycology.

Ophiocordyceps laojunshanensis clustered with O. sinensis in a seven gene phylogenetic analysis and was thus considered as a synonym by Li (2013b), in contradiction to our study. This discrepancy was most probably attributable to insufficient and imbalanced taxon sampling; ITS sequences of O. laojunshanensis cluster with those of O. sinensis if fewer sequences of the former are used, but can be separated with a higher support value if more sequences are included as revealed by a parallel phylogenetic analysis (data not shown). The two species can in any case be distinguished morphologically by characters of the stromata and perithecia (Chen et al. 2011), especially when mature.

Sequences from type material will facilitate the discrimination of these two species and contribute to fixing the species concept of O. sinensis. We attempted to amplify ITS sequences from two isotypes of the basionym of O. sinensis, Sphaeria sinensis, now preserved in the Kew Fungarium (K(M) 166,697 and K(M) 32,163), but without success. In order to precisely fix the application of the name for the future, we plan to formally designate a sequenced specimen from one of the core regions in which the species is found (CS769, from Maqên, Guoluo, Qinghai; HMAS 281396) as epitype in a forthcoming publication, along with details of the molecular studies attempted on the isotypes, and the formal selection of one of the isotypes as lectotype for the name. Ninety-one individuals, including 1 from Gansu, 74 from Qinghai, 10 from Tibet, 1 from Yunnan, and 5 from Sichuan Provinces share the identical functional ITS sequences (haplotype) with the epitype we are to designate (Fig. 2).

ITS pseudogenes were less frequently observed in individuals from the edge of the Tibetan Plateau than those from the central part of the Plateau (Fig. 1). This observation also supports the hypothesis that ITS pseudogenes arose after, not prior to, the speciation of O. sinensis. We speculate that, if northwestern Yunnan was indeed the centre of origin of O. sinensis, pseudogenes may have accumulated along with the species diversification as it spread across the Plateau.

Conclusions

The first discovery of pseudogenic ITS sequences in Ophiocordyceps sinensis was in 2001. Since that time, different hypotheses have been proposed to explain these, including representing cryptic species, distinct species, or different genotypes. This study proved that ITS functional genes and pseudogenes coexist in single individuals by PCR amplification using mono-ascospore isolates. Our large scale sampling demonstrated that ITS pseudogenes were widely detected simultaneously along with functional ITS genes in many individuals gathered from diverse populations and localities. ITS pseudogenes, whether from mono-ascospore isolates, tissue isolates, or from dried specimens, all clustered in a highly supported clade, together with the functional ITS clade of O. sinensis, suggesting that in addition to functional ITS sequences, ITS pseudogenes can accurately circumscribe this species and so may not pose a great threat to the use of the ITS region as a universal barcode for its identification. The patterns revealed by our phylogenetic analyses support the hypothesis that ITS pseudogenes evolved through RIP mutation throughout the evolutionary history of O. sinensis. They also provide evidence that ITS pseudogenes occur frequently and have a wider distribution in populations of the species than previously appreciated, and are not restricted to certain individuals or production areas.

Availability of data and materials

The datasets analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

BI:

Bayesian Inference

BS:

Bootstrap support

CTAB:

Cetyltrimethylammonium Bromide

ITS:

Internal transcribed spacer

ML:

Maximum likelihood

nrDNA:

Nuclear ribosome DNA

PCR:

Polymerase chain reaction

PDA:

Potato Dextrose Agar

PP:

Posterior probabilities

RIP:

Repeat-induced point mutation

References

  1. Akaike H (1974) A new look at the statistical model identification. IEEE Trans Automatic Control 19:716–723

    Google Scholar 

  2. Bailey CD, Carr TG, Harris SA, Hughes CE (2003) Characterization of angiosperm nrDNA polymorphism, paralogy, and pseudogenes. Mol Phylogenet Evol 29:435–455

    PubMed  CAS  Google Scholar 

  3. Buckler ES, Holtsford TP (1996) Zea systematics: ribosomal ITS evidence. Mol Biol Evol 13:612–622

    PubMed  CAS  Google Scholar 

  4. Buckler ES, Ippolito A, Holtsford TP (1997) The evolution of ribosomal DNA: divergent paralogues and phylogenetic implications. Genetics 145:821–832

    PubMed  CAS  Google Scholar 

  5. Buée M, Reich M, Murat C, Morin E, Nilsson RH, Uroz S, Martin F (2009) 454 pyrosequencing analyses of forest soils reveal an unexpectedly high fungal diversity. New Phytol 184:449–456

    PubMed  Google Scholar 

  6. Bushley KE, LiY WWJ, Wang XL, Jiao L, Spatafora JW, Yao YJ (2013) Isolation of the MAT1-1 mating type idiomorph and evidence for selfing in the Chinese medicinal fungus Ophiocordyceps sinensis. Fungal Biol 117:599–610

    PubMed  CAS  Google Scholar 

  7. Cambareri EB, Jensen BC, Schabtach E, Selker EU (1989) Repeat-induced G-G to A-T mutations in Neurospora. Science 244:1571–1575

    PubMed  CAS  Google Scholar 

  8. Chen JY, Cao YQ, Yang DR, Li MH (2011) A new species of Ophiocordyceps (Clavicipitales, Ascomycota) from southwestern China. Mycotaxon 115:1–4

    Google Scholar 

  9. Denman S, Crous PW, Taylor JE, Kang JC, Pascoe I, Wingfield MJ (2000) An overview of the taxonomic history of Botryosphaeria and a re-evaluation of its anamorphs based on morphology and ITS rDNA phylogeny. Stud Mycol 45:129–140

    Google Scholar 

  10. Freitag M, Williams RL, Kothe GO, Selker EU (2002) A cytosine methyltransferase homologue is essential for repeat-induced point mutation in Neurospora crassa, Proceedings of the National Academy of Sciences of the United States of America, vol 99, pp 8802–8807

    Google Scholar 

  11. Ganley ARD, Kobayashi T (2007) Highly efficient concerted evolution in the ribosomal DNA repeats: total rDNA repeat variation revealed by whole-genome shotgun sequence data. Genome Res 17:184–191

    PubMed  PubMed Central  CAS  Google Scholar 

  12. Hall TA (1999) BioEdit: a user-friendly biological sequence alignment editor and analysis program for windows 95/98/NT. Nucleic Acids Symp Ser 41:95–98

    CAS  Google Scholar 

  13. Hane J, Oliver R (2008) RIPCAL: a tool for alignment-based analysis of repeat-induced point mutations in fungal genomic sequences. BMC Bioinform 9:478

    Google Scholar 

  14. Hartmann S, Nason JD, Bhattacharya D (2001) Extensive ribosomal DNA genic variation in the columnar cactus Lophocereus. J Mol Evol 53:124–113

    PubMed  CAS  Google Scholar 

  15. Hřibová E, Čížková J, Christelová P, Taudien S, de Langhe E, Doležel J (2011) The ITS1-5.8S-ITS2 sequence region in the Musaceae: structure, diversity and use in molecular phylogeny. PLoS One 6:e17863

    PubMed  PubMed Central  Google Scholar 

  16. Hu X, Zhang YJ, Xiao GH, Zheng P, Xia YL, Zhang XY, St Leger RJ, Wang CS (2013) Genome survey uncovers the secrets of sex and lifestyle in caterpillar fungus. Chin Sci Bull 58:2846–2854

    CAS  Google Scholar 

  17. Irelan JT, Hagemann AT, Selker EU (1994) High frequency repeat-induced point mutation is not associated with efficient recombination in Neurospora. Genetics 138:1093–1103

    PubMed  PubMed Central  CAS  Google Scholar 

  18. James TY, Kauff F, Schoch CL, Matheny PB, Hofstetter V, Cox CJ, Celio G, Gueidan C, Fraker E, Miadlikowska J, Lumbsch HT, Rauhut A, Reeb V, Arnold AE, Amtoft A, Stajich JE, Hosaka K, Sung GH, Johnson D, O’Rourke B, Crockett M, Binder M, Curtis JM, Slot JC, Wang Z, Wilson AW, Schübler A, Longcore JE, O’Donnell K, Mozley-Standridge S, Porter D, Letcher PM, Powell MJ, Taylor JW, White MM, Griffith GW, Davies DR, Humber RA, Morton JB, Sugiyama J, Rossman AY, Rogers JD, Pfister DH, Hewitt D, Hansen K, Hambleton S, Shoemaker RA, Kohlmeyer J, Volkmann-Kohlmeyer B, Spotts RA, Serdani M, Crous PW, Hughes KW, Matsuura K, Langer E, Langer G, Untereiner WA, Lücking R, Büdel B, Geiser DM, Aptroot A, Diederich P, Schmitt I, Schultz M, Yahr R, Hibbett DS, Lutzoni F, McLaughlin DJ, Spatafora JW, Vilgalys R (2006) Reconstructing the early evolution of Fungi using a six-gene phylogeny. Nature 443:818–822

    CAS  Google Scholar 

  19. Jiao L (2010) Phylogeographic study on Ophiocordyceps sinensis. PhD thesis, Graduate School of Chinese Academy of Sciences, Beijing

    Google Scholar 

  20. Kinjo N, Zang M (2001) Morphological and phylogenetic studies on Cordyceps sinensis distributed in southwestern China. Mycoscience 42:567–574

    Google Scholar 

  21. Li Y (2013a) Evolution of rDNA in Ophicordyceps sinensis and related filamentous ascomycetes. PhD thesis, University of Chinese Academy of Sciences, Beijing

    Google Scholar 

  22. Li Y (2013b) Epitypification, taxonomy and genetic diversity research on Ophiocordyceps sinensis. MSc thesis, Shanxi University, Taiyuan, Shanxi

    Google Scholar 

  23. Li Y, Hsiang T, Yang RH, Hu XD, Wang K, Wang WJ, Wang XL, Jiao L, Yao YJ (2016a) Comparison of different sequencing and assembly strategies for a repeat-rich fungal genome, Ophiocordyceps sinensis. J Microbiol Methods 128:1–6

    PubMed  CAS  Google Scholar 

  24. Li Y, Jiao L, Yao YJ (2013) Non-concerted ITS evolution in fungi, as revealed from the important medicinal fungus Ophiocordyceps sinensis. Mol Phylogenet Evol 68:373–379

    PubMed  Google Scholar 

  25. Li Y, Yang RH, Jiang L, Hu XD, Wu ZJ, Yao YJ (2017) rRNA pseudogenes in filamentous ascomycetes as revealed by genome data. G3-Genes Genomes Genet 7:2695–2703

    CAS  Google Scholar 

  26. Liang S (2018) Missing pieces in the story of a caterpillar fungus – Ophiocordyceps sinensis. IMA Fungus 9:75–77

    Google Scholar 

  27. Lindner D, Banik M (2011) Intra-genomic variation in the ITS rDNA region obscures phylogenetic relationships and inflates estimates of operational taxonomic units in genus Laetiporus. Mycologia 103:731–740

    PubMed  Google Scholar 

  28. Liu ZY, Yao YJ, Liang ZQ, Liu AY, Pegler DN, Chase MW (2001) Molecular evidence for the anamorph–teleomorph connection in Cordyceps sinensis. Mycol Res 105:827–832

    CAS  Google Scholar 

  29. Muir G, Fleming CC, Schlötterer C (2001) Three divergent rDNA clusters predate the species divergence in Quercus petraea (Matt.) Liebl. and Quercus robur L. Mol Biol Evol 18:112–119

    PubMed  CAS  Google Scholar 

  30. Nei M, Rooney AP (2005) Concerted and birth-and-death evolution of multigene families. Annu Rev Genet 39:121–152

    PubMed  PubMed Central  CAS  Google Scholar 

  31. Nylander JAA (2004) MrModeltest 2.2. Program distributed by the author. Evolutionary Biology Centre, Uppsala University, Uppsala. https://github.com/nylander/MrModeltest2

  32. O’Brien HE, Parrent JL, Jackson JA, Moncalvo JM, Vilgalys R (2005) Fungal community analysis by large-scale sequencing of environmental samples. Appl Environ Microbiol 71:5544–5550

    PubMed  PubMed Central  Google Scholar 

  33. O’Donnell K, Cigelnik E (1997) Two divergent intragenomic rDNA ITS2 types within a monophyletic lineage of the fungus Fusarium are nonorthologous. Mol Phylogenet Evol 7:103–116

    PubMed  Google Scholar 

  34. Pegler DN, Yao YJ, Li Y (1994) The Chinese ‘caterpillar fungus’. Mycologist 8:3–5

    Google Scholar 

  35. Razafimandimbison SG, Kellogg EA, Bremer B (2004) Recent origin and phylogenetic utility of divergent ITS putative pseudogenes: a case study from Naucleeae (Rubiaceae). Syst Biol 53:177–192

    PubMed  Google Scholar 

  36. Ronquist F, Teslenko M, van der Mark P, Ayres DL, Darling A, Höhna S, Larget B, Liu L, Suchard MA, Huelsenbeck JP (2012) MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol 61:539–542

    PubMed  PubMed Central  Google Scholar 

  37. Rouland-Lefevre C, Diouf MN, Brauman A, Neyra M (2002) Phylogenetic relationships in Termitomyces (Family Agaricaceae) based on the nucleotide sequence of ITS: a first approach to elucidate the evolutionary history of the symbiosis between fungus-growing termites and their fungi. Mol Phylogenet Evol 22:423–429

    PubMed  CAS  Google Scholar 

  38. Rountree MR, Selker EU (1997) DNA methylation inhibits elongation but not initiation of transcription in Neurospora crassa. Genes Dev 11:2383–2395

    PubMed  PubMed Central  CAS  Google Scholar 

  39. Schoch CL, Seifert KA, Huhndorf S, Robert V, Spouge JL, Levesque CA, Chen W, Consortium FB (2012) Nuclear ribosomal internal transcribed spacer (ITS) region as a universal DNA barcode marker for Fungi. Proc Natl Acad Sci U S A 109:6241–6246

    PubMed  PubMed Central  CAS  Google Scholar 

  40. Selker EU, Stevens JN (1985) DNA methylation at asymmetric sites is associated with numerous transition mutations. Proc Natl Acad Sci U S A 82:8114–8118

    PubMed  PubMed Central  CAS  Google Scholar 

  41. Shrestha UB, Bawa KS (2013) Trade, harvest, and conservation of caterpillar fungus (Ophiocordyceps sinensis) in the Himalayas. Biol Conserv 159:514–520

    Google Scholar 

  42. Stage DE, Eickbush TH (2007) Sequence variation within the rRNA gene loci of 12 Drosophila species. Genome Research 17:1888–1897.

  43. Stamatakis A (2006) RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22:2688–2690

    PubMed  CAS  Google Scholar 

  44. Stensrud Ø, Schumacher T, Shalchian-Tabrizi K, Svegårdena IB, Kauserud H (2007) Accelerated nrDNA evolution and profound AT bias in the medicinal fungus Cordyceps sinensis. Mycol Res 111:409–415

    PubMed  CAS  Google Scholar 

  45. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S (2011) MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony method. Mol Biol Evol 28:2731–2739

    PubMed  PubMed Central  CAS  Google Scholar 

  46. Thompson JD, Higgins DG, Gibson TJ (1994) CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position, specific gap penalties and weight matrix choice. Nucleic Acids Res 22:4673–4680

    PubMed  PubMed Central  CAS  Google Scholar 

  47. Wang DM, Yao YJ (2005) Intrastrain internal transcribed spacer heterogeneity in Ganoderma species. Can J Microbiol 51:113–121

    PubMed  CAS  Google Scholar 

  48. Watters MK, Randall TA, Margolin BS, Selker EU, Stadler DR (1999) Action of repeat-induced point mutation on both strands of a duplex and on tandem duplications of various sizes in Neurospora. Genetics 153:705–714

    PubMed  PubMed Central  CAS  Google Scholar 

  49. Wei XX, Wang XQ, Hong DY (2003) Marked intragenomic heterogeneity and geographical differentiation of nrDNA ITS in Larix potaninii (Pinaceae). J Mol Evol 57:623–635

    PubMed  CAS  Google Scholar 

  50. White TJ, Bruns T, Lee S, Taylor JW (1990) Amplification and direct sequencing of fungal ribosomal RNA genes for phylogenetics. In: Innis MA, Gelfand DH, Sninsky JJ, White TJ (eds) PCR protocols: a guide to methods and applications. Academic Press, San Diego, pp 315–322

    Google Scholar 

  51. Won H, Renner SS (2005) The internal transcribed spacer of nuclear ribosomal DNA in the gymnosperm Gnetum. Mol Phylogenet Evol 36:581–597

    PubMed  CAS  Google Scholar 

  52. Xia EH, Yang DR, Jiang JJ, Zhang QJ, Liu Y, Liu YL, Zhang Y, Zhang HB, Shi C, Tong Y, Kim C, Chen H, Peng YQ, Yu Y, Zhang W, Eichler EE, Gao LZ (2017) The caterpillar fungus, Ophiocordyceps sinensis, genome provides insights into highland adaptation of fungal pathogenicity. Sci Rep 7:1806

    PubMed  PubMed Central  Google Scholar 

  53. Xia X, Xie Z (2001) DAMBE: data analysis in molecular biology and evolution. J Hered 92:371–373

    PubMed  CAS  Google Scholar 

  54. Xiao LQ, Möller M, Zhu H (2010) High nrDNA ITS polymorphism in the ancient extant seed plant Cycas: incomplete concerted evolution and the origin of pseudogenes. Mol Phylogenet Evol 55:168–177

    PubMed  CAS  Google Scholar 

  55. Xiao W, Yang JL, Zhu P, Cheng KD, He HX, Zhu HX, Wang Q (2009) Nonsupport of species complex hypothesis of Cordyceps sinensis by targeted rDNA ITS sequence analysis. Mycosystema 28:724–730

    CAS  Google Scholar 

  56. Yao YJ, Pegler DN, Chase MW (1999) Application of ITS (nrDNA) sequences in the phylogenetic study of Tyromyces s.l. Mycol Res 103:219–229

    CAS  Google Scholar 

  57. Zhang YJ, Xu LL, Zhang S, Liu XZ, An ZQ, Wang M, Guo YL (2009) Genetic diversity of Ophiocordyceps sinensis, a medicinal fungus endemic to the Tibetan plateau: implications for its evolution and conservation. BMC Evol Biol 9:290

    PubMed  PubMed Central  Google Scholar 

  58. Zhu JS, Gao L, Li XH, Yao YS, Zhao JQ, Zhou YJ, Lu JH (2010) Maturational alteration of oppositely orientated rDNA and differential proliferation of GC and AT-biased genotypes of Ophiocordyceps sinensis and Paecilomyces hepiali in natural Cordyceps sinensis. Am J Biomed Sci 2:217–238

    CAS  Google Scholar 

Download references

Acknowledgements

We thank Fei Gu from Sichuan and Phurbu-Dorji from the Tibet Plateau Institute of Biology for their assistance in collecting specimen of Ophiocordyceps sinensis.

Funding

This work is supported by the National Natural Science Foundation of China (31400018, 30025002), the Ministry of Science and Technology of China (2012CB126308, 2012FY111600, 2013BAD16B01) and the Chinese Academy of Sciences (KSCX2-YW-G-076, KSCX2-YW-G-074-04, KSCX2-SW-101C and the scheme of Introduction of Overseas Outstanding Talents).

Author information

Affiliations

Authors

Contributions

YL, HJW, RHY and KEB collected samples. DNA isolation and ITS amplification were performed by YL, LJ, KW and HJW. YL and YuJY analyzed data. YL wrote the original draft, KEB, DLH and YiJY reviewed and edited the draft and contributed to the discussion. ZJW and YiJY supervised the project. All authors read and approved the manuscript.

Corresponding authors

Correspondence to Zujian Wu or Yi-Jian Yao.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1

. Voucher information and ITS pseudogene amplification. Table S2. Length variation and GC content of ITS functional and pseudogenic sequences used in this study. Table S3. Mutation analyses of ITS pseudogenes.

Additional file 2: Fig. S1

. RIP mutation in ITS sequences of Ophiocordycepssinensis shown as RIPCAL output. Functional sequences with the highest GC content, i.e., CS223 and CS234, were defined as consensus. Black, invariant nucleotide; white, gap; red, CpA ↔ TpA or TpG ↔ TpA mutations; dark blue, CpC ↔ TpC or GpG ↔ GpA mutations; green, CpG ↔ TpG or CpG ↔ CpA mutations; pale blue, CpT ↔ TpT or ApG ↔ ApA mutations.

Additional file 3: Fig. S2

. Neighbor joining phylogenetic tree inferred from the combined dataset of ITS genes and pseudogenes.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Li, Y., Jiang, L., Wang, K. et al. RIP mutated ITS genes in populations of Ophiocordyceps sinensis and their implications for molecular systematics. IMA Fungus 11, 18 (2020). https://doi.org/10.1186/s43008-020-00040-0

Download citation

Keywords

  • Caterpillar fungus
  • Cordyceps sinensis
  • ITS pseudogene
  • Haplotype
  • Phylogeny