The pathogenic mechanisms of Tilletia horrida as revealed by comparative and functional genomics

Wang, Aijun; Pang, Linxiu; Wang, Na; Ai, Peng; Yin, Desuo; Li, Shuangcheng; Deng, Qiming; Zhu, Jun; Liang, Yueyang; Zhu, Jianqing; Li, Ping; Zheng, Aiping

doi:10.1038/s41598-018-33752-w

The pathogenic mechanisms of Tilletia horrida as revealed by comparative and functional genomics

Article
Open access
Published: 18 October 2018

Volume 8, article number 15413, (2018)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

The pathogenic mechanisms of Tilletia horrida as revealed by comparative and functional genomics

Download PDF

Aijun Wang ORCID: orcid.org/0000-0002-2985-5796^1,2,3^na1,
Linxiu Pang¹^na1,
Na Wang¹,
Peng Ai¹,
Desuo Yin⁴,
Shuangcheng Li^1,2,3,
Qiming Deng^1,2,3,
Jun Zhu^1,2,3,
Yueyang Liang^1,2,3,
Jianqing Zhu ORCID: orcid.org/0000-0003-4921-1215¹,
Ping Li^1,2,3 &
…
Aiping Zheng^1,2,3

2890 Accesses
16 Citations
1 Altmetric
Explore all metrics

Abstract

Tilletia horrida is a soil-borne, mononucleate basidiomycete fungus with a biotrophic lifestyle that causes rice kernel smut, a disease that is distributed throughout hybrid rice growing areas worldwide. Here we report on the high-quality genome sequence of T. horrida; it is composed of 23.2 Mb that encode 7,729 predicted genes and 6,973 genes supported by RNA-seq. The genome contains few repetitive elements that account for 8.45% of the total. Evolutionarily, T. horrida lies close to the Ustilago fungi, suggesting grass species as potential hosts, but co-linearity was not observed between T. horrida and the barley smut Ustilago hordei. Genes and functions relevant to pathogenicity were presumed. T. horrida possesses a smaller set of carbohydrate-active enzymes and secondary metabolites, which probably reflect the specific characteristics of its infection and biotrophic lifestyle. Genes that encode secreted proteins and enzymes of secondary metabolism, and genes that are represented in the pathogen-host interaction gene database genes, are highly expressed during early infection; this is consistent with their potential roles in pathogenicity. Furthermore, among the 131 candidate pathogen effectors identified according to their expression patterns and functionality, we validated two that trigger leaf cell death in Nicotiana benthamiana. In summary, we have revealed new molecular mechanisms involved in the evolution, biotrophy, and pathogenesis of T. horrida.

Comparative genomics reveals low levels of inter- and intraspecies diversity in the causal agents of dwarf and common bunt of wheat and hint at conspecificity of Tilletia caries and T. laevis

Article Open access 07 June 2022

Comparative genomics of plant pathogenic Diaporthe species and transcriptomics of Diaporthe caulivora during host infection reveal insights into pathogenic strategies of the genus

Article Open access 03 March 2022

Transcriptome sequencing of Mycosphaerella fijiensis during association with Musa acuminata reveals candidate pathogenicity genes

Article Open access 30 August 2016

Introduction

Rice is an economically important seed plant and the most important cereal food crop in the world; it provides approximately 20% of the world’s dietary energy supply^1,2. Rice kernel smut (RKS), caused by the soil-borne Basidiomycete fungus Tilletia horrida that infects rice floral organs, was first reported in Japan in 1896³. RKS was once categorized as a minor disease with sporadic occurrence in rice-growing areas; however, the increasing demand for rice worldwide keeps driving the extensive planting of high-yielding cultivars and hybrid varieties. In China, the cultivated area of hybrid rice has reached about 1.6 million acres. In order to increase production, the breeding of three line hybrid rice that contains male sterile, maintainer, and restorer lines is undertaken. However, the rising incidence of exerted stigma in rice male sterile lines selected as part of the drive towards higher-yielding hybrid lines, increases the impact of RKS, which affects both the yield and quality of hybrid seed by producing masses of dark powdery teliospores^4,5. This has led to an annual 40% to 60% prevalence of RKS in hybrid rice fields and to a 5–20% decrease in rice yield⁶. Disease incidence as high as 87% and 100% in hybrid rice fields has been reported in Pakistan and China, respectively⁷. RKS is now an increasing threat to rice cultivation in Asia, Oceania, Europe, America and Africa^8,9.

Smuts are multicellular fungi, which exist as dark, thick-walled teliospores that are widespread in soil and the seeds of host plants. The morphological characters of 80 smut genera (4, 200 species) have been described and all species are parasitic on higher plants¹⁰. They infect many economically important hosts including maize, barley, wheat, rice, sugarcane, and forage grasses. The smut fungi have originated from two phylogenetically separate lines; Tilletia has different origins to Ustilago and Sporisorium¹¹.

T. horrida belongs to the Tilletia genus of the basidiomycota Tilletiaceae family. It possesses one nucleus per cell (Fig. 1a) and initiates infection through rice floral organs and the immature kernels, producing powdery dark teliospore balls in the rice kernels during the late phase of infection. Teliospore balls are black and spherical, measuring 25_˜30 × 23_˜30 µm, and possess colorless indentations on their surface (Fig. 1b). The spores can survive for more than 1 year in the soil, and for more than 3 years in rice seed that has been infected at the flower stage⁵. On germination, the teliospores produce a promycelium (Fig. 1c) that display distal verticillated digitations (Fig. 1d). Microspores grow in these verticillated digitations and the appearance of these secondary microspores is linear or curved (Fig. 1e,f). The early infective stage of T. horrida, remains asymptomatic, and the organism grows systemically until masses of dark powdery spores appear on the grains (Fig. 1g,h). Besides infecting Oryza sativa, T. horrida is known to infect wild rice and certain weeds, and this may aid the spread of inocula to healthy rice plants at the flowering stage. Because T. horrida fungi are biotrophic pathogens, their growth on artificial media is slow. Studies on this species have predominantly focused on morphological characteristics and its evolutionary biology. To date, genomic structure and pathogenic mechanisms of T. horrida have not been studied.

Despite the high rice yield losses worldwide caused by T. horrida, there has been a reliance on chemical fungicides to control RKS¹². To data, no investigations have been carried out to identify cultivars that are completely immune to T. horrida. In order to exploit more effective resistant cultivars, a better understanding of the pathogenic mechanisms of T. horrida is needed. In the current study, we examined the possible molecular basis of host-pathogen interactions and the pathogenic mechanisms of T. horrida through the sequencing, assembly, and annotation of the genome. The up-regulation of candidate effector genes encoding secreted proteins suggests a list of candidate virulence factors that may play important roles in pathogenicity. A comparative genomics study with the four smut fungi related to T. horrida, namely Sporisorium reilianum, Sporisorium scitamineum, Ustilago maydis, and Ustilago hordei has also been conducted, which may provide useful insights for improving rice crop yields. In summary, this work has elucidated the interactions between rice and the fungal pathogen, T. horrida.

Results

Genome sequencing and assembly

The genome of the T. horrida strain JY-521 was sequenced using a PacBio RS II sequencing strategy and three single-molecule real-time (SMRT) sequence cells were obtained (Supplementary Table S1). The three SMRT cells were assembled into scaffolds using MinHasf Alignment Process (MHAP) assemble, resulting in longer sequences with a total assembly size of 23.2 Mbp; 84 contigs and 84 scaffolds (no gaps within them) were assembled. The N50 size of the scaffold and contigs was 538,348 bp. The average length of the scaffold and contigs was 276,373.48 bp, and the maximum length was 1,812,755 bp. The mitochondrial genome of T. horrida was assembled as a circular molecule of 98.96 kbp, and the GC content of the genome was 55.67% (Table 1, Supplementary Table S2). The genome assembly statistics of T. horrida and other smut fungal isolates, such as U. hordei, S. scitamineum, S. reilianum, and U. maydis, are shown in Table 1. The results revealed that the genome size of smut species was small, and the size of T. horrida JY-521 was greatest of the five smut fungi tested. In the genome of T. horrida, the average gene length and introns per gene were 2,085.6 bp and 4.14, respectively, these were the largest of the five fungi genomes (Table 1).

Table 1 Genome characteristics of five smut fungi.

Full size table

Genome annotation

A total of 7,729 protein-coding genes were annotated, with an average size per gene of 1,759.17 bp with 6,973 being supported by the RNA-seq data. Most of the coding sequences (CDSs) were between 500 and 2,000 bp in length (4,667) and only a few were larger than 5,000 bp (Supplementary Fig. S1). This coding DNA contained 13,596,636 bp of exon sequence lengths (Table 1). Of these 4,960 (64.17%), 2,995(38.75%), 5541(71.69%), 4,324 (55.95%) and 6,212(80.37%) genes were predicted to have homologies with known functions in the SwissProt, Gene Ontology (GO), Kyoto Encyclopaedia of Genes and Genomes (KEGG), EuKaryotic Orthologous Groups (KOG), and Non-Redundant Proteins (NR) databases, respectively (Supplementary Table S3). With regard to non-coding genes, we identified 155 transfer RNAs, and 41 ribosomal RNA fragments from the assembly (Table 1, Supplementary Table S4).

T. horrida repeat sequence content

Repeat DNA sequences comprise interspersed repetitive and tandem repeats that are an important part of a genome. Tandem repeats comprise microsatellite sequences and minisatellite sequences. Interspersed repetitive sequences, also known as transposable elements (TEs), comprise DNA transposon, long terminal repeat (LTR) reverse elements, and long interspersed repetitive elements. Transposable elements that play an important role in fungal pathogens are predominant in repeat sequences¹³. The T. horrida genome is comprised of 1,958,189 bp DNA transposons and retrotransposons that include 76 families. They accounted for 8.3% of the 23,215,372 bp genome sequences. TEs represented about 64.1% of the repetitive sequences and constituted the majority of these sequences. There were 86.4% LTR retrotransposons in TEs (Fig. 2a, Supplementary Table S5). Among the repetitive elements, gypsy elements comprised 835,814 bp, and accounted for 43.36% of the TEs and 3.6% of the total assembly (Supplementary Table S5). These were the most abundant type of TEs and we compared the repeat sequences of the five smut fungal species genomes; results showed that the ratio of repeat sequences in the T. horrida genome was highest of the five fungi tested (Table 1). This indicated that repetitive sequences may play a more important role in T. horrida during inoculation.

Evolution and comparative genomics

For comparative genomics, six additional RKS strains collected from different geological regions in China (Supplementary Table S6) were subjected to 50× sequencing. A comparison was made of the number of single nucleotide polymorphisms (SNPs) between the JY-521 genome and the six other strains (Supplementary Table S7). The smallest number of SNPs was 69,953 when JY-521 was compared with GZ-102 (collected from Guizhou Province) and the greatest number of SNPs was 124,367 when JY-521 was compared with HN-145 (collected from Hunan Province); these findings demonstrated a low level of sequence variation among the different T. horrida strains. The low nucleotide diversity in the genotypes showed a low level of intraspecific sequence variation, suggesting that variations between T. horrida populations were not obvious.

The phylogenetic tree of T. horrida and 13 other fungal species (11 Basidiomycota and 2 Ascomycota outgroups) was evaluated using a set of highly conserved single-copy genes. The analysis revealed that T. horrida was more closely related to the other four smut fungi than to other species, including the rice pathogen Rhizoctonia solani and the mushroom fungus Laccaris bicolor (Fig. 2c). Furthermore, the five smut fungi T. horrida, U. hordei, S. reilianum, S. scitamineum, and U. maydis had fewer genes than other fungi studied (Supplementary Table S8). The T. horrida genome consisted of 7,729 genes, and of these genes, 1,991 were unique; there were 5,983 families, and of these 1,089 were single-gene families. T. horrida possessed the greatest number of genes among the five smut fungi (Supplementary Table S8).

Synteny analysis between T. horrida and U. hordei found no obvious co-linearity (Supplementary Fig. S2a); however, U. hordei and U. maydis was predominantly co-linear (Supplementary Fig. S2b). This suggests that Tilletia belongs to a phylogenetically separate clade when compared with Ustilago and Sporisorium. A total of 4,410 gene families were shared among all five smut fungi, and there were 2,472 predicted genes in 1,591 gene families that appeared to be unique to T. horrida (Fig. 2b). Among these unique genes, genes of unknown function accounted for 59.66%, 138 genes belonged to unnamed or uncharacterized protein-encoding genes, and the expression of 62 genes was up-regulated during early infection (Supplementary Fig. S3). These unique genes may play important roles in pathogenicity that need to be explored. Gene families that had a large number of members included ATP-binding cassette (ABC) transporter, TPR-like protein, NAD (P)-binding protein, amino acid transporter and the glycoside hydrolase family.

Phylogenetic analysis also demonstrated a high degree of homology between T. horrida and U. hordei, U. maydis, S. scitamineum, S. reilianum, but this was displayed in different branches; T. horrida and U. hordei showed a closer relationship than S. scitamineum, S. reilianum, and U. maydis. The results provided an interpretation of the evolution of living smut fungi and the diversity of their hosts. The four related smut organisms could be used as reference species to map the T. horrida genome. Moreover, the fact that T. horrida, U. maydis, U. hordei, S. scitamineum and S. reilianum share most gene families in common supports the evolutionary relationship between these species (Fig. 2b).

Transcriptome analysis during infection

To detect the key disease-associated genes expressed during the entire infection process, we analyzed the transcriptomes of T. horrida at six time points. We found that 6,973 genes were expressed during the entire infection process. As compared with axenic cultures, there were 120, 62, 29, 64, and 32 genes significantly up-regulated, and 217, 49, 62, 25, and 23 genes were significantly down-regulated at 8, 12, 24, 48, and 72 h post inoculation, respectively (FDR < 0.05 and |log₂ Fold Change| > 1; Supplementary Fig. S4a,b). We analyzed gene expression relation at six stages of infection, and found that the gene expression pattern at 8 h was similar to 12 h post inoculation, while 48 h and 72 h were similar (Supplementary Fig. S5). It was clear that the pattern of gene expression was altered following infection and that the expression of pathogenicity-related genes was associated with the stage of infection.

The numbers of up-regulated genes within different GO categories and terms at different stages of infection were compared (Fig. 3). The three GO categories included “biological process”, “cellular component”, and “molecular function”. GO enrichment analyses revealed correlations between stage of infection and gene expression for a range of GO terms within these categories. The up-regulated genes for the following GO terms had high numbers during the host infection process: GO:0044699 single-organism process, GO:0008152 metabolic process, and GO:0009987 cellular process in biological processes, GO:0003824 catalytic activity, GO:0005488 binding in molecular components, GO:0016020 membrane, GO:0043226 organelle, GO:0005623 cell, and GO:0044464 cell part in cellular functions. Some uniquely enriched genes within the GO classes showed the same expression tendency. Although the direct relationship of these genes to pathogenesis was not confirmed, it was possible to use the transcriptome pattern information to demonstrate trends.

Genes involved in pathogenicity

To successfully degrade plant cell walls and infect a host plant, phytopathogenic fungi secrete carbohydrate-active enzymes (CAZymes) that include glycoside hydrolases (GH), glycosyl transferases (GT), polysaccharide lyases (PL), and cutinases¹⁴. We predicted 1,424 putative CAZymes genes in the T. horrida genome; this is similar to biotrophic pathogenic fungi, such as U.virens; but it is significantly lower than plant hemi-biotrophic fungi, such as M. grisea (Supplementary Table S9, Fig. 3). The putative CAZyme genes comprised 542 families, including 474 GH, 533 GT, 101 carbohydrate esterases (CE), 273 carbohydrate binding modules (CBM) and 43 PL (Supplementary Table S9). Of these, the GH and CBM families were reduced in comparison with M. grisea, particularly the proteins in such families as GH6, GH10, GH74, GH39 and GH43 that are involved in degrading the cellulose, hemi-cellulose, and pectin of plant cell walls (Supplementary Dataset 1)^15,16,17. Among the five smut fungi, the expression levels of CAZyme genes showed a similar trend. However, T. horrida had some conspicuously enriched PL4 (involved in pectin degradation) families (Fig. 4). We found that the number of CAZyme genes differentially expressed in T. horrida was similar to U. hordei; this may be because the rice host of T. horrida is closely related to the barley host of U. hordei. These analyses suggest that they had common polysaccharide degradation machinery.

A putative analysis was undertaken of the expression of genes encoding CAZymes over the six infection stages. There were 960 genes that showed specific transcript expression patterns at different infection stages. Finally, 652, 912, 542, 594, and 185 genes encoding CAZymes were upregulated at 8–72 h after infection (Supplementary Table S10). Genes encoding GH family members involved in the infection process reached maximum expression at 12 h. Genes for cellulose-degrading enzymes and hemicellulose-degrading enzymes reached maximum expression at 12 h, while the genes for pectin-degrading enzymes reached maximum expression at 12 h and 48 h, respectively (Supplementary Table S11). These results explain the degeneration of plant primary cell walls and the middle secondary walls during early infection. Genes of the CAZyme families were differentially expressed during the six stages, though not in a consistent fashion. GH16, GH5, GH43, GH28, GH36, CE8, and PL1 showed the highest expression levels, which suggests that the encoded enzymes were present at higher activity during infection and are associated with pathogenicity (Fig. 5 and Supplementary Table S11). Different gene expression patterns clearly occurred at different infection times, and it appears that the expression of genes encoding degradation-associated enzymes plays an important role during infection.

As a biotrophic pathogenic fungus, T. horrida is expected to possess some pathogen-host interaction (PHI) genes¹⁸. A total of 1,697 putative PHI genes were identified in the T. horrida genome. Transcript analysis was performed to identify differential expression patterns of these 1,697 genes during the six infection stages (0, 8, 12, 24, 48, and 72 h). These results suggest that 1,289 genes were expressed at all stages and mainly encoded cell wall degrading enzymes, proteins related to energy metabolism, and proteins related to membrane transport. Of them, 71 were significantly down-regulated and 64 significantly up-regulated genes after infection; the expression pattern of these genes is shown in Supplementary Fig. S6. The expression of smut_3036, which encodes a protein putatively associated with a toxin, reached a maximum at 12 h. The gene smut_4052 encodes carbohydrate esterase Family 5 proteins that are associated with cell wall degradation. The activity of this protein is probably an important determinant of the virulence of pathogenesis. The functions of these virulence associated genes could be clarified by gene disruption or complementation studies.

Plant pathogenic fungi produce some secondary metabolites that are related to pathogenicity, such as host-selective toxins¹⁹. In total, 77 putative secondary metabolism-related genes and seven gene clusters were identified in the T. horrida genome (see Supplementary Fig. S7); the seven gene clusters comprised three polyketide synthase (PKS) clusters, three nonribosomal peptide synthetase (NRPS) clusters, and one prenyltransferase hybrid cluster; the core gene of every cluster is shown in Supplementary Table S12. These genes are important in the biosynthesis of various secondary metabolites^20,21. Iron is an important element necessary for many essential processes in living organisms. T. horrida acquires iron from host rice cells by synthesizing iron-chelating siderophores. Three NRPS gene clusters may participate in siderophores production; they include gene encoding ferrichrome peptide synthetase (Smut_0071), 4-coumarate-CoA ligase (Smut_0795), and polyketide synthase (Smut_1014) and their expression was up-regulated at 24 h after infection (Supplementary Fig. S7). We identified the 12,639 bp core gene Smut_0071, which participates in the synthesis of ferrichrome siderophore peptide²². In the S. scitamineum genome, SmutADNA4_GLEAN_10002728, which encodes ferrichrome siderophore peptide, has 75% identity with Smut_0071.

Cytochrome P450s (CYPs) play an important role in pathogenesis and in the production of toxins^23,24. Transporters are not only involved in obtaining carbon sources and nitrogen sources from their host plants, but are also involved in toxin and effector secretion²⁵. In total, 19 CYP genes were identified in the T. horrida genome and their expression is shown in Supplementary Fig. S8. In addition, we also found 254 transporter genes in the T. horrida genome, among them comprise 40 ATP-binding cassette (ABC) superfamily transporters genes. Therefore, despite the presence of fewer metabolites, it could be secreted extracellularly.

The T. horrida secretome

Recent research has revealed that the secretion of proteins from pathogens is related to the progression of an infection, especially for biotrophic fungi that have intimate host-fungal interactions²⁶. A total of 597 potentially secreted proteins (7.72% of the proteome) of T. horrida were predicted (Supplementary Table S13)^27,28. T. horrida had a higher secretome than S. reilianum, but lower than U. hordei, U. maydis, and S. scitamineum (Supplementary Table S13). There were 366 genes encoding small (<400 amino acids) secreted proteins (Supplementary Fig. S9), and of these, 131 genes were upregulated at 8 h after infection (Supplementary Fig. S10); these were identified as potential plant effectors. In the maize pathogen U. maydis, it was observed that most of the genes in secreted protein-coding gene clusters were induced simultaneously in infected tissue²⁹. For U. maydis, there were 120 predicted effector genes organized into clusters, suggesting that local duplications might be involved in the expansion of effectors in T. horrida (Fig. 6). Repetitive elements are known to have been crucial factors throughout genome evolution and in the diversification of functional genes. However, a relationship between candidate effectors and TE-rich regions (low GC content) was not demonstrated in the current study (Fig. 7). This suggests that TE-driven evolution might have only a small influence on the interactions of T. horrida and rice.

T. horrida candidate effectors and their validation

Some plant pathogen effectors could impact host cells through their various functions; furthermore, the receptors of pathogen effectors operate as dominant disease susceptibility genes³⁰. The effector proteins from rice sheath blight pathogen and rice blast fungus have previously been studied^31,32; however, no effector of the RKS pathogen has been reported prior to the current research.

To identify novel potential effectors, we selected 131 candidate genes from the 366 genes that encoded secreted proteins and demonstrated upregulated expression during early infection (Supplementary Fig. S10). The potential effector gene expression vectors were transformed into Nicotiana benthamiana using the Agrobacterium-mediated transformation method. Interestingly, genes for two potential secreted effectors, smut_2965 (ribonuclease domain) and smut_5844 caused cell death of N. benthamiana leaf phenotypes after inoculation at 4 d (Supplementary Fig. S11, Fig. 8a,b). The highest level of infection was expressed at 8 h after infection. Furthermore, it is known that protein effector genes from pathogens undergo rapid duplication, diversification, deletions, and mutations³³. We suggest that some proteins active in host cells are delivered by the fungus to trigger a defense response; however, this requires further investigation.

Additionally, in order to determine other potential effectors, 78 significantly up-regulated expressed secreted protein genes (FDR < 0.05 and |log2 Fold Change| > 1) in T. horrida were selected and classified into 16 clustered profiles based on trends observed in gene expression using Short Time-series Expression Miner software (STEM) (Supplementary Fig. S12). The clustered profiles with p ≤ 0.05 were considered as statistically significant. In Supplementary Fig. S12, profiles 18 and 19 show two gene clusters that have the same expression trend with two verified effectors, respectively. In them, there are 14 secreted protein genes as potential effectors (Supplementary Table S14) and their expression patterns recorded at different infection times are shown in Fig. 8c. Interestingly, there are three genes, namely smut_1035, smut_1149, and smut_4856 organized into clusters with smut_2965 in the T. horrida genome (Fig. 6). We speculated that those genes were involved in the regulation of pathogen-host interaction. We also analyzed the homologous genes of verified effectors in five smut pathogens; results showed that there were many effector genes homologous with smut_2965 (Supplementary Table S15), but not with smut_5844. We suspect that smut_5844 is a gene for a new effector protein that only affects rice. Of the two T. horrida effectors, we observed fewer SNP among the six RKS strains (Supplementary Dataset 2), suggesting a low intraspecific genetic diversity among the T. horrida population, even with regard to the effectors.

Novel virulence-associated factors in the transduction signal pathway

In order to infect the host plant, plant pathogenic fungi must make appropriate responses to a variety of host-plant surface environmental receptors. These receptors function via the mitogen-activated protein kinase (MAPK) pathway that communicates a signal from a receptor on the surface of the cell to the nucleus in the cell. G protein-coupled receptors (GPCRs) and G-proteins are important components of the MAPK pathway, which has seven transmembrane domain receptors, constitutes a large family of cell surface receptors and is responsible for transducing extracellular signals into intracellular responses that involve complex intracellular-signaling networks^34,35. GPCRs, which are extremely diverse in sequence and function, are found only in eukaryotes, including yeast, choanoflagellates and animals³⁶.

Here, we identified GPCR-like proteins in 14 fungi. T. horrida had 67 GPCRs, which were grouped into 22 families (Supplementary Dataset 3). Therefore, the number of GPCRs in T. horrida was smilar to U. hordei, which contained 64 GPCRs. GPCRs with amino-terminal extracellular cysteine-rich EGF-like domains (CFEM domain) are required for pathogenicity. In M. grisea, GPCRs with CFEM domain have been reported³⁷. We found that no genes encoding GPCRs with CFEM domains were present in the genome of T. horrida. GPCRs suggest that T. horrida is a biotrophic fungi, which has adapted to a narrow host infection, and may acquire the ability to respond to a variety of environmental cues using different GPCRs that activate conserved intracellular signaling pathways and give new triggers to response systems. In particular, T. horrida contains more GPR124, thyrotropin-releasing hormone and secretagogue genes than other fungi. The identification and characterization of GPCRs will provide insights into the means by which T. horrida communicates with its environment and senses the presence of intracellular signaling.

Moreover, we predicted 24 homologues in the MAPK pathway. Amongst them, Ste3 pheromone receptors (smut_6864 and smut_6863) were predicted, and components of other MAPK pathways were also detected, including Ste20 (smut_7706, smut_7489), Ste11 (smut_5984), Ste7 (smut_4602) and Ste12 (smut_5905). Of these Ste12 controls fungal virulence downstream of the pathogenic MAPK cascade as a master regulator of invasive growth in plant pathogenic fungi³⁸. Determining the function of proteins such as Ste12 in the signal transduction pathway during host infection could elucidate the mechanisms by which fungal pathogens cause disease in plants.

Discussion

T. horrida has developed into a major pathogen that restricts hybrid rice seed production. Here, we sequenced and assembled the draft genome of the highly virulent T. horrida strain JY-521. T. horrida has a genome size of 23.2 Mb, which is larger than that of other genus smut fungal genomes that vary from 18.38–21.15 Mb in size (Table 1). The genome contained 8.3% repeat sequences, most of which were TEs. However, compared with the genome size of the sequenced but not annotated T. horrida strain QB-1, using the NGS method, T. horrida JY-521 had a larger genome size⁶. Importantly, the T. horrida JY-521 genome was assembled used SMRT, which generates 1.9 Mb repetitive sequences that have not been identified in the genome of T. horrida QB-1 using other sequencing techniques⁶.

The sequence of T. horrida will provide useful information on host adaptation and evolutionary relationships in the smut species. A comparison of T. horrida with several smut fungi and some other important plant pathogens showed that smut fungi of the Tilletia genus and Sporisorium, and Ustilago genera were from different evolutionary branches, this suggests that species diversity of smut fungi and mechanisms of pathogen-host interaction differ in fungi from different smut. Benevenuto et al.³⁹ compared the genomics of ten smut fungi isolated from maize, barley, sugarcane, wheat, oats, Zizania latifolia (Manchurian rice), Echinochloa colona (a wild grass), Persicaria sp. (a wild dicot plant), oats, and wheat; they showed that host domestication did not play a dominant role in shaping the evolution of smuts; however, the diversification and gain or loss of effector genes are probably the most important determinants of host specificity.

We compared the genomes of 14 fungi and demonstrated that biotrophic smut species contained many fewer genes than necrotrophic Rhizoctonia solani IA, the pathogen of rice sheath blight³¹. We also compared the number of GH families in biotrophic and hemi-biotrophic fungi. Results indicated that GH6, GH10, GH74, GH39, and GH43, that encode the enzymes of cellulose, hemi-cellulose, and pectin in T. horrida, were smaller than in M. grisea. These results showed that biotrophic T. horrida may minimize the degradation of host cell walls using carbohydrate-active enzymes, whose products are often recognized as endogenous signals to induce plant immunity^29,40. These results also reflect the infection pattern of T. horrida that infects rice stamen filaments where cellulose and pectin are deficient⁴¹.

The effectors could interference with host immune responses to enhance virulence. For example, Pseudomonas syringae delivers over 30 effectors during infection. In order to clarify the function of fungal effectors, N. benthamiana has been extensively studied in attempts to elucidate the function of fungal effectors using agroinfiltration^42,43. Several effectors that induce nonhost cell death were identified in M. oryzae and used in transient expression assays in N. benthamiana using agroinfiltration⁴⁴. Similar studies have been reported in P. sojae⁴⁵. We experimentally demonstrated that two selected putative effectors triggered cell death phenotypes in N. benthamiana (Fig. 8a) and they were regulated during T. horrida infection of rice panicles (Fig. 8c). The results showed that the two identified effectors may play important roles during the interaction of T. horrida and rice. Candidate effector proteins are highly diverse between smut species despite their close evolutionary relationship (Supplementary Dataset 2). These analyses further proved that adaptation to different hosts in smut species may be the result of divergent effector repertoires.

One set of characteristics often associated with effectors is a small size and high content of cysteine residues⁴⁶. The secreted protein genes smut_2965 and smut_5844 were encoded 126 aa and 207 aa, respectively. In addition, plant pathogen effector genes can induce expression during infection⁴⁶. This transcription data and quantitative real time reverse transcription-polymerase chain reaction (qRT-PCR) indicated that these two genes were induced during the early infection stage (Supplementary Fig. S13). Furthermore, through transient expression, we demonstrated that smut_2965 and smut_5844 could induce necrosis phenotypes in N. benthamiana. These results showed that smut_2965 and smut_5844 were effector genes and that they play an important role during T. horrida-rice interaction.

Over all, we reported a complete genome sequence of T. horrida using the SMRT sequencing method, which helped in the identification of repetitive elements. From genome assembly and annotation, we predicted that specific CAZymes, PHI genes, secondary metabolites, GPCR genes, and effectors can successfully help T. horrida to adapt to rice. Our study has also laid the groundwork for future discoveries on this important rice disease.

Materials and Methods

Strain isolates, culture conditions, and genomic DNA and RNA isolation

The RKS samples used for sequencing were collected from different heavily infected rice cultivars grown in different provinces of China. The T. horrida strains JY-521, CN-079, JS-058, GZ-102, SN-92, XJ-121, and HN-145 were separated using the spore suspension method⁴⁷. All seven strains are haploid and were identified based on their mycelial morphology, using nucleus fluorescent staining and analysis of rDNA-ITS (internal transcribed spacer) sequences. The T. horrida strains were transferred into potato sugar agar (PSA) medium and incubated in the dark, with agitation at 200 rpm, at 30 °C for 5 d. Total DNA from fungal hyphae was extracted using the cetyltrimethylammonium bromide (CTAB) method⁴⁸. The ITS rDNA genes were amplified using ITS primers (ITS-F:5′-TCCGTAGGTGAACCTGCGG-3′, ITS-R: 5′-TCCTCCGCTTATTGATATGC-3′). The sequencing of PCR products was undertaken by Sangon Biotech (Shanghai, China). Sequences were Blast-searched against NCBI databases. The nucleus fluorescent staining of T. horrida was performed according to the method of Hamada et al.⁴⁹, and mycelial morphology was used as the standard for the detection and identification of T. horrida Tak⁵⁰. The DNA of strain JY-521 was genome sequenced at Novogene Bioinformatics Technology (Beijing, China).

The strain JY-521 was used to infect rice cultivar 9311 A (which is highly susceptible to RKS) to create the treatment group. Young panicles of field grown rice plants, at the booting stage 3–5 d before heading, were collected during the late afternoon. The young panicles were disinfected with 75% alcohol for 2 min and washed twice with sterile water, and then air-dried for 20 min. Part of the hull of the panicles was cut off to allow the inoculation of individual colonies of T. horrida on PSA. Mycelium of T. horrida from six time points post-inoculation (0, 8, 12, 24, 48, and 72 h) were collected, immediately frozen in liquid nitrogen, and stored at −80 °C. Total RNA of T. horrida was isolated using the Omega Fungal RNA kit method. Dried RNA samples were dissolved in DEPC water. RNA quality was assessed on 1.0% denaturing agarose gels. Total RNA of T. horrida following infection of rice for 0, 8, 12, 24, 48, and 72 h was used for RNA-seq analysis.

Genome sequencing and assembly

The genome of T. horrida strain JY-521 was sequenced using the Single Molecule Real-Time (SMRT) method at Novogene Bioinformatics Technology (Beijing, China). From the DNA, 20 kb insert PacBio RS II DNA sequencing libraries were constructed and three SMRT cells of raw data were generated (Supplementary Table S1). The output was 409,787 reads with an average length of 8,936 bp, resulting in 3,662,098,386 bp. The reads were corrected using the MinHash Alignment Process (MHAP)⁵¹, and the corrected reads were assembled using SOAP denovo^52,53 and Celera Assembler⁵⁴. DNA libraries with 300 bp inserts were constructed for the six other T. horrida strains. These DNA libraries were 150 bp paired-end sequenced using the Illumina HiSeq. 2000 at the Beijing Genomics Institute (BGI) in Shenzhen, China. Data from the Illumina libraries were first trimmed by removing low quality sequences and adapter sequences.

Analysis of repeats

T. horrida repeat sequences were predicted by de novo and homology-based methods. De novo transposon libraries were constructed with the de novo software Piler v1.0⁵⁵ and RepeatModeler (http://repeatmasker.org/RepeatModeler/, version 1.0.7). Transposable elements (TEs) were analyzed using libraries of the de novo method constructed using RepeatMasker (http://repeatmasker.org, version open-4.0.5). Tandem repeats sequences were predicted by Tandem Repeats Finder (TRF)⁵⁶. All the parameters were set as default.

Gene prediction and annotation

To obtain accurate candidate genes, we used three ab initio predictors: SNAP, GeneMark (Version 4.30), and AUGUSTUS to predict protein-coding genes in T. horrida^57,58,59. The translation start sites were predicted using NetStart⁶⁰. The six sets of RNA-seq raw data were aligned to the T. horrida JY-521 genome via TopHat v1.1.4 (ref.⁵⁰), and exon junctions were predicted by aligning with the six RNA-seq libraries. Results of gene prediction were integrated using EuGene v3.6 (ref.¹¹). The function of protein-encoding genes was annotated by BLASTp searches in the SwissProt (2015-07-24), KOG(2015-07-24), KEGG (2015-09-26), and NR (2015-07-24) databases, at the threshold of e-value ≤ 1e-5. GO annotation was accomplished by Blast2GO Pipeline (Version 2.3.5) with NR annotated results and the GO (2015-04-07) database.

SNP identification

The high quality paired-ends read of the other six T. horrida strains were mapped to the assembled genome of T. horrida strain JY-521 using BWA software (version 0.7.12) with the command “mem –k 32 -M”, and BAM alignment files were generated using SAM tools software (version 0.1.19). The SNPs were identified using GATK software (version 3.4-46) with the Unified Genotyper model and were filtered using the Variant Filtration model and options -Window 4, -filter “QD < 4.0 || FS > 60.0 || MQ < 40.0 “, -G_filter “GQ < 20”. The filtered SNPs were annotated using ANNOVAR software (version 2014-07-14).

Comparative genome analysis

The CDSs of gene proteins and CDSs sequences of from the 14 fungal genomes were collected, and compared using BlastP 2.6.0+ with the evalue ≤ 1e-7⁶¹. We removed short, spurious, and nonhomologous hits by setting a bitscore/alignment length filtering threshold of 0.4 and a minimum protein length of 30. Proteins passing this filter were clustered into families using the Markov clustering algorithm implemented by OrthoMCL software v1.4^62,63 with options “–mode 3” -I as 2.0, and 806 single-copy gene families were obtained. The protein sequence of each single-copy family was aligned using MUSCLE v3.8.31, then, in turn, the protein alignment sequence to CDS alignment sequence and individual CDS alignment were concatenated into a string of nucleotide acids for each fugal genome. Regions that contained gaps or were highly divergent were removed from the data set using GBLOCKS software v0.91b⁶⁴ with default parameters. Finally, the phylogenetic tree was built with MEGA v7.0.26⁶⁵ under a neighbor-joining model and 1000 bootstrap replicates.

Transcriptome expression

Total RNA was isolated from the T. horrida that infected rice panicles after 0, 8, 12, 24, 48, and 72 h using an RNA isolation kit according to the manufacturer’s instructions (Aidlab Biotechnologies, Beijing). These RNA were used for cDNA libraries constructed, and each library sequence was generated using a library with a read insert of 280 bp using IlluminaHiseq™ 2500 technology. We used mRNA-Seq for expression analysis. The RNA expression analysis was based on the predicted genes of T. horrida. A comparison map of mRNA reads in relation to the genome was generated by Tophat v2.0.14⁶⁶, and the number of expected fragments of 1 kb of transcript per million fragments sequenced (FPKM) was calculated using Cufflinks⁶⁷. The different expression genes were identified using EdgeR with FDR < 0.05 and |log2FC| > = 1^68,69.

Gene relation to pathogenicity and virulence-associated signaling pathway

For genes encoding proteins of carbohydrate metabolism, we used the Carbohydrate Active Enzymes (CAZy) database with an e-value of less than 1 × e⁻⁵. (2015-10-20) (http://www.cazy.org/). Secondary metabolism genes were predicted by SMURF (http://jcvi.org/smurf/precomputed.php) and NRPS predictor (version 2). Cytochrome P450 genes were identified in the P450 database with an e-value of ≤ 1 × e⁻⁵ (http://drnelson.uthsc.edu/cytochromeP450.html). The PHI-base database (http://www.phibase.org/) included experimentally verified disease-related genes of fungal and bacterial pathogens. So, we searched the T. horrida genome using the pathogen-host interaction database with an e-value of ≤ 1 × e⁻⁵, and pathogen-host interaction genes were identified. ABC transporters in the UniProt database (http://www.uniprot.org/) were identified using BLAST with an e-value of 8 × e⁻¹⁴ and the TCDB with an e-value ≤ 1 × e⁻⁵.

Important elements of the G-protein pathway were identified using BLAST with a threshold e-value of ≤ 1 × e⁻⁵ (http://www.genome.jp/kegg/pathway/sce/sce04011.html). The identification and verification of seven transmembrane (7-TM) helices of GPCR-like proteins was performed using TMPRED (http://www.ch.embnet.org/software/TMPRED_form.html), Phobius, and TMHMM 2.0c (http://phobius.sbc.su.se/, http://www.cbs.dtu.dk/services/TMHMM/). The GPCR-like proteins were classified using the following website: http://fse2013.spms.ntu.edu.sg/~chenxin/PCA_GPCR/.

Secreted proteins

There are several prediction algorithms that may be used in the analysis of secreted proteins of T. horrida. The signal peptide site of potential secreted proteins was predicted by SignalP4.0 (http://www.cbs.dtu.dk/services/SignalP-4.0/) and Phobius (http://phobius.binf.ku.dk/). Transmembrane helices in the proteins were predicted using TMHMM 2.0 (http://www.cbs.dtu.dk/services/TMHMM-2.0/). Proteins located in the mitochondria, as determined by TargetP (http://www.cbs.dtu.dk/services/TargetP/), were removed. The proteins that contained signal peptide cleavage sites and no transmembrane helices were potential secreted proteins.

Candidate effectors and their validation

Putative effectors in the T. horrida secretomes were predicted based on the protein size (≤400 amino-acid residues) and expression during the early infection stage of the rice-fungus interaction^70,71. The functions of putative effector proteins were identified by expressed used transient expression assay in N. benthamiana by agroinfiltration. All DNA manipulations and other procedures, including agarose gel electrophoresis, were performed according to standard protocols.

Expression vector construction and preparation

A total of 131 expression plasmids were constructed with the PMDC32 Expression Vector (Libo Shan, Texas A&M University). RNA was prepared from T. horrida mycelia using a Fungal RNA kit (Omega), and cDNA was synthesized using a Transcriptor First strand cDNA synthesis kit (Roche). All PCR products used for cloning were generated using Trans Start FastPfu Fly DNA Polymerase (TransGen Biotech, Beijing, China). All of the restriction enzymes and ClonExpress enzymes were used following the manufacturer’s instructions (Vazyme Biotech, Nanjing, China). Primers for these assays were designed based on our predicted gene sequences and included a BamHI site and a StuI site used CE Design v1.03. The obtained cDNA of target genes were gel-purified with a gel purification kit (Omega) and cloned into the PMDC32 Expression vector.

Transient protein expression in N. benthamiana

For N. benthamiana leaf transformations, PMDC32 expression vector constructs were transformed into Agrobacterium tumefaciens strain GV3101. Bacterial strains were grown in YEP liquid medium containing 50 mg/mL rifampicin, and 50 mg/mL kanamycin at 28 °C for 16 h. Bacteria were harvested by centrifugation, resuspended in infiltration medium [10 mM MES (pH 5.6), 10 mM MgCl₂, and 150 μM acetosyringone] to an OD 600 at 0.5, and incubated in the dark for 3 h at room temperature before leaf infiltration. For each independent infiltration experiment, each construct was infiltrated on 20 leaves from different plants. The infiltrated plants were incubated in growth chambers under controlled conditions for all following assays. For documentation of cell death, leaves were photographed 4–7 d after infiltration.

Plant growth conditions and infection assay conditions

N. benthamiana plants were housed in a growth chamber under light for 16 h at 23 °C and in the dark for 8 h at 23 °C. Spore suspension was inoculated onto plants at the 4-leaf stage using a syringe. The vector containing the BAX gene inoculant was used as the positive control, and a vector containing the green fluorescent protein (GFP) gene inoculation was used as the negative control. Similar results were obtained in five independent experiments. The leaf cell death phenotypes were observed 4d post infiltration.

qRT-PCR

To further verify the expression trend of key genes, qRT-PCR was performed with a Bio-Rad CFX96 Real-Time PCR System (Foster City, CA, USA), according to the manufacturer’s instructions. The ubiquitin (UBQ) gene was used as an internal control for data normalization. The expression levels of genes were calculated using the 2^−∆∆Ct algorithm. Primers used for qRT-PCR are listed in Supplementary Table S16.

Data Availability Statement

The authors declare that all data of this study are available from the corresponding author upon reasonable request.

References

Sharif, M. K., Butt, M. S., Anjum, F. M. & Khan, S. H. Rice bran: a novel functional ingredient. Crit Rev Food Sci Nutr. 54, 807–816 (2014).
Article CAS Google Scholar
Wu, J. G., Shi, C. & Zhang, X. Estimating the amino acid composition in milled rice by near-infrared reflectance spectroscopy. Field Crop Research 75, 1–7 (2002).
Article Google Scholar
Takahashi, Y. On Ustilago virens Cooke and a new species of Tilletia parasitic on rice plant. Tokyo Bot Mag 10, 16–20 (1896).
Article Google Scholar
Chen, Y. et al. Simple and rapid detection of Tilletia horrida causing rice kernel smut in rice seeds. Scientific Reports 6, 33258 (2016).
Article ADS CAS Google Scholar
Webster, R. K. & Gunnell, P. S. Compendium of Rice Diseases. Mycologia 84, 953 (1992).
Google Scholar
Wang, N. et al. Draft genome sequence of the rice kernel smut Tilletia horrida Strain QB-1. Genome Announc 3, e00621–15 (2015).
PubMed PubMed Central Google Scholar
Biswas, A. Kernel smut disease of rice: current status and future challenges. Environment and Ecology 21, 336–351 (2003).
Google Scholar
Carris, L. M., Castlebury, L. A. & Goates, B. J. Nonsystemic Bunt Fungi - Tilletia indica and T. horrida: A Review of History, Systematics, and Biology. Annual Review of Phytopathology 44, 113–133 (2006).
Article CAS Google Scholar
Brooks, S. A., Anders, M. M. & Yeater, K. M. Effect of Cultural Management Practices on the Severity of False Smut and Kernel Smut of Rice. Plant Disease 93, 1202–1208 (2009).
Article Google Scholar
Rogerson, C. T. Illustrated genera of smut fungi. Brittonia 40, 107 (1988).
Article Google Scholar
Roux, C., Almaraz, T. & Durrieu, G. Phylogeny of some smuts fungi based on ITS [International transcribed spacer] sequence analysis. Comptes rendus de I Académie des Sciences. Series 3, Sciences de la Vie 321, 603–609 (1998).
CAS Google Scholar
Tsuda, M., Sasahara, M., Ohara, T. & Kato, S. Optimal application timing of simeconazole granules for control of rice kernel smut and false smut. Journal of General Plant Pathology 72, 301–304 (2006).
Article CAS Google Scholar
Laurie, J. D. et al. Genome comparison of barley and maize smut fungi reveals targeted loss of RNA silencing components and species-specific presence of transposable elements. Plant Cell 24, 1733–1745 (2012).
Article CAS Google Scholar
Cantarel, B. L. et al. The Carbohydrate-Active EnZymes database (CAZy): an expert resource for glycogenomics. Nucleic Acids Research 2009 37, D233–D238 (2009).
Article CAS Google Scholar
Zhao, Z. T., Liu, H. Q., Wang, C. F. & Xu, J. R. Comparative analysis of fungal genomes reveals different plant cell wall degrading capacity in fungi. BMC Genomics 14, 274 (2013).
Article CAS Google Scholar
Couturier, M. et al. Post-genomic analyses of fungal lignocellulosic biomass degradation reveal the unexpected potential of the plant pathogen Ustilago maydis. BMC Genomics 13, 57 (2012).
Article CAS Google Scholar
Martin, F. et al. Perigord black truffle genome uncovers evolutionary origins and mechanisms of symbiosis. Nature 464, 1033–1038 (2010).
Article ADS CAS Google Scholar
Winnenburg, R. et al. PHI-base update: additions to the pathogen–host interaction database. Nucleic Acids Research 36, D572–D576 (2008).
Article CAS Google Scholar
Keller, N. P., Turner, G. & Bennett, J. W. Fungal secondary metabolism-from biochemistry to genomics. Nature Reviews Microbiology 3, 937–947 (2005).
Article CAS Google Scholar
Khaldi, N. et al. SMURF: Genomic mapping of fungal secondary metabolite clusters. Fungal Genetics and Biology 47, 736–741 (2010).
Article CAS Google Scholar
Rausch, C., Weber, T., Kohlbacher, O., Wohlleben, W. & Huson, D. H. Specificity prediction of adenylation domains in nonribosomal peptide synthetases (NRPS) using transductive support vector machines (TSVMs). Nucleic Acids Research 33, 5799–5808 (2005).
Article CAS Google Scholar
Winterberg, B., Uhlmann, S. & Linne, U. Elucidation of the complete ferrichrome A biosynthetic pathway in Ustilago maydis. Molecular Microbiology 75, 1260–1271 (2010).
Article CAS Google Scholar
Nelson, D. R. Cytochrome P450 and the individuality of species. Archives Biochemistry Biophysics 369, 1–10 (1999).
Article CAS Google Scholar
Que, Y. et al. Genome sequencing of Sporisorium scitamineum provides insights into the pathogenic mechanisms of sugarcane smut. BMC Genomics 15, 996 (2014).
Article Google Scholar
Mueller, O. et al. The secretome of the maize pathogen Ustilago maydis. Fungal Genetics. Biology 45, S63–S70 (2008).
CAS Google Scholar
Emanuelsson, O., Brunak, S., von Heijne, G. & Nielsen, H. Locating proteins in the cell using Target P, Signal P and related tools. Nature. Protocols 2, 953–971 (2007).
Article CAS Google Scholar
Lum, G. & Min, X.J. FunSecKB: the Fungal Secretome KnowledgeBase. Database bar001 (2011).
Ma, L. J. et al. Comparative genomics reveals mobile pathogenicity chromosomes in Fusarium. Nature 464, 367–373 (2010).
Article ADS CAS Google Scholar
Kamper, J. et al. Insights from the genome of the biotrophic fungal plant pathogen Ustilago maydis. Nature 444, 97–101 (2006).
Article ADS Google Scholar
Faris, J. D., Zhang, Z. C., Lu, H. J., Lu, S. W. & Reddy, L. A unique wheat disease resistance-like gene governs effector-triggered susceptibility to necrotrophic pathogens. Proceedings of the National Academy of Sciences of the United States of America 107, 13544–13549 (2010).
Article ADS CAS Google Scholar
Zheng, A. P. et al. The evolution and pathogenic mechanisms of the rice sheath blight pathogen. Nature Communications 4, 1424 (2013).
Article Google Scholar
Yoshida, K. et al. Association genetics reveals three novel avirulence genes from the rice blast fungal pathogen Magnaporthe oryzae. The Plant Cell 21, 1573–1591 (2009).
Article CAS Google Scholar
Oliver, R. P. & Solomon, P. S. New developments in pathogenicity and virulence of necrotrophs. Current Opinion in Plant Biology 13, 415–419 (2010).
Article CAS Google Scholar
Soanes, D. M., Richards, T. A. & Talbot, N. J. Insights from sequencing fungal and oomycete genomes: what can we learn about plant disease and the evolution of pathogenicity? Plant Cell 19, 3318–3326 (2007).
Article CAS Google Scholar
Dean, R. A. et al. The genome sequence of the rice blast fungus Magnaporthe grisea. Nature 434, 980–986 (2005).
Article ADS CAS Google Scholar
Cuomo, C. A. et al. The Fusarium graminearum genome reveals a link between localized polymorphism and pathogen specialization. Science 317, 1400–1402 (2007).
Article ADS CAS Google Scholar
Kulkarni, R. D., Thon, M. R., Pan, H. Q. & Dean, R. A. Novel G-protein-coupled receptor-like proteins in the plant pathogenic fungus Magnaporthe grisea. Genome Biology 6, R24 (2005).
Article Google Scholar
Rispail, N. et al. Comparative genomics of MAP kinase and calcium-calcineurin signalling components in plant and human pathogenic fungi. Fungal Genetics. Biology 46, 287–298 (2009).
CAS Google Scholar
Benevenuto, J., Teixeira-Silva, N. S., Kuramae, E. E., Croll, D. & Monteiro-Vitorello, C. B. Comparative Genomics of Smut Pathogens: Insights From Orphans and Positively Selected Genes Into Host Specialization. Front. Microbiol 9, 660 (2018).
Article Google Scholar
Kemen, E. et al. Gene gain and loss during evolution of obligate parasitism in the white rust pathogen of Arabidopsis thaliana. PLoS Biol. 9, e1001094 (2011).
Article CAS Google Scholar
Tang, Y. X. et al. Elucidation of the infection process of Ustilaginoidea virens (teleomorph: Villosiclava virens) in rice spikelets. Plant Pathol. 62, 1–8 (2013).
Article Google Scholar
Petre, B. et al. Candidate effector proteins of the rust pathogen Melampsora larici-populina target diverse plant cell compartments. Molecular Plant-Microbe Interactions 28, 689–700 (2015).
Article CAS Google Scholar
Win, J., Kamoun, S. & Jones, A. M. E. Purification of effector-target protein complexes via transient expression in Nicotiana benthamiana. Plant Immunity: Methods and Protocols 712, 181–194 (2011).
Article CAS Google Scholar
Chen, S. et al. Identification and characterization of in planta-expressed secreted effector proteins from Magnaporthe oryzae that induce cell death in rice. Mol. Plant-Microbe Interact. 26, 191–202 (2013).
Article CAS Google Scholar
Wang, Q. et al. Transcriptional programming and functional interactions within the Phytophthora sojae RXLR effector repertoire. Plant Cell 23, 2064–2086 (2011).
Article CAS Google Scholar
Stergiopoulos, I. & de Wit, P. J. Fungal effector proteins. Annu.Rev. Phytopathol 47, 233–263 (2009).
Article CAS Google Scholar
Chen, S. J. et al. Factors influencing teliospore germination of Neovossia horrida and screening of sporulation medium of N. horrida. Acta Agriculturae Zhejianggensis 23, 572–576 (2011).
Google Scholar
Rogers, S. O. & Bendich, A. J. Extraction of total cellular DNA from plants, algae and fungi. Plant Molecular Biology Manual D1, 1–8 (1994).
Hamada, S. & Fujita, S. DAPI staining improved for quantitative cytofluorometry. Histochemistry 79, 219–226 (1983).
Article CAS Google Scholar
Wu, P. S., Luo, J. F. & Du, H. Z. Detection and identification of Tilletia horrida Tak. PRC National Standard (2011).
Berlin, K., Koren, S., Chin, C. S., Drake, J. M. & Phillippy, A. M. Assembling large genomes with single-molecule sequencing and locality-sensitive hashing. Nat Biotechnology 33, 623–630 (2015).
Article CAS Google Scholar
Li, R. Q., Li, Y. H., Kristiansen, K. & Wang, J. SOAP: short oligonucleotide alignment program. Bioinformatics 24, 713–714 (2008).
Article CAS Google Scholar
Li, R. et al. De novo assembly of human genomes with massively parallel short read sequencing. Genome Research 20, 265–272 (2009).
Article Google Scholar
Denisov, G. et al. Consensus generation and variant detection by Celera Assembler. Bioinformatics 24, 1035–1040 (2008).
Article CAS Google Scholar
Edgar, R. C. & Myers, E. W. PILER: identification and classification of genomic repeats. Bioinformatics 21, (i152–i158 (2005).
Google Scholar
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res 27, 573–80 (1999).
Article CAS Google Scholar
Korf, I. Gene finding in novel genomes. BMC Bioinformatics 5, 59 (2004).
Article Google Scholar
Stanke, M., Steinkamp, R., Waack, S. & Morgenstern, B. AUGUSTUS: a web server for gene finding in eukaryotes. Nucleic Acids Research 32, W309–W312 (2004).
Article CAS Google Scholar
Ter-Hovhannisyan, V., Lomsadze, A., Chernoff, Y. O. & Borodovsky, M. Gene prediction in novel fungal genomes using an ab initio algorithm with unsupervised training. Genome Research 18, 1979–1990 (2008).
Article CAS Google Scholar
Pedersen, A. G. & Nielsen, H. Neural network prediction of translation initiation sites in eukaryotes: perspectives for EST and genome analysis. Proc Int Conf Intell Syst Mol Biol 5, 226–233 (1997).
CAS PubMed Google Scholar
Altschul, S. F. et al. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Research 25, 3389–3402 (1997).
Article CAS Google Scholar
Stijn van, D. Graph Clustering by Flow Simulation. PhD thesis, University of Utrecht (2000).
Enright, A. J., Van, D. S. & Ouzounis, C. A. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Research 30, 1575–1584 (2002).
Article CAS Google Scholar
Talavera, G. & Castresana, J. Improvement of phylogenies after removing divergent and ambiguously aligned blocks from protein sequence alignments. Systematic Biology 56, 564–577 (2007).
Article CAS Google Scholar
Tamura, K. et al. MEGA5: Molecular Evolutionary Genetics Analysis using Maximum Likelihood, Evolutionary Distance, and Maximum Parsimony Methods. Molecular Biology and Evolution 28, 2731–2739 (2011).
Article CAS Google Scholar
Trapnell, C., Pachter, L. & Salzberg, S. L. TopHat: discovering splice junctions with RNA-Seq. Bioinformatics 25, 1105–1111 (2009).
Article CAS Google Scholar
Trapnell, C. et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nature Biotechnology 28, 511–515 (2010).
Article CAS Google Scholar
Eisen, M. B., Spellman, P. T., Brown, P. O. & Botstein, D. Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences of United States of America 95, 14863–14868 (1998).
Article ADS CAS Google Scholar
Perou, C. M. et al. Distinctive gene expression patterns in human mammary epithelial cells and breast cancers. Proceedings of the National Academy of Sciences of United States of America 96, 9212–9217 (1999).
Article ADS CAS Google Scholar
Win, J. et al. Adaptive evolution has targeted the C-terminal domain of the RXLR effectors of plant pathogenic oomycetes. Plant Cell 19, 2349–2369 (2007).
Article CAS Google Scholar
Tuori, R. P., Wolpert, T. J. & Ciuffetti, L. M. Heterologous expression of functional Ptr ToxA. Molecular Plant-Microbe Interact 13, 456–464 (2000).
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by the scientific and technological research program of the Chongqing Municipal Education Commission (KJ15012017) and the National Natural Science Foundation (31400130). The authors thank all contributors for their work and would like to thank the anonymous reviewers for their valuable comments and suggestions.

Author information

Aijun Wang, and Linxiu Pan contributed equally.

Authors and Affiliations

Rice Research Institute of Sichuan Agricultural University, Wenjiang, Chengdu, Sichuan, 611130, China
Aijun Wang, Linxiu Pang, Na Wang, Peng Ai, Shuangcheng Li, Qiming Deng, Jun Zhu, Yueyang Liang, Jianqing Zhu, Ping Li & Aiping Zheng
Key laboratory of Sichuan Crop Major Disease, Sichuan Agricultural University, Wenjiang, Chengdu, Sichuan, 611130, China
Aijun Wang, Shuangcheng Li, Qiming Deng, Jun Zhu, Yueyang Liang, Ping Li & Aiping Zheng
Key Laboratory of Southwest Crop Gene Resource and Genetic Improvement of Ministry of Education, Sichuan Agricultural University, Yaan, Sichuan, 611130, China
Aijun Wang, Shuangcheng Li, Qiming Deng, Jun Zhu, Yueyang Liang, Ping Li & Aiping Zheng
Food Crop Research Institute, Hubei Academy of Agricultural Science, Wuhan, Hubei, 611130, China
Desuo Yin

Authors

Aijun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Linxiu Pang
View author publications
You can also search for this author in PubMed Google Scholar
Na Wang
View author publications
You can also search for this author in PubMed Google Scholar
Peng Ai
View author publications
You can also search for this author in PubMed Google Scholar
Desuo Yin
View author publications
You can also search for this author in PubMed Google Scholar
Shuangcheng Li
View author publications
You can also search for this author in PubMed Google Scholar
Qiming Deng
View author publications
You can also search for this author in PubMed Google Scholar
Jun Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Yueyang Liang
View author publications
You can also search for this author in PubMed Google Scholar
Jianqing Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Ping Li
View author publications
You can also search for this author in PubMed Google Scholar
Aiping Zheng
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.W. conceived, designed and carried out the experiments, analyzed the data and draft the manuscript; L.P. and N.W. conceived and designed the experiments; P.A., D.Y., S.L., M.D., J.Z., Y.L., J.Z., P.L. and A.Z. analyzed and interpreted the data. All the authors reviewed the manuscript.

Corresponding author

Correspondence to Aiping Zheng.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

supplementary figure and table

supplementary dataset 1

supplementary dataset 2

supplementary dataset 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Wang, A., Pang, L., Wang, N. et al. The pathogenic mechanisms of Tilletia horrida as revealed by comparative and functional genomics. Sci Rep 8, 15413 (2018). https://doi.org/10.1038/s41598-018-33752-w

Download citation

Received: 19 March 2018
Accepted: 04 October 2018
Published: 18 October 2018
DOI: https://doi.org/10.1038/s41598-018-33752-w
Springer Nature Limited

Keywords

This article is cited by

Comparative genomics reveals low levels of inter- and intraspecies diversity in the causal agents of dwarf and common bunt of wheat and hint at conspecificity of Tilletia caries and T. laevis
- Somayyeh Sedaghatjoo
- Bagdevi Mishra
- Wolfgang Maier
IMA Fungus (2022)
Understanding the Rice Fungal Pathogen Tilletia horrida from Multiple Perspectives
- Aijun Wang
- Xinyue Shu
- Ping Li
Rice (2022)
Profiling of monosporidial populations of Tilletia barclayana causing kernel smut of rice
- Anju Bala Sharma
- Babanpreet Singh
- Sarbjit Kaur
Indian Phytopathology (2021)
Culture media promoting sporulation of rice kernel smut fungus Tilletia barclayana
- Li Wang
- Ella Nysetvold
- Xin-Gen Zhou
European Journal of Plant Pathology (2021)
Transcriptome analysis and whole genome re-sequencing provide insights on rice kernel smut (Tilletia horrida) pathogenicity
- Aijun Wang
- Xinyue Shu
- Aiping Zheng
Journal of Plant Pathology (2020)

The pathogenic mechanisms of Tilletia horrida as revealed by comparative and functional genomics

Abstract

Similar content being viewed by others

Introduction

Results

Genome sequencing and assembly

Genome annotation

T. horrida repeat sequence content

Evolution and comparative genomics

Transcriptome analysis during infection

Genes involved in pathogenicity

The T. horrida secretome

T. horrida candidate effectors and their validation

Novel virulence-associated factors in the transduction signal pathway

Discussion

Materials and Methods

Strain isolates, culture conditions, and genomic DNA and RNA isolation

Genome sequencing and assembly

Analysis of repeats

Gene prediction and annotation

SNP identification

Comparative genome analysis

Transcriptome expression

Gene relation to pathogenicity and virulence-associated signaling pathway

Secreted proteins

Candidate effectors and their validation

Expression vector construction and preparation

Transient protein expression in N. benthamiana

Plant growth conditions and infection assay conditions

qRT-PCR

Data Availability Statement

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Search

Navigation