Introduction

Insect olfactory system enables insects to detect and discriminate different odor molecules from their environment. The odorant and pheromone perception in insects are a complex series of process that transfer external stimulus from the environment to behavioral response. Diverse proteins consisting in this signal transduction pathway include odorant binding proteins (OBPs), chemosensory proteins (CSPs), chemosensory receptor (CRs), odorant degrading enzymes (ODEs) and sensory neuron membrane proteins (SNMPs)1,2,3,4,5,6,7.

The gene family encoding CRs in insects mainly divides into three groups: olfactory receptors (ORs), gustatory receptors (GRs) and ionotropic receptors (IRs)8,9,10,11,12. The insect ORs and the GRs were firstly identified in the genomic analysis of Drosophila melanogaster13. Previous studies suggested that the OR and GR families belong to a superfamily based on the transmembrane domain13,14. Both ORs and GRs are seven transmembrane-domain receptors with an inverted transmembrane topology in contrasts to the OR of mammalian14,15,16. ORs are housed targeted in the dendritic membrane of olfactory sensory neurons (OSNs). Most OSNs express a single OR gene, along with odorant receptor co-receptor (Orco)17. Insect OR genes are highly diverse, and their numbers among species are various13,18,19,20,21,22. ORs have been shown to form a voltage-gated ion channel following heterodimerization with Orco, which is an ortholog of the D. melanogaster OR, DmOr83b23,24. ORs in Lepidopteran pheromone system are classified into two major groups, pheromone receptors (PRs) and general odorant receptors according to perception of chemicals25. PRs are thought to be associated with olfactory signal transduction of pheromone compounds in peripheral olfactory reception, and general odorant receptors are considered to detect plant volatiles. PRs were reported in several species20,26,27,28. Also pheromone-pheromone receptor relationships also have been identified. In Bombyx mori, general odorant receptor BmOR19 responds to linalool, furthermore, BmOR45 and BmOR47 respond to benzoic acid, 2-phenylethanol, and benzaldehyde respectively29. While PRs (BmOR1 and BmOR3) respond specifically to bombykol (E10Z12-16: OH) and bombykal (E10Z12-16: Ald) respectively, which are major and minor sex pheromone components in B. mori.30,31. Although we classified receptors on the basis of their responses to chemicals, their functions are still unknown. PRs are more conserved compared with other ORs32,33. Orco is the only highly conserved OR protein among divergent insect species, which suggests the crucial role in chemical recognition34,35. The GRs are generally expressed in gustatory receptor neurons (GRNs) within gustatory organs. Differently, several GRs are usually expressed in one neuron. Most GRs sense non-volatile compounds, including sugar and bitter compounds36. Interestingly, Some GRs expressed in antennae cannot identify volatile odorants, but respond to carbon dioxide37. IRs belongs to a variant subfamily of ionotropic glutamate receptors (iGluRs), involving in detection of chemical signals38,39. Based on the expression location, two subfamily of IRs are identified: conserved antennal IRs and divergent IRs, indicating the function of IRs in olfactory and taste39. In antennal IRs, IR8a and IR25a appear to act as co-receptors and form heteromeric complexes with odor- specific receptors40,41. In contrast with ORs, many IR-expressing neurons expressed two or three IR genes17.

The yellow peach moth Conogethes punctiferalis (Guenée) (Lepidoptera: Crambidae), is a serious polyphagous insect of crop (sunflower, maize, sorghum, cotton, etc.) and fruit (peach, plum, durian, etc.). It’s hard to control by traditional method, due to larvae bored the inside fruit or stem42,43. Few olfactory receptor genes have been reported in this moth44, however more receptor genes investigated allows a better understanding of the molecular basis of olfaction. Identification of the chemosensory genes can lead to comprehensive understanding of how to recognize and locate host for this moth. In the present study, the antennal transcriptome of C. punctiferalis were sequenced on the Illumina HiSeq 4000 platform. In total 83 chemoreceptor genes including 62 ORs, 11 IRs and 10 GRs were identified. The expression levels of ORs, which may play important roles in chemoreception of C. punctiferalis were analyzed by using of quantitative real-time PCR (qRT-PCR). The results assist with the identification of genes involved in C. punctiferalis olfactory, help better understand the gene expression in sex, and provide the molecular basis for further study of pheromone and host volatile recognition.

Results

Illumina sequencing

To identify the chemosensory protein genes from C. punctiferalis, the cDNA from male and female antennae were sequenced using the Illumina HiSeqTM 4000 sequencing platform. A total of 59,375,582 and 57,353,054 raw reads were obtained from female and male antennae, respectively. After removing adaptor sequences, low quality sequences and containing N sequences, 58,053,354 and 55,592280 clean reads were generated from the antennae of female and male raw data, respectively. After assembled, the two datasets result in 112,933 contigs with a mean length of 902 bp and an N50 of 1750 bp. From these datasets, 76,486 unigenes were obtained with a mean length of 727 bp and an N50 of 1499 bp (Table 1). The raw reads of the C. punctiferalis are available from the SRA database (accession number: (SRX 1604701 and SRX 1604705).

Table 1 Summary of assembled contigs and unigenes.

Functional annotation of assembled unigenes

The proportion of unigenes annotated in at least one database of Nr, Nt, KO, SwissProt, PFAM, GO, KOG reach to 40%. Only 4.75% unigenes were annotated in all databases (Table 2). Using BlastX database, 26,114 (34.14%) unigenes were matched to known proteins. For species distribution, the results showed 33.43%, 24.85%, and 1.91% of protein similar with the sequences from B. mori, Danaus plexippus and Papilio xuthus, respectively (Fig. 1). GO annotation was used to classify the function of transcripts according to the GO terms (Fig. 2A). In the biological process terms, cellular, metabolic and single-organism were the highest classified. In the cellular component terms, cell, cell part and organelle were the most abundant. In the molecular function category, the genes expressed in the antennae were mostly related to binding, catalytic activity and transporter activity. KOG annotation was based on the relationship between orthologous genes from different species (Fig. 2B). Using KOG functional annotation, “General function prediction only”, “Signal transduction”, and “Posttranslational modification, protein turnover, chaperones” were three largest groups assigned to. For KEGG annotation, 10,298 unigenes were classified into five groups, Cellular processes, environmental information processing, genetic information processing, metabolism and organismal systems (Fig. 2C). Transport and catabolism, signal transduction, translation, carbohydrate metabolism, endocrine systems were the main pathways in each group, respectively.

Table 2 Summary of annotations of unigenes.
Figure 1
figure 1

Best hits of the BLASTx results.

Figure 2
figure 2

(A) Gene Ontology (GO) analysis for the transcriptomic sequences. (B) euKaryotic Ortholog Groups (KOG) annotation for the unigenes.(C) KEGG pathway annotation of the transcriptome. (A) Cellular processes. (B) Environmental information processing. (C) Genetic information processing. (D) Metabolism. (E) Organismal systems.

Putative chemoreceptor genes

All the unigenes were searched by BLAST to identified chemoreceptor genes. Total 62 OR genes, 11 IR genes and 10 GR genes were identified through bioinformatics analysis. All the chemosensory receptor genes of C. punctiferalis were submitted to the GenBank (accession numbers: KX084452-KX084521 and KX096203-KX096215). Of these genes, 45 unigenes (39 ORs and 6 IRs) showed sequence identity to the previously identified in C. punctiferalis which are already submitted in GenBank. Among the identified ORs, 38 were found to represent full-length open reading frames (ORF) with 4–7 transmembrane domains (Table 3). Half of those sequences were annotated against homologs of Ostrinia furnacalis. As expected, Orco/OR2 was identified with high identity compared with the conserved insect OR2/Orco. The IRs and GRs sequences in C. punctiferalis antennal transcriptome were identified according to the similarity with known insect IRs and GRs. Seven of eleven putative IRs were similar with IRs of O. furnacalis. Six unigenes with full-length ORF, contained 3~4 transmembrane domains. Of GR genes, 3 putative GR genes contained full-length ORF with 6 transmembrane regions, while 7 sequences were partial (Table 4).

Table 3 Odorant receptors in Conogethes punctiferalis antennae.
Table 4 Ionotropic and gustatory receptors in Conogethes punctiferalis antennae.

Phylogenetic tree

Phylogenetic tree of ORs was constructed using the sequences of 162 ORs from B. mori, O. furnacalis and C. punctiferalis (Fig. 3). The hits with a length of less than 200 amino acids were ignored. The OR sequences were clustered into PRs, Orco and other divergent ORs. The 8 amino acid sequences, OR1 and OR3-9 clustered into a clade with pheromone receptors of B. mori and O. furnacalis. Orco/OR2 was highly conserved and clustered with Orco/OR2 family. In the neighbor-joining tree of IRs (Fig. 4), the IRs were clustered into ionotropic glutamate receptors (iGluRs), IR25/IR8a and other divergent IRs. Among the nine IRs in C. punctiferalis, CpunIR25a grouped with the highly conserved IR25a/IR8a. Alignment analysis revealed that the similarity of IR25a was higher than 76.3% in D. melanogaster, B. mori, Tribolium castaneum and C. punctiferalis (Fig. 5). CpunIR4, CpunIR6, CpunIR3, CpunIR7 shared identity with IR76b, IR21a, IR68a, IR41a from D. melanogaster, B. mori and T. castaneum, respectively. Unfortunately, we cannot obtain the sequences belong to iGluRs. In phylogenetic tree of the GRs, GRs from B. mori, Manduca sexta and C. punctiferalis were analyzed (Fig. 6), CpunGR5 and CpunGR4 were respectively classified into BmorGr64a and MexGR6 subgroup, which were known as sugar receptors. CpunGR3 was cluster into a clade of fructose receptors.

Figure 3
figure 3

Phylogenetic relationship of putative olfactory receptors from Conogethes punctiferalis and other insects.

The tree was constructed by MEGA 5.2 program using the neighbor-joining method with the Bootstrapping model by 1000 replication.

Figure 4
figure 4

Phylogenetic relationship of putative ionotropic receptors from Conogethes punctiferalis and other insects.

The tree was constructed by MEGA 5.2 program using the neighbor-joining method with the Bootstrapping model by 1000 replication.

Figure 5
figure 5

Amino acid alignment of the putative IR25a with other insects.

Blue shadings indicate the same sequence among insects. Pink shading indicated amino acids which show 75% identity between sequences. Green shading ink shading indicated amino acids which show 50% identity between sequences.

Figure 6
figure 6

Phylogenetic relationship of putative gustatory receptors from Conogethes punctiferalis and other insect.

The tree was constructed by MEGA 5.2 program using the neighbor-joining method with the Bootstrapping model by 1000 replication.

Expression level of OR genes in female and male antennae

Quantitative real-time PCR (qPCR) was used to analyze the expression levels of 50 OR genes in the antennae of male and female moths. The results indicated that all the tested genes were detected in antennae. Among the OR genes, OR8, OR39, OR48, OR4 and OR30 were highly enriched in female, with 182.6, 93.9, 84.4, 15.6, and 13.5 times to male, respectively. PR subfamily members, OR1, OR3, OR5 and OR7 were highly expressed in male antennae, as well as the odorant receptor OR62 (Fig. 7). Meanwhile, Orco/OR2, OR26, and OR8 had high expression level in female with an FPKM value of 534.09, 50.52, and 43.43, respectively. Orco/OR2, OR3 and OR6 had high expression level in male with an FPKM value of 523.08, 283.65, 170.90, respectively (Table 3). As co-receptor, Orco/OR2 was most abundant in female and male antennae. Because OR4, OR39, OR48 and OR53 were only detected in female antenna, so we cannot obtain the FPKM value (Table 3).

Figure 7
figure 7

Relative expression levels of putative ORs in the female and male moth antennae.

FA: female antennae; MA: female antennae. The expression levels were estimated using delta delta CT method. Standard error for each sample is represented by error bar.

Discussion

With the rapidly development of next generation sequencing technology, a large number of unigenes were identified. RNA-seq effectively increased the depth of sequencing compared to the previous sequence data from cDNA library46,47,48. From the transcriptome of C. punctiferalis, we identified 83 chemoreceptors including 62 ORs, 11 IRs and 10 GRs. In Crambidae family insect, totally 56 ORs were identified in O. furnacalis20, and in other moths, Cao et al. found 47 ORs and 20 IRs in Chilo suppressalis49, suggested that the olfactory related genes we identified were equivalent to other moths. Total number of identified OR genes varied in different orders or species due to sequencing methods and depth, and/or sample preparations. In Lepidopteran, B. mori, totally 66 ORs were reported in genomic data50. In Dipteran, there were 79 candidate OR genes from Anopheles gambiae and in Hemipteran, 170 OR genes from Apis mellifera were annotated18,21. Especially in Nasonia vitripennis, total number of ORs was up to 30151. However, identified OR genes were only obtained from the antennae, ORs expressed in other tissues might be difficult to identify and the physiological states perhaps influence the amount of ORs in the antennae22. It seemed that more OR genes in Hymenopteran insect, especially the parasitoid wasp, might be due to need more receptors were needed to identify and locate the host.

In Lepidopteran insects, the PRs are more conserved and clustered in a subgroup within the OR family9. In our study, OR1 and OR3-9 of C. punctiferalis were clustered with pheromone receptors of B. mori and O. furnacalis, and we inferred that some or all of them might be putative pheromone receptors. But the expression profiles of these sequences showed that not all of them were male-specific. Koenig considered that at least one of the putative PR should exhibit male-specific expression associated with trichoid sensilla9. Recent studies showed that some PR genes expressed in both sexes. The PR1, 2 and 4 identified from the antennae of C. suppressalis have been detected in both male and female antennae49. From the antennal transcriptome of Spodoptera littoralis, four unigenes were enriched in male antennae, only two of which were annotated as putiative PRs and found to be expressed in antennae of both sexes48. In B. mori, Bmor19, Bmor45 and Bmor47 were also reported as female-specific or highly female-biased receptors, which responded to linalool, benzoic acid and benzaldehyde, suggesting the potential roles in detecting oviposition site and male-produced sex pheromone29. In our study, five unigenes were enriched in the female antennae than in the male, which may have provide important function during the host plant seeking.

IRs are derived from ionotropic glutamate receptors and comprise a novel family of chemosensory receptors. In D. melanogaster, 15 of total 66 IR genes were specifically expressed in antennae38 that was similar with our study that only 11 IRs were identified from the antennae transcriptomes of C. punctiferalis. Phylogenetic analysis showed that IRs from C. punctiferalis were not closely related to the iGluRs. Animal iGluRs are highly conserved family of ligand-gated ion channels39, while IRs are extremely divergent exclusive of IR8a and IR25a38. IR25a and IR8a are most similar primary sequence to iGluRs, suggesting that they are the IR genes most similar to the ancestral IRs9,39,52. In our study, CpunIR25a clustered with the highly conserved IR25a from B. mori, D. melanogaster and T. castenum, while in N. vitripennis and Microplitis mediator, two putative IR25a orthologues were identified22,39. Therefore, the congruent function of IR25a orthologues is still not clear. In Lepidoptera insect, IR8a were identified from the antennal transcriptome of Helicoverpa armigera, H. assulta and C. suppressalis26,49. Surprisingly, IR8a was not found in our antennal transcriptome. We speculated that it maybe need more deeply sequencing and increasing the samples, suggesting that IR8a maybe not expressed in our present samples.

In the previous studies, 56 and 77 candidate GR genes were respectively identified from the Drosophila and Acyrthosiphon pisum genome14,53. In B. mori, 65 GR genes were annotated from the silkworm genome54. However, only 10 GRs were identified from the antennal transcriptome of C. punctiferalis. It might be that GRs have a low expression level in the antennae and mainly expressed in gustatory organs (proboscis, legs, wings and genitalis)55,56. GRs have been divided into four classes: non-fructose sugar, bitter, CO2, and fructose receptors9,54. In phylogenetic tree of GRs, the gustatory receptor genes, GR3-5 were clustered into sugar and fructose receptors, suggesting the potential sugar detection function in the antennae of this moth. However, the recent report showed a wide range of non-gustatory sensory functions of GRs57, indicating that GRs may have more divergent functions in insect antennae.

In conclusion, the main purpose of this study was to identify the chemosensory receptor genes involved in the chemoreception. Based on the transcriptomic analysis, 83 CRs were identified from the antennae of C. punctiferalis. Our method was successful in identifying CR genes with low-expressing levels and the results made it possible for further studies on the molecular level and behaviors, providing the possibility to applicate potential target genes for controlling this pest.

Materials and Methods

Insects rearing and antennae collection

C. punctiferalis larvae were collected from the infested sunflower at Langfang Experimental Station of Chinese Academy of Agricultural Sciences, Hebei Province, China. In laboratory, the larvae were raised on fresh corn ear under conditions 27 ± 1 °C, with 70–80% relative humidity (RH) and a photo period of 16:8 h light: dark (L:D). After eclosion, adults were fed with 10% honey solution. The antennae (70 pairs of each sex) from adults within three days after eclosion were separately collected and frozen immediately in liquid nitrogen.

RNA extraction

Total RNAs were separately extracted and purified using Trizol Reagent (Invitrogen, Carlsbad, CA, USA) following the product manuals. The integrity of total RNA was assessed with Agilent 2100 (Agilent Technologies Inc., CA, USA) and the purity was detected on a NanoDrop 2000 spectrophotometer (NanoDrop products, Wilmington, DE, USA). The concentration was accurately measured by Qubit. Illumina sequencing was performed at Novogen Co., Ltd. Beijing, China, The cDNA libraries were sequenced using Illumina HiSeq 4000 system under effective concentration. In order to obtain the high-quality transcriptome sequence data, the raw reads were filtered to eliminate adaptor sequences and low quality reads. Reads with uncertain nucleotides larger than 10% of the fragment sequence were removed. Trinity de novo program with a default k-mer was used to assemble the clean reads45. Redundant sequences were removed to obtain unigenes sequences by means of selecting longest transcript contigs.

Unigenes annotation and classification

To obtain comprehensive information of gene functions, assembled unigenes were searched against Nr (NCBI non-redundant protein sequences), Nt (NCBI nucleotide), Pfam (Protein family), KOG (euKaryotic Ortholog Groups), SwissProt, KEGG (Kyoto Encyclopedia of Genes and Genomes), GO (Gene Ontology) database using BlASTX with an E-value < 10−5. CDS (coding sequence) were predicted in two-step process, the unigenes were firstly aligned using NR and SwissProt protein database to obtain ORF. If the sequences mismatch with the two databases, estscan (version 3.0.3) software was used to predict the ORF to obtain the nucleic acid and amino acid sequence. FPKM (fragments per kilobase pairs per million mapped reads) was used for gene expression analysis.

Identification of chemosensory receptor genes

Homology searches of OR, GR and IR sequence were performed using BLAST (http://blast.ncbi.nlm.nih.gov/blast.cgi), and the ORFs were predicted using ORF finder (http://www.ncbi.nlm.nih.gov/gorf/gorf.html). Transmembrane domain was predicted using TMHMM (http://www.cbs.dtu.dk/services/TMHMM/). Phylogenetic tree was constructed by MEGA 5.2 software using the neighbor-joining method with the Bootstrapping model by 1000 replication. The evolutionary distances were computed by using the Poisson correction method. The phylogenetic tree image was further created by Figtree software.

Quantitative real-time PCR

The expression levels of ORs were examined in male and female antennae using qRT-PCR. Total RNA was extracted as described above. First-strand cDNAs were synthetize followed One-Step gDNA removal and cDNA Synthesis kit (TransGen Biotech, China) with oligo dT-primer following the kit manual. The primers for qPCR were designed using Primer premier5.0 program (Table S1). qRT-PCR were performed on ABI 7500 fast real-time PCR system (Applied Biosysterm, USA) in a reaction volume of 20 μL using SYBR Premix Ex Taq II (Tli RNaseH Plus) master mix (Takara-Bio, Shiga, Japan), according to the manufacturers’ instructions. Two-step program were performed as follows, 95 °C for 30 s, followed by 40 cycles of 95 °C for 3 s and 60 °C for 30 s. For each gene, three biological replications were performed with each biological replication measured in three technique replications. Relative quantification was analyzed using the comparative 2−ΔΔCT method, with the housekeeping genes β-actin (accession number JX119014) as the reference gene.

Additional Information

How to cite this article: Ge, X. et al. Identification of putative chemosensory receptor genes from yellow peach moth Conogethes punctiferalis (Guenée) antennae transcriptome. Sci. Rep. 6, 32636; doi: 10.1038/srep32636 (2016).