Genome-wide identification and phylogenetic analysis of plant RNA binding proteins comprising both RNA recognition motifs and contiguous glycine residues
- 743 Downloads
This study focused on the identification and phylogenetic analysis of glycine-rich RNA binding proteins that contain an RNA recognition motif (RRM)-type RNA binding domain in addition to a region with contiguous glycine residues in representative plant species. In higher plants, glycine-rich proteins with an RRM have met considerable interest as they are responsive to environmental cues and play a role in cold tolerance, pathogen defense, flowering time control, and circadian timekeeping. To identify such RRM containing proteins in plant genomes we developed an RRM profile based on the known glycine-rich RRM containing proteins in the reference plant Arabidopsis thaliana. The application of this remodeled RRM profile that omitted sequences from non-plant species reduced the noise when searching plant genomes for RRM proteins compared to a search performed with the known RRM_1 profile. Furthermore, we developed an island scoring function to identify regions with contiguous glycine residues, using a sliding window approach. This approach tags regions in a protein sequence with a high content of the same amino acid, and repetitive structures score higher. This definition of repetitive structures in a fixed sequence length provided a new glance for characterizing patterns which cannot be easily described as regular expressions. By combining the profile-based domain search for well-conserved regions (the RRM) with a scoring technique for regions with repetitive residues we identified groups of proteins related to the A. thaliana glycine-rich RNA binding proteins in eight plant species.
KeywordsGlycine-rich domains HMMER biosequence analysis MUSCLE alignment Orthology prediction RNA binding protein RNA recognition motif Plant
Arabidopsis thaliana zinc finger-containing glycine-rich RNA binding protein
RNA binding protein
RNA recognition motif
We thank Dr. Florian Peschke for his contribution to writing of the manuscript and preparing the figures.
Compliance with ethical standards
This work was supported by the DFG (STA653).
Conflict of interest
Martin Lewinski declares that he has no conflict of interest. Armin Hallmann declares that he has no conflict of interest. Dorothee Staiger declares that she has no conflict of interest.
This article does not contain any studies with animals performed by any of the authors.
- Castello A, Fischer B, Eichelbaum K, Horos R, Beckmann Benedikt M, Strein C, Davey Norman E, Humphreys David T, Preiss T, Steinmetz Lars M, Krijgsveld J, Hentze Matthias W (2012) Insights into RNA biology from an atlas of mammalian mRNA-binding proteins. Cell 149:1393–1406CrossRefPubMedGoogle Scholar
- Grigoriev IV, Nordberg H, Shabalov I, Aerts A, Cantor M, Goodstein D, Kuo A, Minovitsky S, Nikitin R, Ohm RA, Otillar R, Poliakov A, Ratnere I, Riley R, Smirnova T, Rokhsar D, Dubchak I (2012) The Genome Portal of the Department of Energy Joint Genome Institute. Nucleic Acids Res 40:D26–D32CrossRefPubMedPubMedCentralGoogle Scholar
- Kawahara Y, de la Bastide M, Hamilton J, Kanamori H, McCombie WR, Ouyang S, Schwartz D, Tanaka T, Wu J, Zhou S, Childs K, Davidson R, Lin H, Quesada-Ocampo L, Vaillancourt B, Sakai H, Lee SS, Kim J, Numa H, Itoh T, Buell CR, Matsumoto T (2013) Improvement of the Oryza sativa Nipponbare reference genome using next generation sequence and optical map data. Rice 6:4CrossRefPubMedGoogle Scholar
- Kim JS, Park SJ, Kwak KJ, Kim YO, Kim JY, Song J, Jang B, Jung CH, Kang H (2007a) Cold shock domain proteins and glycine-rich RNA-binding proteins from Arabidopsis thaliana can promote the cold adaptation process in Escherichia coli. Nucleic Acids Res 35:506–516CrossRefPubMedPubMedCentralGoogle Scholar
- Kupsch C, Ruwe H, Gusewski S, Tillich M, Small I, Schmitz-Linneweber C (2012) Arabidopsis chloroplast RNA Binding Proteins CP31A and CP29A associate with large transcript pools and confer cold stress tolerance by influencing multiple chloroplast RNA processing steps. Plant Cell 24:4266–4280CrossRefPubMedPubMedCentralGoogle Scholar
- Lamesch P, Berardini TZ, Li D, Swarbreck D, Wilks C, Sasidharan R, Muller R, Dreher K, Alexander DL, Garcia-Hernandez M, Karthikeyan AS, Lee CH, Nelson WD, Ploetz L, Singh S, Wensel A, Huala E (2012) The Arabidopsis information resource (TAIR): improved gene annotation and new tools. Nucleic Acids Res 40:D1202–D1210CrossRefPubMedPubMedCentralGoogle Scholar
- Ortega-Amaro MA, Rodriguez-Hernandez AA, Rodriguez-Kessler M, Hernandez-Lucero E, Rosales Mendoza S, Ibañez-Salazar A, Delgado P, Jimenez Bremont JF (2015) Overexpression of AtGRDP2, a novel glycine-rich domain protein, accelerates plant growth and improves stress tolerance. Front Plant Sci 5:782CrossRefPubMedPubMedCentralGoogle Scholar
- Tillich M, Hardel SL, Kupsch C, Armbruster U, Delannoy E, Gualberto JM, Lehwark P, Leister D, Small ID, Schmitz-Linneweber C (2009) Chloroplast ribonucleoprotein CP31A is required for editing and stability of specific chloroplast mRNAs. Proc Natl Acad Sci USA 106:6002–6007CrossRefPubMedPubMedCentralGoogle Scholar
- Zimmer A, Lang D, Buchta K, Rombauts S, Nishiyama T, Hasebe M, Van de Peer Y, Rensing S, Reski R (2013) Reannotation and extended community resources for the genome of the non-seed plant Physcomitrella patens provide insights into the evolution of plant gene structures and functions. BMC Genom 14:498CrossRefGoogle Scholar