RDMAS: a web server for RNA deleterious mutation analysis

Shu, Wenjie; Bo, Xiaochen; Liu, Rujia; Zhao, Dongsheng; Zheng, Zhiqiang; Wang, Shengqi

doi:10.1186/1471-2105-7-404

RDMAS: a web server for RNA deleterious mutation analysis

Software
Open access
Published: 06 September 2006

Volume 7, article number 404, (2006)
Cite this article

Download PDF

You have full access to this open access article

BMC Bioinformatics Aims and scope Submit manuscript

RDMAS: a web server for RNA deleterious mutation analysis

Download PDF

Wenjie Shu^1,3,
Xiaochen Bo¹,
Rujia Liu⁴,
Dongsheng Zhao²,
Zhiqiang Zheng³ &
…
Shengqi Wang¹

6209 Accesses
27 Citations
Explore all metrics

Abstract

Background

The diverse functions of ncRNAs critically depend on their structures. Mutations in ncRNAs disrupting the structures of functional sites are expected to be deleterious. RNA deleterious mutations have attracted wide attentions because some of them in cells result in serious disease, and some others in microbes influence their fitness.

Results

The RDMAS web server we describe here is an online tool for evaluating structural deleteriousness of single nucleotide mutation in RNA genes. Several structure comparison methods have been integrated; sub-optimal structures predicted can be optionally involved to mitigate the uncertainty of secondary structure prediction. With a user-friendly interface, the web application is easy to use. Intuitive illustrations are provided along with the original computational results to facilitate quick analysis.

Conclusion

RDMAS can be used to explore the structure alterations which cause mutations pathogenic, and to predict deleterious mutations which may help to determine the functionally critical regions. RDMAS is freely accessed via http://biosrv1.bmi.ac.cn/rdmas.

RNAissance

A Biologically Meaningful Extension of the Efficient Method for Deleterious Mutations Prediction in RNAs: Insertions and Deletions in Addition to Substitution Mutations

Concepts and Introduction to RNA Bioinformatics

Background

In addition to its central role in information transfer from DNA to protein, RNA performs a remarkable range of functions [1]. Large numbers of noncoding RNA (ncRNA) transcripts are being revealed [2]. Exploring the role and diversity of these numerous ncRNAs now constitutes a main challenge in life science [3]. In a broad sense, the list of functional ncRNAs also includes functional motifs within protein-coding genes, located mostly in the non-translated 5' or 3' regions of messenger RNAs.

Mutations in RNA genes may lead to striking alterations in RNA structures that impair functions, resulting in diseases. Mutations in some RNA regulators have been reported to be associated with neuropsychiatric disorders [4]. Mutations of tRNAs in mitochondria are reported to harbor more than half of all known mitochondrial pathogenic mutations [5]. Some recent researches also show that mutations in microRNA (miRNA) genes and its flanking sequences may contribute to cancer [6–8].

On the other hand, RNA deleterious mutations could be "beneficial" in some situation. The distribution of the recognized ribosomal functional sites and the antibiotic action sites has been found to be clearly correlated with the location of the known deleterious mutations in bacterial rRNAs. Therefore, deleterious mutations in rRNAs can serve as hallmarks of both functionally important ribosomal centers and antibiotic sites [9]. In their study on influenza viruses, Herlocher et al. found a nonsense mutation on PB2 segment which causes much difference in the secondary structure responsible for cold adaptation [10], that implies that viruses with similar deleterious mutations have potential for live vaccines.

In principle, a RNA mutation could be deleterious because it disrupts a functional site involved in catalysis, ligand-binding, or interaction with proteins. Since the functions of the ncRNAs critically depend on their specific structures, nucleotide alterations which result in structure change are expected to be deleterious. From this point of view, structure analysis should help to identify deleterious mutations. Some structure based method for RNA deleterious mutation analysis have been presented [11, 12], which are applicable when few homologs are available. A user friendly Java application named RNAmute for RNA deleterious mutation analysis has also been reported [13, 14].

The RDMAS we describe here is a noncommercial web application for RNA deleterious mutation analysis. Several secondary structure comparison methods have been implemented in RDMAS to evaluate structure deleteriousness of single nucleotide substitution in RNA molecules.

Implementation

Structural dissimilarity metric

There are 3 × N possible single point mutations for a RNA molecule with N nucleotides. The deleteriousness of these mutations is analyzed in RDMAS on the basis of structure difference. The dissimilarity of secondary structures between wild-type and mutant, D(R, R^*), is used to predict the deleteriousness of mutations. Four types of metric are employed, which are:

(i)
Difference between free energy of RNA secondary structures, i.e.D(R, R^*) = |E(R) - E(R^*)|, where E(·) is the free energy computation function.
(ii)
Edit distance between tree or their string representations of RNA secondary structures, i.e.D(R, R^*) = ED(R, R^*), where ED(·) represents the edit distance computation functions. The structure comparisons are implemented using Vienna RNA package [15, 16] based on four different tree representations, including full, homeomorphically irreducible tree (HIT), coarse grained and weighted coarse representation.
(iii)
Difference between topological indices of RNA structures, i.e. D(R, R^*) = |I(R) - I(R^*)|, where I(·) represents the topological index computation functions. Several topological indices defined on the RNA tree graph representation has been presented [12, 17–20]. Suggested by Merris and tested by Barash's group, the Wiener index which has been widely used in computational biochemistry has also been introduced into RNA graph [21, 22] recently. There is an interesting relation [23] between the Wiener number and the Laplacian spectrum of tree graph used in RNAMute. We have also proposed and employed novel topological descriptors defined on Shapiro's coarse grained and weighted coarse grained RNA tree [24] to characterize RNA structures (details will be published elsewhere). The topological indices used in RDMAS are listed in Table 1. Detailed descriptions can be found in the online manual of the web server (Figure 1C).

Table 1 Topological indices used to measure the structural difference between RNAs in RDMAS.

Full size table

(iv)
Base pair distance between dot-bracket representations of RNA structures, i.e. D(R, R^*) = BP(R, R^*), where BP(·) represents the base pair distance computation function.

The secondary structure prediction in RDMAS is implemented using RNAfold and RNAsubopt [25] from the Vienna RNA package [15, 16]. The former is a variation of the Zuker and Stiegler [26, 27] minimum free energy problem that extends McCaskill's algorithm [28] and computes the complete density of states of an RNA sequence at predefined energy resolution, while the latter is for the calculation of all suboptimal structures within a user defined energy range above the MFE. In order to mitigate the uncertainty of the MFE structure, suboptimal structures of mutants within 1 kcal/mol (the default setting of RNAsubopt) above the minimum free energy (MFE) are considered. Three methods are used to estimate the difference between the structures of the wild-type and possible structure set of the mutant Γ^* = { $R_{1}^{*}$ , $R_{2}^{*}$ ,…, $R_{n}^{*}$ }, where $R_{i}^{*}$ represents the i th predicted structure of the mutant. The two extreme values, D'(R, Γ^*) = $\max_{i} {D (R, R_{i}^{*})}$ and D'(R, Γ^*) = $\max_{i} {D (R, R_{i}^{*})}$ are taken for the most optimistic and the most pessimistic estimation, respectively. The synthetic estimation is given by summing the contribution of all structures weighted by their Boltzmann probabilities, which is similar to the methods used in some research [29]. In this case, the deleteriousness is given by $D^{'} (R, Γ^{*}) = \sum_{i = 1}^{n} w_{i} \cdot D (R, R_{i}^{*}) / \sum_{i = 1}^{n} w_{i}$ , where $w_{i} = \exp {- [E (R_{i}^{*}) - E (R_{M F E}^{*})] / k T}$ .

Input and options

With a step-by-step style input interface (Figure 1A), the RDMAS web server is easy to use. The sequence of a RNA molecule can be input either by pasting raw sequence or by uploading sequence file in FASTA format. Multi-FASTA (MFA) format sequence file is also supported to facilitate users. The limit of sequence length is 200 bases for immediate jobs and 2,000 bases for batch jobs, which meets the need of ncRNA analysis in most cases. For batch jobs, a valid email address is required. The analysis scheme is designed to be custom-built for users. The algorithms for computing structure difference and the methods for using the sub-optimal structures can be selected by users.

Output

The intermediate result report page will be refreshed automatically every 5 seconds after immediate jobs submission. The output page (Figure 1B) of an immediate job can be seen within 1 minute. Served as an online interactive analysis interface, all the output result can be viewed as graphic representation or text list by selecting the content item and clicking the "view" button on the output page. For batch jobs, a notification email containing a URL linked to the output page will be sent to the user when the job has been completed. The URL remains valid for 48 hours.

To make the analysis results intuitive, the maximum difference in structures between the wild-type and the possible mutants at each position are extracted into a structural deleteriousness profile and plotted as waveforms (Figure 2B). The structurally important sites can be easily revealed by peaks with high structural deleteriousness on the profile. The list of the structural deleteriousness values (Figure 2D) and the corresponding dot-bracket representations of secondary structures (Figure 2E) can be displayed as plain text on the output page.

The statistical distributions of the deleteriousness value are calculated and illustrated as histograms (Figure 2C), which may facilitate the analysis on RNA mutational robustness.

With a hyperlink located at the bottom of the output page (Figure 1B), the output page offers download of the results as a single packed file in ".gz" format for off-line analysis. In addition to the structural deleteriousness profile and deleteriousness distribution histogram (all in "PNG" image format), the secondary structure illustration of the wild-type and the mutants (all in PostScript format) are also included in the result file. The result file name is in the form "yymmddhhmmss.no", where "yy" is year, "mm" is month, "dd" is day, "hh" is hour, mm is minute, "ss" is second and "no" is serial number.

Results and discussion

Performance of the web server

To test the computational efficiency of RDMAS, 500 random sequences (listed in Additional files) with 10 different lengths were submitted. All types of structure distance measurement are used in these tests. The CUP time of these 500 tests is illustrated in Figure 3.

Case study

By using artificial mutants, some investigations have been done on the sequence and structural requirements for miRNA processing and functions [30–32]. These experimental results have shown that the base-pairing at the base of the precursor stem is critical for miRNA processing, while the internal loops, terminal loops and bulges are proved to be not essential.

To demonstrate how our web application can be helpful to the analysis on deleterious mutations in ncRNAs, the precursor of human miRNA miR-21 (pre-miR-21), a stem-loop of 71 nt, has been analyzed using RDMAS. Figure 2B is the structural deleteriousness profile of pre-miR-21 computed based on the tree edit distance of HIT representation. Figure 2C is the corresponding deleteriousness distribution histogram. The structures of the wild-type and three mutants G5U, A17C and U38A are illustrated in Figure 2A. The structural deleteriousness of possible mutants and the corresponding dot-bracket representations of the structures are listed partly in Figure 2D and Figure 2E.

It is shown that most mutants in pre-miR-21 are not deleterious. The mutations opening the base of the precursor stem lead to marked difference in RNA structure, while the mutations in the terminal loop and bulge seem to be less deleterious. These results are in good accord with the main conclusions drawn in the aforementioned experimental studies.

Future works

Although the suboptimal structures of the mutant can be used in RDMAS, the structural distance measurement using multiple predicted structures is still a challenge to the present methods. Further research is needed to find approaches to measure the structural distance taking suboptimal structures of both the wild-type and the mutant into consideration at the same time.

On the basis of the criteria of conservation and compensatory co-evolution, Kondrashov presented a method using multiple homologous sequences to predict pathogenic mutations in mitochondria encoded human tRNAs [33]. In some other mutation studies on ncRNAs, especially on viral and bacterial RNAs, enough amounts of homologous sequences are also available. Our further research will also focus on developing methods for RNA deleterious mutation analysis using both homologic and structural information.

Conclusion

Compared to single nucleotide mutation analysis in protein-coding gene, research on RNA mutation has been insufficient, both bioinformatics algorithms and applications are needed. Like RNAmute [13], the RDMAS we developed is a non-commercial software for RNA deleterious mutation analysis, and will be helpful both in the researches on the structure-function relationship of ncRNAs (such as functionally critical region identification) and in the RNA-targeted drug design.

Availability and requirements

Project name: RDMAS

Project home page: http://biosrv1.bmi.ac.cn/rdmas

Operating system(s): Linux, Unix (no GUI)

Programming language: C and PHP

Other requirements: Vienna RNA package

License: GPL

Restrictions to use by non-academics: on request

References

Caprara MG, Nilsen TW: RNA: Versatility in form and function. Nat Struct Mol Biol 2000, 7: 831–833. 10.1038/82816
Article CAS Google Scholar
Mattick JS: The Functional Genomics of Noncoding RNA. Science 2005, 309: 1527–1528. 10.1126/science.1117806
Article CAS PubMed Google Scholar
Claverie JM: Fewer Genes, More Noncoding RNA. Science 2005, 309: 1529–1530. 10.1126/science.1116800
Article CAS PubMed Google Scholar
Perkins DO, Jeffries C, Sullivan P: Expanding the 'central dogma': the regulatory role of nonprotein coding genes and implications for the genetic liability to schizophrenia. Mol Psychiatry 2005, 10: 69–78. 10.1038/sj.mp.4001577
Article CAS PubMed Google Scholar
Brandon MC, Lott MT, Nguyen KC, Spolim S, Navathe SB, Baldi P, Wallace DC: MITOMAP: a human mitochondrial genome database--2004 update. Nucl Acids Res 2005, 33: D611-D613. 10.1093/nar/gki079
Article PubMed Central CAS PubMed Google Scholar
Calin GA, Ferracin M, Cimmino A, Di Leva G, Shimizu M, Wojcik SE, Iorio MV, Visone R, Sever NI, Fabbri M, Iuliano R, Palumbo T, Pichiorri F, Roldo C, Garzon R, Sevignani C, Rassenti L, Alder H, Volinia S, Liu C, Kipps TJ, Negrini M, Croce CM: A MicroRNA Signature Associated with Prognosis and Progression in Chronic Lymphocytic Leukemia. The New England Journal of Medicine 2005, 353: 1793–1801. 10.1056/NEJMoa050995
Article CAS PubMed Google Scholar
Eder M, Scherr M: MicroRNA and Lung Cancer. The New England Journal of Medicine 2005, 352: 2446–2448. 10.1056/NEJMcibr051201
Article CAS PubMed Google Scholar
Chen CZ: MicroRNAs as Oncogenes and Tumor Suppressors. The New England Journal of Medicine 2005, 353: 1768–1771. 10.1056/NEJMp058190
Article CAS PubMed Google Scholar
Yassin A, Fredrick K, Mankin AS: Deleterious mutations in small subunit ribosomal RNA identify functional sites and potential targets for antibiotics. PNAS 2005, 102: 16620–16625. 10.1073/pnas.0508444102
Article PubMed Central CAS PubMed Google Scholar
Herlocher ML, Maassab HF, Webster RG: Molecular and Biological Changes in the Cold-Adapted "Master Strain" A/AA/6/60 (H2N2) Influenza Virus. PNAS 1993, 90: 6032–6036. 10.1073/pnas.90.13.6032
Article PubMed Central CAS PubMed Google Scholar
Margalit H, Shapiro BA, Oppenheim AB, Maizel JVJ: Detection of common motifs in RNA secondary structures. Nucleic Acids Res 1989, 17: 4829–4845.
Article PubMed Central CAS PubMed Google Scholar
Barash D: Deleterious mutation prediction in the secondary structure of RNAs. Nucl Acids Res 2003, 31: 6578–6584. 10.1093/nar/gkg872
Article PubMed Central CAS PubMed Google Scholar
Churkin A, Barash D: RNAmute: RNA secondary structure mutation analysis tool. BMC Bioinformatics 2006, 7: 221. 10.1186/1471-2105-7-221
Article PubMed Central PubMed Google Scholar
Churkin A, Barash D: Structural Analysis of Single-Point Mutations Given an RNA Sequence: A Case Study with RNAMute. EURASIP Journal on Applied Signal Processing 2006, 2006: 56246. 10.1155/ASP/2006/56246
Google Scholar
Hofacker IL: Vienna RNA secondary structure server. Nucl Acids Res 2003, 31: 3429–3431. 10.1093/nar/gkg599
Article PubMed Central CAS PubMed Google Scholar
Hofacker IL: Fast folding and comparison of RNA secondary structures. Monatsh Chem 1994, 125: 167–188. 10.1007/BF00818163
Article CAS Google Scholar
Barash D: Second eigenvalue of the Laplacian matrix for predicting RNA conformational switch by mutation. Bioinformatics 2004, 20: 1861–1869. 10.1093/bioinformatics/bth157
Article CAS PubMed Google Scholar
Benedetti G, Morosetti S: A graph-topological approach to recognition of pattern and similarity in RNA secondary structures. Biophys Chem 1996, 59: 179–184. 10.1016/0301-4622(95)00119-0
Article CAS PubMed Google Scholar
Bermudez CI, Daza EE, Andrade E: Characterization and comparison of Escherichia coli transfer RNAs by graph theory based on secondary structure. J Theor Biol 1999, 197: 193–205. 10.1006/jtbi.1998.0866
Article CAS PubMed Google Scholar
Karklin Y, Meraz RF, Holbrook SR: Classification of non-coding RNA using graph representations of secondary structure. Pac Symp Biocomput 2005, 4–15.
Google Scholar
Avihoo A, Barash D: Prediction of Small RNA Conformational Switching Using Fine-Grain Graph Representations and the Wiener Index: 20050/5/16; Haifa, Israel. 2005.
Google Scholar
A.Avihoo, D.Barash.: Shape Similarity Measures for the Design of Small RNA Switches. Biomolecular Structure and Dynamics 2006, 24: 17–24.
Article Google Scholar
Merris R: An edge version of the matrix-tree theorem and the wiener index. Linear and Multilinear Algebra 1989, 25: 291–296.
Article Google Scholar
Shapiro BA: An algorithm for comparing multiple RNA secondary structures. Comput Appl Biosci 1988, 4: 387–393.
CAS PubMed Google Scholar
Wuchty S, Fontana W, Hofacker IL, Schuster P: Complete suboptimal folding of RNA and the stability of secondary structures. Biopolymers 1999, 49: 145–165. 10.1002/(SICI)1097-0282(199902)49:2<145::AID-BIP4>3.0.CO;2-G
Article CAS PubMed Google Scholar
Zuker M: Mfold web server for nucleic acid folding and hybridization prediction. Nucl Acids Res 2003, 31: 3406–3415. 10.1093/nar/gkg595
Article PubMed Central CAS PubMed Google Scholar
Zuker M, Stiegler P: Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information. Nucl Acids Res 1981, 9: 133–148.
Article PubMed Central CAS PubMed Google Scholar
McCaskill JS: The equilibrium partition function and base pair binding probabilities for RNA secondary structure. Biopolymers 1990, 29: 1105–1119. 10.1002/bip.360290621
Article CAS PubMed Google Scholar
Kitagawa J, Futamura Y, Yamamoto K: Analysis of the conformational energy landscape of human snRNA with a metric based on tree representation of RNA structures. Nucl Acids Res 2003, 31: 2006–2013. 10.1093/nar/gkg288
Article PubMed Central CAS PubMed Google Scholar
Y L, C A, J H, H C, J K, J Y, J L, P P, O R, S K, VN K: The nuclear RNase III Drosha initiates microRNA processing. Nature 2003, 425: 415–419. 10.1038/nature01957
Article Google Scholar
Zeng Y, Cullen BR: Structural requirements for pre-microRNA binding and nuclear export by Exportin 5. Nucl Acids Res 2004, 32: 4776–4785. 10.1093/nar/gkh824
Article PubMed Central CAS PubMed Google Scholar
Zeng Y, Cullen BR: Sequence requirements for micro RNA processing and function in human cells. RNA 2003, 9: 112–123. 10.1261/rna.2780503
Article PubMed Central CAS PubMed Google Scholar
Kondrashov FA: Prediction of pathogenic mutations in mitochondrially encoded human tRNAs. Hum Mol Genet 2005, 14: 2415–2419. 10.1093/hmg/ddi243
Article CAS PubMed Google Scholar

Download references

Acknowledgements

We thank the Super Biomed Computation Center at Beijing Institute of Health Administration and Medicine Information for providing computing resources. We thank Mingjing Lu (Tsinghua University) for the graphic design of the web interface. This work is supported by a grant from the Special Funds for Major State Basic Research Program of China (973 Program) (No. 2004CB518904).

Author information

Authors and Affiliations

Beijing Institute of Radiation Medicine, Beijing, 100850, China
Wenjie Shu, Xiaochen Bo & Shengqi Wang
Beijing Institute of Health Administration and Medicine Information, Beijing, 100850, China
Dongsheng Zhao
College of Electro-Mechanic and Automation, National University of Defense Technology, Changsha, Hunan, 410073, China
Wenjie Shu & Zhiqiang Zheng
Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China
Rujia Liu

Authors

Wenjie Shu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaochen Bo
View author publications
You can also search for this author in PubMed Google Scholar
Rujia Liu
View author publications
You can also search for this author in PubMed Google Scholar
Dongsheng Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Zhiqiang Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Shengqi Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Xiaochen Bo or Shengqi Wang.

Additional information

Authors' contributions

WS and XB designed and developed the methodology. WS and RL programmed the web application. DZ and ZZ test the software. XB wrote the manuscript. SW guided the project.

Wenjie Shu, Xiaochen Bo, Rujia Liu contributed equally to this work.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Shu, W., Bo, X., Liu, R. et al. RDMAS: a web server for RNA deleterious mutation analysis. BMC Bioinformatics 7, 404 (2006). https://doi.org/10.1186/1471-2105-7-404

Download citation

Received: 19 April 2006
Accepted: 06 September 2006
Published: 06 September 2006
DOI: https://doi.org/10.1186/1471-2105-7-404

RDMAS: a web server for RNA deleterious mutation analysis