Regular expressions of MS/MS spectra for partial annotation of metabolite features
- 441 Downloads
Partial annotation and characterization of metabolite structures on the basis of data from tandem mass spectrometry (MS/MS) spectra are technical bottlenecks in metabolomics. Novel approaches should be explored for evaluation of spectral similarities among structurally related compounds as well as for description of fragmentation motifs commonly observed in MS/MS spectra.
A regular expression of MS/MS data was developed to search for structurally similar metabolites and to describe spectral motifs for partial annotation and characterization of metabolite structures.
After definition of an MS/MS string as a text representation of an MS/MS spectrum, a regular expression of MS/MS strings involving meta characters, anchors, and quantifiers was introduced. Here it was also demonstrated that spectral motifs can be described by a regular expression to define a common fragmentation pattern observed among structurally related metabolites.
The regular expression was applied to a search for similar MS/MS spectra. Analysis of MassBank data with fragment assignment information (fragment ion and neutral loss matrix, http://metabolomics.jp/wiki/Index:MassBank) suggested that the regular expression of MS/MS spectra can detect spectral similarities among structurally related metabolites. Analysis of MS/MS spectral libraries of Arabidopsis and rice revealed that the metabolite features can be partially annotated or characterized by the spectral motifs and can be assigned the corresponding ontology codes produced by Chemical Entities of Biological Interest (ChEBI).
The MS/MS spectral motifs represent a method for partial annotation or characterization of metabolite features. A regular expression of MS/MS data holds promise for further enrichment of metabolite annotations and for easy sharing of ambiguous annotation data among metabolomic studies.
KeywordsMS/MS spectrum Regular expression Small molecule identification Mass spectral motif Fragmentation
- Mulder, N. J., & Apweiler, R. (2002). Tools and resources for identifying protein families, domains and motifs. Genome Biology, 3, REVIEWS2001Google Scholar