The metabolomics standards initiative (MSI)
In 2005, the Metabolomics Standards Initiative has been formed. An outline and general introduction is provided to inform about the history, structure, working plan and intentions of this initiative. Comments on any of the suggested minimal reporting standards are welcome to be sent to the open email list Msiemail@example.com
KeywordsStandardization Metabolomics Metabonomics Metabolite profiling Databases Ontology
Metabolomic data can be an important source of new knowledge, as long as it can be shared in a way to permit use and re-use. Because the biological milieu can change rapidly, and can be highly responsive to the environment, history and handling of the subject, details of the experiment are critical to interpretation of the data and to its re-use in meta-analyses of data from multiple experiments. There is, therefore, a need for data reporting standards, such as checklists, outlining the minimal information content that should be reported, common syntax, defining the transmission formats that facilitate the exchange of information, and common semantics, adding the interpretive layer to the data (Field and Sansone 2006). Data reporting standards support an unambiguous description of the study and its execution, the subjects, and the biological materials and data derived from them. Standards make information more accessible and enable the extraction of maximum value from data sets, by supporting efficient querying, accurate data analysis, and facilitate data exchange (Quackenbush 2004). At least, reporting standards should be usable for minimum requirements in the peer-reviewing process of journal publications.
Led by the Imperial College in London (UK), a consortium for metabonomic toxicity had started working in 2001 on generating databases of 1H NMR spectra of body fluids from animal models, involving six major pharmaceutical corporations (Lindon et al. 2003). Due to the collaborative nature of this consortium, standards were needed how to exchange data and communicate results, leading to the SMRS initiative (Standard Metabolic Reporting Structure). The SMRS group published their detailed discussion in their forum (SMRS 2005) and in a peer-reviewed paper in 2005 (Lindon et al. 2005). Simultaneously, a plant focused consortium worked on evolving a generic data model to provide a basis for the design of systems for data storage and exchange, called Architecture for Metabolomics (ArMet) (Jenkins et al. 2004). Both efforts were complementary, because ArMet focused on the conceptual framework how to organize metabolomic data and supporting information (called ‘metadata’) whereas the SRMS group initiated detailed recommendations of which parameters and data were necessary to be reported. Following a series of MetaboMeetings (2006) initiated at The European Bioinformatics Institute, participants of both initiatives joined with further interested scientists to form the Metabolomics Standards Initiative (MSI). The MSI took up the work end of 2005 after an inaugural workshop (Castle et al. 2006) hosted by the U.S. National Institutes of Health (NIH) and the Metabolomics Society.
Standardization efforts at the MSI are being carried out by an international community of volunteers who are working to generate broad community consensus around the proposed standards. The reports presented in this issue of Metabolomics are designed as a means to spotlight ongoing work and an important mile stones rather than a mechanism to report final descriptions of the various standards under development. It is hoped that these reports will spark additional discussion, community input, and raise awareness of this effort. Progress on these objectives and dissemination of updated versions of MSI documents are publicly available at the initiative’s public webpage [http://msi-workgroups.sourceforge.net/]. The aim of the reporting standards proposed here is not to prescribe how an experiment is carried out but rather to provide a common mechanism for describing the work so that the data can be made available to others for evaluation, or to support an extension or repeat of the work as desired, or published in a public repository. In metabolomics, more so than in other ‘omics areas’, the collection of data about the subject during and after sample preparation is critical to support interpretation of the results and also to facilitate use of the data in other analyses that may uncover new relationships between biological state and the metabolome.
- 1.Biological context metadata with the following subgroups:
mammalian studies, divided between human and animal studies
cell cultures and microbiology
For all groups there is certain degree of overlap with other existing initiatives in functional genomics, namely the Human Proteome Organization Proteomics Standards Initiative (Taylor et al. 2006) and the Microarray Gene Expression Data Society (Ball and Brazma 2007). Interaction with these and related initiatives and communities is ensured by both participating in workshops and collaborating at the umbrella level under several synergistic projects. These include: (a) MIBBI (2007), a response to the current proliferation of Minimal Information checklists; (b) Functional Genomics Object Model (Jones et al. 2006), to underpin the development of tools, XML formats and database schemata; (c) the Ontology for Biomedical Investigations (Whetzel et al. 2006; OBI 2006), to build a cross-domain ontology as a resource for the annotation of investigations. The aim of such collaborations is to facilitate data integration across biological or cellular domains and instrument equipments, but also to avoid duplication of efforts and to define common standards (Quackenbush 2006).
The MSI acknowledges and appreciates all input into this effort so far, and actively seeks for greater participation (Fiehn et al. 2006) for improving the current stage of its Standards documents. Apart from ensuring a large community consensus by reaching out to a greater number of scientists in academia and industry, these MSI reporting standards are thought to be eventually endorsed and supported by funding agencies, database repositories and scientific journals. Ultimately, it is envisioned that regulatory bodies may adopt these recommendations as a way to standardize reports of experimental data submitted by their diverse constituencies with the goal of simplifying the task of reviewers thus accelerating the process. Indeed, the US FDA has already utilized the early efforts of MSI in the preparation of their draft (and yet to be published) metabolomics best practice document.
This research was supported in part by the Intramural Research Program of the NIH, and NIEHS, and partially supported by a research grant MCB-0520140 from the U.S. National Science Foundation.