TL;DR: An LC/MS-based data analysis approach, XCMS, which incorporates novel nonlinear retention time alignment, matched filtration, peak detection, and peak matching, and is demonstrated using data sets from a previously reported enzyme knockout study and a large-scale study of plasma samples.
Abstract: Metabolite profiling in biomarker discovery, enzyme substrate assignment, drug activity/specificity determination, and basic metabolic research requires new data preprocessing approaches to correlate specific metabolites to their biological origin. Here we introduce an LC/MS-based data analysis approach, XCMS, which incorporates novel nonlinear retention time alignment, matched filtration, peak detection, and peak matching. Without using internal standards, the method dynamically identifies hundreds of endogenous metabolites for use as standards, calculating a nonlinear retention time correction profile for each sample. Following retention time correction, the relative metabolite ion intensities are directly compared to identify changes in specific endogenous metabolites, such as potential biomarkers. The software is demonstrated using data sets from a previously reported enzyme knockout study and a large-scale study of plasma samples. XCMS is freely available under an open-source license at http://metlin...
TL;DR: By performing global metabolite profiling, also known as untargeted metabolomics, new discoveries linking cellular pathways to biological mechanism are being revealed and are shaping the understanding of cell biology, physiology and medicine.
Abstract: Metabolites, the chemical entities that are transformed during metabolism, provide a functional readout of cellular biochemistry. With emerging technologies in mass spectrometry, thousands of metabolites can now be quantitatively measured from minimal amounts of biological material, which has thereby enabled systems-level analyses. By performing global metabolite profiling, also known as untargeted metabolomics, new discoveries linking cellular pathways to biological mechanism are being revealed and are shaping our understanding of cell biology, physiology and medicine.
TL;DR: METLIN includes an annotated list of known metabolite structural information that is easily cross-correlated with its catalogue of high-resolution Fourier transform mass spectrometry (FTMS) spectra, tandem mass spectrumetry (MS/MS) Spectra, and LC/MS data.
Abstract: Endogenous metabolites have gained increasing interest over the past 5 years largely for their implications in diagnostic and pharmaceutical biomarker discovery. METLIN (http://metlin.scripps.edu), a freely accessible web-based data repository, has been developed to assist in a broad array of metabolite research and to facilitate metabolite identification through mass analysis. METLINincludes an annotated list of known metabolite structural information that is easily cross-correlated with its catalogue of high-resolution Fourier transform mass spectrometry (FTMS) spectra, tandem mass spectrometry (MS/MS) spectra, and LC/MS data.
TL;DR: METLIN now features in silico MS/MS data, which has been made possible through the creation of algorithms trained on METLIN's MS/ MS data from both standards and their isotope analogues, being designed to address the characterization of known and unknown molecules.
Abstract: METLIN originated as a database to characterize known metabolites and has since expanded into a technology platform for the identification of known and unknown metabolites and other chemical entities Through this effort it has become a comprehensive resource containing over 1 million molecules including lipids, amino acids, carbohydrates, toxins, small peptides, and natural products, among other classes METLIN’s high-resolution tandem mass spectrometry (MS/MS) database, which plays a key role in the identification process, has data generated from both reference standards and their labeled stable isotope analogues, facilitated by METLIN-guided analysis of isotope-labeled microorganisms The MS/MS data, coupled with the fragment similarity search function, expand the tool’s capabilities into the identification of unknowns Fragment similarity search is performed independent of the precursor mass, relying solely on the fragment ions to identify similar structures within the database Stable isotope data al
TL;DR: The protocol starts from untargeted metabolomic LC-Q-TOF-MS data that have been analyzed with the bioinformatics program XCMS, and it describes a strategy for selecting interesting features as well as performing subsequent targeted tandem MS.
Abstract: Untargeted metabolomics provides a comprehensive platform for identifying metabolites whose levels are altered between two or more populations. By using liquid chromatography quadrupole time-of-flight mass spectrometry (LC-Q-TOF-MS), hundreds to thousands of peaks with a unique m/z ratio and retention time are routinely detected from most biological samples in an untargeted profiling experiment. Each peak, termed a metabolomic feature, can be characterized on the basis of its accurate mass, retention time and tandem mass spectral fragmentation pattern. Here a seven-step protocol is suggested for such a characterization by using the METLIN metabolite database. The protocol starts from untargeted metabolomic LC-Q-TOF-MS data that have been analyzed with the bioinformatics program XCMS, and it describes a strategy for selecting interesting features as well as performing subsequent targeted tandem MS. The seven steps described will require 2-4 h to complete per feature, depending on the compound.