TITLE

Statistical learning of peptide retention behavior in chromatographic separations: a new kernel-based approach for computational proteomics

AUTHOR(S)
Pfeifer, Nico; Leinenbach, Andreas; Huber, Christian G.; Kohlbacher, Oliver
PUB. DATE
January 2007
SOURCE
BMC Bioinformatics;2007 Supplement 2, Vol. 8, p468
SOURCE TYPE
Academic Journal
DOC. TYPE
Article
ABSTRACT
Background: High-throughput peptide and protein identification technologies have benefited tremendously from strategies based on tandem mass spectrometry (MS/MS) in combination with database searching algorithms. A major problem with existing methods lies within the significant number of false positive and false negative annotations. So far, standard algorithms for protein identification do not use the information gained from separation processes usually involved in peptide analysis, such as retention time information, which are readily available from chromatographic separation of the sample. Identification can thus be improved by comparing measured retention times to predicted retention times. Current prediction models are derived from a set of measured test analytes but they usually require large amounts of training data. Results: We introduce a new kernel function which can be applied in combination with support vector machines to a wide range of computational proteomics problems. We show the performance of this new approach by applying it to the prediction of peptide adsorption/elution behavior in strong anion-exchange solid-phase extraction (SAX-SPE) and ion-pair reversed-phase high-performance liquid chromatography (IP-RP-HPLC). Furthermore, the predicted retention times are used to improve spectrum identifications by a p-value-based filtering approach. The approach was tested on a number of different datasets and shows excellent performance while requiring only very small training sets (about 40 peptides instead of thousands). Using the retention time predictor in our retention time filter improves the fraction of correctly identified peptide mass spectra significantly. Conclusion: The proposed kernel function is well-suited for the prediction of chromatographic separation in computational proteomics and requires only a limited amount of training data. The performance of this new method is demonstrated by applying it to peptide retention time prediction in IP-RP-HPLC and prediction of peptide sample fractionation in SAX-SPE. Finally, we incorporate the predicted chromatographic behavior in a p-value based filter to improve peptide identifications based on liquid chromatography-tandem mass spectrometry.
ACCESSION #
35075570

 

Related Articles

  • Multi-Segment Direct Inject nano-ESI-LTQ-FT-ICRMS/ MS For Protein Identification. Jing Chen; Canales, Lorena; Neal, Rachel E. // Proteome Science;2011, Vol. 9 Issue 1, p38 

    Reversed phase high performance liquid chromatography (HPLC) interfaced to electrospray tandem mass spectrometry (MS/MS) is commonly used for the identification of peptides from proteolytically cleaved proteins embedded in a polyacrylamide gel matrix as well as for metabolomics screening. HPLC...

  • Detection of Toxin Proteins from Bacillus thuringiensis Strain 4.0718 by Strategy of 2D-LC-MS/MS. Yang, Qi; Tang, Sijia; Rang, Jie; Zuo, Mingxing; Ding, Xuezhi; Sun, Yunjun; Feng, Pinghui; Xia, Liqiu // Current Microbiology;Apr2015, Vol. 70 Issue 4, p457 

    Bacillus thuringiensis is a kind of insecticidal microorganism which can produce a variety of toxin proteins, it is particularly important to find an effective strategy to identify novel toxin proteins rapidly and comprehensively with the discovery of the wild-type strains. Multi-dimensional...

  • Speeding up tandem mass spectrometry-based database searching by longest common prefix. Chen Zhou; Hao Chi; Le-Heng Wang; You Li; Yan-Jie Wu; Yan Fu; Rui-Xiang Sun; Si-Min He // BMC Bioinformatics;2010, Vol. 11, p577 

    Background: Tandem mass spectrometry-based database searching has become an important technology for peptide and protein identification. One of the key challenges in database searching is the remarkable increase in computational demand, brought about by the expansion of protein databases, semi-...

  • Optimization of Data-Dependent Acquisition Parameters for Coupling High-Speed Separations with LC-MS/MS for Protein Identifications. Johnson, Darryl; Boyes, Barry; Fields, Taylor; Kopkin, Rachel; Orlando, Ron // Journal of Biomolecular Techniques;Jul2013, Vol. 24 Issue 2, p62 

    Recent developments in chromatography, such as ultra-HPLC and superficially porous particles, offer significantly improved peptide separation. The narrow peak widths, often only several seconds, can permit a 15-min liquid chromatography run to have a similar peak capacity as a 60-min run using...

  • Application of high performance liquid chromatography-tandem mass spectrometry for detection of melittin, a characteristic peptide of bee venom. Akhmerova, D.; Stavrianidi, A.; Rodin, I.; Shpigun, O. // Inorganic Materials;Dec2015, Vol. 51 Issue 14, p1431 

    Bee venom is widely used for the manufacturing of cosmetics and pharmaceuticals. A procedure for the detection of melittin, a characteristic component of bee venom, based on high performance liquid chromatography with tandem mass spectrometric detection is reported. Conditions of...

  • Proteome-wide cellular protein concentrations of the human pathogen Leptospira interrogans. Malmström, Johan; Beck, Martin; Schmidt, Alexander; Lange, Vinzenz; Deutsch, Eric W.; Aebersold, Ruedi // Nature;8/6/2009, Vol. 460 Issue 7256, p762 

    Mass-spectrometry-based methods for relative proteome quantification have broadly affected life science research. However, important research directions, particularly those involving mathematical modelling and simulation of biological processes, also critically depend on absolutely quantitative...

  • Comprehensive analysis of the mouse renal cortex using two-dimensional HPLC -- tandem mass spectrometry. Yingxin Zhao; Denner, Larry; Haidacher, Sigmund J.; LeJeune, Wanda S.; Tilton, Ronald G. // Proteome Science;2008, Vol. 6, Special section p1 

    Background: Proteomic methodologies increasingly have been applied to the kidney to map the renal cortical proteome and to identify global changes in renal proteins induced by diseases such as diabetes. While progress has been made in establishing a renal cortical proteome using 1-D or 2-DE and...

  • IDENTIFICATION OF NEW RVX ADDUCT WITH BLOOD PLASMA PROTEINS BY HIGH-PERFORMANCE LIQUID CHROMATOGRAPHY COUPLED WITH TANDEM MASS-SELECTIVE DETECTION IN HIGH RESOLUTION MODE. Prokofieva, D. S.; Krivorotova, N. V.; Goncharov, N. V. // Toxicological Review;Apr2014, Vol. 125 Issue 1, p32 

    Present-day ideas about in vitro and in vivo transformation of V-type oraganophosphorus chemical agent are described. In vitro experiments, a new covalent RVX adduct with blood plasma proteins was identified using HPLC with tandem mass-selective detection in high resolution mode. Data were...

  • Chemical rule-based filtering of MS/MS spectra. Reiz, Beáta; Kertész-Farkas, Attila; Pongor, Sándor; Myers, Michael P. // Bioinformatics;Apr2013, Vol. 29 Issue 7, p925 

    Motivation: Identification of proteins by mass spectrometry–based proteomics requires automated interpretation of peptide tandem mass spectrometry spectra. The effectiveness of peptide identification can be greatly improved by filtering out extraneous noise peaks before the subsequent...

Share

Read the Article

Courtesy of VIRGINIA BEACH PUBLIC LIBRARY AND SYSTEM

Sorry, but this item is not currently available from your library.

Try another library?
Sign out of this library

Other Topics