Computational prediction of proteotypic peptides for quantitative proteomics

Parag Mallick; Markus Schirle; Sharon S. Chen; Mark R. Flory; Hookeun Lee; Daniel Martin; Jeffrey Ranish; Brian Raught; Robert Schmitt; Thilo Werner; Bernhard Kuster; Ruedi Aebersold

doi:10.1038/nbt1275

Computational prediction of proteotypic peptides for quantitative proteomics

Parag Mallick, Markus Schirle, Sharon S. Chen, Mark R. Flory, Hookeun Lee, Daniel Martin, Jeffrey Ranish, Brian Raught, Robert Schmitt, Thilo Werner, Bernhard Kuster, Ruedi Aebersold

Research output: Contribution to journal › Article › peer-review

585 Scopus citations

Abstract

Mass spectrometry-based quantitative proteomics has become an important component of biological and clinical research. Although such analyses typically assume that a protein's peptide fragments are observed with equal likelihood, only a few so-called 'proteotypic' peptides are repeatedly and consistently identified for any given protein present in a mixture. Using >600,000 peptide identifications generated by four proteomic platforms, we empirically identified >16,000 proteotypic peptides for 4,030 distinct yeast proteins. Characteristic physicochemical properties of these peptides were used to develop a computational tool that can predict proteotypic peptides for any protein from any organism, for a given platform, with >85% cumulative accuracy. Possible applications of proteotypic peptides include validation of protein identifications, absolute quantification of proteins, annotation of coding sequences in genomes, and characterization of the physical principles governing key elements of mass spectrometric workflows (e.g., digestion, chromatography, ionization and fragmentation).

Original language	English (US)
Pages (from-to)	125-131
Number of pages	7
Journal	Nature biotechnology
Volume	25
Issue number	1
DOIs	https://doi.org/10.1038/nbt1275
State	Published - Jan 5 2007
Externally published	Yes

ASJC Scopus subject areas

Biotechnology
Bioengineering
Applied Microbiology and Biotechnology
Molecular Medicine
Biomedical Engineering

Access to Document

10.1038/nbt1275

Cite this

@article{7df578fc42bb4ea8a5e4a13b4afade69,

title = "Computational prediction of proteotypic peptides for quantitative proteomics",

abstract = "Mass spectrometry-based quantitative proteomics has become an important component of biological and clinical research. Although such analyses typically assume that a protein's peptide fragments are observed with equal likelihood, only a few so-called 'proteotypic' peptides are repeatedly and consistently identified for any given protein present in a mixture. Using >600,000 peptide identifications generated by four proteomic platforms, we empirically identified >16,000 proteotypic peptides for 4,030 distinct yeast proteins. Characteristic physicochemical properties of these peptides were used to develop a computational tool that can predict proteotypic peptides for any protein from any organism, for a given platform, with >85% cumulative accuracy. Possible applications of proteotypic peptides include validation of protein identifications, absolute quantification of proteins, annotation of coding sequences in genomes, and characterization of the physical principles governing key elements of mass spectrometric workflows (e.g., digestion, chromatography, ionization and fragmentation).",

author = "Parag Mallick and Markus Schirle and Chen, {Sharon S.} and Flory, {Mark R.} and Hookeun Lee and Daniel Martin and Jeffrey Ranish and Brian Raught and Robert Schmitt and Thilo Werner and Bernhard Kuster and Ruedi Aebersold",

note = "Funding Information: The authors are grateful to Julien Gagneur for fruitful discussions and the Cellzome biochemistry, mass spectrometry and informatics teams for generating and managing data. The work was supported in part with federal funds from the National Heart, Lung, and Blood Institute, National Institutes of Health, under contract N01-HV-28179.",

year = "2007",

month = jan,

day = "5",

doi = "10.1038/nbt1275",

language = "English (US)",

volume = "25",

pages = "125--131",

journal = "Nature biotechnology",

issn = "1087-0156",

publisher = "Nature Publishing Group",

number = "1",

}

TY - JOUR

T1 - Computational prediction of proteotypic peptides for quantitative proteomics

AU - Mallick, Parag

AU - Schirle, Markus

AU - Chen, Sharon S.

AU - Flory, Mark R.

AU - Lee, Hookeun

AU - Martin, Daniel

AU - Ranish, Jeffrey

AU - Raught, Brian

AU - Schmitt, Robert

AU - Werner, Thilo

AU - Kuster, Bernhard

AU - Aebersold, Ruedi

N1 - Funding Information: The authors are grateful to Julien Gagneur for fruitful discussions and the Cellzome biochemistry, mass spectrometry and informatics teams for generating and managing data. The work was supported in part with federal funds from the National Heart, Lung, and Blood Institute, National Institutes of Health, under contract N01-HV-28179.

PY - 2007/1/5

Y1 - 2007/1/5

N2 - Mass spectrometry-based quantitative proteomics has become an important component of biological and clinical research. Although such analyses typically assume that a protein's peptide fragments are observed with equal likelihood, only a few so-called 'proteotypic' peptides are repeatedly and consistently identified for any given protein present in a mixture. Using >600,000 peptide identifications generated by four proteomic platforms, we empirically identified >16,000 proteotypic peptides for 4,030 distinct yeast proteins. Characteristic physicochemical properties of these peptides were used to develop a computational tool that can predict proteotypic peptides for any protein from any organism, for a given platform, with >85% cumulative accuracy. Possible applications of proteotypic peptides include validation of protein identifications, absolute quantification of proteins, annotation of coding sequences in genomes, and characterization of the physical principles governing key elements of mass spectrometric workflows (e.g., digestion, chromatography, ionization and fragmentation).

AB - Mass spectrometry-based quantitative proteomics has become an important component of biological and clinical research. Although such analyses typically assume that a protein's peptide fragments are observed with equal likelihood, only a few so-called 'proteotypic' peptides are repeatedly and consistently identified for any given protein present in a mixture. Using >600,000 peptide identifications generated by four proteomic platforms, we empirically identified >16,000 proteotypic peptides for 4,030 distinct yeast proteins. Characteristic physicochemical properties of these peptides were used to develop a computational tool that can predict proteotypic peptides for any protein from any organism, for a given platform, with >85% cumulative accuracy. Possible applications of proteotypic peptides include validation of protein identifications, absolute quantification of proteins, annotation of coding sequences in genomes, and characterization of the physical principles governing key elements of mass spectrometric workflows (e.g., digestion, chromatography, ionization and fragmentation).

UR - http://www.scopus.com/inward/record.url?scp=33846133955&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33846133955&partnerID=8YFLogxK

U2 - 10.1038/nbt1275

DO - 10.1038/nbt1275

M3 - Article

C2 - 17195840

AN - SCOPUS:33846133955

SN - 1087-0156

VL - 25

SP - 125

EP - 131

JO - Nature biotechnology

JF - Nature biotechnology

IS - 1

ER -

Computational prediction of proteotypic peptides for quantitative proteomics

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this