A statistical framework for protein quantitation in bottom-up MS-based proteomics

Yuliya Karpievitch; Jeff Stanley; Thomas Taverner; Jianhua Huang; Joshua N. Adkins; Charles Ansong; Fred Heffron; Thomas O. Metz; Wei Jun Qian; Hyunjin Yoon; Richard D. Smith; Alan R. Dabney

doi:10.1093/bioinformatics/btp362

A statistical framework for protein quantitation in bottom-up MS-based proteomics

Yuliya Karpievitch, Jeff Stanley, Thomas Taverner, Jianhua Huang, Joshua N. Adkins, Charles Ansong, Fred Heffron, Thomas O. Metz, Wei Jun Qian, Hyunjin Yoon, Richard D. Smith, Alan R. Dabney

Research output: Contribution to journal › Article › peer-review

125 Scopus citations

Abstract

Motivation: Quantitative mass spectrometry-based proteomics requires protein-level estimates and associated confidence measures. Challenges include the presence of low quality or incorrectly identified peptides and informative missingness. Furthermore, models are required for rolling peptide-level information up to the protein level. Results: We present a statistical model that carefully accounts for informative missingness in peak intensities and allows unbiased, model-based, protein-level estimation and inference. The model is applicable to both label-based and label-free quantitation experiments. We also provide automated, model-based, algorithms for filtering of proteins and peptides as well as imputation of missing values. Two LC/MS datasets are used to illustrate the methods. In simulation studies, our methods are shown to achieve substantially more discoveries than standard alternatives.

Original language	English (US)
Pages (from-to)	2028-2034
Number of pages	7
Journal	Bioinformatics
Volume	25
Issue number	16
DOIs	https://doi.org/10.1093/bioinformatics/btp362
State	Published - Aug 2009

ASJC Scopus subject areas

Statistics and Probability
Biochemistry
Molecular Biology
Computer Science Applications
Computational Theory and Mathematics
Computational Mathematics

Access to Document

10.1093/bioinformatics/btp362

Cite this

@article{b437b6e4f0de4635a33426e43733d313,

title = "A statistical framework for protein quantitation in bottom-up MS-based proteomics",

abstract = "Motivation: Quantitative mass spectrometry-based proteomics requires protein-level estimates and associated confidence measures. Challenges include the presence of low quality or incorrectly identified peptides and informative missingness. Furthermore, models are required for rolling peptide-level information up to the protein level. Results: We present a statistical model that carefully accounts for informative missingness in peak intensities and allows unbiased, model-based, protein-level estimation and inference. The model is applicable to both label-based and label-free quantitation experiments. We also provide automated, model-based, algorithms for filtering of proteins and peptides as well as imputation of missing values. Two LC/MS datasets are used to illustrate the methods. In simulation studies, our methods are shown to achieve substantially more discoveries than standard alternatives.",

author = "Yuliya Karpievitch and Jeff Stanley and Thomas Taverner and Jianhua Huang and Adkins, {Joshua N.} and Charles Ansong and Fred Heffron and Metz, {Thomas O.} and Qian, {Wei Jun} and Hyunjin Yoon and Smith, {Richard D.} and Dabney, {Alan R.}",

note = "Funding Information: Funding: This work was sponsored by a subcontract from PNNL and by the NIH R25-CA-90301 training grant at TAMU. Additional support was provided by NIH grant DK070146 and by the National Institute of Allergy and Infectious Diseases (NIH/DHHS through interagency agreement Y1-AI-4894-01).",

year = "2009",

month = aug,

doi = "10.1093/bioinformatics/btp362",

language = "English (US)",

volume = "25",

pages = "2028--2034",

journal = "Bioinformatics",

issn = "1367-4803",

publisher = "Oxford University Press",

number = "16",

}

TY - JOUR

T1 - A statistical framework for protein quantitation in bottom-up MS-based proteomics

AU - Karpievitch, Yuliya

AU - Stanley, Jeff

AU - Taverner, Thomas

AU - Huang, Jianhua

AU - Adkins, Joshua N.

AU - Ansong, Charles

AU - Heffron, Fred

AU - Metz, Thomas O.

AU - Qian, Wei Jun

AU - Yoon, Hyunjin

AU - Smith, Richard D.

AU - Dabney, Alan R.

N1 - Funding Information: Funding: This work was sponsored by a subcontract from PNNL and by the NIH R25-CA-90301 training grant at TAMU. Additional support was provided by NIH grant DK070146 and by the National Institute of Allergy and Infectious Diseases (NIH/DHHS through interagency agreement Y1-AI-4894-01).

PY - 2009/8

Y1 - 2009/8

N2 - Motivation: Quantitative mass spectrometry-based proteomics requires protein-level estimates and associated confidence measures. Challenges include the presence of low quality or incorrectly identified peptides and informative missingness. Furthermore, models are required for rolling peptide-level information up to the protein level. Results: We present a statistical model that carefully accounts for informative missingness in peak intensities and allows unbiased, model-based, protein-level estimation and inference. The model is applicable to both label-based and label-free quantitation experiments. We also provide automated, model-based, algorithms for filtering of proteins and peptides as well as imputation of missing values. Two LC/MS datasets are used to illustrate the methods. In simulation studies, our methods are shown to achieve substantially more discoveries than standard alternatives.

AB - Motivation: Quantitative mass spectrometry-based proteomics requires protein-level estimates and associated confidence measures. Challenges include the presence of low quality or incorrectly identified peptides and informative missingness. Furthermore, models are required for rolling peptide-level information up to the protein level. Results: We present a statistical model that carefully accounts for informative missingness in peak intensities and allows unbiased, model-based, protein-level estimation and inference. The model is applicable to both label-based and label-free quantitation experiments. We also provide automated, model-based, algorithms for filtering of proteins and peptides as well as imputation of missing values. Two LC/MS datasets are used to illustrate the methods. In simulation studies, our methods are shown to achieve substantially more discoveries than standard alternatives.

UR - http://www.scopus.com/inward/record.url?scp=68549137863&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=68549137863&partnerID=8YFLogxK

U2 - 10.1093/bioinformatics/btp362

DO - 10.1093/bioinformatics/btp362

M3 - Article

C2 - 19535538

AN - SCOPUS:68549137863

SN - 1367-4803

VL - 25

SP - 2028

EP - 2034

JO - Bioinformatics

JF - Bioinformatics

IS - 16

ER -

A statistical framework for protein quantitation in bottom-up MS-based proteomics

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this