Assessing bias in experiment design for large scale mass spectrometry-based quantitative proteomics

Amol Prakash, Brian Piening, Jeff Whiteaker, Heidi Zhang, Scott A. Shaffer, Daniel Martin, Laura Hohmann, Kelly Cooke, James M. Olson, Stacey Hansen, Mark Flory, Hookeun Lee, Julian Watts, David R. Goodlett, Ruedi Aebersold, Amanda Paulovich, Benno Schwikowski

Research output: Contribution to journalArticle

41 Citations (Scopus)

Abstract

Mass spectrometry-based proteomics holds great promise as a discovery tool for biomarker candidates in the early detection of diseases. Recently much emphasis has been placed upon producing highly reliable data for quantitative profiling for which highly reproducible methodologies are indispensable. The main problems that affect experimental reproducibility stem from variations introduced by sample collection, preparation, and storage protocols and LC-MS settings and conditions. On the basis of a formally precise and quantitative definition of similarity between LC-MS experiments, we have developed Cha-order, a fully automatic software tool that can assess experimental reproducibility of sets of large scale LC-MS experiments. By visualizing the similarity relationships within a set of experiments, this tool can form the basis of systematic quality control and thus help assess the comparability of mass spectrometry data over time, across different laboratories, and between instruments. Applying Chaorder to data from multiple laboratories and a range of instruments, experimental protocols, and sample complexities revealed biases introduced by the sample processing steps, experimental protocols, and instrument choices. Moreover we show that reducing bias by correcting for just a few steps, for example randomizing the run order, does not provide much gain in statistical power for biomarker discovery.

Original languageEnglish (US)
Pages (from-to)1741-1748
Number of pages8
JournalMolecular and Cellular Proteomics
Volume6
Issue number10
DOIs
StatePublished - Oct 1 2007
Externally publishedYes

Fingerprint

Proteomics
Mass spectrometry
Mass Spectrometry
Biomarkers
Network protocols
Quality Control
Early Diagnosis
Software
Experiments
Quality control
Processing

ASJC Scopus subject areas

  • Biochemistry

Cite this

Prakash, A., Piening, B., Whiteaker, J., Zhang, H., Shaffer, S. A., Martin, D., ... Schwikowski, B. (2007). Assessing bias in experiment design for large scale mass spectrometry-based quantitative proteomics. Molecular and Cellular Proteomics, 6(10), 1741-1748. https://doi.org/10.1074/mcp.M600470-MCP200

Assessing bias in experiment design for large scale mass spectrometry-based quantitative proteomics. / Prakash, Amol; Piening, Brian; Whiteaker, Jeff; Zhang, Heidi; Shaffer, Scott A.; Martin, Daniel; Hohmann, Laura; Cooke, Kelly; Olson, James M.; Hansen, Stacey; Flory, Mark; Lee, Hookeun; Watts, Julian; Goodlett, David R.; Aebersold, Ruedi; Paulovich, Amanda; Schwikowski, Benno.

In: Molecular and Cellular Proteomics, Vol. 6, No. 10, 01.10.2007, p. 1741-1748.

Research output: Contribution to journalArticle

Prakash, A, Piening, B, Whiteaker, J, Zhang, H, Shaffer, SA, Martin, D, Hohmann, L, Cooke, K, Olson, JM, Hansen, S, Flory, M, Lee, H, Watts, J, Goodlett, DR, Aebersold, R, Paulovich, A & Schwikowski, B 2007, 'Assessing bias in experiment design for large scale mass spectrometry-based quantitative proteomics', Molecular and Cellular Proteomics, vol. 6, no. 10, pp. 1741-1748. https://doi.org/10.1074/mcp.M600470-MCP200
Prakash, Amol ; Piening, Brian ; Whiteaker, Jeff ; Zhang, Heidi ; Shaffer, Scott A. ; Martin, Daniel ; Hohmann, Laura ; Cooke, Kelly ; Olson, James M. ; Hansen, Stacey ; Flory, Mark ; Lee, Hookeun ; Watts, Julian ; Goodlett, David R. ; Aebersold, Ruedi ; Paulovich, Amanda ; Schwikowski, Benno. / Assessing bias in experiment design for large scale mass spectrometry-based quantitative proteomics. In: Molecular and Cellular Proteomics. 2007 ; Vol. 6, No. 10. pp. 1741-1748.
@article{2999ccf195854258996522c19a8fd0ce,
title = "Assessing bias in experiment design for large scale mass spectrometry-based quantitative proteomics",
abstract = "Mass spectrometry-based proteomics holds great promise as a discovery tool for biomarker candidates in the early detection of diseases. Recently much emphasis has been placed upon producing highly reliable data for quantitative profiling for which highly reproducible methodologies are indispensable. The main problems that affect experimental reproducibility stem from variations introduced by sample collection, preparation, and storage protocols and LC-MS settings and conditions. On the basis of a formally precise and quantitative definition of similarity between LC-MS experiments, we have developed Cha-order, a fully automatic software tool that can assess experimental reproducibility of sets of large scale LC-MS experiments. By visualizing the similarity relationships within a set of experiments, this tool can form the basis of systematic quality control and thus help assess the comparability of mass spectrometry data over time, across different laboratories, and between instruments. Applying Chaorder to data from multiple laboratories and a range of instruments, experimental protocols, and sample complexities revealed biases introduced by the sample processing steps, experimental protocols, and instrument choices. Moreover we show that reducing bias by correcting for just a few steps, for example randomizing the run order, does not provide much gain in statistical power for biomarker discovery.",
author = "Amol Prakash and Brian Piening and Jeff Whiteaker and Heidi Zhang and Shaffer, {Scott A.} and Daniel Martin and Laura Hohmann and Kelly Cooke and Olson, {James M.} and Stacey Hansen and Mark Flory and Hookeun Lee and Julian Watts and Goodlett, {David R.} and Ruedi Aebersold and Amanda Paulovich and Benno Schwikowski",
year = "2007",
month = "10",
day = "1",
doi = "10.1074/mcp.M600470-MCP200",
language = "English (US)",
volume = "6",
pages = "1741--1748",
journal = "Molecular and Cellular Proteomics",
issn = "1535-9476",
publisher = "American Society for Biochemistry and Molecular Biology Inc.",
number = "10",

}

TY - JOUR

T1 - Assessing bias in experiment design for large scale mass spectrometry-based quantitative proteomics

AU - Prakash, Amol

AU - Piening, Brian

AU - Whiteaker, Jeff

AU - Zhang, Heidi

AU - Shaffer, Scott A.

AU - Martin, Daniel

AU - Hohmann, Laura

AU - Cooke, Kelly

AU - Olson, James M.

AU - Hansen, Stacey

AU - Flory, Mark

AU - Lee, Hookeun

AU - Watts, Julian

AU - Goodlett, David R.

AU - Aebersold, Ruedi

AU - Paulovich, Amanda

AU - Schwikowski, Benno

PY - 2007/10/1

Y1 - 2007/10/1

N2 - Mass spectrometry-based proteomics holds great promise as a discovery tool for biomarker candidates in the early detection of diseases. Recently much emphasis has been placed upon producing highly reliable data for quantitative profiling for which highly reproducible methodologies are indispensable. The main problems that affect experimental reproducibility stem from variations introduced by sample collection, preparation, and storage protocols and LC-MS settings and conditions. On the basis of a formally precise and quantitative definition of similarity between LC-MS experiments, we have developed Cha-order, a fully automatic software tool that can assess experimental reproducibility of sets of large scale LC-MS experiments. By visualizing the similarity relationships within a set of experiments, this tool can form the basis of systematic quality control and thus help assess the comparability of mass spectrometry data over time, across different laboratories, and between instruments. Applying Chaorder to data from multiple laboratories and a range of instruments, experimental protocols, and sample complexities revealed biases introduced by the sample processing steps, experimental protocols, and instrument choices. Moreover we show that reducing bias by correcting for just a few steps, for example randomizing the run order, does not provide much gain in statistical power for biomarker discovery.

AB - Mass spectrometry-based proteomics holds great promise as a discovery tool for biomarker candidates in the early detection of diseases. Recently much emphasis has been placed upon producing highly reliable data for quantitative profiling for which highly reproducible methodologies are indispensable. The main problems that affect experimental reproducibility stem from variations introduced by sample collection, preparation, and storage protocols and LC-MS settings and conditions. On the basis of a formally precise and quantitative definition of similarity between LC-MS experiments, we have developed Cha-order, a fully automatic software tool that can assess experimental reproducibility of sets of large scale LC-MS experiments. By visualizing the similarity relationships within a set of experiments, this tool can form the basis of systematic quality control and thus help assess the comparability of mass spectrometry data over time, across different laboratories, and between instruments. Applying Chaorder to data from multiple laboratories and a range of instruments, experimental protocols, and sample complexities revealed biases introduced by the sample processing steps, experimental protocols, and instrument choices. Moreover we show that reducing bias by correcting for just a few steps, for example randomizing the run order, does not provide much gain in statistical power for biomarker discovery.

UR - http://www.scopus.com/inward/record.url?scp=35648980489&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=35648980489&partnerID=8YFLogxK

U2 - 10.1074/mcp.M600470-MCP200

DO - 10.1074/mcp.M600470-MCP200

M3 - Article

C2 - 17617667

AN - SCOPUS:35648980489

VL - 6

SP - 1741

EP - 1748

JO - Molecular and Cellular Proteomics

JF - Molecular and Cellular Proteomics

SN - 1535-9476

IS - 10

ER -