Unifying gene expression measures from multiple platforms using factor analysis

Xin Victoria Wang, Roel G W Verhaak, Elizabeth Purdom, Paul Spellman, Terence P. Speed

Research output: Contribution to journalArticle

11 Citations (Scopus)

Abstract

In the Cancer Genome Atlas (TCGA) project, gene expression of the same set of samples is measured multiple times on different microarray platforms. There are two main advantages to combining these measurements. First, we have the opportunity to obtain a more precise and accurate estimate of expression levels than using the individual platforms alone. Second, the combined measure simplifies downstream analysis by eliminating the need to work with three sets of expression measures and to consolidate results from the three platforms. We propose to use factor analysis (FA) to obtain a unified gene expression measure (UE) from multiple platforms. The UE is a weighted average of the three platforms, and is shown to perform well in terms of accuracy and precision. In addition, the FA model produces parameter estimates that allow the assessment of the model fit. The R code is provided in File S2. Gene-level FA measurements for the TCGA data sets are available from http://tcga-data.nci.nih.gov/docs/publications/unified_expression/.

Original languageEnglish (US)
Article numbere17691
JournalPLoS One
Volume6
Issue number3
DOIs
StatePublished - 2011
Externally publishedYes

Fingerprint

Factor analysis
Gene expression
Statistical Factor Analysis
Genes
Atlases
Gene Expression
gene expression
Genome
neoplasms
genome
Microarrays
Publications
Neoplasms
factor analysis
genes
sampling

ASJC Scopus subject areas

  • Agricultural and Biological Sciences(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Medicine(all)

Cite this

Unifying gene expression measures from multiple platforms using factor analysis. / Wang, Xin Victoria; Verhaak, Roel G W; Purdom, Elizabeth; Spellman, Paul; Speed, Terence P.

In: PLoS One, Vol. 6, No. 3, e17691, 2011.

Research output: Contribution to journalArticle

Wang, Xin Victoria ; Verhaak, Roel G W ; Purdom, Elizabeth ; Spellman, Paul ; Speed, Terence P. / Unifying gene expression measures from multiple platforms using factor analysis. In: PLoS One. 2011 ; Vol. 6, No. 3.
@article{24bdd5125dd34d4c894e943cea370fba,
title = "Unifying gene expression measures from multiple platforms using factor analysis",
abstract = "In the Cancer Genome Atlas (TCGA) project, gene expression of the same set of samples is measured multiple times on different microarray platforms. There are two main advantages to combining these measurements. First, we have the opportunity to obtain a more precise and accurate estimate of expression levels than using the individual platforms alone. Second, the combined measure simplifies downstream analysis by eliminating the need to work with three sets of expression measures and to consolidate results from the three platforms. We propose to use factor analysis (FA) to obtain a unified gene expression measure (UE) from multiple platforms. The UE is a weighted average of the three platforms, and is shown to perform well in terms of accuracy and precision. In addition, the FA model produces parameter estimates that allow the assessment of the model fit. The R code is provided in File S2. Gene-level FA measurements for the TCGA data sets are available from http://tcga-data.nci.nih.gov/docs/publications/unified_expression/.",
author = "Wang, {Xin Victoria} and Verhaak, {Roel G W} and Elizabeth Purdom and Paul Spellman and Speed, {Terence P.}",
year = "2011",
doi = "10.1371/journal.pone.0017691",
language = "English (US)",
volume = "6",
journal = "PLoS One",
issn = "1932-6203",
publisher = "Public Library of Science",
number = "3",

}

TY - JOUR

T1 - Unifying gene expression measures from multiple platforms using factor analysis

AU - Wang, Xin Victoria

AU - Verhaak, Roel G W

AU - Purdom, Elizabeth

AU - Spellman, Paul

AU - Speed, Terence P.

PY - 2011

Y1 - 2011

N2 - In the Cancer Genome Atlas (TCGA) project, gene expression of the same set of samples is measured multiple times on different microarray platforms. There are two main advantages to combining these measurements. First, we have the opportunity to obtain a more precise and accurate estimate of expression levels than using the individual platforms alone. Second, the combined measure simplifies downstream analysis by eliminating the need to work with three sets of expression measures and to consolidate results from the three platforms. We propose to use factor analysis (FA) to obtain a unified gene expression measure (UE) from multiple platforms. The UE is a weighted average of the three platforms, and is shown to perform well in terms of accuracy and precision. In addition, the FA model produces parameter estimates that allow the assessment of the model fit. The R code is provided in File S2. Gene-level FA measurements for the TCGA data sets are available from http://tcga-data.nci.nih.gov/docs/publications/unified_expression/.

AB - In the Cancer Genome Atlas (TCGA) project, gene expression of the same set of samples is measured multiple times on different microarray platforms. There are two main advantages to combining these measurements. First, we have the opportunity to obtain a more precise and accurate estimate of expression levels than using the individual platforms alone. Second, the combined measure simplifies downstream analysis by eliminating the need to work with three sets of expression measures and to consolidate results from the three platforms. We propose to use factor analysis (FA) to obtain a unified gene expression measure (UE) from multiple platforms. The UE is a weighted average of the three platforms, and is shown to perform well in terms of accuracy and precision. In addition, the FA model produces parameter estimates that allow the assessment of the model fit. The R code is provided in File S2. Gene-level FA measurements for the TCGA data sets are available from http://tcga-data.nci.nih.gov/docs/publications/unified_expression/.

UR - http://www.scopus.com/inward/record.url?scp=79952598216&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79952598216&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0017691

DO - 10.1371/journal.pone.0017691

M3 - Article

C2 - 21436879

AN - SCOPUS:79952598216

VL - 6

JO - PLoS One

JF - PLoS One

SN - 1932-6203

IS - 3

M1 - e17691

ER -