Unifying gene expression measures from multiple platforms using factor analysis

Xin Victoria Wang, Roel G W Verhaak, Elizabeth Purdom, Paul Spellman, Terence P. Speed

Research output: Contribution to journalArticle

11 Scopus citations

Abstract

In the Cancer Genome Atlas (TCGA) project, gene expression of the same set of samples is measured multiple times on different microarray platforms. There are two main advantages to combining these measurements. First, we have the opportunity to obtain a more precise and accurate estimate of expression levels than using the individual platforms alone. Second, the combined measure simplifies downstream analysis by eliminating the need to work with three sets of expression measures and to consolidate results from the three platforms. We propose to use factor analysis (FA) to obtain a unified gene expression measure (UE) from multiple platforms. The UE is a weighted average of the three platforms, and is shown to perform well in terms of accuracy and precision. In addition, the FA model produces parameter estimates that allow the assessment of the model fit. The R code is provided in File S2. Gene-level FA measurements for the TCGA data sets are available from http://tcga-data.nci.nih.gov/docs/publications/unified_expression/.

Original languageEnglish (US)
Article numbere17691
JournalPLoS One
Volume6
Issue number3
DOIs
Publication statusPublished - 2011
Externally publishedYes

    Fingerprint

ASJC Scopus subject areas

  • Agricultural and Biological Sciences(all)
  • Biochemistry, Genetics and Molecular Biology(all)
  • Medicine(all)

Cite this