Gamma-normal-gamma mixture model for detecting differentially methylated loci in three breast cancer cell lines

Abbas Khalili, Dustin Potter, Pearlly Yan, Lang Li, Joe Gray, Tim Huang, Shili Lin

Research output: Contribution to journalArticle

13 Citations (Scopus)

Abstract

With state-of-the-art microarray technologies now available for whole genome CpG island (CGI) methylation profiling, there is a need to develop statistical models that are specifically geared toward the analysis of such data. In this article, we propose a Gamma-Normal-Gamma (GNG) mixture model for describing three groups of CGI loci: hypomethylated, undifferentiated, and hypermethylated, from a single methylation microarray. This model was applied to study the methylation signatures of three breast cancer cell lines: MCF7, T47D, and MDAMB361. Biologically interesting and interpretable results are obtained, which highlights the heterogeneity nature of the three cell lines. This underlies the premise for the need of analyzing each of the microarray slides individually as opposed to pooling them together for a single analysis. Our comparisons with the fitted densities from the Normal-Uniform (NU) mixture model in the literature proposed for gene expression analysis show an improved goodness of fit of the GNG model over the NU model. Although the GNG model was proposed in the context of single-slide methylation analysis, it can be readily adapted to analyze multi-slide methylation data as well as other types of microarray data.

Original languageEnglish (US)
Pages (from-to)43-54
Number of pages12
JournalCancer Informatics
Volume3
StatePublished - 2007
Externally publishedYes

Fingerprint

Methylation
Breast Neoplasms
Cell Line
CpG Islands
Statistical Models
Genome
Technology
Gene Expression

Keywords

  • Breast cancer cell lines
  • CpG islands
  • Methylation/epigenetic signature
  • Microarrays
  • Mixture modeling

ASJC Scopus subject areas

  • Cancer Research
  • Oncology

Cite this

Gamma-normal-gamma mixture model for detecting differentially methylated loci in three breast cancer cell lines. / Khalili, Abbas; Potter, Dustin; Yan, Pearlly; Li, Lang; Gray, Joe; Huang, Tim; Lin, Shili.

In: Cancer Informatics, Vol. 3, 2007, p. 43-54.

Research output: Contribution to journalArticle

Khalili, Abbas ; Potter, Dustin ; Yan, Pearlly ; Li, Lang ; Gray, Joe ; Huang, Tim ; Lin, Shili. / Gamma-normal-gamma mixture model for detecting differentially methylated loci in three breast cancer cell lines. In: Cancer Informatics. 2007 ; Vol. 3. pp. 43-54.
@article{a73d412bfbc74561bf27e351c8e02e07,
title = "Gamma-normal-gamma mixture model for detecting differentially methylated loci in three breast cancer cell lines",
abstract = "With state-of-the-art microarray technologies now available for whole genome CpG island (CGI) methylation profiling, there is a need to develop statistical models that are specifically geared toward the analysis of such data. In this article, we propose a Gamma-Normal-Gamma (GNG) mixture model for describing three groups of CGI loci: hypomethylated, undifferentiated, and hypermethylated, from a single methylation microarray. This model was applied to study the methylation signatures of three breast cancer cell lines: MCF7, T47D, and MDAMB361. Biologically interesting and interpretable results are obtained, which highlights the heterogeneity nature of the three cell lines. This underlies the premise for the need of analyzing each of the microarray slides individually as opposed to pooling them together for a single analysis. Our comparisons with the fitted densities from the Normal-Uniform (NU) mixture model in the literature proposed for gene expression analysis show an improved goodness of fit of the GNG model over the NU model. Although the GNG model was proposed in the context of single-slide methylation analysis, it can be readily adapted to analyze multi-slide methylation data as well as other types of microarray data.",
keywords = "Breast cancer cell lines, CpG islands, Methylation/epigenetic signature, Microarrays, Mixture modeling",
author = "Abbas Khalili and Dustin Potter and Pearlly Yan and Lang Li and Joe Gray and Tim Huang and Shili Lin",
year = "2007",
language = "English (US)",
volume = "3",
pages = "43--54",
journal = "Cancer Informatics",
issn = "1176-9351",
publisher = "Libertas Academica Ltd.",

}

TY - JOUR

T1 - Gamma-normal-gamma mixture model for detecting differentially methylated loci in three breast cancer cell lines

AU - Khalili, Abbas

AU - Potter, Dustin

AU - Yan, Pearlly

AU - Li, Lang

AU - Gray, Joe

AU - Huang, Tim

AU - Lin, Shili

PY - 2007

Y1 - 2007

N2 - With state-of-the-art microarray technologies now available for whole genome CpG island (CGI) methylation profiling, there is a need to develop statistical models that are specifically geared toward the analysis of such data. In this article, we propose a Gamma-Normal-Gamma (GNG) mixture model for describing three groups of CGI loci: hypomethylated, undifferentiated, and hypermethylated, from a single methylation microarray. This model was applied to study the methylation signatures of three breast cancer cell lines: MCF7, T47D, and MDAMB361. Biologically interesting and interpretable results are obtained, which highlights the heterogeneity nature of the three cell lines. This underlies the premise for the need of analyzing each of the microarray slides individually as opposed to pooling them together for a single analysis. Our comparisons with the fitted densities from the Normal-Uniform (NU) mixture model in the literature proposed for gene expression analysis show an improved goodness of fit of the GNG model over the NU model. Although the GNG model was proposed in the context of single-slide methylation analysis, it can be readily adapted to analyze multi-slide methylation data as well as other types of microarray data.

AB - With state-of-the-art microarray technologies now available for whole genome CpG island (CGI) methylation profiling, there is a need to develop statistical models that are specifically geared toward the analysis of such data. In this article, we propose a Gamma-Normal-Gamma (GNG) mixture model for describing three groups of CGI loci: hypomethylated, undifferentiated, and hypermethylated, from a single methylation microarray. This model was applied to study the methylation signatures of three breast cancer cell lines: MCF7, T47D, and MDAMB361. Biologically interesting and interpretable results are obtained, which highlights the heterogeneity nature of the three cell lines. This underlies the premise for the need of analyzing each of the microarray slides individually as opposed to pooling them together for a single analysis. Our comparisons with the fitted densities from the Normal-Uniform (NU) mixture model in the literature proposed for gene expression analysis show an improved goodness of fit of the GNG model over the NU model. Although the GNG model was proposed in the context of single-slide methylation analysis, it can be readily adapted to analyze multi-slide methylation data as well as other types of microarray data.

KW - Breast cancer cell lines

KW - CpG islands

KW - Methylation/epigenetic signature

KW - Microarrays

KW - Mixture modeling

UR - http://www.scopus.com/inward/record.url?scp=34447557026&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34447557026&partnerID=8YFLogxK

M3 - Article

VL - 3

SP - 43

EP - 54

JO - Cancer Informatics

JF - Cancer Informatics

SN - 1176-9351

ER -