Automatic Transformation and Integration to Improve Visualization and Discovery of Latent Effects in Imaging Data

Gregory J. Hunt; Mark A. Dane; James E. Korkola; Laura M. Heiser; Johann A. Gagnon-Bartsch

doi:10.1080/10618600.2020.1741379

Automatic Transformation and Integration to Improve Visualization and Discovery of Latent Effects in Imaging Data

Gregory J. Hunt, Mark A. Dane, James E. Korkola, Laura M. Heiser, Johann A. Gagnon-Bartsch

Research output: Contribution to journal › Article › peer-review

6 Scopus citations

Abstract

Proper data transformation is an essential part of analysis. Choosing appropriate transformations for variables can enhance visualization, improve efficacy of analytical methods, and increase data interpretability. However, determining appropriate transformations of variables from high-content imaging data poses new challenges. Imaging data produce hundreds of covariates from each of thousands of images in a corpus. Each of these covariates will have a different distribution and needs a potentially different transformation. As such imaging data produce hundreds of covariates, determining an appropriate transformation for each of them is infeasible by hand. In this article, we explore simple, robust, and automatic transformations of high-content image data. A central application of our work is to microenvironment microarray bio-imaging data from the NIH LINCS program. We show that our robust transformations enhance visualization and improve the discovery of substantively relevant latent effects. These transformations enhance analysis of image features individually and also improve data integration approaches when combining together multiple features. We anticipate that the advantages of this work will likely also be realized in the analysis of data from other high-content and highly multiplexed technologies like Cell Painting or Cyclic Immunofluorescence. Software and further analysis can be found at gjhunt.github.io/rr. Supplementary materials for this article are available online.

Original language	English (US)
Pages (from-to)	929-941
Number of pages	13
Journal	Journal of Computational and Graphical Statistics
Volume	29
Issue number	4
DOIs	https://doi.org/10.1080/10618600.2020.1741379
State	Published - 2020

Keywords

Automatic transformation
Data integration
Imaging
Latent variables
PCA
Visualization

ASJC Scopus subject areas

Discrete Mathematics and Combinatorics
Statistics and Probability
Statistics, Probability and Uncertainty

Access to Document

10.1080/10618600.2020.1741379

Cite this

@article{d6d9b1920c0b4c25ac91351f04352f88,

title = "Automatic Transformation and Integration to Improve Visualization and Discovery of Latent Effects in Imaging Data",

abstract = "Proper data transformation is an essential part of analysis. Choosing appropriate transformations for variables can enhance visualization, improve efficacy of analytical methods, and increase data interpretability. However, determining appropriate transformations of variables from high-content imaging data poses new challenges. Imaging data produce hundreds of covariates from each of thousands of images in a corpus. Each of these covariates will have a different distribution and needs a potentially different transformation. As such imaging data produce hundreds of covariates, determining an appropriate transformation for each of them is infeasible by hand. In this article, we explore simple, robust, and automatic transformations of high-content image data. A central application of our work is to microenvironment microarray bio-imaging data from the NIH LINCS program. We show that our robust transformations enhance visualization and improve the discovery of substantively relevant latent effects. These transformations enhance analysis of image features individually and also improve data integration approaches when combining together multiple features. We anticipate that the advantages of this work will likely also be realized in the analysis of data from other high-content and highly multiplexed technologies like Cell Painting or Cyclic Immunofluorescence. Software and further analysis can be found at gjhunt.github.io/rr. Supplementary materials for this article are available online.",

keywords = "Automatic transformation, Data integration, Imaging, Latent variables, PCA, Visualization",

author = "Hunt, {Gregory J.} and Dane, {Mark A.} and Korkola, {James E.} and Heiser, {Laura M.} and Gagnon-Bartsch, {Johann A.}",

note = "Publisher Copyright: {\textcopyright} 2020 American Statistical Association, Institute of Mathematical Statistics, and Interface Foundation of North America.",

year = "2020",

doi = "10.1080/10618600.2020.1741379",

language = "English (US)",

volume = "29",

pages = "929--941",

journal = "Journal of Computational and Graphical Statistics",

issn = "1061-8600",

publisher = "American Statistical Association",

number = "4",

}

TY - JOUR

T1 - Automatic Transformation and Integration to Improve Visualization and Discovery of Latent Effects in Imaging Data

AU - Hunt, Gregory J.

AU - Dane, Mark A.

AU - Korkola, James E.

AU - Heiser, Laura M.

AU - Gagnon-Bartsch, Johann A.

PY - 2020

Y1 - 2020

N2 - Proper data transformation is an essential part of analysis. Choosing appropriate transformations for variables can enhance visualization, improve efficacy of analytical methods, and increase data interpretability. However, determining appropriate transformations of variables from high-content imaging data poses new challenges. Imaging data produce hundreds of covariates from each of thousands of images in a corpus. Each of these covariates will have a different distribution and needs a potentially different transformation. As such imaging data produce hundreds of covariates, determining an appropriate transformation for each of them is infeasible by hand. In this article, we explore simple, robust, and automatic transformations of high-content image data. A central application of our work is to microenvironment microarray bio-imaging data from the NIH LINCS program. We show that our robust transformations enhance visualization and improve the discovery of substantively relevant latent effects. These transformations enhance analysis of image features individually and also improve data integration approaches when combining together multiple features. We anticipate that the advantages of this work will likely also be realized in the analysis of data from other high-content and highly multiplexed technologies like Cell Painting or Cyclic Immunofluorescence. Software and further analysis can be found at gjhunt.github.io/rr. Supplementary materials for this article are available online.

AB - Proper data transformation is an essential part of analysis. Choosing appropriate transformations for variables can enhance visualization, improve efficacy of analytical methods, and increase data interpretability. However, determining appropriate transformations of variables from high-content imaging data poses new challenges. Imaging data produce hundreds of covariates from each of thousands of images in a corpus. Each of these covariates will have a different distribution and needs a potentially different transformation. As such imaging data produce hundreds of covariates, determining an appropriate transformation for each of them is infeasible by hand. In this article, we explore simple, robust, and automatic transformations of high-content image data. A central application of our work is to microenvironment microarray bio-imaging data from the NIH LINCS program. We show that our robust transformations enhance visualization and improve the discovery of substantively relevant latent effects. These transformations enhance analysis of image features individually and also improve data integration approaches when combining together multiple features. We anticipate that the advantages of this work will likely also be realized in the analysis of data from other high-content and highly multiplexed technologies like Cell Painting or Cyclic Immunofluorescence. Software and further analysis can be found at gjhunt.github.io/rr. Supplementary materials for this article are available online.

KW - Automatic transformation

KW - Data integration

KW - Imaging

KW - Latent variables

KW - PCA

KW - Visualization

UR - http://www.scopus.com/inward/record.url?scp=85084978084&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85084978084&partnerID=8YFLogxK

U2 - 10.1080/10618600.2020.1741379

DO - 10.1080/10618600.2020.1741379

M3 - Article

AN - SCOPUS:85084978084

SN - 1061-8600

VL - 29

SP - 929

EP - 941

JO - Journal of Computational and Graphical Statistics

JF - Journal of Computational and Graphical Statistics

IS - 4

ER -

Automatic Transformation and Integration to Improve Visualization and Discovery of Latent Effects in Imaging Data

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this