cmIF: A Python Library for Scalable Multiplex Imaging Pipelines

Jennifer Eng; Elmar Bucher; Elliot Gray; Lydia Grace Campbell; Guillaume Thibault; Laura Heiser; Summer Gibbs; Joe W. Gray; Koei Chin; Young Hwan Chang

doi:10.1007/978-3-030-35210-3_3

cmIF: A Python Library for Scalable Multiplex Imaging Pipelines

Jennifer Eng, Elmar Bucher, Elliot Gray, Lydia Grace Campbell, Guillaume Thibault, Laura Heiser, Summer Gibbs, Joe W. Gray, Koei Chin, Young Hwan Chang

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

1 Scopus citations

Abstract

Histological staining and analysis of tissue sections is integral to diagnosis and treatment of many diseases, including cancer. Multiplex imaging technologies (e.g., cyclic immunostaining) have dramatically increased capabilities for assessing prognostic biomarkers in situ, enabling new insights into complex diseases. However, high-resolution, multiplex image data can be terabytes (TB) in size, and traditional pipelines for image analysis are not suited for these rich datasets. While much software development effort goes towards improving image processing tools such as stitching, registration, and segmentation; integration of these tools into a pipeline is often manual, which is highly laborious, error-prone and lacks reproducibility and scalability. Therefore, we developed a Python3 library, cmIF, a free and open-source tool to handle our high-throughput multiplex image processing pipeline. cmIF enables analysis of full-slide pathology tissue sections and tissue microarrays (TMAs), facilitating processing from raw image files through registration, segmentation, feature extraction, manual thresholding, and spatial pattern analysis. Our cmIF library includes functionality for image handling, quality control, metadata extraction, and subtraction of background images (i.e., autofluorescence subtraction). Additionally, it includes a Jupyter notebook for efficient generation and visualization of manual thresholds. Compared to a manual pipeline, use of cmIF reduces errors and improves processing time of datasets from weeks to hours, while documenting processing steps for reproducibility. All code is available on https://gitlab.com/engje/cmif. While our library is specific to our pipeline elements, it is a blueprint for types of functions needed for high throughput analysis. In the future, we will continue developing this open-source tool, and with input from the wider community, adapt it to a range of multiplex image pipelines.

Original language	English (US)
Title of host publication	Mathematical and Computational Oncology - 1st International Symposium, ISMCO 2019, Proceedings
Editors	George Bebis, Takis Benos, Ken Chen, Katharina Jahn, Ernesto Lima
Publisher	Springer
Pages	37-43
Number of pages	7
ISBN (Print)	9783030352097
DOIs	https://doi.org/10.1007/978-3-030-35210-3_3
State	Published - 2019
Event	1st International Symposium on Mathematical and Computational Oncology, ISMCO 2019 - Lake Tahoe, United States Duration: Oct 14 2019 → Oct 16 2019

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	11826 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	1st International Symposium on Mathematical and Computational Oncology, ISMCO 2019
Country/Territory	United States
City	Lake Tahoe
Period	10/14/19 → 10/16/19

Keywords

High-throughput analytics
Image processing
Multiplex imaging

ASJC Scopus subject areas

Theoretical Computer Science
General Computer Science

Access to Document

10.1007/978-3-030-35210-3_3

Cite this

Eng, J., Bucher, E., Gray, E., Campbell, L. G., Thibault, G., Heiser, L., Gibbs, S., Gray, J. W., Chin, K., & Chang, Y. H. (2019). cmIF: A Python Library for Scalable Multiplex Imaging Pipelines. In G. Bebis, T. Benos, K. Chen, K. Jahn, & E. Lima (Eds.), Mathematical and Computational Oncology - 1st International Symposium, ISMCO 2019, Proceedings (pp. 37-43). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11826 LNCS). Springer. https://doi.org/10.1007/978-3-030-35210-3_3

cmIF: A Python Library for Scalable Multiplex Imaging Pipelines. / Eng, Jennifer; Bucher, Elmar; Gray, Elliot et al.
Mathematical and Computational Oncology - 1st International Symposium, ISMCO 2019, Proceedings. ed. / George Bebis; Takis Benos; Ken Chen; Katharina Jahn; Ernesto Lima. Springer, 2019. p. 37-43 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 11826 LNCS).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Eng, J, Bucher, E, Gray, E, Campbell, LG, Thibault, G , Heiser, L , Gibbs, S, Gray, JW, Chin, K & Chang, YH 2019, cmIF: A Python Library for Scalable Multiplex Imaging Pipelines. in G Bebis, T Benos, K Chen, K Jahn & E Lima (eds), Mathematical and Computational Oncology - 1st International Symposium, ISMCO 2019, Proceedings. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 11826 LNCS, Springer, pp. 37-43, 1st International Symposium on Mathematical and Computational Oncology, ISMCO 2019, Lake Tahoe, United States, 10/14/19. https://doi.org/10.1007/978-3-030-35210-3_3

Eng J, Bucher E, Gray E, Campbell LG, Thibault G , Heiser L et al. cmIF: A Python Library for Scalable Multiplex Imaging Pipelines. In Bebis G, Benos T, Chen K, Jahn K, Lima E, editors, Mathematical and Computational Oncology - 1st International Symposium, ISMCO 2019, Proceedings. Springer. 2019. p. 37-43. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-35210-3_3

Eng, Jennifer ; Bucher, Elmar ; Gray, Elliot et al. / cmIF : A Python Library for Scalable Multiplex Imaging Pipelines. Mathematical and Computational Oncology - 1st International Symposium, ISMCO 2019, Proceedings. editor / George Bebis ; Takis Benos ; Ken Chen ; Katharina Jahn ; Ernesto Lima. Springer, 2019. pp. 37-43 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{78e65332c4604761bd77513bb8c164c0,

title = "cmIF: A Python Library for Scalable Multiplex Imaging Pipelines",

abstract = "Histological staining and analysis of tissue sections is integral to diagnosis and treatment of many diseases, including cancer. Multiplex imaging technologies (e.g., cyclic immunostaining) have dramatically increased capabilities for assessing prognostic biomarkers in situ, enabling new insights into complex diseases. However, high-resolution, multiplex image data can be terabytes (TB) in size, and traditional pipelines for image analysis are not suited for these rich datasets. While much software development effort goes towards improving image processing tools such as stitching, registration, and segmentation; integration of these tools into a pipeline is often manual, which is highly laborious, error-prone and lacks reproducibility and scalability. Therefore, we developed a Python3 library, cmIF, a free and open-source tool to handle our high-throughput multiplex image processing pipeline. cmIF enables analysis of full-slide pathology tissue sections and tissue microarrays (TMAs), facilitating processing from raw image files through registration, segmentation, feature extraction, manual thresholding, and spatial pattern analysis. Our cmIF library includes functionality for image handling, quality control, metadata extraction, and subtraction of background images (i.e., autofluorescence subtraction). Additionally, it includes a Jupyter notebook for efficient generation and visualization of manual thresholds. Compared to a manual pipeline, use of cmIF reduces errors and improves processing time of datasets from weeks to hours, while documenting processing steps for reproducibility. All code is available on https://gitlab.com/engje/cmif. While our library is specific to our pipeline elements, it is a blueprint for types of functions needed for high throughput analysis. In the future, we will continue developing this open-source tool, and with input from the wider community, adapt it to a range of multiplex image pipelines.",

keywords = "High-throughput analytics, Image processing, Multiplex imaging",

author = "Jennifer Eng and Elmar Bucher and Elliot Gray and Campbell, {Lydia Grace} and Guillaume Thibault and Laura Heiser and Summer Gibbs and Gray, {Joe W.} and Koei Chin and Chang, {Young Hwan}",

note = "Publisher Copyright: {\textcopyright} 2019, Springer Nature Switzerland AG.; 1st International Symposium on Mathematical and Computational Oncology, ISMCO 2019 ; Conference date: 14-10-2019 Through 16-10-2019",

year = "2019",

doi = "10.1007/978-3-030-35210-3_3",

language = "English (US)",

isbn = "9783030352097",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "37--43",

editor = "George Bebis and Takis Benos and Ken Chen and Katharina Jahn and Ernesto Lima",

booktitle = "Mathematical and Computational Oncology - 1st International Symposium, ISMCO 2019, Proceedings",

}

TY - GEN

T1 - cmIF

T2 - 1st International Symposium on Mathematical and Computational Oncology, ISMCO 2019

AU - Eng, Jennifer

AU - Bucher, Elmar

AU - Gray, Elliot

AU - Campbell, Lydia Grace

AU - Thibault, Guillaume

AU - Heiser, Laura

AU - Gibbs, Summer

AU - Gray, Joe W.

AU - Chin, Koei

AU - Chang, Young Hwan

PY - 2019

Y1 - 2019

N2 - Histological staining and analysis of tissue sections is integral to diagnosis and treatment of many diseases, including cancer. Multiplex imaging technologies (e.g., cyclic immunostaining) have dramatically increased capabilities for assessing prognostic biomarkers in situ, enabling new insights into complex diseases. However, high-resolution, multiplex image data can be terabytes (TB) in size, and traditional pipelines for image analysis are not suited for these rich datasets. While much software development effort goes towards improving image processing tools such as stitching, registration, and segmentation; integration of these tools into a pipeline is often manual, which is highly laborious, error-prone and lacks reproducibility and scalability. Therefore, we developed a Python3 library, cmIF, a free and open-source tool to handle our high-throughput multiplex image processing pipeline. cmIF enables analysis of full-slide pathology tissue sections and tissue microarrays (TMAs), facilitating processing from raw image files through registration, segmentation, feature extraction, manual thresholding, and spatial pattern analysis. Our cmIF library includes functionality for image handling, quality control, metadata extraction, and subtraction of background images (i.e., autofluorescence subtraction). Additionally, it includes a Jupyter notebook for efficient generation and visualization of manual thresholds. Compared to a manual pipeline, use of cmIF reduces errors and improves processing time of datasets from weeks to hours, while documenting processing steps for reproducibility. All code is available on https://gitlab.com/engje/cmif. While our library is specific to our pipeline elements, it is a blueprint for types of functions needed for high throughput analysis. In the future, we will continue developing this open-source tool, and with input from the wider community, adapt it to a range of multiplex image pipelines.

AB - Histological staining and analysis of tissue sections is integral to diagnosis and treatment of many diseases, including cancer. Multiplex imaging technologies (e.g., cyclic immunostaining) have dramatically increased capabilities for assessing prognostic biomarkers in situ, enabling new insights into complex diseases. However, high-resolution, multiplex image data can be terabytes (TB) in size, and traditional pipelines for image analysis are not suited for these rich datasets. While much software development effort goes towards improving image processing tools such as stitching, registration, and segmentation; integration of these tools into a pipeline is often manual, which is highly laborious, error-prone and lacks reproducibility and scalability. Therefore, we developed a Python3 library, cmIF, a free and open-source tool to handle our high-throughput multiplex image processing pipeline. cmIF enables analysis of full-slide pathology tissue sections and tissue microarrays (TMAs), facilitating processing from raw image files through registration, segmentation, feature extraction, manual thresholding, and spatial pattern analysis. Our cmIF library includes functionality for image handling, quality control, metadata extraction, and subtraction of background images (i.e., autofluorescence subtraction). Additionally, it includes a Jupyter notebook for efficient generation and visualization of manual thresholds. Compared to a manual pipeline, use of cmIF reduces errors and improves processing time of datasets from weeks to hours, while documenting processing steps for reproducibility. All code is available on https://gitlab.com/engje/cmif. While our library is specific to our pipeline elements, it is a blueprint for types of functions needed for high throughput analysis. In the future, we will continue developing this open-source tool, and with input from the wider community, adapt it to a range of multiplex image pipelines.

KW - High-throughput analytics

KW - Image processing

KW - Multiplex imaging

UR - http://www.scopus.com/inward/record.url?scp=85076982229&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85076982229&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-35210-3_3

DO - 10.1007/978-3-030-35210-3_3

M3 - Conference contribution

AN - SCOPUS:85076982229

SN - 9783030352097

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 37

EP - 43

BT - Mathematical and Computational Oncology - 1st International Symposium, ISMCO 2019, Proceedings

A2 - Bebis, George

A2 - Benos, Takis

A2 - Chen, Ken

A2 - Jahn, Katharina

A2 - Lima, Ernesto

PB - Springer

Y2 - 14 October 2019 through 16 October 2019

ER -

cmIF: A Python Library for Scalable Multiplex Imaging Pipelines

Abstract

Publication series

Conference

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this