TY - GEN
T1 - cmIF
T2 - 1st International Symposium on Mathematical and Computational Oncology, ISMCO 2019
AU - Eng, Jennifer
AU - Bucher, Elmar
AU - Gray, Elliot
AU - Campbell, Lydia Grace
AU - Thibault, Guillaume
AU - Heiser, Laura
AU - Gibbs, Summer
AU - Gray, Joe W.
AU - Chin, Koei
AU - Chang, Young Hwan
N1 - Funding Information:
Financial Supports. NIH/NCI U54 CA209988, NIH/NCI U2C CA233280, Prospect Creek Foundation, Susan G. Komen Foundation, OHSU Foundation, and Oregon Clinical & Translational Research Institute.
Publisher Copyright:
© 2019, Springer Nature Switzerland AG.
PY - 2019
Y1 - 2019
N2 - Histological staining and analysis of tissue sections is integral to diagnosis and treatment of many diseases, including cancer. Multiplex imaging technologies (e.g., cyclic immunostaining) have dramatically increased capabilities for assessing prognostic biomarkers in situ, enabling new insights into complex diseases. However, high-resolution, multiplex image data can be terabytes (TB) in size, and traditional pipelines for image analysis are not suited for these rich datasets. While much software development effort goes towards improving image processing tools such as stitching, registration, and segmentation; integration of these tools into a pipeline is often manual, which is highly laborious, error-prone and lacks reproducibility and scalability. Therefore, we developed a Python3 library, cmIF, a free and open-source tool to handle our high-throughput multiplex image processing pipeline. cmIF enables analysis of full-slide pathology tissue sections and tissue microarrays (TMAs), facilitating processing from raw image files through registration, segmentation, feature extraction, manual thresholding, and spatial pattern analysis. Our cmIF library includes functionality for image handling, quality control, metadata extraction, and subtraction of background images (i.e., autofluorescence subtraction). Additionally, it includes a Jupyter notebook for efficient generation and visualization of manual thresholds. Compared to a manual pipeline, use of cmIF reduces errors and improves processing time of datasets from weeks to hours, while documenting processing steps for reproducibility. All code is available on https://gitlab.com/engje/cmif. While our library is specific to our pipeline elements, it is a blueprint for types of functions needed for high throughput analysis. In the future, we will continue developing this open-source tool, and with input from the wider community, adapt it to a range of multiplex image pipelines.
AB - Histological staining and analysis of tissue sections is integral to diagnosis and treatment of many diseases, including cancer. Multiplex imaging technologies (e.g., cyclic immunostaining) have dramatically increased capabilities for assessing prognostic biomarkers in situ, enabling new insights into complex diseases. However, high-resolution, multiplex image data can be terabytes (TB) in size, and traditional pipelines for image analysis are not suited for these rich datasets. While much software development effort goes towards improving image processing tools such as stitching, registration, and segmentation; integration of these tools into a pipeline is often manual, which is highly laborious, error-prone and lacks reproducibility and scalability. Therefore, we developed a Python3 library, cmIF, a free and open-source tool to handle our high-throughput multiplex image processing pipeline. cmIF enables analysis of full-slide pathology tissue sections and tissue microarrays (TMAs), facilitating processing from raw image files through registration, segmentation, feature extraction, manual thresholding, and spatial pattern analysis. Our cmIF library includes functionality for image handling, quality control, metadata extraction, and subtraction of background images (i.e., autofluorescence subtraction). Additionally, it includes a Jupyter notebook for efficient generation and visualization of manual thresholds. Compared to a manual pipeline, use of cmIF reduces errors and improves processing time of datasets from weeks to hours, while documenting processing steps for reproducibility. All code is available on https://gitlab.com/engje/cmif. While our library is specific to our pipeline elements, it is a blueprint for types of functions needed for high throughput analysis. In the future, we will continue developing this open-source tool, and with input from the wider community, adapt it to a range of multiplex image pipelines.
KW - High-throughput analytics
KW - Image processing
KW - Multiplex imaging
UR - http://www.scopus.com/inward/record.url?scp=85076982229&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85076982229&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-35210-3_3
DO - 10.1007/978-3-030-35210-3_3
M3 - Conference contribution
AN - SCOPUS:85076982229
SN - 9783030352097
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 37
EP - 43
BT - Mathematical and Computational Oncology - 1st International Symposium, ISMCO 2019, Proceedings
A2 - Bebis, George
A2 - Benos, Takis
A2 - Chen, Ken
A2 - Jahn, Katharina
A2 - Lima, Ernesto
PB - Springer
Y2 - 14 October 2019 through 16 October 2019
ER -