Segmentation editing improves efficiency while reducing inter-expert variation and maintaining accuracy for normal brain tissues in the presence of space-occupying lesions

M. A. Deeley; A. Chen; R. D. Datteri; J. Noble; A. Cmelak; E. Donnelly; A. Malcolm; L. Moretti; J. Jaboin; K. Niermann; Eddy S. Yang; David S. Yu; B. M. Dawant

doi:10.1088/0031-9155/58/12/4071

Segmentation editing improves efficiency while reducing inter-expert variation and maintaining accuracy for normal brain tissues in the presence of space-occupying lesions

M. A. Deeley, A. Chen, R. D. Datteri, J. Noble, A. Cmelak, E. Donnelly, A. Malcolm, L. Moretti, J. Jaboin, K. Niermann, Eddy S. Yang, David S. Yu, B. M. Dawant

Research output: Contribution to journal › Article › peer-review

15 Scopus citations

Abstract

Image segmentation has become a vital and often rate-limiting step in modern radiotherapy treatment planning. In recent years, the pace and scope of algorithm development, and even introduction into the clinic, have far exceeded evaluative studies. In this work we build upon our previous evaluation of a registration driven segmentation algorithm in the context of 8 expert raters and 20 patients who underwent radiotherapy for large space-occupying tumours in the brain. In this work we tested four hypotheses concerning the impact of manual segmentation editing in a randomized single-blinded study. We tested these hypotheses on the normal structures of the brainstem, optic chiasm, eyes and optic nerves using the Dice similarity coefficient, volume, and signed Euclidean distance error to evaluate the impact of editing on inter-rater variance and accuracy. Accuracy analyses relied on two simulated ground truth estimation methods: simultaneous truth and performance level estimation and a novel implementation of probability maps. The experts were presented with automatic, their own, and their peers' segmentations from our previous study to edit. We found, independent of source, editing reduced inter-rater variance while maintaining or improving accuracy and improving efficiency with at least 60% reduction in contouring time. In areas where raters performed poorly contouring from scratch, editing of the automatic segmentations reduced the prevalence of total anatomical miss from approximately 16% to 8% of the total slices contained within the ground truth estimations. These findings suggest that contour editing could be useful for consensus building such as in developing delineation standards, and that both automated methods and even perhaps less sophisticated atlases could improve efficiency, inter-rater variance, and accuracy.

Original language	English (US)
Pages (from-to)	4071-4097
Number of pages	27
Journal	Physics in Medicine and Biology
Volume	58
Issue number	12
DOIs	https://doi.org/10.1088/0031-9155/58/12/4071
State	Published - Jun 21 2013
Externally published	Yes

ASJC Scopus subject areas

Radiological and Ultrasound Technology
Radiology Nuclear Medicine and imaging

Access to Document

10.1088/0031-9155/58/12/4071

Cite this

Deeley, M. A., Chen, A., Datteri, R. D., Noble, J., Cmelak, A., Donnelly, E., Malcolm, A., Moretti, L., Jaboin, J., Niermann, K., Yang, E. S., Yu, D. S., & Dawant, B. M. (2013). Segmentation editing improves efficiency while reducing inter-expert variation and maintaining accuracy for normal brain tissues in the presence of space-occupying lesions. Physics in Medicine and Biology, 58(12), 4071-4097. https://doi.org/10.1088/0031-9155/58/12/4071

Deeley, MA, Chen, A, Datteri, RD, Noble, J, Cmelak, A, Donnelly, E, Malcolm, A, Moretti, L, Jaboin, J, Niermann, K, Yang, ES, Yu, DS & Dawant, BM 2013, 'Segmentation editing improves efficiency while reducing inter-expert variation and maintaining accuracy for normal brain tissues in the presence of space-occupying lesions', Physics in Medicine and Biology, vol. 58, no. 12, pp. 4071-4097. https://doi.org/10.1088/0031-9155/58/12/4071

@article{3d6a195172134e6d95f03b7a83abd1aa,

title = "Segmentation editing improves efficiency while reducing inter-expert variation and maintaining accuracy for normal brain tissues in the presence of space-occupying lesions",

abstract = "Image segmentation has become a vital and often rate-limiting step in modern radiotherapy treatment planning. In recent years, the pace and scope of algorithm development, and even introduction into the clinic, have far exceeded evaluative studies. In this work we build upon our previous evaluation of a registration driven segmentation algorithm in the context of 8 expert raters and 20 patients who underwent radiotherapy for large space-occupying tumours in the brain. In this work we tested four hypotheses concerning the impact of manual segmentation editing in a randomized single-blinded study. We tested these hypotheses on the normal structures of the brainstem, optic chiasm, eyes and optic nerves using the Dice similarity coefficient, volume, and signed Euclidean distance error to evaluate the impact of editing on inter-rater variance and accuracy. Accuracy analyses relied on two simulated ground truth estimation methods: simultaneous truth and performance level estimation and a novel implementation of probability maps. The experts were presented with automatic, their own, and their peers' segmentations from our previous study to edit. We found, independent of source, editing reduced inter-rater variance while maintaining or improving accuracy and improving efficiency with at least 60% reduction in contouring time. In areas where raters performed poorly contouring from scratch, editing of the automatic segmentations reduced the prevalence of total anatomical miss from approximately 16% to 8% of the total slices contained within the ground truth estimations. These findings suggest that contour editing could be useful for consensus building such as in developing delineation standards, and that both automated methods and even perhaps less sophisticated atlases could improve efficiency, inter-rater variance, and accuracy.",

author = "Deeley, {M. A.} and A. Chen and Datteri, {R. D.} and J. Noble and A. Cmelak and E. Donnelly and A. Malcolm and L. Moretti and J. Jaboin and K. Niermann and Yang, {Eddy S.} and Yu, {David S.} and Dawant, {B. M.}",

year = "2013",

month = jun,

day = "21",

doi = "10.1088/0031-9155/58/12/4071",

language = "English (US)",

volume = "58",

pages = "4071--4097",

journal = "Physics in Medicine and Biology",

issn = "0031-9155",

publisher = "IOP Publishing Ltd.",

number = "12",

}

TY - JOUR

T1 - Segmentation editing improves efficiency while reducing inter-expert variation and maintaining accuracy for normal brain tissues in the presence of space-occupying lesions

AU - Deeley, M. A.

AU - Chen, A.

AU - Datteri, R. D.

AU - Noble, J.

AU - Cmelak, A.

AU - Donnelly, E.

AU - Malcolm, A.

AU - Moretti, L.

AU - Jaboin, J.

AU - Niermann, K.

AU - Yang, Eddy S.

AU - Yu, David S.

AU - Dawant, B. M.

PY - 2013/6/21

Y1 - 2013/6/21

N2 - Image segmentation has become a vital and often rate-limiting step in modern radiotherapy treatment planning. In recent years, the pace and scope of algorithm development, and even introduction into the clinic, have far exceeded evaluative studies. In this work we build upon our previous evaluation of a registration driven segmentation algorithm in the context of 8 expert raters and 20 patients who underwent radiotherapy for large space-occupying tumours in the brain. In this work we tested four hypotheses concerning the impact of manual segmentation editing in a randomized single-blinded study. We tested these hypotheses on the normal structures of the brainstem, optic chiasm, eyes and optic nerves using the Dice similarity coefficient, volume, and signed Euclidean distance error to evaluate the impact of editing on inter-rater variance and accuracy. Accuracy analyses relied on two simulated ground truth estimation methods: simultaneous truth and performance level estimation and a novel implementation of probability maps. The experts were presented with automatic, their own, and their peers' segmentations from our previous study to edit. We found, independent of source, editing reduced inter-rater variance while maintaining or improving accuracy and improving efficiency with at least 60% reduction in contouring time. In areas where raters performed poorly contouring from scratch, editing of the automatic segmentations reduced the prevalence of total anatomical miss from approximately 16% to 8% of the total slices contained within the ground truth estimations. These findings suggest that contour editing could be useful for consensus building such as in developing delineation standards, and that both automated methods and even perhaps less sophisticated atlases could improve efficiency, inter-rater variance, and accuracy.

AB - Image segmentation has become a vital and often rate-limiting step in modern radiotherapy treatment planning. In recent years, the pace and scope of algorithm development, and even introduction into the clinic, have far exceeded evaluative studies. In this work we build upon our previous evaluation of a registration driven segmentation algorithm in the context of 8 expert raters and 20 patients who underwent radiotherapy for large space-occupying tumours in the brain. In this work we tested four hypotheses concerning the impact of manual segmentation editing in a randomized single-blinded study. We tested these hypotheses on the normal structures of the brainstem, optic chiasm, eyes and optic nerves using the Dice similarity coefficient, volume, and signed Euclidean distance error to evaluate the impact of editing on inter-rater variance and accuracy. Accuracy analyses relied on two simulated ground truth estimation methods: simultaneous truth and performance level estimation and a novel implementation of probability maps. The experts were presented with automatic, their own, and their peers' segmentations from our previous study to edit. We found, independent of source, editing reduced inter-rater variance while maintaining or improving accuracy and improving efficiency with at least 60% reduction in contouring time. In areas where raters performed poorly contouring from scratch, editing of the automatic segmentations reduced the prevalence of total anatomical miss from approximately 16% to 8% of the total slices contained within the ground truth estimations. These findings suggest that contour editing could be useful for consensus building such as in developing delineation standards, and that both automated methods and even perhaps less sophisticated atlases could improve efficiency, inter-rater variance, and accuracy.

UR - http://www.scopus.com/inward/record.url?scp=84878855480&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84878855480&partnerID=8YFLogxK

U2 - 10.1088/0031-9155/58/12/4071

DO - 10.1088/0031-9155/58/12/4071

M3 - Article

C2 - 23685866

AN - SCOPUS:84878855480

SN - 0031-9155

VL - 58

SP - 4071

EP - 4097

JO - Physics in Medicine and Biology

JF - Physics in Medicine and Biology

IS - 12

ER -

Segmentation editing improves efficiency while reducing inter-expert variation and maintaining accuracy for normal brain tissues in the presence of space-occupying lesions

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this