Evaluation of a deep learning image assessment system for detecting severe retinopathy of prematurity

for the Imaging and Informatics In Retinopathy Of Prematurity (i-ROP) Research Consortium

doi:10.1136/bjophthalmol-2018-313156

Evaluation of a deep learning image assessment system for detecting severe retinopathy of prematurity

for the Imaging and Informatics In Retinopathy Of Prematurity (i-ROP) Research Consortium

Ophthalmology

Research output: Contribution to journal › Article › peer-review

115 Scopus citations

Abstract

Background Prior work has demonstrated the near-perfect accuracy of a deep learning retinal image analysis system for diagnosing plus disease in retinopathy of prematurity (ROP). Here we assess the screening potential of this scoring system by determining its ability to detect all components of ROP diagnosis. Methods Clinical examination and fundus photography were performed at seven participating centres. A deep learning system was trained to detect plus disease, generating a quantitative assessment of retinal vascular abnormality (the i-ROP plus score) on a 1-9 scale. Overall ROP disease category was established using a consensus reference standard diagnosis combining clinical and image-based diagnosis. Experts then ranked ordered a second data set of 100 posterior images according to overall ROP severity. Results 4861 examinations from 870 infants were analysed. 155 examinations (3%) had a reference standard diagnosis of type 1 ROP. The i-ROP deep learning (DL) vascular severity score had an area under the receiver operating curve of 0.960 for detecting type 1 ROP. Establishing a threshold i-ROP DL score of 3 conferred 94% sensitivity, 79% specificity, 13% positive predictive value and 99.7% negative predictive value for type 1 ROP. There was strong correlation between expert rank ordering of overall ROP severity and the i-ROP DL vascular severity score (Spearman correlation coefficient=0.93; p<0.0001). Conclusion The i-ROP DL system accurately identifies diagnostic categories and overall disease severity in an automated fashion, after being trained only on posterior pole vascular morphology. These data provide proof of concept that a deep learning screening platform could improve objectivity of ROP diagnosis and accessibility of screening.

Original language	English (US)
Pages (from-to)	580-584
Number of pages	5
Journal	British Journal of Ophthalmology
Volume	103
Issue number	5
DOIs	https://doi.org/10.1136/bjophthalmol-2018-313156
State	Published - May 1 2019

Keywords

child health (paediatrics)
public health
retina
telemedicine

ASJC Scopus subject areas

Ophthalmology
Sensory Systems
Cellular and Molecular Neuroscience

Access to Document

10.1136/bjophthalmol-2018-313156

Cite this

@article{6124650b36804a398998735436327bf7,

title = "Evaluation of a deep learning image assessment system for detecting severe retinopathy of prematurity",

abstract = "Background Prior work has demonstrated the near-perfect accuracy of a deep learning retinal image analysis system for diagnosing plus disease in retinopathy of prematurity (ROP). Here we assess the screening potential of this scoring system by determining its ability to detect all components of ROP diagnosis. Methods Clinical examination and fundus photography were performed at seven participating centres. A deep learning system was trained to detect plus disease, generating a quantitative assessment of retinal vascular abnormality (the i-ROP plus score) on a 1-9 scale. Overall ROP disease category was established using a consensus reference standard diagnosis combining clinical and image-based diagnosis. Experts then ranked ordered a second data set of 100 posterior images according to overall ROP severity. Results 4861 examinations from 870 infants were analysed. 155 examinations (3%) had a reference standard diagnosis of type 1 ROP. The i-ROP deep learning (DL) vascular severity score had an area under the receiver operating curve of 0.960 for detecting type 1 ROP. Establishing a threshold i-ROP DL score of 3 conferred 94% sensitivity, 79% specificity, 13% positive predictive value and 99.7% negative predictive value for type 1 ROP. There was strong correlation between expert rank ordering of overall ROP severity and the i-ROP DL vascular severity score (Spearman correlation coefficient=0.93; p<0.0001). Conclusion The i-ROP DL system accurately identifies diagnostic categories and overall disease severity in an automated fashion, after being trained only on posterior pole vascular morphology. These data provide proof of concept that a deep learning screening platform could improve objectivity of ROP diagnosis and accessibility of screening.",

keywords = "child health (paediatrics), public health, retina, telemedicine",

author = "{for the Imaging and Informatics In Retinopathy Of Prematurity (i-ROP) Research Consortium} and Redd, {Travis K.} and Campbell, {John Peter} and Brown, {James M.} and Kim, {Sang Jin} and Susan Ostmo and Chan, {Robison Vernon Paul} and Jennifer Dy and Deniz Erdogmus and Stratis Ioannidis and Jayashree Kalpathy-Cramer and Chiang, {Michael F.}",

note = "Publisher Copyright: {\textcopyright} Author(s) (or their employer(s)) 2019. No commercial re-use. See rights and permissions. Published by BMJ.2019.",

year = "2019",

month = may,

day = "1",

doi = "10.1136/bjophthalmol-2018-313156",

language = "English (US)",

volume = "103",

pages = "580--584",

journal = "British Journal of Ophthalmology",

issn = "0007-1161",

publisher = "BMJ Publishing Group",

number = "5",

}

TY - JOUR

T1 - Evaluation of a deep learning image assessment system for detecting severe retinopathy of prematurity

AU - for the Imaging and Informatics In Retinopathy Of Prematurity (i-ROP) Research Consortium

AU - Redd, Travis K.

AU - Campbell, John Peter

AU - Brown, James M.

AU - Kim, Sang Jin

AU - Ostmo, Susan

AU - Chan, Robison Vernon Paul

AU - Dy, Jennifer

AU - Erdogmus, Deniz

AU - Ioannidis, Stratis

AU - Kalpathy-Cramer, Jayashree

AU - Chiang, Michael F.

N1 - Publisher Copyright: © Author(s) (or their employer(s)) 2019. No commercial re-use. See rights and permissions. Published by BMJ.2019.

PY - 2019/5/1

Y1 - 2019/5/1

N2 - Background Prior work has demonstrated the near-perfect accuracy of a deep learning retinal image analysis system for diagnosing plus disease in retinopathy of prematurity (ROP). Here we assess the screening potential of this scoring system by determining its ability to detect all components of ROP diagnosis. Methods Clinical examination and fundus photography were performed at seven participating centres. A deep learning system was trained to detect plus disease, generating a quantitative assessment of retinal vascular abnormality (the i-ROP plus score) on a 1-9 scale. Overall ROP disease category was established using a consensus reference standard diagnosis combining clinical and image-based diagnosis. Experts then ranked ordered a second data set of 100 posterior images according to overall ROP severity. Results 4861 examinations from 870 infants were analysed. 155 examinations (3%) had a reference standard diagnosis of type 1 ROP. The i-ROP deep learning (DL) vascular severity score had an area under the receiver operating curve of 0.960 for detecting type 1 ROP. Establishing a threshold i-ROP DL score of 3 conferred 94% sensitivity, 79% specificity, 13% positive predictive value and 99.7% negative predictive value for type 1 ROP. There was strong correlation between expert rank ordering of overall ROP severity and the i-ROP DL vascular severity score (Spearman correlation coefficient=0.93; p<0.0001). Conclusion The i-ROP DL system accurately identifies diagnostic categories and overall disease severity in an automated fashion, after being trained only on posterior pole vascular morphology. These data provide proof of concept that a deep learning screening platform could improve objectivity of ROP diagnosis and accessibility of screening.

AB - Background Prior work has demonstrated the near-perfect accuracy of a deep learning retinal image analysis system for diagnosing plus disease in retinopathy of prematurity (ROP). Here we assess the screening potential of this scoring system by determining its ability to detect all components of ROP diagnosis. Methods Clinical examination and fundus photography were performed at seven participating centres. A deep learning system was trained to detect plus disease, generating a quantitative assessment of retinal vascular abnormality (the i-ROP plus score) on a 1-9 scale. Overall ROP disease category was established using a consensus reference standard diagnosis combining clinical and image-based diagnosis. Experts then ranked ordered a second data set of 100 posterior images according to overall ROP severity. Results 4861 examinations from 870 infants were analysed. 155 examinations (3%) had a reference standard diagnosis of type 1 ROP. The i-ROP deep learning (DL) vascular severity score had an area under the receiver operating curve of 0.960 for detecting type 1 ROP. Establishing a threshold i-ROP DL score of 3 conferred 94% sensitivity, 79% specificity, 13% positive predictive value and 99.7% negative predictive value for type 1 ROP. There was strong correlation between expert rank ordering of overall ROP severity and the i-ROP DL vascular severity score (Spearman correlation coefficient=0.93; p<0.0001). Conclusion The i-ROP DL system accurately identifies diagnostic categories and overall disease severity in an automated fashion, after being trained only on posterior pole vascular morphology. These data provide proof of concept that a deep learning screening platform could improve objectivity of ROP diagnosis and accessibility of screening.

KW - child health (paediatrics)

KW - public health

KW - retina

KW - telemedicine

UR - http://www.scopus.com/inward/record.url?scp=85057264647&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85057264647&partnerID=8YFLogxK

U2 - 10.1136/bjophthalmol-2018-313156

DO - 10.1136/bjophthalmol-2018-313156

M3 - Article

C2 - 30470715

AN - SCOPUS:85057264647

SN - 0007-1161

VL - 103

SP - 580

EP - 584

JO - British Journal of Ophthalmology

JF - British Journal of Ophthalmology

IS - 5

ER -

Evaluation of a deep learning image assessment system for detecting severe retinopathy of prematurity

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this