Variability of interpretive accuracy among diagnostic mammography facilities

Sara L. Jackson; Stephen H. Taplin; Edward A. Sickles; Linn Abraham; William E. Barlow; Patricia A. Carney; Berta Geller; Eric A. Berns; Gary R. Cutter; Joann G. Elmore

doi:10.1093/jnci/djp105

Variability of interpretive accuracy among diagnostic mammography facilities

Sara L. Jackson, Stephen H. Taplin, Edward A. Sickles, Linn Abraham, William E. Barlow, Patricia A. Carney, Berta Geller, Eric A. Berns, Gary R. Cutter, Joann G. Elmore

Family Medicine

Research output: Contribution to journal › Article › peer-review

23 Scopus citations

Abstract

BackgroundInterpretive performance of screening mammography varies substantially by facility, but performance of diagnostic interpretation has not been studied.MethodsFacilities performing diagnostic mammography within three registries of the Breast Cancer Surveillance Consortium were surveyed about their structure, organization, and interpretive processes. Performance measurements (false-positive rate, sensitivity, and likelihood of cancer among women referred for biopsy [positive predictive value of biopsy recommendation {PPV2}]) from January 1, 1998, through December 31, 2005, were prospectively measured. Logistic regression and receiver operating characteristic (ROC) curve analyses, adjusted for patient and radiologist characteristics, were used to assess the association between facility characteristics and interpretive performance. All statistical tests were two-sided.ResultsForty-five of the 53 facilities completed a facility survey (85% response rate), and 32 of the 45 facilities performed diagnostic mammography. The analyses included 28100 diagnostic mammograms performed as an evaluation of a breast problem, and data were available for 118 radiologists who interpreted diagnostic mammograms at the facilities. Performance measurements demonstrated statistically significant interpretive variability among facilities (sensitivity, P =. 006; false-positive rate, P <. 001; and PPV2, P <. 001) in unadjusted analyses. However, after adjustment for patient and radiologist characteristics, only false-positive rate variation remained statistically significant and facility traits associated with performance measures changed (false-positive rate = 6.5%, 95% confidence interval [CI] = 5.5% to 7.4%; sensitivity = 73.5%, 95% CI = 67.1% to 79.9%; and PPV2 = 33.8%, 95% CI = 29.1% to 38.5%). Facilities reporting that concern about malpractice had moderately or greatly increased diagnostic examination recommendations at the facility had a higher false-positive rate (odds ratio [OR] = 1.48, 95% CI = 1.09 to 2.01) and a non-statistically significantly higher sensitivity (OR = 1.74, 95% CI = 0.94 to 3.23). Facilities offering specialized interventional services had a non-statistically significantly higher false-positive rate (OR = 1.97, 95% CI = 0.94 to 4.1). No characteristics were associated with overall accuracy by ROC curve analyses.ConclusionsVariation in diagnostic mammography interpretation exists across facilities. Failure to adjust for patient characteristics when comparing facility performance could lead to erroneous conclusions. Malpractice concerns are associated with interpretive performance.

Original language	English (US)
Pages (from-to)	814-827
Number of pages	14
Journal	Journal of the National Cancer Institute
Volume	101
Issue number	11
DOIs	https://doi.org/10.1093/jnci/djp105
State	Published - Jun 2009

ASJC Scopus subject areas

Oncology
Cancer Research

Access to Document

10.1093/jnci/djp105

Cite this

@article{83b5211c1d06445cbb58db1bba20d2b4,

title = "Variability of interpretive accuracy among diagnostic mammography facilities",

abstract = "BackgroundInterpretive performance of screening mammography varies substantially by facility, but performance of diagnostic interpretation has not been studied.MethodsFacilities performing diagnostic mammography within three registries of the Breast Cancer Surveillance Consortium were surveyed about their structure, organization, and interpretive processes. Performance measurements (false-positive rate, sensitivity, and likelihood of cancer among women referred for biopsy [positive predictive value of biopsy recommendation {PPV2}]) from January 1, 1998, through December 31, 2005, were prospectively measured. Logistic regression and receiver operating characteristic (ROC) curve analyses, adjusted for patient and radiologist characteristics, were used to assess the association between facility characteristics and interpretive performance. All statistical tests were two-sided.ResultsForty-five of the 53 facilities completed a facility survey (85% response rate), and 32 of the 45 facilities performed diagnostic mammography. The analyses included 28100 diagnostic mammograms performed as an evaluation of a breast problem, and data were available for 118 radiologists who interpreted diagnostic mammograms at the facilities. Performance measurements demonstrated statistically significant interpretive variability among facilities (sensitivity, P =. 006; false-positive rate, P <. 001; and PPV2, P <. 001) in unadjusted analyses. However, after adjustment for patient and radiologist characteristics, only false-positive rate variation remained statistically significant and facility traits associated with performance measures changed (false-positive rate = 6.5%, 95% confidence interval [CI] = 5.5% to 7.4%; sensitivity = 73.5%, 95% CI = 67.1% to 79.9%; and PPV2 = 33.8%, 95% CI = 29.1% to 38.5%). Facilities reporting that concern about malpractice had moderately or greatly increased diagnostic examination recommendations at the facility had a higher false-positive rate (odds ratio [OR] = 1.48, 95% CI = 1.09 to 2.01) and a non-statistically significantly higher sensitivity (OR = 1.74, 95% CI = 0.94 to 3.23). Facilities offering specialized interventional services had a non-statistically significantly higher false-positive rate (OR = 1.97, 95% CI = 0.94 to 4.1). No characteristics were associated with overall accuracy by ROC curve analyses.ConclusionsVariation in diagnostic mammography interpretation exists across facilities. Failure to adjust for patient characteristics when comparing facility performance could lead to erroneous conclusions. Malpractice concerns are associated with interpretive performance.",

author = "Jackson, {Sara L.} and Taplin, {Stephen H.} and Sickles, {Edward A.} and Linn Abraham and Barlow, {William E.} and Carney, {Patricia A.} and Berta Geller and Berns, {Eric A.} and Cutter, {Gary R.} and Elmore, {Joann G.}",

note = "Funding Information: Agency for Healthcare Research and Quality (public health service grant R01 HS-010591 to J.G.E.) and National Cancer Institute (NCI) (grants R01 CA-107623 and K05 CA-104699 to J.G.E.). NCI-funded Breast Cancer Surveillance Consortium cooperative agreement (grants U01CA63740, U01CA86076, U01CA86082, U01CA63736, U01CA70013, U01CA69976, U01CA63731, and U01CA70040 for data collection). Dr Jackson is supported by Dr Elmore{\textquoteright}s R01 grant. The collection of cancer incidence data used in this study was supported in part by several state public health departments and cancer registries throughout the United States. For a full description of these sources, please see http://breastscreening.cancer.gov/work/acknowledgement.html .",

year = "2009",

month = jun,

doi = "10.1093/jnci/djp105",

language = "English (US)",

volume = "101",

pages = "814--827",

journal = "Journal of the National Cancer Institute",

issn = "0027-8874",

publisher = "Oxford University Press",

number = "11",

}

TY - JOUR

T1 - Variability of interpretive accuracy among diagnostic mammography facilities

AU - Jackson, Sara L.

AU - Taplin, Stephen H.

AU - Sickles, Edward A.

AU - Abraham, Linn

AU - Barlow, William E.

AU - Carney, Patricia A.

AU - Geller, Berta

AU - Berns, Eric A.

AU - Cutter, Gary R.

AU - Elmore, Joann G.

N1 - Funding Information: Agency for Healthcare Research and Quality (public health service grant R01 HS-010591 to J.G.E.) and National Cancer Institute (NCI) (grants R01 CA-107623 and K05 CA-104699 to J.G.E.). NCI-funded Breast Cancer Surveillance Consortium cooperative agreement (grants U01CA63740, U01CA86076, U01CA86082, U01CA63736, U01CA70013, U01CA69976, U01CA63731, and U01CA70040 for data collection). Dr Jackson is supported by Dr Elmore’s R01 grant. The collection of cancer incidence data used in this study was supported in part by several state public health departments and cancer registries throughout the United States. For a full description of these sources, please see http://breastscreening.cancer.gov/work/acknowledgement.html .

PY - 2009/6

Y1 - 2009/6

N2 - BackgroundInterpretive performance of screening mammography varies substantially by facility, but performance of diagnostic interpretation has not been studied.MethodsFacilities performing diagnostic mammography within three registries of the Breast Cancer Surveillance Consortium were surveyed about their structure, organization, and interpretive processes. Performance measurements (false-positive rate, sensitivity, and likelihood of cancer among women referred for biopsy [positive predictive value of biopsy recommendation {PPV2}]) from January 1, 1998, through December 31, 2005, were prospectively measured. Logistic regression and receiver operating characteristic (ROC) curve analyses, adjusted for patient and radiologist characteristics, were used to assess the association between facility characteristics and interpretive performance. All statistical tests were two-sided.ResultsForty-five of the 53 facilities completed a facility survey (85% response rate), and 32 of the 45 facilities performed diagnostic mammography. The analyses included 28100 diagnostic mammograms performed as an evaluation of a breast problem, and data were available for 118 radiologists who interpreted diagnostic mammograms at the facilities. Performance measurements demonstrated statistically significant interpretive variability among facilities (sensitivity, P =. 006; false-positive rate, P <. 001; and PPV2, P <. 001) in unadjusted analyses. However, after adjustment for patient and radiologist characteristics, only false-positive rate variation remained statistically significant and facility traits associated with performance measures changed (false-positive rate = 6.5%, 95% confidence interval [CI] = 5.5% to 7.4%; sensitivity = 73.5%, 95% CI = 67.1% to 79.9%; and PPV2 = 33.8%, 95% CI = 29.1% to 38.5%). Facilities reporting that concern about malpractice had moderately or greatly increased diagnostic examination recommendations at the facility had a higher false-positive rate (odds ratio [OR] = 1.48, 95% CI = 1.09 to 2.01) and a non-statistically significantly higher sensitivity (OR = 1.74, 95% CI = 0.94 to 3.23). Facilities offering specialized interventional services had a non-statistically significantly higher false-positive rate (OR = 1.97, 95% CI = 0.94 to 4.1). No characteristics were associated with overall accuracy by ROC curve analyses.ConclusionsVariation in diagnostic mammography interpretation exists across facilities. Failure to adjust for patient characteristics when comparing facility performance could lead to erroneous conclusions. Malpractice concerns are associated with interpretive performance.

AB - BackgroundInterpretive performance of screening mammography varies substantially by facility, but performance of diagnostic interpretation has not been studied.MethodsFacilities performing diagnostic mammography within three registries of the Breast Cancer Surveillance Consortium were surveyed about their structure, organization, and interpretive processes. Performance measurements (false-positive rate, sensitivity, and likelihood of cancer among women referred for biopsy [positive predictive value of biopsy recommendation {PPV2}]) from January 1, 1998, through December 31, 2005, were prospectively measured. Logistic regression and receiver operating characteristic (ROC) curve analyses, adjusted for patient and radiologist characteristics, were used to assess the association between facility characteristics and interpretive performance. All statistical tests were two-sided.ResultsForty-five of the 53 facilities completed a facility survey (85% response rate), and 32 of the 45 facilities performed diagnostic mammography. The analyses included 28100 diagnostic mammograms performed as an evaluation of a breast problem, and data were available for 118 radiologists who interpreted diagnostic mammograms at the facilities. Performance measurements demonstrated statistically significant interpretive variability among facilities (sensitivity, P =. 006; false-positive rate, P <. 001; and PPV2, P <. 001) in unadjusted analyses. However, after adjustment for patient and radiologist characteristics, only false-positive rate variation remained statistically significant and facility traits associated with performance measures changed (false-positive rate = 6.5%, 95% confidence interval [CI] = 5.5% to 7.4%; sensitivity = 73.5%, 95% CI = 67.1% to 79.9%; and PPV2 = 33.8%, 95% CI = 29.1% to 38.5%). Facilities reporting that concern about malpractice had moderately or greatly increased diagnostic examination recommendations at the facility had a higher false-positive rate (odds ratio [OR] = 1.48, 95% CI = 1.09 to 2.01) and a non-statistically significantly higher sensitivity (OR = 1.74, 95% CI = 0.94 to 3.23). Facilities offering specialized interventional services had a non-statistically significantly higher false-positive rate (OR = 1.97, 95% CI = 0.94 to 4.1). No characteristics were associated with overall accuracy by ROC curve analyses.ConclusionsVariation in diagnostic mammography interpretation exists across facilities. Failure to adjust for patient characteristics when comparing facility performance could lead to erroneous conclusions. Malpractice concerns are associated with interpretive performance.

UR - http://www.scopus.com/inward/record.url?scp=67449083871&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=67449083871&partnerID=8YFLogxK

U2 - 10.1093/jnci/djp105

DO - 10.1093/jnci/djp105

M3 - Article

C2 - 19470953

AN - SCOPUS:67449083871

SN - 0027-8874

VL - 101

SP - 814

EP - 827

JO - Journal of the National Cancer Institute

JF - Journal of the National Cancer Institute

IS - 11

ER -

Variability of interpretive accuracy among diagnostic mammography facilities

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this