Variability in pathologists' interpretations of individual breast biopsy slides: A population perspective

Joann G. Elmore; Heidi D. Nelson; Margaret S. Pepe; Gary M. Longton; Anna N.A. Tosteson; Berta Geller; Tracy Onega; Patricia A. Carney; Sara L. Jackson; Kimberly H. Allison; Donald L. Weaver

doi:10.7326/M15-0964

Variability in pathologists' interpretations of individual breast biopsy slides: A population perspective

Joann G. Elmore, Heidi D. Nelson, Margaret S. Pepe, Gary M. Longton, Anna N.A. Tosteson, Berta Geller, Tracy Onega, Patricia A. Carney, Sara L. Jackson, Kimberly H. Allison, Donald L. Weaver

Research output: Contribution to journal › Article › peer-review

55 Scopus citations

Abstract

Background: The effect of physician diagnostic variability on accuracy at a population level depends on the prevalence of diagnoses. Objective: To estimate how diagnostic variability affects accuracy from the perspective of a U.S. woman aged 50 to 59 years having a breast biopsy. Design: Applied probability using Bayes theorem. Setting: B-Path (Breast Pathology) Study comparing pathologists' interpretations of a single biopsy slide versus a reference consensus interpretation from 3 experts. Participants: 115 practicing pathologists (6900 total interpretations from 240 distinct cases). Measurements: A single representative slide from each of the 240 cases was used to estimate the proportion of biopsies with a diagnosis that would be verified if the same slide were interpreted by a reference group of 3 expert pathologists. Probabilities of confirmation (predictive values) were estimated using B-Path Study results and prevalence of biopsy diagnoses for women aged 50 to 59 years in the Breast Cancer Surveillance Consortium. Results: Overall, if 1 representative slide were used per case, 92.3% (95% CI, 91.4% to 93.1%) of breast biopsy diagnoses would be verified by reference consensus diagnoses, with 4.6% (CI, 3.9% to 5.3%) overinterpreted and 3.2% (CI, 2.7% to 3.6%) underinterpreted. Verification of invasive breast cancer and benign without atypia diagnoses is highly probable; estimated predictive values were 97.7% (CI, 96.5% to 98.7%) and 97.1% (CI, 96.7% to 97.4%), respectively. Verification is less probable for atypia (53.6% overinterpreted and 8.6% underinterpreted) and ductal carcinoma in situ (DCIS) (18.5% overinterpreted and 11.8% underinterpreted). Limitations: Estimates are based on a testing situation with 1 slide used per case and without access to second opinions. Population-adjusted estimates may differ for women from other age groups, unscreened women, or women in different practice settings. Conclusion: This analysis, based on interpretation of a single breast biopsy slide per case, predicts a low likelihood that a diagnosis of atypia or DCIS would be verified by a reference consensus diagnosis. This diagnostic gray zone should be considered in clinical management decisions in patients with these diagnoses.

Original language	English (US)
Pages (from-to)	649-655
Number of pages	7
Journal	Annals of internal medicine
Volume	164
Issue number	10
DOIs	https://doi.org/10.7326/M15-0964
State	Published - May 17 2016

ASJC Scopus subject areas

Internal Medicine

Access to Document

10.7326/M15-0964

Cite this

@article{1c7e4e53ee3941a59bc4911eb8bb5cda,

title = "Variability in pathologists' interpretations of individual breast biopsy slides: A population perspective",

abstract = "Background: The effect of physician diagnostic variability on accuracy at a population level depends on the prevalence of diagnoses. Objective: To estimate how diagnostic variability affects accuracy from the perspective of a U.S. woman aged 50 to 59 years having a breast biopsy. Design: Applied probability using Bayes theorem. Setting: B-Path (Breast Pathology) Study comparing pathologists' interpretations of a single biopsy slide versus a reference consensus interpretation from 3 experts. Participants: 115 practicing pathologists (6900 total interpretations from 240 distinct cases). Measurements: A single representative slide from each of the 240 cases was used to estimate the proportion of biopsies with a diagnosis that would be verified if the same slide were interpreted by a reference group of 3 expert pathologists. Probabilities of confirmation (predictive values) were estimated using B-Path Study results and prevalence of biopsy diagnoses for women aged 50 to 59 years in the Breast Cancer Surveillance Consortium. Results: Overall, if 1 representative slide were used per case, 92.3% (95% CI, 91.4% to 93.1%) of breast biopsy diagnoses would be verified by reference consensus diagnoses, with 4.6% (CI, 3.9% to 5.3%) overinterpreted and 3.2% (CI, 2.7% to 3.6%) underinterpreted. Verification of invasive breast cancer and benign without atypia diagnoses is highly probable; estimated predictive values were 97.7% (CI, 96.5% to 98.7%) and 97.1% (CI, 96.7% to 97.4%), respectively. Verification is less probable for atypia (53.6% overinterpreted and 8.6% underinterpreted) and ductal carcinoma in situ (DCIS) (18.5% overinterpreted and 11.8% underinterpreted). Limitations: Estimates are based on a testing situation with 1 slide used per case and without access to second opinions. Population-adjusted estimates may differ for women from other age groups, unscreened women, or women in different practice settings. Conclusion: This analysis, based on interpretation of a single breast biopsy slide per case, predicts a low likelihood that a diagnosis of atypia or DCIS would be verified by a reference consensus diagnosis. This diagnostic gray zone should be considered in clinical management decisions in patients with these diagnoses.",

author = "Elmore, {Joann G.} and Nelson, {Heidi D.} and Pepe, {Margaret S.} and Longton, {Gary M.} and Tosteson, {Anna N.A.} and Berta Geller and Tracy Onega and Carney, {Patricia A.} and Jackson, {Sara L.} and Allison, {Kimberly H.} and Weaver, {Donald L.}",

note = "Publisher Copyright: {\textcopyright} 2016 American College of Physicians.",

year = "2016",

month = may,

day = "17",

doi = "10.7326/M15-0964",

language = "English (US)",

volume = "164",

pages = "649--655",

journal = "Annals of internal medicine",

issn = "0003-4819",

publisher = "American College of Physicians",

number = "10",

}

TY - JOUR

T1 - Variability in pathologists' interpretations of individual breast biopsy slides

T2 - A population perspective

AU - Elmore, Joann G.

AU - Nelson, Heidi D.

AU - Pepe, Margaret S.

AU - Longton, Gary M.

AU - Tosteson, Anna N.A.

AU - Geller, Berta

AU - Onega, Tracy

AU - Carney, Patricia A.

AU - Jackson, Sara L.

AU - Allison, Kimberly H.

AU - Weaver, Donald L.

PY - 2016/5/17

Y1 - 2016/5/17

N2 - Background: The effect of physician diagnostic variability on accuracy at a population level depends on the prevalence of diagnoses. Objective: To estimate how diagnostic variability affects accuracy from the perspective of a U.S. woman aged 50 to 59 years having a breast biopsy. Design: Applied probability using Bayes theorem. Setting: B-Path (Breast Pathology) Study comparing pathologists' interpretations of a single biopsy slide versus a reference consensus interpretation from 3 experts. Participants: 115 practicing pathologists (6900 total interpretations from 240 distinct cases). Measurements: A single representative slide from each of the 240 cases was used to estimate the proportion of biopsies with a diagnosis that would be verified if the same slide were interpreted by a reference group of 3 expert pathologists. Probabilities of confirmation (predictive values) were estimated using B-Path Study results and prevalence of biopsy diagnoses for women aged 50 to 59 years in the Breast Cancer Surveillance Consortium. Results: Overall, if 1 representative slide were used per case, 92.3% (95% CI, 91.4% to 93.1%) of breast biopsy diagnoses would be verified by reference consensus diagnoses, with 4.6% (CI, 3.9% to 5.3%) overinterpreted and 3.2% (CI, 2.7% to 3.6%) underinterpreted. Verification of invasive breast cancer and benign without atypia diagnoses is highly probable; estimated predictive values were 97.7% (CI, 96.5% to 98.7%) and 97.1% (CI, 96.7% to 97.4%), respectively. Verification is less probable for atypia (53.6% overinterpreted and 8.6% underinterpreted) and ductal carcinoma in situ (DCIS) (18.5% overinterpreted and 11.8% underinterpreted). Limitations: Estimates are based on a testing situation with 1 slide used per case and without access to second opinions. Population-adjusted estimates may differ for women from other age groups, unscreened women, or women in different practice settings. Conclusion: This analysis, based on interpretation of a single breast biopsy slide per case, predicts a low likelihood that a diagnosis of atypia or DCIS would be verified by a reference consensus diagnosis. This diagnostic gray zone should be considered in clinical management decisions in patients with these diagnoses.

AB - Background: The effect of physician diagnostic variability on accuracy at a population level depends on the prevalence of diagnoses. Objective: To estimate how diagnostic variability affects accuracy from the perspective of a U.S. woman aged 50 to 59 years having a breast biopsy. Design: Applied probability using Bayes theorem. Setting: B-Path (Breast Pathology) Study comparing pathologists' interpretations of a single biopsy slide versus a reference consensus interpretation from 3 experts. Participants: 115 practicing pathologists (6900 total interpretations from 240 distinct cases). Measurements: A single representative slide from each of the 240 cases was used to estimate the proportion of biopsies with a diagnosis that would be verified if the same slide were interpreted by a reference group of 3 expert pathologists. Probabilities of confirmation (predictive values) were estimated using B-Path Study results and prevalence of biopsy diagnoses for women aged 50 to 59 years in the Breast Cancer Surveillance Consortium. Results: Overall, if 1 representative slide were used per case, 92.3% (95% CI, 91.4% to 93.1%) of breast biopsy diagnoses would be verified by reference consensus diagnoses, with 4.6% (CI, 3.9% to 5.3%) overinterpreted and 3.2% (CI, 2.7% to 3.6%) underinterpreted. Verification of invasive breast cancer and benign without atypia diagnoses is highly probable; estimated predictive values were 97.7% (CI, 96.5% to 98.7%) and 97.1% (CI, 96.7% to 97.4%), respectively. Verification is less probable for atypia (53.6% overinterpreted and 8.6% underinterpreted) and ductal carcinoma in situ (DCIS) (18.5% overinterpreted and 11.8% underinterpreted). Limitations: Estimates are based on a testing situation with 1 slide used per case and without access to second opinions. Population-adjusted estimates may differ for women from other age groups, unscreened women, or women in different practice settings. Conclusion: This analysis, based on interpretation of a single breast biopsy slide per case, predicts a low likelihood that a diagnosis of atypia or DCIS would be verified by a reference consensus diagnosis. This diagnostic gray zone should be considered in clinical management decisions in patients with these diagnoses.

UR - http://www.scopus.com/inward/record.url?scp=84969920110&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84969920110&partnerID=8YFLogxK

U2 - 10.7326/M15-0964

DO - 10.7326/M15-0964

M3 - Article

C2 - 26999810

AN - SCOPUS:84969920110

SN - 0003-4819

VL - 164

SP - 649

EP - 655

JO - Annals of internal medicine

JF - Annals of internal medicine

IS - 10

ER -

Variability in pathologists' interpretations of individual breast biopsy slides: A population perspective

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this