Methods and dimensions of electronic health record data quality assessment: Enabling reuse for clinical research

Nicole Gray Weiskopf; Chunhua Weng

doi:10.1136/amiajnl-2011-000681

Methods and dimensions of electronic health record data quality assessment: Enabling reuse for clinical research

Nicole Gray Weiskopf, Chunhua Weng

Research output: Contribution to journal › Article › peer-review

701 Scopus citations

Abstract

Objective: To review the methods and dimensions of data quality assessment in the context of electronic health record (EHR) data reuse for research. Materials and methods: A review of the clinical research literature discussing data quality assessment methodology for EHR data was performed. Using an iterative process, the aspects of data quality being measured were abstracted and categorized, as well as the methods of assessment used. Results: Five dimensions of data quality were identified, which are completeness, correctness, concordance, plausibility, and currency, and seven broad categories of data quality assessment methods: comparison with gold standards, data element agreement, data source agreement, distribution comparison, validity checks, log review, and element presence. Discussion: Examination of the methods by which clinical researchers have investigated the quality and suitability of EHR data for research shows that there are fundamental features of data quality, which may be difficult to measure, as well as proxy dimensions. Researchers interested in the reuse of EHR data for clinical research are recommended to consider the adoption of a consistent taxonomy of EHR data quality, to remain aware of the task-dependence of data quality, to integrate work on data quality assessment from other fields, and to adopt systematic, empirically driven, statistically based methods of data quality assessment. Conclusion: There is currently little consistency or potential generalizability in the methods used to assess EHR data quality. If the reuse of EHR data for clinical research is to become accepted, researchers should adopt validated, systematic methods of EHR data quality assessment.

Original language	English (US)
Pages (from-to)	144-151
Number of pages	8
Journal	Journal of the American Medical Informatics Association
Volume	20
Issue number	1
DOIs	https://doi.org/10.1136/amiajnl-2011-000681
State	Published - 2013
Externally published	Yes

ASJC Scopus subject areas

Health Informatics

Access to Document

10.1136/amiajnl-2011-000681

Cite this

@article{7f20cfdb3e1a4a8498974e86dfea2536,

title = "Methods and dimensions of electronic health record data quality assessment: Enabling reuse for clinical research",

abstract = "Objective: To review the methods and dimensions of data quality assessment in the context of electronic health record (EHR) data reuse for research. Materials and methods: A review of the clinical research literature discussing data quality assessment methodology for EHR data was performed. Using an iterative process, the aspects of data quality being measured were abstracted and categorized, as well as the methods of assessment used. Results: Five dimensions of data quality were identified, which are completeness, correctness, concordance, plausibility, and currency, and seven broad categories of data quality assessment methods: comparison with gold standards, data element agreement, data source agreement, distribution comparison, validity checks, log review, and element presence. Discussion: Examination of the methods by which clinical researchers have investigated the quality and suitability of EHR data for research shows that there are fundamental features of data quality, which may be difficult to measure, as well as proxy dimensions. Researchers interested in the reuse of EHR data for clinical research are recommended to consider the adoption of a consistent taxonomy of EHR data quality, to remain aware of the task-dependence of data quality, to integrate work on data quality assessment from other fields, and to adopt systematic, empirically driven, statistically based methods of data quality assessment. Conclusion: There is currently little consistency or potential generalizability in the methods used to assess EHR data quality. If the reuse of EHR data for clinical research is to become accepted, researchers should adopt validated, systematic methods of EHR data quality assessment.",

author = "Weiskopf, {Nicole Gray} and Chunhua Weng",

year = "2013",

doi = "10.1136/amiajnl-2011-000681",

language = "English (US)",

volume = "20",

pages = "144--151",

journal = "Journal of the American Medical Informatics Association",

issn = "1067-5027",

publisher = "Oxford University Press",

number = "1",

}

TY - JOUR

T1 - Methods and dimensions of electronic health record data quality assessment

T2 - Enabling reuse for clinical research

AU - Weiskopf, Nicole Gray

AU - Weng, Chunhua

PY - 2013

Y1 - 2013

N2 - Objective: To review the methods and dimensions of data quality assessment in the context of electronic health record (EHR) data reuse for research. Materials and methods: A review of the clinical research literature discussing data quality assessment methodology for EHR data was performed. Using an iterative process, the aspects of data quality being measured were abstracted and categorized, as well as the methods of assessment used. Results: Five dimensions of data quality were identified, which are completeness, correctness, concordance, plausibility, and currency, and seven broad categories of data quality assessment methods: comparison with gold standards, data element agreement, data source agreement, distribution comparison, validity checks, log review, and element presence. Discussion: Examination of the methods by which clinical researchers have investigated the quality and suitability of EHR data for research shows that there are fundamental features of data quality, which may be difficult to measure, as well as proxy dimensions. Researchers interested in the reuse of EHR data for clinical research are recommended to consider the adoption of a consistent taxonomy of EHR data quality, to remain aware of the task-dependence of data quality, to integrate work on data quality assessment from other fields, and to adopt systematic, empirically driven, statistically based methods of data quality assessment. Conclusion: There is currently little consistency or potential generalizability in the methods used to assess EHR data quality. If the reuse of EHR data for clinical research is to become accepted, researchers should adopt validated, systematic methods of EHR data quality assessment.

AB - Objective: To review the methods and dimensions of data quality assessment in the context of electronic health record (EHR) data reuse for research. Materials and methods: A review of the clinical research literature discussing data quality assessment methodology for EHR data was performed. Using an iterative process, the aspects of data quality being measured were abstracted and categorized, as well as the methods of assessment used. Results: Five dimensions of data quality were identified, which are completeness, correctness, concordance, plausibility, and currency, and seven broad categories of data quality assessment methods: comparison with gold standards, data element agreement, data source agreement, distribution comparison, validity checks, log review, and element presence. Discussion: Examination of the methods by which clinical researchers have investigated the quality and suitability of EHR data for research shows that there are fundamental features of data quality, which may be difficult to measure, as well as proxy dimensions. Researchers interested in the reuse of EHR data for clinical research are recommended to consider the adoption of a consistent taxonomy of EHR data quality, to remain aware of the task-dependence of data quality, to integrate work on data quality assessment from other fields, and to adopt systematic, empirically driven, statistically based methods of data quality assessment. Conclusion: There is currently little consistency or potential generalizability in the methods used to assess EHR data quality. If the reuse of EHR data for clinical research is to become accepted, researchers should adopt validated, systematic methods of EHR data quality assessment.

UR - http://www.scopus.com/inward/record.url?scp=84871882786&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84871882786&partnerID=8YFLogxK

U2 - 10.1136/amiajnl-2011-000681

DO - 10.1136/amiajnl-2011-000681

M3 - Article

C2 - 22733976

AN - SCOPUS:84871882786

SN - 1067-5027

VL - 20

SP - 144

EP - 151

JO - Journal of the American Medical Informatics Association

JF - Journal of the American Medical Informatics Association

IS - 1

ER -

Methods and dimensions of electronic health record data quality assessment: Enabling reuse for clinical research

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this