Defining and measuring completeness of electronic health records for secondary use

Nicole Weiskopf, George Hripcsak, Sushmita Swaminathan, Chunhua Weng

Research output: Contribution to journalArticle

93 Citations (Scopus)

Abstract

We demonstrate the importance of explicit definitions of electronic health record (EHR) data completeness and how different conceptualizations of completeness may impact findings from EHR-derived datasets. This study has important repercussions for researchers and clinicians engaged in the secondary use of EHR data. We describe four prototypical definitions of EHR completeness: documentation, breadth, density, and predictive completeness. Each definition dictates a different approach to the measurement of completeness. These measures were applied to representative data from NewYork-Presbyterian Hospital's clinical data warehouse. We found that according to any definition, the number of complete records in our clinical database is far lower than the nominal total. The proportion that meets criteria for completeness is heavily dependent on the definition of completeness used, and the different definitions generate different subsets of records. We conclude that the concept of completeness in EHR is contextual. We urge data consumers to be explicit in how they define a complete record and transparent about the limitations of their data.

Original languageEnglish (US)
Pages (from-to)830-836
Number of pages7
JournalJournal of Biomedical Informatics
Volume46
Issue number5
DOIs
StatePublished - Oct 2013
Externally publishedYes

Fingerprint

Electronic Health Records
Health
Data warehouses
Documentation
Research Personnel
Databases

Keywords

  • Completeness
  • Data quality
  • Electronic health records
  • Secondary use

ASJC Scopus subject areas

  • Computer Science Applications
  • Health Informatics

Cite this

Defining and measuring completeness of electronic health records for secondary use. / Weiskopf, Nicole; Hripcsak, George; Swaminathan, Sushmita; Weng, Chunhua.

In: Journal of Biomedical Informatics, Vol. 46, No. 5, 10.2013, p. 830-836.

Research output: Contribution to journalArticle

Weiskopf, Nicole ; Hripcsak, George ; Swaminathan, Sushmita ; Weng, Chunhua. / Defining and measuring completeness of electronic health records for secondary use. In: Journal of Biomedical Informatics. 2013 ; Vol. 46, No. 5. pp. 830-836.
@article{299e6c0efc304a1db27cde6c608b2c46,
title = "Defining and measuring completeness of electronic health records for secondary use",
abstract = "We demonstrate the importance of explicit definitions of electronic health record (EHR) data completeness and how different conceptualizations of completeness may impact findings from EHR-derived datasets. This study has important repercussions for researchers and clinicians engaged in the secondary use of EHR data. We describe four prototypical definitions of EHR completeness: documentation, breadth, density, and predictive completeness. Each definition dictates a different approach to the measurement of completeness. These measures were applied to representative data from NewYork-Presbyterian Hospital's clinical data warehouse. We found that according to any definition, the number of complete records in our clinical database is far lower than the nominal total. The proportion that meets criteria for completeness is heavily dependent on the definition of completeness used, and the different definitions generate different subsets of records. We conclude that the concept of completeness in EHR is contextual. We urge data consumers to be explicit in how they define a complete record and transparent about the limitations of their data.",
keywords = "Completeness, Data quality, Electronic health records, Secondary use",
author = "Nicole Weiskopf and George Hripcsak and Sushmita Swaminathan and Chunhua Weng",
year = "2013",
month = "10",
doi = "10.1016/j.jbi.2013.06.010",
language = "English (US)",
volume = "46",
pages = "830--836",
journal = "Journal of Biomedical Informatics",
issn = "1532-0464",
publisher = "Academic Press Inc.",
number = "5",

}

TY - JOUR

T1 - Defining and measuring completeness of electronic health records for secondary use

AU - Weiskopf, Nicole

AU - Hripcsak, George

AU - Swaminathan, Sushmita

AU - Weng, Chunhua

PY - 2013/10

Y1 - 2013/10

N2 - We demonstrate the importance of explicit definitions of electronic health record (EHR) data completeness and how different conceptualizations of completeness may impact findings from EHR-derived datasets. This study has important repercussions for researchers and clinicians engaged in the secondary use of EHR data. We describe four prototypical definitions of EHR completeness: documentation, breadth, density, and predictive completeness. Each definition dictates a different approach to the measurement of completeness. These measures were applied to representative data from NewYork-Presbyterian Hospital's clinical data warehouse. We found that according to any definition, the number of complete records in our clinical database is far lower than the nominal total. The proportion that meets criteria for completeness is heavily dependent on the definition of completeness used, and the different definitions generate different subsets of records. We conclude that the concept of completeness in EHR is contextual. We urge data consumers to be explicit in how they define a complete record and transparent about the limitations of their data.

AB - We demonstrate the importance of explicit definitions of electronic health record (EHR) data completeness and how different conceptualizations of completeness may impact findings from EHR-derived datasets. This study has important repercussions for researchers and clinicians engaged in the secondary use of EHR data. We describe four prototypical definitions of EHR completeness: documentation, breadth, density, and predictive completeness. Each definition dictates a different approach to the measurement of completeness. These measures were applied to representative data from NewYork-Presbyterian Hospital's clinical data warehouse. We found that according to any definition, the number of complete records in our clinical database is far lower than the nominal total. The proportion that meets criteria for completeness is heavily dependent on the definition of completeness used, and the different definitions generate different subsets of records. We conclude that the concept of completeness in EHR is contextual. We urge data consumers to be explicit in how they define a complete record and transparent about the limitations of their data.

KW - Completeness

KW - Data quality

KW - Electronic health records

KW - Secondary use

UR - http://www.scopus.com/inward/record.url?scp=84883774090&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84883774090&partnerID=8YFLogxK

U2 - 10.1016/j.jbi.2013.06.010

DO - 10.1016/j.jbi.2013.06.010

M3 - Article

VL - 46

SP - 830

EP - 836

JO - Journal of Biomedical Informatics

JF - Journal of Biomedical Informatics

SN - 1532-0464

IS - 5

ER -