Mining a clinical data warehouse to discover disease-finding associations using co-occurrence statistics.

Hui Cao, Marianthi Markatou, Genevieve B. Melton, Michael F. Chiang, George Hripcsak

Research output: Contribution to journalArticle

42 Scopus citations

Abstract

This paper applies co-occurrence statistics to discover disease-finding associations in a clinical data warehouse. We used two methods, chi2 statistics and the proportion confidence interval (PCI) method, to measure the dependence of pairs of diseases and findings, and then used heuristic cutoff values for association selection. An intrinsic evaluation showed that 94 percent of disease-finding associations obtained by chi2 statistics and 76.8 percent obtained by the PCI method were true associations. The selected associations were used to construct knowledge bases of disease-finding relations (KB-chi2, KB-PCI). An extrinsic evaluation showed that both KB-chi2 and KB-PCI could assist in eliminating clinically non-informative and redundant findings from problem lists generated by our automated problem list summarization system.

Original languageEnglish (US)
Pages (from-to)106-110
Number of pages5
JournalAMIA ... Annual Symposium proceedings / AMIA Symposium. AMIA Symposium
StatePublished - 2005
Externally publishedYes

ASJC Scopus subject areas

  • Medicine(all)

Fingerprint Dive into the research topics of 'Mining a clinical data warehouse to discover disease-finding associations using co-occurrence statistics.'. Together they form a unique fingerprint.

  • Cite this