Reproducibility and responsiveness of health status measures statistics and strategies for evaluation

Richard A. Deyo; Paula Diehr; Donald L. Patrick

doi:10.1016/S0197-2456(05)80019-4

Reproducibility and responsiveness of health status measures statistics and strategies for evaluation

Richard A. Deyo, Paula Diehr, Donald L. Patrick

Research output: Contribution to journal › Article › peer-review

1238 Scopus citations

Abstract

Before being introduced to wide use, health status instruments should be evaluated for reliability and validity. Increasingly, they are also tested for responsiveness to important clinical changes. Although standards exist for assessing these properties, confusion and inconsistency arise because multiple statistics are used for the same property; controversy exists over how to measure responsiveness; many statistics are unavailable on common software programs; strategies for measuring these properties vary; and it is often unclear how to define a clinically important change in patient status. Using data from a clinical trial of therapy for back pain, we demonstrate the calculation of several statistics for measuring reproducibility and responsiveness, and demonstrate relationships among them. Simple computational guides for several statistics are provided. We conclude that reproducibility should generally be quantified with the intraclass correlation coefficient rather than the more common Pearson r. Assessing reproducibility by retest at one-to-two week intervals (rather than a shorter interval) may result in more realistic estimates of the variability to be observed among control subjects in a longitudinal study. Instrument responsiveness should be quantified using indicators of effect size, a modified effect size statistic proposed by Guyatt, or the use of receiver operating characteristic (ROC) curves to describe how well various score changes can distinguish improved from unimproved patients.

Original language	English (US)
Pages (from-to)	S142-S158
Journal	Controlled Clinical Trials
Volume	12
Issue number	4 SUPPL.
DOIs	https://doi.org/10.1016/S0197-2456(05)80019-4
State	Published - Aug 1991
Externally published	Yes

Keywords

Health status
functional status
quality-of-life
questionnaires
responsiveness

ASJC Scopus subject areas

Pharmacology

Access to Document

10.1016/S0197-2456(05)80019-4

Cite this

@article{1e7101c1e9514690b5098952b0cea7ec,

title = "Reproducibility and responsiveness of health status measures statistics and strategies for evaluation",

abstract = "Before being introduced to wide use, health status instruments should be evaluated for reliability and validity. Increasingly, they are also tested for responsiveness to important clinical changes. Although standards exist for assessing these properties, confusion and inconsistency arise because multiple statistics are used for the same property; controversy exists over how to measure responsiveness; many statistics are unavailable on common software programs; strategies for measuring these properties vary; and it is often unclear how to define a clinically important change in patient status. Using data from a clinical trial of therapy for back pain, we demonstrate the calculation of several statistics for measuring reproducibility and responsiveness, and demonstrate relationships among them. Simple computational guides for several statistics are provided. We conclude that reproducibility should generally be quantified with the intraclass correlation coefficient rather than the more common Pearson r. Assessing reproducibility by retest at one-to-two week intervals (rather than a shorter interval) may result in more realistic estimates of the variability to be observed among control subjects in a longitudinal study. Instrument responsiveness should be quantified using indicators of effect size, a modified effect size statistic proposed by Guyatt, or the use of receiver operating characteristic (ROC) curves to describe how well various score changes can distinguish improved from unimproved patients.",

keywords = "Health status, functional status, quality-of-life, questionnaires, responsiveness",

author = "Deyo, {Richard A.} and Paula Diehr and Patrick, {Donald L.}",

note = "Funding Information: This work was supported in part by the Northwest Health Services Research and Development Field Program, Seattle Veterans AdministrationM edical Center, Seattle, Washingtona nd by the University of Washington Center for Health Promotion and Disease Prevention in Older Adults, grant number R48/CCR002181-02 from the Centers for Disease Control. Bruce Psaty, M.D. and Deanna Chew-Freidenberg, PhD, assisted in calculating the ROC curve, using a program provided by Dr. Psaty. Kathy Minotto and Shellie Smith assisted in preparing the manuscript.",

year = "1991",

month = aug,

doi = "10.1016/S0197-2456(05)80019-4",

language = "English (US)",

volume = "12",

pages = "S142--S158",

journal = "Controlled Clinical Trials",

issn = "0197-2456",

publisher = "Elsevier BV",

number = "4 SUPPL.",

}

TY - JOUR

T1 - Reproducibility and responsiveness of health status measures statistics and strategies for evaluation

AU - Deyo, Richard A.

AU - Diehr, Paula

AU - Patrick, Donald L.

N1 - Funding Information: This work was supported in part by the Northwest Health Services Research and Development Field Program, Seattle Veterans AdministrationM edical Center, Seattle, Washingtona nd by the University of Washington Center for Health Promotion and Disease Prevention in Older Adults, grant number R48/CCR002181-02 from the Centers for Disease Control. Bruce Psaty, M.D. and Deanna Chew-Freidenberg, PhD, assisted in calculating the ROC curve, using a program provided by Dr. Psaty. Kathy Minotto and Shellie Smith assisted in preparing the manuscript.

PY - 1991/8

Y1 - 1991/8

N2 - Before being introduced to wide use, health status instruments should be evaluated for reliability and validity. Increasingly, they are also tested for responsiveness to important clinical changes. Although standards exist for assessing these properties, confusion and inconsistency arise because multiple statistics are used for the same property; controversy exists over how to measure responsiveness; many statistics are unavailable on common software programs; strategies for measuring these properties vary; and it is often unclear how to define a clinically important change in patient status. Using data from a clinical trial of therapy for back pain, we demonstrate the calculation of several statistics for measuring reproducibility and responsiveness, and demonstrate relationships among them. Simple computational guides for several statistics are provided. We conclude that reproducibility should generally be quantified with the intraclass correlation coefficient rather than the more common Pearson r. Assessing reproducibility by retest at one-to-two week intervals (rather than a shorter interval) may result in more realistic estimates of the variability to be observed among control subjects in a longitudinal study. Instrument responsiveness should be quantified using indicators of effect size, a modified effect size statistic proposed by Guyatt, or the use of receiver operating characteristic (ROC) curves to describe how well various score changes can distinguish improved from unimproved patients.

AB - Before being introduced to wide use, health status instruments should be evaluated for reliability and validity. Increasingly, they are also tested for responsiveness to important clinical changes. Although standards exist for assessing these properties, confusion and inconsistency arise because multiple statistics are used for the same property; controversy exists over how to measure responsiveness; many statistics are unavailable on common software programs; strategies for measuring these properties vary; and it is often unclear how to define a clinically important change in patient status. Using data from a clinical trial of therapy for back pain, we demonstrate the calculation of several statistics for measuring reproducibility and responsiveness, and demonstrate relationships among them. Simple computational guides for several statistics are provided. We conclude that reproducibility should generally be quantified with the intraclass correlation coefficient rather than the more common Pearson r. Assessing reproducibility by retest at one-to-two week intervals (rather than a shorter interval) may result in more realistic estimates of the variability to be observed among control subjects in a longitudinal study. Instrument responsiveness should be quantified using indicators of effect size, a modified effect size statistic proposed by Guyatt, or the use of receiver operating characteristic (ROC) curves to describe how well various score changes can distinguish improved from unimproved patients.

KW - Health status

KW - functional status

KW - quality-of-life

KW - questionnaires

KW - responsiveness

UR - http://www.scopus.com/inward/record.url?scp=0025851102&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0025851102&partnerID=8YFLogxK

U2 - 10.1016/S0197-2456(05)80019-4

DO - 10.1016/S0197-2456(05)80019-4

M3 - Article

C2 - 1663851

AN - SCOPUS:0025851102

SN - 0197-2456

VL - 12

SP - S142-S158

JO - Controlled Clinical Trials

JF - Controlled Clinical Trials

IS - 4 SUPPL.

ER -

Reproducibility and responsiveness of health status measures statistics and strategies for evaluation

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this