Assessment of clinical performance during simulated crises using both technical and behavioral ratings

David M. Gaba; Steven K. Howard; Brendan Flanagan; Brian E. Smith; Kevin J. Fish; Richard Botney

doi:10.1097/00000542-199807000-00005

Assessment of clinical performance during simulated crises using both technical and behavioral ratings

David M. Gaba, Steven K. Howard, Brendan Flanagan, Brian E. Smith, Kevin J. Fish, Richard Botney

Research output: Contribution to journal › Article › peer-review

421 Scopus citations

Abstract

Background: Techniques are needed to assess anesthesiologists' performance when responding to critical events. Patient simulators allow presentation of similar crisis situations to different clinicians. This study evaluated ratings of performance, and the interrater variability of the ratings, made by multiple independent observers viewing videotapes of simulated crises. Methods: Raters scored the videotapes of 14 different teams that were managing two scenarios: malignant hyperthermia (MH) and cardiac arrest. Technical performance and crisis management behaviors were rated. Technical ratings could range from 0.0 to 1.0 based on scenario-specific checklists of appropriate actions. Ratings of 12 crisis management behaviors were made using a five-point ordinal scale. Several statistical assessments of interrater variability were applied. Results: Technical ratings were high for most teams in both scenarios (0.78 ± 0.08 for MH, 0.83 ± 0.06 for cardiac arrest). Ratings of crisis management behavior varied, with some teams rated as minimally acceptable or poor (28% for MH, 14% for cardiac arrest). The agreement between raters was fair to excellent, depending on the item rated and the statistical test used. Conclusions: Both technical and behavioral performance can be assessed from videotapes of simulations. The behavioral rating system can be improved; one particular difficulty was aggregating a single rating for a behavior that fluctuated over time. These performance assessment tools might be nseful for educational research or for tracking a resident's progress. The rating system needs more refinement before it can be used to assess clinical competence for residency graduation or board certification.

Original language	English (US)
Pages (from-to)	8-18
Number of pages	11
Journal	Anesthesiology
Volume	89
Issue number	1
DOIs	https://doi.org/10.1097/00000542-199807000-00005
State	Published - Jul 1998
Externally published	Yes

Keywords

Evaluation
Performance
Simulation
Teamwork
Testing

ASJC Scopus subject areas

Anesthesiology and Pain Medicine

Access to Document

10.1097/00000542-199807000-00005

Cite this

@article{ac41ef84ac874e6bb3919c63018dd21b,

title = "Assessment of clinical performance during simulated crises using both technical and behavioral ratings",

abstract = "Background: Techniques are needed to assess anesthesiologists' performance when responding to critical events. Patient simulators allow presentation of similar crisis situations to different clinicians. This study evaluated ratings of performance, and the interrater variability of the ratings, made by multiple independent observers viewing videotapes of simulated crises. Methods: Raters scored the videotapes of 14 different teams that were managing two scenarios: malignant hyperthermia (MH) and cardiac arrest. Technical performance and crisis management behaviors were rated. Technical ratings could range from 0.0 to 1.0 based on scenario-specific checklists of appropriate actions. Ratings of 12 crisis management behaviors were made using a five-point ordinal scale. Several statistical assessments of interrater variability were applied. Results: Technical ratings were high for most teams in both scenarios (0.78 ± 0.08 for MH, 0.83 ± 0.06 for cardiac arrest). Ratings of crisis management behavior varied, with some teams rated as minimally acceptable or poor (28% for MH, 14% for cardiac arrest). The agreement between raters was fair to excellent, depending on the item rated and the statistical test used. Conclusions: Both technical and behavioral performance can be assessed from videotapes of simulations. The behavioral rating system can be improved; one particular difficulty was aggregating a single rating for a behavior that fluctuated over time. These performance assessment tools might be nseful for educational research or for tracking a resident's progress. The rating system needs more refinement before it can be used to assess clinical competence for residency graduation or board certification.",

keywords = "Evaluation, Performance, Simulation, Teamwork, Testing",

author = "Gaba, {David M.} and Howard, {Steven K.} and Brendan Flanagan and Smith, {Brian E.} and Fish, {Kevin J.} and Richard Botney",

year = "1998",

month = jul,

doi = "10.1097/00000542-199807000-00005",

language = "English (US)",

volume = "89",

pages = "8--18",

journal = "Anesthesiology",

issn = "0003-3022",

publisher = "Lippincott Williams and Wilkins",

number = "1",

}

TY - JOUR

T1 - Assessment of clinical performance during simulated crises using both technical and behavioral ratings

AU - Gaba, David M.

AU - Howard, Steven K.

AU - Flanagan, Brendan

AU - Smith, Brian E.

AU - Fish, Kevin J.

AU - Botney, Richard

PY - 1998/7

Y1 - 1998/7

N2 - Background: Techniques are needed to assess anesthesiologists' performance when responding to critical events. Patient simulators allow presentation of similar crisis situations to different clinicians. This study evaluated ratings of performance, and the interrater variability of the ratings, made by multiple independent observers viewing videotapes of simulated crises. Methods: Raters scored the videotapes of 14 different teams that were managing two scenarios: malignant hyperthermia (MH) and cardiac arrest. Technical performance and crisis management behaviors were rated. Technical ratings could range from 0.0 to 1.0 based on scenario-specific checklists of appropriate actions. Ratings of 12 crisis management behaviors were made using a five-point ordinal scale. Several statistical assessments of interrater variability were applied. Results: Technical ratings were high for most teams in both scenarios (0.78 ± 0.08 for MH, 0.83 ± 0.06 for cardiac arrest). Ratings of crisis management behavior varied, with some teams rated as minimally acceptable or poor (28% for MH, 14% for cardiac arrest). The agreement between raters was fair to excellent, depending on the item rated and the statistical test used. Conclusions: Both technical and behavioral performance can be assessed from videotapes of simulations. The behavioral rating system can be improved; one particular difficulty was aggregating a single rating for a behavior that fluctuated over time. These performance assessment tools might be nseful for educational research or for tracking a resident's progress. The rating system needs more refinement before it can be used to assess clinical competence for residency graduation or board certification.

AB - Background: Techniques are needed to assess anesthesiologists' performance when responding to critical events. Patient simulators allow presentation of similar crisis situations to different clinicians. This study evaluated ratings of performance, and the interrater variability of the ratings, made by multiple independent observers viewing videotapes of simulated crises. Methods: Raters scored the videotapes of 14 different teams that were managing two scenarios: malignant hyperthermia (MH) and cardiac arrest. Technical performance and crisis management behaviors were rated. Technical ratings could range from 0.0 to 1.0 based on scenario-specific checklists of appropriate actions. Ratings of 12 crisis management behaviors were made using a five-point ordinal scale. Several statistical assessments of interrater variability were applied. Results: Technical ratings were high for most teams in both scenarios (0.78 ± 0.08 for MH, 0.83 ± 0.06 for cardiac arrest). Ratings of crisis management behavior varied, with some teams rated as minimally acceptable or poor (28% for MH, 14% for cardiac arrest). The agreement between raters was fair to excellent, depending on the item rated and the statistical test used. Conclusions: Both technical and behavioral performance can be assessed from videotapes of simulations. The behavioral rating system can be improved; one particular difficulty was aggregating a single rating for a behavior that fluctuated over time. These performance assessment tools might be nseful for educational research or for tracking a resident's progress. The rating system needs more refinement before it can be used to assess clinical competence for residency graduation or board certification.

KW - Evaluation

KW - Performance

KW - Simulation

KW - Teamwork

KW - Testing

UR - http://www.scopus.com/inward/record.url?scp=0031849239&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0031849239&partnerID=8YFLogxK

U2 - 10.1097/00000542-199807000-00005

DO - 10.1097/00000542-199807000-00005

M3 - Article

C2 - 9667288

AN - SCOPUS:0031849239

SN - 0003-3022

VL - 89

SP - 8

EP - 18

JO - Anesthesiology

JF - Anesthesiology

IS - 1

ER -

Assessment of clinical performance during simulated crises using both technical and behavioral ratings

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this