Workplace-Based Entrustment Scales for the Core EPAs: A Multisite Comparison of Validity Evidence for Two Proposed Instruments Using Structured Vignettes and Trained Raters

Michael S. Ryan; Asra R. Khan; Yoon Soo Park; Cody Chastain; Carrie Phillipi; Sally A. Santen; Beth A. Barron; Vivian Obeso; Sandra L. Yingling

doi:10.1097/ACM.0000000000004222

Workplace-Based Entrustment Scales for the Core EPAs: A Multisite Comparison of Validity Evidence for Two Proposed Instruments Using Structured Vignettes and Trained Raters

Michael S. Ryan, Asra R. Khan, Yoon Soo Park, Cody Chastain, Carrie Phillipi, Sally A. Santen, Beth A. Barron, Vivian Obeso, Sandra L. Yingling

Pediatrics

Research output: Contribution to journal › Article › peer-review

4 Scopus citations

Abstract

Purpose In undergraduate medical education (UME), competency-based medical education has been operationalized through the 13 Core Entrustable Professional Activities for Entering Residency (Core EPAs). Direct observation in the workplace using rigorous, valid, reliable measures is required to inform summative decisions about graduates' readiness for residency. The purpose of this study is to investigate the validity evidence of 2 proposed workplace-based entrustment scales. Method The authors of this multisite, randomized, experimental study used structured vignettes and experienced raters to examine validity evidence of the Ottawa scale and the UME supervisory tool (Chen scale) in 2019. The authors used a series of 8 cases (6 developed de novo) depicting learners at preentrustable (less-developed) and entrustable (more-developed) skill levels across 5 Core EPAs. Participants from Core EPA pilot institutions rated learner performance using either the Ottawa or Chen scale. The authors used descriptive statistics and analysis of variance to examine data trends and compare ratings, conducted interrater reliability and generalizability studies to evaluate consistency among participants, and performed a content analysis of narrative comments. Results Fifty clinician-educators from 10 institutions participated, yielding 579 discrete EPA assessments. Both Ottawa and Chen scales differentiated between less- and more-developed skill levels (P <.001). The interclass correlation was good to excellent for all EPAs using Ottawa (range, 0.68-0.91) and fair to excellent using Chen (range, 0.54-0.83). Generalizability analysis revealed substantial variance in ratings attributable to the learner-EPA interaction (59.6% for Ottawa; 48.9% for Chen) suggesting variability for ratings was appropriately associated with performance on individual EPAs. Conclusions In a structured setting, both the Ottawa and Chen scales distinguished between preentrustable and entrustable learners; however, the Ottawa scale demonstrated more desirable characteristics. These findings represent a critical step forward in developing valid, reliable instruments to measure learner progression toward entrustment for the Core EPAs.

Original language	English (US)
Pages (from-to)	544-551
Number of pages	8
Journal	Academic Medicine
Volume	97
Issue number	4
DOIs	https://doi.org/10.1097/ACM.0000000000004222
State	Published - Apr 1 2022

ASJC Scopus subject areas

Education

Access to Document

10.1097/ACM.0000000000004222

Cite this

Ryan, M. S., Khan, A. R., Park, Y. S., Chastain, C., Phillipi, C., Santen, S. A., Barron, B. A., Obeso, V., & Yingling, S. L. (2022). Workplace-Based Entrustment Scales for the Core EPAs: A Multisite Comparison of Validity Evidence for Two Proposed Instruments Using Structured Vignettes and Trained Raters. Academic Medicine, 97(4), 544-551. https://doi.org/10.1097/ACM.0000000000004222

Ryan, MS, Khan, AR, Park, YS, Chastain, C, Phillipi, C, Santen, SA, Barron, BA, Obeso, V & Yingling, SL 2022, 'Workplace-Based Entrustment Scales for the Core EPAs: A Multisite Comparison of Validity Evidence for Two Proposed Instruments Using Structured Vignettes and Trained Raters', Academic Medicine, vol. 97, no. 4, pp. 544-551. https://doi.org/10.1097/ACM.0000000000004222

@article{9dc33ea6171c479db70a6799d0e3b818,

title = "Workplace-Based Entrustment Scales for the Core EPAs: A Multisite Comparison of Validity Evidence for Two Proposed Instruments Using Structured Vignettes and Trained Raters",

abstract = "Purpose In undergraduate medical education (UME), competency-based medical education has been operationalized through the 13 Core Entrustable Professional Activities for Entering Residency (Core EPAs). Direct observation in the workplace using rigorous, valid, reliable measures is required to inform summative decisions about graduates' readiness for residency. The purpose of this study is to investigate the validity evidence of 2 proposed workplace-based entrustment scales. Method The authors of this multisite, randomized, experimental study used structured vignettes and experienced raters to examine validity evidence of the Ottawa scale and the UME supervisory tool (Chen scale) in 2019. The authors used a series of 8 cases (6 developed de novo) depicting learners at preentrustable (less-developed) and entrustable (more-developed) skill levels across 5 Core EPAs. Participants from Core EPA pilot institutions rated learner performance using either the Ottawa or Chen scale. The authors used descriptive statistics and analysis of variance to examine data trends and compare ratings, conducted interrater reliability and generalizability studies to evaluate consistency among participants, and performed a content analysis of narrative comments. Results Fifty clinician-educators from 10 institutions participated, yielding 579 discrete EPA assessments. Both Ottawa and Chen scales differentiated between less- and more-developed skill levels (P <.001). The interclass correlation was good to excellent for all EPAs using Ottawa (range, 0.68-0.91) and fair to excellent using Chen (range, 0.54-0.83). Generalizability analysis revealed substantial variance in ratings attributable to the learner-EPA interaction (59.6% for Ottawa; 48.9% for Chen) suggesting variability for ratings was appropriately associated with performance on individual EPAs. Conclusions In a structured setting, both the Ottawa and Chen scales distinguished between preentrustable and entrustable learners; however, the Ottawa scale demonstrated more desirable characteristics. These findings represent a critical step forward in developing valid, reliable instruments to measure learner progression toward entrustment for the Core EPAs.",

author = "Ryan, {Michael S.} and Khan, {Asra R.} and Park, {Yoon Soo} and Cody Chastain and Carrie Phillipi and Santen, {Sally A.} and Barron, {Beth A.} and Vivian Obeso and Yingling, {Sandra L.}",

note = "Publisher Copyright: {\textcopyright} 2021 by the Association of American Medical Colleges.",

year = "2022",

month = apr,

day = "1",

doi = "10.1097/ACM.0000000000004222",

language = "English (US)",

volume = "97",

pages = "544--551",

journal = "Academic Medicine",

issn = "1040-2446",

publisher = "Lippincott Williams and Wilkins",

number = "4",

}

TY - JOUR

T1 - Workplace-Based Entrustment Scales for the Core EPAs

T2 - A Multisite Comparison of Validity Evidence for Two Proposed Instruments Using Structured Vignettes and Trained Raters

AU - Ryan, Michael S.

AU - Khan, Asra R.

AU - Park, Yoon Soo

AU - Chastain, Cody

AU - Phillipi, Carrie

AU - Santen, Sally A.

AU - Barron, Beth A.

AU - Obeso, Vivian

AU - Yingling, Sandra L.

PY - 2022/4/1

Y1 - 2022/4/1

N2 - Purpose In undergraduate medical education (UME), competency-based medical education has been operationalized through the 13 Core Entrustable Professional Activities for Entering Residency (Core EPAs). Direct observation in the workplace using rigorous, valid, reliable measures is required to inform summative decisions about graduates' readiness for residency. The purpose of this study is to investigate the validity evidence of 2 proposed workplace-based entrustment scales. Method The authors of this multisite, randomized, experimental study used structured vignettes and experienced raters to examine validity evidence of the Ottawa scale and the UME supervisory tool (Chen scale) in 2019. The authors used a series of 8 cases (6 developed de novo) depicting learners at preentrustable (less-developed) and entrustable (more-developed) skill levels across 5 Core EPAs. Participants from Core EPA pilot institutions rated learner performance using either the Ottawa or Chen scale. The authors used descriptive statistics and analysis of variance to examine data trends and compare ratings, conducted interrater reliability and generalizability studies to evaluate consistency among participants, and performed a content analysis of narrative comments. Results Fifty clinician-educators from 10 institutions participated, yielding 579 discrete EPA assessments. Both Ottawa and Chen scales differentiated between less- and more-developed skill levels (P <.001). The interclass correlation was good to excellent for all EPAs using Ottawa (range, 0.68-0.91) and fair to excellent using Chen (range, 0.54-0.83). Generalizability analysis revealed substantial variance in ratings attributable to the learner-EPA interaction (59.6% for Ottawa; 48.9% for Chen) suggesting variability for ratings was appropriately associated with performance on individual EPAs. Conclusions In a structured setting, both the Ottawa and Chen scales distinguished between preentrustable and entrustable learners; however, the Ottawa scale demonstrated more desirable characteristics. These findings represent a critical step forward in developing valid, reliable instruments to measure learner progression toward entrustment for the Core EPAs.

AB - Purpose In undergraduate medical education (UME), competency-based medical education has been operationalized through the 13 Core Entrustable Professional Activities for Entering Residency (Core EPAs). Direct observation in the workplace using rigorous, valid, reliable measures is required to inform summative decisions about graduates' readiness for residency. The purpose of this study is to investigate the validity evidence of 2 proposed workplace-based entrustment scales. Method The authors of this multisite, randomized, experimental study used structured vignettes and experienced raters to examine validity evidence of the Ottawa scale and the UME supervisory tool (Chen scale) in 2019. The authors used a series of 8 cases (6 developed de novo) depicting learners at preentrustable (less-developed) and entrustable (more-developed) skill levels across 5 Core EPAs. Participants from Core EPA pilot institutions rated learner performance using either the Ottawa or Chen scale. The authors used descriptive statistics and analysis of variance to examine data trends and compare ratings, conducted interrater reliability and generalizability studies to evaluate consistency among participants, and performed a content analysis of narrative comments. Results Fifty clinician-educators from 10 institutions participated, yielding 579 discrete EPA assessments. Both Ottawa and Chen scales differentiated between less- and more-developed skill levels (P <.001). The interclass correlation was good to excellent for all EPAs using Ottawa (range, 0.68-0.91) and fair to excellent using Chen (range, 0.54-0.83). Generalizability analysis revealed substantial variance in ratings attributable to the learner-EPA interaction (59.6% for Ottawa; 48.9% for Chen) suggesting variability for ratings was appropriately associated with performance on individual EPAs. Conclusions In a structured setting, both the Ottawa and Chen scales distinguished between preentrustable and entrustable learners; however, the Ottawa scale demonstrated more desirable characteristics. These findings represent a critical step forward in developing valid, reliable instruments to measure learner progression toward entrustment for the Core EPAs.

UR - http://www.scopus.com/inward/record.url?scp=85127974446&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85127974446&partnerID=8YFLogxK

U2 - 10.1097/ACM.0000000000004222

DO - 10.1097/ACM.0000000000004222

M3 - Article

C2 - 34192721

AN - SCOPUS:85127974446

SN - 1040-2446

VL - 97

SP - 544

EP - 551

JO - Academic Medicine

JF - Academic Medicine

IS - 4

ER -

Workplace-Based Entrustment Scales for the Core EPAs: A Multisite Comparison of Validity Evidence for Two Proposed Instruments Using Structured Vignettes and Trained Raters

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this