Examining Reliability and Validity of an Online Score (ALiEM AIR) for Rating Free Open Access Medical Education Resources

Teresa Man Yee Chan; Andrew Grock; Michael Paddock; Kulamakan Kulasegaram; Lalena M. Yarris; Michelle Lin

doi:10.1016/j.annemergmed.2016.02.018

Examining Reliability and Validity of an Online Score (ALiEM AIR) for Rating Free Open Access Medical Education Resources

Teresa Man Yee Chan, Andrew Grock, Michael Paddock, Kulamakan Kulasegaram, Lalena M. Yarris, Michelle Lin

Emergency Medicine

Research output: Contribution to journal › Article › peer-review

50 Scopus citations

Abstract

Study objective Since 2014, Academic Life in Emergency Medicine (ALiEM) has used the Approved Instructional Resources (AIR) score to critically appraise online content. The primary goals of this study are to determine the interrater reliability (IRR) of the ALiEM AIR rating score and determine its correlation with expert educator gestalt. We also determine the minimum number of educator-raters needed to achieve acceptable reliability. Methods Eight educators each rated 83 online educational posts with the ALiEM AIR scale. Items include accuracy, usage of evidence-based medicine, referencing, utility, and the Best Evidence in Emergency Medicine rating score. A generalizability study was conducted to determine IRR and rating variance contributions of facets such as rater, blogs, posts, and topic. A randomized selection of 40 blog posts previously rated through ALiEM AIR was then rated again by a blinded group of expert medical educators according to their gestalt. Their gestalt impression was subsequently correlated with the ALiEM AIR score. Results The IRR for the ALiEM AIR rating scale was 0.81 during the 6-month pilot period. Decision studies showed that at least 9 raters were required to achieve this reliability. Spearman correlations between mean AIR score and the mean expert gestalt ratings were 0.40 for recommendation for learners and 0.35 for their colleagues. Conclusion The ALiEM AIR scale is a moderately to highly reliable, 5-question tool when used by medical educators for rating online resources. The score displays a fair correlation with expert educator gestalt in regard to the quality of the resources. The score displays a fair correlation with educator gestalt.

Original language	English (US)
Pages (from-to)	729-735
Number of pages	7
Journal	Annals of emergency medicine
Volume	68
Issue number	6
DOIs	https://doi.org/10.1016/j.annemergmed.2016.02.018
State	Published - Dec 1 2016

ASJC Scopus subject areas

Emergency Medicine

Access to Document

10.1016/j.annemergmed.2016.02.018

Cite this

@article{8037be9605414199bba64f8b0c96ff06,

title = "Examining Reliability and Validity of an Online Score (ALiEM AIR) for Rating Free Open Access Medical Education Resources",

abstract = "Study objective Since 2014, Academic Life in Emergency Medicine (ALiEM) has used the Approved Instructional Resources (AIR) score to critically appraise online content. The primary goals of this study are to determine the interrater reliability (IRR) of the ALiEM AIR rating score and determine its correlation with expert educator gestalt. We also determine the minimum number of educator-raters needed to achieve acceptable reliability. Methods Eight educators each rated 83 online educational posts with the ALiEM AIR scale. Items include accuracy, usage of evidence-based medicine, referencing, utility, and the Best Evidence in Emergency Medicine rating score. A generalizability study was conducted to determine IRR and rating variance contributions of facets such as rater, blogs, posts, and topic. A randomized selection of 40 blog posts previously rated through ALiEM AIR was then rated again by a blinded group of expert medical educators according to their gestalt. Their gestalt impression was subsequently correlated with the ALiEM AIR score. Results The IRR for the ALiEM AIR rating scale was 0.81 during the 6-month pilot period. Decision studies showed that at least 9 raters were required to achieve this reliability. Spearman correlations between mean AIR score and the mean expert gestalt ratings were 0.40 for recommendation for learners and 0.35 for their colleagues. Conclusion The ALiEM AIR scale is a moderately to highly reliable, 5-question tool when used by medical educators for rating online resources. The score displays a fair correlation with expert educator gestalt in regard to the quality of the resources. The score displays a fair correlation with educator gestalt.",

author = "Chan, {Teresa Man Yee} and Andrew Grock and Michael Paddock and Kulamakan Kulasegaram and Yarris, {Lalena M.} and Michelle Lin",

note = "Publisher Copyright: {\textcopyright} 2016 American College of Emergency Physicians",

year = "2016",

month = dec,

day = "1",

doi = "10.1016/j.annemergmed.2016.02.018",

language = "English (US)",

volume = "68",

pages = "729--735",

journal = "Annals of emergency medicine",

issn = "0196-0644",

publisher = "Mosby Inc.",

number = "6",

}

TY - JOUR

T1 - Examining Reliability and Validity of an Online Score (ALiEM AIR) for Rating Free Open Access Medical Education Resources

AU - Chan, Teresa Man Yee

AU - Grock, Andrew

AU - Paddock, Michael

AU - Kulasegaram, Kulamakan

AU - Yarris, Lalena M.

AU - Lin, Michelle

PY - 2016/12/1

Y1 - 2016/12/1

N2 - Study objective Since 2014, Academic Life in Emergency Medicine (ALiEM) has used the Approved Instructional Resources (AIR) score to critically appraise online content. The primary goals of this study are to determine the interrater reliability (IRR) of the ALiEM AIR rating score and determine its correlation with expert educator gestalt. We also determine the minimum number of educator-raters needed to achieve acceptable reliability. Methods Eight educators each rated 83 online educational posts with the ALiEM AIR scale. Items include accuracy, usage of evidence-based medicine, referencing, utility, and the Best Evidence in Emergency Medicine rating score. A generalizability study was conducted to determine IRR and rating variance contributions of facets such as rater, blogs, posts, and topic. A randomized selection of 40 blog posts previously rated through ALiEM AIR was then rated again by a blinded group of expert medical educators according to their gestalt. Their gestalt impression was subsequently correlated with the ALiEM AIR score. Results The IRR for the ALiEM AIR rating scale was 0.81 during the 6-month pilot period. Decision studies showed that at least 9 raters were required to achieve this reliability. Spearman correlations between mean AIR score and the mean expert gestalt ratings were 0.40 for recommendation for learners and 0.35 for their colleagues. Conclusion The ALiEM AIR scale is a moderately to highly reliable, 5-question tool when used by medical educators for rating online resources. The score displays a fair correlation with expert educator gestalt in regard to the quality of the resources. The score displays a fair correlation with educator gestalt.

AB - Study objective Since 2014, Academic Life in Emergency Medicine (ALiEM) has used the Approved Instructional Resources (AIR) score to critically appraise online content. The primary goals of this study are to determine the interrater reliability (IRR) of the ALiEM AIR rating score and determine its correlation with expert educator gestalt. We also determine the minimum number of educator-raters needed to achieve acceptable reliability. Methods Eight educators each rated 83 online educational posts with the ALiEM AIR scale. Items include accuracy, usage of evidence-based medicine, referencing, utility, and the Best Evidence in Emergency Medicine rating score. A generalizability study was conducted to determine IRR and rating variance contributions of facets such as rater, blogs, posts, and topic. A randomized selection of 40 blog posts previously rated through ALiEM AIR was then rated again by a blinded group of expert medical educators according to their gestalt. Their gestalt impression was subsequently correlated with the ALiEM AIR score. Results The IRR for the ALiEM AIR rating scale was 0.81 during the 6-month pilot period. Decision studies showed that at least 9 raters were required to achieve this reliability. Spearman correlations between mean AIR score and the mean expert gestalt ratings were 0.40 for recommendation for learners and 0.35 for their colleagues. Conclusion The ALiEM AIR scale is a moderately to highly reliable, 5-question tool when used by medical educators for rating online resources. The score displays a fair correlation with expert educator gestalt in regard to the quality of the resources. The score displays a fair correlation with educator gestalt.

UR - http://www.scopus.com/inward/record.url?scp=84962208274&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84962208274&partnerID=8YFLogxK

U2 - 10.1016/j.annemergmed.2016.02.018

DO - 10.1016/j.annemergmed.2016.02.018

M3 - Article

C2 - 27033141

AN - SCOPUS:84962208274

SN - 0196-0644

VL - 68

SP - 729

EP - 735

JO - Annals of emergency medicine

JF - Annals of emergency medicine

IS - 6

ER -

Examining Reliability and Validity of an Online Score (ALiEM AIR) for Rating Free Open Access Medical Education Resources

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this