Use of machine learning techniques in the development and refinement of a predictive model for early diagnosis of ankylosing spondylitis

Atulya (Atul) Deodhar, Martin Rozycki, Cody Garges, Oodaye Shukla, Theresa Arndt, Tara Grabowsky, Yujin Park

Research output: Contribution to journalArticle

Abstract

Objective: To develop a predictive mathematical model for the early identification of ankylosing spondylitis (AS) based on the medical and pharmacy claims history of patients with and without AS. Methods: This retrospective study used claims data from Truven databases from January 2006 to September 2015 (Segment 1) and October 2015 to February 2018 (Segment 2). Machine learning identified features differentiating patients with AS from matched controls; selected features were used as inputs in developing Model A/B to identify patients likely to have AS. Model A/B was trained and developed in Segment 1, and patients predicted to have AS in Segment 1 were followed up in Segment 2 to evaluate the predictive capability of Model A/B. Results: Of 228,471 patients in Segment 1 without any history of AS, Model A/B predicted 1923 patients to have AS. Ultimately, 1242 patients received an AS diagnosis in Segment 2; 120 of these were correctly predicted by Model A/B, yielding a positive predictive value (PPV) of 6.24%. The diagnostic accuracy of Model A/B compared favorably with that of a clinical model (PPV, 1.29%) that predicted AS based on spondyloarthritis features described in the Assessment of SpondyloArthritis international Society classification criteria. A simplified linear regression model created to test the operability of Model A/B yielded a lower PPV (2.55%). Conclusions: Model A/B performed better than a clinically based model in predicting a diagnosis of AS among patients in a large claims database; its use may contribute to early recognition of AS and a timely diagnosis.

Original languageEnglish (US)
JournalClinical Rheumatology
DOIs
StatePublished - Jan 1 2019

Fingerprint

Ankylosing Spondylitis
Early Diagnosis
Linear Models
Machine Learning
Databases
Theoretical Models
Retrospective Studies

Keywords

  • Ankylosing spondylitis
  • Machine learning
  • Mathematical model
  • Spondyloarthritis

ASJC Scopus subject areas

  • Rheumatology

Cite this

Use of machine learning techniques in the development and refinement of a predictive model for early diagnosis of ankylosing spondylitis. / Deodhar, Atulya (Atul); Rozycki, Martin; Garges, Cody; Shukla, Oodaye; Arndt, Theresa; Grabowsky, Tara; Park, Yujin.

In: Clinical Rheumatology, 01.01.2019.

Research output: Contribution to journalArticle

@article{a752058669a44feeaefbb45d4b53910f,
title = "Use of machine learning techniques in the development and refinement of a predictive model for early diagnosis of ankylosing spondylitis",
abstract = "Objective: To develop a predictive mathematical model for the early identification of ankylosing spondylitis (AS) based on the medical and pharmacy claims history of patients with and without AS. Methods: This retrospective study used claims data from Truven databases from January 2006 to September 2015 (Segment 1) and October 2015 to February 2018 (Segment 2). Machine learning identified features differentiating patients with AS from matched controls; selected features were used as inputs in developing Model A/B to identify patients likely to have AS. Model A/B was trained and developed in Segment 1, and patients predicted to have AS in Segment 1 were followed up in Segment 2 to evaluate the predictive capability of Model A/B. Results: Of 228,471 patients in Segment 1 without any history of AS, Model A/B predicted 1923 patients to have AS. Ultimately, 1242 patients received an AS diagnosis in Segment 2; 120 of these were correctly predicted by Model A/B, yielding a positive predictive value (PPV) of 6.24{\%}. The diagnostic accuracy of Model A/B compared favorably with that of a clinical model (PPV, 1.29{\%}) that predicted AS based on spondyloarthritis features described in the Assessment of SpondyloArthritis international Society classification criteria. A simplified linear regression model created to test the operability of Model A/B yielded a lower PPV (2.55{\%}). Conclusions: Model A/B performed better than a clinically based model in predicting a diagnosis of AS among patients in a large claims database; its use may contribute to early recognition of AS and a timely diagnosis.",
keywords = "Ankylosing spondylitis, Machine learning, Mathematical model, Spondyloarthritis",
author = "Deodhar, {Atulya (Atul)} and Martin Rozycki and Cody Garges and Oodaye Shukla and Theresa Arndt and Tara Grabowsky and Yujin Park",
year = "2019",
month = "1",
day = "1",
doi = "10.1007/s10067-019-04553-x",
language = "English (US)",
journal = "Clinical Rheumatology",
issn = "0770-3198",
publisher = "Springer London",

}

TY - JOUR

T1 - Use of machine learning techniques in the development and refinement of a predictive model for early diagnosis of ankylosing spondylitis

AU - Deodhar, Atulya (Atul)

AU - Rozycki, Martin

AU - Garges, Cody

AU - Shukla, Oodaye

AU - Arndt, Theresa

AU - Grabowsky, Tara

AU - Park, Yujin

PY - 2019/1/1

Y1 - 2019/1/1

N2 - Objective: To develop a predictive mathematical model for the early identification of ankylosing spondylitis (AS) based on the medical and pharmacy claims history of patients with and without AS. Methods: This retrospective study used claims data from Truven databases from January 2006 to September 2015 (Segment 1) and October 2015 to February 2018 (Segment 2). Machine learning identified features differentiating patients with AS from matched controls; selected features were used as inputs in developing Model A/B to identify patients likely to have AS. Model A/B was trained and developed in Segment 1, and patients predicted to have AS in Segment 1 were followed up in Segment 2 to evaluate the predictive capability of Model A/B. Results: Of 228,471 patients in Segment 1 without any history of AS, Model A/B predicted 1923 patients to have AS. Ultimately, 1242 patients received an AS diagnosis in Segment 2; 120 of these were correctly predicted by Model A/B, yielding a positive predictive value (PPV) of 6.24%. The diagnostic accuracy of Model A/B compared favorably with that of a clinical model (PPV, 1.29%) that predicted AS based on spondyloarthritis features described in the Assessment of SpondyloArthritis international Society classification criteria. A simplified linear regression model created to test the operability of Model A/B yielded a lower PPV (2.55%). Conclusions: Model A/B performed better than a clinically based model in predicting a diagnosis of AS among patients in a large claims database; its use may contribute to early recognition of AS and a timely diagnosis.

AB - Objective: To develop a predictive mathematical model for the early identification of ankylosing spondylitis (AS) based on the medical and pharmacy claims history of patients with and without AS. Methods: This retrospective study used claims data from Truven databases from January 2006 to September 2015 (Segment 1) and October 2015 to February 2018 (Segment 2). Machine learning identified features differentiating patients with AS from matched controls; selected features were used as inputs in developing Model A/B to identify patients likely to have AS. Model A/B was trained and developed in Segment 1, and patients predicted to have AS in Segment 1 were followed up in Segment 2 to evaluate the predictive capability of Model A/B. Results: Of 228,471 patients in Segment 1 without any history of AS, Model A/B predicted 1923 patients to have AS. Ultimately, 1242 patients received an AS diagnosis in Segment 2; 120 of these were correctly predicted by Model A/B, yielding a positive predictive value (PPV) of 6.24%. The diagnostic accuracy of Model A/B compared favorably with that of a clinical model (PPV, 1.29%) that predicted AS based on spondyloarthritis features described in the Assessment of SpondyloArthritis international Society classification criteria. A simplified linear regression model created to test the operability of Model A/B yielded a lower PPV (2.55%). Conclusions: Model A/B performed better than a clinically based model in predicting a diagnosis of AS among patients in a large claims database; its use may contribute to early recognition of AS and a timely diagnosis.

KW - Ankylosing spondylitis

KW - Machine learning

KW - Mathematical model

KW - Spondyloarthritis

UR - http://www.scopus.com/inward/record.url?scp=85065235512&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85065235512&partnerID=8YFLogxK

U2 - 10.1007/s10067-019-04553-x

DO - 10.1007/s10067-019-04553-x

M3 - Article

JO - Clinical Rheumatology

JF - Clinical Rheumatology

SN - 0770-3198

ER -