A similarity-based approach to leverage multi-cohort medical data on the diagnosis and prognosis of Alzheimer's disease

The Alzheimer's Disease Neuroimaging Initiative

Research output: Contribution to journalArticle


Motivation: Heterogeneous diseases such as Alzheimer's disease (AD) manifest a variety of phenotypes among populations. Early diagnosis and effective treatment offer cost benefits. Many studies on biochemical and imaging markers have shown potential promise in improving diagnosis, yet establishing quantitative diagnostic criteria for ancillary tests remains challenging. Results: We have developed a similarity-based approach that matches individuals to subjects with similar conditions. We modeled the disease with a Gaussian process, and tested the method in the Alzheimer's Disease Big Data DREAM Challenge. Ranked the highest among submitted methods, our diagnostic model predicted cognitive impairment scores in an independent dataset test with a correlation score of 0.573. It differentiated AD patients from control subjects with an area under the receiver operating curve of 0.920. Without knowing longitudinal information about subjects, the model predicted patients who are vulnerable to conversion from mild-cognitive impairment to AD through the similarity network. This diagnostic framework can be applied to other diseases with clinical heterogeneity, such as Parkinson's disease.

Original languageEnglish (US)
Article numbergiy085
Issue number7
StatePublished - Jul 1 2018



  • Alzheimer's Disease
  • Gaussian Process Regression
  • Kernel method
  • Machine learning
  • Patient similarity network

ASJC Scopus subject areas

  • Computer Science Applications
  • Health Informatics

Cite this