Performance of Machine Learning Algorithms for Predicting Progression to Dementia in Memory Clinic Patients

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Key Points

Question

Can machine learning algorithms accurately predict 2-year dementia incidence in memory clinic patients and how do these predictions compare with existing models?

Findings

In this prognostic study of data from 15 307 memory clinic attendees without dementia, machine learning algorithms were superior in their ability to predict incident dementia within 2 years compared with 2 existing predictive models. Machine learning algorithms required only 6 variables to reach an accuracy of at least 90%, and had an area under the receiver operating characteristic curve of 0.89.

Meaning

These findings suggest that machine learning algorithms could be used to accurately predict 2-year dementia risk and may form the basis for a clinical decision-making aid.

Abstract

This prognostic study assesses the ability of novel machine learning algorithms compared with existing risk prediction models to predict dementia incidence within 2 years.

Abstract

Importance

Machine learning algorithms could be used as the basis for clinical decision-making aids to enhance clinical practice.

Objective

To assess the ability of machine learning algorithms to predict dementia incidence within 2 years compared with existing models and determine the optimal analytic approach and number of variables required.

Design, Setting, and Participants

This prognostic study used data from a prospective cohort of 15 307 participants without dementia at baseline to perform a secondary analysis of factors that could be used to predict dementia incidence. Participants attended National Alzheimer Coordinating Center memory clinics across the United States between 2005 and 2015. Analyses were conducted from March to May 2021.

Exposures

258 variables spanning domains of dementia-related clinical measures and risk factors.

Main Outcomes and Measures

The main outcome was incident all-cause dementia diagnosed within 2 years of baseline assessment.

Results

In a sample of 15 307 participants (mean [SD] age, 72.3 [9.8] years; 9129 [60%] women and 6178 [40%] men) without dementia at baseline, 1568 (10%) received a diagnosis of dementia within 2 years of their initial assessment. Compared with 2 existing models for dementia risk prediction (ie, Cardiovascular Risk Factors, Aging, and Incidence of Dementia Risk Score, and the Brief Dementia Screening Indicator), machine learning algorithms were superior in predicting incident all-cause dementia within 2 years. The gradient-boosted trees algorithm had a mean (SD) overall accuracy of 92% (1%), sensitivity of 0.45 (0.05), specificity of 0.97 (0.01), and area under the curve of 0.92 (0.01) using all 258 variables. Analysis of variable importance showed that only 6 variables were required for machine learning algorithms to achieve an accuracy of 91% and area under the curve of at least 0.89. Machine learning algorithms also identified up to 84% of participants who received an initial dementia diagnosis that was subsequently reversed to mild cognitive impairment or cognitively unimpaired, suggesting possible misdiagnosis.

Conclusions and Relevance

These findings suggest that machine learning algorithms could accurately predict incident dementia within 2 years in patients receiving care at memory clinics using only 6 variables. These findings could be used to inform the development and validation of decision-making aids in memory clinics.

Related collections

Most cited references 40

Record: found
Abstract: not found
Article: not found

Random Forests

Leo Breiman (2001)

0 comments Cited 7134 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

Support-vector networks

Corinna Cortes, Vladimir Vapnik (1995)

0 comments Cited 2742 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: not found
Article: not found

An introduction to ROC analysis

Tom Fawcett (2006)

0 comments Cited 1562 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-ta): JAMA Netw Open

Journal ID (iso-abbrev): JAMA Netw Open

Title: JAMA Network Open

Publisher: American Medical Association

ISSN (Electronic): 2574-3805

Publication date (Electronic): 16 December 2021

Publication date Collection: December 2021

Publication date PMC-release: 16 December 2021

Volume: 4

Issue: 12

Electronic Location Identifier: e2136553

Affiliations

[1 ]University of Exeter Medical School, Exeter, United Kingdom

[2 ]Deep Dementia Phenotyping Network, United Kingdom

[3 ]Department of Computer Science, University of Exeter, Exeter, United Kingdom

[4 ]The Alan Turing Institute, London, United Kingdom

Author notes

Article Information

Accepted for Publication: October 4, 2021.

Published: December 16, 2021. doi:10.1001/jamanetworkopen.2021.36553

Corresponding Author: David J. Llewellyn, PhD, University of Exeter Medical School, Veysey Building, College House, St Luke’s Campus, Exeter, Devon EX2 4SG, United Kingdom ( david.llewellyn@ 123456exeter.ac.uk ).

Author Contributions: Dr James had full access to all of the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.

Concept and design: All authors.

Acquisition, analysis, or interpretation of data: James, Everson, Llewellyn.

Drafting of the manuscript: James, Everson, Llewellyn.

Critical revision of the manuscript for important intellectual content: Ranson, Llewellyn.

Statistical analysis: James, Everson.

Obtained funding: Llewellyn.

Supervision: Everson, Llewellyn.

Conflict of Interest Disclosures: Dr Llewellyn reported receiving personal fees from the National Institute on Aging of the National Institutes of Health, Alzheimer’s Research UK, Mind over Matter Medtech, and SharpTx outside the submitted work. No other disclosures were reported.

Funding/Support: Drs James and Ranson are supported by Alzheimer’s Research UK (ARUK). Dr Llewellyn is supported by the National Institute for Health Research Applied Research Collaboration South West Peninsula, ARUK, National Health and Medical Research Council, JP Moulton Foundation, National Institute on Aging of the National Institutes of Health (NIA/NIH; grant No. RF1AG055654), Alan Turing Institute/Engineering and Physical Sciences Research Council (grant No. EP/N510129/1). The National Alzheimer Coordinating Center database is funded by NIA/NIH (grant No. U01 AG016976). NACC data are contributed by the NIA-funded Aging and Disability Resource Centers (grant No. P30 AG019610, P30 AG013846, P50 AG008702, P50 AG025688, P50 AG047266, P30 AG010133, P50 AG005146, P50 AG005134, P50 AG016574, P50 AG005138, P30 AG008051, P30 AG013854, P30 AG008017, P30 AG010161, P50 AG047366, P30 AG010129, P50 AG016573, P50 AG005131, P50 AG023501, P30 AG035982, P30 AG028383, P30 AG053760, P30 AG010124, P50 AG005133, P50 AG005142, P30 AG012300, P30 AG049638, P50 AG005136, P50 AG033514, P50 AG005681, and P50 AG047270).

Role of the Funder/Sponsor: The funders had no role in the design and conduct of the study; collection, management, analysis, and interpretation of the data; preparation, review, or approval of the manuscript; and decision to submit the manuscript for publication.

Additional Contributions: Fliss Guest, PhD (Department of Computer Science, University of Exeter, Exeter, United Kingdom), assisted with cleaning and preprocessing the data and was not compensated for this work.

Article

Publisher ID: zoi211030

DOI: 10.1001/jamanetworkopen.2021.36553

PMC ID: 8678688

PubMed ID: 34913981

SO-VID: 29391ee4-a92a-4610-8f52-2b98569df09c

License:

This is an open access article distributed under the terms of the CC-BY License.

History

Date received : 9 June 2021

Date accepted : 4 October 2021

Comments

Comment on this article

scite_

Cited by 24

See all cited by

Most referenced authors 1,708

See all reference authors

- Version 1

Performance of Machine Learning Algorithms for Predicting Progression to Dementia in Memory Clinic Patients

Read this article at

Key Points

Question

Findings

Meaning

Abstract

Abstract

Importance

Objective

Design, Setting, and Participants

Exposures

Main Outcomes and Measures

Results

Conclusions and Relevance

Related collections

Annual Reviews AI, Machine Learning, and Society

Most cited references 40

Random Forests

Support-vector networks

An introduction to ROC analysis

Author and article information

Journal

Affiliations

Author notes

Article

History

Categories

Comments

Comment on this article

Similar content 226

Cited by 24

Most referenced authors 1,708