Skip to main content
Log in

Machine Learning Algorithm Helps Identify Non-Diagnosed Prodromal Alzheimer’s Disease Patients in the General Population

  • Original Research
  • Published:
The Journal of Prevention of Alzheimer's Disease Aims and scope Submit manuscript

Abstract

Background

Recruiting patients for clinical trials of potential therapies for Alzheimer’s disease (AD) remains a major challenge, with demand for trial participants at an all-time high. The AD treatment R&D pipeline includes around 112 agents. In the United States alone, 150 clinical trials are seeking 70,000 participants. Most people with early cognitive impairment consult primary care providers, who may lack time, diagnostic skills and awareness of local clinical trials. Machine learning and predictive analytics offer promise to boost enrollment by predicting which patients have prodromal AD, and which will go on to develop AD.

Objectives

The authors set out to develop a machine learning predictive model that identifies prodromal AD patients in the general population, to aid early AD detection by primary care physicians and timely referral to expert sites for biomarker confirmation of diagnosis and clinical trial enrollment.

Design

The authors use a classification machine learning algorithm to extract patterns within healthcare claims and prescription data three years prior to AD diagnosis/AD drug initiation.

Setting

The study focused on subjects included within proprietary IQVIA US data assets (claims and prescription databases). Patient information was extracted from January 2010 to July 2018, for cohorts aged between 50 and 85 years.

Participants

A total of 88,298,289 subjects aged between 50 and 85 years were identified. For the positive cohort, 667,288 subjects were identified who had 24 months of medical history and at least one record with AD or AD treatment. For the negative cohort, 3,670,254 patients were selected who had a similar length of medical history and who were matched to positive cohort subjects based on the prevalence rate. The scoring cohort was selected based on availability of recent medical data of 2–5 years and included 72,670,283 subjects between the ages of 50 and 85 years.

Intervention (if any)

None.

Measurements

A list of clinically–relevant and interpretable predictors was generated and extracted from the data sets for each subject, including pharmacological treatments (NDC/ product), office/specialist visits (specialty), tests and procedures (HCPCS and CPT), and diagnosis (ICD). The positive cohort was defined as patients who have AD diagnosis/AD treatment with a 3 years offset as an estimate for prodromal AD diagnosis. Supervised ML techniques were used to develop algorithms to predict the occurrence of prodromal AD cases. The sample dataset was divided randomly into a training dataset and a test dataset. The classification models were trained and executed in the PySpark framework. Training and evaluation of LogisticRegression, DecisionTreeClassifier, RandomForestClassifier, and GBTClassifier were executed using PySpark’s mllib module. The area under the precision-recall curve (AUCPR) was used to compare the results of the various models.

Results

The AUCPRs are 0.426, 0.157, 0.436, and 0.440 for LogisticRegression, DecisionTreeClassifier, RandomForestClassifier, and GBTClassifier, respectively, meaning that GBTClassifier (Gradient Boosted Tree) outperforms the other three classifiers. The GBT model identified 222,721 subjects in the prodromal AD stage with 80% precision. Some 76% of identified prodromal AD patients were in the primary care setting.

Conclusions

Applying the developed predictive model to 72,670,283 U.S. residents, 222,721 prodromal AD patients were identified, the majority of whom were in the primary care setting. This could drive major advances in AD research by enabling more accurate and earlier prodromal AD diagnosis at the primary care physician level, which would facilitate timely referral to expert sites for in–depth assessment and potential enrolment in clinical trials.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3

Similar content being viewed by others

References

  1. Cummings J, Fox N. Defining Disease Modifying Therapy for Alzheimer’s Disease. The journal of prevention of Alzheimer’s disease. 2017;4(2):109–115.

    CAS  PubMed  PubMed Central  Google Scholar 

  2. Cummings J, Lee G, Ritter A, Zhong K. Alzheimer’s disease drug development pipeline: 2018. Alzheimer’s & Dementia: Translational Research & Clinical Interventions. 2018;4:195–214.

    Google Scholar 

  3. Hughes L, Kalali A, Vanbelle C, Cascade E. Innovative Digital Patient Recruitment Strategies in Prodromal AD trials. Poster at CTAD Annual Meeting, October 29–31, 2012.

    Google Scholar 

  4. Grill JD, Galvin JE. Facilitating Alzheimer disease research recruitment. Alzheimer Dis Assoc Disord. 2014;28(1):1–8. https://doi.org/www.ncbi.nlm.nih.gov/pmc/articles/PMC3945167/

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Boada M, et al. Patient Engagement: The Fundacio ACE Framework for Improving Recruitment and Retention in Alzheimer’s Disease Research. J Alzheimers Dis, 2018. 62(3): p. 1079–1090. https://doi.org/www.ncbi.nlm.nih.gov/pubmed/29562541

    Article  PubMed  PubMed Central  Google Scholar 

  6. Clinical Trials and Studies–Myths vs. Facts | Research Center | Alzheimer’s Association. Available at: https://doi.org/www.alz.org/research/clinical_trials/myth_vs_fact.asp. Accessed 2018–08–24.

  7. Watson JL, Ryan L, Silverberg N, Cahan V, Bernard MA. Obstacles And Opportunities In Alzheimer’s Clinical Trial Recruitment. Health affairs (Project Hope). 2014;33(4):574–579. doi:10.1377/hlthaff.2013.1314. https://doi.org/www.ncbi.nlm.nih.gov/pmc/articles/PMC4167360/(accessed 08–26–18)

    Article  PubMed Central  Google Scholar 

  8. Getz K. The Need and Opportunity for a New Paradigm in Clinical Trial Execution. Applied Clinical Trials, Volume 27, issue 6, June 1, 2018. https://doi.org/www.appliedclinicaltrialsonline.com/need-and-opportunity-new-paradigmclinical-trial-execution

    Google Scholar 

  9. Moradi E, Pepe A, Gaser C, Huttunen H, Tohka J. Machine learning framework for early MRI–based Alzheimer’s conversion prediction in MCI subjects. NeuroImage. 2015;104:398–412.

    Article  PubMed  Google Scholar 

  10. «MLlib | Apache Spark». spark.apache.org. Retrieved 2016–01–18.

  11. Spark SQL: Relational Data Processing in Spark https://doi.org/amplab.cs.berkeley.edu/publication/spark-sql-relational-data-processing-in-spark/2017 Alzheimer’s disease facts and figures, Alzheimer’s & Dementia, Volume 13, Issue 4, 2017, Pages 325–373, ISSN 1552–5260, https://doi.org/10.1016/j.jalz.2017.02.001. (https://doi.org/www.sciencedirect.com/science/article/pii/S1552526017300511)

  12. Moore P. Alzheimer’s Association Clinical Studies Initiative Recruitment and Retention Challenges and Opportunities For the Alzheimer Disease Centers. Available at: https://doi.org/www.alz.washington.edu/NONMEMBER/FALL07/moore.pdf (accessed 2018–08–24)

    Google Scholar 

  13. Jones RW, Andrieu S, Knox S. et al. Physicians and caregivers: Ready and waiting for increased participation in clinical research J Nutr Health Aging (2010) 14:563. https://doi.org/10.1007/s12603-010-0269-5. https://doi.org/link.springer.com/article/10.1007/s12603-010-0269-5

    CAS  Google Scholar 

  14. Alzheimer’s Disease International web page, IMPACT Study. https://doi.org/www.alz.co.uk/impact-study (accessed 08–27–18)

  15. US National Institute on Aging web page. Seeking your ideas for ways to enhance recruitment and retention of Alzheimer’s disease study participants. March 07, 2018. https://doi.org/www.nia.nih.gov/research/blog/2018/03/seeking-your-ideas-ways-enhance-recruitment-and-retention-alzheimers-disease (accessed 08–27–18)

  16. Galvin JE, Meuser TM, Morris JC. Improving physician awareness of Alzheimer’s Disease and Enhancing Recruitment: The clinician partners program. Alzheimer Disease and Associated Disorders. 2012;26(1):61–67. doi:10.1097/WAD.0b013e318212c0df. https://doi.org/www.ncbi.nlm.nih.gov/pmc/articles/PMC3288449/(accessed 09–29–28)

    Article  PubMed  PubMed Central  Google Scholar 

  17. Williams MM, Meisel MM, Williams J, et al. An interdisciplinary outreach model of African American recruitment for Alzheimer’s disease research. The Gerontologist. 2011;51(Suppl 1):S134–S141. [PMC free article] [PubMed]

    Book  Google Scholar 

  18. Nichols L, Martindale–Adams J, Burns R, et al. Social marketing as a framework for recruitment: illustrations from the REACH study. Journal of Aging and Health. 2004;16(5 Suppl):157S–176S. [PMC free article] [PubMed]

    Google Scholar 

  19. Fargo KN, Carrillo MC, Weiner MW, Potter WZ, Khachaturian Z. The crisis in recruitment for clinical trials in Alzheimer’s and dementia: An action plan for solutions. Alzheimer’s & Dementia: The Journal of the Alzheimer’s Association, Volume 12, Issue 11, 1113–1115. https://doi.org/www.alzheimersanddementia.com/article/S1552-5260(16)33033-3/fulltext (accessed 08–29–18)

  20. Gold M, Amatniek J, Carrillo MC, Cedarbaum JM, Hendrix JA, Miller BB, Robillard JM, Rice JJ, Soares H, Tome MB, Tarnanas I, Vargas G, Bain LJ, Czaja SJ. Digital technologies as biomarkers, clinical outcomes assessment, and recruitment tools in Alzheimer’s disease clinical trials. Alzheimers Dement (N Y). 2018 May 24;4:234–242. doi: 10.1016/j.trci.2018.04.003. eCollection 2018. https://doi.org/www.ncbi.nlm.nih.gov/pubmed/29955666 (accessed 08–29–18)

    Google Scholar 

  21. Singanamalli A, Wang H, Madabhushi A. Cascaded Multi–view Canonical Correlation (CaMCCo) for Early Diagnosis of Alzheimer’s Disease via Fusion of Clinical, Imaging and Omic Features. Scientific Reports, 2017; 7 (1) DOI: 10.1038/s41598–017–03925–0 (press release: https://doi.org/www.sciencedaily.com/releases/2017/08/170815110950.htm, accessed 08–26–18)

    Google Scholar 

  22. Amoroso N, La Rocca M, Bruno S, Maggipinto T, Monaco A, Bellotti R, Tangaro S. Brain structural connectivity atrophy in Alzheimer’s disease. September 7, 2017. arXiv:1709.02369

    Google Scholar 

  23. Anithaswamy A. AI spots Alzheimer’s brain changes years before symptoms emerge. New Scientist Magazine 14 September 2017. https://doi.org/www.newscientist.com/article/2147472-ai-spots-alzheimers-brain-changes-years-before-symptoms-emerge/(accessed 08–26–18)

  24. Lee JS, Kim C, Shin J–H, et al. Machine Learning–based Individual Assessment of Cortical Atrophy Pattern in Alzheimer’s Disease Spectrum: Development of the Classifier and Longitudinal Evaluation. Scientific Reports. 2018;8(1):4161. https://doi.org/www.nature.com/articles/s41598-018-22277-x (accessed 08–26–18)

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Ding X, Bucholc M, Wang H, et al. A hybrid computational approach for efficient Alzheimer’s disease classification based on heterogeneous data. Scientific Reports. 2018;8(1):9774.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sam Khinda.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Uspenskaya-Cadoz, O., Alamuri, C., Wang, L. et al. Machine Learning Algorithm Helps Identify Non-Diagnosed Prodromal Alzheimer’s Disease Patients in the General Population. J Prev Alzheimers Dis 6, 185–191 (2019). https://doi.org/10.14283/jpad.2019.10

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.14283/jpad.2019.10

Key words

Navigation