ORCID
- Carroll, Camille: 0000-0001-7472-953X
- Luo, Shouqing: 0000-0002-7998-3059
Abstract
As the population ages, neurodegenerative diseases are becoming more prevalent, making it crucial to comprehend the underlying disease mechanisms and identify biomarkers to allow for early diagnosis and effective screening for clinical trials. Thanks to advancements in gene expression profiling, it is now possible to search for disease biomarkers on an unprecedented scale.Here we applied a selection of five machine learning (ML) approaches to identify blood-based biomarkers for Alzheimer's (AD) and Parkinson's disease (PD) with the application of multiple feature selection methods. Based on ROC AUC performance, one optimal random forest (RF) model was discovered for AD with 159 gene markers (ROC-AUC = 0.886), while one optimal RF model was discovered for PD (ROC-AUC = 0.743). Additionally, in comparison to traditional ML approaches, deep learning approaches were applied to evaluate their potential applications in future works. We demonstrated that convolutional neural networks perform consistently well across both the Alzheimer's (ROC AUC = 0.810) and Parkinson's (ROC AUC = 0.715) datasets, suggesting its potential in gene expression biomarker detection with increased tuning of their architecture.
DOI
10.1038/s41598-023-43956-4
Publication Date
2023-10-11
Publication Title
Scientific Reports
Volume
13
Issue
1
ISSN
2045-2322
Organisational Unit
University of Plymouth
Recommended Citation
Kelly, J., Moyeed, R., Carroll, C., Luo, S., & Li, X. (2023) 'Blood biomarker-based classification study for neurodegenerative diseases', Scientific Reports, 13(1). Available at: https://doi.org/10.1038/s41598-023-43956-4