Applying Machine Learning to High-Dimensional Proteomics Datasets for Biomarker Discovery in Neurodegenerative Disorders

Ivarsson, Christoffer
Rosberg, Oscar
Göteborgs universitet/Institutionen för data- och informationsteknikswe
University of Gothenburg/Department of Computer Science and Engineeringeng
2024-10-16T13:37:37Z
2024-10-16T13:37:37Z
2024-10-16
Identifying biomarkers for Alzheimer’s Disease (AD), a progressive neurodegenerative disorder characterized by progressive cognitive decline is crucial for early diagnosis and treatment. This thesis explores proteomic abundances along the AD continuum using lumbar and ventricular cerebrospinal fluid (CSF) samples from patients with idiopathic normal pressure hydrocephalus (iNPH) to identify potential new biomarkers. Our study emphasizes the necessity of treating lumbar and ventricular CSF samples as separate datasets due to their distinct proteomic profiles. Challenges such as handling high-dimensional data with missing values, small sample sizes and class imbalances were addressed through imputation, oversampling and k-fold cross-validation techniques. We discuss the presence and consequence of batch effect, a remnant of the mass spectrometry technique tandem mass tag. Comparative analysis through staging on existing biomarkers highlights the uniqueness of the dataset provided by Sahlgrenska University Hospital. Through machine learning and feature selection techniques, we propose eight protein and nine peptide biomarkers for distinguishing iNPH patients on the pathological AD spectra. One such biomarker shows relevance in both lumbar and ventricular CSF. Despite the study’s limited cohort size, our findings contribute insights into the proteomic analysis of neurodegenerative disorders.sv
https://hdl.handle.net/2077/83684
Technology
Alzheimer’s diseasesv
neurodegenerative disordersv
proteomicssv
mass spectrometrysv
high-dimensional datasv
biomarkerssv
machine learningsv
feature selectionsv
stagingsv
Applying Machine Learning to High-Dimensional Proteomics Datasets for Biomarker Discovery in Neurodegenerative Disorderssv
text
Student essay
H2

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
CSE 24-28 CIO OR.pdf
Size:
3.11 MB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
876 B
Format:
Item-specific license agreed upon to submission
Description:

Collections