-
2D and 3D Deep Learning Models for MRI-based Parkinson's Disease Classification: A Comparative Analysis of Convolutional Kolmogorov-Arnold Networks, Convolutional Neural Networks, and Graph Convolutional Networks
Authors:
Salil B Patel,
Vicky Goh,
James F FitzGerald,
Chrystalina A Antoniades
Abstract:
Parkinson's Disease (PD) diagnosis remains challenging. This study applies Convolutional Kolmogorov-Arnold Networks (ConvKANs), integrating learnable spline-based activation functions into convolutional layers, for PD classification using structural MRI. The first 3D implementation of ConvKANs for medical imaging is presented, comparing their performance to Convolutional Neural Networks (CNNs) and…
▽ More
Parkinson's Disease (PD) diagnosis remains challenging. This study applies Convolutional Kolmogorov-Arnold Networks (ConvKANs), integrating learnable spline-based activation functions into convolutional layers, for PD classification using structural MRI. The first 3D implementation of ConvKANs for medical imaging is presented, comparing their performance to Convolutional Neural Networks (CNNs) and Graph Convolutional Networks (GCNs) across three open-source datasets. Isolated analyses assessed performance within individual datasets, using cross-validation techniques. Holdout analyses evaluated cross-dataset generalizability by training models on two datasets and testing on the third, mirroring real-world clinical scenarios. In isolated analyses, 2D ConvKANs achieved the highest AUC of 0.99 (95% CI: 0.98-0.99) on the PPMI dataset, outperforming 2D CNNs (AUC: 0.97, p = 0.0092). 3D models showed promise, with 3D CNN and 3D ConvKAN reaching an AUC of 0.85 on PPMI. In holdout analyses, 3D ConvKAN demonstrated superior generalization, achieving an AUC of 0.85 on early-stage PD data. GCNs underperformed in 2D but improved in 3D implementations. These findings highlight ConvKANs' potential for PD detection, emphasize the importance of 3D analysis in capturing subtle brain changes, and underscore cross-dataset generalization challenges. This study advances AI-assisted PD diagnosis using structural MRI and emphasizes the need for larger-scale validation.
△ Less
Submitted 26 September, 2024; v1 submitted 24 July, 2024;
originally announced July 2024.
-
Hierarchical Machine Learning Classification of Parkinsonian Disorders using Saccadic Eye Movements: A Development and Validation Study
Authors:
Salil B Patel,
Oliver B Bredemeyer,
James J FitzGerald,
Chrystalina A Antoniades
Abstract:
Discriminating between Parkinson's Disease (PD) and Progressive Supranuclear Palsy (PSP) is difficult due to overlapping symptoms, especially early on. Saccades (rapid conjugate eye movements between fixation points) are affected by both diseases but conventional saccade analyses exhibit group level differences only. We hypothesized analyzing entire saccade raw time series waveforms would permit s…
▽ More
Discriminating between Parkinson's Disease (PD) and Progressive Supranuclear Palsy (PSP) is difficult due to overlapping symptoms, especially early on. Saccades (rapid conjugate eye movements between fixation points) are affected by both diseases but conventional saccade analyses exhibit group level differences only. We hypothesized analyzing entire saccade raw time series waveforms would permit superior individual level discrimination between PD, PSP, and healthy controls (HC). 13,309 saccadic eye movements from 127 participants were analyzed using a novel, calibration-free waveform analysis and hierarchical machine learning framework. Individual saccades were classified based on which trained model could reconstruct each waveform with minimum error, indicating the most likely condition. A hierarchical classifier then predicted overall status (recently diagnosed and medication-naive 'de novo' PD, 'established' PD on antiparkinsonian medication, PSP, and healthy controls) by combining each participant's saccade results. This approach substantially outperformed conventional metrics, achieving high AUROCs distinguishing de novo PD from PSP (0.92-0.97), de novo PD from HC (0.72-0.89), and PSP from HC (0.90-0.95), while the conventional model showed limited performance (AUROC range: 0.45-0.75). This calibration-free waveform analysis sets a new standard for precise saccadic classification of PD, PSP, and HC, increasing potential for clinical adoption, remote monitoring, and screening.
△ Less
Submitted 24 July, 2024; v1 submitted 22 July, 2024;
originally announced July 2024.
-
Global Human-guided Counterfactual Explanations for Molecular Properties via Reinforcement Learning
Authors:
Danqing Wang,
Antonis Antoniades,
Kha-Dinh Luong,
Edwin Zhang,
Mert Kosan,
Jiachen Li,
Ambuj Singh,
William Yang Wang,
Lei Li
Abstract:
Counterfactual explanations of Graph Neural Networks (GNNs) offer a powerful way to understand data that can naturally be represented by a graph structure. Furthermore, in many domains, it is highly desirable to derive data-driven global explanations or rules that can better explain the high-level properties of the models and data in question. However, evaluating global counterfactual explanations…
▽ More
Counterfactual explanations of Graph Neural Networks (GNNs) offer a powerful way to understand data that can naturally be represented by a graph structure. Furthermore, in many domains, it is highly desirable to derive data-driven global explanations or rules that can better explain the high-level properties of the models and data in question. However, evaluating global counterfactual explanations is hard in real-world datasets due to a lack of human-annotated ground truth, which limits their use in areas like molecular sciences. Additionally, the increasing scale of these datasets provides a challenge for random search-based methods. In this paper, we develop a novel global explanation model RLHEX for molecular property prediction. It aligns the counterfactual explanations with human-defined principles, making the explanations more interpretable and easy for experts to evaluate. RLHEX includes a VAE-based graph generator to generate global explanations and an adapter to adjust the latent representation space to human-defined principles. Optimized by Proximal Policy Optimization (PPO), the global explanations produced by RLHEX cover 4.12% more input graphs and reduce the distance between the counterfactual explanation set and the input set by 0.47% on average across three molecular datasets. RLHEX provides a flexible framework to incorporate different human-designed principles into the counterfactual explanation generation process, aligning these explanations with domain expertise. The code and data are released at https://github.com/dqwang122/RLHEX.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Neuroformer: Multimodal and Multitask Generative Pretraining for Brain Data
Authors:
Antonis Antoniades,
Yiyi Yu,
Joseph Canzano,
William Wang,
Spencer LaVere Smith
Abstract:
State-of-the-art systems neuroscience experiments yield large-scale multimodal data, and these data sets require new tools for analysis. Inspired by the success of large pretrained models in vision and language domains, we reframe the analysis of large-scale, cellular-resolution neuronal spiking data into an autoregressive spatiotemporal generation problem. Neuroformer is a multimodal, multitask g…
▽ More
State-of-the-art systems neuroscience experiments yield large-scale multimodal data, and these data sets require new tools for analysis. Inspired by the success of large pretrained models in vision and language domains, we reframe the analysis of large-scale, cellular-resolution neuronal spiking data into an autoregressive spatiotemporal generation problem. Neuroformer is a multimodal, multitask generative pretrained transformer (GPT) model that is specifically designed to handle the intricacies of data in systems neuroscience. It scales linearly with feature size, can process an arbitrary number of modalities, and is adaptable to downstream tasks, such as predicting behavior. We first trained Neuroformer on simulated datasets, and found that it both accurately predicted simulated neuronal circuit activity, and also intrinsically inferred the underlying neural circuit connectivity, including direction. When pretrained to decode neural responses, the model predicted the behavior of a mouse with only few-shot fine-tuning, suggesting that the model begins learning how to do so directly from the neural representations themselves, without any explicit supervision. We used an ablation study to show that joint training on neuronal responses and behavior boosted performance, highlighting the model's ability to associate behavioral and neural representations in an unsupervised manner. These findings show that Neuroformer can analyze neural datasets and their emergent properties, informing the development of models and hypotheses associated with the brain.
△ Less
Submitted 15 March, 2024; v1 submitted 31 October, 2023;
originally announced November 2023.