Skip to main content

Showing 1–50 of 60 results for author: Thompson, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.18209  [pdf, ps, other

    cs.CV cs.AI

    Deep Learning-based Alignment Measurement in Knee Radiographs

    Authors: Zhisen Hu, Dominic Cullen, Peter Thompson, David Johnson, Chang Bian, Aleksei Tiulpin, Timothy Cootes, Claudia Lindner

    Abstract: Radiographic knee alignment (KA) measurement is important for predicting joint health and surgical outcomes after total knee replacement. Traditional methods for KA measurements are manual, time-consuming and require long-leg radiographs. This study proposes a deep learning-based method to measure KA in anteroposterior knee radiographs via automatically localized knee anatomical landmarks. Our met… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

    Comments: Accepted to MICCAI 2025

  2. arXiv:2505.14917  [pdf, ps, other

    cs.CL

    ConspEmoLLM-v2: A robust and stable model to detect sentiment-transformed conspiracy theories

    Authors: Zhiwei Liu, Paul Thompson, Jiaqi Rong, Sophia Ananiadou

    Abstract: Despite the many benefits of large language models (LLMs), they can also cause harm, e.g., through automatic generation of misinformation, including conspiracy theories. Moreover, LLMs can also ''disguise'' conspiracy theories by altering characteristic textual features, e.g., by transforming their typically strong negative emotions into a more positive tone. Although several studies have proposed… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: work in progress

  3. arXiv:2504.15267  [pdf, other

    cs.CV

    Diffusion Bridge Models for 3D Medical Image Translation

    Authors: Shaorong Zhang, Tamoghna Chattopadhyay, Sophia I. Thomopoulos, Jose-Luis Ambite, Paul M. Thompson, Greg Ver Steeg

    Abstract: Diffusion tensor imaging (DTI) provides crucial insights into the microstructure of the human brain, but it can be time-consuming to acquire compared to more readily available T1-weighted (T1w) magnetic resonance imaging (MRI). To address this challenge, we propose a diffusion bridge model for 3D brain image translation between T1w MRI and DTI modalities. Our model learns to generate high-quality… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  4. arXiv:2411.19617  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Materials Learning Algorithms (MALA): Scalable Machine Learning for Electronic Structure Calculations in Large-Scale Atomistic Simulations

    Authors: Attila Cangi, Lenz Fiedler, Bartosz Brzoza, Karan Shah, Timothy J. Callow, Daniel Kotik, Steve Schmerler, Matthew C. Barry, James M. Goff, Andrew Rohskopf, Dayton J. Vogel, Normand Modine, Aidan P. Thompson, Sivasankaran Rajamanickam

    Abstract: We present the Materials Learning Algorithms (MALA) package, a scalable machine learning framework designed to accelerate density functional theory (DFT) calculations suitable for large-scale atomistic simulations. Using local descriptors of the atomic environment, MALA models efficiently predict key electronic observables, including local density of states, electronic density, density of states,… ▽ More

    Submitted 29 November, 2024; originally announced November 2024.

  5. arXiv:2411.09618  [pdf, other

    physics.med-ph cs.LG

    MICCAI-CDMRI 2023 QuantConn Challenge Findings on Achieving Robust Quantitative Connectivity through Harmonized Preprocessing of Diffusion MRI

    Authors: Nancy R. Newlin, Kurt Schilling, Serge Koudoro, Bramsh Qamar Chandio, Praitayini Kanakaraj, Daniel Moyer, Claire E. Kelly, Sila Genc, Jian Chen, Joseph Yuan-Mou Yang, Ye Wu, Yifei He, Jiawei Zhang, Qingrun Zeng, Fan Zhang, Nagesh Adluru, Vishwesh Nath, Sudhir Pathak, Walter Schneider, Anurag Gade, Yogesh Rathi, Tom Hendriks, Anna Vilanova, Maxime Chamberland, Tomasz Pieciak , et al. (11 additional authors not shown)

    Abstract: White matter alterations are increasingly implicated in neurological diseases and their progression. International-scale studies use diffusion-weighted magnetic resonance imaging (DW-MRI) to qualitatively identify changes in white matter microstructure and connectivity. Yet, quantitative analysis of DW-MRI data is hindered by inconsistencies stemming from varying acquisition protocols. There is a… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

    Comments: Accepted for publication at the Journal of Machine Learning for Biomedical Imaging (MELBA) https://melba-journal.org/2024/019

    Journal ref: Machine.Learning.for.Biomedical.Imaging. 2 (2024)

  6. Distributed Harmonization: Federated Clustered Batch Effect Adjustment and Generalization

    Authors: Bao Hoang, Yijiang Pang, Siqi Liang, Liang Zhan, Paul Thompson, Jiayu Zhou

    Abstract: Independent and identically distributed (i.i.d.) data is essential to many data analysis and modeling techniques. In the medical domain, collecting data from multiple sites or institutions is a common strategy that guarantees sufficient clinical diversity, determined by the decentralized nature of medical data. However, data from various sites are easily biased by the local environment or faciliti… ▽ More

    Submitted 7 August, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 11 pages, 7 figures, accepted to KDD2024-ADS

  7. arXiv:2405.13190  [pdf, other

    cs.LG cs.AI

    Interpretable Spatio-Temporal Embedding for Brain Structural-Effective Network with Ordinary Differential Equation

    Authors: Haoteng Tang, Guodong Liu, Siyuan Dai, Kai Ye, Kun Zhao, Wenlu Wang, Carl Yang, Lifang He, Alex Leow, Paul Thompson, Heng Huang, Liang Zhan

    Abstract: The MRI-derived brain network serves as a pivotal instrument in elucidating both the structural and functional aspects of the brain, encompassing the ramifications of diseases and developmental processes. However, prevailing methodologies, often focusing on synchronous BOLD signals from functional MRI (fMRI), may not capture directional influences among brain regions and rarely tackle temporal fun… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  8. ConspEmoLLM: Conspiracy Theory Detection Using an Emotion-Based Large Language Model

    Authors: Zhiwei Liu, Boyang Liu, Paul Thompson, Kailai Yang, Sophia Ananiadou

    Abstract: The internet has brought both benefits and harms to society. A prime example of the latter is misinformation, including conspiracy theories, which flood the web. Recent advances in natural language processing, particularly the emergence of large language models (LLMs), have improved the prospects of accurate misinformation detection. However, most LLM-based approaches to conspiracy theory detectio… ▽ More

    Submitted 12 August, 2024; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: Work in progress

  9. arXiv:2402.01505  [pdf, other

    cs.CL

    Code-Switched Language Identification is Harder Than You Think

    Authors: Laurie Burchell, Alexandra Birch, Robert P. Thompson, Kenneth Heafield

    Abstract: Code switching (CS) is a very common phenomenon in written and spoken communication but one that is handled poorly by many natural language processing applications. Looking to the application of building CS corpora, we explore CS language identification (LID) for corpus building. We make the task more realistic by scaling it to more languages and considering models with simpler architectures for f… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: EACL 2024

  10. arXiv:2311.11046  [pdf

    q-bio.QM cs.LG q-bio.NC

    Classification of Major Depressive Disorder Using Vertex-Wise Brain Sulcal Depth, Curvature, and Thickness with a Deep and a Shallow Learning Model

    Authors: Roberto Goya-Maldonado, Tracy Erwin-Grabner, Ling-Li Zeng, Christopher R. K. Ching, Andre Aleman, Alyssa R. Amod, Zeynep Basgoze, Francesco Benedetti, Bianca Besteher, Katharina Brosch, Robin Bülow, Romain Colle, Colm G. Connolly, Emmanuelle Corruble, Baptiste Couvy-Duchesne, Kathryn Cullen, Udo Dannlowski, Christopher G. Davey, Annemiek Dols, Jan Ernsting, Jennifer W. Evans, Lukas Fisch, Paola Fuentes-Claramonte, Ali Saffet Gonul, Ian H. Gotlib , et al. (62 additional authors not shown)

    Abstract: Major depressive disorder (MDD) is a complex psychiatric disorder that affects the lives of hundreds of millions of individuals around the globe. Even today, researchers debate if morphological alterations in the brain are linked to MDD, likely due to the heterogeneity of this disorder. The application of deep learning tools to neuroimaging data, capable of capturing complex non-linear patterns, h… ▽ More

    Submitted 24 January, 2025; v1 submitted 18 November, 2023; originally announced November 2023.

    Comments: arXiv admin note: text overlap with arXiv:2206.08122

  11. Emotion Detection for Misinformation: A Review

    Authors: Zhiwei Liu, Tianlin Zhang, Kailai Yang, Paul Thompson, Zeping Yu, Sophia Ananiadou

    Abstract: With the advent of social media, an increasing number of netizens are sharing and reading posts and news online. However, the huge volumes of misinformation (e.g., fake news and rumors) that flood the internet can adversely affect people's lives, and have resulted in the emergence of rumor and fake news detection as a hot research topic. The emotions and sentiments of netizens, as expressed in soc… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: 30 pages, 11 figures

  12. Agent-based models of social behaviour and communication in evacuations: A systematic review

    Authors: Anne Templeton, Hui Xie, Steve Gwynne, Aoife Hunt, Pete Thompson, Gerta Köster

    Abstract: Most modern agent-based evacuation models involve interactions between evacuees. However, the assumed reasons for interactions and portrayal of them may be overly simple. Research from social psychology suggests that people interact and communicate with one another when evacuating and evacuee response is impacted by the way information is communicated. Thus, we conducted a systematic review of age… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Pre-print submitted to Safety Science special issue following the 2023 Pedestrian and Evacuation Dynamics conference

  13. arXiv:2309.07352  [pdf

    q-bio.GN cs.LG eess.IV q-bio.QM

    Tackling the dimensions in imaging genetics with CLUB-PLS

    Authors: Andre Altmann, Ana C Lawry Aguila, Neda Jahanshad, Paul M Thompson, Marco Lorenzi

    Abstract: A major challenge in imaging genetics and similar fields is to link high-dimensional data in one domain, e.g., genetic data, to high dimensional data in a second domain, e.g., brain imaging data. The standard approach in the area are mass univariate analyses across genetic factors and imaging phenotypes. That entails executing one genome-wide association study (GWAS) for each pre-defined imaging m… ▽ More

    Submitted 19 September, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: 12 pages, 4 Figures, 2 Tables

  14. arXiv:2309.04651  [pdf

    eess.IV cs.AI cs.CV

    Video and Synthetic MRI Pre-training of 3D Vision Architectures for Neuroimage Analysis

    Authors: Nikhil J. Dhinagar, Amit Singh, Saket Ozarkar, Ketaki Buwa, Sophia I. Thomopoulos, Conor Owens-Walton, Emily Laltoo, Yao-Liang Chen, Philip Cook, Corey McMillan, Chih-Chien Tsai, J-J Wang, Yih-Ru Wu, Paul M. Thompson

    Abstract: Transfer learning represents a recent paradigm shift in the way we build artificial intelligence (AI) systems. In contrast to training task-specific models, transfer learning involves pre-training deep learning models on a large corpus of data and minimally fine-tuning them for adaptation to specific tasks. Even so, for 3D medical imaging tasks, we do not know if it is best to pre-train models on… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  15. arXiv:2309.04607  [pdf

    cs.CL cs.AI

    Linking Symptom Inventories using Semantic Textual Similarity

    Authors: Eamonn Kennedy, Shashank Vadlamani, Hannah M Lindsey, Kelly S Peterson, Kristen Dams OConnor, Kenton Murray, Ronak Agarwal, Houshang H Amiri, Raeda K Andersen, Talin Babikian, David A Baron, Erin D Bigler, Karen Caeyenberghs, Lisa Delano-Wood, Seth G Disner, Ekaterina Dobryakova, Blessen C Eapen, Rachel M Edelstein, Carrie Esopenko, Helen M Genova, Elbert Geuze, Naomi J Goodrich-Hunsaker, Jordan Grafman, Asta K Haberg, Cooper B Hodges , et al. (57 additional authors not shown)

    Abstract: An extensive library of symptom inventories has been developed over time to measure clinical symptoms, but this variety has led to several long standing issues. Most notably, results drawn from different settings and studies are not comparable, which limits reproducibility. Here, we present an artificial intelligence (AI) approach using semantic textual similarity (STS) to link symptoms and scores… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

  16. Algebraic Reasoning About Timeliness

    Authors: Seyed Hossein Haeri, Peter W. Thompson, Peter Van Roy, Magne Haveraaen, Neil J. Davies, Mikhail Barash, Kevin Hammond, James Chapman

    Abstract: Designing distributed systems to have predictable performance under high load is difficult because of resource exhaustion, non-linearity, and stochastic behaviour. Timeliness, i.e., delivering results within defined time bounds, is a central aspect of predictable performance. In this paper, we focus on timeliness using the DELTA-Q Systems Development paradigm (DELTA-QSD, developed by PNSol), which… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: In Proceedings ICE 2023, arXiv:2308.08920

    ACM Class: B.8.2; C.4; D.2.4; D.2.8; F.3.2; F.3.1; F.4.1; F.4.3; I.1.1

    Journal ref: EPTCS 383, 2023, pp. 35-54

  17. arXiv:2305.16222  [pdf, ps, other

    eess.IV cs.CV cs.LG q-bio.NC

    Incomplete Multimodal Learning for Complex Brain Disorders Prediction

    Authors: Reza Shirkavand, Liang Zhan, Heng Huang, Li Shen, Paul M. Thompson

    Abstract: Recent advancements in the acquisition of various brain data sources have created new opportunities for integrating multimodal brain data to assist in early detection of complex brain disorders. However, current data integration approaches typically need a complete set of biomedical data modalities, which may not always be feasible, as some modalities are only available in large-scale research coh… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  18. arXiv:2304.00134  [pdf

    physics.med-ph cs.AI

    A Surface-Based Federated Chow Test Model for Integrating APOE Status, Tau Deposition Measure, and Hippocampal Surface Morphometry

    Authors: Jianfeng Wu, Yi Su, Yanxi Chen, Wenhui Zhu, Eric M. Reiman, Richard J. Caselli, Kewei Chen, Paul M. Thompson, Junwen Wang, Yalin Wang

    Abstract: Background: Alzheimer's Disease (AD) is the most common type of age-related dementia, affecting 6.2 million people aged 65 or older according to CDC data. It is commonly agreed that discovering an effective AD diagnosis biomarker could have enormous public health benefits, potentially preventing or delaying up to 40% of dementia cases. Tau neurofibrillary tangles are the primary driver of downstre… ▽ More

    Submitted 31 March, 2023; originally announced April 2023.

  19. arXiv:2303.08224  [pdf

    eess.IV cs.AI cs.CV cs.LG q-bio.QM

    Few-Shot Classification of Autism Spectrum Disorder using Site-Agnostic Meta-Learning and Brain MRI

    Authors: Nikhil J. Dhinagar, Vignesh Santhalingam, Katherine E. Lawrence, Emily Laltoo, Paul M. Thompson

    Abstract: For machine learning applications in medical imaging, the availability of training data is often limited, which hampers the design of radiological classifiers for subtle conditions such as autism spectrum disorder (ASD). Transfer learning is one method to counter this problem of low training data regimes. Here we explore the use of meta-learning for very low data regimes in the context of having p… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  20. arXiv:2303.08216  [pdf

    eess.IV cs.AI cs.CV cs.LG q-bio.QM

    Efficiently Training Vision Transformers on Structural MRI Scans for Alzheimer's Disease Detection

    Authors: Nikhil J. Dhinagar, Sophia I. Thomopoulos, Emily Laltoo, Paul M. Thompson

    Abstract: Neuroimaging of large populations is valuable to identify factors that promote or resist brain disease, and to assist diagnosis, subtyping, and prognosis. Data-driven models such as convolutional neural networks (CNNs) have increasingly been applied to brain images to perform diagnostic and prognostic tasks by learning robust features. Vision transformers (ViT) - a new class of deep learning archi… ▽ More

    Submitted 14 March, 2023; originally announced March 2023.

  21. arXiv:2303.01491  [pdf, other

    eess.IV cs.LG q-bio.QM

    Transferring Models Trained on Natural Images to 3D MRI via Position Encoded Slice Models

    Authors: Umang Gupta, Tamoghna Chattopadhyay, Nikhil Dhinagar, Paul M. Thompson, Greg Ver Steeg, The Alzheimer's Disease Neuroimaging Initiative

    Abstract: Transfer learning has remarkably improved computer vision. These advances also promise improvements in neuroimaging, where training set sizes are often small. However, various difficulties arise in directly applying models pretrained on natural images to radiologic images, such as MRIs. In particular, a mismatch in the input space (2D images vs. 3D MRIs) restricts the direct transfer of models, of… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

    Comments: To appear at IEEE International Symposium on Biomedical Imaging 2023 (ISBI 2023). Code is available at https://github.com/umgupta/2d-slice-set-networks

  22. arXiv:2302.13631  [pdf

    eess.IV cs.AI cs.CV cs.LG q-bio.QM

    Curriculum Based Multi-Task Learning for Parkinson's Disease Detection

    Authors: Nikhil J. Dhinagar, Conor Owens-Walton, Emily Laltoo, Christina P. Boyle, Yao-Liang Chen, Philip Cook, Corey McMillan, Chih-Chien Tsai, J-J Wang, Yih-Ru Wu, Ysbrand van der Werf, Paul M. Thompson

    Abstract: There is great interest in developing radiological classifiers for diagnosis, staging, and predictive modeling in progressive diseases such as Parkinson's disease (PD), a neurodegenerative disease that is difficult to detect in its early stages. Here we leverage severity-based meta-data on the stages of disease to define a curriculum for training a deep convolutional neural network (CNN). Typicall… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: Accepted for publication at the 20th IEEE International Symposium on Biomedical Imaging, ISBI 2023

  23. arXiv:2211.05235  [pdf

    physics.med-ph cs.LG

    Improved Prediction of Beta-Amyloid and Tau Burden Using Hippocampal Surface Multivariate Morphometry Statistics and Sparse Coding

    Authors: Jianfeng Wu, Yi Su, Wenhui Zhu, Negar Jalili Mallak, Natasha Lepore, Eric M. Reiman, Richard J. Caselli, Paul M. Thompson, Kewei Chen, Yalin Wang

    Abstract: Background: Beta-amyloid (A$β$) plaques and tau protein tangles in the brain are the defining 'A' and 'T' hallmarks of Alzheimer's disease (AD), and together with structural atrophy detectable on brain magnetic resonance imaging (MRI) scans as one of the neurodegenerative ('N') biomarkers comprise the ''ATN framework'' of AD. Current methods to detect A$β$/tau pathology include cerebrospinal fluid… ▽ More

    Submitted 27 October, 2022; originally announced November 2022.

    Comments: 34 pages, 5 figures, 1 table, accepted by the Journal of Alzheimer's Disease

    MSC Class: 65U05

  24. arXiv:2208.11669  [pdf, other

    cs.LG cs.CR eess.IV q-bio.QM

    Towards Sparsified Federated Neuroimaging Models via Weight Pruning

    Authors: Dimitris Stripelis, Umang Gupta, Nikhil Dhinagar, Greg Ver Steeg, Paul Thompson, José Luis Ambite

    Abstract: Federated training of large deep neural networks can often be restrictive due to the increasing costs of communicating the updates with increasing model sizes. Various model pruning techniques have been designed in centralized settings to reduce inference times. Combining centralized pruning techniques with federated training seems intuitive for reducing communication costs -- by pruning the model… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

    Comments: Accepted to 3rd MICCAI Workshop on Distributed, Collaborative and Federated Learning (DeCaF, 2022)

  25. arXiv:2205.07854  [pdf, other

    cs.LG cs.AI cs.CV eess.IV q-bio.NC

    Functional2Structural: Cross-Modality Brain Networks Representation Learning

    Authors: Haoteng Tang, Xiyao Fu, Lei Guo, Yalin Wang, Scott Mackin, Olusola Ajilore, Alex Leow, Paul Thompson, Heng Huang, Liang Zhan

    Abstract: MRI-based modeling of brain networks has been widely used to understand functional and structural interactions and connections among brain regions, and factors that affect them, such as brain development and disease. Graph mining on brain networks may facilitate the discovery of novel biomarkers for clinical phenotypes and neurodegenerative diseases. Since brain networks derived from functional an… ▽ More

    Submitted 5 May, 2022; originally announced May 2022.

  26. arXiv:2205.05249  [pdf, other

    cs.LG cs.CR cs.CV cs.DC

    Secure & Private Federated Neuroimaging

    Authors: Dimitris Stripelis, Umang Gupta, Hamza Saleem, Nikhil Dhinagar, Tanmay Ghai, Rafael Chrysovalantis Anastasiou, Armaghan Asghar, Greg Ver Steeg, Srivatsan Ravi, Muhammad Naveed, Paul M. Thompson, Jose Luis Ambite

    Abstract: The amount of biomedical data continues to grow rapidly. However, collecting data from multiple sites for joint analysis remains challenging due to security, privacy, and regulatory concerns. To overcome this challenge, we use Federated Learning, which enables distributed training of neural network models over multiple data sources without sharing data. Each site trains the neural network over its… ▽ More

    Submitted 28 August, 2023; v1 submitted 10 May, 2022; originally announced May 2022.

    Comments: 18 pages, 13 figures, 2 tables

    ACM Class: I.2; I.5.1; J.3

  27. arXiv:2110.10709  [pdf

    physics.med-ph cs.LG eess.IV

    Predicting Tau Accumulation in Cerebral Cortex with Multivariate MRI Morphometry Measurements, Sparse Coding, and Correntropy

    Authors: Jianfeng Wu, Wenhui Zhu, Yi Su, Jie Gui, Natasha Lepore, Eric M. Reiman, Richard J. Caselli, Paul M. Thompson, Kewei Chen, Yalin Wang

    Abstract: Biomarker-assisted diagnosis and intervention in Alzheimer's disease (AD) may be the key to prevention breakthroughs. One of the hallmarks of AD is the accumulation of tau plaques in the human brain. However, current methods to detect tau pathology are either invasive (lumbar puncture) or quite costly and not widely available (Tau PET). In our previous work, structural MRI-based hippocampal multiv… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: 10 pages, 5 figures, 17th International Symposium on Medical Information Processing and Analysis

  28. arXiv:2108.03437  [pdf, other

    cs.CR cs.LG

    Secure Neuroimaging Analysis using Federated Learning with Homomorphic Encryption

    Authors: Dimitris Stripelis, Hamza Saleem, Tanmay Ghai, Nikhil Dhinagar, Umang Gupta, Chrysovalantis Anastasiou, Greg Ver Steeg, Srivatsan Ravi, Muhammad Naveed, Paul M. Thompson, Jose Luis Ambite

    Abstract: Federated learning (FL) enables distributed computation of machine learning models over various disparate, remote data sources, without requiring to transfer any individual data to a centralized location. This results in an improved generalizability of models and efficient scaling of computation as more sources and larger datasets are added to the federation. Nevertheless, recent membership attack… ▽ More

    Submitted 9 November, 2021; v1 submitted 7 August, 2021; originally announced August 2021.

    Comments: 9 pages, 3 figures, 1 algorithm

  29. arXiv:2105.02866  [pdf, other

    q-bio.QM cs.CR cs.LG eess.IV

    Membership Inference Attacks on Deep Regression Models for Neuroimaging

    Authors: Umang Gupta, Dimitris Stripelis, Pradeep K. Lam, Paul M. Thompson, José Luis Ambite, Greg Ver Steeg

    Abstract: Ensuring the privacy of research participants is vital, even more so in healthcare environments. Deep learning approaches to neuroimaging require large datasets, and this often necessitates sharing data between multiple sites, which is antithetical to the privacy objectives. Federated learning is a commonly proposed solution to this problem. It circumvents the need for data sharing by sharing para… ▽ More

    Submitted 3 June, 2021; v1 submitted 6 May, 2021; originally announced May 2021.

    Comments: To appear at Medical Imaging with Deep Learning 2021 (MIDL 2021)

  30. arXiv:2103.12420  [pdf, other

    cs.IR

    HSEarch: semantic search system for workplace accident reports

    Authors: Emrah Inan, Paul Thompson, Tim Yates, Sophia Ananiadou

    Abstract: Semantic search engines, which integrate the output of text mining (TM) methods, can significantly increase the ease and efficiency of finding relevant documents and locating important information within them. We present a novel search engine for the construction industry, HSEarch (http://www.nactem.ac.uk/hse/), which uses TM methods to provide semantically-enhanced, faceted search over a reposito… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

    Comments: Accepted to appear in ECIR 2021

  31. arXiv:2102.10503  [pdf, ps, other

    eess.IV cs.CV

    Predicting Future Cognitive Decline with Hyperbolic Stochastic Coding

    Authors: J. Zhang, Q. Dong, J. Shi, Q. Li, C. M. Stonnington, B. A. Gutman, K. Chen, E. M. Reiman, R. J. Caselli, P. M. Thompson, J. Ye, Y. Wang

    Abstract: Hyperbolic geometry has been successfully applied in modeling brain cortical and subcortical surfaces with general topological structures. However such approaches, similar to other surface based brain morphology analysis methods, usually generate high dimensional features. It limits their statistical power in cognitive decline prediction research, especially in datasets with limited subject number… ▽ More

    Submitted 20 February, 2021; originally announced February 2021.

  32. arXiv:2102.08440  [pdf, other

    cs.LG cs.DC

    Scaling Neuroscience Research using Federated Learning

    Authors: Dimitris Stripelis, Jose Luis Ambite, Pradeep Lam, Paul Thompson

    Abstract: The amount of biomedical data continues to grow rapidly. However, the ability to analyze these data is limited due to privacy and regulatory concerns. Machine learning approaches that require data to be copied to a single location are hampered by the challenges of data sharing. Federated Learning is a promising approach to learn a joint model over data silos. This architecture does not share any s… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

    Comments: To appear at IEEE International Symposium on Biomedical Imaging 2021 (ISBI 2021)

    MSC Class: 68T07 ACM Class: I.5.4

  33. arXiv:2102.04438  [pdf, other

    eess.IV cs.LG q-bio.QM

    Improved Brain Age Estimation with Slice-based Set Networks

    Authors: Umang Gupta, Pradeep K. Lam, Greg Ver Steeg, Paul M. Thompson

    Abstract: Deep Learning for neuroimaging data is a promising but challenging direction. The high dimensionality of 3D MRI scans makes this endeavor compute and data-intensive. Most conventional 3D neuroimaging methods use 3D-CNN-based architectures with a large number of parameters and require more time and data to train. Recently, 2D-slice-based models have received increasing attention as they have fewer… ▽ More

    Submitted 9 February, 2021; v1 submitted 8 February, 2021; originally announced February 2021.

    Comments: To appear at IEEE International Symposium on Biomedical Imaging 2021 (ISBI 2021). Code is available at https://git.io/JtazG

  34. arXiv:2011.12875  [pdf, other

    cs.DC cs.PF

    Rapid Exploration of Optimization Strategies on Advanced Architectures using TestSNAP and LAMMPS

    Authors: Rahulkumar Gayatri, Stan Moore, Evan Weinberg, Nicholas Lubbers, Sarah Anderson, Jack Deslippe, Danny Perez, Aidan P. Thompson

    Abstract: The exascale race is at an end with the announcement of the Aurora and Frontier machines. This next generation of supercomputers utilize diverse hardware architectures to achieve their compute performance, providing an added onus on the performance portability of applications. An expanding fragmentation of programming models would provide a compounding optimization challenge were it not for the ev… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

    Comments: Submitted to IPDPS 2021, October 19, 2020

  35. arXiv:2010.04905  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Accelerating Finite-temperature Kohn-Sham Density Functional Theory with Deep Neural Networks

    Authors: J. Austin Ellis, Lenz Fiedler, Gabriel A. Popoola, Normand A. Modine, J. Adam Stephens, Aidan P. Thompson, Attila Cangi, Sivasankaran Rajamanickam

    Abstract: We present a numerical modeling workflow based on machine learning (ML) which reproduces the the total energies produced by Kohn-Sham density functional theory (DFT) at finite electronic temperature to within chemical accuracy at negligible computational cost. Based on deep neural networks, our workflow yields the local density of states (LDOS) for a given atomic configuration. From the LDOS, spat… ▽ More

    Submitted 9 July, 2021; v1 submitted 10 October, 2020; originally announced October 2020.

    Journal ref: Phys. Rev. B 104, 035120 (2021)

  36. arXiv:2007.14787  [pdf, ps, other

    math.AG cs.SC eess.SY math.DS

    Parameter identifiability and input-output equations

    Authors: Alexey Ovchinnikov, Gleb Pogudin, Peter Thompson

    Abstract: Structural parameter identifiability is a property of a differential model with parameters that allows for the parameters to be determined from the model equations in the absence of noise. One of the standard approaches to assessing this problem is via input-output equations and, in particular, characteristic sets of differential ideals. The precise relation between identifiability and input-outpu… ▽ More

    Submitted 27 December, 2020; v1 submitted 27 July, 2020; originally announced July 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1910.03960

  37. arXiv:2007.09777  [pdf, other

    cs.CV

    Deep Representation Learning For Multimodal Brain Networks

    Authors: Wen Zhang, Liang Zhan, Paul Thompson, Yalin Wang

    Abstract: Applying network science approaches to investigate the functions and anatomy of the human brain is prevalent in modern medical imaging analysis. Due to the complex network topology, for an individual brain, mining a discriminative network representation from the multimodal brain networks is non-trivial. The recent success of deep learning techniques on graph-structured data suggests a new way to m… ▽ More

    Submitted 19 July, 2020; originally announced July 2020.

    Comments: 11 pages, 3 figures, MICCAI 2020

  38. arXiv:2006.00139  [pdf, other

    physics.comp-ph cs.CE physics.atom-ph

    Multi-fidelity machine-learning with uncertainty quantification and Bayesian optimization for materials design: Application to ternary random alloys

    Authors: Anh Tran, Julien Tranchida, Tim Wildey, Aidan P. Thompson

    Abstract: We present a scale-bridging approach based on a multi-fidelity (MF) machine-learning (ML) framework leveraging Gaussian processes (GP) to fuse atomistic computational model predictions across multiple levels of fidelity. Through the posterior variance of the MFGP, our framework naturally enables uncertainty quantification, providing estimates of confidence in the predictions. We used Density Funct… ▽ More

    Submitted 5 August, 2020; v1 submitted 29 May, 2020; originally announced June 2020.

  39. arXiv:2006.00115  [pdf, other

    q-bio.QM cs.CV cs.LG eess.IV

    Overview of Scanner Invariant Representations

    Authors: Daniel Moyer, Greg Ver Steeg, Paul M. Thompson

    Abstract: Pooled imaging data from multiple sources is subject to bias from each source. Studies that do not correct for these scanner/site biases at best lose statistical power, and at worst leave spurious correlations in their data. Estimation of the bias effects is non-trivial due to the paucity of data with correspondence across sites, so called "traveling phantom" data, which is expensive to collect. N… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

    Comments: Accepted as a short paper in MIDL 2020. In accordance with the MIDL 2020 Call for Papers, this short paper is an overview of an already published work arXiv:1904.05375, and was submitted to MIDL in order to allow presentation and discussion at the meeting

    Report number: MIDL/2020/ExtendedAbstract/yqm9RD_XHT

  40. arXiv:1910.03960  [pdf, ps, other

    math.DS cs.SC eess.SY math.AC

    Input-output equations and identifiability of linear ODE models

    Authors: Alexey Ovchinnikov, Gleb Pogudin, Peter Thompson

    Abstract: Structural identifiability is a property of a differential model with parameters that allows for the parameters to be determined from the model equations in the absence of noise. The method of input-output equations is one method for verifying structural identifiability. This method stands out in its importance because the additional insights it provides can be used to analyze and improve models.… ▽ More

    Submitted 27 January, 2022; v1 submitted 9 October, 2019; originally announced October 2019.

    MSC Class: 12H05; 34A55; 92B05; 93C15; 93B25; 93B30

  41. arXiv:1904.06288  [pdf, other

    math.ST cs.LG

    Outlier-robust estimation of a sparse linear model using $\ell_1$-penalized Huber's $M$-estimator

    Authors: Arnak S. Dalalyan, Philip Thompson

    Abstract: We study the problem of estimating a $p$-dimensional $s$-sparse vector in a linear model with Gaussian design and additive noise. In the case where the labels are contaminated by at most $o$ adversarial outliers, we prove that the $\ell_1$-penalized Huber's $M$-estimator based on $n$ samples attains the optimal rate of convergence $(s/n)^{1/2} + (o/n)$, up to a logarithmic factor. For more general… ▽ More

    Submitted 19 November, 2019; v1 submitted 12 April, 2019; originally announced April 2019.

    Comments: This is a follow up paper of arXiv:1805.08020

  42. arXiv:1904.05375  [pdf, other

    q-bio.QM cs.LG eess.IV stat.AP stat.ML

    Scanner Invariant Representations for Diffusion MRI Harmonization

    Authors: Daniel Moyer, Greg Ver Steeg, Chantal M. W. Tax, Paul M. Thompson

    Abstract: Purpose: In the present work we describe the correction of diffusion-weighted MRI for site and scanner biases using a novel method based on invariant representation. Theory and Methods: Pooled imaging data from multiple sources are subject to variation between the sources. Correcting for these biases has become very important as imaging studies increase in size and multi-site cases become more c… ▽ More

    Submitted 31 January, 2020; v1 submitted 10 April, 2019; originally announced April 2019.

  43. arXiv:1810.08553  [pdf, other

    stat.ML cs.LG q-bio.NC q-bio.QM

    Federated Learning in Distributed Medical Databases: Meta-Analysis of Large-Scale Subcortical Brain Data

    Authors: Santiago Silva, Boris Gutman, Eduardo Romero, Paul M Thompson, Andre Altmann, Marco Lorenzi

    Abstract: At this moment, databanks worldwide contain brain images of previously unimaginable numbers. Combined with developments in data science, these massive data provide the potential to better understand the genetic underpinnings of brain diseases. However, different datasets, which are stored at different institutions, cannot always be shared directly due to privacy and legal concerns, thus limiting t… ▽ More

    Submitted 28 January, 2025; v1 submitted 19 October, 2018; originally announced October 2018.

    Comments: Federated learning, distributed databases, PCA, SVD, meta-analysis, brain disease

  44. arXiv:1806.04634  [pdf, other

    q-bio.QM cs.LG q-bio.TO stat.AP

    Measures of Tractography Convergence

    Authors: Daniel Moyer, Paul M. Thompson, Greg Ver Steeg

    Abstract: In the present work, we use information theory to understand the empirical convergence rate of tractography, a widely-used approach to reconstruct anatomical fiber pathways in the living brain. Based on diffusion MRI data, tractography is the starting point for many methods to study brain connectivity. Of the available methods to perform tractography, most reconstruct a finite set of streamlines,… ▽ More

    Submitted 12 June, 2018; originally announced June 2018.

    Comments: 11 pages

  45. arXiv:1805.01049  [pdf, other

    cs.LG stat.ML

    Large-Scale Unsupervised Deep Representation Learning for Brain Structure

    Authors: Ayush Jaiswal, Dong Guo, Cauligi S. Raghavendra, Paul Thompson

    Abstract: Machine Learning (ML) is increasingly being used for computer aided diagnosis of brain related disorders based on structural magnetic resonance imaging (MRI) data. Most of such work employs biologically and medically meaningful hand-crafted features calculated from different regions of the brain. The construction of such highly specialized features requires a considerable amount of time, manual ov… ▽ More

    Submitted 2 May, 2018; originally announced May 2018.

  46. arXiv:1711.05766  [pdf, other

    cs.CV cs.AI

    Fast Predictive Simple Geodesic Regression

    Authors: Zhipeng Ding, Greg Fleishman, Xiao Yang, Paul Thompson, Roland Kwitt, Marc Niethammer

    Abstract: Deformable image registration and regression are important tasks in medical image analysis. However, they are computationally expensive, especially when analyzing large-scale datasets that contain thousands of images. Hence, cluster computing is typically used, making the approaches dependent on such computational infrastructure. Even larger computational resources are required as study sizes incr… ▽ More

    Submitted 15 November, 2017; originally announced November 2017.

    Comments: 19 pages, 10 figures, 13 tables

  47. arXiv:1709.03645  [pdf, other

    stat.ML cs.LG q-bio.GN

    Identifying Genetic Risk Factors via Sparse Group Lasso with Group Graph Structure

    Authors: Tao Yang, Paul Thompson, Sihai Zhao, Jieping Ye

    Abstract: Genome-wide association studies (GWA studies or GWAS) investigate the relationships between genetic variants such as single-nucleotide polymorphisms (SNPs) and individual traits. Recently, incorporating biological priors together with machine learning methods in GWA studies has attracted increasing attention. However, in real-world, nucleotide-level bio-priors have not been well-studied to date. A… ▽ More

    Submitted 11 September, 2017; originally announced September 2017.

  48. arXiv:1708.04789  [pdf, other

    stat.AP cs.CY

    revisit: a Workflow Tool for Data Science

    Authors: Norman Matloff, Reed Davis, Laurel Beckett, Paul Thompson

    Abstract: In recent years there has been widespread concern in the scientific community over a reproducibility crisis. Among the major causes that have been identified is statistical: In many scientific research the statistical analysis (including data preparation) suffers from a lack of transparency and methodological problems, major obstructions to reproducibility. The revisit package aims toward remedyin… ▽ More

    Submitted 16 August, 2017; originally announced August 2017.

  49. arXiv:1706.06031  [pdf, other

    q-bio.NC cs.CV

    Evaluating 35 Methods to Generate Structural Connectomes Using Pairwise Classification

    Authors: Dmitry Petrov, Alexander Ivanov, Joshua Faskowitz, Boris Gutman, Daniel Moyer, Julio Villalon, Neda Jahanshad, Paul Thompson

    Abstract: There is no consensus on how to construct structural brain networks from diffusion MRI. How variations in pre-processing steps affect network reliability and its ability to distinguish subjects remains opaque. In this work, we address this issue by comparing 35 structural connectome-building pipelines. We vary diffusion reconstruction models, tractography algorithms and parcellations. Next, we cla… ▽ More

    Submitted 19 June, 2017; originally announced June 2017.

    Comments: Accepted for MICCAI 2017, 8 pages, 3 figures

  50. arXiv:1705.10312  [pdf

    cs.LG cs.CE stat.AP

    Classification of Major Depressive Disorder via Multi-Site Weighted LASSO Model

    Authors: Dajiang Zhu, Brandalyn C. Riedel, Neda Jahanshad, Nynke A. Groenewold, Dan J. Stein, Ian H. Gotlib, Matthew D. Sacchet, Danai Dima, James H. Cole, Cynthia H. Y. Fu, Henrik Walter, Ilya M. Veer, Thomas Frodl, Lianne Schmaal, Dick J. Veltman, Paul M. Thompson

    Abstract: Large-scale collaborative analysis of brain imaging data, in psychiatry and neu-rology, offers a new source of statistical power to discover features that boost ac-curacy in disease classification, differential diagnosis, and outcome prediction. However, due to data privacy regulations or limited accessibility to large datasets across the world, it is challenging to efficiently integrate distribut… ▽ More

    Submitted 3 June, 2017; v1 submitted 26 May, 2017; originally announced May 2017.

    Comments: Accepted by MICCAI 2017