Search | arXiv e-print repository

A principled framework to assess the information-theoretic fitness of brain functional sub-circuits

Authors: Duy Duong-Tran, Nghi Nguyen, Shizhuo Mu, Jiong Chen, Jingxuan Bao, Frederick Xu, Sumita Garai, Jose Cadena-Pico, Alan David Kaplan, Tianlong Chen, Yize Zhao, Li Shen, Joaquín Goñi

Abstract: In systems and network neuroscience, many common practices in brain connectomic analysis are often not properly scrutinized. One such practice is mapping a predetermined set of sub-circuits, like functional networks (FNs), onto subjects' functional connectomes (FCs) without adequately assessing the information-theoretic appropriateness of the partition. Another practice that goes unchallenged is t… ▽ More In systems and network neuroscience, many common practices in brain connectomic analysis are often not properly scrutinized. One such practice is mapping a predetermined set of sub-circuits, like functional networks (FNs), onto subjects' functional connectomes (FCs) without adequately assessing the information-theoretic appropriateness of the partition. Another practice that goes unchallenged is thresholding weighted FCs to remove spurious connections without justifying the chosen threshold. This paper leverages recent theoretical advances in Stochastic Block Models (SBMs) to formally define and quantify the information-theoretic fitness (e.g., prominence) of a predetermined set of FNs when mapped to individual FCs under different fMRI task conditions. Our framework allows for evaluating any combination of FC granularity, FN partition, and thresholding strategy, thereby optimizing these choices to preserve important topological features of the human brain connectomes. By applying to the Human Connectome Project with Schaefer parcellations at multiple levels of granularity, the framework showed that the common thresholding value of 0.25 was indeed information-theoretically valid for group-average FCs despite its previous lack of justification. Our results pave the way for the proper use of FNs and thresholding methods and provide insights for future research in individualized parcellations. △ Less

Submitted 23 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

arXiv:2303.16644 [pdf, other]

Policy lessons from the Italian pandemic of Covid-19

Authors: José M. Carcione, Jing Ba

Abstract: We analyze the management of the Italian pandemic during the five identified waves. We considered the following problems: (i) The composition of the CTS ("Scientific Technical Committee"), which was composed entirely of doctors, mainly virologists, without mathematical epidemiologists, statisticians, physicists, etc. In fact, a pandemic has a behavior described by mathematical, stochastic and prob… ▽ More We analyze the management of the Italian pandemic during the five identified waves. We considered the following problems: (i) The composition of the CTS ("Scientific Technical Committee"), which was composed entirely of doctors, mainly virologists, without mathematical epidemiologists, statisticians, physicists, etc. In fact, a pandemic has a behavior described by mathematical, stochastic and probabilistic criteria; (ii) Political interference in security measures and media propaganda; (iii) The initial stages of the vaccination campaign, ignoring the age factor, and (iv) The persistence of the pandemic due to the population unvaccinated (anti-vax or "no-vax"), which amounts to about six to seven million people, including 10% of anti-vax doctors. △ Less

Submitted 14 May, 2023; v1 submitted 11 March, 2023; originally announced March 2023.

arXiv:2302.00319 [pdf, other]

Development of deep biological ages aware of morbidity and mortality based on unsupervised and semi-supervised deep learning approaches

Authors: Seong-Eun Moon, Ji Won Yoon, Shinyoung Joo, Yoohyung Kim, Jae Hyun Bae, Seokho Yoon, Haanju Yoo, Young Min Cho

Abstract: Background: While deep learning technology, which has the capability of obtaining latent representations based on large-scale data, can be a potential solution for the discovery of a novel aging biomarker, existing deep learning methods for biological age estimation usually depend on chronological ages and lack of consideration of mortality and morbidity that are the most significant outcomes of a… ▽ More Background: While deep learning technology, which has the capability of obtaining latent representations based on large-scale data, can be a potential solution for the discovery of a novel aging biomarker, existing deep learning methods for biological age estimation usually depend on chronological ages and lack of consideration of mortality and morbidity that are the most significant outcomes of aging. Methods: This paper proposes a novel deep learning model to learn latent representations of biological aging in regard to subjects' morbidity and mortality. The model utilizes health check-up data in addition to morbidity and mortality information to learn the complex relationships between aging and measured clinical attributes. Findings: The proposed model is evaluated on a large dataset of general populations compared with KDM and other learning-based models. Results demonstrate that biological ages obtained by the proposed model have superior discriminability of subjects' morbidity and mortality. △ Less

Submitted 1 February, 2023; originally announced February 2023.

arXiv:2301.10772 [pdf]

Gene-SGAN: a method for discovering disease subtypes with imaging and genetic signatures via multi-view weakly-supervised deep clustering

Authors: Zhijian Yang, Junhao Wen, Ahmed Abdulkadir, Yuhan Cui, Guray Erus, Elizabeth Mamourian, Randa Melhem, Dhivya Srinivasan, Sindhuja T. Govindarajan, Jiong Chen, Mohamad Habes, Colin L. Masters, Paul Maruff, Jurgen Fripp, Luigi Ferrucci, Marilyn S. Albert, Sterling C. Johnson, John C. Morris, Pamela LaMontagne, Daniel S. Marcus, Tammie L. S. Benzinger, David A. Wolk, Li Shen, Jingxuan Bao, Susan M. Resnick , et al. (3 additional authors not shown)

Abstract: Disease heterogeneity has been a critical challenge for precision diagnosis and treatment, especially in neurologic and neuropsychiatric diseases. Many diseases can display multiple distinct brain phenotypes across individuals, potentially reflecting disease subtypes that can be captured using MRI and machine learning methods. However, biological interpretability and treatment relevance are limite… ▽ More Disease heterogeneity has been a critical challenge for precision diagnosis and treatment, especially in neurologic and neuropsychiatric diseases. Many diseases can display multiple distinct brain phenotypes across individuals, potentially reflecting disease subtypes that can be captured using MRI and machine learning methods. However, biological interpretability and treatment relevance are limited if the derived subtypes are not associated with genetic drivers or susceptibility factors. Herein, we describe Gene-SGAN - a multi-view, weakly-supervised deep clustering method - which dissects disease heterogeneity by jointly considering phenotypic and genetic data, thereby conferring genetic correlations to the disease subtypes and associated endophenotypic signatures. We first validate the generalizability, interpretability, and robustness of Gene-SGAN in semi-synthetic experiments. We then demonstrate its application to real multi-site datasets from 28,858 individuals, deriving subtypes of Alzheimer's disease and brain endophenotypes associated with hypertension, from MRI and SNP data. Derived brain phenotypes displayed significant differences in neuroanatomical patterns, genetic determinants, biological and clinical biomarkers, indicating potentially distinct underlying neuropathologic processes, genetic drivers, and susceptibility factors. Overall, Gene-SGAN is broadly applicable to disease subtyping and endophenotype discovery, and is herein tested on disease-related, genetically-driven neuroimaging phenotypes. △ Less

Submitted 25 January, 2023; originally announced January 2023.

arXiv:2110.11347 [pdf]

Multidimensional representations in late-life depression: convergence in neuroimaging, cognition, clinical symptomatology and genetics

Authors: Junhao Wen, Cynthia H. Y. Fu, Duygu Tosun, Yogasudha Veturi, Zhijian Yang, Ahmed Abdulkadir, Elizabeth Mamourian, Dhivya Srinivasan, Jingxuan Bao, Guray Erus, Haochang Shou, Mohamad Habes, Jimit Doshi, Erdem Varol, Scott R Mackin, Aristeidis Sotiras, Yong Fan, Andrew J. Saykin, Yvette I. Sheline, Li Shen, Marylyn D. Ritchie, David A. Wolk, Marilyn Albert, Susan M. Resnick, Christos Davatzikos

Abstract: Late-life depression (LLD) is characterized by considerable heterogeneity in clinical manifestation. Unraveling such heterogeneity would aid in elucidating etiological mechanisms and pave the road to precision and individualized medicine. We sought to delineate, cross-sectionally and longitudinally, disease-related heterogeneity in LLD linked to neuroanatomy, cognitive functioning, clinical sympto… ▽ More Late-life depression (LLD) is characterized by considerable heterogeneity in clinical manifestation. Unraveling such heterogeneity would aid in elucidating etiological mechanisms and pave the road to precision and individualized medicine. We sought to delineate, cross-sectionally and longitudinally, disease-related heterogeneity in LLD linked to neuroanatomy, cognitive functioning, clinical symptomatology, and genetic profiles. Multimodal data from a multicentre sample (N=996) were analyzed. A semi-supervised clustering method (HYDRA) was applied to regional grey matter (GM) brain volumes to derive dimensional representations. Two dimensions were identified, which accounted for the LLD-related heterogeneity in voxel-wise GM maps, white matter (WM) fractional anisotropy (FA), neurocognitive functioning, clinical phenotype, and genetics. Dimension one (Dim1) demonstrated relatively preserved brain anatomy without WM disruptions relative to healthy controls. In contrast, dimension two (Dim2) showed widespread brain atrophy and WM integrity disruptions, along with cognitive impairment and higher depression severity. Moreover, one de novo independent genetic variant (rs13120336) was significantly associated with Dim 1 but not with Dim 2. Notably, the two dimensions demonstrated significant SNP-based heritability of 18-27% within the general population (N=12,518 in UKBB). Lastly, in a subset of individuals having longitudinal measurements, Dim2 demonstrated a more rapid longitudinal decrease in GM and brain age, and was more likely to progress to Alzheimers disease, compared to Dim1 (N=1,413 participants and 7,225 scans from ADNI, BLSA, and BIOCARD datasets). △ Less

Submitted 25 October, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

arXiv:2011.04202 [pdf, other]

Clinical Landscape of COVID-19 Testing: Difficult Choices

Authors: Darshan Gandhi, Sanskruti Landage, Joseph Bae, Sheshank Shankar, Rohan Sukumaran, Parth Patwa, Sethuraman T V, Priyanshi Katiyar, Shailesh Advani, Rohan Iyer, Sunaina Anand, Aryan Mahindra, Rachel Barbar, Abhishek Singh, Ramesh Raskar

Abstract: The coronavirus disease 2019 (COVID-19) pandemic has spread rapidly across the world, leading to enormous amounts of human death and economic loss. Until definitive preventive or curative measures are developed, policies regarding testing, contact tracing, and quarantine remain the best public health tools for curbing viral spread. Testing is a crucial component of these efforts, enabling the iden… ▽ More The coronavirus disease 2019 (COVID-19) pandemic has spread rapidly across the world, leading to enormous amounts of human death and economic loss. Until definitive preventive or curative measures are developed, policies regarding testing, contact tracing, and quarantine remain the best public health tools for curbing viral spread. Testing is a crucial component of these efforts, enabling the identification and isolation of infected individuals. Differences in testing methodologies, time frames, and outcomes can have an impact on their overall efficiency, usability and efficacy. In this early draft, we draw a comparison between the various types of diagnostic tests including PCR, antigen, and home tests in relation to their relative advantages, disadvantages, and use cases. We also look into alternative and unconventional methods. Further, we analyze the short-term and long-term impacts of the virus and its testing on various verticals such as business, government laws, policies, and healthcare. △ Less

Submitted 15 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

Comments: 9 pages, 12 figures

arXiv:2007.08028 [pdf]

Predicting Clinical Outcomes in COVID-19 using Radiomics and Deep Learning on Chest Radiographs: A Multi-Institutional Study

Authors: Joseph Bae, Saarthak Kapse, Gagandeep Singh, Rishabh Gattu, Syed Ali, Neal Shah, Colin Marshall, Jonathan Pierce, Tej Phatak, Amit Gupta, Jeremy Green, Nikhil Madan, Prateek Prasanna

Abstract: We predict mechanical ventilation requirement and mortality using computational modeling of chest radiographs (CXRs) for coronavirus disease 2019 (COVID-19) patients. This two-center, retrospective study analyzed 530 deidentified CXRs from 515 COVID-19 patients treated at Stony Brook University Hospital and Newark Beth Israel Medical Center between March and August 2020. DL and machine learning cl… ▽ More We predict mechanical ventilation requirement and mortality using computational modeling of chest radiographs (CXRs) for coronavirus disease 2019 (COVID-19) patients. This two-center, retrospective study analyzed 530 deidentified CXRs from 515 COVID-19 patients treated at Stony Brook University Hospital and Newark Beth Israel Medical Center between March and August 2020. DL and machine learning classifiers to predict mechanical ventilation requirement and mortality were trained and evaluated using patient CXRs. A novel radiomic embedding framework was also explored for outcome prediction. All results are compared against radiologist grading of CXRs (zone-wise expert severity scores). Radiomic and DL classification models had mAUCs of 0.78+/-0.02 and 0.81+/-0.04, compared with expert scores mAUCs of 0.75+/-0.02 and 0.79+/-0.05 for mechanical ventilation requirement and mortality prediction, respectively. Combined classifiers using both radiomics and expert severity scores resulted in mAUCs of 0.79+/-0.04 and 0.83+/-0.04 for each prediction task, demonstrating improvement over either artificial intelligence or radiologist interpretation alone. Our results also suggest instances where inclusion of radiomic features in DL improves model predictions, something that might be explored in other pathologies. The models proposed in this study and the prognostic information they provide might aid physician decision making and resource allocation during the COVID-19 pandemic. △ Less

Submitted 1 July, 2021; v1 submitted 15 July, 2020; originally announced July 2020.

Comments: Joseph Bae and Saarthak Kapse have contributed equally to this work

ACM Class: J.3; I.2.6

arXiv:2004.03575 [pdf, other]

A simulation of a COVID-19 epidemic based on a deterministic SEIR model

Authors: Jose' M. Carcione, Juan E. Santos, Claudio Bagaini, Jing Ba

Abstract: An epidemic disease caused by a new coronavirus has spread in Northern Italy with a strong contagion rate. We implement an SEIR model to compute the infected population and number of casualties of this epidemic. The example may ideally regard the situation in the Italian Region of Lombardy, where the epidemic started on February 25. We calibrate the model with the number of dead individuals to dat… ▽ More An epidemic disease caused by a new coronavirus has spread in Northern Italy with a strong contagion rate. We implement an SEIR model to compute the infected population and number of casualties of this epidemic. The example may ideally regard the situation in the Italian Region of Lombardy, where the epidemic started on February 25. We calibrate the model with the number of dead individuals to date (May 5, 2020) and constraint the parameters on the basis of values reported in the literature. The peak occurs at day 37 (March 31) approximately, when there is a rapid decrease, with a reproduction ratio R0 = 3 initially, 1.36 at day 22 and 0.8 after day 35, indicating different degrees of lockdown. The predicted death toll is approximately 15600 casualties, with 2.7 million infected individuals at the end of the epidemic. The incubation period providing a better fit of the dead individuals is 4.25 days and the infection period is 4 days, with a fatality rate of 0.00144/day [values based on the reported (official) number of casualties]. The infection fatality rate (IFR) is 0.57 %, and 2.36 % if twice the reported number of casualties is assumed. However, these rates depend on the initially exposed individuals. If approximately nine times more individuals are exposed, there are three times more infected people at the end of the epidemic and IFR = 0.47 %. If we relax these constraints and use a wider range of lower and upper bounds for the incubation and infection periods, we observe that a higher incubation period (13 versus 4.25 days) gives the same IFR (0.6 versus 0.57 %), but nine times more exposed individuals in the first case. Therefore, a precise determination of the fatality rate is subject to the knowledge of the characteristics of the epidemic. △ Less

Submitted 10 May, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

arXiv:1601.05113 [pdf]

Validating non-invasive EEG source imaging using optimal electrode configurations on a representative rat head model

Authors: Pedro A. Valdes-Hernandez, Jihye Bae, Yinchen Song, Akira Sumiyoshi, Eduardo Aubert-Vazquez, Jorge J. Riera

Abstract: The curtain of technical limitations impeding rat multichannel non-invasive electroencephalography (EEG) has risen. Given the importance of this preclinical model, development and validation of EEG source imaging (ESI) is essential. We investigate the validity of well-known human ESI methodologies in rats which individual tissue geometries have been approximated by those extracted from an MRI temp… ▽ More The curtain of technical limitations impeding rat multichannel non-invasive electroencephalography (EEG) has risen. Given the importance of this preclinical model, development and validation of EEG source imaging (ESI) is essential. We investigate the validity of well-known human ESI methodologies in rats which individual tissue geometries have been approximated by those extracted from an MRI template, leading also to imprecision in electrode localizations. With the half and fifth sensitivity volumes we determine both the theoretical minimum electrode separation for non-redundant scalp EEG measurements and the electrode sensitivity resolution, which vary over the scalp because of the head geometry. According to our results, electrodes should be at least ~3-3.5 mm apart for an optimal configuration. The sensitivity resolution is generally worse for electrodes at the boundaries of the scalp measured region, though, by analogy with human montages, concentrates the sensitivity enough to localize sources. Cramér-Rao lower bounds of source localization errors indicate it is theoretically possible to achieve ESI accuracy at the level of anatomical structures, such as the stimulus-specific somatosensory areas, using the template. More validation for this approximation is provided through the comparison between the template and the individual lead field matrices, for several rats. Finally, using well-accepted inverse methods, we demonstrate that somatosensory ESI is not only expected but also allows exploring unknown phenomena related to global sensory integration. Inheriting the advantages and pitfalls of human ESI, rat ESI will boost the understanding of brain pathophysiological mechanisms and the evaluation of ESI methodologies, new pharmacological treatments and ESI-based biomarkers. △ Less

Submitted 25 January, 2016; v1 submitted 19 January, 2016; originally announced January 2016.

Comments: 27 pages, 2 tables and 14 figures

arXiv:1511.05286 [pdf, other]

doi 10.1093/bioinformatics/btw252

Classifying and Segmenting Microscopy Images Using Convolutional Multiple Instance Learning

Authors: Oren Z. Kraus, Lei Jimmy Ba, Brendan Frey

Abstract: Convolutional neural networks (CNN) have achieved state of the art performance on both classification and segmentation tasks. Applying CNNs to microscopy images is challenging due to the lack of datasets labeled at the single cell level. We extend the application of CNNs to microscopy image classification and segmentation using multiple instance learning (MIL). We present the adaptive Noisy-AND MI… ▽ More Convolutional neural networks (CNN) have achieved state of the art performance on both classification and segmentation tasks. Applying CNNs to microscopy images is challenging due to the lack of datasets labeled at the single cell level. We extend the application of CNNs to microscopy image classification and segmentation using multiple instance learning (MIL). We present the adaptive Noisy-AND MIL pooling function, a new MIL operator that is robust to outliers. Combining CNNs with MIL enables training CNNs using full resolution microscopy images with global labels. We base our approach on the similarity between the aggregation function used in MIL and pooling layers used in CNNs. We show that training MIL CNNs end-to-end outperforms several previous methods on both mammalian and yeast microscopy images without requiring any segmentation steps. △ Less

Submitted 17 November, 2015; originally announced November 2015.

Journal ref: Bioinformatics (2016) 32 (12): i52-i59

Showing 1–10 of 10 results for author: Bao, J