Search | arXiv e-print repository

Regulatory DNA sequence Design with Reinforcement Learning

Authors: Zhao Yang, Bing Su, Chuan Cao, Ji-Rong Wen

Abstract: Cis-regulatory elements (CREs), such as promoters and enhancers, are relatively short DNA sequences that directly regulate gene expression. The fitness of CREs, measured by their ability to modulate gene expression, highly depends on the nucleotide sequences, especially specific motifs known as transcription factor binding sites (TFBSs). Designing high-fitness CREs is crucial for therapeutic and b… ▽ More Cis-regulatory elements (CREs), such as promoters and enhancers, are relatively short DNA sequences that directly regulate gene expression. The fitness of CREs, measured by their ability to modulate gene expression, highly depends on the nucleotide sequences, especially specific motifs known as transcription factor binding sites (TFBSs). Designing high-fitness CREs is crucial for therapeutic and bioengineering applications. Current CRE design methods are limited by two major drawbacks: (1) they typically rely on iterative optimization strategies that modify existing sequences and are prone to local optima, and (2) they lack the guidance of biological prior knowledge in sequence optimization. In this paper, we address these limitations by proposing a generative approach that leverages reinforcement learning (RL) to fine-tune a pre-trained autoregressive (AR) model. Our method incorporates data-driven biological priors by deriving computational inference-based rewards that simulate the addition of activator TFBSs and removal of repressor TFBSs, which are then integrated into the RL process. We evaluate our method on promoter design tasks in two yeast media conditions and enhancer design tasks for three human cell types, demonstrating its ability to generate high-fitness CREs while maintaining sequence diversity. The code is available at https://github.com/yangzhao1230/TACO. △ Less

Submitted 10 March, 2025; originally announced March 2025.

arXiv:2502.06274 [pdf, other]

HODDI: A Dataset of High-Order Drug-Drug Interactions for Computational Pharmacovigilance

Authors: Zhaoying Wang, Yingdan Shi, Xiang Liu, Can Chen, Jun Wen, Ren Wang

Abstract: Drug-side effect research is vital for understanding adverse reactions arising in complex multi-drug therapies. However, the scarcity of higher-order datasets that capture the combinatorial effects of multiple drugs severely limits progress in this field. Existing resources such as TWOSIDES primarily focus on pairwise interactions. To fill this critical gap, we introduce HODDI, the first Higher-Or… ▽ More Drug-side effect research is vital for understanding adverse reactions arising in complex multi-drug therapies. However, the scarcity of higher-order datasets that capture the combinatorial effects of multiple drugs severely limits progress in this field. Existing resources such as TWOSIDES primarily focus on pairwise interactions. To fill this critical gap, we introduce HODDI, the first Higher-Order Drug-Drug Interaction Dataset, constructed from U.S. Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS) records spanning the past decade, to advance computational pharmacovigilance. HODDI contains 109,744 records involving 2,506 unique drugs and 4,569 unique side effects, specifically curated to capture multi-drug interactions and their collective impact on adverse effects. Comprehensive statistical analyses demonstrate HODDI's extensive coverage and robust analytical metrics, making it a valuable resource for studying higher-order drug relationships. Evaluating HODDI with multiple models, we found that simple Multi-Layer Perceptron (MLP) can outperform graph models, while hypergraph models demonstrate superior performance in capturing complex multi-drug interactions, further validating HODDI's effectiveness. Our findings highlight the inherent value of higher-order information in drug-side effect prediction and position HODDI as a benchmark dataset for advancing research in pharmacovigilance, drug safety, and personalized medicine. The dataset and codes are available at https://github.com/TIML-Group/HODDI. △ Less

Submitted 10 February, 2025; originally announced February 2025.

arXiv:2501.05644 [pdf, ps, other]

Interpretable Enzyme Function Prediction via Residue-Level Detection

Authors: Zhao Yang, Bing Su, Jiahao Chen, Ji-Rong Wen

Abstract: Predicting multiple functions labeled with Enzyme Commission (EC) numbers from the enzyme sequence is of great significance but remains a challenge due to its sparse multi-label classification nature, i.e., each enzyme is typically associated with only a few labels out of more than 6000 possible EC numbers. However, existing machine learning algorithms generally learn a fixed global representation… ▽ More Predicting multiple functions labeled with Enzyme Commission (EC) numbers from the enzyme sequence is of great significance but remains a challenge due to its sparse multi-label classification nature, i.e., each enzyme is typically associated with only a few labels out of more than 6000 possible EC numbers. However, existing machine learning algorithms generally learn a fixed global representation for each enzyme to classify all functions, thereby they lack interpretability and the fine-grained information of some function-specific local residue fragments may be overwhelmed. Here we present an attention-based framework, namely ProtDETR (Protein Detection Transformer), by casting enzyme function prediction as a detection problem. It uses a set of learnable functional queries to adaptatively extract different local representations from the sequence of residue-level features for predicting different EC numbers. ProtDETR not only significantly outperforms existing deep learning-based enzyme function prediction methods, but also provides a new interpretable perspective on automatically detecting different local regions for identifying different functions through cross-attentions between queries and residue-level features. Code is available at https://github.com/yangzhao1230/ProtDETR. △ Less

Submitted 5 June, 2025; v1 submitted 9 January, 2025; originally announced January 2025.

arXiv:2411.00948 [pdf]

Multiplex Imaging Analysis in Pathology: a Comprehensive Review on Analytical Approaches and Digital Toolkits

Authors: Mohamed Omar, Giuseppe Nicolo Fanelli, Fabio Socciarelli, Varun Ullanat, Sreekar Reddy Puchala, James Wen, Alex Chowdhury, Itzel Valencia, Cristian Scatena, Luigi Marchionni, Renato Umeton, Massimo Loda

Abstract: Conventional histopathology has long been essential for disease diagnosis, relying on visual inspection of tissue sections. Immunohistochemistry aids in detecting specific biomarkers but is limited by its single-marker approach, restricting its ability to capture the full tissue environment. The advent of multiplexed imaging technologies, like multiplexed immunofluorescence and spatial transcripto… ▽ More Conventional histopathology has long been essential for disease diagnosis, relying on visual inspection of tissue sections. Immunohistochemistry aids in detecting specific biomarkers but is limited by its single-marker approach, restricting its ability to capture the full tissue environment. The advent of multiplexed imaging technologies, like multiplexed immunofluorescence and spatial transcriptomics, allows for simultaneous visualization of multiple biomarkers in a single section, enhancing morphological data with molecular and spatial information. This provides a more comprehensive view of the tissue microenvironment, cellular interactions, and disease mechanisms - crucial for understanding disease progression, prognosis, and treatment response. However, the extensive data from multiplexed imaging necessitates sophisticated computational methods for preprocessing, segmentation, feature extraction, and spatial analysis. These tools are vital for managing large, multidimensional datasets, converting raw imaging data into actionable insights. By automating labor-intensive tasks and enhancing reproducibility and accuracy, computational tools are pivotal in diagnostics and research. This review explores the current landscape of multiplexed imaging in pathology, detailing workflows and key technologies like PathML, an AI-powered platform that streamlines image analysis, making complex dataset interpretation accessible for clinical and research settings. △ Less

Submitted 1 November, 2024; originally announced November 2024.

Comments: 54 pages (39 manuscript + 14 supplementary), 3 figures (figure 1, 2 and supplementary figure 1), 6 Tables (Table 1, 2, 3 and supplementary table 1,2,3)

arXiv:2401.09517 [pdf]

Dimensional Neuroimaging Endophenotypes: Neurobiological Representations of Disease Heterogeneity Through Machine Learning

Authors: Junhao Wen, Mathilde Antoniades, Zhijian Yang, Gyujoon Hwang, Ioanna Skampardoni, Rongguang Wang, Christos Davatzikos

Abstract: Machine learning has been increasingly used to obtain individualized neuroimaging signatures for disease diagnosis, prognosis, and response to treatment in neuropsychiatric and neurodegenerative disorders. Therefore, it has contributed to a better understanding of disease heterogeneity by identifying disease subtypes that present significant differences in various brain phenotypic measures. In thi… ▽ More Machine learning has been increasingly used to obtain individualized neuroimaging signatures for disease diagnosis, prognosis, and response to treatment in neuropsychiatric and neurodegenerative disorders. Therefore, it has contributed to a better understanding of disease heterogeneity by identifying disease subtypes that present significant differences in various brain phenotypic measures. In this review, we first present a systematic literature overview of studies using machine learning and multimodal MRI to unravel disease heterogeneity in various neuropsychiatric and neurodegenerative disorders, including Alzheimer disease, schizophrenia, major depressive disorder, autism spectrum disorder, multiple sclerosis, as well as their potential in transdiagnostic settings. Subsequently, we summarize relevant machine learning methodologies and discuss an emerging paradigm which we call dimensional neuroimaging endophenotype (DNE). DNE dissects the neurobiological heterogeneity of neuropsychiatric and neurodegenerative disorders into a low dimensional yet informative, quantitative brain phenotypic representation, serving as a robust intermediate phenotype (i.e., endophenotype) largely reflecting underlying genetics and etiology. Finally, we discuss the potential clinical implications of the current findings and envision future research avenues. △ Less

Submitted 17 January, 2024; originally announced January 2024.

arXiv:2305.12618 [pdf, other]

Atomic and Subgraph-aware Bilateral Aggregation for Molecular Representation Learning

Authors: Jiahao Chen, Yurou Liu, Jiangmeng Li, Bing Su, Jirong Wen

Abstract: Molecular representation learning is a crucial task in predicting molecular properties. Molecules are often modeled as graphs where atoms and chemical bonds are represented as nodes and edges, respectively, and Graph Neural Networks (GNNs) have been commonly utilized to predict atom-related properties, such as reactivity and solubility. However, functional groups (subgraphs) are closely related to… ▽ More Molecular representation learning is a crucial task in predicting molecular properties. Molecules are often modeled as graphs where atoms and chemical bonds are represented as nodes and edges, respectively, and Graph Neural Networks (GNNs) have been commonly utilized to predict atom-related properties, such as reactivity and solubility. However, functional groups (subgraphs) are closely related to some chemical properties of molecules, such as efficacy, and metabolic properties, which cannot be solely determined by individual atoms. In this paper, we introduce a new model for molecular representation learning called the Atomic and Subgraph-aware Bilateral Aggregation (ASBA), which addresses the limitations of previous atom-wise and subgraph-wise models by incorporating both types of information. ASBA consists of two branches, one for atom-wise information and the other for subgraph-wise information. Considering existing atom-wise GNNs cannot properly extract invariant subgraph features, we propose a decomposition-polymerization GNN architecture for the subgraph-wise branch. Furthermore, we propose cooperative node-level and graph-level self-supervised learning strategies for ASBA to improve its generalization. Our method offers a more comprehensive way to learn representations for molecular property prediction and has broad potential in drug and material discovery applications. Extensive experiments have demonstrated the effectiveness of our method. △ Less

Submitted 21 May, 2023; originally announced May 2023.

arXiv:2301.10772 [pdf]

Gene-SGAN: a method for discovering disease subtypes with imaging and genetic signatures via multi-view weakly-supervised deep clustering

Authors: Zhijian Yang, Junhao Wen, Ahmed Abdulkadir, Yuhan Cui, Guray Erus, Elizabeth Mamourian, Randa Melhem, Dhivya Srinivasan, Sindhuja T. Govindarajan, Jiong Chen, Mohamad Habes, Colin L. Masters, Paul Maruff, Jurgen Fripp, Luigi Ferrucci, Marilyn S. Albert, Sterling C. Johnson, John C. Morris, Pamela LaMontagne, Daniel S. Marcus, Tammie L. S. Benzinger, David A. Wolk, Li Shen, Jingxuan Bao, Susan M. Resnick , et al. (3 additional authors not shown)

Abstract: Disease heterogeneity has been a critical challenge for precision diagnosis and treatment, especially in neurologic and neuropsychiatric diseases. Many diseases can display multiple distinct brain phenotypes across individuals, potentially reflecting disease subtypes that can be captured using MRI and machine learning methods. However, biological interpretability and treatment relevance are limite… ▽ More Disease heterogeneity has been a critical challenge for precision diagnosis and treatment, especially in neurologic and neuropsychiatric diseases. Many diseases can display multiple distinct brain phenotypes across individuals, potentially reflecting disease subtypes that can be captured using MRI and machine learning methods. However, biological interpretability and treatment relevance are limited if the derived subtypes are not associated with genetic drivers or susceptibility factors. Herein, we describe Gene-SGAN - a multi-view, weakly-supervised deep clustering method - which dissects disease heterogeneity by jointly considering phenotypic and genetic data, thereby conferring genetic correlations to the disease subtypes and associated endophenotypic signatures. We first validate the generalizability, interpretability, and robustness of Gene-SGAN in semi-synthetic experiments. We then demonstrate its application to real multi-site datasets from 28,858 individuals, deriving subtypes of Alzheimer's disease and brain endophenotypes associated with hypertension, from MRI and SNP data. Derived brain phenotypes displayed significant differences in neuroanatomical patterns, genetic determinants, biological and clinical biomarkers, indicating potentially distinct underlying neuropathologic processes, genetic drivers, and susceptibility factors. Overall, Gene-SGAN is broadly applicable to disease subtyping and endophenotype discovery, and is herein tested on disease-related, genetically-driven neuroimaging phenotypes. △ Less

Submitted 25 January, 2023; originally announced January 2023.

arXiv:2211.06785 [pdf]

In vivo labeling and quantitative imaging of neurons using MRI

Authors: Shana Li, Xiang Xu, Canjun Li, Ziyan Xu, Qiong Ye, Yan Zhang, Chunlei Cang, Jie Wen

Abstract: Mammalian brain is a complex organ that contains billions of neurons. These neurons form various neural circuits that control the perception, cognition, emotion and behavior. Developing in vivo neuronal labeling and imaging techniques is crucial for studying the structure and function of neural circuits. In vivo techniques can provide true physiological information that cannot be provided by ex vi… ▽ More Mammalian brain is a complex organ that contains billions of neurons. These neurons form various neural circuits that control the perception, cognition, emotion and behavior. Developing in vivo neuronal labeling and imaging techniques is crucial for studying the structure and function of neural circuits. In vivo techniques can provide true physiological information that cannot be provided by ex vivo methods. In this study, we describe a new strategy for in vivo neuronal labeling and quantification using MRI. To demonstrate the ability of this new method, we used neurotropic virus to deliver oatp1a1 gene to the target neural circuit. OATP1A1 protein is expressed on the neuronal membrane and can increase the uptake of a specific MRI contrast agent (Gd-EOB-DTPA). By using T1-weighted images for observation, labeled neurons "light up" on MRI. We further use a dynamic-contrast-enhancement based method to obtain measures that provide quantitative information of labeled neurons in vivo. △ Less

Submitted 12 November, 2022; originally announced November 2022.

arXiv:2110.11347 [pdf]

Multidimensional representations in late-life depression: convergence in neuroimaging, cognition, clinical symptomatology and genetics

Authors: Junhao Wen, Cynthia H. Y. Fu, Duygu Tosun, Yogasudha Veturi, Zhijian Yang, Ahmed Abdulkadir, Elizabeth Mamourian, Dhivya Srinivasan, Jingxuan Bao, Guray Erus, Haochang Shou, Mohamad Habes, Jimit Doshi, Erdem Varol, Scott R Mackin, Aristeidis Sotiras, Yong Fan, Andrew J. Saykin, Yvette I. Sheline, Li Shen, Marylyn D. Ritchie, David A. Wolk, Marilyn Albert, Susan M. Resnick, Christos Davatzikos

Abstract: Late-life depression (LLD) is characterized by considerable heterogeneity in clinical manifestation. Unraveling such heterogeneity would aid in elucidating etiological mechanisms and pave the road to precision and individualized medicine. We sought to delineate, cross-sectionally and longitudinally, disease-related heterogeneity in LLD linked to neuroanatomy, cognitive functioning, clinical sympto… ▽ More Late-life depression (LLD) is characterized by considerable heterogeneity in clinical manifestation. Unraveling such heterogeneity would aid in elucidating etiological mechanisms and pave the road to precision and individualized medicine. We sought to delineate, cross-sectionally and longitudinally, disease-related heterogeneity in LLD linked to neuroanatomy, cognitive functioning, clinical symptomatology, and genetic profiles. Multimodal data from a multicentre sample (N=996) were analyzed. A semi-supervised clustering method (HYDRA) was applied to regional grey matter (GM) brain volumes to derive dimensional representations. Two dimensions were identified, which accounted for the LLD-related heterogeneity in voxel-wise GM maps, white matter (WM) fractional anisotropy (FA), neurocognitive functioning, clinical phenotype, and genetics. Dimension one (Dim1) demonstrated relatively preserved brain anatomy without WM disruptions relative to healthy controls. In contrast, dimension two (Dim2) showed widespread brain atrophy and WM integrity disruptions, along with cognitive impairment and higher depression severity. Moreover, one de novo independent genetic variant (rs13120336) was significantly associated with Dim 1 but not with Dim 2. Notably, the two dimensions demonstrated significant SNP-based heritability of 18-27% within the general population (N=12,518 in UKBB). Lastly, in a subset of individuals having longitudinal measurements, Dim2 demonstrated a more rapid longitudinal decrease in GM and brain age, and was more likely to progress to Alzheimers disease, compared to Dim1 (N=1,413 participants and 7,225 scans from ADNI, BLSA, and BIOCARD datasets). △ Less

Submitted 25 October, 2021; v1 submitted 20 October, 2021; originally announced October 2021.

arXiv:2107.10256 [pdf]

Clinica: an open source software platform for reproducible clinical neuroscience studies

Authors: Alexandre Routier, Ninon Burgos, Mauricio Díaz, Michael Bacci, Simona Bottani, Omar El-Rifai, Sabrina Fontanella, Pietro Gori, Jérémy Guillon, Alexis Guyot, Ravi Hassanaly, Thomas Jacquemont, Pascal Lu, Arnaud Marcoux, Tristan Moreau, Jorge Samper-González, Marc Teichmann, Elina Thibeau--Sutre, Ghislain Vaillant, Junhao Wen, Adam Wild, Marie-Odile Habert, Stanley Durrleman, Olivier Colliot

Abstract: We present Clinica (www.clinica.run), an open-source software platform designed to make clinical neuroscience studies easier and more reproducible. Clinica aims for researchers to i) spend less time on data management and processing, ii) perform reproducible evaluations of their methods, and iii) easily share data and results within their institution and with external collaborators. The core of Cl… ▽ More We present Clinica (www.clinica.run), an open-source software platform designed to make clinical neuroscience studies easier and more reproducible. Clinica aims for researchers to i) spend less time on data management and processing, ii) perform reproducible evaluations of their methods, and iii) easily share data and results within their institution and with external collaborators. The core of Clinica is a set of automatic pipelines for processing and analysis of multimodal neuroimaging data (currently, T1-weighted MRI, diffusion MRI and PET data), as well as tools for statistics, machine learning and deep learning. It relies on the brain imaging data structure (BIDS) for the organization of raw neuroimaging datasets and on established tools written by the community to build its pipelines. It also provides converters of public neuroimaging datasets to BIDS (currently ADNI, AIBL, OASIS and NIFD). Processed data include image-valued scalar fields (e.g. tissue probability maps), meshes, surface-based scalar fields (e.g. cortical thickness maps) or scalar outputs (e.g. regional averages). These data follow the ClinicA Processed Structure (CAPS) format which shares the same philosophy as BIDS. Consistent organization of raw and processed neuroimaging files facilitates the execution of single pipelines and of sequences of pipelines, as well as the integration of processed data into statistics or machine learning frameworks. The target audience of Clinica is neuroscientists or clinicians conducting clinical neuroscience studies involving multimodal imaging, and researchers developing advanced machine learning algorithms applied to neuroimaging data. △ Less

Submitted 21 July, 2021; originally announced July 2021.

arXiv:2102.12582 [pdf]

Disentangling brain heterogeneity via semi-supervised deep-learning and MRI: dimensional representations of Alzheimer's Disease

Authors: Zhijian Yang, Ilya M. Nasrallah, Haochang Shou, Junhao Wen, Jimit Doshi, Mohamad Habes, Guray Erus, Ahmed Abdulkadir, Susan M. Resnick, David Wolk, Christos Davatzikos

Abstract: Heterogeneity of brain diseases is a challenge for precision diagnosis/prognosis. We describe and validate Smile-GAN (SeMI-supervised cLustEring-Generative Adversarial Network), a novel semi-supervised deep-clustering method, which dissects neuroanatomical heterogeneity, enabling identification of disease subtypes via their imaging signatures relative to controls. When applied to MRIs (2 studies;… ▽ More Heterogeneity of brain diseases is a challenge for precision diagnosis/prognosis. We describe and validate Smile-GAN (SeMI-supervised cLustEring-Generative Adversarial Network), a novel semi-supervised deep-clustering method, which dissects neuroanatomical heterogeneity, enabling identification of disease subtypes via their imaging signatures relative to controls. When applied to MRIs (2 studies; 2,832 participants; 8,146 scans) including cognitively normal individuals and those with cognitive impairment and dementia, Smile-GAN identified 4 neurodegenerative patterns/axes: P1, normal anatomy and highest cognitive performance; P2, mild/diffuse atrophy and more prominent executive dysfunction; P3, focal medial temporal atrophy and relatively greater memory impairment; P4, advanced neurodegeneration. Further application to longitudinal data revealed two distinct progression pathways: P1$\rightarrow$P2$\rightarrow$P4 and P1$\rightarrow$P3$\rightarrow$P4. Baseline expression of these patterns predicted the pathway and rate of future neurodegeneration. Pattern expression offered better yet complementary performance in predicting clinical progression, compared to amyloid/tau. These deep-learning derived biomarkers offer promise for precision diagnostics and targeted clinical trial recruitment. △ Less

Submitted 24 February, 2021; originally announced February 2021.

Comments: 37 pages, 11 figures

arXiv:2006.15255 [pdf, other]

Smile-GANs: Semi-supervised clustering via GANs for dissecting brain disease heterogeneity from medical images

Authors: Zhijian Yang, Junhao Wen, Christos Davatzikos

Abstract: Machine learning methods applied to complex biomedical data has enabled the construction of disease signatures of diagnostic/prognostic value. However, less attention has been given to understanding disease heterogeneity. Semi-supervised clustering methods can address this problem by estimating multiple transformations from a (e.g. healthy) control (CN) group to a patient (PT) group, seeking to ca… ▽ More Machine learning methods applied to complex biomedical data has enabled the construction of disease signatures of diagnostic/prognostic value. However, less attention has been given to understanding disease heterogeneity. Semi-supervised clustering methods can address this problem by estimating multiple transformations from a (e.g. healthy) control (CN) group to a patient (PT) group, seeking to capture the heterogeneity of underlying pathlogic processes. Herein, we propose a novel method, Smile-GANs (SeMi-supervIsed cLustEring via GANs), for semi-supervised clustering, and apply it to brain MRI scans. Smile-GANs first learns multiple distinct mappings by generating PT from CN, with each mapping characterizing one relatively distinct pathological pattern. Moreover, a clustering model is trained interactively with mapping functions to assign PT into corresponding subtype memberships. Using relaxed assumptions on PT/CN data distribution and imposing mapping non-linearity, Smile-GANs captures heterogeneous differences in distribution between the CN and PT domains. We first validate Smile-GANs using simulated data, subsequently on real data, by demonstrating its potential in characterizing heterogeneity in Alzheimer's Disease (AD) and its prodromal phases. The model was first trained using baseline MRIs from the ADNI2 database and then applied to longitudinal data from ADNI1 and BLSA. Four robust subtypes with distinct neuroanatomical patterns were discovered: 1) normal brain, 2) diffuse atrophy atypical of AD, 3) focal medial temporal lobe atrophy, 4) typical-AD. Further longitudinal analyses discover two distinct progressive pathways from prodromal to full AD: i) subtypes 1 - 2 - 4, and ii) subtypes 1 - 3 - 4. Although demonstrated on an important biomedical problem, Smile-GANs is general and can find application in many biomedical and other domains. △ Less

Submitted 26 June, 2020; originally announced June 2020.

arXiv:2006.02396 [pdf, other]

How initial distribution affects symmetry breaking induced by panic in ants: experiment and flee-pheromone model

Authors: Geng Li, Weijia Wang, Jiahui Lin, Zhiyang Huang, Jianqiang Liang, Huabo Wu, Jianping Wen, Zengru Di, Bertrand Roehner, Zhangang Han

Abstract: Collective escaping is a ubiquitous phenomenon in animal groups. Symmetry breaking caused by panic escape exhibits a shared feature across species that one exit is used more than the other when agents escaping from a closed space with two symmetrically located exists. Intuitively, one exit will be used more by more individuals close to it, namely there is an asymmetric distribution initially. We u… ▽ More Collective escaping is a ubiquitous phenomenon in animal groups. Symmetry breaking caused by panic escape exhibits a shared feature across species that one exit is used more than the other when agents escaping from a closed space with two symmetrically located exists. Intuitively, one exit will be used more by more individuals close to it, namely there is an asymmetric distribution initially. We used ant groups to investigate how initial distribution of colonies would influence symmetry breaking in collective escaping. Surprisingly, there was no positive correlation between symmetry breaking and the asymmetrically initial distribution, which was quite counter-intuitive. In the experiments, a flee stage was observed and accordingly a flee-pheromone model was introduced to depict this special behavior in the early stage of escaping. Simulation results fitted well with the experiment. Furthermore, the flee stage duration was calibrated quantitatively and the model reproduced the observation demonstrated by our previous work. This paper explicitly distinguished two stages in ant panic escaping for the first time, thus enhancing the understanding in escaping behavior of ant colonies. △ Less

Submitted 3 June, 2020; originally announced June 2020.

arXiv:2001.09400 [pdf]

Fast library-driven approach for implementation of the voxel spread function technique for correcting magnetic field inhomogeneity artifacts

Authors: Jie Wen, Feiyan Zeng, Dmitriy Yablonskiy, Alexander Sukstansky, Ying Liu, Bin Cai, Yong Zhang, Weifu Lv

Abstract: Purpose: Previously-developed Voxel Spread Function (VSF) method (Yablonskiy, et al, MRM, 2013;70:1283) provides means to correct artifacts induced by macroscopic magnetic field inhomogeneities in the images obtained by multi-Gradient-Recalled-Echo (mGRE) techniques. The goal of this study is to develop a library-driven approach for fast VSF implementation. Methods: The VSF approach describes the… ▽ More Purpose: Previously-developed Voxel Spread Function (VSF) method (Yablonskiy, et al, MRM, 2013;70:1283) provides means to correct artifacts induced by macroscopic magnetic field inhomogeneities in the images obtained by multi-Gradient-Recalled-Echo (mGRE) techniques. The goal of this study is to develop a library-driven approach for fast VSF implementation. Methods: The VSF approach describes the contribution of the magnetic field inhomogeneity effects on the mGRE signal decay in terms of the F-function calculated from mGRE phase and magnitude images. A pre-calculated library accounting for a variety of background field gradients caused by magnetic field inhomogeneities was used herein to speed up calculation of the F-function and to generate quantitative R2* maps from the mGRE data collected from two healthy volunteers. Results: As compared with direct calculation of the F-function based on a voxel-wise approach, the new library-driven method substantially reduces computational time from several hours to few minutes, while, at the same time, providing similar accuracy of R2* mapping. Conclusion: The new procedure proposed in this study provides a fast post-processing algorithm that can be incorporated in the quantitative analysis of mGRE data to account for background field inhomogeneity artifacts, thus can facilitate the applications of mGRE-based quantitative techniques in clinical practices. △ Less

Submitted 26 January, 2020; originally announced January 2020.

Comments: 14 pages, 5 figures

arXiv:1902.09059 [pdf, other]

Rapid Circadian Entrainment in Models of Circadian Genes Regulation

Authors: Jiawei Yin, Agung Julius, John T. Wen

Abstract: The light-based minimum-time circadian entrainment problem for mammals, Neurospora, and Drosophila is studied based on the mathematical models of their circadian gene regulation. These models contain high order nonlinear differential equations. Two model simplification methods are applied to these high-order models: the phase response curves (PRC) and the Principal Orthogonal Decomposition (POD).… ▽ More The light-based minimum-time circadian entrainment problem for mammals, Neurospora, and Drosophila is studied based on the mathematical models of their circadian gene regulation. These models contain high order nonlinear differential equations. Two model simplification methods are applied to these high-order models: the phase response curves (PRC) and the Principal Orthogonal Decomposition (POD). The variational calculus and a gradient descent algorithm are applied for solving the optimal light input in the high-order models. As the results of the gradient descent algorithm rely heavily on the initial guesses, we use the optimal control of the PRC and the simplified model to initialize the gradient descent algorithm. In this paper, we present: (1) the application of PRC and direct shooting algorithm on high-order nonlinear models; (2) a general process for solving the minimum-time optimal control problem on high-order models; (3) the impacts of minimum-time optimal light on circadian gene transcription and protein synthesis. △ Less

Submitted 8 March, 2019; v1 submitted 24 February, 2019; originally announced February 2019.

arXiv:1812.11183 [pdf]

Reproducible evaluation of diffusion MRI features for automatic classification of patients with Alzheimers disease

Authors: Junhao Wen, Jorge Samper-Gonzalez, Simona Bottani, Alexandre Routier, Ninon Burgos, Thomas Jacquemont, Sabrina Fontanella, Stanley Durrleman, Stephane Epelbaum, Anne Bertrand, Olivier Colliot

Abstract: Diffusion MRI is the modality of choice to study alterations of white matter. In past years, various works have used diffusion MRI for automatic classification of AD. However, classification performance obtained with different approaches is difficult to compare and these studies are also difficult to reproduce. In the present paper, we first extend a previously proposed framework to diffusion MRI… ▽ More Diffusion MRI is the modality of choice to study alterations of white matter. In past years, various works have used diffusion MRI for automatic classification of AD. However, classification performance obtained with different approaches is difficult to compare and these studies are also difficult to reproduce. In the present paper, we first extend a previously proposed framework to diffusion MRI data for AD classification. Specifically, we add: conversion of diffusion MRI ADNI data into the BIDS standard and pipelines for diffusion MRI preprocessing and feature extraction. We then apply the framework to compare different components. First, FS has a positive impact on classification results: highest balanced accuracy (BA) improved from 0.76 to 0.82 for task CN vs AD. Secondly, voxel-wise features generally gives better performance than regional features. Fractional anisotropy (FA) and mean diffusivity (MD) provided comparable results for voxel-wise features. Moreover, we observe that the poor performance obtained in tasks involving MCI were potentially caused by the small data samples, rather than by the data imbalance. Furthermore, no extensive classification difference exists for different degree of smoothing and registration methods. Besides, we demonstrate that using non-nested validation of FS leads to unreliable and over-optimistic results: 0.05 up to 0.40 relative increase in BA. Lastly, with proper FR and FS, the performance of diffusion MRI features is comparable to that of T1w MRI. All the code of the framework and the experiments are publicly available: general-purpose tools have been integrated into the Clinica software package (www.clinica.run) and the paper-specific code is available at: https://github.com/aramis-lab/AD-ML. △ Less

Submitted 11 June, 2020; v1 submitted 28 December, 2018; originally announced December 2018.

Comments: 51 pages, 5 figure and 6 tables

arXiv:0707.0662 [pdf]

doi 10.1529/biophysj.106.094243

Force unfolding kinetics of RNA using optical tweezers. II. Modeling experiments

Authors: M. Manosas, J. -D. Wen, P. T. X. Li, S. B. Smith, C. Bustamante, I. Tinoco, Jr., F. Ritort

Abstract: By exerting mechanical force it is possible to unfold/refold RNA molecules one at a time. In a small range of forces, an RNA molecule can hop between the folded and the unfolded state with force-dependent kinetic rates. Here, we introduce a mesoscopic model to analyze the hopping kinetics of RNA hairpins in an optical tweezers setup. The model includes different elements of the experimental setu… ▽ More By exerting mechanical force it is possible to unfold/refold RNA molecules one at a time. In a small range of forces, an RNA molecule can hop between the folded and the unfolded state with force-dependent kinetic rates. Here, we introduce a mesoscopic model to analyze the hopping kinetics of RNA hairpins in an optical tweezers setup. The model includes different elements of the experimental setup (beads, handles and RNA sequence) and limitations of the instrument (time lag of the force-feedback mechanism and finite bandwidth of data acquisition). We investigated the influence of the instrument on the measured hopping rates. Results from the model are in good agreement with the experiments reported in the companion article (1). The comparison between theory and experiments allowed us to infer the values of the intrinsic molecular rates of the RNA hairpin alone and to search for the optimal experimental conditions to do the measurements. We conclude that long handles and soft laser traps represent the best conditions to extract rate estimates that are closest to the intrinsic molecular rates. The methodology and rationale presented here can be applied to other experimental setups and other molecules. △ Less

Submitted 4 July, 2007; originally announced July 2007.

Comments: PDF file, 32 pages including 9 figures plus supplementary material

Journal ref: Biophysical Journal, 92 (2007) 3010-3021

arXiv:0707.0580 [pdf]

doi 10.1529/biophysj.106.094052

Force unfolding kinetics of RNA using optical tweezers. I. Effects of experimental variables on measured results

Authors: J. -D. Wen, M. Manosas, P. T. X. Li, S. B. Smith, C. Bustamante, F. Ritort, I. Tinoco Jr

Abstract: Experimental variables of optical tweezers instrumentation that affect RNA folding/unfolding kinetics were investigated. A model RNA hairpin, P5ab, was attached to two micron-sized beads through hybrid RNA/DNA handles; one bead was trapped by dual-beam lasers and the other was held by a micropipette. Several experimental variables were changed while measuring the unfolding/refolding kinetics, in… ▽ More Experimental variables of optical tweezers instrumentation that affect RNA folding/unfolding kinetics were investigated. A model RNA hairpin, P5ab, was attached to two micron-sized beads through hybrid RNA/DNA handles; one bead was trapped by dual-beam lasers and the other was held by a micropipette. Several experimental variables were changed while measuring the unfolding/refolding kinetics, including handle lengths, trap stiffness, and modes of force applied to the molecule. In constant-force mode where the tension applied to the RNA was maintained through feedback control, the measured rate coefficients varied within 40% when the handle lengths were changed by 10 fold (1.1 to 10.2 Kbp); they increased by two- to three-fold when the trap stiffness was lowered to one third (from 0.1 to 0.035 pN/nm). In the passive mode, without feedback control and where the force applied to the RNA varied in response to the end-to-end distance change of the tether, the RNA hopped between a high-force folded-state and a low-force unfolded-state. In this mode, the rates increased up to two-fold with longer handles or softer traps. Overall, the measured rates remained with the same order-of-magnitude over the wide range of conditions studied. In the companion paper (1), we analyze how the measured kinetics parameters differ from the intrinsic molecular rates of the RNA, and thus how to obtain the molecular rates. △ Less

Submitted 4 July, 2007; originally announced July 2007.

Comments: PDF file, 30 pages, 7 figures

Journal ref: Biophysical Journal, 92 (2007) 2996-3009

Showing 1–18 of 18 results for author: Wen, J