Search | arXiv e-print repository

Maximal Speed of Glucose Change Significantly Distinguishes Prediabetes from Diabetes

Authors: Dandan Wang, Xiaoyan Chen, Jingxiang Lin, Teng Zhang, Lianyi Huang, Dongliang Leng, Xiaohua Douglas Zhang, Gang Li

Abstract: Rapid changes in blood glucose levels can have severe and immediate health consequences, leading to the need to develop indices for assessing these rapid changes based on continuous glucose monitoring (CGM) data. We proposed a CGM index, maxSpeed, that represents the maximum of speed of glucose change (SGC) in a subject, respectively, and conducted a clinical study to investigate this index along… ▽ More Rapid changes in blood glucose levels can have severe and immediate health consequences, leading to the need to develop indices for assessing these rapid changes based on continuous glucose monitoring (CGM) data. We proposed a CGM index, maxSpeed, that represents the maximum of speed of glucose change (SGC) in a subject, respectively, and conducted a clinical study to investigate this index along with SGC mean (meanSpeed) and SGC standard deviation (sdSpeed), coefficient of variation (CV), standard deviation (SD), glycemic variability percentage (GVP), mean amplitude of glycemic excursions (MAG), mean absolute glucose excursion (MAGE), mean of daily differences (MODD) and continuous overlapping net glycemic action (CONGA). Our study revealed that, there exist multiple patterns in distinguishing non-diabetes, prediabetes, type 1 diabetes (T1D) and type 2 diabetes (T2D). First, maxSpeed significantly distinguishes between either of non-diabetes and prediabetes and either of T1D and T2D. Second, meanSpeed, sdSpeed, GVP and MAG significantly distinguish between non-diabetes and either of T1D and T2D. Third, MODD and CONGA of 24 hours significantly distinguish between non-diabetes and either of T1D and T2D, between T1D and either of prediabetes and T2D. Fourth, SD, MAGE and CONGA of 12 hours significantly distinguish between non-diabetes and either of T1D and T2D, between T1D and pre-diabetes. Fifth, CV significantly distinguishes between T1D and either of Non-diabetes and T2D. maxSpeed assesses the rapid change of glucose in a short term, which is important both biologically and clinicially because our human body may not tolerate too rapid change in a short term. △ Less

Submitted 14 June, 2025; originally announced June 2025.

arXiv:2506.12107 [pdf]

Network Pharmacology Reveals HSPA1A/BST2 as Potential Targets of Ci Bai Capsule's Active Compounds Intervening in Leukopenia

Authors: Dingfan Zhang, Congshu Huang, Lei Zhou, Boyang Wang, Wei Zhou, Tiantian Xia, Pan Shen, Shao Li, Yue Gao

Abstract: Background: Radiation-induced leukopenia caused by low-dose exposure is frequently associated with Traditional Chinese Medicine (TCM) syndromes like "blood deficiency" and "fatigue syndrome". Ci Bai Capsule (CB) has been reported to enhance white blood cell levels; however, its mechanisms and bioactive compounds remain unclear.Aim: This study aimed to identify the bioactive compounds group of CB a… ▽ More Background: Radiation-induced leukopenia caused by low-dose exposure is frequently associated with Traditional Chinese Medicine (TCM) syndromes like "blood deficiency" and "fatigue syndrome". Ci Bai Capsule (CB) has been reported to enhance white blood cell levels; however, its mechanisms and bioactive compounds remain unclear.Aim: This study aimed to identify the bioactive compounds group of CB and elucidate its potential mechanisms in radiation-induced leukopenia.Methods: Syndrome-related data were gathered from SYMMAP and CTD database. CB's target profile is predicted by DrugCIPHER. Network pharmacology approaches were employed to identify active compounds and related pathways. Experimental validation was conducted through flow cytometry and RNA-sequencing in both ex vivo and in vivo models.Results: A total of 22 pathways related to cellular processes, immune responses, and signal transduction were identified. Five key bioactive compounds (kaempferol-3-glucorhamnoside, syringin, schisandrin, 3-hydroxytyrosol 3-O-glucoside and salidroside) were found to significantly modulate syndrome-related pathways. Optimal dosing of this compound combination enhanced leukocyte counts and splenic immune cell proliferation in irradiated mice. Transcriptomic analysis revealed that the compounds exert regulatory effects on PP1A, RB, CDK4/6, CDK2, and CDK1, thereby modulating downstream immune and hematopoietic markers such as MNDA, BST2, and HSPA1A.Conclusion: Our findings suggest that CB mitigates radiation-induced leukopenia by enhancing immune and hematopoietic recovery, offering a promising therapeutic approach for managing radiation-related hematological disorders. △ Less

Submitted 13 June, 2025; originally announced June 2025.

arXiv:2504.11454 [pdf, ps, other]

Elucidating the Design Space of Multimodal Protein Language Models

Authors: Cheng-Yen Hsieh, Xinyou Wang, Daiheng Zhang, Dongyu Xue, Fei Ye, Shujian Huang, Zaixiang Zheng, Quanquan Gu

Abstract: Multimodal protein language models (PLMs) integrate sequence and token-based structural information, serving as a powerful foundation for protein modeling, generation, and design. However, the reliance on tokenizing 3D structures into discrete tokens causes substantial loss of fidelity about fine-grained structural details and correlations. In this paper, we systematically elucidate the design spa… ▽ More Multimodal protein language models (PLMs) integrate sequence and token-based structural information, serving as a powerful foundation for protein modeling, generation, and design. However, the reliance on tokenizing 3D structures into discrete tokens causes substantial loss of fidelity about fine-grained structural details and correlations. In this paper, we systematically elucidate the design space of multimodal PLMs to overcome their limitations. We identify tokenization loss and inaccurate structure token predictions by the PLMs as major bottlenecks. To address these, our proposed design space covers improved generative modeling, structure-aware architectures and representation learning, and data exploration. Our advancements approach finer-grained supervision, demonstrating that token-based multimodal PLMs can achieve robust structural modeling. The effective design methods dramatically improve the structure generation diversity, and notably, folding abilities of our 650M model by reducing the RMSD from 5.52 to 2.36 on PDB testset, even outperforming 3B baselines and on par with the specialized folding models. Project page and code: https://bytedance.github.io/dplm/dplm-2.1/. △ Less

Submitted 11 June, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

Comments: ICML 2025 Spotlight; Project Page: https://bytedance.github.io/dplm/dplm-2.1/

arXiv:2504.02488 [pdf, other]

A Behaviour and Disease Model of Testing and Isolation

Authors: Matthew Ryan, Roslyn I. Hickson, Edward M. Hill, Thomas House, Valerie Isham, Dongni Zhang, Mick G. Roberts

Abstract: There has been interest in the interactions between infectious disease dynamics and behaviour for most of the history of mathematical epidemiology. This has included consideration of which mathematical models best capture each phenomenon, as well as their interaction, but typically in a manner that is agnostic to the exact behaviour in question. Here, we investigate interacting behaviour and disea… ▽ More There has been interest in the interactions between infectious disease dynamics and behaviour for most of the history of mathematical epidemiology. This has included consideration of which mathematical models best capture each phenomenon, as well as their interaction, but typically in a manner that is agnostic to the exact behaviour in question. Here, we investigate interacting behaviour and disease dynamics specifically related to behaviours around testing and isolation. This epidemiological-behavioural interaction is of particular interest as, prospectively, it is well-placed to be informed by real-world data temporally monitoring test results and compliance with testing policy. To carry out our investigation we extend an existing "behaviour and disease" (BaD) model by incorporating the dynamics of symptomatic testing and isolation. We provide a dynamical systems analysis of the ordinary differential equations that define this model, providing theoretical results on its behaviour early in a new outbreak (particularly its basic reproduction number) and endemicity of the system (its steady states and associated stability criteria). We then supplement these findings with a numerical analysis to inform how temporal and cumulative outbreak metrics depend on the model parameter values for epidemic and endemic regimes. As the presented interdisciplinary modelling approach can accommodate further extensions (including, but not limited to, adding testing capacity, decay in behavioural effects and multiple pathogen variants), we hope that our work will encourage further modelling studies integrating specific measured behaviours and disease dynamics that may reduce the health and economic impacts of future epidemics. △ Less

Submitted 3 April, 2025; originally announced April 2025.

Comments: 22 pages, 10 figures

MSC Class: 92D30; 05C80; 34A45

arXiv:2502.17213 [pdf, other]

Deep Learning-Powered Electrical Brain Signals Analysis: Advancing Neurological Diagnostics

Authors: Jiahe Li, Xin Chen, Fanqi Shen, Junru Chen, Yuxin Liu, Daoze Zhang, Zhizhang Yuan, Fang Zhao, Meng Li, Yang Yang

Abstract: Neurological disorders represent significant global health challenges, driving the advancement of brain signal analysis methods. Scalp electroencephalography (EEG) and intracranial electroencephalography (iEEG) are widely used to diagnose and monitor neurological conditions. However, dataset heterogeneity and task variations pose challenges in developing robust deep learning solutions. This review… ▽ More Neurological disorders represent significant global health challenges, driving the advancement of brain signal analysis methods. Scalp electroencephalography (EEG) and intracranial electroencephalography (iEEG) are widely used to diagnose and monitor neurological conditions. However, dataset heterogeneity and task variations pose challenges in developing robust deep learning solutions. This review systematically examines recent advances in deep learning approaches for EEG/iEEG-based neurological diagnostics, focusing on applications across 7 neurological conditions using 46 datasets. We explore trends in data utilization, model design, and task-specific adaptations, highlighting the importance of pre-trained multi-task models for scalable, generalizable solutions. To advance research, we propose a standardized benchmark for evaluating models across diverse datasets to enhance reproducibility. This survey emphasizes how recent innovations can transform neurological diagnostics and enable the development of intelligent, adaptable healthcare solutions. △ Less

Submitted 24 February, 2025; originally announced February 2025.

arXiv:2501.11291 [pdf]

Multilineage-differentiating stress-enduring cells alleviate neuropathic pain in mice by secreting TGF-b and IL-10

Authors: Yayu Zhao, Ying Fei, Yunyun Cai, Zhongya Wei, Ying Chen, Yuhua Ji, Xue Chen, Dongmei Zhang, Gang Chen

Abstract: Neuropathic pain is a chronic condition characterized by damage to and dysfunction of the peripheral or central nervous system. There are currently no effective treatment options available for neuropathic pain, and existing drugs often provide only temporary relief with potential side effects. Multilineage-differentiating stress-enduring (Muse) cells are characterized by high expansion potential,… ▽ More Neuropathic pain is a chronic condition characterized by damage to and dysfunction of the peripheral or central nervous system. There are currently no effective treatment options available for neuropathic pain, and existing drugs often provide only temporary relief with potential side effects. Multilineage-differentiating stress-enduring (Muse) cells are characterized by high expansion potential, a stable phenotype and strong immunosuppression. These properties make them attractive candidates for therapeutics for neuropathic pain management. In this study, we conducted a series of experiments to evaluate the effect of Muse cells on neuropathic pain. Muse cells from different species demonstrated analgesic potential by reversing CCI-induced neuropathic pain. Protein profiling revealed a high degree of similarity between Muse cells and BMSCs. The intrathecal injection of Muse cells effectively reduced neuropathic pain in various mouse models, resulting in better analgesic effects than the administration of equivalent low doses of BMSCs. Immunohistochemical analysis and qPCR revealed the ability of Muse cells to inhibit spinal cord neuroinflammation caused by SNI. In addition, Transwell and ELISA revealed that Muse cells migrated through the injured dorsal root ganglion (DRG) via the CCR7-CCL21 chemotactic axis. In addition, the secretion of TGF-b and IL-10 by Muse cells was identified as the mechanism underlying the analgesic effect of Muse cells. The capacity of Muse cells to mitigate neuroinflammation and produce analgesic effects via the modulation of TGF-b and IL-10 underscores their potential as promising therapeutic approaches for the treatment of neuropathic pain. △ Less

Submitted 20 January, 2025; originally announced January 2025.

arXiv:2412.19191 [pdf, other]

Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models

Authors: Haonan He, Yuchen Ren, Yining Tang, Ziyang Xu, Junxian Li, Minghao Yang, Di Zhang, Dong Yuan, Tao Chen, Shufei Zhang, Yuqiang Li, Nanqing Dong, Wanli Ouyang, Dongzhan Zhou, Peng Ye

Abstract: Large language models have already demonstrated their formidable capabilities in general domains, ushering in a revolutionary transformation. However, exploring and exploiting the extensive knowledge of these models to comprehend multi-omics biology remains underexplored. To fill this research gap, we first introduce Biology-Instructions, the first large-scale multi-omics biological sequences-rela… ▽ More Large language models have already demonstrated their formidable capabilities in general domains, ushering in a revolutionary transformation. However, exploring and exploiting the extensive knowledge of these models to comprehend multi-omics biology remains underexplored. To fill this research gap, we first introduce Biology-Instructions, the first large-scale multi-omics biological sequences-related instruction-tuning dataset including DNA, RNA, proteins, and multi-molecules, designed to bridge the gap between large language models (LLMs) and complex biological sequences-related tasks. This dataset can enhance the versatility of LLMs by integrating diverse biological sequenced-based prediction tasks with advanced reasoning capabilities, while maintaining conversational fluency. Additionally, we reveal significant performance limitations in even state-of-the-art LLMs on biological sequence-related multi-omics tasks without specialized pre-training and instruction-tuning. We further develop a strong baseline called ChatMultiOmics with a novel three-stage training pipeline, demonstrating the powerful ability to understand biology by using Biology-Instructions. Biology-Instructions and ChatMultiOmics are publicly available and crucial resources for enabling more effective integration of LLMs with multi-omics sequence analysis. △ Less

Submitted 26 December, 2024; originally announced December 2024.

arXiv:2411.14721 [pdf, other]

MolReFlect: Towards In-Context Fine-grained Alignments between Molecules and Texts

Authors: Jiatong Li, Yunqing Liu, Wei Liu, Jingdi Le, Di Zhang, Wenqi Fan, Dongzhan Zhou, Yuqiang Li, Qing Li

Abstract: Molecule discovery is a pivotal research field, impacting everything from the medicines we take to the materials we use. Recently, Large Language Models (LLMs) have been widely adopted in molecule understanding and generation, yet the alignments between molecules and their corresponding captions remain a significant challenge. Previous endeavours often treat the molecule as a general SMILES string… ▽ More Molecule discovery is a pivotal research field, impacting everything from the medicines we take to the materials we use. Recently, Large Language Models (LLMs) have been widely adopted in molecule understanding and generation, yet the alignments between molecules and their corresponding captions remain a significant challenge. Previous endeavours often treat the molecule as a general SMILES string or molecular graph, neglecting the fine-grained alignments between the molecular sub-structures and the descriptive textual phrases, which are crucial for accurate and explainable predictions. In this case, we introduce MolReFlect, a novel teacher-student framework designed to contextually perform the molecule-caption alignments in a fine-grained way. Our approach initially leverages a larger teacher LLM to label the detailed alignments by directly extracting critical phrases from molecule captions or SMILES strings and implying them to corresponding sub-structures or characteristics. To refine these alignments, we propose In-Context Selective Reflection, which retrieves previous extraction results as context examples for teacher LLM to reflect and lets a smaller student LLM select from in-context reflection and previous extraction results. Finally, we enhance the learning process of the student LLM through Chain-of-Thought In-Context Molecule Tuning, integrating the fine-grained alignments and the reasoning processes within the Chain-of-Thought format. Our experimental results demonstrate that MolReFlect enables LLMs like Mistral-7B to significantly outperform the previous baselines, achieving SOTA performance on the ChEBI-20 dataset. This advancement not only enhances the generative capabilities of LLMs in the molecule-caption translation task, but also contributes to a more explainable framework. △ Less

Submitted 21 November, 2024; originally announced November 2024.

Comments: 22 pages, 12 figures

arXiv:2411.11668 [pdf, other]

Efficient and Robust Continual Graph Learning for Graph Classification in Biology

Authors: Ding Zhang, Jane Downer, Can Chen, Ren Wang

Abstract: Graph classification is essential for understanding complex biological systems, where molecular structures and interactions are naturally represented as graphs. Traditional graph neural networks (GNNs) perform well on static tasks but struggle in dynamic settings due to catastrophic forgetting. We present Perturbed and Sparsified Continual Graph Learning (PSCGL), a robust and efficient continual g… ▽ More Graph classification is essential for understanding complex biological systems, where molecular structures and interactions are naturally represented as graphs. Traditional graph neural networks (GNNs) perform well on static tasks but struggle in dynamic settings due to catastrophic forgetting. We present Perturbed and Sparsified Continual Graph Learning (PSCGL), a robust and efficient continual graph learning framework for graph data classification, specifically targeting biological datasets. We introduce a perturbed sampling strategy to identify critical data points that contribute to model learning and a motif-based graph sparsification technique to reduce storage needs while maintaining performance. Additionally, our PSCGL framework inherently defends against graph backdoor attacks, which is crucial for applications in sensitive biological contexts. Extensive experiments on biological datasets demonstrate that PSCGL not only retains knowledge across tasks but also enhances the efficiency and robustness of graph classification models in biology. △ Less

Submitted 18 November, 2024; originally announced November 2024.

arXiv:2411.08306 [pdf, other]

Evaluating Molecule Synthesizability via Retrosynthetic Planning and Reaction Prediction

Authors: Songtao Liu, Dandan Zhang, Zhengkai Tu, Hanjun Dai, Peng Liu

Abstract: A significant challenge in wet lab experiments with current drug design generative models is the trade-off between pharmacological properties and synthesizability. Molecules predicted to have highly desirable properties are often difficult to synthesize, while those that are easily synthesizable tend to exhibit less favorable properties. As a result, evaluating the synthesizability of molecules in… ▽ More A significant challenge in wet lab experiments with current drug design generative models is the trade-off between pharmacological properties and synthesizability. Molecules predicted to have highly desirable properties are often difficult to synthesize, while those that are easily synthesizable tend to exhibit less favorable properties. As a result, evaluating the synthesizability of molecules in general drug design scenarios remains a significant challenge in the field of drug discovery. The commonly used synthetic accessibility (SA) score aims to evaluate the ease of synthesizing generated molecules, but it falls short of guaranteeing that synthetic routes can actually be found. Inspired by recent advances in top-down synthetic route generation and forward reaction prediction, we propose a new, data-driven metric to evaluate molecule synthesizability. This novel metric leverages the synergistic duality between retrosynthetic planners and reaction predictors, both of which are trained on extensive reaction datasets. To demonstrate the efficacy of our metric, we conduct a comprehensive evaluation of round-trip scores across a range of representative molecule generative models. △ Less

Submitted 3 April, 2025; v1 submitted 12 November, 2024; originally announced November 2024.

arXiv:2411.05825 [pdf, other]

SurfGNN: A robust surface-based prediction model with interpretability for coactivation maps of spatial and cortical features

Authors: Zhuoshuo Li, Jiong Zhang, Youbing Zeng, Jiaying Lin, Dan Zhang, Jianjia Zhang, Duan Xu, Hosung Kim, Bingguang Liu, Mengting Liu

Abstract: Current brain surface-based prediction models often overlook the variability of regional attributes at the cortical feature level. While graph neural networks (GNNs) excel at capturing regional differences, they encounter challenges when dealing with complex, high-density graph structures. In this work, we consider the cortical surface mesh as a sparse graph and propose an interpretable prediction… ▽ More Current brain surface-based prediction models often overlook the variability of regional attributes at the cortical feature level. While graph neural networks (GNNs) excel at capturing regional differences, they encounter challenges when dealing with complex, high-density graph structures. In this work, we consider the cortical surface mesh as a sparse graph and propose an interpretable prediction model-Surface Graph Neural Network (SurfGNN). SurfGNN employs topology-sampling learning (TSL) and region-specific learning (RSL) structures to manage individual cortical features at both lower and higher scales of the surface mesh, effectively tackling the challenges posed by the overly abundant mesh nodes and addressing the issue of heterogeneity in cortical regions. Building on this, a novel score-weighted fusion (SWF) method is implemented to merge nodal representations associated with each cortical feature for prediction. We apply our model to a neonatal brain age prediction task using a dataset of harmonized MR images from 481 subjects (503 scans). SurfGNN outperforms all existing state-of-the-art methods, demonstrating an improvement of at least 9.0% and achieving a mean absolute error (MAE) of 0.827+0.056 in postmenstrual weeks. Furthermore, it generates feature-level activation maps, indicating its capability to identify robust regional variations in different morphometric contributions for prediction. △ Less

Submitted 5 November, 2024; originally announced November 2024.

Comments: 15 pages, 6 figures

ACM Class: J.3

arXiv:2411.04568 [pdf, other]

Dynamic-Attention-based EEG State Transition Modeling for Emotion Recognition

Authors: Xinke Shen, Runmin Gan, Kaixuan Wang, Shuyi Yang, Qingzhu Zhang, Quanying Liu, Dan Zhang, Sen Song

Abstract: Electroencephalogram (EEG)-based emotion decoding can objectively quantify people's emotional state and has broad application prospects in human-computer interaction and early detection of emotional disorders. Recently emerging deep learning architectures have significantly improved the performance of EEG emotion decoding. However, existing methods still fall short of fully capturing the complex s… ▽ More Electroencephalogram (EEG)-based emotion decoding can objectively quantify people's emotional state and has broad application prospects in human-computer interaction and early detection of emotional disorders. Recently emerging deep learning architectures have significantly improved the performance of EEG emotion decoding. However, existing methods still fall short of fully capturing the complex spatiotemporal dynamics of neural signals, which are crucial for representing emotion processing. This study proposes a Dynamic-Attention-based EEG State Transition (DAEST) modeling method to characterize EEG spatiotemporal dynamics. The model extracts spatiotemporal components of EEG that represent multiple parallel neural processes and estimates dynamic attention weights on these components to capture transitions in brain states. The model is optimized within a contrastive learning framework for cross-subject emotion recognition. The proposed method achieved state-of-the-art performance on three publicly available datasets: FACED, SEED, and SEED-V. It achieved 75.4% accuracy in the binary classification of positive and negative emotions and 59.3% in nine-class discrete emotion classification on the FACED dataset, 88.1% in the three-class classification of positive, negative, and neutral emotions on the SEED dataset, and 73.6% in five-class discrete emotion classification on the SEED-V dataset. The learned EEG spatiotemporal patterns and dynamic transition properties offer valuable insights into neural dynamics underlying emotion processing. △ Less

Submitted 7 November, 2024; originally announced November 2024.

Comments: 14 pages, 6 figures

arXiv:2410.05292 [pdf, other]

CaLMFlow: Volterra Flow Matching using Causal Language Models

Authors: Sizhuang He, Daniel Levine, Ivan Vrkic, Marco Francesco Bressana, David Zhang, Syed Asad Rizvi, Yangtian Zhang, Emanuele Zappala, David van Dijk

Abstract: We introduce CaLMFlow (Causal Language Models for Flow Matching), a novel framework that casts flow matching as a Volterra integral equation (VIE), leveraging the power of large language models (LLMs) for continuous data generation. CaLMFlow enables the direct application of LLMs to learn complex flows by formulating flow matching as a sequence modeling task, bridging discrete language modeling an… ▽ More We introduce CaLMFlow (Causal Language Models for Flow Matching), a novel framework that casts flow matching as a Volterra integral equation (VIE), leveraging the power of large language models (LLMs) for continuous data generation. CaLMFlow enables the direct application of LLMs to learn complex flows by formulating flow matching as a sequence modeling task, bridging discrete language modeling and continuous generative modeling. Our method implements tokenization across space and time, thereby solving a VIE over these domains. This approach enables efficient handling of high-dimensional data and outperforms ODE solver-dependent methods like conditional flow matching (CFM). We demonstrate CaLMFlow's effectiveness on synthetic and real-world data, including single-cell perturbation response prediction, showcasing its ability to incorporate textual context and generalize to unseen conditions. Our results highlight LLM-driven flow matching as a promising paradigm in generative modeling, offering improved scalability, flexibility, and context-awareness. △ Less

Submitted 3 October, 2024; originally announced October 2024.

Comments: 10 pages, 9 figures, 7 tables

arXiv:2410.00327 [pdf, other]

EnzymeFlow: Generating Reaction-specific Enzyme Catalytic Pockets through Flow Matching and Co-Evolutionary Dynamics

Authors: Chenqing Hua, Yong Liu, Dinghuai Zhang, Odin Zhang, Sitao Luan, Kevin K. Yang, Guy Wolf, Doina Precup, Shuangjia Zheng

Abstract: Enzyme design is a critical area in biotechnology, with applications ranging from drug development to synthetic biology. Traditional methods for enzyme function prediction or protein binding pocket design often fall short in capturing the dynamic and complex nature of enzyme-substrate interactions, particularly in catalytic processes. To address the challenges, we introduce EnzymeFlow, a generativ… ▽ More Enzyme design is a critical area in biotechnology, with applications ranging from drug development to synthetic biology. Traditional methods for enzyme function prediction or protein binding pocket design often fall short in capturing the dynamic and complex nature of enzyme-substrate interactions, particularly in catalytic processes. To address the challenges, we introduce EnzymeFlow, a generative model that employs flow matching with hierarchical pre-training and enzyme-reaction co-evolution to generate catalytic pockets for specific substrates and catalytic reactions. Additionally, we introduce a large-scale, curated, and validated dataset of enzyme-reaction pairs, specifically designed for the catalytic pocket generation task, comprising a total of $328,192$ pairs. By incorporating evolutionary dynamics and reaction-specific adaptations, EnzymeFlow becomes a powerful model for designing enzyme pockets, which is capable of catalyzing a wide range of biochemical reactions. Experiments on the new dataset demonstrate the model's effectiveness in designing high-quality, functional enzyme catalytic pockets, paving the way for advancements in enzyme engineering and synthetic biology. We provide EnzymeFlow code at https://github.com/WillHua127/EnzymeFlow with notebook demonstration at https://github.com/WillHua127/EnzymeFlow/blob/main/enzymeflow_demo.ipynb. △ Less

Submitted 30 September, 2024; originally announced October 2024.

arXiv:2407.11622 [pdf, other]

Sideward contact tracing in an epidemic model with mixing groups

Authors: Dongni Zhang, Martina Favero

Abstract: We consider a stochastic epidemic model with sideward contact tracing. We assume that infection is driven by interactions within mixing events (gatherings of two or more individuals). Once an infective is diagnosed, each individual who was infected at the same event as the diagnosed individual is contact traced with some given probability. Assuming few initial infectives in a large population, the… ▽ More We consider a stochastic epidemic model with sideward contact tracing. We assume that infection is driven by interactions within mixing events (gatherings of two or more individuals). Once an infective is diagnosed, each individual who was infected at the same event as the diagnosed individual is contact traced with some given probability. Assuming few initial infectives in a large population, the early phase of the epidemic is approximated by a branching process with sibling dependencies. To address the challenges given by the dependencies, we consider sibling groups (individuals who become infected at the same event) as macro-individuals and define a macro-branching process. This allows us to derive an expression for the effective macro-reproduction number which corresponds to the effective individual reproduction number and represents a threshold for the behaviour of the epidemic. Through numerical examples, we show how the reproduction number varies with the distribution of the mixing event size, the mean size, the rate of diagnosis and the tracing probability. △ Less

Submitted 26 March, 2025; v1 submitted 16 July, 2024; originally announced July 2024.

arXiv:2406.09817 [pdf, other]

Efficient and Precise Force Field Optimization for Biomolecules Using DPA-2

Authors: Junhan Chang, Duo Zhang, Yuqing Deng, Hongrui Lin, Zhirong Liu, Linfeng Zhang, Hang Zheng, Xinyan Wang

Abstract: Molecular simulations are essential tools in computational chemistry, enabling the prediction and understanding of molecular interactions and thermodynamic properties of biomolecules. However, traditional force fields face significant challenges in accurately representing novel molecules and complex chemical environments due to the labor-intensive process of manually setting optimization parameter… ▽ More Molecular simulations are essential tools in computational chemistry, enabling the prediction and understanding of molecular interactions and thermodynamic properties of biomolecules. However, traditional force fields face significant challenges in accurately representing novel molecules and complex chemical environments due to the labor-intensive process of manually setting optimization parameters and the high computational cost of quantum mechanical calculations. To overcome these difficulties, we fine-tuned a high-accuracy DPA-2 pre-trained model and applied it to optimize force field parameters on-the-fly, significantly reducing computational costs. Our method combines this fine-tuned DPA-2 model with a node-embedding-based similarity metric, allowing seamless augmentation to new chemical species without manual intervention. We applied this process to the TYK2 inhibitor and PTP1B systems and demonstrated its effectiveness through the improvement of free energy perturbation calculation results. This advancement contributes valuable insights and tools for the computational chemistry community. △ Less

Submitted 14 June, 2024; originally announced June 2024.

arXiv:2404.05329 [pdf]

In silico bioactivity prediction of proteins interacting with graphene-based nanomaterials guides rational design of biosensor

Authors: Jing Ye, Minzhi Fan, Xiaoyu Zhang, Shasha Lu, Mengyao Chai, Yunshan Zhang, Xiaoyu Zhao, Shuang Li, Diming Zhang

Abstract: Graphene based nanomaterials have attracted significant attention for their potentials in biomedical and biotechnology applications in recent years, owing to the outstanding physical and chemical properties. However, the interaction mechanism and impact on biological activity of macro and micro biomolecules still require more concerns and further research in order to enhance their applicability in… ▽ More Graphene based nanomaterials have attracted significant attention for their potentials in biomedical and biotechnology applications in recent years, owing to the outstanding physical and chemical properties. However, the interaction mechanism and impact on biological activity of macro and micro biomolecules still require more concerns and further research in order to enhance their applicability in biosensors, etc. Herein, an integrated method has been developed to predict the protein bioactivity performance when interacting with nanomaterials for protein based biosensor. Molecular dynamics simulation and molecular docking technique were consolidated to investigate several nanomaterials C60 fullerene, single walled carbon nanotube, pristine graphene and graphene oxide, and their effect when interacting with protein. The adsorption behavior, secondary structure changes and protein bioactivity changes were simulated, and the results of protein activity simulation were verified in combination with atomic force spectrum, circular dichroism spectrum fluorescence and electrochemical experiments. The best quantification alignment between bioactivity obtained by simulation and experiment measurements was further explored. The two proteins, RNase A and Exonuclease III, were regarded as analysis model for the proof of concept, and the prediction accuracy of protein bioactivty could reach up to 0.98. △ Less

Submitted 8 April, 2024; originally announced April 2024.

arXiv:2403.20163 [pdf, other]

Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning

Authors: Duzhen Zhang, Qingyu Wang, Tielin Zhang, Bo Xu

Abstract: The success of Deep Reinforcement Learning (DRL) is largely attributed to utilizing Artificial Neural Networks (ANNs) as function approximators. Recent advances in neuroscience have unveiled that the human brain achieves efficient reward-based learning, at least by integrating spiking neurons with spatial-temporal dynamics and network topologies with biologically-plausible connectivity patterns. T… ▽ More The success of Deep Reinforcement Learning (DRL) is largely attributed to utilizing Artificial Neural Networks (ANNs) as function approximators. Recent advances in neuroscience have unveiled that the human brain achieves efficient reward-based learning, at least by integrating spiking neurons with spatial-temporal dynamics and network topologies with biologically-plausible connectivity patterns. This integration process allows spiking neurons to efficiently combine information across and within layers via nonlinear dendritic trees and lateral interactions. The fusion of these two topologies enhances the network's information-processing ability, crucial for grasping intricate perceptions and guiding decision-making procedures. However, ANNs and brain networks differ significantly. ANNs lack intricate dynamical neurons and only feature inter-layer connections, typically achieved by direct linear summation, without intra-layer connections. This limitation leads to constrained network expressivity. To address this, we propose a novel alternative for function approximator, the Biologically-Plausible Topology improved Spiking Actor Network (BPT-SAN), tailored for efficient decision-making in DRL. The BPT-SAN incorporates spiking neurons with intricate spatial-temporal dynamics and introduces intra-layer connections, enhancing spatial-temporal state representation and facilitating more precise biological simulations. Diverging from the conventional direct linear weighted sum, the BPT-SAN models the local nonlinearities of dendritic trees within the inter-layer connections. For the intra-layer connections, the BPT-SAN introduces lateral interactions between adjacent neurons, integrating them into the membrane potential formula to ensure accurate spike firing. △ Less

Submitted 29 March, 2024; originally announced March 2024.

Comments: Work in Progress

arXiv:2402.14213 [pdf]

Contrastive Learning of Shared Spatiotemporal EEG Representations Across Individuals for Naturalistic Neuroscience

Authors: Xinke Shen, Lingyi Tao, Xuyang Chen, Sen Song, Quanying Liu, Dan Zhang

Abstract: Neural representations induced by naturalistic stimuli offer insights into how humans respond to stimuli in daily life. Understanding neural mechanisms underlying naturalistic stimuli processing hinges on the precise identification and extraction of the shared neural patterns that are consistently present across individuals. Targeting the Electroencephalogram (EEG) technique, known for its rich sp… ▽ More Neural representations induced by naturalistic stimuli offer insights into how humans respond to stimuli in daily life. Understanding neural mechanisms underlying naturalistic stimuli processing hinges on the precise identification and extraction of the shared neural patterns that are consistently present across individuals. Targeting the Electroencephalogram (EEG) technique, known for its rich spatial and temporal information, this study presents a framework for Contrastive Learning of Shared SpatioTemporal EEG Representations across individuals (CL-SSTER). CL-SSTER utilizes contrastive learning to maximize the similarity of EEG representations across individuals for identical stimuli, contrasting with those for varied stimuli. The network employed spatial and temporal convolutions to simultaneously learn the spatial and temporal patterns inherent in EEG. The versatility of CL-SSTER was demonstrated on three EEG datasets, including a synthetic dataset, a natural speech comprehension EEG dataset, and an emotional video watching EEG dataset. CL-SSTER attained the highest inter-subject correlation (ISC) values compared to the state-of-the-art ISC methods. The latent representations generated by CL-SSTER exhibited reliable spatiotemporal EEG patterns, which can be explained by properties of the naturalistic stimuli. CL-SSTER serves as an interpretable and scalable framework for the identification of inter-subject shared neural representations in naturalistic neuroscience. △ Less

Submitted 13 July, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

Comments: 54 pages, 17 figures

arXiv:2402.13392 [pdf, other]

An SEIR network epidemic model with manual and digital contact tracing allowing delays

Authors: Dongni Zhang, Tom Britton

Abstract: We consider an SEIR epidemic model on a network also allowing random contacts, where recovered individuals could either recover naturally or be diagnosed. Upon diagnosis, manual contact tracing is triggered such that each infected network contact is reported, tested and isolated with some probability and after a random delay. Additionally, digital tracing (based on a tracing app) is triggered if t… ▽ More We consider an SEIR epidemic model on a network also allowing random contacts, where recovered individuals could either recover naturally or be diagnosed. Upon diagnosis, manual contact tracing is triggered such that each infected network contact is reported, tested and isolated with some probability and after a random delay. Additionally, digital tracing (based on a tracing app) is triggered if the diagnosed individual is an app-user, and then all of its app-using infectees are immediately notified and isolated. The early phase of the epidemic with manual and/or digital tracing is approximated by different multi-type branching processes, and three respective reproduction numbers are derived. The effectiveness of both contact tracing mechanisms is numerically quantified through the reduction of the reproduction number. This shows that app-using fraction plays an essential role in the overall effectiveness of contact tracing. The relative effectiveness of manual tracing compared to digital tracing increases if: more of the transmission occurs on the network, when the tracing delay is shortened, and when the network degree distribution is heavy-tailed. For realistic values, the combined tracing case can reduce $R_0$ by $20-30\%$, so other preventive measures are needed to reduce the reproduction number down to $1.2-1.4$ for contact tracing to make it successful in avoiding big outbreaks. △ Less

Submitted 5 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

arXiv:2310.08774 [pdf, other]

PhyloGFN: Phylogenetic inference with generative flow networks

Authors: Mingyang Zhou, Zichao Yan, Elliot Layne, Nikolay Malkin, Dinghuai Zhang, Moksh Jain, Mathieu Blanchette, Yoshua Bengio

Abstract: Phylogenetics is a branch of computational biology that studies the evolutionary relationships among biological entities. Its long history and numerous applications notwithstanding, inference of phylogenetic trees from sequence data remains challenging: the high complexity of tree space poses a significant obstacle for the current combinatorial and probabilistic techniques. In this paper, we adopt… ▽ More Phylogenetics is a branch of computational biology that studies the evolutionary relationships among biological entities. Its long history and numerous applications notwithstanding, inference of phylogenetic trees from sequence data remains challenging: the high complexity of tree space poses a significant obstacle for the current combinatorial and probabilistic techniques. In this paper, we adopt the framework of generative flow networks (GFlowNets) to tackle two core problems in phylogenetics: parsimony-based and Bayesian phylogenetic inference. Because GFlowNets are well-suited for sampling complex combinatorial structures, they are a natural choice for exploring and sampling from the multimodal posterior distribution over tree topologies and evolutionary distances. We demonstrate that our amortized posterior sampler, PhyloGFN, produces diverse and high-quality evolutionary hypotheses on real benchmark datasets. PhyloGFN is competitive with prior works in marginal likelihood estimation and achieves a closer fit to the target distribution than state-of-the-art variational inference methods. Our code is available at https://github.com/zmy1116/phylogfn. △ Less

Submitted 24 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

arXiv:2307.05628 [pdf, other]

DNAGPT: A Generalized Pre-trained Tool for Versatile DNA Sequence Analysis Tasks

Authors: Daoan Zhang, Weitong Zhang, Yu Zhao, Jianguo Zhang, Bing He, Chenchen Qin, Jianhua Yao

Abstract: Pre-trained large language models demonstrate potential in extracting information from DNA sequences, yet adapting to a variety of tasks and data modalities remains a challenge. To address this, we propose DNAGPT, a generalized DNA pre-training model trained on over 200 billion base pairs from all mammals. By enhancing the classic GPT model with a binary classification task (DNA sequence order), a… ▽ More Pre-trained large language models demonstrate potential in extracting information from DNA sequences, yet adapting to a variety of tasks and data modalities remains a challenge. To address this, we propose DNAGPT, a generalized DNA pre-training model trained on over 200 billion base pairs from all mammals. By enhancing the classic GPT model with a binary classification task (DNA sequence order), a numerical regression task (guanine-cytosine content prediction), and a comprehensive token language, DNAGPT can handle versatile DNA analysis tasks while processing both sequence and numerical data. Our evaluation of genomic signal and region recognition, mRNA abundance regression, and artificial genomes generation tasks demonstrates DNAGPT's superior performance compared to existing models designed for specific downstream tasks, benefiting from pre-training using the newly designed model structure. △ Less

Submitted 30 August, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

arXiv:2305.19544 [pdf, other]

A data-driven analysis on the mediation effect of compartment models between control measures and COVID-19 epidemics

Authors: Dongyan Zhang, Wuyue Yang, Wanqi Wen, Liangrong Peng, Changjingn Zhuge, Liu Hong

Abstract: We make a retrospective review on various control measures taken by 127 countries/territories during the first wave of COVID-19 pandemic until July 7, 2020, and evaluate their impacts on the epidemic dynamics quantitatively. The SEIR-QD model, as a representative for general compartment models, is used to fit the epidemic data, enabling the extraction of crucial model parameters and dynamical feat… ▽ More We make a retrospective review on various control measures taken by 127 countries/territories during the first wave of COVID-19 pandemic until July 7, 2020, and evaluate their impacts on the epidemic dynamics quantitatively. The SEIR-QD model, as a representative for general compartment models, is used to fit the epidemic data, enabling the extraction of crucial model parameters and dynamical features. The mediation effect of the SEIR-QD model is revealed by using the mediation analysis with structure equation modeling for multiple mediators operating in parallel. The inherent impacts of these control policies on the transmission dynamics of COVID-19 epidemics are clarified, and compared with results derived from both multiple linear regression and neural-network-based nonlinear regression. Through this data-driven analysis, the mediation effect of compartment models is confirmed, which provides a better understanding on the intrinsic correlations among the strength of control measures and the dynamical features of COVID-19 epidemics. △ Less

Submitted 22 September, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

Comments: 21 pages, 6 figures, 1 tables

MSC Class: 92-10

arXiv:2305.04931 [pdf]

Network pharmacology on the mechanism of Yi Qi Tong Qiao Pill inhibiting allergic rhinitis

Authors: Boyang Wang, DingFan Zhang, Tingyu Zhang, Chayanis Sutcharitchan, Jianlin Hua, Dongfang Hua, Bo Zhang, Shao Li

Abstract: Objective: The purpose of this study is to reveal the mechanism of action of Yi Qi Tong Qiao Pill (YQTQP) in the treatment of allergic rhinitis (AR), as well as establish a paradigm for the researches on traditional Chinese medicine (TCM) from systematic perspective. Methods: Based on the data collected from TCM-related and disease-related databases, target profiles of compounds in YQTQP were calc… ▽ More Objective: The purpose of this study is to reveal the mechanism of action of Yi Qi Tong Qiao Pill (YQTQP) in the treatment of allergic rhinitis (AR), as well as establish a paradigm for the researches on traditional Chinese medicine (TCM) from systematic perspective. Methods: Based on the data collected from TCM-related and disease-related databases, target profiles of compounds in YQTQP were calculated through network-based algorithms and holistic targets of TQTQP was constructed. Network target analysis was performed to explore the potential mechanisms of YQTQP in the treatment of AR and the mechanisms were classified into different modules according to their biological functions. Besides, animal and clinical experiments were conducted to validate our findings inferred from Network target analysis. Results: Network target analysis showed that YQTQP targeted 12 main pathways or biological processes related to AR, represented by those related to IL-4, IFN-γ, TNF-α and IL-13. These results could be classified into 3 biological modules, including regulation of immune and inflammation, epithelial barrier disorder and cell adhesion. Finally, a series of experiments composed of animal and clinical experiments, proved our findings and confirmed that YQTQP could improve related symptoms of AR, like permeability of nasal mucosa epithelium. Conclusion: A combination of Network target analysis and the experimental validation indicated that YQTQP was effective in the treatment of AR and might provide a new insight on revealing the mechanism of TCM against diseases. △ Less

Submitted 21 May, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

Comments: 25 pages, 6 figures

MSC Class: None

arXiv:2301.11356 [pdf, other]

The Automated Discovery of Kinetic Rate Models -- Methodological Frameworks

Authors: Miguel Ángel de Carvalho Servia, Ilya Orson Sandoval, Klaus Hellgardt, King Kuok, Hii, Dongda Zhang, Ehecatl Antonio del Rio Chanona

Abstract: The industrialization of catalytic processes requires reliable kinetic models for their design, optimization and control. Mechanistic models require significant domain knowledge, while data-driven and hybrid models lack interpretability. Automated knowledge discovery methods, such as ALAMO (Automated Learning of Algebraic Models for Optimization), SINDy (Sparse Identification of Nonlinear Dynamics… ▽ More The industrialization of catalytic processes requires reliable kinetic models for their design, optimization and control. Mechanistic models require significant domain knowledge, while data-driven and hybrid models lack interpretability. Automated knowledge discovery methods, such as ALAMO (Automated Learning of Algebraic Models for Optimization), SINDy (Sparse Identification of Nonlinear Dynamics), and genetic programming, have gained popularity but suffer from limitations such as needing model structure assumptions, exhibiting poor scalability, and displaying sensitivity to noise. To overcome these challenges, we propose two methodological frameworks, ADoK-S and ADoK-W (Automated Discovery of Kinetic rate models using a Strong/Weak formulation of symbolic regression), for the automated generation of catalytic kinetic models using a robust criterion for model selection. We leverage genetic programming for model generation and a sequential optimization routine for model refinement. The frameworks are tested against three case studies of increasing complexity, demonstrating their ability to retrieve the underlying kinetic rate model with limited noisy data from the catalytic systems, showcasing their potential for chemical reaction engineering applications. △ Less

Submitted 2 November, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

arXiv:2211.12869 [pdf, other]

doi 10.1017/apr.2025.15

Epidemic models with digital and manual contact tracing

Authors: Tom Britton, Dongni Zhang

Abstract: We analyze a Markovian SIR epidemic model where individuals either recover naturally or are diagnosed, leading to isolation and potential contact tracing. Our focus is on digital contact tracing via a tracing app, considering both its standalone use and combination with manual tracing. We prove that as the population size $n$ grows large, the epidemic process converges to a limiting process, which… ▽ More We analyze a Markovian SIR epidemic model where individuals either recover naturally or are diagnosed, leading to isolation and potential contact tracing. Our focus is on digital contact tracing via a tracing app, considering both its standalone use and combination with manual tracing. We prove that as the population size $n$ grows large, the epidemic process converges to a limiting process, which, unlike typical epidemic models, is not a branching process due to dependencies created by contact tracing. However, by grouping to-be-traced individuals into macro-individuals, we derive a multi-type branching process interpretation, allowing computation of the reproduction number $R$. This is then converted to an individual reproduction number $R^{(ind)}$, which, contrary to $R$, decays monotonically with the fraction of app-users while both share the same threshold at 1. Finally, we compare digital (only) contact tracing and manual (only) contact tracing, proving that the critical fraction app-users $π_c$ required for $R=1$ is higher than the critical fraction manually contact traced $p_c$ for manual tracing. △ Less

Submitted 27 March, 2025; v1 submitted 23 November, 2022; originally announced November 2022.

arXiv:2209.14664 [pdf]

doi 10.1016/j.drudis.2023.103737

Causal inference in drug discovery and development

Authors: Tom Michoel, Jitao David Zhang

Abstract: To discover new drugs is to seek and to prove causality. As an emerging approach leveraging human knowledge and creativity, data, and machine intelligence, causal inference holds the promise of reducing cognitive bias and improving decision making in drug discovery. While it has been applied across the value chain, the concepts and practice of causal inference remain obscure to many practitioners.… ▽ More To discover new drugs is to seek and to prove causality. As an emerging approach leveraging human knowledge and creativity, data, and machine intelligence, causal inference holds the promise of reducing cognitive bias and improving decision making in drug discovery. While it has been applied across the value chain, the concepts and practice of causal inference remain obscure to many practitioners. This article offers a non-technical introduction to causal inference, reviews its recent applications, and discusses opportunities and challenges of adopting the causal language in drug discovery and development. △ Less

Submitted 29 September, 2022; originally announced September 2022.

arXiv:2207.03288 [pdf]

Further analysis of metagenomic datasets containing GD and GX pangolin CoVs indicates widespread contamination, undermining pangolin host attribution

Authors: Adrian Jones, Steven E. Massey, Daoyu Zhang, Yuri Deigin, Steven C. Quay

Abstract: The only animals other than bats reported to have been infected with SARS-CoV-2-related coronaviruses (SARS2r-CoVs) prior to the COVID-19 pandemic are pangolins. In early 2020 multiple papers reported the identification of two clades of SARS2r-CoVs, GD and GX, infecting pangolins. However the RNA-Seq datasets supporting pangolin genome assembly were widely contaminated, contained synthetic vectors… ▽ More The only animals other than bats reported to have been infected with SARS-CoV-2-related coronaviruses (SARS2r-CoVs) prior to the COVID-19 pandemic are pangolins. In early 2020 multiple papers reported the identification of two clades of SARS2r-CoVs, GD and GX, infecting pangolins. However the RNA-Seq datasets supporting pangolin genome assembly were widely contaminated, contained synthetic vectors or were heavily enriched or filtered with little but coronavirus sequences left in the datasets. Here we investigate two pangolin fecal samples sequenced by Li et al. (2021) provided in support of GD PCoV infection of pangolins in Guangdong and find the read distribution consistent with PCR amplicon contamination and SARS-CoV-2 contamination, and further identify the presence of synthetic plasmid sequences. We also build upon our previous work to further analyze the dataset GX/P3B by Lam et al. (2020), which is the only non enriched/heavily filtered pangolin tissue dataset sequenced by Lam et al. (2020). We identify synthetic vectors and confirm human genomic origin samples in the dataset. Finally, we find human mitochondrial sequences in all pangolin organ datasets and mouse and tiger mitochondrial sequences in selected pangolin organ datasets sequenced by Liu et al. (2019). We infer that human and mouse genomic origin sequences were probably sourced from contamination prior to sequencing, while tiger origin sequence contamination may have occurred due to index hopping during sequencing. These observations are problematic for attributing pangolins as SARS2r-CoV hosts in the datasets examined. The forensic methods developed and used here can be applied to examine any third party SRA data sets. △ Less

Submitted 11 July, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

Comments: 46 pages, 32 figures

arXiv:2206.04349 [pdf, other]

doi 10.1016/j.neucom.2020.10.117

Deep radiomic signature with immune cell markers predicts the survival of glioma patients

Authors: Ahmad Chaddad, Paul Daniel Mingli Zhang, Saima Rathore, Paul Sargos, Christian Desrosiers, Tamim Niazi

Abstract: Imaging biomarkers offer a non-invasive way to predict the response of immunotherapy prior to treatment. In this work, we propose a novel type of deep radiomic features (DRFs) computed from a convolutional neural network (CNN), which capture tumor characteristics related to immune cell markers and overall survival. Our study uses four MRI sequences (T1-weighted, T1-weighted post-contrast, T2-weigh… ▽ More Imaging biomarkers offer a non-invasive way to predict the response of immunotherapy prior to treatment. In this work, we propose a novel type of deep radiomic features (DRFs) computed from a convolutional neural network (CNN), which capture tumor characteristics related to immune cell markers and overall survival. Our study uses four MRI sequences (T1-weighted, T1-weighted post-contrast, T2-weighted and FLAIR) with corresponding immune cell markers of 151 patients with brain tumor. The proposed method extracts a total of 180 DRFs by aggregating the activation maps of a pre-trained 3D-CNN within labeled tumor regions of MRI scans. These features offer a compact, yet powerful representation of regional texture encoding tissue heterogeneity. A comprehensive set of experiments is performed to assess the relationship between the proposed DRFs and immune cell markers, and measure their association with overall survival. Results show a high correlation between DRFs and various markers, as well as significant differences between patients grouped based on these markers. Moreover, combining DRFs, clinical features and immune cell markers as input to a random forest classifier helps discriminate between short and long survival outcomes, with AUC of 72\% and p=2.36$\times$10$^{-5}$. These results demonstrate the usefulness of proposed DRFs as non-invasive biomarker for predicting treatment response in patients with brain tumors. △ Less

Submitted 9 June, 2022; originally announced June 2022.

Journal ref: Neurocomputing, Volume 469, 16 January 2022, Pages 366-375

arXiv:2203.04115 [pdf, other]

Biological Sequence Design with GFlowNets

Authors: Moksh Jain, Emmanuel Bengio, Alex-Hernandez Garcia, Jarrid Rector-Brooks, Bonaventure F. P. Dossou, Chanakya Ekbote, Jie Fu, Tianyu Zhang, Micheal Kilgour, Dinghuai Zhang, Lena Simine, Payel Das, Yoshua Bengio

Abstract: Design of de novo biological sequences with desired properties, like protein and DNA sequences, often involves an active loop with several rounds of molecule ideation and expensive wet-lab evaluations. These experiments can consist of multiple stages, with increasing levels of precision and cost of evaluation, where candidates are filtered. This makes the diversity of proposed candidates a key con… ▽ More Design of de novo biological sequences with desired properties, like protein and DNA sequences, often involves an active loop with several rounds of molecule ideation and expensive wet-lab evaluations. These experiments can consist of multiple stages, with increasing levels of precision and cost of evaluation, where candidates are filtered. This makes the diversity of proposed candidates a key consideration in the ideation phase. In this work, we propose an active learning algorithm leveraging epistemic uncertainty estimation and the recently proposed GFlowNets as a generator of diverse candidate solutions, with the objective to obtain a diverse batch of useful (as defined by some utility function, for example, the predicted anti-microbial activity of a peptide) and informative candidates after each round. We also propose a scheme to incorporate existing labeled datasets of candidates, in addition to a reward function, to speed up learning in GFlowNets. We present empirical results on several biological sequence design tasks, and we find that our method generates more diverse and novel batches with high scoring candidates compared to existing approaches. △ Less

Submitted 24 May, 2023; v1 submitted 2 March, 2022; originally announced March 2022.

Comments: ICML 2022. 15 pages, 3 figures. Code available at: https://github.com/MJ10/BioSeq-GFN-AL. Updated GFP results

arXiv:2110.07220 [pdf, other]

Analysing the Effect of Test-and-Trace Strategy in an SIR Epidemic Model

Authors: Dongni Zhang, Tom Britton

Abstract: Consider a Markovian SIR epidemic model in a homogeneous community. To this model we add a rate at which individuals are tested, and once an infectious individual tests positive it is isolated and each of their contacts are traced and tested independently with some fixed probability. If such a traced individual tests positive it is isolated, and the contact tracing is iterated. This model is analy… ▽ More Consider a Markovian SIR epidemic model in a homogeneous community. To this model we add a rate at which individuals are tested, and once an infectious individual tests positive it is isolated and each of their contacts are traced and tested independently with some fixed probability. If such a traced individual tests positive it is isolated, and the contact tracing is iterated. This model is analysed using large population approximations, both for the early stage of the epidemic when the "to-be-traced components" of the epidemic behaves like a branching process, and for the main stage of the epidemic where the process of to-be-traced components converges to a deterministic process defined by a system of differential equations. These approximations are used to quantify the effect of testing and of contact tracing on the effective reproduction numbers (for the components as well as for the individuals), the probability of a major outbreak, and the final fraction getting infected. Using numerical illustrations when rates of infection and natural recovery are fixed, it is shown that Test-and-Trace strategy is effective in reducing the reproduction number. Surprisingly, the reproduction number for the branching process of components is not monotonically decreasing in the tracing probability, but the individual reproduction number is conjectured to be monotonic as expected. Further, in the situation where individuals also self-report for testing, the tracing probability is more influential than the screening rate (measured by the fraction infected being screened). △ Less

Submitted 5 August, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

arXiv:2110.03372 [pdf, other]

Unifying Likelihood-free Inference with Black-box Optimization and Beyond

Authors: Dinghuai Zhang, Jie Fu, Yoshua Bengio, Aaron Courville

Abstract: Black-box optimization formulations for biological sequence design have drawn recent attention due to their promising potential impact on the pharmaceutical industry. In this work, we propose to unify two seemingly distinct worlds: likelihood-free inference and black-box optimization, under one probabilistic framework. In tandem, we provide a recipe for constructing various sequence design methods… ▽ More Black-box optimization formulations for biological sequence design have drawn recent attention due to their promising potential impact on the pharmaceutical industry. In this work, we propose to unify two seemingly distinct worlds: likelihood-free inference and black-box optimization, under one probabilistic framework. In tandem, we provide a recipe for constructing various sequence design methods based on this framework. We show how previous optimization approaches can be "reinvented" in our framework, and further propose new probabilistic black-box optimization algorithms. Extensive experiments on sequence design application illustrate the benefits of the proposed methodology. △ Less

Submitted 8 February, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

Comments: ICLR 2022 spotlight

arXiv:2109.09559 [pdf]

doi 10.1109/TAFFC.2022.3164516

Contrastive Learning of Subject-Invariant EEG Representations for Cross-Subject Emotion Recognition

Authors: Xinke Shen, Xianggen Liu, Xin Hu, Dan Zhang, Sen Song

Abstract: EEG signals have been reported to be informative and reliable for emotion recognition in recent years. However, the inter-subject variability of emotion-related EEG signals still poses a great challenge for the practical applications of EEG-based emotion recognition. Inspired by recent neuroscience studies on inter-subject correlation, we proposed a Contrastive Learning method for Inter-Subject Al… ▽ More EEG signals have been reported to be informative and reliable for emotion recognition in recent years. However, the inter-subject variability of emotion-related EEG signals still poses a great challenge for the practical applications of EEG-based emotion recognition. Inspired by recent neuroscience studies on inter-subject correlation, we proposed a Contrastive Learning method for Inter-Subject Alignment (CLISA) to tackle the cross-subject emotion recognition problem. Contrastive learning was employed to minimize the inter-subject differences by maximizing the similarity in EEG signal representations across subjects when they received the same emotional stimuli in contrast to different ones. Specifically, a convolutional neural network was applied to learn inter-subject aligned spatiotemporal representations from EEG time series in contrastive learning. The aligned representations were subsequently used to extract differential entropy features for emotion classification. CLISA achieved state-of-the-art cross-subject emotion recognition performance on our THU-EP dataset with 80 subjects and the publicly available SEED dataset with 15 subjects. It could generalize to unseen subjects or unseen emotional stimuli in testing. Furthermore, the spatiotemporal representations learned by CLISA could provide insights into the neural mechanisms of human emotion processing. △ Less

Submitted 5 April, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

Comments: 23 pages, 13 figures, journal paper. IEEE Transactions on Affective Computing, 2022

arXiv:2109.09112 [pdf]

Nipah virus vector sequences in COVID-19 patient samples sequenced by the Wuhan Institute of Virology

Authors: Steven C. Quay, Daoyu Zhang, Adrian Jones, Yuri Deigin

Abstract: We report the detection of Nipah virus in an infectious clone format, a BSL4-level pathogen and CDC-designated Bioterrorism Agent, in raw RNA-Seq sequencing reads deposited by the Wuhan Institute of Virology (WIV) produced from five December 2019 patients infected with SARS-CoV-2. Research involving Nipah infectious clones has never been reported to have occured at the WIV. These patient samples h… ▽ More We report the detection of Nipah virus in an infectious clone format, a BSL4-level pathogen and CDC-designated Bioterrorism Agent, in raw RNA-Seq sequencing reads deposited by the Wuhan Institute of Virology (WIV) produced from five December 2019 patients infected with SARS-CoV-2. Research involving Nipah infectious clones has never been reported to have occured at the WIV. These patient samples have been previously reported to contain reads from several other viruses: Influenza A, Spodoptera frugiperda rhabdovirus and Nipah. Previous authors have interpreted the presence of these virus sequences as indicative of co-infections of the patients in question by these pathogens or laboratory contamination. However, our analysis shows that NiV genes are encapsulated in synthetic vectors, which we infer was for assembly of a NiV infectious clone. In particular, we document the finding of internal N, P-V-W-C and L protein coding sequences as well as coverage of the G and F genes. Furthermore, the format of Hepatitis D virus ribozyme and T7 terminator downstream of the 5-prime end of the NiV sequence is consistent with truncation required at the end of the genome for a full length infectious clone. This indicates that research at WIV was being conducted on an assembled NiV infectious clone. Contamination of patient sequencing reads by an infectious NiV clone of the highly pathogenic Bangladesh strain could indicate a significant breach of BSL-4 protocols. We call on WIV to explain the purpose of this research on infectious clones of Nipah Virus, the full chronology of this work, and to explain how and at what stage of sample preparation this contamination occurred. △ Less

Submitted 19 September, 2021; originally announced September 2021.

Comments: 16 pages, 4 figures, and supplemental materials

arXiv:2108.08163 [pdf]

Analysis of pangolin metagenomic datasets reveals significant contamination, raising concerns for pangolin CoV host attribution

Authors: Adrian Jones, Daoyu Zhang, Yuri Deigin, Steven C. Quay

Abstract: Metagenomic datasets from pangolin tissue specimens have previously yielded SARS-related coronaviruses which show high homology in their receptor binding domain to SARS-CoV-2, suggesting a potential zoonotic source for this feature of the human virus, possibly via recombination (Liu et al. 2019, Lam et al. 2020, Xiao et al. 2020, Liu et al. 2020). Here we re-examine these published datasets. We re… ▽ More Metagenomic datasets from pangolin tissue specimens have previously yielded SARS-related coronaviruses which show high homology in their receptor binding domain to SARS-CoV-2, suggesting a potential zoonotic source for this feature of the human virus, possibly via recombination (Liu et al. 2019, Lam et al. 2020, Xiao et al. 2020, Liu et al. 2020). Here we re-examine these published datasets. We report that only a few pangolin samples were found to contain coronavirus reads, and even then in low abundance, while other non-pangolin hosted viruses were present in higher abundance. We also discovered extensive contamination with human, rodent, and other mammalian gene sequences, which was a surprising finding. Furthermore, we uncovered a number of pangolin CoV sequences embedded in standard laboratory cloning vectors, which suggests the pangolin specimens could have been contaminated with sequences derived from synthetic biology experiments. Finally, we discover a third pangolin dataset (He et al. 2022) with low levels of SARSr-CoV sequences and unambiguous extensive contamination of several pangolin samples. For these reasons, we find it unlikely that the pangolins in question had a coronavirus infection while alive, and all current versions of the cited papers claiming a zoonotic infection of pangolins with a SARS-r CoV require substantial corrections and should be retracted until such corrections are made. △ Less

Submitted 1 March, 2022; v1 submitted 18 August, 2021; originally announced August 2021.

Comments: 55 pages, 15 figures

arXiv:2104.01533 [pdf]

Unexpected novel Merbecovirus discoveries in agricultural sequencing datasets from Wuhan, China

Authors: Daoyu Zhang, Adrian Jones, Yuri Deigin, Karl Sirotkin, Alejandro Sousa

Abstract: In this study we document the unexpected discovery of multiple coronaviruses and a BSL-3 pathogen in agricultural cotton and rice sequencing datasets. In particular, we have identified a novel HKU5-related Merbecovirus in a cotton dataset sequenced by the Huazhong Agricultural University in 2017. We have also found an infectious clone sequence containing a novel HKU4-related Merbecovirus related t… ▽ More In this study we document the unexpected discovery of multiple coronaviruses and a BSL-3 pathogen in agricultural cotton and rice sequencing datasets. In particular, we have identified a novel HKU5-related Merbecovirus in a cotton dataset sequenced by the Huazhong Agricultural University in 2017. We have also found an infectious clone sequence containing a novel HKU4-related Merbecovirus related to MERS coronavirus in a rice dataset sequenced by the Huazhong Agricultural University in early 2020. Another HKU5-related Merbecovirus, as well as Japanese encephalitis virus, were identified in a cotton dataset sequenced by the Huazhong Agricultural University in 2018. An HKU3-related Betacoronavirus was found in a Mus musculus sequencing dataset from the Wuhan Institute of Virology in 2017. Finally, a SARS-WIV1-like Betacoronavirus was found in a rice dataset sequenced by the Fujian Agriculture and Forestry University in 2017. Using the contaminating reads we have extracted from the above datasets, we were able to assemble complete genomes of two novel coronaviruses which we disclose herein. In light of our findings, we raise concerns about biosafety protocol breaches, as indicated by our discovery of multiple dangerous human pathogens in agricultural sequencing laboratories in Wuhan and Fouzou City, China. △ Less

Submitted 6 June, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

Comments: Supplementary information and data can be found in Zenodo datasets doi: 10.5281/zenodo.4660981, doi: 10.5281/zenodo.4620604, doi: 10.5281/zenodo.4399248

arXiv:2102.03910 [pdf]

An open debate on SARS-CoV-2's proximal origin is long overdue

Authors: Rossana Segreto, Yuri Deigin, Kevin McCairn, Alejandro Sousa, Dan Sirotkin, Karl Sirotkin, Jonathan J. Couey, Adrian Jones, Daoyu Zhang

Abstract: There is a near consensus view that SARS-CoV-2 has a natural zoonotic origin; however, several characteristics of SARS-CoV-2 taken together are not easily explained by a natural zoonotic origin hypothesis. These include: a low rate of evolution in the early phase of transmission; the lack of evidence of recombination events; a high pre-existing binding to human ACE2; a novel furin cleavage site in… ▽ More There is a near consensus view that SARS-CoV-2 has a natural zoonotic origin; however, several characteristics of SARS-CoV-2 taken together are not easily explained by a natural zoonotic origin hypothesis. These include: a low rate of evolution in the early phase of transmission; the lack of evidence of recombination events; a high pre-existing binding to human ACE2; a novel furin cleavage site insert; a flat glycan binding domain of the spike protein which conflicts with host evasion survival patterns exhibited by other coronaviruses, and high human and mouse peptide mimicry. Initial assumptions against a laboratory origin, by contrast, have remained unsubstantiated. Furthermore, over a year after the initial outbreak in Wuhan, there is still no clear evidence of zoonotic transfer from a bat or intermediate species. Given the immense social and economic impact of this pandemic, identifying the true origin of SARS-CoV-2 is fundamental to preventing future outbreaks. The search for SARS-CoV-2's origin should include an open and unbiased inquiry into a possible laboratory origin. △ Less

Submitted 9 February, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

arXiv:2010.15594 [pdf, other]

Shared Space Transfer Learning for analyzing multi-site fMRI data

Authors: Muhammad Yousefnezhad, Alessandro Selvitella, Daoqiang Zhang, Andrew J. Greenshaw, Russell Greiner

Abstract: Multi-voxel pattern analysis (MVPA) learns predictive models from task-based functional magnetic resonance imaging (fMRI) data, for distinguishing when subjects are performing different cognitive tasks -- e.g., watching movies or making decisions. MVPA works best with a well-designed feature set and an adequate sample size. However, most fMRI datasets are noisy, high-dimensional, expensive to coll… ▽ More Multi-voxel pattern analysis (MVPA) learns predictive models from task-based functional magnetic resonance imaging (fMRI) data, for distinguishing when subjects are performing different cognitive tasks -- e.g., watching movies or making decisions. MVPA works best with a well-designed feature set and an adequate sample size. However, most fMRI datasets are noisy, high-dimensional, expensive to collect, and with small sample sizes. Further, training a robust, generalized predictive model that can analyze homogeneous cognitive tasks provided by multi-site fMRI datasets has additional challenges. This paper proposes the Shared Space Transfer Learning (SSTL) as a novel transfer learning (TL) approach that can functionally align homogeneous multi-site fMRI datasets, and so improve the prediction performance in every site. SSTL first extracts a set of common features for all subjects in each site. It then uses TL to map these site-specific features to a site-independent shared space in order to improve the performance of the MVPA. SSTL uses a scalable optimization procedure that works effectively for high-dimensional fMRI datasets. The optimization procedure extracts the common features for each site by using a single-iteration algorithm and maps these site-specific common features to the site-independent shared space. We evaluate the effectiveness of the proposed method for transferring between various cognitive tasks. Our comprehensive experiments validate that SSTL achieves superior performance to other state-of-the-art analysis techniques. △ Less

Submitted 24 October, 2020; originally announced October 2020.

Comments: 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada. The Supplementary Material: https://www.yousefnezhad.com/publications/NeurIPS2020_Paper4157_SuppMat.zip

arXiv:2010.02012 [pdf, other]

doi 10.1007/s12021-020-09494-4

Deep Representational Similarity Learning for analyzing neural signatures in task-based fMRI dataset

Authors: Muhammad Yousefnezhad, Jeffrey Sawalha, Alessandro Selvitella, Daoqiang Zhang

Abstract: Similarity analysis is one of the crucial steps in most fMRI studies. Representational Similarity Analysis (RSA) can measure similarities of neural signatures generated by different cognitive states. This paper develops Deep Representational Similarity Learning (DRSL), a deep extension of RSA that is appropriate for analyzing similarities between various cognitive tasks in fMRI datasets with a lar… ▽ More Similarity analysis is one of the crucial steps in most fMRI studies. Representational Similarity Analysis (RSA) can measure similarities of neural signatures generated by different cognitive states. This paper develops Deep Representational Similarity Learning (DRSL), a deep extension of RSA that is appropriate for analyzing similarities between various cognitive tasks in fMRI datasets with a large number of subjects, and high-dimensionality -- such as whole-brain images. Unlike the previous methods, DRSL is not limited by a linear transformation or a restricted fixed nonlinear kernel function -- such as Gaussian kernel. DRSL utilizes a multi-layer neural network for mapping neural responses to linear space, where this network can implement a customized nonlinear transformation for each subject separately. Furthermore, utilizing a gradient-based optimization in DRSL can significantly reduce runtime of analysis on large datasets because it uses a batch of samples in each iteration rather than all neural responses to find an optimal solution. Empirical studies on multi-subject fMRI datasets with various tasks -- including visual stimuli, decision making, flavor, and working memory -- confirm that the proposed method achieves superior performance to other state-of-the-art RSA algorithms. △ Less

Submitted 28 September, 2020; originally announced October 2020.

Comments: Neuroinformatics

arXiv:2003.05666 [pdf, other]

Rational evaluation of various epidemic models based on the COVID-19 data of China

Authors: Wuyue Yang, Dongyan Zhang, Liangrong Peng, Changjing Zhuge, Liu Hong

Abstract: In this paper, based on the Akaike information criterion, root mean square error and robustness coefficient, a rational evaluation of various epidemic models/methods, including seven empirical functions, four statistical inference methods and five dynamical models, on their forecasting abilities is carried out. With respect to the outbreak data of COVID-19 epidemics in China, we find that before t… ▽ More In this paper, based on the Akaike information criterion, root mean square error and robustness coefficient, a rational evaluation of various epidemic models/methods, including seven empirical functions, four statistical inference methods and five dynamical models, on their forecasting abilities is carried out. With respect to the outbreak data of COVID-19 epidemics in China, we find that before the inflection point, all models fail to make a reliable prediction. The Logistic function consistently underestimates the final epidemic size, while the Gompertz's function makes an overestimation in all cases. Towards statistical inference methods, the methods of sequential Bayesian and time-dependent reproduction number are more accurate at the late stage of an epidemic. And the transition-like behavior of exponential growth method from underestimation to overestimation with respect to the inflection point might be useful for constructing a more reliable forecast. Compared to ODE-based SIR, SEIR and SEIR-AHQ models, the SEIR-QD and SEIR-PO models generally show a better performance on studying the COVID-19 epidemics, whose success we believe could be attributed to a proper trade-off between model complexity and fitting accuracy. Our findings not only are crucial for the forecast of COVID-19 epidemics, but also may apply to other infectious diseases. △ Less

Submitted 14 September, 2021; v1 submitted 12 March, 2020; originally announced March 2020.

Comments: 25 pages, 5 figures, 2 tables

arXiv:2002.06563 [pdf, other]

Epidemic analysis of COVID-19 in China by dynamical modeling

Authors: Liangrong Peng, Wuyue Yang, Dongyan Zhang, Changjing Zhuge, Liu Hong

Abstract: The outbreak of novel coronavirus-caused pneumonia (COVID-19) in Wuhan has attracted worldwide attention. Here, we propose a generalized SEIR model to analyze this epidemic. Based on the public data of National Health Commission of China from Jan. 20th to Feb. 9th, 2020, we reliably estimate key epidemic parameters and make predictions on the inflection point and possible ending time for 5 differe… ▽ More The outbreak of novel coronavirus-caused pneumonia (COVID-19) in Wuhan has attracted worldwide attention. Here, we propose a generalized SEIR model to analyze this epidemic. Based on the public data of National Health Commission of China from Jan. 20th to Feb. 9th, 2020, we reliably estimate key epidemic parameters and make predictions on the inflection point and possible ending time for 5 different regions. According to optimistic estimation, the epidemics in Beijing and Shanghai will end soon within two weeks, while for most part of China, including the majority of cities in Hubei province, the success of anti-epidemic will be no later than the middle of March. The situation in Wuhan is still very severe, at least based on public data until Feb. 15th. We expect it will end up at the beginning of April. Moreover, by inverse inference, we find the outbreak of COVID-19 in Mainland, Hubei province and Wuhan all can be dated back to the end of December 2019, and the doubling time is around two days at the early stage. △ Less

Submitted 25 June, 2020; v1 submitted 16 February, 2020; originally announced February 2020.

Comments: 11 pages, 6 figures, 1 table

arXiv:2001.02894 [pdf, other]

doi 10.1109/TCDS.2020.2965981

Supervised Hyperalignment for multi-subject fMRI data alignment

Authors: Muhammad Yousefnezhad, Alessandro Selvitella, Liangxiu Han, Daoqiang Zhang

Abstract: Hyperalignment has been widely employed in Multivariate Pattern (MVP) analysis to discover the cognitive states in the human brains based on multi-subject functional Magnetic Resonance Imaging (fMRI) datasets. Most of the existing HA methods utilized unsupervised approaches, where they only maximized the correlation between the voxels with the same position in the time series. However, these unsup… ▽ More Hyperalignment has been widely employed in Multivariate Pattern (MVP) analysis to discover the cognitive states in the human brains based on multi-subject functional Magnetic Resonance Imaging (fMRI) datasets. Most of the existing HA methods utilized unsupervised approaches, where they only maximized the correlation between the voxels with the same position in the time series. However, these unsupervised solutions may not be optimum for handling the functional alignment in the supervised MVP problems. This paper proposes a Supervised Hyperalignment (SHA) method to ensure better functional alignment for MVP analysis, where the proposed method provides a supervised shared space that can maximize the correlation among the stimuli belonging to the same category and minimize the correlation between distinct categories of stimuli. Further, SHA employs a generalized optimization solution, which generates the shared space and calculates the mapped features in a single iteration, hence with optimum time and space complexities for large datasets. Experiments on multi-subject datasets demonstrate that SHA method achieves up to 19% better performance for multi-class problems over the state-of-the-art HA algorithms. △ Less

Submitted 9 January, 2020; originally announced January 2020.

Comments: IEEE Transactions on Cognitive and Developmental Systems

arXiv:1910.14292 [pdf]

doi 10.1109/ner.2019.8717029

Preliminary Results on a New Algorithm for Blink Correction Adaptive to Inter- and Intra-Subject Variability

Authors: E. Guttmann-Flury, X. Sheng, D. Zhang, X. Zhu

Abstract: This paper presents a new preprocessing method to correct blinking artifacts in Electroencephalography (EEG) based Brain-Computer Interfaces (BCIs). This Algorithm for Blink Correction (ABC) directly corrects the signal in the time domain without the need for additional Electrooculogram (EOG) electrodes. The main idea is to automatically adapt to the blink's inter- and intra-subject variability by… ▽ More This paper presents a new preprocessing method to correct blinking artifacts in Electroencephalography (EEG) based Brain-Computer Interfaces (BCIs). This Algorithm for Blink Correction (ABC) directly corrects the signal in the time domain without the need for additional Electrooculogram (EOG) electrodes. The main idea is to automatically adapt to the blink's inter- and intra-subject variability by considering the blink's amplitude as a parameter. A simple Minimum Distance to Riemannian Mean (MDRM) is applied as the classification algorithm. Preliminary results on three subjects show a mean classification accuracy increase of 13.7% using ABC. △ Less

Submitted 31 October, 2019; originally announced October 2019.

arXiv:1809.04429 [pdf]

Gradient-based Representational Similarity Analysis with Searchlight for Analyzing fMRI Data

Authors: Xiaoliang Sheng, Muhammad Yousefnezhad, Tonglin Xu, Ning Yuan, Daoqiang Zhang

Abstract: Representational Similarity Analysis (RSA) aims to explore similarities between neural activities of different stimuli. Classical RSA techniques employ the inverse of the covariance matrix to explore a linear model between the neural activities and task events. However, calculating the inverse of a large-scale covariance matrix is time-consuming and can reduce the stability and robustness of the f… ▽ More Representational Similarity Analysis (RSA) aims to explore similarities between neural activities of different stimuli. Classical RSA techniques employ the inverse of the covariance matrix to explore a linear model between the neural activities and task events. However, calculating the inverse of a large-scale covariance matrix is time-consuming and can reduce the stability and robustness of the final analysis. Notably, it becomes severe when the number of samples is too large. For facing this shortcoming, this paper proposes a novel RSA method called gradient-based RSA (GRSA). Moreover, the proposed method is not restricted to a linear model. In fact, there is a growing interest in finding more effective ways of using multi-subject and whole-brain fMRI data. Searchlight technique can extend RSA from the localized brain regions to the whole-brain regions with smaller memory footprint in each process. Based on Searchlight, we propose a new method called Spatiotemporal Searchlight GRSA (SSL-GRSA) that generalizes our ROI-based GRSA algorithm to the whole-brain data. Further, our approach can handle some computational challenges while dealing with large-scale, multi-subject fMRI data. Experimental studies on multi-subject datasets confirm that both proposed approaches achieve superior performance to other state-of-the-art RSA algorithms. △ Less

Submitted 12 September, 2018; originally announced September 2018.

Comments: Conference: Chinese Conference on Pattern Recognition and Computer Vision 2018 (PRCV18), 23-26/Nov, Guangzhou, China

arXiv:1808.01642 [pdf, other]

Multi-Objective Cognitive Model: a supervised approach for multi-subject fMRI analysis

Authors: Muhammad Yousefnezhad, Daoqiang Zhang

Abstract: In order to decode the human brain, Multivariate Pattern (MVP) classification generates cognitive models by using functional Magnetic Resonance Imaging (fMRI) datasets. As a standard pipeline in the MVP analysis, brain patterns in multi-subject fMRI dataset must be mapped to a shared space and then a classification model is generated by employing the mapped patterns. However, the MVP models may no… ▽ More In order to decode the human brain, Multivariate Pattern (MVP) classification generates cognitive models by using functional Magnetic Resonance Imaging (fMRI) datasets. As a standard pipeline in the MVP analysis, brain patterns in multi-subject fMRI dataset must be mapped to a shared space and then a classification model is generated by employing the mapped patterns. However, the MVP models may not provide stable performance on a new fMRI dataset because the standard pipeline uses disjoint steps for generating these models. Indeed, each step in the pipeline includes an objective function with independent optimization approach, where the best solution of each step may not be optimum for the next steps. For tackling the mentioned issue, this paper introduces the Multi-Objective Cognitive Model (MOCM) that utilizes an integrated objective function for MVP analysis rather than just using those disjoint steps. For solving the integrated problem, we proposed a customized multi-objective optimization approach, where all possible solutions are firstly generated, and then our method ranks and selects the robust solutions as the final results. Empirical studies confirm that the proposed method can generate superior performance in comparison with other techniques. △ Less

Submitted 5 August, 2018; originally announced August 2018.

Comments: Neuroinformatics, Springer

arXiv:1807.02612 [pdf]

Gradient Hyperalignment for multi-subject fMRI data alignment

Authors: Tonglin Xu, Muhammad Yousefnezhad, Daoqiang Zhang

Abstract: Multi-subject fMRI data analysis is an interesting and challenging problem in human brain decoding studies. The inherent anatomical and functional variability across subjects make it necessary to do both anatomical and functional alignment before classification analysis. Besides, when it comes to big data, time complexity becomes a problem that cannot be ignored. This paper proposes Gradient Hyper… ▽ More Multi-subject fMRI data analysis is an interesting and challenging problem in human brain decoding studies. The inherent anatomical and functional variability across subjects make it necessary to do both anatomical and functional alignment before classification analysis. Besides, when it comes to big data, time complexity becomes a problem that cannot be ignored. This paper proposes Gradient Hyperalignment (Gradient-HA) as a gradient-based functional alignment method that is suitable for multi-subject fMRI datasets with large amounts of samples and voxels. The advantage of Gradient-HA is that it can solve independence and high dimension problems by using Independent Component Analysis (ICA) and Stochastic Gradient Ascent (SGA). Validation using multi-classification tasks on big data demonstrates that Gradient-HA method has less time complexity and better or comparable performance compared with other state-of-the-art functional alignment methods. △ Less

Submitted 7 July, 2018; originally announced July 2018.

Comments: 15th Pacific Rim International Conference on Artificial Intelligence (PRICAI 2018), Nanjing, China, August 28-31, 2018

arXiv:1710.03923 [pdf, other]

Deep Hyperalignment

Authors: Muhammad Yousefnezhad, Daoqiang Zhang

Abstract: This paper proposes Deep Hyperalignment (DHA) as a regularized, deep extension, scalable Hyperalignment (HA) method, which is well-suited for applying functional alignment to fMRI datasets with nonlinearity, high-dimensionality (broad ROI), and a large number of subjects. Unlink previous methods, DHA is not limited by a restricted fixed kernel function. Further, it uses a parametric approach, rank… ▽ More This paper proposes Deep Hyperalignment (DHA) as a regularized, deep extension, scalable Hyperalignment (HA) method, which is well-suited for applying functional alignment to fMRI datasets with nonlinearity, high-dimensionality (broad ROI), and a large number of subjects. Unlink previous methods, DHA is not limited by a restricted fixed kernel function. Further, it uses a parametric approach, rank-$m$ Singular Value Decomposition (SVD), and stochastic gradient descent for optimization. Therefore, DHA has a suitable time complexity for large datasets, and DHA does not require the training data when it computes the functional alignment for a new subject. Experimental studies on multi-subject fMRI analysis confirm that the DHA method achieves superior performance to other state-of-the-art HA algorithms. △ Less

Submitted 11 October, 2017; originally announced October 2017.

Comments: 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA

arXiv:1710.02113 [pdf, other]

Anatomical Pattern Analysis for decoding visual stimuli in human brains

Authors: Muhammad Yousefnezhad, Daoqiang Zhang

Abstract: Background: A universal unanswered question in neuroscience and machine learning is whether computers can decode the patterns of the human brain. Multi-Voxels Pattern Analysis (MVPA) is a critical tool for addressing this question. However, there are two challenges in the previous MVPA methods, which include decreasing sparsity and noise in the extracted features and increasing the performance of… ▽ More Background: A universal unanswered question in neuroscience and machine learning is whether computers can decode the patterns of the human brain. Multi-Voxels Pattern Analysis (MVPA) is a critical tool for addressing this question. However, there are two challenges in the previous MVPA methods, which include decreasing sparsity and noise in the extracted features and increasing the performance of prediction. Methods: In overcoming mentioned challenges, this paper proposes Anatomical Pattern Analysis (APA) for decoding visual stimuli in the human brain. This framework develops a novel anatomical feature extraction method and a new imbalance AdaBoost algorithm for binary classification. Further, it utilizes an Error-Correcting Output Codes (ECOC) method for multiclass prediction. APA can automatically detect active regions for each category of the visual stimuli. Moreover, it enables us to combine homogeneous datasets for applying advanced classification. Results and Conclusions: Experimental studies on 4 visual categories (words, consonants, objects and scrambled photos) demonstrate that the proposed approach achieves superior performance to state-of-the-art methods. △ Less

Submitted 5 October, 2017; originally announced October 2017.

Comments: Published in Cognitive Computation

arXiv:1708.08991 [pdf]

Targeted and Imaging-guided In Vivo Photodynamic Therapy of Tumors Using Dual-functional, Aggregation-induced Emission Nanoparticles

Authors: Xianhe Sun, Abudureheman zebibula, Xiaobiao Dong, Gonghui Li, Guanxin Zhang, Deqing Zhang, Jun Qian, Sailing He

Abstract: Dual-functional nanoparticles, with the property of aggregation-induced emission and the capability of reactive oxygen species, were used to achieve passive/active targeting of tumor. Good contrast in in vivo imaging and obvious therapeutic efficiency were realized with a low dose of AIE nanoparticles as well as a low power density of light, resulting in negligible side effects. Dual-functional nanoparticles, with the property of aggregation-induced emission and the capability of reactive oxygen species, were used to achieve passive/active targeting of tumor. Good contrast in in vivo imaging and obvious therapeutic efficiency were realized with a low dose of AIE nanoparticles as well as a low power density of light, resulting in negligible side effects. △ Less

Submitted 22 August, 2017; originally announced August 2017.

Comments: 30 pages, 7 figures

arXiv:1708.06578 [pdf, other]

Cascade and Parallel Convolutional Recurrent Neural Networks on EEG-based Intention Recognition for Brain Computer Interface

Authors: Dalin Zhang, Lina Yao, Xiang Zhang, Sen Wang, Weitong Chen, Robert Boots

Abstract: Brain-Computer Interface (BCI) is a system empowering humans to communicate with or control the outside world with exclusively brain intentions. Electroencephalography (EEG) based BCIs are promising solutions due to their convenient and portable instruments. Motor imagery EEG (MI-EEG) is a kind of most widely focused EEG signals, which reveals a subjects movement intentions without actual actions.… ▽ More Brain-Computer Interface (BCI) is a system empowering humans to communicate with or control the outside world with exclusively brain intentions. Electroencephalography (EEG) based BCIs are promising solutions due to their convenient and portable instruments. Motor imagery EEG (MI-EEG) is a kind of most widely focused EEG signals, which reveals a subjects movement intentions without actual actions. Despite the extensive research of MI-EEG in recent years, it is still challenging to interpret EEG signals effectively due to the massive noises in EEG signals (e.g., low signal noise ratio and incomplete EEG signals), and difficulties in capturing the inconspicuous relationships between EEG signals and certain brain activities. Most existing works either only consider EEG as chain-like sequences neglecting complex dependencies between adjacent signals or performing simple temporal averaging over EEG sequences. In this paper, we introduce both cascade and parallel convolutional recurrent neural network models for precisely identifying human intended movements by effectively learning compositional spatio-temporal representations of raw EEG streams. The proposed models grasp the spatial correlations between physically neighboring EEG signals by converting the chain like EEG sequences into a 2D mesh like hierarchy. An LSTM based recurrent network is able to extract the subtle temporal dependencies of EEG data streams. Extensive experiments on a large-scale MI-EEG dataset (108 subjects, 3,145,160 EEG records) have demonstrated that both models achieve high accuracy near 98.3% and outperform a set of baseline methods and most recent deep learning based EEG recognition models, yielding a significant accuracy increase of 18% in the cross-subject validation scenario. △ Less

Submitted 10 June, 2021; v1 submitted 22 August, 2017; originally announced August 2017.

Comments: 8 pages, 5 figures

Showing 1–50 of 54 results for author: Zhang, D