-
Maximal Speed of Glucose Change Significantly Distinguishes Prediabetes from Diabetes
Authors:
Dandan Wang,
Xiaoyan Chen,
Jingxiang Lin,
Teng Zhang,
Lianyi Huang,
Dongliang Leng,
Xiaohua Douglas Zhang,
Gang Li
Abstract:
Rapid changes in blood glucose levels can have severe and immediate health consequences, leading to the need to develop indices for assessing these rapid changes based on continuous glucose monitoring (CGM) data. We proposed a CGM index, maxSpeed, that represents the maximum of speed of glucose change (SGC) in a subject, respectively, and conducted a clinical study to investigate this index along…
▽ More
Rapid changes in blood glucose levels can have severe and immediate health consequences, leading to the need to develop indices for assessing these rapid changes based on continuous glucose monitoring (CGM) data. We proposed a CGM index, maxSpeed, that represents the maximum of speed of glucose change (SGC) in a subject, respectively, and conducted a clinical study to investigate this index along with SGC mean (meanSpeed) and SGC standard deviation (sdSpeed), coefficient of variation (CV), standard deviation (SD), glycemic variability percentage (GVP), mean amplitude of glycemic excursions (MAG), mean absolute glucose excursion (MAGE), mean of daily differences (MODD) and continuous overlapping net glycemic action (CONGA). Our study revealed that, there exist multiple patterns in distinguishing non-diabetes, prediabetes, type 1 diabetes (T1D) and type 2 diabetes (T2D). First, maxSpeed significantly distinguishes between either of non-diabetes and prediabetes and either of T1D and T2D. Second, meanSpeed, sdSpeed, GVP and MAG significantly distinguish between non-diabetes and either of T1D and T2D. Third, MODD and CONGA of 24 hours significantly distinguish between non-diabetes and either of T1D and T2D, between T1D and either of prediabetes and T2D. Fourth, SD, MAGE and CONGA of 12 hours significantly distinguish between non-diabetes and either of T1D and T2D, between T1D and pre-diabetes. Fifth, CV significantly distinguishes between T1D and either of Non-diabetes and T2D. maxSpeed assesses the rapid change of glucose in a short term, which is important both biologically and clinicially because our human body may not tolerate too rapid change in a short term.
△ Less
Submitted 14 June, 2025;
originally announced June 2025.
-
Network Pharmacology Reveals HSPA1A/BST2 as Potential Targets of Ci Bai Capsule's Active Compounds Intervening in Leukopenia
Authors:
Dingfan Zhang,
Congshu Huang,
Lei Zhou,
Boyang Wang,
Wei Zhou,
Tiantian Xia,
Pan Shen,
Shao Li,
Yue Gao
Abstract:
Background: Radiation-induced leukopenia caused by low-dose exposure is frequently associated with Traditional Chinese Medicine (TCM) syndromes like "blood deficiency" and "fatigue syndrome". Ci Bai Capsule (CB) has been reported to enhance white blood cell levels; however, its mechanisms and bioactive compounds remain unclear.Aim: This study aimed to identify the bioactive compounds group of CB a…
▽ More
Background: Radiation-induced leukopenia caused by low-dose exposure is frequently associated with Traditional Chinese Medicine (TCM) syndromes like "blood deficiency" and "fatigue syndrome". Ci Bai Capsule (CB) has been reported to enhance white blood cell levels; however, its mechanisms and bioactive compounds remain unclear.Aim: This study aimed to identify the bioactive compounds group of CB and elucidate its potential mechanisms in radiation-induced leukopenia.Methods: Syndrome-related data were gathered from SYMMAP and CTD database. CB's target profile is predicted by DrugCIPHER. Network pharmacology approaches were employed to identify active compounds and related pathways. Experimental validation was conducted through flow cytometry and RNA-sequencing in both ex vivo and in vivo models.Results: A total of 22 pathways related to cellular processes, immune responses, and signal transduction were identified. Five key bioactive compounds (kaempferol-3-glucorhamnoside, syringin, schisandrin, 3-hydroxytyrosol 3-O-glucoside and salidroside) were found to significantly modulate syndrome-related pathways. Optimal dosing of this compound combination enhanced leukocyte counts and splenic immune cell proliferation in irradiated mice. Transcriptomic analysis revealed that the compounds exert regulatory effects on PP1A, RB, CDK4/6, CDK2, and CDK1, thereby modulating downstream immune and hematopoietic markers such as MNDA, BST2, and HSPA1A.Conclusion: Our findings suggest that CB mitigates radiation-induced leukopenia by enhancing immune and hematopoietic recovery, offering a promising therapeutic approach for managing radiation-related hematological disorders.
△ Less
Submitted 13 June, 2025;
originally announced June 2025.
-
Elucidating the Design Space of Multimodal Protein Language Models
Authors:
Cheng-Yen Hsieh,
Xinyou Wang,
Daiheng Zhang,
Dongyu Xue,
Fei Ye,
Shujian Huang,
Zaixiang Zheng,
Quanquan Gu
Abstract:
Multimodal protein language models (PLMs) integrate sequence and token-based structural information, serving as a powerful foundation for protein modeling, generation, and design. However, the reliance on tokenizing 3D structures into discrete tokens causes substantial loss of fidelity about fine-grained structural details and correlations. In this paper, we systematically elucidate the design spa…
▽ More
Multimodal protein language models (PLMs) integrate sequence and token-based structural information, serving as a powerful foundation for protein modeling, generation, and design. However, the reliance on tokenizing 3D structures into discrete tokens causes substantial loss of fidelity about fine-grained structural details and correlations. In this paper, we systematically elucidate the design space of multimodal PLMs to overcome their limitations. We identify tokenization loss and inaccurate structure token predictions by the PLMs as major bottlenecks. To address these, our proposed design space covers improved generative modeling, structure-aware architectures and representation learning, and data exploration. Our advancements approach finer-grained supervision, demonstrating that token-based multimodal PLMs can achieve robust structural modeling. The effective design methods dramatically improve the structure generation diversity, and notably, folding abilities of our 650M model by reducing the RMSD from 5.52 to 2.36 on PDB testset, even outperforming 3B baselines and on par with the specialized folding models. Project page and code: https://bytedance.github.io/dplm/dplm-2.1/.
△ Less
Submitted 11 June, 2025; v1 submitted 15 April, 2025;
originally announced April 2025.
-
A Behaviour and Disease Model of Testing and Isolation
Authors:
Matthew Ryan,
Roslyn I. Hickson,
Edward M. Hill,
Thomas House,
Valerie Isham,
Dongni Zhang,
Mick G. Roberts
Abstract:
There has been interest in the interactions between infectious disease dynamics and behaviour for most of the history of mathematical epidemiology. This has included consideration of which mathematical models best capture each phenomenon, as well as their interaction, but typically in a manner that is agnostic to the exact behaviour in question. Here, we investigate interacting behaviour and disea…
▽ More
There has been interest in the interactions between infectious disease dynamics and behaviour for most of the history of mathematical epidemiology. This has included consideration of which mathematical models best capture each phenomenon, as well as their interaction, but typically in a manner that is agnostic to the exact behaviour in question. Here, we investigate interacting behaviour and disease dynamics specifically related to behaviours around testing and isolation. This epidemiological-behavioural interaction is of particular interest as, prospectively, it is well-placed to be informed by real-world data temporally monitoring test results and compliance with testing policy. To carry out our investigation we extend an existing "behaviour and disease" (BaD) model by incorporating the dynamics of symptomatic testing and isolation. We provide a dynamical systems analysis of the ordinary differential equations that define this model, providing theoretical results on its behaviour early in a new outbreak (particularly its basic reproduction number) and endemicity of the system (its steady states and associated stability criteria). We then supplement these findings with a numerical analysis to inform how temporal and cumulative outbreak metrics depend on the model parameter values for epidemic and endemic regimes. As the presented interdisciplinary modelling approach can accommodate further extensions (including, but not limited to, adding testing capacity, decay in behavioural effects and multiple pathogen variants), we hope that our work will encourage further modelling studies integrating specific measured behaviours and disease dynamics that may reduce the health and economic impacts of future epidemics.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
Deep Learning-Powered Electrical Brain Signals Analysis: Advancing Neurological Diagnostics
Authors:
Jiahe Li,
Xin Chen,
Fanqi Shen,
Junru Chen,
Yuxin Liu,
Daoze Zhang,
Zhizhang Yuan,
Fang Zhao,
Meng Li,
Yang Yang
Abstract:
Neurological disorders represent significant global health challenges, driving the advancement of brain signal analysis methods. Scalp electroencephalography (EEG) and intracranial electroencephalography (iEEG) are widely used to diagnose and monitor neurological conditions. However, dataset heterogeneity and task variations pose challenges in developing robust deep learning solutions. This review…
▽ More
Neurological disorders represent significant global health challenges, driving the advancement of brain signal analysis methods. Scalp electroencephalography (EEG) and intracranial electroencephalography (iEEG) are widely used to diagnose and monitor neurological conditions. However, dataset heterogeneity and task variations pose challenges in developing robust deep learning solutions. This review systematically examines recent advances in deep learning approaches for EEG/iEEG-based neurological diagnostics, focusing on applications across 7 neurological conditions using 46 datasets. We explore trends in data utilization, model design, and task-specific adaptations, highlighting the importance of pre-trained multi-task models for scalable, generalizable solutions. To advance research, we propose a standardized benchmark for evaluating models across diverse datasets to enhance reproducibility. This survey emphasizes how recent innovations can transform neurological diagnostics and enable the development of intelligent, adaptable healthcare solutions.
△ Less
Submitted 24 February, 2025;
originally announced February 2025.
-
Multilineage-differentiating stress-enduring cells alleviate neuropathic pain in mice by secreting TGF-b and IL-10
Authors:
Yayu Zhao,
Ying Fei,
Yunyun Cai,
Zhongya Wei,
Ying Chen,
Yuhua Ji,
Xue Chen,
Dongmei Zhang,
Gang Chen
Abstract:
Neuropathic pain is a chronic condition characterized by damage to and dysfunction of the peripheral or central nervous system. There are currently no effective treatment options available for neuropathic pain, and existing drugs often provide only temporary relief with potential side effects. Multilineage-differentiating stress-enduring (Muse) cells are characterized by high expansion potential,…
▽ More
Neuropathic pain is a chronic condition characterized by damage to and dysfunction of the peripheral or central nervous system. There are currently no effective treatment options available for neuropathic pain, and existing drugs often provide only temporary relief with potential side effects. Multilineage-differentiating stress-enduring (Muse) cells are characterized by high expansion potential, a stable phenotype and strong immunosuppression. These properties make them attractive candidates for therapeutics for neuropathic pain management. In this study, we conducted a series of experiments to evaluate the effect of Muse cells on neuropathic pain. Muse cells from different species demonstrated analgesic potential by reversing CCI-induced neuropathic pain. Protein profiling revealed a high degree of similarity between Muse cells and BMSCs. The intrathecal injection of Muse cells effectively reduced neuropathic pain in various mouse models, resulting in better analgesic effects than the administration of equivalent low doses of BMSCs. Immunohistochemical analysis and qPCR revealed the ability of Muse cells to inhibit spinal cord neuroinflammation caused by SNI. In addition, Transwell and ELISA revealed that Muse cells migrated through the injured dorsal root ganglion (DRG) via the CCR7-CCL21 chemotactic axis. In addition, the secretion of TGF-b and IL-10 by Muse cells was identified as the mechanism underlying the analgesic effect of Muse cells. The capacity of Muse cells to mitigate neuroinflammation and produce analgesic effects via the modulation of TGF-b and IL-10 underscores their potential as promising therapeutic approaches for the treatment of neuropathic pain.
△ Less
Submitted 20 January, 2025;
originally announced January 2025.
-
Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models
Authors:
Haonan He,
Yuchen Ren,
Yining Tang,
Ziyang Xu,
Junxian Li,
Minghao Yang,
Di Zhang,
Dong Yuan,
Tao Chen,
Shufei Zhang,
Yuqiang Li,
Nanqing Dong,
Wanli Ouyang,
Dongzhan Zhou,
Peng Ye
Abstract:
Large language models have already demonstrated their formidable capabilities in general domains, ushering in a revolutionary transformation. However, exploring and exploiting the extensive knowledge of these models to comprehend multi-omics biology remains underexplored. To fill this research gap, we first introduce Biology-Instructions, the first large-scale multi-omics biological sequences-rela…
▽ More
Large language models have already demonstrated their formidable capabilities in general domains, ushering in a revolutionary transformation. However, exploring and exploiting the extensive knowledge of these models to comprehend multi-omics biology remains underexplored. To fill this research gap, we first introduce Biology-Instructions, the first large-scale multi-omics biological sequences-related instruction-tuning dataset including DNA, RNA, proteins, and multi-molecules, designed to bridge the gap between large language models (LLMs) and complex biological sequences-related tasks. This dataset can enhance the versatility of LLMs by integrating diverse biological sequenced-based prediction tasks with advanced reasoning capabilities, while maintaining conversational fluency. Additionally, we reveal significant performance limitations in even state-of-the-art LLMs on biological sequence-related multi-omics tasks without specialized pre-training and instruction-tuning. We further develop a strong baseline called ChatMultiOmics with a novel three-stage training pipeline, demonstrating the powerful ability to understand biology by using Biology-Instructions. Biology-Instructions and ChatMultiOmics are publicly available and crucial resources for enabling more effective integration of LLMs with multi-omics sequence analysis.
△ Less
Submitted 26 December, 2024;
originally announced December 2024.
-
MolReFlect: Towards In-Context Fine-grained Alignments between Molecules and Texts
Authors:
Jiatong Li,
Yunqing Liu,
Wei Liu,
Jingdi Le,
Di Zhang,
Wenqi Fan,
Dongzhan Zhou,
Yuqiang Li,
Qing Li
Abstract:
Molecule discovery is a pivotal research field, impacting everything from the medicines we take to the materials we use. Recently, Large Language Models (LLMs) have been widely adopted in molecule understanding and generation, yet the alignments between molecules and their corresponding captions remain a significant challenge. Previous endeavours often treat the molecule as a general SMILES string…
▽ More
Molecule discovery is a pivotal research field, impacting everything from the medicines we take to the materials we use. Recently, Large Language Models (LLMs) have been widely adopted in molecule understanding and generation, yet the alignments between molecules and their corresponding captions remain a significant challenge. Previous endeavours often treat the molecule as a general SMILES string or molecular graph, neglecting the fine-grained alignments between the molecular sub-structures and the descriptive textual phrases, which are crucial for accurate and explainable predictions. In this case, we introduce MolReFlect, a novel teacher-student framework designed to contextually perform the molecule-caption alignments in a fine-grained way. Our approach initially leverages a larger teacher LLM to label the detailed alignments by directly extracting critical phrases from molecule captions or SMILES strings and implying them to corresponding sub-structures or characteristics. To refine these alignments, we propose In-Context Selective Reflection, which retrieves previous extraction results as context examples for teacher LLM to reflect and lets a smaller student LLM select from in-context reflection and previous extraction results. Finally, we enhance the learning process of the student LLM through Chain-of-Thought In-Context Molecule Tuning, integrating the fine-grained alignments and the reasoning processes within the Chain-of-Thought format. Our experimental results demonstrate that MolReFlect enables LLMs like Mistral-7B to significantly outperform the previous baselines, achieving SOTA performance on the ChEBI-20 dataset. This advancement not only enhances the generative capabilities of LLMs in the molecule-caption translation task, but also contributes to a more explainable framework.
△ Less
Submitted 21 November, 2024;
originally announced November 2024.
-
Efficient and Robust Continual Graph Learning for Graph Classification in Biology
Authors:
Ding Zhang,
Jane Downer,
Can Chen,
Ren Wang
Abstract:
Graph classification is essential for understanding complex biological systems, where molecular structures and interactions are naturally represented as graphs. Traditional graph neural networks (GNNs) perform well on static tasks but struggle in dynamic settings due to catastrophic forgetting. We present Perturbed and Sparsified Continual Graph Learning (PSCGL), a robust and efficient continual g…
▽ More
Graph classification is essential for understanding complex biological systems, where molecular structures and interactions are naturally represented as graphs. Traditional graph neural networks (GNNs) perform well on static tasks but struggle in dynamic settings due to catastrophic forgetting. We present Perturbed and Sparsified Continual Graph Learning (PSCGL), a robust and efficient continual graph learning framework for graph data classification, specifically targeting biological datasets. We introduce a perturbed sampling strategy to identify critical data points that contribute to model learning and a motif-based graph sparsification technique to reduce storage needs while maintaining performance. Additionally, our PSCGL framework inherently defends against graph backdoor attacks, which is crucial for applications in sensitive biological contexts. Extensive experiments on biological datasets demonstrate that PSCGL not only retains knowledge across tasks but also enhances the efficiency and robustness of graph classification models in biology.
△ Less
Submitted 18 November, 2024;
originally announced November 2024.
-
Evaluating Molecule Synthesizability via Retrosynthetic Planning and Reaction Prediction
Authors:
Songtao Liu,
Dandan Zhang,
Zhengkai Tu,
Hanjun Dai,
Peng Liu
Abstract:
A significant challenge in wet lab experiments with current drug design generative models is the trade-off between pharmacological properties and synthesizability. Molecules predicted to have highly desirable properties are often difficult to synthesize, while those that are easily synthesizable tend to exhibit less favorable properties. As a result, evaluating the synthesizability of molecules in…
▽ More
A significant challenge in wet lab experiments with current drug design generative models is the trade-off between pharmacological properties and synthesizability. Molecules predicted to have highly desirable properties are often difficult to synthesize, while those that are easily synthesizable tend to exhibit less favorable properties. As a result, evaluating the synthesizability of molecules in general drug design scenarios remains a significant challenge in the field of drug discovery. The commonly used synthetic accessibility (SA) score aims to evaluate the ease of synthesizing generated molecules, but it falls short of guaranteeing that synthetic routes can actually be found. Inspired by recent advances in top-down synthetic route generation and forward reaction prediction, we propose a new, data-driven metric to evaluate molecule synthesizability. This novel metric leverages the synergistic duality between retrosynthetic planners and reaction predictors, both of which are trained on extensive reaction datasets. To demonstrate the efficacy of our metric, we conduct a comprehensive evaluation of round-trip scores across a range of representative molecule generative models.
△ Less
Submitted 3 April, 2025; v1 submitted 12 November, 2024;
originally announced November 2024.
-
SurfGNN: A robust surface-based prediction model with interpretability for coactivation maps of spatial and cortical features
Authors:
Zhuoshuo Li,
Jiong Zhang,
Youbing Zeng,
Jiaying Lin,
Dan Zhang,
Jianjia Zhang,
Duan Xu,
Hosung Kim,
Bingguang Liu,
Mengting Liu
Abstract:
Current brain surface-based prediction models often overlook the variability of regional attributes at the cortical feature level. While graph neural networks (GNNs) excel at capturing regional differences, they encounter challenges when dealing with complex, high-density graph structures. In this work, we consider the cortical surface mesh as a sparse graph and propose an interpretable prediction…
▽ More
Current brain surface-based prediction models often overlook the variability of regional attributes at the cortical feature level. While graph neural networks (GNNs) excel at capturing regional differences, they encounter challenges when dealing with complex, high-density graph structures. In this work, we consider the cortical surface mesh as a sparse graph and propose an interpretable prediction model-Surface Graph Neural Network (SurfGNN). SurfGNN employs topology-sampling learning (TSL) and region-specific learning (RSL) structures to manage individual cortical features at both lower and higher scales of the surface mesh, effectively tackling the challenges posed by the overly abundant mesh nodes and addressing the issue of heterogeneity in cortical regions. Building on this, a novel score-weighted fusion (SWF) method is implemented to merge nodal representations associated with each cortical feature for prediction. We apply our model to a neonatal brain age prediction task using a dataset of harmonized MR images from 481 subjects (503 scans). SurfGNN outperforms all existing state-of-the-art methods, demonstrating an improvement of at least 9.0% and achieving a mean absolute error (MAE) of 0.827+0.056 in postmenstrual weeks. Furthermore, it generates feature-level activation maps, indicating its capability to identify robust regional variations in different morphometric contributions for prediction.
△ Less
Submitted 5 November, 2024;
originally announced November 2024.
-
Dynamic-Attention-based EEG State Transition Modeling for Emotion Recognition
Authors:
Xinke Shen,
Runmin Gan,
Kaixuan Wang,
Shuyi Yang,
Qingzhu Zhang,
Quanying Liu,
Dan Zhang,
Sen Song
Abstract:
Electroencephalogram (EEG)-based emotion decoding can objectively quantify people's emotional state and has broad application prospects in human-computer interaction and early detection of emotional disorders. Recently emerging deep learning architectures have significantly improved the performance of EEG emotion decoding. However, existing methods still fall short of fully capturing the complex s…
▽ More
Electroencephalogram (EEG)-based emotion decoding can objectively quantify people's emotional state and has broad application prospects in human-computer interaction and early detection of emotional disorders. Recently emerging deep learning architectures have significantly improved the performance of EEG emotion decoding. However, existing methods still fall short of fully capturing the complex spatiotemporal dynamics of neural signals, which are crucial for representing emotion processing. This study proposes a Dynamic-Attention-based EEG State Transition (DAEST) modeling method to characterize EEG spatiotemporal dynamics. The model extracts spatiotemporal components of EEG that represent multiple parallel neural processes and estimates dynamic attention weights on these components to capture transitions in brain states. The model is optimized within a contrastive learning framework for cross-subject emotion recognition. The proposed method achieved state-of-the-art performance on three publicly available datasets: FACED, SEED, and SEED-V. It achieved 75.4% accuracy in the binary classification of positive and negative emotions and 59.3% in nine-class discrete emotion classification on the FACED dataset, 88.1% in the three-class classification of positive, negative, and neutral emotions on the SEED dataset, and 73.6% in five-class discrete emotion classification on the SEED-V dataset. The learned EEG spatiotemporal patterns and dynamic transition properties offer valuable insights into neural dynamics underlying emotion processing.
△ Less
Submitted 7 November, 2024;
originally announced November 2024.
-
CaLMFlow: Volterra Flow Matching using Causal Language Models
Authors:
Sizhuang He,
Daniel Levine,
Ivan Vrkic,
Marco Francesco Bressana,
David Zhang,
Syed Asad Rizvi,
Yangtian Zhang,
Emanuele Zappala,
David van Dijk
Abstract:
We introduce CaLMFlow (Causal Language Models for Flow Matching), a novel framework that casts flow matching as a Volterra integral equation (VIE), leveraging the power of large language models (LLMs) for continuous data generation. CaLMFlow enables the direct application of LLMs to learn complex flows by formulating flow matching as a sequence modeling task, bridging discrete language modeling an…
▽ More
We introduce CaLMFlow (Causal Language Models for Flow Matching), a novel framework that casts flow matching as a Volterra integral equation (VIE), leveraging the power of large language models (LLMs) for continuous data generation. CaLMFlow enables the direct application of LLMs to learn complex flows by formulating flow matching as a sequence modeling task, bridging discrete language modeling and continuous generative modeling. Our method implements tokenization across space and time, thereby solving a VIE over these domains. This approach enables efficient handling of high-dimensional data and outperforms ODE solver-dependent methods like conditional flow matching (CFM). We demonstrate CaLMFlow's effectiveness on synthetic and real-world data, including single-cell perturbation response prediction, showcasing its ability to incorporate textual context and generalize to unseen conditions. Our results highlight LLM-driven flow matching as a promising paradigm in generative modeling, offering improved scalability, flexibility, and context-awareness.
△ Less
Submitted 3 October, 2024;
originally announced October 2024.
-
EnzymeFlow: Generating Reaction-specific Enzyme Catalytic Pockets through Flow Matching and Co-Evolutionary Dynamics
Authors:
Chenqing Hua,
Yong Liu,
Dinghuai Zhang,
Odin Zhang,
Sitao Luan,
Kevin K. Yang,
Guy Wolf,
Doina Precup,
Shuangjia Zheng
Abstract:
Enzyme design is a critical area in biotechnology, with applications ranging from drug development to synthetic biology. Traditional methods for enzyme function prediction or protein binding pocket design often fall short in capturing the dynamic and complex nature of enzyme-substrate interactions, particularly in catalytic processes. To address the challenges, we introduce EnzymeFlow, a generativ…
▽ More
Enzyme design is a critical area in biotechnology, with applications ranging from drug development to synthetic biology. Traditional methods for enzyme function prediction or protein binding pocket design often fall short in capturing the dynamic and complex nature of enzyme-substrate interactions, particularly in catalytic processes. To address the challenges, we introduce EnzymeFlow, a generative model that employs flow matching with hierarchical pre-training and enzyme-reaction co-evolution to generate catalytic pockets for specific substrates and catalytic reactions. Additionally, we introduce a large-scale, curated, and validated dataset of enzyme-reaction pairs, specifically designed for the catalytic pocket generation task, comprising a total of $328,192$ pairs. By incorporating evolutionary dynamics and reaction-specific adaptations, EnzymeFlow becomes a powerful model for designing enzyme pockets, which is capable of catalyzing a wide range of biochemical reactions. Experiments on the new dataset demonstrate the model's effectiveness in designing high-quality, functional enzyme catalytic pockets, paving the way for advancements in enzyme engineering and synthetic biology. We provide EnzymeFlow code at https://github.com/WillHua127/EnzymeFlow with notebook demonstration at https://github.com/WillHua127/EnzymeFlow/blob/main/enzymeflow_demo.ipynb.
△ Less
Submitted 30 September, 2024;
originally announced October 2024.
-
Sideward contact tracing in an epidemic model with mixing groups
Authors:
Dongni Zhang,
Martina Favero
Abstract:
We consider a stochastic epidemic model with sideward contact tracing. We assume that infection is driven by interactions within mixing events (gatherings of two or more individuals). Once an infective is diagnosed, each individual who was infected at the same event as the diagnosed individual is contact traced with some given probability. Assuming few initial infectives in a large population, the…
▽ More
We consider a stochastic epidemic model with sideward contact tracing. We assume that infection is driven by interactions within mixing events (gatherings of two or more individuals). Once an infective is diagnosed, each individual who was infected at the same event as the diagnosed individual is contact traced with some given probability. Assuming few initial infectives in a large population, the early phase of the epidemic is approximated by a branching process with sibling dependencies. To address the challenges given by the dependencies, we consider sibling groups (individuals who become infected at the same event) as macro-individuals and define a macro-branching process. This allows us to derive an expression for the effective macro-reproduction number which corresponds to the effective individual reproduction number and represents a threshold for the behaviour of the epidemic. Through numerical examples, we show how the reproduction number varies with the distribution of the mixing event size, the mean size, the rate of diagnosis and the tracing probability.
△ Less
Submitted 26 March, 2025; v1 submitted 16 July, 2024;
originally announced July 2024.
-
Efficient and Precise Force Field Optimization for Biomolecules Using DPA-2
Authors:
Junhan Chang,
Duo Zhang,
Yuqing Deng,
Hongrui Lin,
Zhirong Liu,
Linfeng Zhang,
Hang Zheng,
Xinyan Wang
Abstract:
Molecular simulations are essential tools in computational chemistry, enabling the prediction and understanding of molecular interactions and thermodynamic properties of biomolecules. However, traditional force fields face significant challenges in accurately representing novel molecules and complex chemical environments due to the labor-intensive process of manually setting optimization parameter…
▽ More
Molecular simulations are essential tools in computational chemistry, enabling the prediction and understanding of molecular interactions and thermodynamic properties of biomolecules. However, traditional force fields face significant challenges in accurately representing novel molecules and complex chemical environments due to the labor-intensive process of manually setting optimization parameters and the high computational cost of quantum mechanical calculations. To overcome these difficulties, we fine-tuned a high-accuracy DPA-2 pre-trained model and applied it to optimize force field parameters on-the-fly, significantly reducing computational costs. Our method combines this fine-tuned DPA-2 model with a node-embedding-based similarity metric, allowing seamless augmentation to new chemical species without manual intervention. We applied this process to the TYK2 inhibitor and PTP1B systems and demonstrated its effectiveness through the improvement of free energy perturbation calculation results. This advancement contributes valuable insights and tools for the computational chemistry community.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
In silico bioactivity prediction of proteins interacting with graphene-based nanomaterials guides rational design of biosensor
Authors:
Jing Ye,
Minzhi Fan,
Xiaoyu Zhang,
Shasha Lu,
Mengyao Chai,
Yunshan Zhang,
Xiaoyu Zhao,
Shuang Li,
Diming Zhang
Abstract:
Graphene based nanomaterials have attracted significant attention for their potentials in biomedical and biotechnology applications in recent years, owing to the outstanding physical and chemical properties. However, the interaction mechanism and impact on biological activity of macro and micro biomolecules still require more concerns and further research in order to enhance their applicability in…
▽ More
Graphene based nanomaterials have attracted significant attention for their potentials in biomedical and biotechnology applications in recent years, owing to the outstanding physical and chemical properties. However, the interaction mechanism and impact on biological activity of macro and micro biomolecules still require more concerns and further research in order to enhance their applicability in biosensors, etc. Herein, an integrated method has been developed to predict the protein bioactivity performance when interacting with nanomaterials for protein based biosensor. Molecular dynamics simulation and molecular docking technique were consolidated to investigate several nanomaterials C60 fullerene, single walled carbon nanotube, pristine graphene and graphene oxide, and their effect when interacting with protein. The adsorption behavior, secondary structure changes and protein bioactivity changes were simulated, and the results of protein activity simulation were verified in combination with atomic force spectrum, circular dichroism spectrum fluorescence and electrochemical experiments. The best quantification alignment between bioactivity obtained by simulation and experiment measurements was further explored. The two proteins, RNase A and Exonuclease III, were regarded as analysis model for the proof of concept, and the prediction accuracy of protein bioactivty could reach up to 0.98.
△ Less
Submitted 8 April, 2024;
originally announced April 2024.
-
Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning
Authors:
Duzhen Zhang,
Qingyu Wang,
Tielin Zhang,
Bo Xu
Abstract:
The success of Deep Reinforcement Learning (DRL) is largely attributed to utilizing Artificial Neural Networks (ANNs) as function approximators. Recent advances in neuroscience have unveiled that the human brain achieves efficient reward-based learning, at least by integrating spiking neurons with spatial-temporal dynamics and network topologies with biologically-plausible connectivity patterns. T…
▽ More
The success of Deep Reinforcement Learning (DRL) is largely attributed to utilizing Artificial Neural Networks (ANNs) as function approximators. Recent advances in neuroscience have unveiled that the human brain achieves efficient reward-based learning, at least by integrating spiking neurons with spatial-temporal dynamics and network topologies with biologically-plausible connectivity patterns. This integration process allows spiking neurons to efficiently combine information across and within layers via nonlinear dendritic trees and lateral interactions. The fusion of these two topologies enhances the network's information-processing ability, crucial for grasping intricate perceptions and guiding decision-making procedures. However, ANNs and brain networks differ significantly. ANNs lack intricate dynamical neurons and only feature inter-layer connections, typically achieved by direct linear summation, without intra-layer connections. This limitation leads to constrained network expressivity. To address this, we propose a novel alternative for function approximator, the Biologically-Plausible Topology improved Spiking Actor Network (BPT-SAN), tailored for efficient decision-making in DRL. The BPT-SAN incorporates spiking neurons with intricate spatial-temporal dynamics and introduces intra-layer connections, enhancing spatial-temporal state representation and facilitating more precise biological simulations. Diverging from the conventional direct linear weighted sum, the BPT-SAN models the local nonlinearities of dendritic trees within the inter-layer connections. For the intra-layer connections, the BPT-SAN introduces lateral interactions between adjacent neurons, integrating them into the membrane potential formula to ensure accurate spike firing.
△ Less
Submitted 29 March, 2024;
originally announced March 2024.
-
Contrastive Learning of Shared Spatiotemporal EEG Representations Across Individuals for Naturalistic Neuroscience
Authors:
Xinke Shen,
Lingyi Tao,
Xuyang Chen,
Sen Song,
Quanying Liu,
Dan Zhang
Abstract:
Neural representations induced by naturalistic stimuli offer insights into how humans respond to stimuli in daily life. Understanding neural mechanisms underlying naturalistic stimuli processing hinges on the precise identification and extraction of the shared neural patterns that are consistently present across individuals. Targeting the Electroencephalogram (EEG) technique, known for its rich sp…
▽ More
Neural representations induced by naturalistic stimuli offer insights into how humans respond to stimuli in daily life. Understanding neural mechanisms underlying naturalistic stimuli processing hinges on the precise identification and extraction of the shared neural patterns that are consistently present across individuals. Targeting the Electroencephalogram (EEG) technique, known for its rich spatial and temporal information, this study presents a framework for Contrastive Learning of Shared SpatioTemporal EEG Representations across individuals (CL-SSTER). CL-SSTER utilizes contrastive learning to maximize the similarity of EEG representations across individuals for identical stimuli, contrasting with those for varied stimuli. The network employed spatial and temporal convolutions to simultaneously learn the spatial and temporal patterns inherent in EEG. The versatility of CL-SSTER was demonstrated on three EEG datasets, including a synthetic dataset, a natural speech comprehension EEG dataset, and an emotional video watching EEG dataset. CL-SSTER attained the highest inter-subject correlation (ISC) values compared to the state-of-the-art ISC methods. The latent representations generated by CL-SSTER exhibited reliable spatiotemporal EEG patterns, which can be explained by properties of the naturalistic stimuli. CL-SSTER serves as an interpretable and scalable framework for the identification of inter-subject shared neural representations in naturalistic neuroscience.
△ Less
Submitted 13 July, 2024; v1 submitted 21 February, 2024;
originally announced February 2024.
-
An SEIR network epidemic model with manual and digital contact tracing allowing delays
Authors:
Dongni Zhang,
Tom Britton
Abstract:
We consider an SEIR epidemic model on a network also allowing random contacts, where recovered individuals could either recover naturally or be diagnosed. Upon diagnosis, manual contact tracing is triggered such that each infected network contact is reported, tested and isolated with some probability and after a random delay. Additionally, digital tracing (based on a tracing app) is triggered if t…
▽ More
We consider an SEIR epidemic model on a network also allowing random contacts, where recovered individuals could either recover naturally or be diagnosed. Upon diagnosis, manual contact tracing is triggered such that each infected network contact is reported, tested and isolated with some probability and after a random delay. Additionally, digital tracing (based on a tracing app) is triggered if the diagnosed individual is an app-user, and then all of its app-using infectees are immediately notified and isolated. The early phase of the epidemic with manual and/or digital tracing is approximated by different multi-type branching processes, and three respective reproduction numbers are derived. The effectiveness of both contact tracing mechanisms is numerically quantified through the reduction of the reproduction number. This shows that app-using fraction plays an essential role in the overall effectiveness of contact tracing. The relative effectiveness of manual tracing compared to digital tracing increases if: more of the transmission occurs on the network, when the tracing delay is shortened, and when the network degree distribution is heavy-tailed. For realistic values, the combined tracing case can reduce $R_0$ by $20-30\%$, so other preventive measures are needed to reduce the reproduction number down to $1.2-1.4$ for contact tracing to make it successful in avoiding big outbreaks.
△ Less
Submitted 5 June, 2024; v1 submitted 20 February, 2024;
originally announced February 2024.
-
PhyloGFN: Phylogenetic inference with generative flow networks
Authors:
Mingyang Zhou,
Zichao Yan,
Elliot Layne,
Nikolay Malkin,
Dinghuai Zhang,
Moksh Jain,
Mathieu Blanchette,
Yoshua Bengio
Abstract:
Phylogenetics is a branch of computational biology that studies the evolutionary relationships among biological entities. Its long history and numerous applications notwithstanding, inference of phylogenetic trees from sequence data remains challenging: the high complexity of tree space poses a significant obstacle for the current combinatorial and probabilistic techniques. In this paper, we adopt…
▽ More
Phylogenetics is a branch of computational biology that studies the evolutionary relationships among biological entities. Its long history and numerous applications notwithstanding, inference of phylogenetic trees from sequence data remains challenging: the high complexity of tree space poses a significant obstacle for the current combinatorial and probabilistic techniques. In this paper, we adopt the framework of generative flow networks (GFlowNets) to tackle two core problems in phylogenetics: parsimony-based and Bayesian phylogenetic inference. Because GFlowNets are well-suited for sampling complex combinatorial structures, they are a natural choice for exploring and sampling from the multimodal posterior distribution over tree topologies and evolutionary distances. We demonstrate that our amortized posterior sampler, PhyloGFN, produces diverse and high-quality evolutionary hypotheses on real benchmark datasets. PhyloGFN is competitive with prior works in marginal likelihood estimation and achieves a closer fit to the target distribution than state-of-the-art variational inference methods. Our code is available at https://github.com/zmy1116/phylogfn.
△ Less
Submitted 24 March, 2024; v1 submitted 12 October, 2023;
originally announced October 2023.
-
DNAGPT: A Generalized Pre-trained Tool for Versatile DNA Sequence Analysis Tasks
Authors:
Daoan Zhang,
Weitong Zhang,
Yu Zhao,
Jianguo Zhang,
Bing He,
Chenchen Qin,
Jianhua Yao
Abstract:
Pre-trained large language models demonstrate potential in extracting information from DNA sequences, yet adapting to a variety of tasks and data modalities remains a challenge. To address this, we propose DNAGPT, a generalized DNA pre-training model trained on over 200 billion base pairs from all mammals. By enhancing the classic GPT model with a binary classification task (DNA sequence order), a…
▽ More
Pre-trained large language models demonstrate potential in extracting information from DNA sequences, yet adapting to a variety of tasks and data modalities remains a challenge. To address this, we propose DNAGPT, a generalized DNA pre-training model trained on over 200 billion base pairs from all mammals. By enhancing the classic GPT model with a binary classification task (DNA sequence order), a numerical regression task (guanine-cytosine content prediction), and a comprehensive token language, DNAGPT can handle versatile DNA analysis tasks while processing both sequence and numerical data. Our evaluation of genomic signal and region recognition, mRNA abundance regression, and artificial genomes generation tasks demonstrates DNAGPT's superior performance compared to existing models designed for specific downstream tasks, benefiting from pre-training using the newly designed model structure.
△ Less
Submitted 30 August, 2023; v1 submitted 11 July, 2023;
originally announced July 2023.
-
A data-driven analysis on the mediation effect of compartment models between control measures and COVID-19 epidemics
Authors:
Dongyan Zhang,
Wuyue Yang,
Wanqi Wen,
Liangrong Peng,
Changjingn Zhuge,
Liu Hong
Abstract:
We make a retrospective review on various control measures taken by 127 countries/territories during the first wave of COVID-19 pandemic until July 7, 2020, and evaluate their impacts on the epidemic dynamics quantitatively. The SEIR-QD model, as a representative for general compartment models, is used to fit the epidemic data, enabling the extraction of crucial model parameters and dynamical feat…
▽ More
We make a retrospective review on various control measures taken by 127 countries/territories during the first wave of COVID-19 pandemic until July 7, 2020, and evaluate their impacts on the epidemic dynamics quantitatively. The SEIR-QD model, as a representative for general compartment models, is used to fit the epidemic data, enabling the extraction of crucial model parameters and dynamical features. The mediation effect of the SEIR-QD model is revealed by using the mediation analysis with structure equation modeling for multiple mediators operating in parallel. The inherent impacts of these control policies on the transmission dynamics of COVID-19 epidemics are clarified, and compared with results derived from both multiple linear regression and neural-network-based nonlinear regression. Through this data-driven analysis, the mediation effect of compartment models is confirmed, which provides a better understanding on the intrinsic correlations among the strength of control measures and the dynamical features of COVID-19 epidemics.
△ Less
Submitted 22 September, 2023; v1 submitted 31 May, 2023;
originally announced May 2023.
-
Network pharmacology on the mechanism of Yi Qi Tong Qiao Pill inhibiting allergic rhinitis
Authors:
Boyang Wang,
DingFan Zhang,
Tingyu Zhang,
Chayanis Sutcharitchan,
Jianlin Hua,
Dongfang Hua,
Bo Zhang,
Shao Li
Abstract:
Objective: The purpose of this study is to reveal the mechanism of action of Yi Qi Tong Qiao Pill (YQTQP) in the treatment of allergic rhinitis (AR), as well as establish a paradigm for the researches on traditional Chinese medicine (TCM) from systematic perspective. Methods: Based on the data collected from TCM-related and disease-related databases, target profiles of compounds in YQTQP were calc…
▽ More
Objective: The purpose of this study is to reveal the mechanism of action of Yi Qi Tong Qiao Pill (YQTQP) in the treatment of allergic rhinitis (AR), as well as establish a paradigm for the researches on traditional Chinese medicine (TCM) from systematic perspective. Methods: Based on the data collected from TCM-related and disease-related databases, target profiles of compounds in YQTQP were calculated through network-based algorithms and holistic targets of TQTQP was constructed. Network target analysis was performed to explore the potential mechanisms of YQTQP in the treatment of AR and the mechanisms were classified into different modules according to their biological functions. Besides, animal and clinical experiments were conducted to validate our findings inferred from Network target analysis. Results: Network target analysis showed that YQTQP targeted 12 main pathways or biological processes related to AR, represented by those related to IL-4, IFN-γ, TNF-α and IL-13. These results could be classified into 3 biological modules, including regulation of immune and inflammation, epithelial barrier disorder and cell adhesion. Finally, a series of experiments composed of animal and clinical experiments, proved our findings and confirmed that YQTQP could improve related symptoms of AR, like permeability of nasal mucosa epithelium. Conclusion: A combination of Network target analysis and the experimental validation indicated that YQTQP was effective in the treatment of AR and might provide a new insight on revealing the mechanism of TCM against diseases.
△ Less
Submitted 21 May, 2023; v1 submitted 6 May, 2023;
originally announced May 2023.
-
The Automated Discovery of Kinetic Rate Models -- Methodological Frameworks
Authors:
Miguel Ángel de Carvalho Servia,
Ilya Orson Sandoval,
Klaus Hellgardt,
King Kuok,
Hii,
Dongda Zhang,
Ehecatl Antonio del Rio Chanona
Abstract:
The industrialization of catalytic processes requires reliable kinetic models for their design, optimization and control. Mechanistic models require significant domain knowledge, while data-driven and hybrid models lack interpretability. Automated knowledge discovery methods, such as ALAMO (Automated Learning of Algebraic Models for Optimization), SINDy (Sparse Identification of Nonlinear Dynamics…
▽ More
The industrialization of catalytic processes requires reliable kinetic models for their design, optimization and control. Mechanistic models require significant domain knowledge, while data-driven and hybrid models lack interpretability. Automated knowledge discovery methods, such as ALAMO (Automated Learning of Algebraic Models for Optimization), SINDy (Sparse Identification of Nonlinear Dynamics), and genetic programming, have gained popularity but suffer from limitations such as needing model structure assumptions, exhibiting poor scalability, and displaying sensitivity to noise. To overcome these challenges, we propose two methodological frameworks, ADoK-S and ADoK-W (Automated Discovery of Kinetic rate models using a Strong/Weak formulation of symbolic regression), for the automated generation of catalytic kinetic models using a robust criterion for model selection. We leverage genetic programming for model generation and a sequential optimization routine for model refinement. The frameworks are tested against three case studies of increasing complexity, demonstrating their ability to retrieve the underlying kinetic rate model with limited noisy data from the catalytic systems, showcasing their potential for chemical reaction engineering applications.
△ Less
Submitted 2 November, 2023; v1 submitted 26 January, 2023;
originally announced January 2023.
-
Epidemic models with digital and manual contact tracing
Authors:
Tom Britton,
Dongni Zhang
Abstract:
We analyze a Markovian SIR epidemic model where individuals either recover naturally or are diagnosed, leading to isolation and potential contact tracing. Our focus is on digital contact tracing via a tracing app, considering both its standalone use and combination with manual tracing. We prove that as the population size $n$ grows large, the epidemic process converges to a limiting process, which…
▽ More
We analyze a Markovian SIR epidemic model where individuals either recover naturally or are diagnosed, leading to isolation and potential contact tracing. Our focus is on digital contact tracing via a tracing app, considering both its standalone use and combination with manual tracing. We prove that as the population size $n$ grows large, the epidemic process converges to a limiting process, which, unlike typical epidemic models, is not a branching process due to dependencies created by contact tracing. However, by grouping to-be-traced individuals into macro-individuals, we derive a multi-type branching process interpretation, allowing computation of the reproduction number $R$. This is then converted to an individual reproduction number $R^{(ind)}$, which, contrary to $R$, decays monotonically with the fraction of app-users while both share the same threshold at 1. Finally, we compare digital (only) contact tracing and manual (only) contact tracing, proving that the critical fraction app-users $π_c$ required for $R=1$ is higher than the critical fraction manually contact traced $p_c$ for manual tracing.
△ Less
Submitted 27 March, 2025; v1 submitted 23 November, 2022;
originally announced November 2022.
-
Causal inference in drug discovery and development
Authors:
Tom Michoel,
Jitao David Zhang
Abstract:
To discover new drugs is to seek and to prove causality. As an emerging approach leveraging human knowledge and creativity, data, and machine intelligence, causal inference holds the promise of reducing cognitive bias and improving decision making in drug discovery. While it has been applied across the value chain, the concepts and practice of causal inference remain obscure to many practitioners.…
▽ More
To discover new drugs is to seek and to prove causality. As an emerging approach leveraging human knowledge and creativity, data, and machine intelligence, causal inference holds the promise of reducing cognitive bias and improving decision making in drug discovery. While it has been applied across the value chain, the concepts and practice of causal inference remain obscure to many practitioners. This article offers a non-technical introduction to causal inference, reviews its recent applications, and discusses opportunities and challenges of adopting the causal language in drug discovery and development.
△ Less
Submitted 29 September, 2022;
originally announced September 2022.
-
Further analysis of metagenomic datasets containing GD and GX pangolin CoVs indicates widespread contamination, undermining pangolin host attribution
Authors:
Adrian Jones,
Steven E. Massey,
Daoyu Zhang,
Yuri Deigin,
Steven C. Quay
Abstract:
The only animals other than bats reported to have been infected with SARS-CoV-2-related coronaviruses (SARS2r-CoVs) prior to the COVID-19 pandemic are pangolins. In early 2020 multiple papers reported the identification of two clades of SARS2r-CoVs, GD and GX, infecting pangolins. However the RNA-Seq datasets supporting pangolin genome assembly were widely contaminated, contained synthetic vectors…
▽ More
The only animals other than bats reported to have been infected with SARS-CoV-2-related coronaviruses (SARS2r-CoVs) prior to the COVID-19 pandemic are pangolins. In early 2020 multiple papers reported the identification of two clades of SARS2r-CoVs, GD and GX, infecting pangolins. However the RNA-Seq datasets supporting pangolin genome assembly were widely contaminated, contained synthetic vectors or were heavily enriched or filtered with little but coronavirus sequences left in the datasets. Here we investigate two pangolin fecal samples sequenced by Li et al. (2021) provided in support of GD PCoV infection of pangolins in Guangdong and find the read distribution consistent with PCR amplicon contamination and SARS-CoV-2 contamination, and further identify the presence of synthetic plasmid sequences. We also build upon our previous work to further analyze the dataset GX/P3B by Lam et al. (2020), which is the only non enriched/heavily filtered pangolin tissue dataset sequenced by Lam et al. (2020). We identify synthetic vectors and confirm human genomic origin samples in the dataset. Finally, we find human mitochondrial sequences in all pangolin organ datasets and mouse and tiger mitochondrial sequences in selected pangolin organ datasets sequenced by Liu et al. (2019). We infer that human and mouse genomic origin sequences were probably sourced from contamination prior to sequencing, while tiger origin sequence contamination may have occurred due to index hopping during sequencing. These observations are problematic for attributing pangolins as SARS2r-CoV hosts in the datasets examined. The forensic methods developed and used here can be applied to examine any third party SRA data sets.
△ Less
Submitted 11 July, 2022; v1 submitted 7 July, 2022;
originally announced July 2022.
-
Deep radiomic signature with immune cell markers predicts the survival of glioma patients
Authors:
Ahmad Chaddad,
Paul Daniel Mingli Zhang,
Saima Rathore,
Paul Sargos,
Christian Desrosiers,
Tamim Niazi
Abstract:
Imaging biomarkers offer a non-invasive way to predict the response of immunotherapy prior to treatment. In this work, we propose a novel type of deep radiomic features (DRFs) computed from a convolutional neural network (CNN), which capture tumor characteristics related to immune cell markers and overall survival. Our study uses four MRI sequences (T1-weighted, T1-weighted post-contrast, T2-weigh…
▽ More
Imaging biomarkers offer a non-invasive way to predict the response of immunotherapy prior to treatment. In this work, we propose a novel type of deep radiomic features (DRFs) computed from a convolutional neural network (CNN), which capture tumor characteristics related to immune cell markers and overall survival. Our study uses four MRI sequences (T1-weighted, T1-weighted post-contrast, T2-weighted and FLAIR) with corresponding immune cell markers of 151 patients with brain tumor. The proposed method extracts a total of 180 DRFs by aggregating the activation maps of a pre-trained 3D-CNN within labeled tumor regions of MRI scans. These features offer a compact, yet powerful representation of regional texture encoding tissue heterogeneity. A comprehensive set of experiments is performed to assess the relationship between the proposed DRFs and immune cell markers, and measure their association with overall survival. Results show a high correlation between DRFs and various markers, as well as significant differences between patients grouped based on these markers. Moreover, combining DRFs, clinical features and immune cell markers as input to a random forest classifier helps discriminate between short and long survival outcomes, with AUC of 72\% and p=2.36$\times$10$^{-5}$. These results demonstrate the usefulness of proposed DRFs as non-invasive biomarker for predicting treatment response in patients with brain tumors.
△ Less
Submitted 9 June, 2022;
originally announced June 2022.
-
Biological Sequence Design with GFlowNets
Authors:
Moksh Jain,
Emmanuel Bengio,
Alex-Hernandez Garcia,
Jarrid Rector-Brooks,
Bonaventure F. P. Dossou,
Chanakya Ekbote,
Jie Fu,
Tianyu Zhang,
Micheal Kilgour,
Dinghuai Zhang,
Lena Simine,
Payel Das,
Yoshua Bengio
Abstract:
Design of de novo biological sequences with desired properties, like protein and DNA sequences, often involves an active loop with several rounds of molecule ideation and expensive wet-lab evaluations. These experiments can consist of multiple stages, with increasing levels of precision and cost of evaluation, where candidates are filtered. This makes the diversity of proposed candidates a key con…
▽ More
Design of de novo biological sequences with desired properties, like protein and DNA sequences, often involves an active loop with several rounds of molecule ideation and expensive wet-lab evaluations. These experiments can consist of multiple stages, with increasing levels of precision and cost of evaluation, where candidates are filtered. This makes the diversity of proposed candidates a key consideration in the ideation phase. In this work, we propose an active learning algorithm leveraging epistemic uncertainty estimation and the recently proposed GFlowNets as a generator of diverse candidate solutions, with the objective to obtain a diverse batch of useful (as defined by some utility function, for example, the predicted anti-microbial activity of a peptide) and informative candidates after each round. We also propose a scheme to incorporate existing labeled datasets of candidates, in addition to a reward function, to speed up learning in GFlowNets. We present empirical results on several biological sequence design tasks, and we find that our method generates more diverse and novel batches with high scoring candidates compared to existing approaches.
△ Less
Submitted 24 May, 2023; v1 submitted 2 March, 2022;
originally announced March 2022.
-
Analysing the Effect of Test-and-Trace Strategy in an SIR Epidemic Model
Authors:
Dongni Zhang,
Tom Britton
Abstract:
Consider a Markovian SIR epidemic model in a homogeneous community. To this model we add a rate at which individuals are tested, and once an infectious individual tests positive it is isolated and each of their contacts are traced and tested independently with some fixed probability. If such a traced individual tests positive it is isolated, and the contact tracing is iterated. This model is analy…
▽ More
Consider a Markovian SIR epidemic model in a homogeneous community. To this model we add a rate at which individuals are tested, and once an infectious individual tests positive it is isolated and each of their contacts are traced and tested independently with some fixed probability. If such a traced individual tests positive it is isolated, and the contact tracing is iterated. This model is analysed using large population approximations, both for the early stage of the epidemic when the "to-be-traced components" of the epidemic behaves like a branching process, and for the main stage of the epidemic where the process of to-be-traced components converges to a deterministic process defined by a system of differential equations. These approximations are used to quantify the effect of testing and of contact tracing on the effective reproduction numbers (for the components as well as for the individuals), the probability of a major outbreak, and the final fraction getting infected. Using numerical illustrations when rates of infection and natural recovery are fixed, it is shown that Test-and-Trace strategy is effective in reducing the reproduction number. Surprisingly, the reproduction number for the branching process of components is not monotonically decreasing in the tracing probability, but the individual reproduction number is conjectured to be monotonic as expected. Further, in the situation where individuals also self-report for testing, the tracing probability is more influential than the screening rate (measured by the fraction infected being screened).
△ Less
Submitted 5 August, 2022; v1 submitted 14 October, 2021;
originally announced October 2021.
-
Unifying Likelihood-free Inference with Black-box Optimization and Beyond
Authors:
Dinghuai Zhang,
Jie Fu,
Yoshua Bengio,
Aaron Courville
Abstract:
Black-box optimization formulations for biological sequence design have drawn recent attention due to their promising potential impact on the pharmaceutical industry. In this work, we propose to unify two seemingly distinct worlds: likelihood-free inference and black-box optimization, under one probabilistic framework. In tandem, we provide a recipe for constructing various sequence design methods…
▽ More
Black-box optimization formulations for biological sequence design have drawn recent attention due to their promising potential impact on the pharmaceutical industry. In this work, we propose to unify two seemingly distinct worlds: likelihood-free inference and black-box optimization, under one probabilistic framework. In tandem, we provide a recipe for constructing various sequence design methods based on this framework. We show how previous optimization approaches can be "reinvented" in our framework, and further propose new probabilistic black-box optimization algorithms. Extensive experiments on sequence design application illustrate the benefits of the proposed methodology.
△ Less
Submitted 8 February, 2022; v1 submitted 5 October, 2021;
originally announced October 2021.
-
Contrastive Learning of Subject-Invariant EEG Representations for Cross-Subject Emotion Recognition
Authors:
Xinke Shen,
Xianggen Liu,
Xin Hu,
Dan Zhang,
Sen Song
Abstract:
EEG signals have been reported to be informative and reliable for emotion recognition in recent years. However, the inter-subject variability of emotion-related EEG signals still poses a great challenge for the practical applications of EEG-based emotion recognition. Inspired by recent neuroscience studies on inter-subject correlation, we proposed a Contrastive Learning method for Inter-Subject Al…
▽ More
EEG signals have been reported to be informative and reliable for emotion recognition in recent years. However, the inter-subject variability of emotion-related EEG signals still poses a great challenge for the practical applications of EEG-based emotion recognition. Inspired by recent neuroscience studies on inter-subject correlation, we proposed a Contrastive Learning method for Inter-Subject Alignment (CLISA) to tackle the cross-subject emotion recognition problem. Contrastive learning was employed to minimize the inter-subject differences by maximizing the similarity in EEG signal representations across subjects when they received the same emotional stimuli in contrast to different ones. Specifically, a convolutional neural network was applied to learn inter-subject aligned spatiotemporal representations from EEG time series in contrastive learning. The aligned representations were subsequently used to extract differential entropy features for emotion classification. CLISA achieved state-of-the-art cross-subject emotion recognition performance on our THU-EP dataset with 80 subjects and the publicly available SEED dataset with 15 subjects. It could generalize to unseen subjects or unseen emotional stimuli in testing. Furthermore, the spatiotemporal representations learned by CLISA could provide insights into the neural mechanisms of human emotion processing.
△ Less
Submitted 5 April, 2022; v1 submitted 20 September, 2021;
originally announced September 2021.
-
Nipah virus vector sequences in COVID-19 patient samples sequenced by the Wuhan Institute of Virology
Authors:
Steven C. Quay,
Daoyu Zhang,
Adrian Jones,
Yuri Deigin
Abstract:
We report the detection of Nipah virus in an infectious clone format, a BSL4-level pathogen and CDC-designated Bioterrorism Agent, in raw RNA-Seq sequencing reads deposited by the Wuhan Institute of Virology (WIV) produced from five December 2019 patients infected with SARS-CoV-2. Research involving Nipah infectious clones has never been reported to have occured at the WIV. These patient samples h…
▽ More
We report the detection of Nipah virus in an infectious clone format, a BSL4-level pathogen and CDC-designated Bioterrorism Agent, in raw RNA-Seq sequencing reads deposited by the Wuhan Institute of Virology (WIV) produced from five December 2019 patients infected with SARS-CoV-2. Research involving Nipah infectious clones has never been reported to have occured at the WIV. These patient samples have been previously reported to contain reads from several other viruses: Influenza A, Spodoptera frugiperda rhabdovirus and Nipah. Previous authors have interpreted the presence of these virus sequences as indicative of co-infections of the patients in question by these pathogens or laboratory contamination. However, our analysis shows that NiV genes are encapsulated in synthetic vectors, which we infer was for assembly of a NiV infectious clone. In particular, we document the finding of internal N, P-V-W-C and L protein coding sequences as well as coverage of the G and F genes. Furthermore, the format of Hepatitis D virus ribozyme and T7 terminator downstream of the 5-prime end of the NiV sequence is consistent with truncation required at the end of the genome for a full length infectious clone. This indicates that research at WIV was being conducted on an assembled NiV infectious clone. Contamination of patient sequencing reads by an infectious NiV clone of the highly pathogenic Bangladesh strain could indicate a significant breach of BSL-4 protocols. We call on WIV to explain the purpose of this research on infectious clones of Nipah Virus, the full chronology of this work, and to explain how and at what stage of sample preparation this contamination occurred.
△ Less
Submitted 19 September, 2021;
originally announced September 2021.
-
Analysis of pangolin metagenomic datasets reveals significant contamination, raising concerns for pangolin CoV host attribution
Authors:
Adrian Jones,
Daoyu Zhang,
Yuri Deigin,
Steven C. Quay
Abstract:
Metagenomic datasets from pangolin tissue specimens have previously yielded SARS-related coronaviruses which show high homology in their receptor binding domain to SARS-CoV-2, suggesting a potential zoonotic source for this feature of the human virus, possibly via recombination (Liu et al. 2019, Lam et al. 2020, Xiao et al. 2020, Liu et al. 2020). Here we re-examine these published datasets. We re…
▽ More
Metagenomic datasets from pangolin tissue specimens have previously yielded SARS-related coronaviruses which show high homology in their receptor binding domain to SARS-CoV-2, suggesting a potential zoonotic source for this feature of the human virus, possibly via recombination (Liu et al. 2019, Lam et al. 2020, Xiao et al. 2020, Liu et al. 2020). Here we re-examine these published datasets. We report that only a few pangolin samples were found to contain coronavirus reads, and even then in low abundance, while other non-pangolin hosted viruses were present in higher abundance. We also discovered extensive contamination with human, rodent, and other mammalian gene sequences, which was a surprising finding. Furthermore, we uncovered a number of pangolin CoV sequences embedded in standard laboratory cloning vectors, which suggests the pangolin specimens could have been contaminated with sequences derived from synthetic biology experiments. Finally, we discover a third pangolin dataset (He et al. 2022) with low levels of SARSr-CoV sequences and unambiguous extensive contamination of several pangolin samples. For these reasons, we find it unlikely that the pangolins in question had a coronavirus infection while alive, and all current versions of the cited papers claiming a zoonotic infection of pangolins with a SARS-r CoV require substantial corrections and should be retracted until such corrections are made.
△ Less
Submitted 1 March, 2022; v1 submitted 18 August, 2021;
originally announced August 2021.
-
Unexpected novel Merbecovirus discoveries in agricultural sequencing datasets from Wuhan, China
Authors:
Daoyu Zhang,
Adrian Jones,
Yuri Deigin,
Karl Sirotkin,
Alejandro Sousa
Abstract:
In this study we document the unexpected discovery of multiple coronaviruses and a BSL-3 pathogen in agricultural cotton and rice sequencing datasets. In particular, we have identified a novel HKU5-related Merbecovirus in a cotton dataset sequenced by the Huazhong Agricultural University in 2017. We have also found an infectious clone sequence containing a novel HKU4-related Merbecovirus related t…
▽ More
In this study we document the unexpected discovery of multiple coronaviruses and a BSL-3 pathogen in agricultural cotton and rice sequencing datasets. In particular, we have identified a novel HKU5-related Merbecovirus in a cotton dataset sequenced by the Huazhong Agricultural University in 2017. We have also found an infectious clone sequence containing a novel HKU4-related Merbecovirus related to MERS coronavirus in a rice dataset sequenced by the Huazhong Agricultural University in early 2020. Another HKU5-related Merbecovirus, as well as Japanese encephalitis virus, were identified in a cotton dataset sequenced by the Huazhong Agricultural University in 2018. An HKU3-related Betacoronavirus was found in a Mus musculus sequencing dataset from the Wuhan Institute of Virology in 2017. Finally, a SARS-WIV1-like Betacoronavirus was found in a rice dataset sequenced by the Fujian Agriculture and Forestry University in 2017. Using the contaminating reads we have extracted from the above datasets, we were able to assemble complete genomes of two novel coronaviruses which we disclose herein. In light of our findings, we raise concerns about biosafety protocol breaches, as indicated by our discovery of multiple dangerous human pathogens in agricultural sequencing laboratories in Wuhan and Fouzou City, China.
△ Less
Submitted 6 June, 2021; v1 submitted 3 April, 2021;
originally announced April 2021.
-
An open debate on SARS-CoV-2's proximal origin is long overdue
Authors:
Rossana Segreto,
Yuri Deigin,
Kevin McCairn,
Alejandro Sousa,
Dan Sirotkin,
Karl Sirotkin,
Jonathan J. Couey,
Adrian Jones,
Daoyu Zhang
Abstract:
There is a near consensus view that SARS-CoV-2 has a natural zoonotic origin; however, several characteristics of SARS-CoV-2 taken together are not easily explained by a natural zoonotic origin hypothesis. These include: a low rate of evolution in the early phase of transmission; the lack of evidence of recombination events; a high pre-existing binding to human ACE2; a novel furin cleavage site in…
▽ More
There is a near consensus view that SARS-CoV-2 has a natural zoonotic origin; however, several characteristics of SARS-CoV-2 taken together are not easily explained by a natural zoonotic origin hypothesis. These include: a low rate of evolution in the early phase of transmission; the lack of evidence of recombination events; a high pre-existing binding to human ACE2; a novel furin cleavage site insert; a flat glycan binding domain of the spike protein which conflicts with host evasion survival patterns exhibited by other coronaviruses, and high human and mouse peptide mimicry. Initial assumptions against a laboratory origin, by contrast, have remained unsubstantiated. Furthermore, over a year after the initial outbreak in Wuhan, there is still no clear evidence of zoonotic transfer from a bat or intermediate species. Given the immense social and economic impact of this pandemic, identifying the true origin of SARS-CoV-2 is fundamental to preventing future outbreaks. The search for SARS-CoV-2's origin should include an open and unbiased inquiry into a possible laboratory origin.
△ Less
Submitted 9 February, 2021; v1 submitted 7 February, 2021;
originally announced February 2021.
-
Shared Space Transfer Learning for analyzing multi-site fMRI data
Authors:
Muhammad Yousefnezhad,
Alessandro Selvitella,
Daoqiang Zhang,
Andrew J. Greenshaw,
Russell Greiner
Abstract:
Multi-voxel pattern analysis (MVPA) learns predictive models from task-based functional magnetic resonance imaging (fMRI) data, for distinguishing when subjects are performing different cognitive tasks -- e.g., watching movies or making decisions. MVPA works best with a well-designed feature set and an adequate sample size. However, most fMRI datasets are noisy, high-dimensional, expensive to coll…
▽ More
Multi-voxel pattern analysis (MVPA) learns predictive models from task-based functional magnetic resonance imaging (fMRI) data, for distinguishing when subjects are performing different cognitive tasks -- e.g., watching movies or making decisions. MVPA works best with a well-designed feature set and an adequate sample size. However, most fMRI datasets are noisy, high-dimensional, expensive to collect, and with small sample sizes. Further, training a robust, generalized predictive model that can analyze homogeneous cognitive tasks provided by multi-site fMRI datasets has additional challenges. This paper proposes the Shared Space Transfer Learning (SSTL) as a novel transfer learning (TL) approach that can functionally align homogeneous multi-site fMRI datasets, and so improve the prediction performance in every site. SSTL first extracts a set of common features for all subjects in each site. It then uses TL to map these site-specific features to a site-independent shared space in order to improve the performance of the MVPA. SSTL uses a scalable optimization procedure that works effectively for high-dimensional fMRI datasets. The optimization procedure extracts the common features for each site by using a single-iteration algorithm and maps these site-specific common features to the site-independent shared space. We evaluate the effectiveness of the proposed method for transferring between various cognitive tasks. Our comprehensive experiments validate that SSTL achieves superior performance to other state-of-the-art analysis techniques.
△ Less
Submitted 24 October, 2020;
originally announced October 2020.
-
Deep Representational Similarity Learning for analyzing neural signatures in task-based fMRI dataset
Authors:
Muhammad Yousefnezhad,
Jeffrey Sawalha,
Alessandro Selvitella,
Daoqiang Zhang
Abstract:
Similarity analysis is one of the crucial steps in most fMRI studies. Representational Similarity Analysis (RSA) can measure similarities of neural signatures generated by different cognitive states. This paper develops Deep Representational Similarity Learning (DRSL), a deep extension of RSA that is appropriate for analyzing similarities between various cognitive tasks in fMRI datasets with a lar…
▽ More
Similarity analysis is one of the crucial steps in most fMRI studies. Representational Similarity Analysis (RSA) can measure similarities of neural signatures generated by different cognitive states. This paper develops Deep Representational Similarity Learning (DRSL), a deep extension of RSA that is appropriate for analyzing similarities between various cognitive tasks in fMRI datasets with a large number of subjects, and high-dimensionality -- such as whole-brain images. Unlike the previous methods, DRSL is not limited by a linear transformation or a restricted fixed nonlinear kernel function -- such as Gaussian kernel. DRSL utilizes a multi-layer neural network for mapping neural responses to linear space, where this network can implement a customized nonlinear transformation for each subject separately. Furthermore, utilizing a gradient-based optimization in DRSL can significantly reduce runtime of analysis on large datasets because it uses a batch of samples in each iteration rather than all neural responses to find an optimal solution. Empirical studies on multi-subject fMRI datasets with various tasks -- including visual stimuli, decision making, flavor, and working memory -- confirm that the proposed method achieves superior performance to other state-of-the-art RSA algorithms.
△ Less
Submitted 28 September, 2020;
originally announced October 2020.
-
Rational evaluation of various epidemic models based on the COVID-19 data of China
Authors:
Wuyue Yang,
Dongyan Zhang,
Liangrong Peng,
Changjing Zhuge,
Liu Hong
Abstract:
In this paper, based on the Akaike information criterion, root mean square error and robustness coefficient, a rational evaluation of various epidemic models/methods, including seven empirical functions, four statistical inference methods and five dynamical models, on their forecasting abilities is carried out. With respect to the outbreak data of COVID-19 epidemics in China, we find that before t…
▽ More
In this paper, based on the Akaike information criterion, root mean square error and robustness coefficient, a rational evaluation of various epidemic models/methods, including seven empirical functions, four statistical inference methods and five dynamical models, on their forecasting abilities is carried out. With respect to the outbreak data of COVID-19 epidemics in China, we find that before the inflection point, all models fail to make a reliable prediction. The Logistic function consistently underestimates the final epidemic size, while the Gompertz's function makes an overestimation in all cases. Towards statistical inference methods, the methods of sequential Bayesian and time-dependent reproduction number are more accurate at the late stage of an epidemic. And the transition-like behavior of exponential growth method from underestimation to overestimation with respect to the inflection point might be useful for constructing a more reliable forecast. Compared to ODE-based SIR, SEIR and SEIR-AHQ models, the SEIR-QD and SEIR-PO models generally show a better performance on studying the COVID-19 epidemics, whose success we believe could be attributed to a proper trade-off between model complexity and fitting accuracy. Our findings not only are crucial for the forecast of COVID-19 epidemics, but also may apply to other infectious diseases.
△ Less
Submitted 14 September, 2021; v1 submitted 12 March, 2020;
originally announced March 2020.
-
Epidemic analysis of COVID-19 in China by dynamical modeling
Authors:
Liangrong Peng,
Wuyue Yang,
Dongyan Zhang,
Changjing Zhuge,
Liu Hong
Abstract:
The outbreak of novel coronavirus-caused pneumonia (COVID-19) in Wuhan has attracted worldwide attention. Here, we propose a generalized SEIR model to analyze this epidemic. Based on the public data of National Health Commission of China from Jan. 20th to Feb. 9th, 2020, we reliably estimate key epidemic parameters and make predictions on the inflection point and possible ending time for 5 differe…
▽ More
The outbreak of novel coronavirus-caused pneumonia (COVID-19) in Wuhan has attracted worldwide attention. Here, we propose a generalized SEIR model to analyze this epidemic. Based on the public data of National Health Commission of China from Jan. 20th to Feb. 9th, 2020, we reliably estimate key epidemic parameters and make predictions on the inflection point and possible ending time for 5 different regions. According to optimistic estimation, the epidemics in Beijing and Shanghai will end soon within two weeks, while for most part of China, including the majority of cities in Hubei province, the success of anti-epidemic will be no later than the middle of March. The situation in Wuhan is still very severe, at least based on public data until Feb. 15th. We expect it will end up at the beginning of April. Moreover, by inverse inference, we find the outbreak of COVID-19 in Mainland, Hubei province and Wuhan all can be dated back to the end of December 2019, and the doubling time is around two days at the early stage.
△ Less
Submitted 25 June, 2020; v1 submitted 16 February, 2020;
originally announced February 2020.
-
Supervised Hyperalignment for multi-subject fMRI data alignment
Authors:
Muhammad Yousefnezhad,
Alessandro Selvitella,
Liangxiu Han,
Daoqiang Zhang
Abstract:
Hyperalignment has been widely employed in Multivariate Pattern (MVP) analysis to discover the cognitive states in the human brains based on multi-subject functional Magnetic Resonance Imaging (fMRI) datasets. Most of the existing HA methods utilized unsupervised approaches, where they only maximized the correlation between the voxels with the same position in the time series. However, these unsup…
▽ More
Hyperalignment has been widely employed in Multivariate Pattern (MVP) analysis to discover the cognitive states in the human brains based on multi-subject functional Magnetic Resonance Imaging (fMRI) datasets. Most of the existing HA methods utilized unsupervised approaches, where they only maximized the correlation between the voxels with the same position in the time series. However, these unsupervised solutions may not be optimum for handling the functional alignment in the supervised MVP problems. This paper proposes a Supervised Hyperalignment (SHA) method to ensure better functional alignment for MVP analysis, where the proposed method provides a supervised shared space that can maximize the correlation among the stimuli belonging to the same category and minimize the correlation between distinct categories of stimuli. Further, SHA employs a generalized optimization solution, which generates the shared space and calculates the mapped features in a single iteration, hence with optimum time and space complexities for large datasets. Experiments on multi-subject datasets demonstrate that SHA method achieves up to 19% better performance for multi-class problems over the state-of-the-art HA algorithms.
△ Less
Submitted 9 January, 2020;
originally announced January 2020.
-
Preliminary Results on a New Algorithm for Blink Correction Adaptive to Inter- and Intra-Subject Variability
Authors:
E. Guttmann-Flury,
X. Sheng,
D. Zhang,
X. Zhu
Abstract:
This paper presents a new preprocessing method to correct blinking artifacts in Electroencephalography (EEG) based Brain-Computer Interfaces (BCIs). This Algorithm for Blink Correction (ABC) directly corrects the signal in the time domain without the need for additional Electrooculogram (EOG) electrodes. The main idea is to automatically adapt to the blink's inter- and intra-subject variability by…
▽ More
This paper presents a new preprocessing method to correct blinking artifacts in Electroencephalography (EEG) based Brain-Computer Interfaces (BCIs). This Algorithm for Blink Correction (ABC) directly corrects the signal in the time domain without the need for additional Electrooculogram (EOG) electrodes. The main idea is to automatically adapt to the blink's inter- and intra-subject variability by considering the blink's amplitude as a parameter. A simple Minimum Distance to Riemannian Mean (MDRM) is applied as the classification algorithm. Preliminary results on three subjects show a mean classification accuracy increase of 13.7% using ABC.
△ Less
Submitted 31 October, 2019;
originally announced October 2019.
-
Gradient-based Representational Similarity Analysis with Searchlight for Analyzing fMRI Data
Authors:
Xiaoliang Sheng,
Muhammad Yousefnezhad,
Tonglin Xu,
Ning Yuan,
Daoqiang Zhang
Abstract:
Representational Similarity Analysis (RSA) aims to explore similarities between neural activities of different stimuli. Classical RSA techniques employ the inverse of the covariance matrix to explore a linear model between the neural activities and task events. However, calculating the inverse of a large-scale covariance matrix is time-consuming and can reduce the stability and robustness of the f…
▽ More
Representational Similarity Analysis (RSA) aims to explore similarities between neural activities of different stimuli. Classical RSA techniques employ the inverse of the covariance matrix to explore a linear model between the neural activities and task events. However, calculating the inverse of a large-scale covariance matrix is time-consuming and can reduce the stability and robustness of the final analysis. Notably, it becomes severe when the number of samples is too large. For facing this shortcoming, this paper proposes a novel RSA method called gradient-based RSA (GRSA). Moreover, the proposed method is not restricted to a linear model. In fact, there is a growing interest in finding more effective ways of using multi-subject and whole-brain fMRI data. Searchlight technique can extend RSA from the localized brain regions to the whole-brain regions with smaller memory footprint in each process. Based on Searchlight, we propose a new method called Spatiotemporal Searchlight GRSA (SSL-GRSA) that generalizes our ROI-based GRSA algorithm to the whole-brain data. Further, our approach can handle some computational challenges while dealing with large-scale, multi-subject fMRI data. Experimental studies on multi-subject datasets confirm that both proposed approaches achieve superior performance to other state-of-the-art RSA algorithms.
△ Less
Submitted 12 September, 2018;
originally announced September 2018.
-
Multi-Objective Cognitive Model: a supervised approach for multi-subject fMRI analysis
Authors:
Muhammad Yousefnezhad,
Daoqiang Zhang
Abstract:
In order to decode the human brain, Multivariate Pattern (MVP) classification generates cognitive models by using functional Magnetic Resonance Imaging (fMRI) datasets. As a standard pipeline in the MVP analysis, brain patterns in multi-subject fMRI dataset must be mapped to a shared space and then a classification model is generated by employing the mapped patterns. However, the MVP models may no…
▽ More
In order to decode the human brain, Multivariate Pattern (MVP) classification generates cognitive models by using functional Magnetic Resonance Imaging (fMRI) datasets. As a standard pipeline in the MVP analysis, brain patterns in multi-subject fMRI dataset must be mapped to a shared space and then a classification model is generated by employing the mapped patterns. However, the MVP models may not provide stable performance on a new fMRI dataset because the standard pipeline uses disjoint steps for generating these models. Indeed, each step in the pipeline includes an objective function with independent optimization approach, where the best solution of each step may not be optimum for the next steps. For tackling the mentioned issue, this paper introduces the Multi-Objective Cognitive Model (MOCM) that utilizes an integrated objective function for MVP analysis rather than just using those disjoint steps. For solving the integrated problem, we proposed a customized multi-objective optimization approach, where all possible solutions are firstly generated, and then our method ranks and selects the robust solutions as the final results. Empirical studies confirm that the proposed method can generate superior performance in comparison with other techniques.
△ Less
Submitted 5 August, 2018;
originally announced August 2018.
-
Gradient Hyperalignment for multi-subject fMRI data alignment
Authors:
Tonglin Xu,
Muhammad Yousefnezhad,
Daoqiang Zhang
Abstract:
Multi-subject fMRI data analysis is an interesting and challenging problem in human brain decoding studies. The inherent anatomical and functional variability across subjects make it necessary to do both anatomical and functional alignment before classification analysis. Besides, when it comes to big data, time complexity becomes a problem that cannot be ignored. This paper proposes Gradient Hyper…
▽ More
Multi-subject fMRI data analysis is an interesting and challenging problem in human brain decoding studies. The inherent anatomical and functional variability across subjects make it necessary to do both anatomical and functional alignment before classification analysis. Besides, when it comes to big data, time complexity becomes a problem that cannot be ignored. This paper proposes Gradient Hyperalignment (Gradient-HA) as a gradient-based functional alignment method that is suitable for multi-subject fMRI datasets with large amounts of samples and voxels. The advantage of Gradient-HA is that it can solve independence and high dimension problems by using Independent Component Analysis (ICA) and Stochastic Gradient Ascent (SGA). Validation using multi-classification tasks on big data demonstrates that Gradient-HA method has less time complexity and better or comparable performance compared with other state-of-the-art functional alignment methods.
△ Less
Submitted 7 July, 2018;
originally announced July 2018.
-
Deep Hyperalignment
Authors:
Muhammad Yousefnezhad,
Daoqiang Zhang
Abstract:
This paper proposes Deep Hyperalignment (DHA) as a regularized, deep extension, scalable Hyperalignment (HA) method, which is well-suited for applying functional alignment to fMRI datasets with nonlinearity, high-dimensionality (broad ROI), and a large number of subjects. Unlink previous methods, DHA is not limited by a restricted fixed kernel function. Further, it uses a parametric approach, rank…
▽ More
This paper proposes Deep Hyperalignment (DHA) as a regularized, deep extension, scalable Hyperalignment (HA) method, which is well-suited for applying functional alignment to fMRI datasets with nonlinearity, high-dimensionality (broad ROI), and a large number of subjects. Unlink previous methods, DHA is not limited by a restricted fixed kernel function. Further, it uses a parametric approach, rank-$m$ Singular Value Decomposition (SVD), and stochastic gradient descent for optimization. Therefore, DHA has a suitable time complexity for large datasets, and DHA does not require the training data when it computes the functional alignment for a new subject. Experimental studies on multi-subject fMRI analysis confirm that the DHA method achieves superior performance to other state-of-the-art HA algorithms.
△ Less
Submitted 11 October, 2017;
originally announced October 2017.
-
Anatomical Pattern Analysis for decoding visual stimuli in human brains
Authors:
Muhammad Yousefnezhad,
Daoqiang Zhang
Abstract:
Background: A universal unanswered question in neuroscience and machine learning is whether computers can decode the patterns of the human brain. Multi-Voxels Pattern Analysis (MVPA) is a critical tool for addressing this question. However, there are two challenges in the previous MVPA methods, which include decreasing sparsity and noise in the extracted features and increasing the performance of…
▽ More
Background: A universal unanswered question in neuroscience and machine learning is whether computers can decode the patterns of the human brain. Multi-Voxels Pattern Analysis (MVPA) is a critical tool for addressing this question. However, there are two challenges in the previous MVPA methods, which include decreasing sparsity and noise in the extracted features and increasing the performance of prediction.
Methods: In overcoming mentioned challenges, this paper proposes Anatomical Pattern Analysis (APA) for decoding visual stimuli in the human brain. This framework develops a novel anatomical feature extraction method and a new imbalance AdaBoost algorithm for binary classification. Further, it utilizes an Error-Correcting Output Codes (ECOC) method for multiclass prediction. APA can automatically detect active regions for each category of the visual stimuli. Moreover, it enables us to combine homogeneous datasets for applying advanced classification.
Results and Conclusions: Experimental studies on 4 visual categories (words, consonants, objects and scrambled photos) demonstrate that the proposed approach achieves superior performance to state-of-the-art methods.
△ Less
Submitted 5 October, 2017;
originally announced October 2017.
-
Targeted and Imaging-guided In Vivo Photodynamic Therapy of Tumors Using Dual-functional, Aggregation-induced Emission Nanoparticles
Authors:
Xianhe Sun,
Abudureheman zebibula,
Xiaobiao Dong,
Gonghui Li,
Guanxin Zhang,
Deqing Zhang,
Jun Qian,
Sailing He
Abstract:
Dual-functional nanoparticles, with the property of aggregation-induced emission and the capability of reactive oxygen species, were used to achieve passive/active targeting of tumor. Good contrast in in vivo imaging and obvious therapeutic efficiency were realized with a low dose of AIE nanoparticles as well as a low power density of light, resulting in negligible side effects.
Dual-functional nanoparticles, with the property of aggregation-induced emission and the capability of reactive oxygen species, were used to achieve passive/active targeting of tumor. Good contrast in in vivo imaging and obvious therapeutic efficiency were realized with a low dose of AIE nanoparticles as well as a low power density of light, resulting in negligible side effects.
△ Less
Submitted 22 August, 2017;
originally announced August 2017.
-
Cascade and Parallel Convolutional Recurrent Neural Networks on EEG-based Intention Recognition for Brain Computer Interface
Authors:
Dalin Zhang,
Lina Yao,
Xiang Zhang,
Sen Wang,
Weitong Chen,
Robert Boots
Abstract:
Brain-Computer Interface (BCI) is a system empowering humans to communicate with or control the outside world with exclusively brain intentions. Electroencephalography (EEG) based BCIs are promising solutions due to their convenient and portable instruments. Motor imagery EEG (MI-EEG) is a kind of most widely focused EEG signals, which reveals a subjects movement intentions without actual actions.…
▽ More
Brain-Computer Interface (BCI) is a system empowering humans to communicate with or control the outside world with exclusively brain intentions. Electroencephalography (EEG) based BCIs are promising solutions due to their convenient and portable instruments. Motor imagery EEG (MI-EEG) is a kind of most widely focused EEG signals, which reveals a subjects movement intentions without actual actions. Despite the extensive research of MI-EEG in recent years, it is still challenging to interpret EEG signals effectively due to the massive noises in EEG signals (e.g., low signal noise ratio and incomplete EEG signals), and difficulties in capturing the inconspicuous relationships between EEG signals and certain brain activities. Most existing works either only consider EEG as chain-like sequences neglecting complex dependencies between adjacent signals or performing simple temporal averaging over EEG sequences. In this paper, we introduce both cascade and parallel convolutional recurrent neural network models for precisely identifying human intended movements by effectively learning compositional spatio-temporal representations of raw EEG streams. The proposed models grasp the spatial correlations between physically neighboring EEG signals by converting the chain like EEG sequences into a 2D mesh like hierarchy. An LSTM based recurrent network is able to extract the subtle temporal dependencies of EEG data streams. Extensive experiments on a large-scale MI-EEG dataset (108 subjects, 3,145,160 EEG records) have demonstrated that both models achieve high accuracy near 98.3% and outperform a set of baseline methods and most recent deep learning based EEG recognition models, yielding a significant accuracy increase of 18% in the cross-subject validation scenario.
△ Less
Submitted 10 June, 2021; v1 submitted 22 August, 2017;
originally announced August 2017.