Skip to main content

Showing 1–50 of 54 results for author: Zhang, D

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2506.12567  [pdf

    q-bio.QM

    Maximal Speed of Glucose Change Significantly Distinguishes Prediabetes from Diabetes

    Authors: Dandan Wang, Xiaoyan Chen, Jingxiang Lin, Teng Zhang, Lianyi Huang, Dongliang Leng, Xiaohua Douglas Zhang, Gang Li

    Abstract: Rapid changes in blood glucose levels can have severe and immediate health consequences, leading to the need to develop indices for assessing these rapid changes based on continuous glucose monitoring (CGM) data. We proposed a CGM index, maxSpeed, that represents the maximum of speed of glucose change (SGC) in a subject, respectively, and conducted a clinical study to investigate this index along… ▽ More

    Submitted 14 June, 2025; originally announced June 2025.

  2. arXiv:2506.12107  [pdf

    q-bio.MN

    Network Pharmacology Reveals HSPA1A/BST2 as Potential Targets of Ci Bai Capsule's Active Compounds Intervening in Leukopenia

    Authors: Dingfan Zhang, Congshu Huang, Lei Zhou, Boyang Wang, Wei Zhou, Tiantian Xia, Pan Shen, Shao Li, Yue Gao

    Abstract: Background: Radiation-induced leukopenia caused by low-dose exposure is frequently associated with Traditional Chinese Medicine (TCM) syndromes like "blood deficiency" and "fatigue syndrome". Ci Bai Capsule (CB) has been reported to enhance white blood cell levels; however, its mechanisms and bioactive compounds remain unclear.Aim: This study aimed to identify the bioactive compounds group of CB a… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

  3. arXiv:2504.11454  [pdf, ps, other

    cs.LG cs.AI q-bio.QM

    Elucidating the Design Space of Multimodal Protein Language Models

    Authors: Cheng-Yen Hsieh, Xinyou Wang, Daiheng Zhang, Dongyu Xue, Fei Ye, Shujian Huang, Zaixiang Zheng, Quanquan Gu

    Abstract: Multimodal protein language models (PLMs) integrate sequence and token-based structural information, serving as a powerful foundation for protein modeling, generation, and design. However, the reliance on tokenizing 3D structures into discrete tokens causes substantial loss of fidelity about fine-grained structural details and correlations. In this paper, we systematically elucidate the design spa… ▽ More

    Submitted 11 June, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

    Comments: ICML 2025 Spotlight; Project Page: https://bytedance.github.io/dplm/dplm-2.1/

  4. arXiv:2504.02488  [pdf, other

    q-bio.PE physics.soc-ph

    A Behaviour and Disease Model of Testing and Isolation

    Authors: Matthew Ryan, Roslyn I. Hickson, Edward M. Hill, Thomas House, Valerie Isham, Dongni Zhang, Mick G. Roberts

    Abstract: There has been interest in the interactions between infectious disease dynamics and behaviour for most of the history of mathematical epidemiology. This has included consideration of which mathematical models best capture each phenomenon, as well as their interaction, but typically in a manner that is agnostic to the exact behaviour in question. Here, we investigate interacting behaviour and disea… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: 22 pages, 10 figures

    MSC Class: 92D30; 05C80; 34A45

  5. arXiv:2502.17213  [pdf, other

    q-bio.NC cs.AI cs.LG eess.SP

    Deep Learning-Powered Electrical Brain Signals Analysis: Advancing Neurological Diagnostics

    Authors: Jiahe Li, Xin Chen, Fanqi Shen, Junru Chen, Yuxin Liu, Daoze Zhang, Zhizhang Yuan, Fang Zhao, Meng Li, Yang Yang

    Abstract: Neurological disorders represent significant global health challenges, driving the advancement of brain signal analysis methods. Scalp electroencephalography (EEG) and intracranial electroencephalography (iEEG) are widely used to diagnose and monitor neurological conditions. However, dataset heterogeneity and task variations pose challenges in developing robust deep learning solutions. This review… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

  6. arXiv:2501.11291  [pdf

    q-bio.TO q-bio.QM

    Multilineage-differentiating stress-enduring cells alleviate neuropathic pain in mice by secreting TGF-b and IL-10

    Authors: Yayu Zhao, Ying Fei, Yunyun Cai, Zhongya Wei, Ying Chen, Yuhua Ji, Xue Chen, Dongmei Zhang, Gang Chen

    Abstract: Neuropathic pain is a chronic condition characterized by damage to and dysfunction of the peripheral or central nervous system. There are currently no effective treatment options available for neuropathic pain, and existing drugs often provide only temporary relief with potential side effects. Multilineage-differentiating stress-enduring (Muse) cells are characterized by high expansion potential,… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

  7. arXiv:2412.19191  [pdf, other

    q-bio.BM cs.AI cs.LG

    Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models

    Authors: Haonan He, Yuchen Ren, Yining Tang, Ziyang Xu, Junxian Li, Minghao Yang, Di Zhang, Dong Yuan, Tao Chen, Shufei Zhang, Yuqiang Li, Nanqing Dong, Wanli Ouyang, Dongzhan Zhou, Peng Ye

    Abstract: Large language models have already demonstrated their formidable capabilities in general domains, ushering in a revolutionary transformation. However, exploring and exploiting the extensive knowledge of these models to comprehend multi-omics biology remains underexplored. To fill this research gap, we first introduce Biology-Instructions, the first large-scale multi-omics biological sequences-rela… ▽ More

    Submitted 26 December, 2024; originally announced December 2024.

  8. arXiv:2411.14721  [pdf, other

    cs.CL cs.LG q-bio.QM

    MolReFlect: Towards In-Context Fine-grained Alignments between Molecules and Texts

    Authors: Jiatong Li, Yunqing Liu, Wei Liu, Jingdi Le, Di Zhang, Wenqi Fan, Dongzhan Zhou, Yuqiang Li, Qing Li

    Abstract: Molecule discovery is a pivotal research field, impacting everything from the medicines we take to the materials we use. Recently, Large Language Models (LLMs) have been widely adopted in molecule understanding and generation, yet the alignments between molecules and their corresponding captions remain a significant challenge. Previous endeavours often treat the molecule as a general SMILES string… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: 22 pages, 12 figures

  9. arXiv:2411.11668  [pdf, other

    cs.LG q-bio.QM

    Efficient and Robust Continual Graph Learning for Graph Classification in Biology

    Authors: Ding Zhang, Jane Downer, Can Chen, Ren Wang

    Abstract: Graph classification is essential for understanding complex biological systems, where molecular structures and interactions are naturally represented as graphs. Traditional graph neural networks (GNNs) perform well on static tasks but struggle in dynamic settings due to catastrophic forgetting. We present Perturbed and Sparsified Continual Graph Learning (PSCGL), a robust and efficient continual g… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  10. arXiv:2411.08306  [pdf, other

    cs.LG q-bio.QM

    Evaluating Molecule Synthesizability via Retrosynthetic Planning and Reaction Prediction

    Authors: Songtao Liu, Dandan Zhang, Zhengkai Tu, Hanjun Dai, Peng Liu

    Abstract: A significant challenge in wet lab experiments with current drug design generative models is the trade-off between pharmacological properties and synthesizability. Molecules predicted to have highly desirable properties are often difficult to synthesize, while those that are easily synthesizable tend to exhibit less favorable properties. As a result, evaluating the synthesizability of molecules in… ▽ More

    Submitted 3 April, 2025; v1 submitted 12 November, 2024; originally announced November 2024.

  11. arXiv:2411.05825  [pdf, other

    q-bio.NC cs.AI cs.CV

    SurfGNN: A robust surface-based prediction model with interpretability for coactivation maps of spatial and cortical features

    Authors: Zhuoshuo Li, Jiong Zhang, Youbing Zeng, Jiaying Lin, Dan Zhang, Jianjia Zhang, Duan Xu, Hosung Kim, Bingguang Liu, Mengting Liu

    Abstract: Current brain surface-based prediction models often overlook the variability of regional attributes at the cortical feature level. While graph neural networks (GNNs) excel at capturing regional differences, they encounter challenges when dealing with complex, high-density graph structures. In this work, we consider the cortical surface mesh as a sparse graph and propose an interpretable prediction… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: 15 pages, 6 figures

    ACM Class: J.3

  12. arXiv:2411.04568  [pdf, other

    cs.HC eess.SP q-bio.NC

    Dynamic-Attention-based EEG State Transition Modeling for Emotion Recognition

    Authors: Xinke Shen, Runmin Gan, Kaixuan Wang, Shuyi Yang, Qingzhu Zhang, Quanying Liu, Dan Zhang, Sen Song

    Abstract: Electroencephalogram (EEG)-based emotion decoding can objectively quantify people's emotional state and has broad application prospects in human-computer interaction and early detection of emotional disorders. Recently emerging deep learning architectures have significantly improved the performance of EEG emotion decoding. However, existing methods still fall short of fully capturing the complex s… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: 14 pages, 6 figures

  13. arXiv:2410.05292  [pdf, other

    cs.LG cs.AI q-bio.QM

    CaLMFlow: Volterra Flow Matching using Causal Language Models

    Authors: Sizhuang He, Daniel Levine, Ivan Vrkic, Marco Francesco Bressana, David Zhang, Syed Asad Rizvi, Yangtian Zhang, Emanuele Zappala, David van Dijk

    Abstract: We introduce CaLMFlow (Causal Language Models for Flow Matching), a novel framework that casts flow matching as a Volterra integral equation (VIE), leveraging the power of large language models (LLMs) for continuous data generation. CaLMFlow enables the direct application of LLMs to learn complex flows by formulating flow matching as a sequence modeling task, bridging discrete language modeling an… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: 10 pages, 9 figures, 7 tables

  14. arXiv:2410.00327  [pdf, other

    cs.LG cs.AI cs.CE q-bio.QM

    EnzymeFlow: Generating Reaction-specific Enzyme Catalytic Pockets through Flow Matching and Co-Evolutionary Dynamics

    Authors: Chenqing Hua, Yong Liu, Dinghuai Zhang, Odin Zhang, Sitao Luan, Kevin K. Yang, Guy Wolf, Doina Precup, Shuangjia Zheng

    Abstract: Enzyme design is a critical area in biotechnology, with applications ranging from drug development to synthetic biology. Traditional methods for enzyme function prediction or protein binding pocket design often fall short in capturing the dynamic and complex nature of enzyme-substrate interactions, particularly in catalytic processes. To address the challenges, we introduce EnzymeFlow, a generativ… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

  15. arXiv:2407.11622  [pdf, other

    q-bio.PE math.PR

    Sideward contact tracing in an epidemic model with mixing groups

    Authors: Dongni Zhang, Martina Favero

    Abstract: We consider a stochastic epidemic model with sideward contact tracing. We assume that infection is driven by interactions within mixing events (gatherings of two or more individuals). Once an infective is diagnosed, each individual who was infected at the same event as the diagnosed individual is contact traced with some given probability. Assuming few initial infectives in a large population, the… ▽ More

    Submitted 26 March, 2025; v1 submitted 16 July, 2024; originally announced July 2024.

  16. arXiv:2406.09817  [pdf, other

    physics.chem-ph q-bio.BM

    Efficient and Precise Force Field Optimization for Biomolecules Using DPA-2

    Authors: Junhan Chang, Duo Zhang, Yuqing Deng, Hongrui Lin, Zhirong Liu, Linfeng Zhang, Hang Zheng, Xinyan Wang

    Abstract: Molecular simulations are essential tools in computational chemistry, enabling the prediction and understanding of molecular interactions and thermodynamic properties of biomolecules. However, traditional force fields face significant challenges in accurately representing novel molecules and complex chemical environments due to the labor-intensive process of manually setting optimization parameter… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  17. arXiv:2404.05329  [pdf

    q-bio.BM

    In silico bioactivity prediction of proteins interacting with graphene-based nanomaterials guides rational design of biosensor

    Authors: Jing Ye, Minzhi Fan, Xiaoyu Zhang, Shasha Lu, Mengyao Chai, Yunshan Zhang, Xiaoyu Zhao, Shuang Li, Diming Zhang

    Abstract: Graphene based nanomaterials have attracted significant attention for their potentials in biomedical and biotechnology applications in recent years, owing to the outstanding physical and chemical properties. However, the interaction mechanism and impact on biological activity of macro and micro biomolecules still require more concerns and further research in order to enhance their applicability in… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  18. arXiv:2403.20163  [pdf, other

    cs.NE q-bio.NC

    Biologically-Plausible Topology Improved Spiking Actor Network for Efficient Deep Reinforcement Learning

    Authors: Duzhen Zhang, Qingyu Wang, Tielin Zhang, Bo Xu

    Abstract: The success of Deep Reinforcement Learning (DRL) is largely attributed to utilizing Artificial Neural Networks (ANNs) as function approximators. Recent advances in neuroscience have unveiled that the human brain achieves efficient reward-based learning, at least by integrating spiking neurons with spatial-temporal dynamics and network topologies with biologically-plausible connectivity patterns. T… ▽ More

    Submitted 29 March, 2024; originally announced March 2024.

    Comments: Work in Progress

  19. arXiv:2402.14213  [pdf

    q-bio.NC cs.LG eess.SP

    Contrastive Learning of Shared Spatiotemporal EEG Representations Across Individuals for Naturalistic Neuroscience

    Authors: Xinke Shen, Lingyi Tao, Xuyang Chen, Sen Song, Quanying Liu, Dan Zhang

    Abstract: Neural representations induced by naturalistic stimuli offer insights into how humans respond to stimuli in daily life. Understanding neural mechanisms underlying naturalistic stimuli processing hinges on the precise identification and extraction of the shared neural patterns that are consistently present across individuals. Targeting the Electroencephalogram (EEG) technique, known for its rich sp… ▽ More

    Submitted 13 July, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: 54 pages, 17 figures

  20. arXiv:2402.13392  [pdf, other

    physics.soc-ph math.PR q-bio.PE

    An SEIR network epidemic model with manual and digital contact tracing allowing delays

    Authors: Dongni Zhang, Tom Britton

    Abstract: We consider an SEIR epidemic model on a network also allowing random contacts, where recovered individuals could either recover naturally or be diagnosed. Upon diagnosis, manual contact tracing is triggered such that each infected network contact is reported, tested and isolated with some probability and after a random delay. Additionally, digital tracing (based on a tracing app) is triggered if t… ▽ More

    Submitted 5 June, 2024; v1 submitted 20 February, 2024; originally announced February 2024.

  21. arXiv:2310.08774  [pdf, other

    q-bio.PE cs.LG stat.ML

    PhyloGFN: Phylogenetic inference with generative flow networks

    Authors: Mingyang Zhou, Zichao Yan, Elliot Layne, Nikolay Malkin, Dinghuai Zhang, Moksh Jain, Mathieu Blanchette, Yoshua Bengio

    Abstract: Phylogenetics is a branch of computational biology that studies the evolutionary relationships among biological entities. Its long history and numerous applications notwithstanding, inference of phylogenetic trees from sequence data remains challenging: the high complexity of tree space poses a significant obstacle for the current combinatorial and probabilistic techniques. In this paper, we adopt… ▽ More

    Submitted 24 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  22. arXiv:2307.05628  [pdf, other

    q-bio.GN cs.LG

    DNAGPT: A Generalized Pre-trained Tool for Versatile DNA Sequence Analysis Tasks

    Authors: Daoan Zhang, Weitong Zhang, Yu Zhao, Jianguo Zhang, Bing He, Chenchen Qin, Jianhua Yao

    Abstract: Pre-trained large language models demonstrate potential in extracting information from DNA sequences, yet adapting to a variety of tasks and data modalities remains a challenge. To address this, we propose DNAGPT, a generalized DNA pre-training model trained on over 200 billion base pairs from all mammals. By enhancing the classic GPT model with a binary classification task (DNA sequence order), a… ▽ More

    Submitted 30 August, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

  23. arXiv:2305.19544  [pdf, other

    q-bio.PE physics.soc-ph

    A data-driven analysis on the mediation effect of compartment models between control measures and COVID-19 epidemics

    Authors: Dongyan Zhang, Wuyue Yang, Wanqi Wen, Liangrong Peng, Changjingn Zhuge, Liu Hong

    Abstract: We make a retrospective review on various control measures taken by 127 countries/territories during the first wave of COVID-19 pandemic until July 7, 2020, and evaluate their impacts on the epidemic dynamics quantitatively. The SEIR-QD model, as a representative for general compartment models, is used to fit the epidemic data, enabling the extraction of crucial model parameters and dynamical feat… ▽ More

    Submitted 22 September, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: 21 pages, 6 figures, 1 tables

    MSC Class: 92-10

  24. arXiv:2305.04931  [pdf

    q-bio.QM

    Network pharmacology on the mechanism of Yi Qi Tong Qiao Pill inhibiting allergic rhinitis

    Authors: Boyang Wang, DingFan Zhang, Tingyu Zhang, Chayanis Sutcharitchan, Jianlin Hua, Dongfang Hua, Bo Zhang, Shao Li

    Abstract: Objective: The purpose of this study is to reveal the mechanism of action of Yi Qi Tong Qiao Pill (YQTQP) in the treatment of allergic rhinitis (AR), as well as establish a paradigm for the researches on traditional Chinese medicine (TCM) from systematic perspective. Methods: Based on the data collected from TCM-related and disease-related databases, target profiles of compounds in YQTQP were calc… ▽ More

    Submitted 21 May, 2023; v1 submitted 6 May, 2023; originally announced May 2023.

    Comments: 25 pages, 6 figures

    MSC Class: None

  25. arXiv:2301.11356  [pdf, other

    cs.SC cs.CE q-bio.QM

    The Automated Discovery of Kinetic Rate Models -- Methodological Frameworks

    Authors: Miguel Ángel de Carvalho Servia, Ilya Orson Sandoval, Klaus Hellgardt, King Kuok, Hii, Dongda Zhang, Ehecatl Antonio del Rio Chanona

    Abstract: The industrialization of catalytic processes requires reliable kinetic models for their design, optimization and control. Mechanistic models require significant domain knowledge, while data-driven and hybrid models lack interpretability. Automated knowledge discovery methods, such as ALAMO (Automated Learning of Algebraic Models for Optimization), SINDy (Sparse Identification of Nonlinear Dynamics… ▽ More

    Submitted 2 November, 2023; v1 submitted 26 January, 2023; originally announced January 2023.

  26. arXiv:2211.12869  [pdf, other

    math.PR q-bio.PE

    Epidemic models with digital and manual contact tracing

    Authors: Tom Britton, Dongni Zhang

    Abstract: We analyze a Markovian SIR epidemic model where individuals either recover naturally or are diagnosed, leading to isolation and potential contact tracing. Our focus is on digital contact tracing via a tracing app, considering both its standalone use and combination with manual tracing. We prove that as the population size $n$ grows large, the epidemic process converges to a limiting process, which… ▽ More

    Submitted 27 March, 2025; v1 submitted 23 November, 2022; originally announced November 2022.

  27. arXiv:2209.14664  [pdf

    q-bio.QM cs.LG stat.AP

    Causal inference in drug discovery and development

    Authors: Tom Michoel, Jitao David Zhang

    Abstract: To discover new drugs is to seek and to prove causality. As an emerging approach leveraging human knowledge and creativity, data, and machine intelligence, causal inference holds the promise of reducing cognitive bias and improving decision making in drug discovery. While it has been applied across the value chain, the concepts and practice of causal inference remain obscure to many practitioners.… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  28. arXiv:2207.03288  [pdf

    q-bio.GN

    Further analysis of metagenomic datasets containing GD and GX pangolin CoVs indicates widespread contamination, undermining pangolin host attribution

    Authors: Adrian Jones, Steven E. Massey, Daoyu Zhang, Yuri Deigin, Steven C. Quay

    Abstract: The only animals other than bats reported to have been infected with SARS-CoV-2-related coronaviruses (SARS2r-CoVs) prior to the COVID-19 pandemic are pangolins. In early 2020 multiple papers reported the identification of two clades of SARS2r-CoVs, GD and GX, infecting pangolins. However the RNA-Seq datasets supporting pangolin genome assembly were widely contaminated, contained synthetic vectors… ▽ More

    Submitted 11 July, 2022; v1 submitted 7 July, 2022; originally announced July 2022.

    Comments: 46 pages, 32 figures

  29. arXiv:2206.04349  [pdf, other

    cs.CV cs.AI q-bio.GN q-bio.QM stat.ME

    Deep radiomic signature with immune cell markers predicts the survival of glioma patients

    Authors: Ahmad Chaddad, Paul Daniel Mingli Zhang, Saima Rathore, Paul Sargos, Christian Desrosiers, Tamim Niazi

    Abstract: Imaging biomarkers offer a non-invasive way to predict the response of immunotherapy prior to treatment. In this work, we propose a novel type of deep radiomic features (DRFs) computed from a convolutional neural network (CNN), which capture tumor characteristics related to immune cell markers and overall survival. Our study uses four MRI sequences (T1-weighted, T1-weighted post-contrast, T2-weigh… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Journal ref: Neurocomputing, Volume 469, 16 January 2022, Pages 366-375

  30. arXiv:2203.04115  [pdf, other

    q-bio.BM cs.LG

    Biological Sequence Design with GFlowNets

    Authors: Moksh Jain, Emmanuel Bengio, Alex-Hernandez Garcia, Jarrid Rector-Brooks, Bonaventure F. P. Dossou, Chanakya Ekbote, Jie Fu, Tianyu Zhang, Micheal Kilgour, Dinghuai Zhang, Lena Simine, Payel Das, Yoshua Bengio

    Abstract: Design of de novo biological sequences with desired properties, like protein and DNA sequences, often involves an active loop with several rounds of molecule ideation and expensive wet-lab evaluations. These experiments can consist of multiple stages, with increasing levels of precision and cost of evaluation, where candidates are filtered. This makes the diversity of proposed candidates a key con… ▽ More

    Submitted 24 May, 2023; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: ICML 2022. 15 pages, 3 figures. Code available at: https://github.com/MJ10/BioSeq-GFN-AL. Updated GFP results

  31. arXiv:2110.07220  [pdf, other

    q-bio.PE math.PR

    Analysing the Effect of Test-and-Trace Strategy in an SIR Epidemic Model

    Authors: Dongni Zhang, Tom Britton

    Abstract: Consider a Markovian SIR epidemic model in a homogeneous community. To this model we add a rate at which individuals are tested, and once an infectious individual tests positive it is isolated and each of their contacts are traced and tested independently with some fixed probability. If such a traced individual tests positive it is isolated, and the contact tracing is iterated. This model is analy… ▽ More

    Submitted 5 August, 2022; v1 submitted 14 October, 2021; originally announced October 2021.

  32. arXiv:2110.03372  [pdf, other

    cs.LG cs.AI q-bio.BM stat.ME stat.ML

    Unifying Likelihood-free Inference with Black-box Optimization and Beyond

    Authors: Dinghuai Zhang, Jie Fu, Yoshua Bengio, Aaron Courville

    Abstract: Black-box optimization formulations for biological sequence design have drawn recent attention due to their promising potential impact on the pharmaceutical industry. In this work, we propose to unify two seemingly distinct worlds: likelihood-free inference and black-box optimization, under one probabilistic framework. In tandem, we provide a recipe for constructing various sequence design methods… ▽ More

    Submitted 8 February, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: ICLR 2022 spotlight

  33. arXiv:2109.09559  [pdf

    cs.HC cs.LG eess.SP q-bio.NC

    Contrastive Learning of Subject-Invariant EEG Representations for Cross-Subject Emotion Recognition

    Authors: Xinke Shen, Xianggen Liu, Xin Hu, Dan Zhang, Sen Song

    Abstract: EEG signals have been reported to be informative and reliable for emotion recognition in recent years. However, the inter-subject variability of emotion-related EEG signals still poses a great challenge for the practical applications of EEG-based emotion recognition. Inspired by recent neuroscience studies on inter-subject correlation, we proposed a Contrastive Learning method for Inter-Subject Al… ▽ More

    Submitted 5 April, 2022; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: 23 pages, 13 figures, journal paper. IEEE Transactions on Affective Computing, 2022

  34. arXiv:2109.09112  [pdf

    q-bio.GN q-bio.PE

    Nipah virus vector sequences in COVID-19 patient samples sequenced by the Wuhan Institute of Virology

    Authors: Steven C. Quay, Daoyu Zhang, Adrian Jones, Yuri Deigin

    Abstract: We report the detection of Nipah virus in an infectious clone format, a BSL4-level pathogen and CDC-designated Bioterrorism Agent, in raw RNA-Seq sequencing reads deposited by the Wuhan Institute of Virology (WIV) produced from five December 2019 patients infected with SARS-CoV-2. Research involving Nipah infectious clones has never been reported to have occured at the WIV. These patient samples h… ▽ More

    Submitted 19 September, 2021; originally announced September 2021.

    Comments: 16 pages, 4 figures, and supplemental materials

  35. arXiv:2108.08163  [pdf

    q-bio.GN

    Analysis of pangolin metagenomic datasets reveals significant contamination, raising concerns for pangolin CoV host attribution

    Authors: Adrian Jones, Daoyu Zhang, Yuri Deigin, Steven C. Quay

    Abstract: Metagenomic datasets from pangolin tissue specimens have previously yielded SARS-related coronaviruses which show high homology in their receptor binding domain to SARS-CoV-2, suggesting a potential zoonotic source for this feature of the human virus, possibly via recombination (Liu et al. 2019, Lam et al. 2020, Xiao et al. 2020, Liu et al. 2020). Here we re-examine these published datasets. We re… ▽ More

    Submitted 1 March, 2022; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: 55 pages, 15 figures

  36. arXiv:2104.01533  [pdf

    q-bio.GN

    Unexpected novel Merbecovirus discoveries in agricultural sequencing datasets from Wuhan, China

    Authors: Daoyu Zhang, Adrian Jones, Yuri Deigin, Karl Sirotkin, Alejandro Sousa

    Abstract: In this study we document the unexpected discovery of multiple coronaviruses and a BSL-3 pathogen in agricultural cotton and rice sequencing datasets. In particular, we have identified a novel HKU5-related Merbecovirus in a cotton dataset sequenced by the Huazhong Agricultural University in 2017. We have also found an infectious clone sequence containing a novel HKU4-related Merbecovirus related t… ▽ More

    Submitted 6 June, 2021; v1 submitted 3 April, 2021; originally announced April 2021.

    Comments: Supplementary information and data can be found in Zenodo datasets doi: 10.5281/zenodo.4660981, doi: 10.5281/zenodo.4620604, doi: 10.5281/zenodo.4399248

  37. arXiv:2102.03910  [pdf

    q-bio.PE q-bio.GN

    An open debate on SARS-CoV-2's proximal origin is long overdue

    Authors: Rossana Segreto, Yuri Deigin, Kevin McCairn, Alejandro Sousa, Dan Sirotkin, Karl Sirotkin, Jonathan J. Couey, Adrian Jones, Daoyu Zhang

    Abstract: There is a near consensus view that SARS-CoV-2 has a natural zoonotic origin; however, several characteristics of SARS-CoV-2 taken together are not easily explained by a natural zoonotic origin hypothesis. These include: a low rate of evolution in the early phase of transmission; the lack of evidence of recombination events; a high pre-existing binding to human ACE2; a novel furin cleavage site in… ▽ More

    Submitted 9 February, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

  38. arXiv:2010.15594  [pdf, other

    cs.LG cs.AI eess.IV math.FA q-bio.NC

    Shared Space Transfer Learning for analyzing multi-site fMRI data

    Authors: Muhammad Yousefnezhad, Alessandro Selvitella, Daoqiang Zhang, Andrew J. Greenshaw, Russell Greiner

    Abstract: Multi-voxel pattern analysis (MVPA) learns predictive models from task-based functional magnetic resonance imaging (fMRI) data, for distinguishing when subjects are performing different cognitive tasks -- e.g., watching movies or making decisions. MVPA works best with a well-designed feature set and an adequate sample size. However, most fMRI datasets are noisy, high-dimensional, expensive to coll… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

    Comments: 34th Conference on Neural Information Processing Systems (NeurIPS 2020), Vancouver, Canada. The Supplementary Material: https://www.yousefnezhad.com/publications/NeurIPS2020_Paper4157_SuppMat.zip

  39. arXiv:2010.02012  [pdf, other

    eess.IV cs.AI cs.LG q-bio.NC

    Deep Representational Similarity Learning for analyzing neural signatures in task-based fMRI dataset

    Authors: Muhammad Yousefnezhad, Jeffrey Sawalha, Alessandro Selvitella, Daoqiang Zhang

    Abstract: Similarity analysis is one of the crucial steps in most fMRI studies. Representational Similarity Analysis (RSA) can measure similarities of neural signatures generated by different cognitive states. This paper develops Deep Representational Similarity Learning (DRSL), a deep extension of RSA that is appropriate for analyzing similarities between various cognitive tasks in fMRI datasets with a lar… ▽ More

    Submitted 28 September, 2020; originally announced October 2020.

    Comments: Neuroinformatics

  40. arXiv:2003.05666  [pdf, other

    q-bio.PE math.DS q-bio.QM

    Rational evaluation of various epidemic models based on the COVID-19 data of China

    Authors: Wuyue Yang, Dongyan Zhang, Liangrong Peng, Changjing Zhuge, Liu Hong

    Abstract: In this paper, based on the Akaike information criterion, root mean square error and robustness coefficient, a rational evaluation of various epidemic models/methods, including seven empirical functions, four statistical inference methods and five dynamical models, on their forecasting abilities is carried out. With respect to the outbreak data of COVID-19 epidemics in China, we find that before t… ▽ More

    Submitted 14 September, 2021; v1 submitted 12 March, 2020; originally announced March 2020.

    Comments: 25 pages, 5 figures, 2 tables

  41. arXiv:2002.06563  [pdf, other

    q-bio.PE

    Epidemic analysis of COVID-19 in China by dynamical modeling

    Authors: Liangrong Peng, Wuyue Yang, Dongyan Zhang, Changjing Zhuge, Liu Hong

    Abstract: The outbreak of novel coronavirus-caused pneumonia (COVID-19) in Wuhan has attracted worldwide attention. Here, we propose a generalized SEIR model to analyze this epidemic. Based on the public data of National Health Commission of China from Jan. 20th to Feb. 9th, 2020, we reliably estimate key epidemic parameters and make predictions on the inflection point and possible ending time for 5 differe… ▽ More

    Submitted 25 June, 2020; v1 submitted 16 February, 2020; originally announced February 2020.

    Comments: 11 pages, 6 figures, 1 table

  42. arXiv:2001.02894  [pdf, other

    stat.ML cs.LG q-bio.NC

    Supervised Hyperalignment for multi-subject fMRI data alignment

    Authors: Muhammad Yousefnezhad, Alessandro Selvitella, Liangxiu Han, Daoqiang Zhang

    Abstract: Hyperalignment has been widely employed in Multivariate Pattern (MVP) analysis to discover the cognitive states in the human brains based on multi-subject functional Magnetic Resonance Imaging (fMRI) datasets. Most of the existing HA methods utilized unsupervised approaches, where they only maximized the correlation between the voxels with the same position in the time series. However, these unsup… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

    Comments: IEEE Transactions on Cognitive and Developmental Systems

  43. Preliminary Results on a New Algorithm for Blink Correction Adaptive to Inter- and Intra-Subject Variability

    Authors: E. Guttmann-Flury, X. Sheng, D. Zhang, X. Zhu

    Abstract: This paper presents a new preprocessing method to correct blinking artifacts in Electroencephalography (EEG) based Brain-Computer Interfaces (BCIs). This Algorithm for Blink Correction (ABC) directly corrects the signal in the time domain without the need for additional Electrooculogram (EOG) electrodes. The main idea is to automatically adapt to the blink's inter- and intra-subject variability by… ▽ More

    Submitted 31 October, 2019; originally announced October 2019.

  44. arXiv:1809.04429  [pdf

    q-bio.NC cs.LG stat.ML

    Gradient-based Representational Similarity Analysis with Searchlight for Analyzing fMRI Data

    Authors: Xiaoliang Sheng, Muhammad Yousefnezhad, Tonglin Xu, Ning Yuan, Daoqiang Zhang

    Abstract: Representational Similarity Analysis (RSA) aims to explore similarities between neural activities of different stimuli. Classical RSA techniques employ the inverse of the covariance matrix to explore a linear model between the neural activities and task events. However, calculating the inverse of a large-scale covariance matrix is time-consuming and can reduce the stability and robustness of the f… ▽ More

    Submitted 12 September, 2018; originally announced September 2018.

    Comments: Conference: Chinese Conference on Pattern Recognition and Computer Vision 2018 (PRCV18), 23-26/Nov, Guangzhou, China

  45. arXiv:1808.01642  [pdf, other

    stat.ML cs.CV cs.LG math.OC q-bio.NC

    Multi-Objective Cognitive Model: a supervised approach for multi-subject fMRI analysis

    Authors: Muhammad Yousefnezhad, Daoqiang Zhang

    Abstract: In order to decode the human brain, Multivariate Pattern (MVP) classification generates cognitive models by using functional Magnetic Resonance Imaging (fMRI) datasets. As a standard pipeline in the MVP analysis, brain patterns in multi-subject fMRI dataset must be mapped to a shared space and then a classification model is generated by employing the mapped patterns. However, the MVP models may no… ▽ More

    Submitted 5 August, 2018; originally announced August 2018.

    Comments: Neuroinformatics, Springer

  46. arXiv:1807.02612  [pdf

    cs.LG q-bio.NC stat.ML

    Gradient Hyperalignment for multi-subject fMRI data alignment

    Authors: Tonglin Xu, Muhammad Yousefnezhad, Daoqiang Zhang

    Abstract: Multi-subject fMRI data analysis is an interesting and challenging problem in human brain decoding studies. The inherent anatomical and functional variability across subjects make it necessary to do both anatomical and functional alignment before classification analysis. Besides, when it comes to big data, time complexity becomes a problem that cannot be ignored. This paper proposes Gradient Hyper… ▽ More

    Submitted 7 July, 2018; originally announced July 2018.

    Comments: 15th Pacific Rim International Conference on Artificial Intelligence (PRICAI 2018), Nanjing, China, August 28-31, 2018

  47. arXiv:1710.03923  [pdf, other

    q-bio.NC cs.CV stat.ML

    Deep Hyperalignment

    Authors: Muhammad Yousefnezhad, Daoqiang Zhang

    Abstract: This paper proposes Deep Hyperalignment (DHA) as a regularized, deep extension, scalable Hyperalignment (HA) method, which is well-suited for applying functional alignment to fMRI datasets with nonlinearity, high-dimensionality (broad ROI), and a large number of subjects. Unlink previous methods, DHA is not limited by a restricted fixed kernel function. Further, it uses a parametric approach, rank… ▽ More

    Submitted 11 October, 2017; originally announced October 2017.

    Comments: 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA

  48. arXiv:1710.02113  [pdf, other

    stat.ML cs.CV q-bio.NC

    Anatomical Pattern Analysis for decoding visual stimuli in human brains

    Authors: Muhammad Yousefnezhad, Daoqiang Zhang

    Abstract: Background: A universal unanswered question in neuroscience and machine learning is whether computers can decode the patterns of the human brain. Multi-Voxels Pattern Analysis (MVPA) is a critical tool for addressing this question. However, there are two challenges in the previous MVPA methods, which include decreasing sparsity and noise in the extracted features and increasing the performance of… ▽ More

    Submitted 5 October, 2017; originally announced October 2017.

    Comments: Published in Cognitive Computation

  49. arXiv:1708.08991  [pdf

    q-bio.TO physics.bio-ph physics.med-ph physics.optics

    Targeted and Imaging-guided In Vivo Photodynamic Therapy of Tumors Using Dual-functional, Aggregation-induced Emission Nanoparticles

    Authors: Xianhe Sun, Abudureheman zebibula, Xiaobiao Dong, Gonghui Li, Guanxin Zhang, Deqing Zhang, Jun Qian, Sailing He

    Abstract: Dual-functional nanoparticles, with the property of aggregation-induced emission and the capability of reactive oxygen species, were used to achieve passive/active targeting of tumor. Good contrast in in vivo imaging and obvious therapeutic efficiency were realized with a low dose of AIE nanoparticles as well as a low power density of light, resulting in negligible side effects.

    Submitted 22 August, 2017; originally announced August 2017.

    Comments: 30 pages, 7 figures

  50. arXiv:1708.06578  [pdf, other

    cs.HC q-bio.NC

    Cascade and Parallel Convolutional Recurrent Neural Networks on EEG-based Intention Recognition for Brain Computer Interface

    Authors: Dalin Zhang, Lina Yao, Xiang Zhang, Sen Wang, Weitong Chen, Robert Boots

    Abstract: Brain-Computer Interface (BCI) is a system empowering humans to communicate with or control the outside world with exclusively brain intentions. Electroencephalography (EEG) based BCIs are promising solutions due to their convenient and portable instruments. Motor imagery EEG (MI-EEG) is a kind of most widely focused EEG signals, which reveals a subjects movement intentions without actual actions.… ▽ More

    Submitted 10 June, 2021; v1 submitted 22 August, 2017; originally announced August 2017.

    Comments: 8 pages, 5 figures