Skip to main content

Showing 1–26 of 26 results for author: Guan, J

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2504.11911  [pdf, ps, other

    nlin.AO q-bio.PE

    Higher-order evolutionary dynamics with game transitions

    Authors: Yi-Duo Chen, Zhi-Xi Wu, Jian-Yue Guan

    Abstract: Higher-order interactions are prevalent in real-world complex systems and exert unique influences on system evolution that cannot be captured by pairwise interactions. We incorporate game transitions into the higher-order prisoner's dilemma game model, where these transitions consistently promote cooperation. Moreover, in systems with game transitions, the fraction of higher-order interactions has… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Comments: 10 pages, 9 figures

  2. arXiv:2504.05790  [pdf, other

    q-bio.GN

    ViralQC: A Tool for Assessing Completeness and Contamination of Predicted Viral Contigs

    Authors: Cheng Peng, Jiayu Shang, Jiaojiao Guan, Yanni Sun

    Abstract: Motivation: Viruses represent the most abundant biological entities on the planet and play vital roles in diverse ecosystems. Cataloging viruses across various environments is essential for understanding their properties and functions. Metagenomic sequencing has emerged as the most comprehensive method for virus discovery, enabling the sequencing of all genetic materials, including viruses, from h… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: 16 pages, 9 figures

  3. arXiv:2503.03989  [pdf, other

    q-bio.BM cs.LG

    Integrating Protein Dynamics into Structure-Based Drug Design via Full-Atom Stochastic Flows

    Authors: Xiangxin Zhou, Yi Xiao, Haowei Lin, Xinheng He, Jiaqi Guan, Yang Wang, Qiang Liu, Feng Zhou, Liang Wang, Jianzhu Ma

    Abstract: The dynamic nature of proteins, influenced by ligand interactions, is essential for comprehending protein function and progressing drug discovery. Traditional structure-based drug design (SBDD) approaches typically target binding sites with rigid structures, limiting their practical application in drug development. While molecular dynamics simulation can theoretically capture all the biologically… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: Accepted to ICLR 2025

  4. arXiv:2501.15472  [pdf, other

    q-bio.GN

    GiantHunter: Accurate detection of giant virus in metagenomic data using reinforcement-learning and Monte Carlo tree search

    Authors: Fuchuan Qu, Cheng Peng, Jiaojiao Guan, Donglin Wang, Yanni Sun, Jiayu Shang

    Abstract: Motivation: Nucleocytoplasmic large DNA viruses (NCLDVs) are notable for their large genomes and extensive gene repertoires, which contribute to their widespread environmental presence and critical roles in processes such as host metabolic reprogramming and nutrient cycling. Metagenomic sequencing has emerged as a powerful tool for uncovering novel NCLDVs in environmental samples. However, identif… ▽ More

    Submitted 26 January, 2025; originally announced January 2025.

    Comments: 15 pages, 7 figures

  5. arXiv:2501.15055  [pdf, other

    q-bio.BM cs.AI

    Group Ligands Docking to Protein Pockets

    Authors: Jiaqi Guan, Jiahan Li, Xiangxin Zhou, Xingang Peng, Sheng Wang, Yunan Luo, Jian Peng, Jianzhu Ma

    Abstract: Molecular docking is a key task in computational biology that has attracted increasing interest from the machine learning community. While existing methods have achieved success, they generally treat each protein-ligand pair in isolation. Inspired by the biochemical observation that ligands binding to the same target protein tend to adopt similar poses, we propose \textsc{GroupBind}, a novel molec… ▽ More

    Submitted 24 January, 2025; originally announced January 2025.

    Comments: 18 pages, published in ICLR 2025

  6. arXiv:2412.06847  [pdf, other

    q-bio.QM cs.AI cs.LG

    M$^{3}$-20M: A Large-Scale Multi-Modal Molecule Dataset for AI-driven Drug Design and Discovery

    Authors: Siyuan Guo, Lexuan Wang, Chang Jin, Jinxian Wang, Han Peng, Huayang Shi, Wengen Li, Jihong Guan, Shuigeng Zhou

    Abstract: This paper introduces M$^{3}$-20M, a large-scale Multi-Modal Molecule dataset that contains over 20 million molecules, with the data mainly being integrated from existing databases and partially generated by large language models. Designed to support AI-driven drug design and discovery, M$^{3}$-20M is 71 times more in the number of molecules than the largest existing dataset, providing an unpreced… ▽ More

    Submitted 16 March, 2025; v1 submitted 7 December, 2024; originally announced December 2024.

  7. arXiv:2411.18463  [pdf, other

    q-bio.BM cs.AI cs.LG

    Hotspot-Driven Peptide Design via Multi-Fragment Autoregressive Extension

    Authors: Jiahan Li, Tong Chen, Shitong Luo, Chaoran Cheng, Jiaqi Guan, Ruihan Guo, Sheng Wang, Ge Liu, Jian Peng, Jianzhu Ma

    Abstract: Peptides, short chains of amino acids, interact with target proteins, making them a unique class of protein-based therapeutics for treating human diseases. Recently, deep generative models have shown great promise in peptide generation. However, several challenges remain in designing effective peptide binders. First, not all residues contribute equally to peptide-target interactions. Second, the g… ▽ More

    Submitted 20 May, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

    Comments: Published as a conference paper at ICLR 2025

  8. arXiv:2410.20688  [pdf, other

    cs.LG q-bio.BM

    Reprogramming Pretrained Target-Specific Diffusion Models for Dual-Target Drug Design

    Authors: Xiangxin Zhou, Jiaqi Guan, Yijia Zhang, Xingang Peng, Liang Wang, Jianzhu Ma

    Abstract: Dual-target therapeutic strategies have become a compelling approach and attracted significant attention due to various benefits, such as their potential in overcoming drug resistance in cancer therapy. Considering the tremendous success that deep generative models have achieved in structure-based drug design in recent years, we formulate dual-target drug design as a generative task and curate a n… ▽ More

    Submitted 26 November, 2024; v1 submitted 27 October, 2024; originally announced October 2024.

    Comments: Accepted to NeurIPS 2024

  9. arXiv:2408.06402  [pdf, other

    q-bio.QM cs.AI cs.LG

    PhaGO: Protein function annotation for bacteriophages by integrating the genomic context

    Authors: Jiaojiao Guan, Yongxin Ji, Cheng Peng, Wei Zou, Xubo Tang, Jiayu Shang, Yanni Sun

    Abstract: Bacteriophages are viruses that target bacteria, playing a crucial role in microbial ecology. Phage proteins are important in understanding phage biology, such as virus infection, replication, and evolution. Although a large number of new phages have been identified via metagenomic sequencing, many of them have limited protein function annotation. Accurate function annotation of phage proteins pre… ▽ More

    Submitted 17 August, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

    Comments: 17 pages,6 figures

  10. Coevolutionary game dynamics with localized environmental resource feedback

    Authors: Yi-Duo Chen, Jian-Yue Guan, Zhi-Xi Wu

    Abstract: Dynamic environments shape diverse dynamics in evolutionary game systems. We introduce spatial heterogeneity of resources into the prisoner's dilemma game model to explore coevolutionary game dynamics with environmental feedback. The availability of resources significantly affects the survival competitiveness of surrounding individuals. Feedback between individuals' strategies and the resources th… ▽ More

    Submitted 14 February, 2025; v1 submitted 25 July, 2024; originally announced July 2024.

    Journal ref: Y.-D. Chen, J.-Y. Guan and Z.-X. Wu, Coevolutionary game dynamics with localized environmental resource feedback, Phys. Rev. E, 111, 024305 (2025)

  11. arXiv:2405.11735  [pdf, other

    q-bio.GN

    Accurate and efficient protein embedding using multi-teacher distillation learning

    Authors: Jiayu Shang, Cheng Peng, Yongxin Ji, Jiaojiao Guan, Dehan Cai, Xubo Tang, Yanni Sun

    Abstract: Motivation: Protein embedding, which represents proteins as numerical vectors, is a crucial step in various learning-based protein annotation/classification problems, including gene ontology prediction, protein-protein interaction prediction, and protein structure prediction. However, existing protein embedding methods are often computationally expensive due to their large number of parameters, wh… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: 3 pages; 1 figure

  12. arXiv:2403.07902  [pdf, other

    q-bio.BM cs.LG

    DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design

    Authors: Jiaqi Guan, Xiangxin Zhou, Yuwei Yang, Yu Bao, Jian Peng, Jianzhu Ma, Qiang Liu, Liang Wang, Quanquan Gu

    Abstract: Designing 3D ligands within a target binding site is a fundamental task in drug discovery. Existing structured-based drug design methods treat all ligand atoms equally, which ignores different roles of atoms in the ligand for drug design and can be less efficient for exploring the large drug-like molecule space. In this paper, inspired by the convention in pharmaceutical practice, we decompose the… ▽ More

    Submitted 26 February, 2024; originally announced March 2024.

    Comments: Accepted to ICML 2023

  13. arXiv:2312.16855  [pdf, other

    cs.LG q-bio.BM

    Molecular Property Prediction Based on Graph Structure Learning

    Authors: Bangyi Zhao, Weixia Xu, Jihong Guan, Shuigeng Zhou

    Abstract: Molecular property prediction (MPP) is a fundamental but challenging task in the computer-aided drug discovery process. More and more recent works employ different graph-based models for MPP, which have made considerable progress in improving prediction performance. However, current models often ignore relationships between molecules, which could be also helpful for MPP. For this sake, in this pap… ▽ More

    Submitted 28 December, 2023; originally announced December 2023.

  14. arXiv:2312.16600  [pdf, other

    q-bio.GN cs.AI cs.LG

    scRNA-seq Data Clustering by Cluster-aware Iterative Contrastive Learning

    Authors: Weikang Jiang, Jinxian Wang, Jihong Guan, Shuigeng Zhou

    Abstract: Single-cell RNA sequencing (scRNA-seq) enables researchers to analyze gene expression at single-cell level. One important task in scRNA-seq data analysis is unsupervised clustering, which helps identify distinct cell types, laying down the foundation for other downstream analysis tasks. In this paper, we propose a novel method called Cluster-aware Iterative Contrastive Learning (CICL in short) for… ▽ More

    Submitted 27 December, 2023; originally announced December 2023.

  15. arXiv:2310.04463  [pdf, other

    q-bio.BM cs.AI cs.LG

    Diffusing on Two Levels and Optimizing for Multiple Properties: A Novel Approach to Generating Molecules with Desirable Properties

    Authors: Siyuan Guo, Jihong Guan, Shuigeng Zhou

    Abstract: In the past decade, Artificial Intelligence driven drug design and discovery has been a hot research topic, where an important branch is molecule generation by generative models, from GAN-based models and VAE-based models to the latest diffusion-based models. However, most existing models pursue only the basic properties like validity and uniqueness of the generated molecules, a few go further to… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  16. arXiv:2305.07508  [pdf, other

    q-bio.BM cs.LG q-bio.QM

    MolDiff: Addressing the Atom-Bond Inconsistency Problem in 3D Molecule Diffusion Generation

    Authors: Xingang Peng, Jiaqi Guan, Qiang Liu, Jianzhu Ma

    Abstract: Deep generative models have recently achieved superior performance in 3D molecule generation. Most of them first generate atoms and then add chemical bonds based on the generated atoms in a post-processing manner. However, there might be no corresponding bond solution for the temporally generated atoms as their locations are generated without considering potential bonds. We define this problem as… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

  17. arXiv:2303.06902  [pdf, other

    q-bio.BM cs.LG

    Molecular Property Prediction by Semantic-invariant Contrastive Learning

    Authors: Ziqiao Zhang, Ailin Xie, Jihong Guan, Shuigeng Zhou

    Abstract: Contrastive learning have been widely used as pretext tasks for self-supervised pre-trained molecular representation learning models in AI-aided drug design and discovery. However, exiting methods that generate molecular views by noise-adding operations for contrastive learning may face the semantic inconsistency problem, which leads to false positive pairs and consequently poor prediction perform… ▽ More

    Submitted 13 March, 2023; originally announced March 2023.

  18. arXiv:2303.05856  [pdf, other

    nlin.AO q-bio.CB

    Intercellular competitive growth dynamics with microenvironmental feedback

    Authors: De-Ming Liu, Zhi-Xi Wu, Jian-Yue Guan

    Abstract: Normal life activities between cells rely crucially on the homeostasis of the cellular microenvironment, but aging and cancer will upset this balance. In this paper, we introduce the microenvironmental feedback mechanism to the growth dynamics of multicellular organisms, which changes the cellular competitive ability, and thereby regulates the growth of multicellular organisms. We show that the pr… ▽ More

    Submitted 8 May, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  19. arXiv:2303.03543  [pdf, other

    q-bio.BM cs.LG

    3D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction

    Authors: Jiaqi Guan, Wesley Wei Qian, Xingang Peng, Yufeng Su, Jian Peng, Jianzhu Ma

    Abstract: Rich data and powerful machine learning models allow us to design drugs for a specific protein target \textit{in silico}. Recently, the inclusion of 3D structures during targeted drug design shows superior performance to other target-free models as the atomic interaction in the 3D space is explicitly modeled. However, current 3D target-aware models either rely on the voxelized atom densities or th… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted to ICLR 2023

  20. arXiv:2205.07249  [pdf, other

    cs.LG q-bio.BM

    Pocket2Mol: Efficient Molecular Sampling Based on 3D Protein Pockets

    Authors: Xingang Peng, Shitong Luo, Jiaqi Guan, Qi Xie, Jian Peng, Jianzhu Ma

    Abstract: Deep generative models have achieved tremendous success in designing novel drug molecules in recent years. A new thread of works have shown the great potential in advancing the specificity and success rate of in silico drug design by considering the structure of protein pockets. This setting posts fundamental computational challenges in sampling new chemical compounds that could satisfy multiple g… ▽ More

    Submitted 15 May, 2022; originally announced May 2022.

    Comments: ICML 2022 accepted

  21. arXiv:2203.10446  [pdf, other

    q-bio.BM cs.LG

    A 3D Generative Model for Structure-Based Drug Design

    Authors: Shitong Luo, Jiaqi Guan, Jianzhu Ma, Jian Peng

    Abstract: We study a fundamental problem in structure-based drug design -- generating molecules that bind to specific protein binding sites. While we have witnessed the great success of deep generative models in drug design, the existing methods are mostly string-based or graph-based. They are limited by the lack of spatial information and thus unable to be applied to structure-based design tasks. Particula… ▽ More

    Submitted 12 November, 2022; v1 submitted 19 March, 2022; originally announced March 2022.

    Comments: Accepted to NeurIPS 2021

  22. arXiv:2201.13299  [pdf, other

    q-bio.BM cs.AI cs.LG

    Orientation-Aware Graph Neural Networks for Protein Structure Representation Learning

    Authors: Jiahan Li, Shitong Luo, Congyue Deng, Chaoran Cheng, Jiaqi Guan, Leonidas Guibas, Jian Peng, Jianzhu Ma

    Abstract: By folding into particular 3D structures, proteins play a key role in living beings. To learn meaningful representation from a protein structure for downstream tasks, not only the global backbone topology but the local fine-grained orientational relations between amino acids should also be considered. In this work, we propose the Orientation-Aware Graph Neural Networks (OAGNNs) to better sense the… ▽ More

    Submitted 4 February, 2025; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: Accepetd in RECOMB 2025

  23. arXiv:2111.10011  [pdf, other

    physics.soc-ph q-bio.PE

    Game-environment feedback dynamics for voluntary prisoner's dilemma games

    Authors: Bin-Quan Li, Cong Liu, Zhi-Xi Wu, Jian-Yue Guan

    Abstract: Recently, the eco-evolutionary game theory which describes the coupled dynamics of strategies and environment have attracted great attention. At the same time, most of the current work is focused on the classic two-player two-strategy game. In this work, we study multi-strategy eco-evolutionary game theory which is an extension of the framework. For simplicity, we'll focus on the voluntary partici… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

  24. arXiv:1312.0691  [pdf, other

    physics.soc-ph q-bio.PE

    Behavior of susceptible-vaccinated--infected--recovered epidemics with diversity in the infection rate of the individuals

    Authors: Chao-Ran Cai, Zhi-Xi wu, Jian-Yue Guan

    Abstract: We study a susceptible-vaccinated--infected--recovered (SVIR) epidemic-spreading model with diversity of infection rate of the individuals. By means of analytical arguments as well as extensive computer simulations, we demonstrate that the heterogeneity in infection rate can either impede or accelerate the epidemic spreading, which depends on the amount of vaccinated individuals introduced in the… ▽ More

    Submitted 3 December, 2013; v1 submitted 2 December, 2013; originally announced December 2013.

    Comments: 9 pages, many figures

    Journal ref: Phys. Rev. E 88, 062805 (2013)

  25. arXiv:1306.0505  [pdf

    physics.data-an physics.bio-ph q-bio.QM

    Diagnosing Heterogeneous Dynamics in Single Molecule/Particle Trajectories with Multiscale Wavelets

    Authors: Kejia Chen, Bo Wang, Juan Guan, Steve Granick

    Abstract: We describe a simple automated method to extract and quantify transient heterogeneous dynamical changes from large datasets generated in single molecule/particle tracking experiments. Based on wavelet transform, the method transforms raw data to locally match dynamics of interest. This is accomplished using statistically adaptive universal thresholding, whose advantage is to avoid a single arbitra… ▽ More

    Submitted 3 June, 2013; originally announced June 2013.

  26. arXiv:0903.0924  [pdf, ps, other

    q-bio.PE physics.soc-ph

    Epidemic spreading with nonlinear infectivity in weighted scale-free networks

    Authors: Xiangwei Chu, Zhongzhi Zhang, Jihong Guan, Shuigeng Zhou

    Abstract: In this paper, we investigate the epidemic spreading for SIR model in weighted scale-free networks with nonlinear infectivity, where the transmission rate in our analytical model is weighted. Concretely, we introduce the infectivity exponent $α$ and the weight exponent $β$ into the analytical SIR model, then examine the combination effects of $α$ and $β$ on the epidemic threshold and phase trans… ▽ More

    Submitted 5 March, 2009; originally announced March 2009.

    Comments: 17 pages, 12 figures