Skip to main content

Showing 1–50 of 102 results for author: Wu, J

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2507.02231  [pdf

    q-bio.BM

    Downregulation of aquaporin 3 promotes hyperosmolarity-induced apoptosis of nucleus pulposus cells through PI3K/Akt/mTOR pathway suppression

    Authors: Yuan Sang, Huiqing Zhao, Jiajun Wu, Ting Zhang, Wenbin Xu, Hui Yao, Kaihua Liu, Chang Liu, Junbin Zhang, Ping Li, Depeng Wu, Yichun Xu, Jianying Zhang, Gang Hou

    Abstract: Hyperosmolarity is a key contributor to nucleus pulposus cell (NPC) apoptosis during intervertebral disc degeneration (IVDD). Aquaporin 3 (AQP3), a membrane channel protein, regulates cellular osmotic balance by transporting water and osmolytes. Although AQP3 downregulation is associated with disc degeneration, its role in apoptosis under hyperosmotic conditions remains unclear. Here, we demonstra… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

  2. arXiv:2506.23075  [pdf, ps, other

    cs.HC cs.LG eess.SP q-bio.NC

    CSBrain: A Cross-scale Spatiotemporal Brain Foundation Model for EEG Decoding

    Authors: Yuchen Zhou, Jiamin Wu, Zichen Ren, Zhouheng Yao, Weiheng Lu, Kunyu Peng, Qihao Zheng, Chunfeng Song, Wanli Ouyang, Chao Gou

    Abstract: Understanding and decoding brain activity from electroencephalography (EEG) signals is a fundamental challenge in neuroscience and AI, with applications in cognition, emotion recognition, diagnosis, and brain-computer interfaces. While recent EEG foundation models advance generalized decoding via unified architectures and large-scale pretraining, they adopt a scale-agnostic dense modeling paradigm… ▽ More

    Submitted 28 June, 2025; originally announced June 2025.

  3. arXiv:2506.17310  [pdf, ps, other

    q-bio.NC cs.CL cs.NE

    PaceLLM: Brain-Inspired Large Language Models for Long-Context Understanding

    Authors: Kangcong Li, Peng Ye, Chongjun Tu, Lin Zhang, Chunfeng Song, Jiamin Wu, Tao Yang, Qihao Zheng, Tao Chen

    Abstract: While Large Language Models (LLMs) demonstrate strong performance across domains, their long-context capabilities are limited by transient neural activations causing information decay and unstructured feed-forward network (FFN) weights leading to semantic fragmentation. Inspired by the brain's working memory and cortical modularity, we propose PaceLLM, featuring two innovations: (1) a Persistent A… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

  4. arXiv:2506.07553  [pdf, ps, other

    cs.AI q-bio.QM

    GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition

    Authors: Jingchao Wang, Haote Yang, Jiang Wu, Yifan He, Xingjian Wei, Yinfan Wang, Chengjin Liu, Lingli Ge, Lijun Wu, Bin Wang, Dahua Lin, Conghui He

    Abstract: Optical Chemical Structure Recognition (OCSR) is crucial for digitizing chemical knowledge by converting molecular images into machine-readable formats. While recent vision-language models (VLMs) have shown potential in this task, their image-captioning approach often struggles with complex molecular structures and inconsistent annotations. To overcome these challenges, we introduce GTR-Mol-VLM, a… ▽ More

    Submitted 9 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

  5. arXiv:2506.05443  [pdf

    cs.LG cs.AI q-bio.GN

    UniPTMs: The First Unified Multi-type PTM Site Prediction Model via Master-Slave Architecture-Based Multi-Stage Fusion Strategy and Hierarchical Contrastive Loss

    Authors: Yiyu Lin, Yan Wang, You Zhou, Xinye Ni, Jiahui Wu, Sen Yang

    Abstract: As a core mechanism of epigenetic regulation in eukaryotes, protein post-translational modifications (PTMs) require precise prediction to decipher dynamic life activity networks. To address the limitations of existing deep learning models in cross-modal feature fusion, domain generalization, and architectural optimization, this study proposes UniPTMs: the first unified framework for multi-type PTM… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  6. arXiv:2505.15453  [pdf, other

    q-bio.NC

    A dynamical memory with only one spiking neuron

    Authors: Damien Depannemaecker, Adrien d'Hollande, Jiaming Wu, Marcelo J. Rozenberg

    Abstract: Common wisdom indicates that to implement a Dynamical Memory with spiking neurons two ingredients are necessary: recurrence and a neuron population. Here we shall show that the second requirement is not needed. We shall demonstrate that under very general assumptions a single recursive spiking neuron can realize a robust model of a dynamical memory. We demonstrate the implementation of a dynamical… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 17 pages, 9 figures

  7. arXiv:2505.08581  [pdf, other

    cs.CV eess.IV q-bio.TO

    ReSurgSAM2: Referring Segment Anything in Surgical Video via Credible Long-term Tracking

    Authors: Haofeng Liu, Mingqi Gao, Xuxiao Luo, Ziyue Wang, Guanyi Qin, Junde Wu, Yueming Jin

    Abstract: Surgical scene segmentation is critical in computer-assisted surgery and is vital for enhancing surgical quality and patient outcomes. Recently, referring surgical segmentation is emerging, given its advantage of providing surgeons with an interactive experience to segment the target object. However, existing methods are limited by low efficiency and short-term tracking, hindering their applicabil… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: Early accepted by MICCAI 2025

  8. arXiv:2505.03121  [pdf

    q-bio.BM

    AutoLoop: a novel autoregressive deep learning method for protein loop prediction with high accuracy

    Authors: Tianyue Wang, Xujun Zhang, Langcheng Wang, Odin Zhang, Jike Wang, Ercheng Wang, Jialu Wu, Renling Hu, Jingxuan Ge, Shimeng Li, Qun Su, Jiajun Yu, Chang-Yu Hsieh, Tingjun Hou, Yu Kang

    Abstract: Protein structure prediction is a critical and longstanding challenge in biology, garnering widespread interest due to its significance in understanding biological processes. A particular area of focus is the prediction of missing loops in proteins, which are vital in determining protein function and activity. To address this challenge, we propose AutoLoop, a novel computational model designed to… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: 34 pages, 7 figures

  9. arXiv:2504.10983  [pdf, other

    cs.LG cs.AI q-bio.BM

    ProtFlow: Fast Protein Sequence Design via Flow Matching on Compressed Protein Language Model Embeddings

    Authors: Zitai Kong, Yiheng Zhu, Yinlong Xu, Hanjing Zhou, Mingzhe Yin, Jialu Wu, Hongxia Xu, Chang-Yu Hsieh, Tingjun Hou, Jian Wu

    Abstract: The design of protein sequences with desired functionalities is a fundamental task in protein engineering. Deep generative methods, such as autoregressive models and diffusion models, have greatly accelerated the discovery of novel protein sequences. However, these methods mainly focus on local or shallow residual semantics and suffer from low inference efficiency, large modeling space and high tr… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  10. arXiv:2504.10525  [pdf

    q-bio.QM cs.CL cs.IR

    BioChemInsight: An Open-Source Toolkit for Automated Identification and Recognition of Optical Chemical Structures and Activity Data in Scientific Publications

    Authors: Zhe Wang, Fangtian Fu, Wei Zhang, Lige Yan, Yan Meng, Jianping Wu, Hui Wu, Gang Xu, Si Chen

    Abstract: Automated extraction of chemical structures and their bioactivity data is crucial for accelerating drug discovery and enabling data-driven pharmaceutical research. Existing optical chemical structure recognition (OCSR) tools fail to autonomously associate molecular structures with their bioactivity profiles, creating a critical bottleneck in structure-activity relationship (SAR) analysis. Here, we… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

    Comments: 20 pages, 7 figures

  11. arXiv:2503.17738  [pdf

    q-bio.CB

    Tumor-associated CD19$^+$ macrophages induce immunosuppressive microenvironment in hepatocellular carcinoma

    Authors: Junli Wang, Wanyue Cao, Jinyan Huang, Yu Zhou, Rujia Zheng, Yu Lou, Jiaqi Yang, Jianghui Tang, Mao Ye, Zhengtao Hong, Jiangchao Wu, Haonan Ding, Yuquan Zhang, Jianpeng Sheng, Xinjiang Lu, Pinglong Xu, Xiongbin Lu, Xueli Bai, Tingbo Liang, Qi Zhang

    Abstract: Tumor-associated macrophages are a key component that contributes to the immunosuppressive microenvironment in human cancers. However, therapeutic targeting of macrophages has been a challenge in clinic due to the limited understanding of their heterogeneous subpopulations and distinct functions. Here, we identify a unique and clinically relevant CD19$^+$ subpopulation of macrophages that is enric… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: 7 figures

  12. arXiv:2503.04362  [pdf, other

    cs.LG cs.AI q-bio.BM

    A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery

    Authors: Yiheng Zhu, Mingyang Li, Junlong Liu, Kun Fu, Jiansheng Wu, Qiuyi Li, Mingze Yin, Jieping Ye, Jian Wu, Zheng Wang

    Abstract: Structure-based drug discovery (SBDD) is a systematic scientific process that develops new drugs by leveraging the detailed physical structure of the target protein. Recent advancements in pre-trained models for biomolecules have demonstrated remarkable success across various biochemical applications, including drug discovery and protein engineering. However, in most approaches, the pre-trained mo… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  13. arXiv:2503.03783  [pdf, other

    q-bio.TO cs.AI cs.ET cs.HC cs.LG

    Passive Heart Rate Monitoring During Smartphone Use in Everyday Life

    Authors: Shun Liao, Paolo Di Achille, Jiang Wu, Silviu Borac, Jonathan Wang, Xin Liu, Eric Teasley, Lawrence Cai, Yuzhe Yang, Yun Liu, Daniel McDuff, Hao-Wei Su, Brent Winslow, Anupam Pathak, Shwetak Patel, James A. Taylor, Jameson K. Rogers, Ming-Zher Poh

    Abstract: Resting heart rate (RHR) is an important biomarker of cardiovascular health and mortality, but tracking it longitudinally generally requires a wearable device, limiting its availability. We present PHRM, a deep learning system for passive heart rate (HR) and RHR measurements during everyday smartphone use, using facial video-based photoplethysmography. Our system was developed using 225,773 videos… ▽ More

    Submitted 21 March, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

    Comments: Updated author list

  14. arXiv:2502.19391  [pdf, other

    q-bio.BM cs.LG

    Towards More Accurate Full-Atom Antibody Co-Design

    Authors: Jiayang Wu, Xingyi Zhang, Xiangyu Dong, Kun Xie, Ziqi Liu, Wensheng Gan, Sibo Wang, Le Song

    Abstract: Antibody co-design represents a critical frontier in drug development, where accurate prediction of both 1D sequence and 3D structure of complementarity-determining regions (CDRs) is essential for targeting specific epitopes. Despite recent advances in equivariant graph neural networks for antibody design, current approaches often fall short in capturing the intricate interactions that govern anti… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  15. arXiv:2502.08975  [pdf, other

    cs.LG q-bio.BM

    Graph-structured Small Molecule Drug Discovery Through Deep Learning: Progress, Challenges, and Opportunities

    Authors: Kun Li, Yida Xiong, Hongzhi Zhang, Xiantao Cai, Jia Wu, Bo Du, Wenbin Hu

    Abstract: Due to their excellent drug-like and pharmacokinetic properties, small molecule drugs are widely used to treat various diseases, making them a critical component of drug discovery. In recent years, with the rapid development of deep learning (DL) techniques, DL-based small molecule drug discovery methods have achieved excellent performance in prediction accuracy, speed, and complex molecular relat… ▽ More

    Submitted 14 May, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

    Comments: 10 pages, 1 figures, 8 tables

  16. arXiv:2502.07297  [pdf, other

    cs.LG q-bio.QM

    Generation of Drug-Induced Cardiac Reactions towards Virtual Clinical Trials

    Authors: Qian Shao, Bang Du, Zepeng Li, Qiyuan Chen, Hongxia Xu, Jimeng Sun, Jian Wu, Jintai Chen

    Abstract: Clinical trials remain critical in cardiac drug development but face high failure rates due to efficacy limitations and safety risks, incurring substantial costs. In-silico trial methodologies, particularly generative models simulating drug-induced electrocardiogram (ECG) alterations, offer a potential solution to mitigate these challenges. While existing models show progress in ECG synthesis, the… ▽ More

    Submitted 18 May, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

    Comments: Under review

  17. arXiv:2502.00934  [pdf

    q-bio.PE q-bio.QM

    Optimizing Global Genomic Surveillance for Early Detection of Emerging SARS-CoV-2 Variants

    Authors: Haogao Gu, Jifan Li, Wanying Sun, Mengting Li, Kathy Leung, Joseph T. Wu, Hsiang-Yu Yuan, Maggie H. Wang, Bingyi Yang, Matthew R. McKay, Ning Ning, Leo L. M. Poon

    Abstract: Background: Global viral threats underscore the need for effective genomic surveillance, but high costs and uneven resource distribution hamper its implementation. Targeting surveillance to international travelers in major travel hubs may offer a more efficient strategy for the early detection of SARS-CoV-2 variants. Methods: We developed and calibrated a multiple-strain metapopulation model of… ▽ More

    Submitted 13 February, 2025; v1 submitted 2 February, 2025; originally announced February 2025.

  18. arXiv:2501.15799  [pdf, other

    q-bio.BM cs.LG cs.NE

    Can Molecular Evolution Mechanism Enhance Molecular Representation?

    Authors: Kun Li, Longtao Hu, Xiantao Cai, Jia Wu, Wenbin Hu

    Abstract: Molecular evolution is the process of simulating the natural evolution of molecules in chemical space to explore potential molecular structures and properties. The relationships between similar molecules are often described through transformations such as adding, deleting, and modifying atoms and chemical bonds, reflecting specific evolutionary paths. Existing molecular representation methods main… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

    Comments: 9 pages, 6 figures, 5 tables

  19. arXiv:2501.05099  [pdf, other

    physics.soc-ph q-bio.NC q-bio.QM

    Recovery of activation propagation and self-sustained oscillation abilities in stroke brain networks

    Authors: Yingpeng Liu, Jiao Wu, Kesheng Xu, Muhua Zheng

    Abstract: Healthy brain networks usually show highly efficient information communication and self-sustained oscillation abilities. However, how the brain network structure affects these dynamics after an injury (stroke) is not very clear. The recovery of structure and dynamics of stroke brain networks over time is still not known precisely. Based on the analysis of a large number of strokes' brain network d… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: 20 pages, 13 figures

  20. arXiv:2412.20014  [pdf, other

    cs.LG cs.AI q-bio.BM

    ProtCLIP: Function-Informed Protein Multi-Modal Learning

    Authors: Hanjing Zhou, Mingze Yin, Wei Wu, Mingyang Li, Kun Fu, Jintai Chen, Jian Wu, Zheng Wang

    Abstract: Multi-modality pre-training paradigm that aligns protein sequences and biological descriptions has learned general protein representations and achieved promising performance in various downstream applications. However, these works were still unable to replicate the extraordinary success of language-supervised visual foundation models due to the ineffective usage of aligned protein-text paired data… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

    Journal ref: AAAI 2025

  21. arXiv:2411.17331  [pdf, other

    math.GT q-bio.BM

    Multiscale Jones Polynomial and Persistent Jones Polynomial for Knot Data Analysis

    Authors: Ruzhi Song, Fengling Li, Jie Wu, Fengchun Lei, Guo-Wei Wei

    Abstract: Many structures in science, engineering, and art can be viewed as curves in 3-space. The entanglement of these curves plays a crucial role in determining the functionality and physical properties of materials. Many concepts in knot theory provide theoretical tools to explore the complexity and entanglement of curves in 3-space. However, classical knot theory primarily focuses on global topological… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

    Comments: 27 pages, 9 figures

    MSC Class: 57K14; 92C10

  22. arXiv:2411.15215  [pdf, other

    cs.LG cs.AI q-bio.BM

    S$^2$ALM: Sequence-Structure Pre-trained Large Language Model for Comprehensive Antibody Representation Learning

    Authors: Mingze Yin, Hanjing Zhou, Jialu Wu, Yiheng Zhu, Yuxuan Zhan, Zitai Kong, Hongxia Xu, Chang-Yu Hsieh, Jintai Chen, Tingjun Hou, Jian Wu

    Abstract: Antibodies safeguard our health through their precise and potent binding to specific antigens, demonstrating promising therapeutic efficacy in the treatment of numerous diseases, including COVID-19. Recent advancements in biomedical language models have shown the great potential to interpret complex biological structures and functions. However, existing antibody specific models have a notable limi… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

  23. arXiv:2411.02120  [pdf, other

    cs.LG cs.AI q-bio.BM

    Bridge-IF: Learning Inverse Protein Folding with Markov Bridges

    Authors: Yiheng Zhu, Jialu Wu, Qiuyi Li, Jiahuan Yan, Mingze Yin, Wei Wu, Mingyang Li, Jieping Ye, Zheng Wang, Jian Wu

    Abstract: Inverse protein folding is a fundamental task in computational protein design, which aims to design protein sequences that fold into the desired backbone structures. While the development of machine learning algorithms for this task has seen significant success, the prevailing approaches, which predominantly employ a discriminative formulation, frequently encounter the error accumulation issue and… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024

  24. arXiv:2410.21069  [pdf

    cs.LG cs.AI q-bio.BM

    EMOCPD: Efficient Attention-based Models for Computational Protein Design Using Amino Acid Microenvironment

    Authors: Xiaoqi Ling, Cheng Cai, Demin Kong, Zhisheng Wei, Jing Wu, Lei Wang, Zhaohong Deng

    Abstract: Computational protein design (CPD) refers to the use of computational methods to design proteins. Traditional methods relying on energy functions and heuristic algorithms for sequence design are inefficient and do not meet the demands of the big data era in biomolecules, with their accuracy limited by the energy functions and search algorithms. Existing deep learning methods are constrained by the… ▽ More

    Submitted 29 October, 2024; v1 submitted 28 October, 2024; originally announced October 2024.

  25. arXiv:2410.06232  [pdf, other

    q-bio.NC cs.AI cs.LG cs.NE

    Range, not Independence, Drives Modularity in Biologically Inspired Representations

    Authors: Will Dorrell, Kyle Hsu, Luke Hollingsworth, Jin Hwa Lee, Jiajun Wu, Chelsea Finn, Peter E Latham, Tim EJ Behrens, James CR Whittington

    Abstract: Why do biological and artificial neurons sometimes modularise, each encoding a single meaningful variable, and sometimes entangle their representation of many variables? In this work, we develop a theory of when biologically inspired networks -- those that are nonnegative and energy efficient -- modularise their representation of source variables (sources). We derive necessary and sufficient condi… ▽ More

    Submitted 11 April, 2025; v1 submitted 8 October, 2024; originally announced October 2024.

    Comments: 37 pages, 12 figures. WD and KH contributed equally; LH and JHL contributed equally

  26. arXiv:2410.04815  [pdf, other

    q-bio.PE cs.AI

    A Review of BioTree Construction in the Context of Information Fusion: Priors, Methods, Applications and Trends

    Authors: Zelin Zang, Yongjie Xu, Chenrui Duan, Yue Yuan, Jinlin Wu, Zhen Lei, Stan Z. Li

    Abstract: Biological tree (BioTree) analysis is a foundational tool in biology, enabling the exploration of evolutionary and differentiation relationships among organisms, genes, and cells. Traditional tree construction methods, while instrumental in early research, face significant challenges in handling the growing complexity and scale of modern biological data, particularly in integrating multimodal data… ▽ More

    Submitted 15 February, 2025; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: 115 pages, 15 figures

  27. arXiv:2410.03803  [pdf, other

    cs.LG cs.AI physics.chem-ph q-bio.BM

    Text-guided Diffusion Model for 3D Molecule Generation

    Authors: Yanchen Luo, Junfeng Fang, Sihang Li, Zhiyuan Liu, Jiancan Wu, An Zhang, Wenjie Du, Xiang Wang

    Abstract: The de novo generation of molecules with targeted properties is crucial in biology, chemistry, and drug discovery. Current generative models are limited to using single property values as conditions, struggling with complex customizations described in detailed human language. To address this, we propose the text guidance instead, and introduce TextSMOG, a new Text-guided Small Molecule Generation… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  28. arXiv:2409.11174  [pdf, other

    q-bio.NC cs.AI

    Identifying Influential nodes in Brain Networks via Self-Supervised Graph-Transformer

    Authors: Yanqing Kang, Di Zhu, Haiyang Zhang, Enze Shi, Sigang Yu, Jinru Wu, Xuhui Wang, Xuan Liu, Geng Chen, Xi Jiang, Tuo Zhang, Shu Zhang

    Abstract: Studying influential nodes (I-nodes) in brain networks is of great significance in the field of brain imaging. Most existing studies consider brain connectivity hubs as I-nodes. However, this approach relies heavily on prior knowledge from graph theory, which may overlook the intrinsic characteristics of the brain network, especially when its architecture is not fully understood. In contrast, self… ▽ More

    Submitted 17 September, 2024; originally announced September 2024.

  29. arXiv:2408.15999  [pdf

    q-bio.QM cs.LG

    Q-MRS: A Deep Learning Framework for Quantitative Magnetic Resonance Spectra Analysis

    Authors: Christopher J. Wu, Lawrence S. Kegeles, Jia Guo

    Abstract: Magnetic resonance spectroscopy (MRS) is an established technique for studying tissue metabolism, particularly in central nervous system disorders. While powerful and versatile, MRS is often limited by challenges associated with data quality, processing, and quantification. Existing MRS quantification methods face difficulties in balancing model complexity and reproducibility during spectral model… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 8 pages, 4 figures, and 3 tables for the main body; 9 pages, 4 figures, and 3 tables for the supplementary material

  30. arXiv:2408.11356  [pdf, other

    cs.AI cs.LG q-bio.BM

    One-step Structure Prediction and Screening for Protein-Ligand Complexes using Multi-Task Geometric Deep Learning

    Authors: Kelei He, Tiejun Dong, Jinhui Wu, Junfeng Zhang

    Abstract: Understanding the structure of the protein-ligand complex is crucial to drug development. Existing virtual structure measurement and screening methods are dominated by docking and its derived methods combined with deep learning. However, the sampling and scoring methodology have largely restricted the accuracy and efficiency. Here, we show that these two fundamental tasks can be accurately tackled… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  31. arXiv:2408.09106  [pdf, other

    q-bio.BM cs.AI

    Fragment-Masked Diffusion for Molecular Optimization

    Authors: Kun Li, Xiantao Cai, Jia Wu, Shirui Pan, Huiting Xu, Bo Du, Wenbin Hu

    Abstract: Molecular optimization is a crucial aspect of drug discovery, aimed at refining molecular structures to enhance drug efficacy and minimize side effects, ultimately accelerating the overall drug development process. Many molecular optimization methods have been proposed, significantly advancing drug discovery. These methods primarily on understanding the specific drug target structures or their hyp… ▽ More

    Submitted 14 May, 2025; v1 submitted 17 August, 2024; originally announced August 2024.

    Comments: 12 pages, 9 figures, 4 tables

  32. arXiv:2407.04055  [pdf, other

    q-bio.QM cs.AI cs.LG

    Benchmark on Drug Target Interaction Modeling from a Structure Perspective

    Authors: Xinnan Zhang, Jialin Wu, Junyi Xie, Tianlong Chen, Kaixiong Zhou

    Abstract: The prediction modeling of drug-target interactions is crucial to drug discovery and design, which has seen rapid advancements owing to deep learning technologies. Recently developed methods, such as those based on graph neural networks (GNNs) and Transformers, demonstrate exceptional performance across various datasets by effectively extracting structural information. However, the benchmarking of… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Submitted to NIPS 2024 Dataset and Benchmark

  33. arXiv:2405.14545  [pdf, other

    q-bio.BM cs.LG

    A Cross-Field Fusion Strategy for Drug-Target Interaction Prediction

    Authors: Hongzhi Zhang, Xiuwen Gong, Shirui Pan, Jia Wu, Bo Du, Wenbin Hu

    Abstract: Drug-target interaction (DTI) prediction is a critical component of the drug discovery process. In the drug development engineering field, predicting novel drug-target interactions is extremely crucial.However, although existing methods have achieved high accuracy levels in predicting known drugs and drug targets, they fail to utilize global protein information during DTI prediction. This leads to… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  34. arXiv:2405.14536  [pdf, other

    q-bio.MN cs.AI cs.LG

    Regressor-free Molecule Generation to Support Drug Response Prediction

    Authors: Kun Li, Xiuwen Gong, Shirui Pan, Jia Wu, Bo Du, Wenbin Hu

    Abstract: Drug response prediction (DRP) is a crucial phase in drug discovery, and the most important metric for its evaluation is the IC50 score. DRP results are heavily dependent on the quality of the generated molecules. Existing molecule generation methods typically employ classifier-based guidance, enabling sampling within the IC50 classification range. However, these methods fail to ensure the samplin… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 22 pages, 7 figures, 9 tables,

  35. arXiv:2404.16357  [pdf, other

    q-bio.NC eess.SY

    Reverse engineering the brain input: Network control theory to identify cognitive task-related control nodes

    Authors: Zhichao Liang, Yinuo Zhang, Jushen Wu, Quanying Liu

    Abstract: The human brain receives complex inputs when performing cognitive tasks, which range from external inputs via the senses to internal inputs from other brain regions. However, the explicit inputs to the brain during a cognitive task remain unclear. Here, we present an input identification framework for reverse engineering the control nodes and the corresponding inputs to the brain. The framework is… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  36. arXiv:2403.03089  [pdf, other

    q-bio.QM cs.AI cs.LG

    VQSynery: Robust Drug Synergy Prediction With Vector Quantization Mechanism

    Authors: Jiawei Wu, Mingyuan Yan, Dianbo Liu

    Abstract: The pursuit of optimizing cancer therapies is significantly advanced by the accurate prediction of drug synergy. Traditional methods, such as clinical trials, are reliable yet encumbered by extensive time and financial demands. The emergence of high-throughput screening and computational innovations has heralded a shift towards more efficient methodologies for exploring drug interactions. In this… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  37. arXiv:2402.17997  [pdf

    q-bio.BM

    StaPep: an open-source tool for the structure prediction and feature extraction of hydrocarbon-stapled peptides

    Authors: Zhe Wang, Jianping Wu, Mengjun Zheng, Chenchen Geng, Borui Zhen, Wei Zhang, Hui Wu, Zhengyang Xu, Gang Xu, Si Chen, Xiang Li

    Abstract: Many tools exist for extracting structural and physiochemical descriptors from linear peptides to predict their properties, but similar tools for hydrocarbon-stapled peptides are lacking.Here, we present StaPep, a Python-based toolkit designed for generating 2D/3D structures and calculating 21 distinct features for hydrocarbon-stapled peptides.The current version supports hydrocarbon-stapled pepti… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: 26 pages, 6 figures

  38. arXiv:2402.10516  [pdf, other

    q-bio.BM cs.AI cs.LG

    Generative AI for Controllable Protein Sequence Design: A Survey

    Authors: Yiheng Zhu, Zitai Kong, Jialu Wu, Weize Liu, Yuqiang Han, Mingze Yin, Hongxia Xu, Chang-Yu Hsieh, Tingjun Hou

    Abstract: The design of novel protein sequences with targeted functionalities underpins a central theme in protein engineering, impacting diverse fields such as drug discovery and enzymatic engineering. However, navigating this vast combinatorial search space remains a severe challenge due to time and financial constraints. This scenario is rapidly evolving as the transformative advancements in AI, particul… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

    Comments: 9 pages

  39. arXiv:2402.02164  [pdf

    cs.AI q-bio.BM

    Hierarchical Structure Enhances the Convergence and Generalizability of Linear Molecular Representation

    Authors: Juan-Ni Wu, Tong Wang, Li-Juan Tang, Hai-Long Wu, Ru-Qin Yu

    Abstract: Language models demonstrate fundamental abilities in syntax, semantics, and reasoning, though their performance often depends significantly on the inputs they process. This study introduces TSIS (Simplified TSID) and its variants:TSISD (TSIS with Depth-First Search), TSISO (TSIS in Order), and TSISR (TSIS in Random), as integral components of the t-SMILES framework. These additions complete the fr… ▽ More

    Submitted 18 November, 2024; v1 submitted 3 February, 2024; originally announced February 2024.

    Comments: 26pages, 6 figures

  40. arXiv:2401.09500  [pdf, other

    q-bio.NC cs.LG cs.NE

    MorphGrower: A Synchronized Layer-by-layer Growing Approach for Plausible Neuronal Morphology Generation

    Authors: Nianzu Yang, Kaipeng Zeng, Haotian Lu, Yexin Wu, Zexin Yuan, Danni Chen, Shengdian Jiang, Jiaxiang Wu, Yimin Wang, Junchi Yan

    Abstract: Neuronal morphology is essential for studying brain functioning and understanding neurodegenerative disorders. As acquiring real-world morphology data is expensive, computational approaches for morphology generation have been studied. Traditional methods heavily rely on expert-set rules and parameter tuning, making it difficult to generalize across different types of morphologies. Recently, MorphV… ▽ More

    Submitted 27 May, 2024; v1 submitted 17 January, 2024; originally announced January 2024.

  41. arXiv:2311.12834  [pdf, other

    math.GT q-bio.BM

    Knot data analysis using multiscale Gauss link integral

    Authors: Li Shen, Hongsong Feng, Fengling Li, Fengchun Lei, Jie Wu, Guo-Wei Wei

    Abstract: In the past decade, topological data analysis (TDA) has emerged as a powerful approach in data science. The main technique in TDA is persistent homology, which tracks topological invariants over the filtration of point cloud data using algebraic topology. Although knot theory and related subjects are a focus of study in mathematics, their success in practical applications is quite limited due to t… ▽ More

    Submitted 2 October, 2023; originally announced November 2023.

  42. arXiv:2311.02798  [pdf, other

    cs.LG physics.chem-ph q-bio.QM

    Multi-channel learning for integrating structural hierarchies into context-dependent molecular representation

    Authors: Yue Wan, Jialu Wu, Tingjun Hou, Chang-Yu Hsieh, Xiaowei Jia

    Abstract: Reliable molecular property prediction is essential for various scientific endeavors and industrial applications, such as drug discovery. However, the data scarcity, combined with the highly non-linear causal relationships between physicochemical and biological properties and conventional molecular featurization schemes, complicates the development of robust molecular machine learning models. Self… ▽ More

    Submitted 12 January, 2025; v1 submitted 5 November, 2023; originally announced November 2023.

    Journal ref: Nat Commun 16, 413 (2025)

  43. arXiv:2310.08735  [pdf, other

    q-bio.PE cond-mat.stat-mech physics.bio-ph

    Noise driven phase transitions in eco-evolutionary systems

    Authors: Jim Wu, David J. Schwab, Trevor GrandPre

    Abstract: In complex ecosystems such as microbial communities, there is constant ecological and evolutionary feedback between the residing species and the environment occurring on concurrent timescales. Species respond and adapt to their surroundings by modifying their phenotypic traits, which in turn alters their environment and the resources available. To study this interplay between ecological and evolut… ▽ More

    Submitted 16 October, 2023; v1 submitted 12 October, 2023; originally announced October 2023.

  44. arXiv:2309.16684  [pdf, other

    q-bio.BM cs.LG physics.chem-ph

    Leveraging Side Information for Ligand Conformation Generation using Diffusion-Based Approaches

    Authors: Jiamin Wu, He Cao, Yuan Yao

    Abstract: Ligand molecule conformation generation is a critical challenge in drug discovery. Deep learning models have been developed to tackle this problem, particularly through the use of generative models in recent years. However, these models often generate conformations that lack meaningful structure and randomness due to the absence of essential side information. Examples of such side information incl… ▽ More

    Submitted 2 August, 2023; originally announced September 2023.

  45. arXiv:2305.05086  [pdf

    q-bio.PE cs.RO

    Mechanical Evidence for the Phylogenetic Origin of the Red Panda's False Thumb as an Adaptation to Arboreal Locomotion

    Authors: Braden Barnett, Yiqi Lyu, Kyle Pichney, Brian Sun, Jixiao Wu

    Abstract: We constructed a modular, biomimetic red panda paw with which to experimentally investigate the evolutionary reason for the existence of the false thumbs of red pandas. These thumbs were once believed to have shared a common origin with the similar false thumbs of giant pandas; however, the discovery of a carnivorous fossil ancestor of the red panda that had false thumbs implies that the red panda… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: 14 pages, 10 figures

  46. arXiv:2302.00545  [pdf, other

    cs.CV q-bio.NC

    An Out-of-Domain Synapse Detection Challenge for Microwasp Brain Connectomes

    Authors: Jingpeng Wu, Yicong Li, Nishika Gupta, Kazunori Shinomiya, Pat Gunn, Alexey Polilov, Hanspeter Pfister, Dmitri Chklovskii, Donglai Wei

    Abstract: The size of image stacks in connectomics studies now reaches the terabyte and often petabyte scales with a great diversity of appearance across brain regions and samples. However, manual annotation of neural structures, e.g., synapses, is time-consuming, which leads to limited training data often smaller than 0.001\% of the test data in size. Domain adaptation and generalization approaches were pr… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  47. arXiv:2301.03424  [pdf, other

    q-bio.BM cs.AI cs.LG

    An open unified deep graph learning framework for discovering drug leads

    Authors: Yueming Yin, Haifeng Hu, Zhen Yang, Jitao Yang, Chun Ye, Jiansheng Wu, Wilson Wen Bin Goh

    Abstract: Computational discovery of ideal lead compounds is a critical process for modern drug discovery. It comprises multiple stages: hit screening, molecular property prediction, and molecule optimization. Current efforts are disparate, involving the establishment of models for each stage, followed by multi-stage multi-model integration. However, this is non-ideal, as clumsy integration of incompatible… ▽ More

    Submitted 20 January, 2023; v1 submitted 5 December, 2022; originally announced January 2023.

    Comments: This article is used as the preliminary studies for the application of Lee Kuan Yew Postdoctoral Fellowship (LKYPDF) 2023 in Singapore. All rights reserved

  48. arXiv:2301.01829  [pdf

    cs.LG cs.AI q-bio.BM

    t-SMILES: A Scalable Fragment-based Molecular Representation Framework for De Novo Molecule Generation

    Authors: Juan-Ni Wu, Tong Wang, Yue Chen, Li-Juan Tang, Hai-Long Wu, Ru-Qin Yu

    Abstract: Effective representation of molecules is a crucial factor affecting the performance of artificial intelligence models. This study introduces a flexible, fragment-based, multiscale molecular representation framework called t-SMILES (tree-based SMILES) with three code algorithms: TSSA, TSDY and TSID. It describes molecules using SMILES-type strings obtained by performing a breadth-first search on a… ▽ More

    Submitted 20 May, 2024; v1 submitted 4 January, 2023; originally announced January 2023.

  49. arXiv:2210.17401  [pdf, other

    q-bio.BM cs.AI cs.LG

    Towards a Better Model with Dual Transformer for Drug Response Prediction

    Authors: Kun Li, Jia Wu, Bo Du, Sergey V. Petoukhov, Huiting Xu, Zheman Xiao, Wenbin Hu

    Abstract: GNN-based methods have achieved excellent results as a mainstream task in drug response prediction tasks in recent years. Traditional GNN methods use only the atoms in a drug molecule as nodes to obtain the representation of the molecular graph through node information passing, whereas the method using the transformer can only extract information about the nodes. However, the covalent bonding and… ▽ More

    Submitted 10 December, 2024; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: 28 pages, 4 figures, 5 tables

  50. arXiv:2210.04552  [pdf

    physics.ed-ph q-bio.NC

    Intrinsic motivation, Need for cognition, Grit, Growth Mindset and Academic Achievement in High School Students: Latent Profiles and Its Predictive Effects

    Authors: Jun Wu, Shuoli Qi, Yueshan Zhong

    Abstract: Recent efforts to identify non-cognitive predictors of academic achievement have especially focused on self-constructs, whose measurement is concerned with a specific domain (e.g., mathematics). However, other important factors, such as character and motivation, have received less attention. Additionally, the predictive accuracy of non-cognitive factors lacks evidence from subjects including Engli… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: 24 pages, 2 tables, 2 figures

    MSC Class: 62P15 (Primary) ACM Class: G.2