Skip to main content

Showing 1–50 of 247 results for author: Wang, J

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2507.02379  [pdf

    cs.AI q-bio.BM

    An AI-native experimental laboratory for autonomous biomolecular engineering

    Authors: Mingyu Wu, Zhaoguo Wang, Jiabin Wang, Zhiyuan Dong, Jingkai Yang, Qingting Li, Tianyu Huang, Lei Zhao, Mingqiang Li, Fei Wang, Chunhai Fan, Haibo Chen

    Abstract: Autonomous scientific research, capable of independently conducting complex experiments and serving non-specialists, represents a long-held aspiration. Achieving it requires a fundamental paradigm shift driven by artificial intelligence (AI). While autonomous experimental systems are emerging, they remain confined to areas featuring singular objectives and well-defined, simple experimental workflo… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

  2. arXiv:2506.16921  [pdf, ps, other

    q-bio.QM

    EHCube4P: Learning Epistatic Patterns Through Hypercube Graph Convolution Neural Network for Protein Fitness Function Estimation

    Authors: Muhammad Daud, Philippe Charton, Cedric Damour, Jingbo Wang, Frederic Cadet

    Abstract: Understanding the relationship between protein sequences and their functions is fundamental to protein engineering, but this task is hindered by the combinatorially vast sequence space and the experimental noise inherent in fitness measurements. In this study, we present a novel framework that models the sequence landscape as a hypercube $H(k,2)$ and integrates wavelet-based signal denoising with… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

    Comments: 12 pages, 4 figures, 1 table

  3. arXiv:2506.15190  [pdf, ps, other

    cs.LG q-bio.NC

    Learning Task-Agnostic Skill Bases to Uncover Motor Primitives in Animal Behaviors

    Authors: Jiyi Wang, Jingyang Ke, Bo Dai, Anqi Wu

    Abstract: Animals flexibly recombine a finite set of core motor primitives to meet diverse task demands, but existing behavior-segmentation methods oversimplify this process by imposing discrete syllables under restrictive generative assumptions. To reflect the animal behavior generation procedure, we introduce skill-based imitation learning (SKIL) for behavior understanding, a reinforcement learning-based… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 9 pages and 4 figures for the main text

  4. arXiv:2506.10271  [pdf, ps, other

    q-bio.QM cs.LG q-bio.GN

    Predicting function of evolutionarily implausible DNA sequences

    Authors: Shiyu Jiang, Xuyin Liu, Zitong Jerry Wang

    Abstract: Genomic language models (gLMs) show potential for generating novel, functional DNA sequences for synthetic biology, but doing so requires them to learn not just evolutionary plausibility, but also sequence-to-function relationships. We introduce a set of prediction tasks called Nullsettes, which assesses a model's ability to predict loss-of-function mutations created by translocating key control e… ▽ More

    Submitted 4 July, 2025; v1 submitted 11 June, 2025; originally announced June 2025.

    Comments: 13 pages, 6 figures, accepted to ICML 2025 Generative AI and Biology Workshop

  5. arXiv:2506.07553  [pdf, ps, other

    cs.AI q-bio.QM

    GTR-CoT: Graph Traversal as Visual Chain of Thought for Molecular Structure Recognition

    Authors: Jingchao Wang, Haote Yang, Jiang Wu, Yifan He, Xingjian Wei, Yinfan Wang, Chengjin Liu, Lingli Ge, Lijun Wu, Bin Wang, Dahua Lin, Conghui He

    Abstract: Optical Chemical Structure Recognition (OCSR) is crucial for digitizing chemical knowledge by converting molecular images into machine-readable formats. While recent vision-language models (VLMs) have shown potential in this task, their image-captioning approach often struggles with complex molecular structures and inconsistent annotations. To overcome these challenges, we introduce GTR-Mol-VLM, a… ▽ More

    Submitted 9 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

  6. arXiv:2506.06915  [pdf

    q-bio.BM cs.LG

    Graph Neural Networks in Modern AI-aided Drug Discovery

    Authors: Odin Zhang, Haitao Lin, Xujun Zhang, Xiaorui Wang, Zhenxing Wu, Qing Ye, Weibo Zhao, Jike Wang, Kejun Ying, Yu Kang, Chang-yu Hsieh, Tingjun Hou

    Abstract: Graph neural networks (GNNs), as topology/structure-aware models within deep learning, have emerged as powerful tools for AI-aided drug discovery (AIDD). By directly operating on molecular graphs, GNNs offer an intuitive and expressive framework for learning the complex topological and geometric features of drug-like molecules, cementing their role in modern molecular modeling. This review provide… ▽ More

    Submitted 7 June, 2025; originally announced June 2025.

  7. arXiv:2506.05768  [pdf, ps, other

    cs.LG q-bio.BM

    AANet: Virtual Screening under Structural Uncertainty via Alignment and Aggregation

    Authors: Wenyu Zhu, Jianhui Wang, Bowen Gao, Yinjun Jia, Haichuan Tan, Ya-Qin Zhang, Wei-Ying Ma, Yanyan Lan

    Abstract: Virtual screening (VS) is a critical component of modern drug discovery, yet most existing methods--whether physics-based or deep learning-based--are developed around holo protein structures with known ligand-bound pockets. Consequently, their performance degrades significantly on apo or predicted structures such as those from AlphaFold2, which are more representative of real-world early-stage dru… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  8. arXiv:2506.01456  [pdf

    q-bio.GN cs.AI cs.LG q-bio.NC

    GenDMR: A dynamic multimodal role-swapping network for identifying risk gene phenotypes

    Authors: Lina Qin, Cheng Zhu, Chuqi Zhou, Yukun Huang, Jiayi Zhu, Ping Liang, Jinju Wang, Yixing Huang, Cheng Luo, Dezhong Yao, Ying Tan

    Abstract: Recent studies have shown that integrating multimodal data fusion techniques for imaging and genetic features is beneficial for the etiological analysis and predictive diagnosis of Alzheimer's disease (AD). However, there are several critical flaws in current deep learning methods. Firstly, there has been insufficient discussion and exploration regarding the selection and encoding of genetic infor… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: 31 pages, 9 figures

  9. arXiv:2505.09664  [pdf, other

    q-bio.MN q-bio.QM

    KINDLE: Knowledge-Guided Distillation for Prior-Free Gene Regulatory Network Inference

    Authors: Rui Peng, Yuchen Lu, Qichen Sun, Yuxing Lu, Chi Zhang, Ziru Liu, Jinzhuo Wang

    Abstract: Gene regulatory network (GRN) inference serves as a cornerstone for deciphering cellular decision-making processes. Early approaches rely exclusively on gene expression data, thus their predictive power remain fundamentally constrained by the vast combinatorial space of potential gene-gene interactions. Subsequent methods integrate prior knowledge to mitigate this challenge by restricting the solu… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  10. arXiv:2505.09656  [pdf, other

    q-bio.QM

    VIGIL: Vision-Language Guided Multiple Instance Learning Framework for Ulcerative Colitis Histological Healing Prediction

    Authors: Zhengxuan Qiu, Bo Peng, Xiaoying Tang, Jiankun Wang, Qin Guo

    Abstract: Objective: Ulcerative colitis (UC), characterized by chronic inflammation with alternating remission-relapse cycles, requires precise histological healing (HH) evaluation to improve clinical outcomes. To overcome the limitations of annotation-intensive deep learning methods and suboptimal multi-instance learning (MIL) in HH prediction, we propose VIGIL, the first vision-language guided MIL framewo… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  11. arXiv:2505.03121  [pdf

    q-bio.BM

    AutoLoop: a novel autoregressive deep learning method for protein loop prediction with high accuracy

    Authors: Tianyue Wang, Xujun Zhang, Langcheng Wang, Odin Zhang, Jike Wang, Ercheng Wang, Jialu Wu, Renling Hu, Jingxuan Ge, Shimeng Li, Qun Su, Jiajun Yu, Chang-Yu Hsieh, Tingjun Hou, Yu Kang

    Abstract: Protein structure prediction is a critical and longstanding challenge in biology, garnering widespread interest due to its significance in understanding biological processes. A particular area of focus is the prediction of missing loops in proteins, which are vital in determining protein function and activity. To address this challenge, we propose AutoLoop, a novel computational model designed to… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

    Comments: 34 pages, 7 figures

  12. arXiv:2504.17162  [pdf

    cs.CV cs.AI q-bio.GN q-bio.SC

    A Comprehensive Review on RNA Subcellular Localization Prediction

    Authors: Cece Zhang, Xuehuan Zhu, Nick Peterson, Jieqiong Wang, Shibiao Wan

    Abstract: The subcellular localization of RNAs, including long non-coding RNAs (lncRNAs), messenger RNAs (mRNAs), microRNAs (miRNAs) and other smaller RNAs, plays a critical role in determining their biological functions. For instance, lncRNAs are predominantly associated with chromatin and act as regulators of gene transcription and chromatin structure, while mRNAs are distributed across the nucleus and cy… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  13. arXiv:2504.07881  [pdf

    q-bio.GN

    An LLM-Driven Multi-Agent Debate System for Mendelian Diseases

    Authors: Xinyang Zhou, Yongyong Ren, Qianqian Zhao, Daoyi Huang, Xinbo Wang, Tingting Zhao, Zhixing Zhu, Wenyuan He, Shuyuan Li, Yan Xu, Yu Sun, Yongguo Yu, Shengnan Wu, Jian Wang, Guangjun Yu, Dake He, Bo Ban, Hui Lu

    Abstract: Accurate diagnosis of Mendelian diseases is crucial for precision therapy and assistance in preimplantation genetic diagnosis. However, existing methods often fall short of clinical standards or depend on extensive datasets to build pretrained machine learning models. To address this, we introduce an innovative LLM-Driven multi-agent debate system (MD2GPS) with natural language explanations of the… ▽ More

    Submitted 11 April, 2025; v1 submitted 10 April, 2025; originally announced April 2025.

    Comments: 21 pages, 5 figures, 1 table

  14. arXiv:2503.21788  [pdf, other

    q-bio.BM cs.LG

    PharMolixFM: All-Atom Foundation Models for Molecular Modeling and Generation

    Authors: Yizhen Luo, Jiashuo Wang, Siqi Fan, Zaiqing Nie

    Abstract: Structural biology relies on accurate three-dimensional biomolecular structures to advance our understanding of biological functions, disease mechanisms, and therapeutics. While recent advances in deep learning have enabled the development of all-atom foundation models for molecular modeling and generation, existing approaches face challenges in generalization due to the multi-modal nature of atom… ▽ More

    Submitted 31 March, 2025; v1 submitted 12 March, 2025; originally announced March 2025.

  15. arXiv:2503.17738  [pdf

    q-bio.CB

    Tumor-associated CD19$^+$ macrophages induce immunosuppressive microenvironment in hepatocellular carcinoma

    Authors: Junli Wang, Wanyue Cao, Jinyan Huang, Yu Zhou, Rujia Zheng, Yu Lou, Jiaqi Yang, Jianghui Tang, Mao Ye, Zhengtao Hong, Jiangchao Wu, Haonan Ding, Yuquan Zhang, Jianpeng Sheng, Xinjiang Lu, Pinglong Xu, Xiongbin Lu, Xueli Bai, Tingbo Liang, Qi Zhang

    Abstract: Tumor-associated macrophages are a key component that contributes to the immunosuppressive microenvironment in human cancers. However, therapeutic targeting of macrophages has been a challenge in clinic due to the limited understanding of their heterogeneous subpopulations and distinct functions. Here, we identify a unique and clinically relevant CD19$^+$ subpopulation of macrophages that is enric… ▽ More

    Submitted 22 March, 2025; originally announced March 2025.

    Comments: 7 figures

  16. arXiv:2503.16582  [pdf

    cs.LG cs.AI q-bio.GN

    Machine Learning-Based Genomic Linguistic Analysis (Gene Sequence Feature Learning): A Case Study on Predicting Heavy Metal Response Genes in Rice

    Authors: Ruiqi Yang, Jianxu Wang, Wei Yuan, Xun Wang, Mei Li

    Abstract: This study explores the application of machine learning-based genetic linguistics for identifying heavy metal response genes in rice (Oryza sativa). By integrating convolutional neural networks and random forest algorithms, we developed a hybrid model capable of extracting and learning meaningful features from gene sequences, such as k-mer frequencies and physicochemical properties. The model was… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  17. arXiv:2503.13522  [pdf, ps, other

    q-bio.BM cs.AI cs.LG

    Advanced Deep Learning Methods for Protein Structure Prediction and Design

    Authors: Yichao Zhang, Ningyuan Deng, Xinyuan Song, Ziqian Bi, Tianyang Wang, Zheyu Yao, Keyu Chen, Ming Li, Qian Niu, Junyu Liu, Benji Peng, Sen Zhang, Ming Liu, Li Zhang, Xuanhe Pan, Jinlang Wang, Pohsun Feng, Yizhu Wen, Lawrence KQ Yan, Hongming Tseng, Yan Zhong, Yunze Wang, Ziyuan Qin, Bowen Jing, Junjie Yang , et al. (3 additional authors not shown)

    Abstract: After AlphaFold won the Nobel Prize, protein prediction with deep learning once again became a hot topic. We comprehensively explore advanced deep learning methods applied to protein structure prediction and design. It begins by examining recent innovations in prediction architectures, with detailed discussions on improvements such as diffusion based frameworks and novel pairwise attention modules… ▽ More

    Submitted 29 March, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

  18. arXiv:2503.13465  [pdf, ps, other

    eess.SP cs.AI cs.LG q-bio.NC

    A novel Fourier Adjacency Transformer for advanced EEG emotion recognition

    Authors: Jinfeng Wang, Yanhao Huang, Sifan Song, Boqian Wang, Jionglong Su, Jiaman Ding

    Abstract: EEG emotion recognition faces significant hurdles due to noise interference, signal nonstationarity, and the inherent complexity of brain activity which make accurately emotion classification. In this study, we present the Fourier Adjacency Transformer, a novel framework that seamlessly integrates Fourier-based periodic analysis with graph-driven structural modeling. Our method first leverages nov… ▽ More

    Submitted 27 February, 2025; originally announced March 2025.

  19. arXiv:2503.10195  [pdf, other

    cs.CV cs.NE q-bio.NC

    ST-FlowNet: An Efficient Spiking Neural Network for Event-Based Optical Flow Estimation

    Authors: Hongze Sun, Jun Wang, Wuque Cai, Duo Chen, Qianqian Liao, Jiayi He, Yan Cui, Dezhong Yao, Daqing Guo

    Abstract: Spiking Neural Networks (SNNs) have emerged as a promising tool for event-based optical flow estimation tasks due to their ability to leverage spatio-temporal information and low-power capabilities. However, the performance of SNN models is often constrained, limiting their application in real-world scenarios. In this work, we address this gap by proposing a novel neural network architecture, ST-F… ▽ More

    Submitted 27 April, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

    Comments: 13 pages, 6 figures, 6 tables; This work has been submitted to Neural Networks for possible publication

  20. arXiv:2503.03783  [pdf, other

    q-bio.TO cs.AI cs.ET cs.HC cs.LG

    Passive Heart Rate Monitoring During Smartphone Use in Everyday Life

    Authors: Shun Liao, Paolo Di Achille, Jiang Wu, Silviu Borac, Jonathan Wang, Xin Liu, Eric Teasley, Lawrence Cai, Yuzhe Yang, Yun Liu, Daniel McDuff, Hao-Wei Su, Brent Winslow, Anupam Pathak, Shwetak Patel, James A. Taylor, Jameson K. Rogers, Ming-Zher Poh

    Abstract: Resting heart rate (RHR) is an important biomarker of cardiovascular health and mortality, but tracking it longitudinally generally requires a wearable device, limiting its availability. We present PHRM, a deep learning system for passive heart rate (HR) and RHR measurements during everyday smartphone use, using facial video-based photoplethysmography. Our system was developed using 225,773 videos… ▽ More

    Submitted 21 March, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

    Comments: Updated author list

  21. Weighted Combination and Singular Spectrum Analysis Based Remote Photoplethysmography Pulse Extraction in Low-light Environments

    Authors: Lin Xi, Xingming Wu, Weihai Chen, Jianhua Wang, Changchen Zhao

    Abstract: Camera-based vital signs monitoring in recent years has attracted more and more researchers and the results are promising. However, a few research works focus on heart rate extraction under extremely low illumination environments. In this paper, we propose a novel framework for remote heart rate estimation under low-light conditions. This method uses singular spectrum analysis (SSA) to decompose t… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 14 pages, 8 figures; Published at Medical Engineering & Physics (MEP)

  22. arXiv:2502.20275  [pdf, other

    q-bio.QM

    How cancer emerges: Data-driven universal insights into tumorigenesis via hallmark networks

    Authors: Jiahe Wang, Yan Wu, Yuke Hou, Yang Li, Dachuan Xu, Changjing Zhuge, Yue Han

    Abstract: Cancer is a complex disease driven by dynamic regulatory shifts that cannot be fully captured by individual molecular profiling. We employ a data-driven approach to construct a coarse-grained dynamic network model based on hallmark interactions, integrating stochastic differential equations with gene regulatory network data to explore key macroscopic dynamic changes in tumorigenesis. Our analysis… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  23. arXiv:2502.16446  [pdf, other

    cs.LG cs.AI q-bio.BM

    Auxiliary Discrminator Sequence Generative Adversarial Networks (ADSeqGAN) for Few Sample Molecule Generation

    Authors: Haocheng Tang, Jing Long, Junmei Wang

    Abstract: In this work, we introduce Auxiliary Discriminator Sequence Generative Adversarial Networks (ADSeqGAN), a novel approach for molecular generation in small-sample datasets. Traditional generative models often struggle with limited training data, particularly in drug discovery, where molecular datasets for specific therapeutic targets, such as nucleic acids binders and central nervous system (CNS) d… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

  24. arXiv:2502.10425  [pdf, other

    q-bio.NC cs.AI cs.NE

    Neuron Platonic Intrinsic Representation From Dynamics Using Contrastive Learning

    Authors: Wei Wu, Can Liao, Zizhen Deng, Zhengrui Guo, Jinzhuo Wang

    Abstract: The Platonic Representation Hypothesis suggests a universal, modality-independent reality representation behind different data modalities. Inspired by this, we view each neuron as a system and detect its multi-segment activity data under various peripheral conditions. We assume there's a time-invariant representation for the same neuron, reflecting its intrinsic properties like molecular profiles,… ▽ More

    Submitted 18 February, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: Accepted by ICLR'2025

  25. arXiv:2502.06881  [pdf, other

    q-bio.BM

    A Comprehensive Review of Protein Language Models

    Authors: Lei Wang, Xudong Li, Han Zhang, Jinyi Wang, Dingkang Jiang, Zhidong Xue, Yan Wang

    Abstract: At the intersection of the rapidly growing biological data landscape and advancements in Natural Language Processing (NLP), protein language models (PLMs) have emerged as a transformative force in modern research. These models have achieved remarkable progress, highlighting the need for timely and comprehensive overviews. However, much of the existing literature focuses narrowly on specific domain… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

  26. arXiv:2502.01430  [pdf, other

    cs.LG q-bio.QM

    Molecular Odor Prediction Based on Multi-Feature Graph Attention Networks

    Authors: HongXin Xie, JianDe Sun, Yi Shao, Shuai Li, Sujuan Hou, YuLong Sun, Jian Wang

    Abstract: Olfactory perception plays a critical role in both human and organismal interactions, yet understanding of its underlying mechanisms and influencing factors remain insufficient. Molecular structures influence odor perception through intricate biochemical interactions, and accurately quantifying structure-odor relationships presents significant challenges. The Quantitative Structure-Odor Relationsh… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  27. arXiv:2501.08363  [pdf

    q-bio.GN

    TopoLa: A Universal Framework to Enhance Cell Representations for Single-cell and Spatial Omics through Topology-encoded Latent Hyperbolic Geometry

    Authors: Kai Zheng, Shaokai Wang, Yunpei Xu, Qiming Lei, Qichang Zhao, Xiao Liang, Qilong Feng, Yaohang Li, Min Li, Jinhui Xu, Jianxin Wang

    Abstract: Recent advances in cellular research demonstrate that scRNA-seq characterizes cellular heterogeneity, while spatial transcriptomics reveals the spatial distribution of gene expression. Cell representation is the fundamental issue in the two fields. Here, we propose Topology-encoded Latent Hyperbolic Geometry (TopoLa), a computational framework enhancing cell representations by capturing fine-grain… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

    Comments: 116 pages,53 figures

  28. arXiv:2501.02176  [pdf

    q-bio.QM cs.LG

    Molecule-dynamic-based Aging Clock and Aging Roadmap Forecast with Sundial

    Authors: Wei Wu, Zizhen Deng, Chi Zhang, Can Liao, Jinzhuo Wang

    Abstract: Addressing the unavoidable bias inherent in supervised aging clocks, we introduce Sundial, a novel framework that models molecular dynamics through a diffusion field, capturing both the population-level aging process and the individual-level relative aging order. Sundial enables unbiasedestimation of biological age and the forecast of aging roadmap. Fasteraging individuals from Sundial exhibit a h… ▽ More

    Submitted 3 January, 2025; originally announced January 2025.

  29. arXiv:2501.01462  [pdf

    cs.LG cs.AI q-bio.GN

    Pan-infection Foundation Framework Enables Multiple Pathogen Prediction

    Authors: Lingrui Zhang, Haonan Wu, Nana Jin, Chenqing Zheng, Jize Xie, Qitai Cai, Jun Wang, Qin Cao, Xubin Zheng, Jiankun Wang, Lixin Cheng

    Abstract: Host-response-based diagnostics can improve the accuracy of diagnosing bacterial and viral infections, thereby reducing inappropriate antibiotic prescriptions. However, the existing cohorts with limited sample size and coarse infections types are unable to support the exploration of an accurate and generalizable diagnostic model. Here, we curate the largest infection host-response transcriptome da… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: 15 pages, 8 figures

  30. arXiv:2412.19831  [pdf, ps, other

    q-bio.PE

    Leslie Population Models in Predator-prey and Competitive populations: theory and applications by machine learning

    Authors: Pico Gilman, Steven J. Miller, Daeyoung Son, Saad Waheed, Janine Wang

    Abstract: We introduce a new predator-prey model by replacing the growth and predation constant by a square matrix, and the population density as a population vector. The classical Lotka-Volterra model describes a population that either modulates or converges. Stability analysis of such models have been extensively studied by the works of Merdan (https://doi.org/10.1016/j.chaos.2007.06.062). The new model a… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

  31. A Fluid-Structure Interaction Model of the Zebrafish Aortic Valve

    Authors: Alexander D. Kaiser, Jing Wang, Aaron L. Brown, Enbo Zhu, Tzung Hsiai, Alison L. Marsden

    Abstract: The zebrafish is a valuable model organism for studying cardiac development and diseases due to its many shared aspects of genetics and anatomy with humans and ease of experimental manipulations. Computational fluid-structure interaction (FSI) simulations are an efficient and highly controllable means to study the function of cardiac valves in development and diseases. Due to their small scales, l… ▽ More

    Submitted 19 June, 2025; v1 submitted 23 December, 2024; originally announced December 2024.

    MSC Class: 92C35 (Primary); 92C10; 76Z05 (Secondary) ACM Class: J.3.1

  32. arXiv:2412.17043  [pdf, other

    q-bio.NC physics.bio-ph

    Optimal signal transmission and timescale diversity in a model of human brain operating near criticality

    Authors: Yang Qi, Jiexiang Wang, Weiyang Ding, Gustavo Deco, Viktor Jirsa, Wenlian Lu, Jianfeng Feng

    Abstract: Cortical neurons exhibit a hierarchy of timescales across brain regions in response to input stimuli, which is thought to be crucial for information processing of different temporal scales. Modeling studies suggest that both intra-regional circuit dynamics as well as cross-regional connectome may contribute to this timescale diversity. Equally important to diverse timescales is the ability to tran… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

  33. arXiv:2412.16220  [pdf, other

    q-bio.QM cs.AI cs.LG

    Cross-Attention Graph Neural Networks for Inferring Gene Regulatory Networks with Skewed Degree Distribution

    Authors: Jiaqi Xiong, Nan Yin, Shiyang Liang, Haoyang Li, Yingxu Wang, Duo Ai, Fang Pan, Jingjie Wang

    Abstract: Inferencing Gene Regulatory Networks (GRNs) from gene expression data is a pivotal challenge in systems biology, and several innovative computational methods have been introduced. However, most of these studies have not considered the skewed degree distribution of genes. Specifically, some genes may regulate multiple target genes while some genes may be regulated by multiple regulator genes. Such… ▽ More

    Submitted 9 January, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

    Comments: 11 pages, 6 figures,1 tabels

  34. arXiv:2412.10659  [pdf, other

    cs.CV cs.LG q-bio.QM

    MEATRD: Multimodal Anomalous Tissue Region Detection Enhanced with Spatial Transcriptomics

    Authors: Kaichen Xu, Qilong Wu, Yan Lu, Yinan Zheng, Wenlin Li, Xingjie Tang, Jun Wang, Xiaobo Sun

    Abstract: The detection of anomalous tissue regions (ATRs) within affected tissues is crucial in clinical diagnosis and pathological studies. Conventional automated ATR detection methods, primarily based on histology images alone, falter in cases where ATRs and normal tissues have subtle visual differences. The recent spatial transcriptomics (ST) technology profiles gene expressions across tissue regions, o… ▽ More

    Submitted 13 December, 2024; originally announced December 2024.

    Comments: AAAI 2025. Code: https://github.com/wqlzuel/MEATRD

  35. arXiv:2412.07236  [pdf, other

    eess.SP cs.AI cs.LG q-bio.NC

    CBraMod: A Criss-Cross Brain Foundation Model for EEG Decoding

    Authors: Jiquan Wang, Sha Zhao, Zhiling Luo, Yangxuan Zhou, Haiteng Jiang, Shijian Li, Tao Li, Gang Pan

    Abstract: Electroencephalography (EEG) is a non-invasive technique to measure and record brain electrical activity, widely used in various BCI and healthcare applications. Early EEG decoding methods rely on supervised learning, limited by specific tasks and datasets, hindering model performance and generalizability. With the success of large language models, there is a growing body of studies focusing on EE… ▽ More

    Submitted 13 April, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: Accepted by The Thirteenth International Conference on Learning Representations (ICLR 2025)

  36. arXiv:2412.07136  [pdf

    cs.CV q-bio.QM

    A multimodal ensemble approach for clear cell renal cell carcinoma treatment outcome prediction

    Authors: Meixu Chen, Kai Wang, Payal Kapur, James Brugarolas, Raquibul Hannan, Jing Wang

    Abstract: Purpose: A reliable cancer prognosis model for clear cell renal cell carcinoma (ccRCC) can enhance personalized treatment. We developed a multi-modal ensemble model (MMEM) that integrates pretreatment clinical data, multi-omics data, and histopathology whole slide image (WSI) data to predict overall survival (OS) and disease-free survival (DFS) for ccRCC patients. Methods: We analyzed 226 patients… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

    Comments: 10 pages, 3 figures, 4 tables

  37. arXiv:2412.06847  [pdf, other

    q-bio.QM cs.AI cs.LG

    M$^{3}$-20M: A Large-Scale Multi-Modal Molecule Dataset for AI-driven Drug Design and Discovery

    Authors: Siyuan Guo, Lexuan Wang, Chang Jin, Jinxian Wang, Han Peng, Huayang Shi, Wengen Li, Jihong Guan, Shuigeng Zhou

    Abstract: This paper introduces M$^{3}$-20M, a large-scale Multi-Modal Molecule dataset that contains over 20 million molecules, with the data mainly being integrated from existing databases and partially generated by large language models. Designed to support AI-driven drug design and discovery, M$^{3}$-20M is 71 times more in the number of molecules than the largest existing dataset, providing an unpreced… ▽ More

    Submitted 16 March, 2025; v1 submitted 7 December, 2024; originally announced December 2024.

  38. arXiv:2411.17206  [pdf, other

    q-bio.NC

    Energy Consumption Optimization, Response Time Differences and Indicators in Cortical Working Memory Revealed by Nonequilibrium

    Authors: Xiaochen Wang, Yuxuan Wu, Feng Zhang, Jin Wang

    Abstract: The neocortex, a complex system driving multi-region interactions, remains a core puzzle in neuroscience. Despite quantitative insights across brain scales, understanding the mechanisms underlying neural activities is challenging. Advances from Hopfield networks to large-scale cortical models have deepened neural network theory, yet these models often fall short of capturing global brain functions… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

  39. arXiv:2411.14743  [pdf, other

    cs.CV cs.AI q-bio.QM

    FOCUS: Knowledge-enhanced Adaptive Visual Compression for Few-shot Whole Slide Image Classification

    Authors: Zhengrui Guo, Conghao Xiong, Jiabo Ma, Qichen Sun, Lishuang Feng, Jinzhuo Wang, Hao Chen

    Abstract: Few-shot learning presents a critical solution for cancer diagnosis in computational pathology (CPath), addressing fundamental limitations in data availability, particularly the scarcity of expert annotations and patient privacy constraints. A key challenge in this paradigm stems from the inherent disparity between the limited training set of whole slide images (WSIs) and the enormous number of co… ▽ More

    Submitted 20 March, 2025; v1 submitted 22 November, 2024; originally announced November 2024.

    Comments: Accepted by CVPR'2025

  40. arXiv:2411.05244  [pdf

    q-bio.QM

    Nonperfused Retinal Capillaries -- A New Method Developed on OCT and OCTA

    Authors: Min Gao, Yukun Guo, Tristan T. Hormel, Jie Wang, Elizabeth White, Dong-Wouk Park, Thomas S. Hwang, Steven T. Bailey, Yali Jia

    Abstract: To develop a new method to quantify nonperfused retinal capillaries (NPCs) by using co-registered optical coherence tomography (OCT) and OCT angiography (OCTA), and to evaluate NPCs in eyes with age-related macular degeneration (AMD) and diabetic retinopathy (DR). Multiple consecutive 3x3-mm OCT/OCTA scans were obtained using a commercial device (Solix; Visionix/Optovue, Inc., California, USA). We… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

  41. arXiv:2410.20132  [pdf, ps, other

    eess.SP cs.AI cs.LG q-bio.BM

    On-Site Precise Screening of SARS-CoV-2 Systems Using a Channel-Wise Attention-Based PLS-1D-CNN Model with Limited Infrared Signatures

    Authors: Wenwen Zhang, Zhouzhuo Tang, Yingmei Feng, Xia Yu, Qi Jie Wang, Zhiping Lin

    Abstract: During the early stages of respiratory virus outbreaks, such as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the efficient utilize of limited nasopharyngeal swabs for rapid and accurate screening is crucial for public health. In this study, we present a methodology that integrates attenuated total reflection-Fourier transform infrared spectroscopy (ATR-FTIR) with the adaptive iter… ▽ More

    Submitted 26 October, 2024; originally announced October 2024.

  42. arXiv:2410.17620  [pdf

    q-bio.NC physics.bio-ph

    Holistic structure of neural pathways underlies brain perceptual rivalry: Physical mechanism of auditory stream segregation

    Authors: Yuxuan Wu, Jinling Gao, Xiaona Fang, Jin Wang

    Abstract: Brain perceptual rivalry, exemplified by auditory stream segregation of competing tones (A_, B__, ABA_), serves as a core mechanism of brain perception formation. While increasingly recognized as determining by neural connections rather than specific neural groups, the mechanism of brain perception remains uncertain. We demonstrate that auditory stream segregation arises from the topological struc… ▽ More

    Submitted 7 March, 2025; v1 submitted 23 October, 2024; originally announced October 2024.

    Comments: 26 pages, 8 figures

  43. arXiv:2410.13872  [pdf, other

    cs.NE cs.LG q-bio.NC

    BLEND: Behavior-guided Neural Population Dynamics Modeling via Privileged Knowledge Distillation

    Authors: Zhengrui Guo, Fangxu Zhou, Wei Wu, Qichen Sun, Lishuang Feng, Jinzhuo Wang, Hao Chen

    Abstract: Modeling the nonlinear dynamics of neuronal populations represents a key pursuit in computational neuroscience. Recent research has increasingly focused on jointly modeling neural activity and behavior to unravel their interconnections. Despite significant efforts, these approaches often necessitate either intricate model designs or oversimplified assumptions. Given the frequent absence of perfect… ▽ More

    Submitted 6 February, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: Accepted by ICLR'2025

  44. arXiv:2410.07919  [pdf, other

    cs.CL q-bio.BM

    InstructBioMol: Advancing Biomolecule Understanding and Design Following Human Instructions

    Authors: Xiang Zhuang, Keyan Ding, Tianwen Lyu, Yinuo Jiang, Xiaotong Li, Zhuoyi Xiang, Zeyuan Wang, Ming Qin, Kehua Feng, Jike Wang, Qiang Zhang, Huajun Chen

    Abstract: Understanding and designing biomolecules, such as proteins and small molecules, is central to advancing drug discovery, synthetic biology, and enzyme engineering. Recent breakthroughs in Artificial Intelligence (AI) have revolutionized biomolecular research, achieving remarkable accuracy in biomolecular prediction and design. However, a critical gap remains between AI's computational power and res… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  45. arXiv:2410.04351  [pdf

    q-bio.TO

    An asymmetric surface coating strategy for promotes rapid endothelialization in the rabbit carotid artery

    Authors: Lili Tan, Zhiyi Ye, Suhua Yu, Jinxuan Wang, Chenxi Ouyang, Zhengcai Zhang, Robert Guidoin, Guixue Wang

    Abstract: Studying surface modification has long been a key area for enhancing the effects of vascular stents after surgery. The study aimed to develop an asymmetric drug-eluting stent (ADES) with differential drug loading on its inner and outer surfaces, hypothesizing that this design would enhance drug delivery efficacy for percutaneous coronary interventions (PCIs) compared to uniformly coated drug-eluti… ▽ More

    Submitted 6 October, 2024; originally announced October 2024.

    Comments: 24 pages,7 figures, 1 table

  46. arXiv:2409.08022  [pdf, other

    q-bio.BM

    De novo design of high-affinity protein binders with AlphaProteo

    Authors: Vinicius Zambaldi, David La, Alexander E. Chu, Harshnira Patani, Amy E. Danson, Tristan O. C. Kwan, Thomas Frerix, Rosalia G. Schneider, David Saxton, Ashok Thillaisundaram, Zachary Wu, Isabel Moraes, Oskar Lange, Eliseo Papa, Gabriella Stanton, Victor Martin, Sukhdeep Singh, Lai H. Wong, Russ Bates, Simon A. Kohl, Josh Abramson, Andrew W. Senior, Yilmaz Alguel, Mary Y. Wu, Irene M. Aspalter , et al. (7 additional authors not shown)

    Abstract: Computational design of protein-binding proteins is a fundamental capability with broad utility in biomedical research and biotechnology. Recent methods have made strides against some target proteins, but on-demand creation of high-affinity binders without multiple rounds of experimental testing remains an unsolved challenge. This technical report introduces AlphaProteo, a family of machine learni… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: 45 pages, 17 figures

  47. arXiv:2408.12413  [pdf, other

    q-bio.BM cs.AI

    Dynamic PDB: A New Dataset and a SE(3) Model Extension by Integrating Dynamic Behaviors and Physical Properties in Protein Structures

    Authors: Ce Liu, Jun Wang, Zhiqiang Cai, Yingxu Wang, Huizhen Kuang, Kaihui Cheng, Liwei Zhang, Qingkun Su, Yining Tang, Fenglei Cao, Limei Han, Siyu Zhu, Yuan Qi

    Abstract: Despite significant progress in static protein structure collection and prediction, the dynamic behavior of proteins, one of their most vital characteristics, has been largely overlooked in prior research. This oversight can be attributed to the limited availability, diversity, and heterogeneity of dynamic protein datasets. To address this gap, we propose to enhance existing prestigious static 3D… ▽ More

    Submitted 18 September, 2024; v1 submitted 22 August, 2024; originally announced August 2024.

  48. arXiv:2407.12296  [pdf

    q-bio.BM

    Discovery of novel antimicrobial peptides with notable antibacterial potency by a LLM-based foundation model

    Authors: Jike Wang, Jianwen Feng, Yu Kang, Peichen Pan, Jingxuan Ge, Yan Wang, Mingyang Wang, Zhenxing Wu, Xingcai Zhang, Jiameng Yu, Xujun Zhang, Tianyue Wang, Lirong Wen, Guangning Yan, Yafeng Deng, Hui Shi, Chang-Yu Hsieh, Zhihui Jiang, Tingjun Hou

    Abstract: Large language models (LLMs) have shown remarkable advancements in chemistry and biomedical research, acting as versatile foundation models for various tasks. We introduce AMP-Designer, an LLM-based approach for swiftly designing novel antimicrobial peptides (AMPs) with desired properties. Within 11 days, AMP-Designer achieved the de novo design of 18 AMPs with broad-spectrum activity against Gram… ▽ More

    Submitted 2 March, 2025; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: 43 pages, 6 figures, 5 tables. Due to the limitation "The abstract field cannot be longer than 1,920 characters", the abstract appearing here is slightly shorter than that in the PDF file

  49. arXiv:2407.09450  [pdf, other

    cs.AI cs.CL cs.LG q-bio.NC

    Human-like Episodic Memory for Infinite Context LLMs

    Authors: Zafeirios Fountas, Martin A Benfeghoul, Adnan Oomerjee, Fenia Christopoulou, Gerasimos Lampouras, Haitham Bou-Ammar, Jun Wang

    Abstract: Large language models (LLMs) have shown remarkable capabilities, but still struggle with processing extensive contexts, limiting their ability to maintain coherence and accuracy over long sequences. In contrast, the human brain excels at organising and retrieving episodic experiences across vast temporal scales, spanning a lifetime. In this work, we introduce EM-LLM, a novel approach that integrat… ▽ More

    Submitted 25 October, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

  50. arXiv:2407.07930  [pdf

    q-bio.BM cs.LG

    Token-Mol 1.0: Tokenized drug design with large language model

    Authors: Jike Wang, Rui Qin, Mingyang Wang, Meijing Fang, Yangyang Zhang, Yuchen Zhu, Qun Su, Qiaolin Gou, Chao Shen, Odin Zhang, Zhenxing Wu, Dejun Jiang, Xujun Zhang, Huifeng Zhao, Xiaozhe Wan, Zhourui Wu, Liwei Liu, Yu Kang, Chang-Yu Hsieh, Tingjun Hou

    Abstract: Significant interests have recently risen in leveraging sequence-based large language models (LLMs) for drug design. However, most current applications of LLMs in drug discovery lack the ability to comprehend three-dimensional (3D) structures, thereby limiting their effectiveness in tasks that explicitly involve molecular conformations. In this study, we introduced Token-Mol, a token-only 3D drug… ▽ More

    Submitted 19 August, 2024; v1 submitted 10 July, 2024; originally announced July 2024.