Skip to main content

Showing 1–50 of 69 results for author: Wang, M

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2507.02450  [pdf, ps, other

    q-bio.NC

    Network structural change point detection and reconstruction for balanced neuronal networks

    Authors: Kai Chen, Mingzhang Wang, Songting Li, Douglas Zhou

    Abstract: Understanding brain dynamics and functions critically depends on knowledge of the network connectivity among neurons. However, the complexity of brain structural connectivity, coupled with continuous modifications driven by synaptic plasticity, makes its direct experimental measurement particularly challenging. Conventional connectivity inference methods based on neuronal recordings often assumes… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: 22 pages, 5 figures

  2. arXiv:2507.02004  [pdf, ps, other

    cs.AI cs.CL q-bio.BM

    STELLA: Self-Evolving LLM Agent for Biomedical Research

    Authors: Ruofan Jin, Zaixi Zhang, Mengdi Wang, Le Cong

    Abstract: The rapid growth of biomedical data, tools, and literature has created a fragmented research landscape that outpaces human expertise. While AI agents offer a solution, they typically rely on static, manually curated toolsets, limiting their ability to adapt and scale. Here, we introduce STELLA, a self-evolving AI agent designed to overcome these limitations. STELLA employs a multi-agent architectu… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

  3. arXiv:2505.23839  [pdf, other

    cs.CR q-bio.GN

    GeneBreaker: Jailbreak Attacks against DNA Language Models with Pathogenicity Guidance

    Authors: Zaixi Zhang, Zhenghong Zhou, Ruofan Jin, Le Cong, Mengdi Wang

    Abstract: DNA, encoding genetic instructions for almost all living organisms, fuels groundbreaking advances in genomics and synthetic biology. Recently, DNA Foundation Models have achieved success in designing synthetic functional DNA sequences, even whole genomes, but their susceptibility to jailbreaking remains underexplored, leading to potential concern of generating harmful sequences such as pathogens o… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  4. arXiv:2505.22250  [pdf

    cs.CV q-bio.QM

    YH-MINER: Multimodal Intelligent System for Natural Ecological Reef Metric Extraction

    Authors: Mingzhuang Wang, Yvyang Li, Xiyang Zhang, Fei Tan, Qi Shi, Guotao Zhang, Siqi Chen, Yufei Liu, Lei Lei, Ming Zhou, Qiang Lin, Hongqiang Yang

    Abstract: Coral reefs, crucial for sustaining marine biodiversity and ecological processes (e.g., nutrient cycling, habitat provision), face escalating threats, underscoring the need for efficient monitoring. Coral reef ecological monitoring faces dual challenges of low efficiency in manual analysis and insufficient segmentation accuracy in complex underwater scenarios. This study develops the YH-MINER syst… ▽ More

    Submitted 29 May, 2025; v1 submitted 28 May, 2025; originally announced May 2025.

  5. arXiv:2505.17922  [pdf, ps, other

    q-bio.QM

    Bayesian ensemble learning for predicting health outcomes of multipollutant mixtures

    Authors: Yu-Chien Ning, Xin Zhou, Francine Laden, Molin Wang

    Abstract: We introduce the SoftBart approach from Bayesian ensemble learning to estimate the relationship between multipollutant mixtures and health on chronic exposures in epidemiology research. This approach offers several key advantages over existing methods: (1) it is computationally efficient and well-suited for analyzing large datasets; (2) it is flexible in estimating various correlated nonlinear fun… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: 13 pages, 6 figures

  6. arXiv:2505.09883  [pdf, other

    q-bio.GN

    DeepPlantCRE: A Transformer-CNN Hybrid Framework for Plant Gene Expression Modeling and Cross-Species Generalization

    Authors: Yingjun Wu, Jingyun Huang, Liang Ming, Pengcheng Deng, Maojun Wang, Zeyu Zhang

    Abstract: The investigation of plant transcriptional regulation constitutes a fundamental basis for crop breeding, where cis-regulatory elements (CREs), as the key factor determining gene expression, have become the focus of crop genetic improvement research. Deep learning techniques, leveraging their exceptional capacity for high-dimensional feature extraction and nonlinear regulatory relationship modeling… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  7. arXiv:2505.09873  [pdf, other

    q-bio.GN

    Deep Learning and Explainable AI: New Pathways to Genetic Insights

    Authors: Chenyu Wang, Chaoying Zuo, Zihan Su, Yuhang Xing, Lu Li, Maojun Wang, Zeyu Zhang

    Abstract: Deep learning-based AI models have been extensively applied in genomics, achieving remarkable success across diverse applications. As these models gain prominence, there exists an urgent need for interpretability methods to establish trustworthiness in model-driven decisions. For genetic researchers, interpretable insights derived from these models hold significant value in providing novel perspec… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  8. arXiv:2505.01700  [pdf, other

    cs.LG q-bio.QM

    PoseX: AI Defeats Physics Approaches on Protein-Ligand Cross Docking

    Authors: Yize Jiang, Xinze Li, Yuanyuan Zhang, Jin Han, Youjun Xu, Ayush Pandit, Zaixi Zhang, Mengdi Wang, Mengyang Wang, Chong Liu, Guang Yang, Yejin Choi, Wu-Jun Li, Tianfan Fu, Fang Wu, Junhong Liu

    Abstract: Existing protein-ligand docking studies typically focus on the self-docking scenario, which is less practical in real applications. Moreover, some studies involve heavy frameworks requiring extensive training, posing challenges for convenient and efficient assessment of docking methods. To fill these gaps, we design PoseX, an open-source benchmark to evaluate both self-docking and cross-docking, e… ▽ More

    Submitted 21 May, 2025; v1 submitted 3 May, 2025; originally announced May 2025.

  9. arXiv:2503.20179  [pdf, other

    cs.CL cs.IR q-bio.QM

    ProtoBERT-LoRA: Parameter-Efficient Prototypical Finetuning for Immunotherapy Study Identification

    Authors: Shijia Zhang, Xiyu Ding, Kai Ding, Jacob Zhang, Kevin Galinsky, Mengrui Wang, Ryan P. Mayers, Zheyu Wang, Hadi Kharrazi

    Abstract: Identifying immune checkpoint inhibitor (ICI) studies in genomic repositories like Gene Expression Omnibus (GEO) is vital for cancer research yet remains challenging due to semantic ambiguity, extreme class imbalance, and limited labeled data in low-resource settings. We present ProtoBERT-LoRA, a hybrid framework that combines PubMedBERT with prototypical networks and Low-Rank Adaptation (LoRA) fo… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: Submitted to AMIA 2025 Annual Symposium

  10. arXiv:2502.00934  [pdf

    q-bio.PE q-bio.QM

    Optimizing Global Genomic Surveillance for Early Detection of Emerging SARS-CoV-2 Variants

    Authors: Haogao Gu, Jifan Li, Wanying Sun, Mengting Li, Kathy Leung, Joseph T. Wu, Hsiang-Yu Yuan, Maggie H. Wang, Bingyi Yang, Matthew R. McKay, Ning Ning, Leo L. M. Poon

    Abstract: Background: Global viral threats underscore the need for effective genomic surveillance, but high costs and uneven resource distribution hamper its implementation. Targeting surveillance to international travelers in major travel hubs may offer a more efficient strategy for the early detection of SARS-CoV-2 variants. Methods: We developed and calibrated a multiple-strain metapopulation model of… ▽ More

    Submitted 13 February, 2025; v1 submitted 2 February, 2025; originally announced February 2025.

  11. arXiv:2412.16483  [pdf, other

    cs.LG physics.chem-ph q-bio.BM

    MOL-Mamba: Enhancing Molecular Representation with Structural & Electronic Insights

    Authors: Jingjing Hu, Dan Guo, Zhan Si, Deguang Liu, Yunfeng Diao, Jing Zhang, Jinxing Zhou, Meng Wang

    Abstract: Molecular representation learning plays a crucial role in various downstream tasks, such as molecular property prediction and drug design. To accurately represent molecules, Graph Neural Networks (GNNs) and Graph Transformers (GTs) have shown potential in the realm of self-supervised pretraining. However, existing approaches often overlook the relationship between molecular structure and electroni… ▽ More

    Submitted 5 February, 2025; v1 submitted 20 December, 2024; originally announced December 2024.

    Comments: Accepted by AAAI2025

  12. arXiv:2412.12229  [pdf

    q-bio.NC

    Efficacy of Temporal Interference Electrical Stimulation for Spinal Cord Injury Rehabilitation: A Case Series

    Authors: Ruidong Cheng, Yuling Shao, Xi Li, Li Zhang, Zehao Sheng, Chenyang Li, Xu Xie, Huilin Mou, Weidong Chen, Shaomin Zhang, Yuchen Xu, Minmin Wang

    Abstract: Spinal cord injury (SCI) is a debilitating condition that often results in significant motor and sensory deficits, impacting the quality of life. Current rehabilitation methods, including physical therapy and electrical stimulation, offer variable outcomes and often require invasive procedures. Temporal interference (TI) stimulation has emerged as a novel, non-invasive neuromodulation technique ca… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: 19 pages,1 table

  13. arXiv:2410.20354  [pdf, other

    cs.CR cs.LG q-bio.BM

    FoldMark: Protecting Protein Generative Models with Watermarking

    Authors: Zaixi Zhang, Ruofan Jin, Kaidi Fu, Le Cong, Marinka Zitnik, Mengdi Wang

    Abstract: Protein structure is key to understanding protein function and is essential for progress in bioengineering, drug discovery, and molecular biology. Recently, with the incorporation of generative AI, the power and accuracy of computational protein structure prediction/design have been improved significantly. However, ethical concerns such as copyright protection and harmful content generation (biose… ▽ More

    Submitted 11 November, 2024; v1 submitted 27 October, 2024; originally announced October 2024.

  14. arXiv:2409.19645  [pdf, other

    q-bio.BM

    FlexSBDD: Structure-Based Drug Design with Flexible Protein Modeling

    Authors: Zaixi Zhang, Mengdi Wang, Qi Liu

    Abstract: Structure-based drug design (SBDD), which aims to generate 3D ligand molecules binding to target proteins, is a fundamental task in drug discovery. Existing SBDD methods typically treat protein as rigid and neglect protein structural change when binding with ligand molecules, leading to a big gap with real-world scenarios and inferior generation qualities (e.g., many steric clashes). To bridge the… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: Accepted by NeurIPS 2024

  15. arXiv:2409.09828  [pdf, other

    cs.LG cs.AI q-bio.QM

    Latent Diffusion Models for Controllable RNA Sequence Generation

    Authors: Kaixuan Huang, Yukang Yang, Kaidi Fu, Yanyi Chu, Le Cong, Mengdi Wang

    Abstract: This work presents RNAdiffusion, a latent diffusion model for generating and optimizing discrete RNA sequences of variable lengths. RNA is a key intermediary between DNA and protein, exhibiting high sequence diversity and complex three-dimensional structures to support a wide range of functions. We utilize pretrained BERT-type models to encode raw RNA sequences into token-level, biologically meani… ▽ More

    Submitted 2 October, 2024; v1 submitted 15 September, 2024; originally announced September 2024.

  16. arXiv:2408.05224  [pdf, ps, other

    q-bio.BM math.OC

    Optimal Strategy for Stabilizing Protein Folding Intermediates

    Authors: Mengshou Wang, Liangrong Pengb, Baoguo Jia, Liu Hong

    Abstract: To manipulate the protein population at certain functional state through chemical stabilizers is crucial for protein-related studies. It not only plays a key role in protein structure analysis and protein folding kinetics, but also affects protein functionality to a large extent and thus has wide applications in medicine, food industry, etc. However, due to concerns about side effects or financial… ▽ More

    Submitted 28 July, 2024; originally announced August 2024.

    Comments: 19 pages, 5 figures, 2 tables

    MSC Class: 34Hxx; 92Cxx

  17. arXiv:2407.20978  [pdf

    q-bio.GN

    Are gene-by-environment interactions leveraged in multi-modality neural networks for breast cancer prediction?

    Authors: Monica Isgut, Andrew Hornback, Yunan Luo, Asma Khimani, Neha Jain, May D. Wang

    Abstract: Polygenic risk scores (PRSs) can significantly enhance breast cancer risk prediction when combined with clinical risk factor data. While many studies have explored the value-add of PRSs, little is known about the potential impact of gene-by-gene or gene-by-environment interactions towards enhancing the risk discrimination capabilities of multi-modal models combining PRSs with clinical data. In thi… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  18. arXiv:2407.12296  [pdf

    q-bio.BM

    Discovery of novel antimicrobial peptides with notable antibacterial potency by a LLM-based foundation model

    Authors: Jike Wang, Jianwen Feng, Yu Kang, Peichen Pan, Jingxuan Ge, Yan Wang, Mingyang Wang, Zhenxing Wu, Xingcai Zhang, Jiameng Yu, Xujun Zhang, Tianyue Wang, Lirong Wen, Guangning Yan, Yafeng Deng, Hui Shi, Chang-Yu Hsieh, Zhihui Jiang, Tingjun Hou

    Abstract: Large language models (LLMs) have shown remarkable advancements in chemistry and biomedical research, acting as versatile foundation models for various tasks. We introduce AMP-Designer, an LLM-based approach for swiftly designing novel antimicrobial peptides (AMPs) with desired properties. Within 11 days, AMP-Designer achieved the de novo design of 18 AMPs with broad-spectrum activity against Gram… ▽ More

    Submitted 2 March, 2025; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: 43 pages, 6 figures, 5 tables. Due to the limitation "The abstract field cannot be longer than 1,920 characters", the abstract appearing here is slightly shorter than that in the PDF file

  19. arXiv:2407.07930  [pdf

    q-bio.BM cs.LG

    Token-Mol 1.0: Tokenized drug design with large language model

    Authors: Jike Wang, Rui Qin, Mingyang Wang, Meijing Fang, Yangyang Zhang, Yuchen Zhu, Qun Su, Qiaolin Gou, Chao Shen, Odin Zhang, Zhenxing Wu, Dejun Jiang, Xujun Zhang, Huifeng Zhao, Xiaozhe Wan, Zhourui Wu, Liwei Liu, Yu Kang, Chang-Yu Hsieh, Tingjun Hou

    Abstract: Significant interests have recently risen in leveraging sequence-based large language models (LLMs) for drug design. However, most current applications of LLMs in drug discovery lack the ability to comprehend three-dimensional (3D) structures, thereby limiting their effectiveness in tasks that explicitly involve molecular conformations. In this study, we introduced Token-Mol, a token-only 3D drug… ▽ More

    Submitted 19 August, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

  20. arXiv:2407.07357  [pdf, ps, other

    cs.LG q-bio.MN

    A deep graph model for the signed interaction prediction in biological network

    Authors: Shuyi Jin, Mengji Zhang, Meijie Wang, Lun Yu

    Abstract: Predicting signed interactions in biological networks is crucial for understanding drug mechanisms and facilitating drug repurposing. While deep graph models have demonstrated success in modeling complex biological systems, existing approaches often fail to distinguish between positive and negative interactions, limiting their utility for precise pharmacological predictions. In this study, we prop… ▽ More

    Submitted 17 March, 2025; v1 submitted 10 July, 2024; originally announced July 2024.

  21. arXiv:2406.07662  [pdf, other

    eess.IV cs.AI cs.CV cs.LG q-bio.NC

    Progress Towards Decoding Visual Imagery via fNIRS

    Authors: Michel Adamic, Wellington Avelino, Anna Brandenberger, Bryan Chiang, Hunter Davis, Stephen Fay, Andrew Gregory, Aayush Gupta, Raphael Hotter, Grace Jiang, Fiona Leng, Stephen Polcyn, Thomas Ribeiro, Paul Scotti, Michelle Wang, Marley Xiong, Jonathan Xu

    Abstract: We demonstrate the possibility of reconstructing images from fNIRS brain activity and start building a prototype to match the required specs. By training an image reconstruction model on downsampled fMRI data, we discovered that cm-scale spatial resolution is sufficient for image generation. We obtained 71% retrieval accuracy with 1-cm resolution, compared to 93% on the full-resolution fMRI, and 2… ▽ More

    Submitted 22 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  22. arXiv:2405.15158  [pdf, other

    q-bio.BM cs.LG

    ProtFAD: Introducing function-aware domains as implicit modality towards protein function prediction

    Authors: Mingqing Wang, Zhiwei Nie, Yonghong He, Athanasios V. Vasilakos, Zhixiang Ren

    Abstract: Protein function prediction is currently achieved by encoding its sequence or structure, where the sequence-to-function transcendence and high-quality structural data scarcity lead to obvious performance bottlenecks. Protein domains are "building blocks" of proteins that are functionally independent, and their combinations determine the diverse biological functions. However, most existing studies… ▽ More

    Submitted 2 December, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 17 pages, 7 figures, 5 tables

  23. arXiv:2405.12144  [pdf

    q-bio.NC

    Alterations of electrocortical activity during hand movements induced by motor cortex glioma

    Authors: Yihan Wu, Tao Chang, Siliang Chen, Xiaodong Niu, Yu Li, Yuan Fang, Lei Yang, Yixuan Zong, Yaoxin Yang, Yuehua Li, Mengsong Wang, Wen Yang, Yixuan Wu, Chen Fu, Xia Fang, Yuxin Quan, Xilin Peng, Qiang Sun, Marc M. Van Hulle, Yanhui Liu, Ning Jiang, Dario Farina, Yuan Yang, Jiayuan He, Qing Mao

    Abstract: Glioma cells can reshape functional neuronal networks by hijacking neuronal synapses, leading to partial or complete neurological dysfunction. These mechanisms have been previously explored for language functions. However, the impact of glioma on sensorimotor functions is still unknown. Therefore, we recruited a control group of patients with unaffected motor cortex and a group of patients with gl… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  24. MicroBundlePillarTrack: A Python package for automated segmentation, tracking, and analysis of pillar deflection in cardiac microbundles

    Authors: Hiba Kobeissi, Xining Gao, Samuel J. DePalma, Jourdan K. Ewoldt, Miranda C. Wang, Shoshana L. Das, Javiera Jilberto, David Nordsletten, Brendon M. Baker, Christopher S. Chen, Emma Lejeune

    Abstract: Movies of human induced pluripotent stem cell (hiPSC)-derived engineered cardiac tissue (microbundles) contain abundant information about structural and functional maturity. However, extracting these data in a reproducible and high-throughput manner remains a major challenge. Furthermore, it is not straightforward to make direct quantitative comparisons across the multiple in vitro experimental pl… ▽ More

    Submitted 15 August, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

    Comments: 8 main pages, 1 main figure, Supplementary Information included. microPublication Biology (2024)

    MSC Class: 92F05; 74A05 ACM Class: J.2; J.3

  25. arXiv:2404.18443  [pdf, other

    cs.CL cs.AI cs.IR q-bio.QM

    BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers

    Authors: Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May D. Wang, Joyce C. Ho, Chao Zhang, Carl Yang

    Abstract: Developing effective biomedical retrieval models is important for excelling at knowledge-intensive biomedical tasks but still challenging due to the deficiency of sufficient publicly annotated biomedical data and computational resources. We present BMRetriever, a series of dense retrievers for enhancing biomedical retrieval via unsupervised pre-training on large biomedical corpora, followed by ins… ▽ More

    Submitted 3 October, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted to EMNLP 2024. The model and data are uploaded to \url{https://github.com/ritaranx/BMRetriever}

    Journal ref: EMNLP 2024

  26. arXiv:2404.18021  [pdf, other

    cs.AI cs.CL cs.HC q-bio.QM

    CRISPR-GPT: An LLM Agent for Automated Design of Gene-Editing Experiments

    Authors: Kaixuan Huang, Yuanhao Qu, Henry Cousins, William A. Johnson, Di Yin, Mihir Shah, Denny Zhou, Russ Altman, Mengdi Wang, Le Cong

    Abstract: The introduction of genome engineering technology has transformed biomedical research, making it possible to make precise changes to genetic information. However, creating an efficient gene-editing system requires a deep understanding of CRISPR technology, and the complex experimental systems under investigation. While Large Language Models (LLMs) have shown promise in various tasks, they often la… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

  27. arXiv:2404.02924  [pdf, other

    q-bio.PE

    Accounting for contact network uncertainty in epidemic inferences

    Authors: Maxwell H. Wang, Jukka-Pekka Onnela

    Abstract: When modeling the dynamics of infectious disease, the incorporation of contact network information allows for the capture of the non-randomness and heterogeneity of realistic contact patterns. Oftentimes, it is assumed that the underlying contact pattern is known with perfect certainty. However, in realistic settings, the observed data often serves as an imperfect proxy of the actual contact patte… ▽ More

    Submitted 15 April, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: 27 pages, 7 figures

  28. arXiv:2404.00014  [pdf

    physics.chem-ph cs.AI q-bio.BM

    Deep Geometry Handling and Fragment-wise Molecular 3D Graph Generation

    Authors: Odin Zhang, Yufei Huang, Shichen Cheng, Mengyao Yu, Xujun Zhang, Haitao Lin, Yundian Zeng, Mingyang Wang, Zhenxing Wu, Huifeng Zhao, Zaixi Zhang, Chenqing Hua, Yu Kang, Sunliang Cui, Peichen Pan, Chang-Yu Hsieh, Tingjun Hou

    Abstract: Most earlier 3D structure-based molecular generation approaches follow an atom-wise paradigm, incrementally adding atoms to a partially built molecular fragment within protein pockets. These methods, while effective in designing tightly bound ligands, often overlook other essential properties such as synthesizability. The fragment-wise generation paradigm offers a promising solution. However, a co… ▽ More

    Submitted 15 March, 2024; originally announced April 2024.

  29. arXiv:2403.00815  [pdf, other

    cs.CL cs.AI cs.IR q-bio.OT

    RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records

    Authors: Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Bowen Jin, May D. Wang, Joyce C. Ho, Carl Yang

    Abstract: We present RAM-EHR, a Retrieval AugMentation pipeline to improve clinical predictions on Electronic Health Records (EHRs). RAM-EHR first collects multiple knowledge sources, converts them into text format, and uses dense retrieval to obtain information related to medical concepts. This strategy addresses the difficulties associated with complex names for the concepts. RAM-EHR then augments the loc… ▽ More

    Submitted 26 July, 2024; v1 submitted 25 February, 2024; originally announced March 2024.

    Comments: ACL 2024 (Oral)

    Journal ref: ACL 2024

  30. arXiv:2401.06173  [pdf, other

    q-bio.BM cs.LG

    Tree Search-Based Evolutionary Bandits for Protein Sequence Optimization

    Authors: Jiahao Qiu, Hui Yuan, Jinghong Zhang, Wentao Chen, Huazheng Wang, Mengdi Wang

    Abstract: While modern biotechnologies allow synthesizing new proteins and function measurements at scale, efficiently exploring a protein sequence space and engineering it remains a daunting task due to the vast sequence space of any given protein. Protein engineering is typically conducted through an iterative process of adding mutations to the wild-type or lead sequences, recombination of mutations, and… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: AAAI 2024

  31. arXiv:2401.04246  [pdf, other

    cs.LG q-bio.BM

    Scalable Normalizing Flows Enable Boltzmann Generators for Macromolecules

    Authors: Joseph C. Kim, David Bloore, Karan Kapoor, Jun Feng, Ming-Hong Hao, Mengdi Wang

    Abstract: The Boltzmann distribution of a protein provides a roadmap to all of its functional states. Normalizing flows are a promising tool for modeling this distribution, but current methods are intractable for typical pharmacological targets; they become computationally intractable due to the size of the system, heterogeneity of intra-molecular potential energy, and long-range interactions. To remedy the… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

  32. arXiv:2312.12989  [pdf, other

    cs.LG cs.CL q-bio.QM

    Benchmarking and Analyzing In-context Learning, Fine-tuning and Supervised Learning for Biomedical Knowledge Curation: a focused study on chemical entities of biological interest

    Authors: Emily Groves, Minhong Wang, Yusuf Abdulle, Holger Kunz, Jason Hoelscher-Obermaier, Ronin Wu, Honghan Wu

    Abstract: Automated knowledge curation for biomedical ontologies is key to ensure that they remain comprehensive, high-quality and up-to-date. In the era of foundational language models, this study compares and analyzes three NLP paradigms for curation tasks: in-context learning (ICL), fine-tuning (FT), and supervised learning (ML). Using the Chemical Entities of Biological Interest (ChEBI) database as a mo… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: 26 pages, 5 figures, 14 tables

  33. arXiv:2311.04238  [pdf, other

    q-bio.PE

    Flexible Bayesian Inference on Partially Observed Epidemics

    Authors: Maxwell H. Wang, Jukka-Pekka Onnela

    Abstract: Individual-based models of contagious processes are useful for predicting epidemic trajectories and informing intervention strategies. In such models, the incorporation of contact network information can capture the non-randomness and heterogeneity of realistic contact dynamics. In this paper, we consider Bayesian inference on the spreading parameters of an SIR contagion on a known, static network… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 27 pages, 7 figures

  34. arXiv:2308.01241  [pdf, other

    cs.NE q-bio.NC

    Digital Twin Brain: a simulation and assimilation platform for whole human brain

    Authors: Wenlian Lu, Longbin Zeng, Xin Du, Wenyong Zhang, Shitong Xiang, Huarui Wang, Jiexiang Wang, Mingda Ji, Yubo Hou, Minglong Wang, Yuhao Liu, Zhongyu Chen, Qibao Zheng, Ningsheng Xu, Jianfeng Feng

    Abstract: In this work, we present a computing platform named digital twin brain (DTB) that can simulate spiking neuronal networks of the whole human brain scale and more importantly, a personalized biological brain structure. In comparison to most brain simulations with a homogeneous global structure, we highlight that the sparseness, couplingness and heterogeneity in the sMRI, DTI and PET data of the brai… ▽ More

    Submitted 2 August, 2023; originally announced August 2023.

    Comments: 12 pages, 11 figures

  35. arXiv:2306.11768  [pdf, other

    q-bio.QM cs.CE cs.LG

    Geometric Deep Learning for Structure-Based Drug Design: A Survey

    Authors: Zaixi Zhang, Jiaxian Yan, Yining Huang, Qi Liu, Enhong Chen, Mengdi Wang, Marinka Zitnik

    Abstract: Structure-based drug design (SBDD) leverages the three-dimensional geometry of proteins to identify potential drug candidates. Traditional approaches, rooted in physicochemical modeling and domain expertise, are often resource-intensive. Recent advancements in geometric deep learning, which effectively integrate and process 3D geometric data, alongside breakthroughs in accurate protein structure p… ▽ More

    Submitted 15 November, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: 28 pages, under review

  36. arXiv:2212.06394  [pdf

    q-bio.NC

    Tangent functional connectomes uncover more unique phenotypic traits

    Authors: Kausar Abbas, Mintao Liu, Michael Wang, Duy Duong-Tran, Uttara Tipnis, Enrico Amico, Alan D. Kaplan, Mario Dzemidzic, David Kareken, Beau M. Ances, Jaroslaw Harezlak, Joaquín Goñi

    Abstract: Functional connectomes (FCs) contain pairwise estimations of functional couplings based on pairs of brain regions activity. FCs are commonly represented as correlation matrices that are symmetric positive definite (SPD) lying on or inside the SPD manifold. Since the geometry on the SPD manifold is non-Euclidean, the inter-related entries of FCs undermine the use of Euclidean-based distances. By pr… ▽ More

    Submitted 9 June, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: 31 pages, 10 figures, 2 tables

  37. arXiv:2211.05658  [pdf, other

    q-bio.QM cs.NE q-bio.NC

    Multi-objective optimization via evolutionary algorithm (MOVEA) for high-definition transcranial electrical stimulation of the human brain

    Authors: Mo Wang, Kexin Lou, Zeming Liu, Pengfei Wei, Quanying Liu

    Abstract: Designing a transcranial electrical stimulation (TES) strategy requires considering multiple objectives, such as intensity in the target area, focality, stimulation depth, and avoidance zone, which are often mutually exclusive. A computational framework for optimizing different strategies and comparing trade-offs between these objectives is currently lacking. In this paper, we propose a general fr… ▽ More

    Submitted 3 April, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

    Journal ref: NeuroImage, Volume 280, 2020

  38. arXiv:2210.05713  [pdf, other

    q-bio.NC cs.NE eess.SP

    Explainable fMRI-based Brain Decoding via Spatial Temporal-pyramid Graph Convolutional Network

    Authors: Ziyuan Ye, Youzhi Qu, Zhichao Liang, Mo Wang, Quanying Liu

    Abstract: Brain decoding, aiming to identify the brain states using neural activity, is important for cognitive neuroscience and neural engineering. However, existing machine learning methods for fMRI-based brain decoding either suffer from low classification performance or poor explainability. Here, we address this issue by proposing a biologically inspired architecture, Spatial Temporal-pyramid Graph Conv… ▽ More

    Submitted 8 October, 2022; originally announced October 2022.

  39. arXiv:2208.04314  [pdf

    q-bio.QM cs.LG

    TripHLApan: predicting HLA molecules binding peptides based on triple coding matrix and transfer learning

    Authors: Meng Wang, Chuqi Lei, Jianxin Wang, Yaohang Li, Min Li

    Abstract: Human leukocyte antigen (HLA) is an important molecule family in the field of human immunity, which recognizes foreign threats and triggers immune responses by presenting peptides to T cells. In recent years, the synthesis of tumor vaccines to induce specific immune responses has become the forefront of cancer treatment. Computationally modeling the binding patterns between peptide and HLA can gre… ▽ More

    Submitted 6 August, 2022; originally announced August 2022.

    Comments: 25 pages, 7 figures

  40. arXiv:2206.12240  [pdf, other

    q-bio.BM cs.LG

    PSP: Million-level Protein Sequence Dataset for Protein Structure Prediction

    Authors: Sirui Liu, Jun Zhang, Haotian Chu, Min Wang, Boxin Xue, Ningxi Ni, Jialiang Yu, Yuhao Xie, Zhenyu Chen, Mengyun Chen, Yuan Liu, Piya Patra, Fan Xu, Jie Chen, Zidong Wang, Lijiang Yang, Fan Yu, Lei Chen, Yi Qin Gao

    Abstract: Proteins are essential component of human life and their structures are important for function and mechanism analysis. Recent work has shown the potential of AI-driven methods for protein structure prediction. However, the development of new models is restricted by the lack of dataset and benchmark training procedure. To the best of our knowledge, the existing open source datasets are far less to… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  41. arXiv:2109.00123  [pdf, ps, other

    q-bio.TO physics.bio-ph

    Regulatory Feedback Effects on Tissue Growth Dynamics in a Two-Stage Cell Lineage Model

    Authors: Mao-Xiang Wang, Arthur Lander, Pik-Yin Lai

    Abstract: Identifying the mechanism of intercellular feedback regulation is critical for the basic understanding of tissue growth control in organisms. In this paper, we analyze a tissue growth model consisting of a single lineage of two cell types regulated by negative feedback signalling molecules that undergo spatial diffusion. By deriving the fixed points for the uniform steady states and carrying out l… ▽ More

    Submitted 31 August, 2021; originally announced September 2021.

    Comments: to be published in Physical Review E

  42. arXiv:2104.10878  [pdf, other

    stat.AP q-bio.PE

    Comparing regional and provincial-wide COVID-19 models with physical distancing in British Columbia

    Authors: Geoffrey McGregor, Jennifer Tippett, Andy T. S. Wan, Mengxiao Wang, Samuel W. K. Wong

    Abstract: We study the effects of physical distancing measures for the spread of COVID-19 in regional areas within British Columbia, using the reported cases of the five provincial Health Authorities. Building on the Bayesian epidemiological model of Anderson et al. (2020), we propose a hierarchical regional Bayesian model with time-varying regional parameters between March to December of 2020. In the absen… ▽ More

    Submitted 13 November, 2021; v1 submitted 22 April, 2021; originally announced April 2021.

    Comments: 35 pages, 16 figures

    Journal ref: AIMS Mathematics, 2022, 7(4): 6743-6778

  43. arXiv:2104.01474  [pdf, other

    q-bio.NC

    Thalamocortical contribution to solving credit assignment in neural systems

    Authors: Mien Brabeeba Wang, Michael M. Halassa

    Abstract: Animal brains evolved to optimize behavior in dynamically changing environments, selecting actions that maximize future rewards. A large body of experimental work indicates that such optimization changes the wiring of neural circuits, appropriately mapping environmental input onto behavioral outputs. A major unsolved scientific question is how optimal wiring adjustments, which must target the conn… ▽ More

    Submitted 3 April, 2021; originally announced April 2021.

  44. arXiv:2103.00399  [pdf

    q-bio.BM

    Hydrophobic interaction determines docking affinity of SARS CoV 2 variants with antibodies

    Authors: Jiacheng Li, Chengyu Hou, Menghao Wang, Chencheng Liao, Shuai Guo, Liping Shi, Xiaoliang Ma, Hongchi Zhang, Shenda Jiang, Bing Zheng, Lin Ye, Lin Yang, Xiaodong He

    Abstract: Preliminary epidemiologic, phylogenetic and clinical findings suggest that several novel severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) variants have increased transmissibility and decreased efficacy of several existing vaccines. Four mutations in the receptor-binding domain (RBD) of the spike protein that are reported to contribute to increased transmission. Understanding physical m… ▽ More

    Submitted 28 February, 2021; originally announced March 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2008.11883

  45. arXiv:2102.13276  [pdf, other

    stat.ML cs.LG q-bio.PE

    Spectral Top-Down Recovery of Latent Tree Models

    Authors: Yariv Aizenbud, Ariel Jaffe, Meng Wang, Amber Hu, Noah Amsel, Boaz Nadler, Joseph T. Chang, Yuval Kluger

    Abstract: Modeling the distribution of high dimensional data by a latent tree graphical model is a prevalent approach in multiple scientific domains. A common task is to infer the underlying tree structure, given only observations of its terminal nodes. Many algorithms for tree recovery are computationally intensive, which limits their applicability to trees of moderate size. For large trees, a common appro… ▽ More

    Submitted 7 December, 2021; v1 submitted 25 February, 2021; originally announced February 2021.

  46. arXiv:2102.05440  [pdf

    physics.bio-ph q-bio.BM

    Protein corona critically affects the bio-behaviors of SARS-CoV-2

    Authors: Yue-wen Yin, Yan-jing Sheng, Min Wang, Song-di Ni, Hong-ming Ding, Yu-qiang Ma

    Abstract: The outbreak of the coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) has become a worldwide public health crisis. When the SARS-CoV-2 enters the biological fluids in the human body, different types of biomolecules (in particular proteins) may adsorb on its surface and alter its infection ability. Although great efforts have recently been de… ▽ More

    Submitted 10 February, 2021; originally announced February 2021.

    Comments: 18 pages, 7 figures

  47. arXiv:2005.14669  [pdf, other

    q-bio.BM q-bio.QM

    Mutations strengthened SARS-CoV-2 infectivity

    Authors: Jiahui Chen, Rui Wang, Menglun Wang, Guo-Wei Wei

    Abstract: Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) infectivity is a major concern in coronavirus disease 2019 (COVID-19) prevention and economic reopening. However, rigorous determination of SARS-COV-2 infectivity is essentially impossible owing to its continuous evolution with over 13752 single nucleotide polymorphisms (SNP) variants in six different subtypes. We develop an advanced mac… ▽ More

    Submitted 27 May, 2020; originally announced May 2020.

    Comments: 24 pages, 2 tables and 19 figures

  48. arXiv:2005.11935  [pdf

    q-bio.QM cs.HC

    A Novel Approach of using AR and Smart Surgical Glasses Supported Trauma Care

    Authors: Anurag Lal, Ming-Hsien Hu, Pei-Yuan Lee, Min Liang Wang

    Abstract: BACKGROUND: Augmented reality (AR) is gaining popularity in varying field such as computer gaming and medical education fields. However, still few of applications in real surgeries. Orthopedic surgical applications are currently limited and underdeveloped. - METHODS: The clinic validation was prepared with the currently available AR equipment and software. A total of 1 Vertebroplasty, 2 ORIF Pelvi… ▽ More

    Submitted 25 May, 2020; originally announced May 2020.

    Comments: 10 pages, 9 Figures, Conference. arXiv admin note: text overlap with arXiv:1801.01560 by other authors

  49. arXiv:2002.07096  [pdf

    physics.med-ph q-bio.PE

    Visual Data Analysis and Simulation Prediction for COVID-19

    Authors: Baoquan Chen, Mingyi Shi, Xingyu Ni, Liangwang Ruan, Hongda Jiang, Heyuan Yao, Mengdi Wang, Zhenhua Song, Qiang Zhou, Tong Ge

    Abstract: The COVID-19 (formerly, 2019-nCoV) epidemic has become a global health emergency, as such, WHO declared PHEIC. China has taken the most hit since the outbreak of the virus, which could be dated as far back as late November by some experts. It was not until January 23rd that the Wuhan government finally recognized the severity of the epidemic and took a drastic measure to curtain the virus spread b… ▽ More

    Submitted 6 March, 2020; v1 submitted 14 February, 2020; originally announced February 2020.

    Comments: 19 pages, 21 figures, revised English version and originally Chinese version

  50. arXiv:1911.03839  [pdf, ps, other

    q-bio.QM cs.CY cs.LG stat.ML

    In Vitro Fertilization (IVF) Cumulative Pregnancy Rate Prediction from Basic Patient Characteristics

    Authors: Bo Zhang, Yuqi Cui, Meng Wang, Jingjing Li, Lei Jin, Dongrui Wu

    Abstract: Tens of millions of women suffer from infertility worldwide each year. In vitro fertilization (IVF) is the best choice for many such patients. However, IVF is expensive, time-consuming, and both physically and emotionally demanding. The first question that a patient usually asks before the IVF is how likely she will conceive, given her basic medical examination information. This paper proposes thr… ▽ More

    Submitted 9 November, 2019; originally announced November 2019.