Skip to main content

Showing 1–26 of 26 results for author: Xie, L

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2411.03743  [pdf, other

    cs.AI q-bio.QM

    Automating Exploratory Proteomics Research via Language Models

    Authors: Ning Ding, Shang Qu, Linhai Xie, Yifei Li, Zaoqu Liu, Kaiyan Zhang, Yibai Xiong, Yuxin Zuo, Zhangren Chen, Ermo Hua, Xingtai Lv, Youbang Sun, Yang Li, Dong Li, Fuchu He, Bowen Zhou

    Abstract: With the development of artificial intelligence, its contribution to science is evolving from simulating a complex problem to automating entire research processes and producing novel discoveries. Achieving this advancement requires both specialized general models grounded in real-world scientific data and iterative, exploratory frameworks that mirror human scientific methodologies. In this paper,… ▽ More

    Submitted 6 November, 2024; originally announced November 2024.

  2. arXiv:2411.00054  [pdf, other

    q-bio.GN cs.AI cs.LG

    eDOC: Explainable Decoding Out-of-domain Cell Types with Evidential Learning

    Authors: Chaochen Wu, Meiyun Zuo, Lei Xie

    Abstract: Single-cell RNA-seq (scRNA-seq) technology is a powerful tool for unraveling the complexity of biological systems. One of essential and fundamental tasks in scRNA-seq data analysis is Cell Type Annotation (CTA). In spite of tremendous efforts in developing machine learning methods for this problem, several challenges remains. They include identifying Out-of-Domain (OOD) cell types, quantifying the… ▽ More

    Submitted 30 October, 2024; originally announced November 2024.

    Comments: under review

  3. arXiv:2410.05278  [pdf, other

    q-bio.BM cs.AI cs.LG cs.NE

    Dumpling GNN: Hybrid GNN Enables Better ADC Payload Activity Prediction Based on Chemical Structure

    Authors: Shengjie Xu, Lingxi Xie

    Abstract: Antibody-drug conjugates (ADCs) have emerged as a promising class of targeted cancer therapeutics, but the design and optimization of their cytotoxic payloads remain challenging. This study introduces DumplingGNN, a novel hybrid Graph Neural Network architecture specifically designed for predicting ADC payload activity based on chemical structure. By integrating Message Passing Neural Networks (MP… ▽ More

    Submitted 23 September, 2024; originally announced October 2024.

  4. arXiv:2312.17495  [pdf

    cs.LG physics.bio-ph q-bio.BM

    Integrating Chemical Language and Molecular Graph in Multimodal Fused Deep Learning for Drug Property Prediction

    Authors: Xiaohua Lu, Liangxu Xie, Lei Xu, Rongzhi Mao, Shan Chang, Xiaojun Xu

    Abstract: Accurately predicting molecular properties is a challenging but essential task in drug discovery. Recently, many mono-modal deep learning methods have been successfully applied to molecular property prediction. However, the inherent limitation of mono-modal learning arises from relying solely on one modality of molecular representation, which restricts a comprehensive understanding of drug molecul… ▽ More

    Submitted 12 September, 2024; v1 submitted 29 December, 2023; originally announced December 2023.

  5. arXiv:2312.03016  [pdf, other

    q-bio.QM cs.CL cs.LG

    Protein Language Model-Powered 3D Ligand Binding Site Prediction from Protein Sequence

    Authors: Shuo Zhang, Lei Xie

    Abstract: Prediction of ligand binding sites of proteins is a fundamental and important task for understanding the function of proteins and screening potential drugs. Most existing methods require experimentally determined protein holo-structures as input. However, such structures can be unavailable on novel or less-studied proteins. To tackle this limitation, we propose LaMPSite, which only takes protein s… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted by the AI for Science (AI4Science) Workshop and the New Frontiers of AI for Drug Discovery and Development (AI4D3) Workshop at NeurIPS 2023

  6. A Universal Framework for Accurate and Efficient Geometric Deep Learning of Molecular Systems

    Authors: Shuo Zhang, Yang Liu, Lei Xie

    Abstract: Molecular sciences address a wide range of problems involving molecules of different types and sizes and their complexes. Recently, geometric deep learning, especially Graph Neural Networks, has shown promising performance in molecular science applications. However, most existing works often impose targeted inductive biases to a specific molecular system, and are inefficient when applied to macrom… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: Published in Scientific Reports (DOI: 10.1038/s41598-023-46382-8)

    Journal ref: Scientific Reports 13, 19171 (2023)

  7. arXiv:2307.12491  [pdf, other

    cs.LG q-bio.BM

    Learning Universal and Robust 3D Molecular Representations with Graph Convolutional Networks

    Authors: Shuo Zhang, Yang Liu, Li Xie, Lei Xie

    Abstract: To learn accurate representations of molecules, it is essential to consider both chemical and geometric features. To encode geometric information, many descriptors have been proposed in constrained circumstances for specific types of molecules and do not have the properties to be ``robust": 1. Invariant to rotations and translations; 2. Injective when embedding molecular structures. In this work,… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

    Comments: Preprint. Work in progress

  8. arXiv:2304.04673  [pdf

    q-bio.NC cs.AI

    Regional Deep Atrophy: a Self-Supervised Learning Method to Automatically Identify Regions Associated With Alzheimer's Disease Progression From Longitudinal MRI

    Authors: Mengjin Dong, Long Xie, Sandhitsu R. Das, Jiancong Wang, Laura E. M. Wisse, Robin deFlores, David A. Wolk, Paul A. Yushkevich

    Abstract: Longitudinal assessment of brain atrophy, particularly in the hippocampus, is a well-studied biomarker for neurodegenerative diseases, such as Alzheimer's disease (AD). In clinical trials, estimation of brain progressive rates can be applied to track therapeutic efficacy of disease modifying treatments. However, most state-of-the-art measurements calculate changes directly by segmentation and/or d… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: Submitted to NeuroImage for review

  9. arXiv:2212.06772  [pdf

    q-bio.OT

    A Tribute to Phil Bourne -- Scientist and Human

    Authors: Cameron Mura, Emma Candelier, Lei Xie

    Abstract: This Special Issue of Biomolecules, commissioned in honor of Dr. Philip E. Bourne, focuses on a new field of biomolecular data science. In this brief retrospective, we consider the arc of Bourne's 40-year scientific and professional career, particularly as it relates to the origins of this new field.

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 5 pages, 1 figure

  10. arXiv:2210.16392  [pdf, other

    cs.LG q-bio.BM q-bio.QM

    Physics-aware Graph Neural Network for Accurate RNA 3D Structure Prediction

    Authors: Shuo Zhang, Yang Liu, Lei Xie

    Abstract: Biological functions of RNAs are determined by their three-dimensional (3D) structures. Thus, given the limited number of experimentally determined RNA structures, the prediction of RNA structures will facilitate elucidating RNA functions and RNA-targeted drug discovery, but remains a challenging task. In this work, we propose a Graph Neural Network (GNN)-based scoring function trained only with t… ▽ More

    Submitted 23 July, 2023; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: Accepted by the Machine Learning for Structural Biology Workshop (MLSB) at the 36th Conference on Neural Information Processing Systems (NeurIPS 2022)

  11. arXiv:2206.02789  [pdf, other

    q-bio.BM cs.LG

    Efficient and Accurate Physics-aware Multiplex Graph Neural Networks for 3D Small Molecules and Macromolecule Complexes

    Authors: Shuo Zhang, Yang Liu, Lei Xie

    Abstract: Recent advances in applying Graph Neural Networks (GNNs) to molecular science have showcased the power of learning three-dimensional (3D) structure representations with GNNs. However, most existing GNNs suffer from the limitations of insufficient modeling of diverse interactions, computational expensive operations, and ignorance of vectorial values. Here, we tackle these limitations by proposing a… ▽ More

    Submitted 18 November, 2023; v1 submitted 5 June, 2022; originally announced June 2022.

    Comments: An enhanced version of this preprint has been published in Scientific Reports (DOI: 10.1038/s41598-023-46382-8)

  12. arXiv:2203.08820  [pdf, other

    q-bio.QM cs.CV cs.LG

    DePS: An improved deep learning model for de novo peptide sequencing

    Authors: Cheng Ge, Yi Lu, Jia Qu, Liangxu Xie, Feng Wang, Hong Zhang, Ren Kong, Shan Chang

    Abstract: De novo peptide sequencing from mass spectrometry data is an important method for protein identification. Recently, various deep learning approaches were applied for de novo peptide sequencing and DeepNovoV2 is one of the represetative models. In this study, we proposed an enhanced model, DePS, which can improve the accuracy of de novo peptide sequencing even with missing signal peaks or large num… ▽ More

    Submitted 16 March, 2022; originally announced March 2022.

    Comments: 10 pages, 7 figures

  13. arXiv:2201.08894  [pdf

    q-bio.BM cs.LG

    Reinforcement Learning for Personalized Drug Discovery and Design for Complex Diseases: A Systems Pharmacology Perspective

    Authors: Ryan K. Tan, Yang Liu, Lei Xie

    Abstract: Many multi-genic systemic diseases such as neurological disorders, inflammatory diseases, and the majority of cancers do not have effective treatments yet. Reinforcement learning powered systems pharmacology is a potentially effective approach to design personalized therapies for untreatable complex diseases. In this survey, state-of-the-art reinforcement learning methods and their latest applicat… ▽ More

    Submitted 23 February, 2022; v1 submitted 21 January, 2022; originally announced January 2022.

    Comments: 26 pages, 3 figure

  14. arXiv:2112.13153  [pdf

    q-bio.QM

    Mathematical Properties of Incremental Effect Additivity and Other Synergy Theories

    Authors: Leonid Hanin, Liyang Xie, Rainer Sachs

    Abstract: Synergy theories for multi-component agent mixtures use 1-agent dose-effect relations, assumed known from analyzing previous 1-agent experiments, to calculate baseline Neither-Synergy-Nor-Antagonism mixture dose-effect relations. The most commonly used synergy theory, Simple Effect Additivity, is not self-consistent mathematically. Many nonlinear alternatives have been suggested, almost all of whi… ▽ More

    Submitted 24 December, 2021; originally announced December 2021.

    Comments: 53 pages including 14 figures and supplementary materials

    MSC Class: 34A99; 92-08; 92F05; 92F99

  15. arXiv:2111.14283  [pdf, other

    q-bio.QM cs.AI cs.LG

    Exploration of Dark Chemical Genomics Space via Portal Learning: Applied to Targeting the Undruggable Genome and COVID-19 Anti-Infective Polypharmacology

    Authors: Tian Cai, Li Xie, Muge Chen, Yang Liu, Di He, Shuo Zhang, Cameron Mura, Philip E. Bourne, Lei Xie

    Abstract: Advances in biomedicine are largely fueled by exploring uncharted territories of human biology. Machine learning can both enable and accelerate discovery, but faces a fundamental hurdle when applied to unseen data with distributions that differ from previously observed ones -- a common dilemma in scientific inquiry. We have developed a new deep learning framework, called {\textit{Portal Learning}}… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: 18 pages, 6 figures

    MSC Class: 68T07

  16. arXiv:2102.00538  [pdf, other

    cs.LG q-bio.GN

    CODE-AE: A Coherent De-confounding Autoencoder for Predicting Patient-Specific Drug Response From Cell Line Transcriptomics

    Authors: Di He, Lei Xie

    Abstract: Accurate and robust prediction of patient's response to drug treatments is critical for developing precision medicine. However, it is often difficult to obtain a sufficient amount of coherent drug response data from patients directly for training a generalized machine learning model. Although the utilization of rich cell line data provides an alternative solution, it is challenging to transfer the… ▽ More

    Submitted 31 January, 2021; originally announced February 2021.

  17. arXiv:2011.07457  [pdf, other

    cs.LG physics.comp-ph q-bio.QM

    Molecular Mechanics-Driven Graph Neural Network with Multiplex Graph for Molecular Structures

    Authors: Shuo Zhang, Yang Liu, Lei Xie

    Abstract: The prediction of physicochemical properties from molecular structures is a crucial task for artificial intelligence aided molecular design. A growing number of Graph Neural Networks (GNNs) have been proposed to address this challenge. These models improve their expressive power by incorporating auxiliary information in molecules while inevitably increase their computational complexity. In this wo… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Comments: Accepted by the Machine Learning for Structural Biology Workshop (MLSB 2020) and the Machine Learning for Molecules Workshop (ML4Molecules 2020) at the 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

  18. arXiv:2010.12948  [pdf

    cs.LG eess.IV q-bio.NC

    DeepAtrophy: Teaching a Neural Network to Differentiate Progressive Changes from Noise on Longitudinal MRI in Alzheimer's Disease

    Authors: Mengjin Dong, Long Xie, Sandhitsu R. Das, Jiancong Wang, Laura E. M. Wisse, Robin deFlores, David A. Wolk, Paul Yushkevich

    Abstract: Volume change measures derived from longitudinal MRI (e.g. hippocampal atrophy) are a well-studied biomarker of disease progression in Alzheimer's Disease (AD) and are used in clinical trials to track the therapeutic efficacy of disease-modifying treatments. However, longitudinal MRI change measures can be confounded by non-biological factors, such as different degrees of head motion and susceptib… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

    Comments: Submitted to a journal, IF ~ 6

  19. arXiv:2010.04824  [pdf, other

    cs.LG q-bio.GN

    A Cross-Level Information Transmission Network for Predicting Phenotype from New Genotype: Application to Cancer Precision Medicine

    Authors: Di He, Lei Xie

    Abstract: An unsolved fundamental problem in biology and ecology is to predict observable traits (phenotypes) from a new genetic constitution (genotype) of an organism under environmental perturbations (e.g., drug treatment). The emergence of multiple omics data provides new opportunities but imposes great challenges in the predictive modeling of genotype-phenotype associations. Firstly, the high-dimensiona… ▽ More

    Submitted 9 October, 2020; originally announced October 2020.

  20. Structural insights into characterizing binding sites in EGFR kinase mutants

    Authors: Zheng Zhao, Lei Xie, Philip E. Bourne

    Abstract: Over the last two decades epidermal growth factor receptor (EGFR) kinase has become an important target to treat non-small cell lung cancer (NSCLC). Currently, three generations of EGFR kinase-targeted small molecule drugs have been FDA approved. They nominally produce a response at the start of treatment and lead to a substantial survival benefit for patients. However, long-term treatment results… ▽ More

    Submitted 26 December, 2018; originally announced December 2018.

    Comments: 32 pages, 7 figures

    Journal ref: Journal of Chemical Information and Modeling, 2018

  21. arXiv:1711.01705  [pdf

    q-bio.BM physics.bio-ph physics.chem-ph

    The Activation Entropy Change in Enzymatic Reaction Catalyzed by Isochorismate-Pyruvate Lyase of Pseudomonas Aeruginosa PchB

    Authors: Liangxu Xie, Zhe-Ning Chen, Mingjun Yang

    Abstract: The elucidation of entropic contribution to enzyme catalysis has been debated over decades. The recent experimentally measured activation enthalpy and entropy, for chorismate rearrangement reaction in PchB brings up a hotly debated issue whether the chorismate mutase catalyzed reaction is entropy-driven reaction. Extensive configurational sampling combined with quantum mechanics/molecular mechanic… ▽ More

    Submitted 5 November, 2017; originally announced November 2017.

  22. Automated deconvolution of structured mixtures from bulk tumor genomic data

    Authors: Theodore Roman, Lu Xie, Russell Schwartz

    Abstract: Motivation: As cancer researchers have come to appreciate the importance of intratumor heterogeneity, much attention has focused on the challenges of accurately profiling heterogeneity in individual patients. Experimental technologies for directly profiling genomes of single cells are rapidly improving, but they are still impractical for large-scale sampling. Bulk genomic assays remain the standar… ▽ More

    Submitted 8 April, 2016; originally announced April 2016.

    Comments: Paper accepted at RECOMB-CCB 2016

  23. arXiv:1507.02148  [pdf

    q-bio.QM physics.bio-ph

    Derivative-free optimization of rate parameters of capsid assembly models from bulk in vitro data

    Authors: Lu Xie, Gregory R. Smith, Russell Schwartz

    Abstract: The assembly of virus capsids from free coat proteins proceeds by a complicated cascade of association and dissociation steps, the great majority of which cannot be directly experimentally observed. This has made capsid assembly a rich field for computational models to attempt to fill the gaps in what is experimentally observable. Nonetheless, accurate simulation predictions depend on accurate mod… ▽ More

    Submitted 7 July, 2015; originally announced July 2015.

  24. arXiv:1210.5234  [pdf

    q-bio.QM q-bio.MN

    Avoid Internal Loops in Steady State Flux Space Sampling

    Authors: Lu Xie

    Abstract: As a widely used method in metabolic network studies, Monte-Carlo sampling in the steady state flux space is known for its flexibility and convenience of carrying out different purposes, simply by alternating constraints or objective functions, or appending post processes. Recently the concept of a non-linear constraint based on the second thermodynamic law, known as "Loop Law", is challenging cur… ▽ More

    Submitted 18 October, 2012; originally announced October 2012.

    Comments: arXiv admin note: substantial text overlap with arXiv:0711.1193

  25. Implications of 3-step swimming patterns in bacterial chemotaxis

    Authors: Tuba Altindal, Li Xie, Xiao-Lun Wu

    Abstract: We recently found that marine bacteria Vibrio alginolyticus execute a cyclic 3-step (run- reverse-flick) motility pattern that is distinctively different from the 2-step (run-tumble) pattern of Escherichia coli. How this novel swimming pattern is regulated by cells of V. alginolyticus is not currently known, but its significance for bacterial chemotaxis is self- evident and will be delineated here… ▽ More

    Submitted 14 November, 2010; originally announced November 2010.

    Comments: 18 pages, 4 figures, submitted to biophysical journal

  26. arXiv:0711.1193  [pdf

    q-bio.OT

    Imposition of Different Optimizing Object with Non-Linear Constraints on Flux Sampling and Elimination of Free Futile Pathways

    Authors: Lu Xie, Yi Zhang

    Abstract: Constraint-based modeling has been widely used on metabolic networks analysis, such as biosynthetic prediction and flux optimization. The linear constraints, like mass conservation constraint, reversibility constraint, biological capacity constraint, can be imposed on linear algorithms. However, recently a non-linear constraint based on the second thermodynamic law, known as "loop law", has emer… ▽ More

    Submitted 28 November, 2009; v1 submitted 7 November, 2007; originally announced November 2007.

    Comments: 31 pages including 13 figures and 1 table, in preparation