Skip to main content

Showing 1–11 of 11 results for author: Yao, J

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2507.07201  [pdf, ps, other

    q-bio.BM cs.AI cs.LG

    MODA: A Unified 3D Diffusion Framework for Multi-Task Target-Aware Molecular Generation

    Authors: Dong Xu, Zhangfan Yang, Sisi Yuan, Jenna Xinyi Yao, Jiangqiang Li, Junkai Ji

    Abstract: Three-dimensional molecular generators based on diffusion models can now reach near-crystallographic accuracy, yet they remain fragmented across tasks. SMILES-only inputs, two-stage pretrain-finetune pipelines, and one-task-one-model practices hinder stereochemical fidelity, task alignment, and zero-shot transfer. We introduce MODA, a diffusion framework that unifies fragment growing, linker desig… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

  2. arXiv:2504.00020  [pdf, other

    q-bio.GN cs.AI cs.LG

    Celler:A Genomic Language Model for Long-Tailed Single-Cell Annotation

    Authors: Huan Zhao, Yiming Liu, Jina Yao, Ling Xiong, Zexin Zhou, Zixing Zhang

    Abstract: Recent breakthroughs in single-cell technology have ushered in unparalleled opportunities to decode the molecular intricacy of intricate biological systems, especially those linked to diseases unique to humans. However, these progressions have also ushered in novel obstacles-specifically, the efficient annotation of extensive, long-tailed single-cell data pertaining to disease conditions. To effec… ▽ More

    Submitted 27 March, 2025; originally announced April 2025.

  3. arXiv:2502.15867  [pdf

    q-bio.OT cs.AI

    Strategic priorities for transformative progress in advancing biology with proteomics and artificial intelligence

    Authors: Yingying Sun, Jun A, Zhiwei Liu, Rui Sun, Liujia Qian, Samuel H. Payne, Wout Bittremieux, Markus Ralser, Chen Li, Yi Chen, Zhen Dong, Yasset Perez-Riverol, Asif Khan, Chris Sander, Ruedi Aebersold, Juan Antonio VizcaĆ­no, Jonathan R Krieger, Jianhua Yao, Han Wen, Linfeng Zhang, Yunping Zhu, Yue Xuan, Benjamin Boyang Sun, Liang Qiao, Henning Hermjakob , et al. (37 additional authors not shown)

    Abstract: Artificial intelligence (AI) is transforming scientific research, including proteomics. Advances in mass spectrometry (MS)-based proteomics data quality, diversity, and scale, combined with groundbreaking AI techniques, are unlocking new challenges and opportunities in biological discovery. Here, we highlight key areas where AI is driving innovation, from data analysis to new biological insights.… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: 28 pages, 2 figures, perspective in AI proteomics

  4. arXiv:2502.14934  [pdf, other

    q-bio.QM cs.AI cs.LG

    Fast and Accurate Blind Flexible Docking

    Authors: Zizhuo Zhang, Lijun Wu, Kaiyuan Gao, Jiangchao Yao, Tao Qin, Bo Han

    Abstract: Molecular docking that predicts the bound structures of small molecules (ligands) to their protein targets, plays a vital role in drug discovery. However, existing docking methods often face limitations: they either overlook crucial structural changes by assuming protein rigidity or suffer from low computational efficiency due to their reliance on generative models for structure sampling. To addre… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: 25 pages, Accepted by ICLR 2025

  5. arXiv:2404.16880  [pdf, other

    q-bio.QM cs.AI cs.CL

    Atomas: Hierarchical Alignment on Molecule-Text for Unified Molecule Understanding and Generation

    Authors: Yikun Zhang, Geyan Ye, Chaohao Yuan, Bo Han, Long-Kai Huang, Jianhua Yao, Wei Liu, Yu Rong

    Abstract: Molecule-and-text cross-modal representation learning has emerged as a promising direction for enhancing the quality of molecular representation, thereby improving performance in various scientific fields. However, most approaches employ a global alignment approach to learn the knowledge from different modalities that may fail to capture fine-grained information, such as molecule-and-text fragment… ▽ More

    Submitted 3 March, 2025; v1 submitted 23 April, 2024; originally announced April 2024.

  6. arXiv:2404.16866  [pdf, other

    q-bio.QM cs.AI cs.LG

    Annotation-guided Protein Design with Multi-Level Domain Alignment

    Authors: Chaohao Yuan, Songyou Li, Geyan Ye, Yikun Zhang, Long-Kai Huang, Wenbing Huang, Wei Liu, Jianhua Yao, Yu Rong

    Abstract: The core challenge of de novo protein design lies in creating proteins with specific functions or properties, guided by certain conditions. Current models explore to generate protein using structural and evolutionary guidance, which only provide indirect conditions concerning functions and properties. However, textual annotations of proteins, especially the annotations for protein domains, which d… ▽ More

    Submitted 12 December, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Accepted by KDD 2025

  7. arXiv:2402.16894  [pdf, other

    q-bio.NC eess.IV

    Topological Analysis of Mouse Brain Vasculature via 3D Light-sheet Microscopy Images

    Authors: Jiachen Yao, Nina Hagemann, Qiaojie Xiong, Jianxu Chen, Dirk M. Hermann, Chao Chen

    Abstract: Vascular networks play a crucial role in understanding brain functionalities. Brain integrity and function, neuronal activity and plasticity, which are crucial for learning, are actively modulated by their local environments, specifically vascular networks. With recent developments in high-resolution 3D light-sheet microscopy imaging together with tissue processing techniques, it becomes feasible… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

  8. arXiv:2311.01276  [pdf, other

    cs.LG q-bio.QM

    Neural Atoms: Propagating Long-range Interaction in Molecular Graphs through Efficient Communication Channel

    Authors: Xuan Li, Zhanke Zhou, Jiangchao Yao, Yu Rong, Lu Zhang, Bo Han

    Abstract: Graph Neural Networks (GNNs) have been widely adopted for drug discovery with molecular graphs. Nevertheless, current GNNs mainly excel in leveraging short-range interactions (SRI) but struggle to capture long-range interactions (LRI), both of which are crucial for determining molecular properties. To tackle this issue, we propose a method to abstract the collective information of atomic groups in… ▽ More

    Submitted 31 March, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

  9. arXiv:2307.05628  [pdf, other

    q-bio.GN cs.LG

    DNAGPT: A Generalized Pre-trained Tool for Versatile DNA Sequence Analysis Tasks

    Authors: Daoan Zhang, Weitong Zhang, Yu Zhao, Jianguo Zhang, Bing He, Chenchen Qin, Jianhua Yao

    Abstract: Pre-trained large language models demonstrate potential in extracting information from DNA sequences, yet adapting to a variety of tasks and data modalities remains a challenge. To address this, we propose DNAGPT, a generalized DNA pre-training model trained on over 200 billion base pairs from all mammals. By enhancing the classic GPT model with a binary classification task (DNA sequence order), a… ▽ More

    Submitted 30 August, 2023; v1 submitted 11 July, 2023; originally announced July 2023.

  10. arXiv:1910.06659  [pdf

    q-bio.QM q-bio.NC

    Ballistocardiogram artifact reduction in simultaneous EEG-fMRI using deep learning

    Authors: J. R. McIntosh, J. Yao, Linbi Hong, J. Faller, P. Sajda

    Abstract: Objective: The concurrent recording of electroencephalography (EEG) and functional magnetic resonance imaging (fMRI) is a technique that has received much attention due to its potential for combined high temporal and spatial resolution. However, the ballistocardiogram (BCG), a large-amplitude artifact caused by cardiac induced movement contaminates the EEG during EEG-fMRI recordings. Removal of BC… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  11. Osteoporotic and Neoplastic Compression Fracture Classification on Longitudinal CT

    Authors: Yinong Wang, Jianhua Yao, Joseph E. Burns, Ronald M. Summers

    Abstract: Classification of vertebral compression fractures (VCF) having osteoporotic or neoplastic origin is fundamental to the planning of treatment. We developed a fracture classification system by acquiring quantitative morphologic and bone density determinants of fracture progression through the use of automated measurements from longitudinal studies. A total of 250 CT studies were acquired for the tas… ▽ More

    Submitted 27 January, 2016; originally announced January 2016.

    Comments: Contributed 4-Page Paper to be presented at the 2016 IEEE International Symposium on Biomedical Imaging (ISBI), April 13-16, 2016, Prague, Czech Republic