Skip to main content

Showing 1–12 of 12 results for author: Ying, R

Searching in archive q-bio. Search in all archives.
.
  1. arXiv:2506.11152  [pdf, ps, other

    q-bio.GN cs.LG q-bio.CB

    HEIST: A Graph Foundation Model for Spatial Transcriptomics and Proteomics Data

    Authors: Hiren Madhu, João Felipe Rocha, Tinglin Huang, Siddharth Viswanath, Smita Krishnaswamy, Rex Ying

    Abstract: Single-cell transcriptomics and proteomics have become a great source for data-driven insights into biology, enabling the use of advanced deep learning methods to understand cellular heterogeneity and gene expression at the single-cell level. With the advent of spatial-omics data, we have the promise of characterizing cells within their tissue context as it provides both spatial coordinates and in… ▽ More

    Submitted 25 September, 2025; v1 submitted 11 June, 2025; originally announced June 2025.

  2. arXiv:2506.05361  [pdf, other

    cs.CV q-bio.GN

    Scalable Generation of Spatial Transcriptomics from Histology Images via Whole-Slide Flow Matching

    Authors: Tinglin Huang, Tianyu Liu, Mehrtash Babadi, Wengong Jin, Rex Ying

    Abstract: Spatial transcriptomics (ST) has emerged as a powerful technology for bridging histology imaging with gene expression profiling. However, its application has been limited by low throughput and the need for specialized experimental facilities. Prior works sought to predict ST from whole-slide histology images to accelerate this process, but they suffer from two major limitations. First, they do not… ▽ More

    Submitted 24 May, 2025; originally announced June 2025.

    Comments: Accepted at ICML 2025

  3. arXiv:2502.15786  [pdf, ps, other

    q-bio.NC cs.AI cs.LG eess.SP

    MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding

    Authors: Weikang Qiu, Zheng Huang, Haoyu Hu, Aosong Feng, Yujun Yan, Rex Ying

    Abstract: Decoding functional magnetic resonance imaging (fMRI) signals into text has been a key challenge in the neuroscience community, with the potential to advance brain-computer interfaces and uncover deeper insights into brain mechanisms. However, existing approaches often struggle with suboptimal predictive performance, limited task variety, and poor generalization across subjects. In response to thi… ▽ More

    Submitted 6 June, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: Forty-Second International Conference on Machine Learning (ICML 2025)

  4. arXiv:2411.16694  [pdf, other

    q-bio.BM cs.AI

    Reaction-conditioned De Novo Enzyme Design with GENzyme

    Authors: Chenqing Hua, Jiarui Lu, Yong Liu, Odin Zhang, Jian Tang, Rex Ying, Wengong Jin, Guy Wolf, Doina Precup, Shuangjia Zheng

    Abstract: The introduction of models like RFDiffusionAA, AlphaFold3, AlphaProteo, and Chai1 has revolutionized protein structure modeling and interaction prediction, primarily from a binding perspective, focusing on creating ideal lock-and-key models. However, these methods can fall short for enzyme-substrate interactions, where perfect binding models are rare, and induced fit states are more common. To add… ▽ More

    Submitted 9 November, 2024; originally announced November 2024.

  5. arXiv:2406.13839  [pdf, ps, other

    q-bio.BM cs.LG q-bio.GN

    RNA-FrameFlow: Flow Matching for de novo 3D RNA Backbone Design

    Authors: Rishabh Anand, Chaitanya K. Joshi, Alex Morehead, Arian R. Jamasb, Charles Harris, Simon V. Mathis, Kieran Didi, Rex Ying, Bryan Hooi, Pietro Liò

    Abstract: We introduce RNA-FrameFlow, the first generative model for 3D RNA backbone design. We build upon SE(3) flow matching for protein backbone generation and establish protocols for data preparation and evaluation to address unique challenges posed by RNA modeling. We formulate RNA structures as a set of rigid-body frames and associated loss functions which account for larger, more conformationally fle… ▽ More

    Submitted 11 August, 2025; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Published in Transactions on Machine Learning Research (https://openreview.net/forum?id=wOc1Yx5s09). Also presented as an Oral at Machine Learning in Computational Biology 2024, ICML 2024 Structured Probabilistic Inference & Generative Modeling Workshop, and a Spotlight at ICML 2024 AI4Science Workshop

  6. arXiv:2406.09586  [pdf, other

    q-bio.BM

    Protein-Nucleic Acid Complex Modeling with Frame Averaging Transformer

    Authors: Tinglin Huang, Zhenqiao Song, Rex Ying, Wengong Jin

    Abstract: Nucleic acid-based drugs like aptamers have recently demonstrated great therapeutic potential. However, experimental platforms for aptamer screening are costly, and the scarcity of labeled data presents a challenge for supervised methods to learn protein-aptamer binding. To this end, we develop an unsupervised learning approach based on the predicted pairwise contact map between a protein and a nu… ▽ More

    Submitted 3 November, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted at NeurIPS 2024

  7. arXiv:2312.02203  [pdf, other

    q-bio.NC cs.LG

    Learning High-Order Relationships of Brain Regions

    Authors: Weikang Qiu, Huangrui Chu, Selena Wang, Haolan Zuo, Xiaoxiao Li, Yize Zhao, Rex Ying

    Abstract: Discovering reliable and informative relationships among brain regions from functional magnetic resonance imaging (fMRI) signals is essential in phenotypic predictions. Most of the current methods fail to accurately characterize those interactions because they only focus on pairwise connections and overlook the high-order relationships of brain regions. We propose that these high-order relationshi… ▽ More

    Submitted 8 June, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted at ICML 2024, Camera Ready Version

  8. arXiv:2310.02275  [pdf, other

    cs.LG q-bio.GN

    MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph Data

    Authors: Tianyu Liu, Yuge Wang, Rex Ying, Hongyu Zhao

    Abstract: Discovering genes with similar functions across diverse biomedical contexts poses a significant challenge in gene representation learning due to data heterogeneity. In this study, we resolve this problem by introducing a novel model called Multimodal Similarity Learning Graph Neural Network, which combines Multimodal Machine Learning and Deep Graph Neural Networks to learn gene representations fro… ▽ More

    Submitted 29 September, 2023; originally announced October 2023.

    Comments: Accepted by NeurIPS 2023

  9. arXiv:2307.04052  [pdf, other

    q-bio.BM cs.LG

    Learning to Group Auxiliary Datasets for Molecule

    Authors: Tinglin Huang, Ziniu Hu, Rex Ying

    Abstract: The limited availability of annotations in small molecule datasets presents a challenge to machine learning models. To address this, one common strategy is to collaborate with additional auxiliary datasets. However, having more data does not always guarantee improvements. Negative transfer can occur when the knowledge in the target dataset differs or contradicts that of the auxiliary molecule data… ▽ More

    Submitted 8 November, 2023; v1 submitted 8 July, 2023; originally announced July 2023.

    Comments: Accepted at NeurIPS 2023, Camera Ready Version

  10. arXiv:2304.14621  [pdf, other

    cs.LG q-bio.BM

    MUDiff: Unified Diffusion for Complete Molecule Generation

    Authors: Chenqing Hua, Sitao Luan, Minkai Xu, Rex Ying, Jie Fu, Stefano Ermon, Doina Precup

    Abstract: Molecule generation is a very important practical problem, with uses in drug discovery and material design, and AI methods promise to provide useful solutions. However, existing methods for molecule generation focus either on 2D graph structure or on 3D geometric structure, which is not sufficient to represent a complete molecule as 2D graph captures mainly topology while 3D geometry captures main… ▽ More

    Submitted 5 February, 2024; v1 submitted 28 April, 2023; originally announced April 2023.

  11. arXiv:2209.15315  [pdf, other

    cs.LG physics.chem-ph q-bio.BM q-bio.QM

    FusionRetro: Molecule Representation Fusion via In-Context Learning for Retrosynthetic Planning

    Authors: Songtao Liu, Zhengkai Tu, Minkai Xu, Zuobai Zhang, Lu Lin, Rex Ying, Jian Tang, Peilin Zhao, Dinghao Wu

    Abstract: Retrosynthetic planning aims to devise a complete multi-step synthetic route from starting materials to a target molecule. Current strategies use a decoupled approach of single-step retrosynthesis models and search algorithms, taking only the product as the input to predict the reactants for each planning step and ignoring valuable context information along the synthetic route. In this work, we pr… ▽ More

    Submitted 31 May, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Accepted by ICML 2023

  12. arXiv:2109.09740  [pdf, other

    q-bio.QM cs.LG

    Neural Distance Embeddings for Biological Sequences

    Authors: Gabriele Corso, Rex Ying, Michal Pándy, Petar Veličković, Jure Leskovec, Pietro Liò

    Abstract: The development of data-dependent heuristics and representations for biological sequences that reflect their evolutionary distance is critical for large-scale biological research. However, popular machine learning approaches, based on continuous Euclidean spaces, have struggled with the discrete combinatorial formulation of the edit distance that models evolution and the hierarchical relationship… ▽ More

    Submitted 11 October, 2021; v1 submitted 20 September, 2021; originally announced September 2021.

    Comments: Advances in Neural Information Processing Systems (NeurIPS 2021)