Skip to main content

Showing 1–50 of 107 results for author: Ying, R

.
  1. arXiv:2506.09114  [pdf, other

    cs.LG

    TRACE: Grounding Time Series in Context for Multimodal Embedding and Retrieval

    Authors: Jialin Chen, Ziyu Zhao, Gaukhar Nurbek, Aosong Feng, Ali Maatouk, Leandros Tassiulas, Yifeng Gao, Rex Ying

    Abstract: The ubiquity of dynamic data in domains such as weather, healthcare, and energy underscores a growing need for effective interpretation and retrieval of time-series data. These data are inherently tied to domain-specific contexts, such as clinical notes or weather narratives, making cross-modal retrieval essential not only for downstream tasks but also for developing robust time-series foundation… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  2. arXiv:2506.05826  [pdf, ps, other

    cs.LG

    Learning Along the Arrow of Time: Hyperbolic Geometry for Backward-Compatible Representation Learning

    Authors: Ngoc Bui, Menglin Yang, Runjin Chen, Leonardo Neves, Mingxuan Ju, Rex Ying, Neil Shah, Tong Zhao

    Abstract: Backward compatible representation learning enables updated models to integrate seamlessly with existing ones, avoiding to reprocess stored data. Despite recent advances, existing compatibility approaches in Euclidean space neglect the uncertainty in the old embedding model and force the new model to reconstruct outdated representations regardless of their quality, thereby hindering the learning p… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  3. arXiv:2506.05361  [pdf, other

    cs.CV q-bio.GN

    Scalable Generation of Spatial Transcriptomics from Histology Images via Whole-Slide Flow Matching

    Authors: Tinglin Huang, Tianyu Liu, Mehrtash Babadi, Wengong Jin, Rex Ying

    Abstract: Spatial transcriptomics (ST) has emerged as a powerful technology for bridging histology imaging with gene expression profiling. However, its application has been limited by low throughput and the need for specialized experimental facilities. Prior works sought to predict ST from whole-slide histology images to accelerate this process, but they suffer from two major limitations. First, they do not… ▽ More

    Submitted 24 May, 2025; originally announced June 2025.

    Comments: Accepted at ICML 2025

  4. arXiv:2505.24722  [pdf, ps, other

    cs.LG cs.AI

    HELM: Hyperbolic Large Language Models via Mixture-of-Curvature Experts

    Authors: Neil He, Rishabh Anand, Hiren Madhu, Ali Maatouk, Smita Krishnaswamy, Leandros Tassiulas, Menglin Yang, Rex Ying

    Abstract: Large language models (LLMs) have shown great success in text modeling tasks across domains. However, natural language exhibits inherent semantic hierarchies and nuanced geometric structure, which current LLMs do not capture completely owing to their reliance on Euclidean operations. Recent studies have also shown that not respecting the geometry of token embeddings leads to training instabilities… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  5. Towards Non-Euclidean Foundation Models: Advancing AI Beyond Euclidean Frameworks

    Authors: Menglin Yang, Yifei Zhang, Jialin Chen, Melanie Weber, Rex Ying

    Abstract: In the era of foundation models and Large Language Models (LLMs), Euclidean space is the de facto geometric setting of our machine learning architectures. However, recent literature has demonstrated that this choice comes with fundamental limitations. To that end, non-Euclidean learning is quickly gaining traction, particularly in web-related applications where complex relationships and structures… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: WWW 2025 Companion

  6. arXiv:2505.03209  [pdf, other

    cs.LG

    DYSTIL: Dynamic Strategy Induction with Large Language Models for Reinforcement Learning

    Authors: Borui Wang, Kathleen McKeown, Rex Ying

    Abstract: Reinforcement learning from expert demonstrations has long remained a challenging research problem, and existing state-of-the-art methods using behavioral cloning plus further RL training often suffer from poor generalization, low sample efficiency, and poor model interpretability. Inspired by the strong reasoning abilities of large language models (LLMs), we propose a novel strategy-based reinfor… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

  7. arXiv:2504.11698  [pdf, other

    cs.RO cs.CV

    An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World

    Authors: Xingwu Ji, Haochen Niu, Dexin Duan, Rendong Ying, Fei Wen, Peilin Liu

    Abstract: Recently, learning-based robotic navigation systems have gained extensive research attention and made significant progress. However, the diversity of open-world scenarios poses a major challenge for the generalization of such systems to practical scenarios. Specifically, learned systems for scene measurement and state estimation tend to degrade when the application scenarios deviate from the train… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

    Comments: 11 pages, 14 figures

  8. arXiv:2504.10917  [pdf, other

    cs.LG cs.AI

    Towards A Universal Graph Structural Encoder

    Authors: Jialin Chen, Haolan Zuo, Haoyu Peter Wang, Siqi Miao, Pan Li, Rex Ying

    Abstract: Recent advancements in large-scale pre-training have shown the potential to learn generalizable representations for downstream tasks. In the graph domain, however, capturing and transferring structural information across different graph domains remains challenging, primarily due to the inherent differences in topological patterns across various contexts. Additionally, most existing models struggle… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  9. arXiv:2504.08912  [pdf, other

    cs.LG cs.AI

    HyperCore: The Core Framework for Building Hyperbolic Foundation Models with Comprehensive Modules

    Authors: Neil He, Menglin Yang, Rex Ying

    Abstract: Hyperbolic neural networks have emerged as a powerful tool for modeling hierarchical data across diverse modalities. Recent studies show that token distributions in foundation models exhibit scale-free properties, suggesting that hyperbolic space is a more suitable ambient space than Euclidean space for many pre-training and downstream tasks. However, existing tools lack essential components for b… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: 11 pages, 4 figures

  10. arXiv:2504.08896  [pdf, other

    cs.LG cs.AI

    Position: Beyond Euclidean -- Foundation Models Should Embrace Non-Euclidean Geometries

    Authors: Neil He, Jiahong Liu, Buze Zhang, Ngoc Bui, Ali Maatouk, Menglin Yang, Irwin King, Melanie Weber, Rex Ying

    Abstract: In the era of foundation models and Large Language Models (LLMs), Euclidean space has been the de facto geometric setting for machine learning architectures. However, recent literature has demonstrated that this choice comes with fundamental limitations. At a large scale, real-world data often exhibit inherently non-Euclidean structures, such as multi-way relationships, hierarchies, symmetries, an… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: 22 pages, 4 figures

  11. arXiv:2504.08150  [pdf, other

    cs.LG

    Beyond Feature Importance: Feature Interactions in Predicting Post-Stroke Rigidity with Graph Explainable AI

    Authors: Jiawei Xu, Yonggeon Lee, Anthony Elkommos Youssef, Eunjin Yun, Tinglin Huang, Tianjian Guo, Hamidreza Saber, Rex Ying, Ying Ding

    Abstract: This study addresses the challenge of predicting post-stroke rigidity by emphasizing feature interactions through graph-based explainable AI. Post-stroke rigidity, characterized by increased muscle tone and stiffness, significantly affects survivors' mobility and quality of life. Despite its prevalence, early prediction remains limited, delaying intervention. We analyze 519K stroke hospitalization… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: Jiawei Xu and Yonggeon Lee contributed equally to this work

  12. arXiv:2504.05019  [pdf, other

    cs.LG cs.CL

    Mixture-of-Personas Language Models for Population Simulation

    Authors: Ngoc Bui, Hieu Trung Nguyen, Shantanu Kumar, Julian Theodore, Weikang Qiu, Viet Anh Nguyen, Rex Ying

    Abstract: Advances in Large Language Models (LLMs) paved the way for their emerging applications in various domains, such as human behavior simulations, where LLMs could augment human-generated data in social science research and machine learning model training. However, pretrained LLMs often fail to capture the behavioral diversity of target populations due to the inherent variability across individuals an… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  13. arXiv:2503.17933  [pdf, other

    cs.CL cs.AI cs.IR

    Experience Retrieval-Augmentation with Electronic Health Records Enables Accurate Discharge QA

    Authors: Justice Ou, Tinglin Huang, Yilun Zhao, Ziyang Yu, Peiqing Lu, Rex Ying

    Abstract: To improve the reliability of Large Language Models (LLMs) in clinical applications, retrieval-augmented generation (RAG) is extensively applied to provide factual medical knowledge. However, beyond general medical knowledge from open-ended datasets, clinical case-based knowledge is also critical for effective medical reasoning, as it provides context grounded in real-world patient experiences.Mot… ▽ More

    Submitted 28 May, 2025; v1 submitted 23 March, 2025; originally announced March 2025.

  14. arXiv:2503.16858  [pdf, other

    cs.CL cs.AI

    MTBench: A Multimodal Time Series Benchmark for Temporal Reasoning and Question Answering

    Authors: Jialin Chen, Aosong Feng, Ziyu Zhao, Juan Garza, Gaukhar Nurbek, Cheng Qin, Ali Maatouk, Leandros Tassiulas, Yifeng Gao, Rex Ying

    Abstract: Understanding the relationship between textual news and time-series evolution is a critical yet under-explored challenge in applied data science. While multimodal learning has gained traction, existing multimodal time-series datasets fall short in evaluating cross-modal reasoning and complex question answering, which are essential for capturing complex interactions between narrative information an… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: 14 pages

  15. arXiv:2503.04184  [pdf

    cs.NI cs.AI cs.CL

    Large-Scale AI in Telecom: Charting the Roadmap for Innovation, Scalability, and Enhanced Digital Experiences

    Authors: Adnan Shahid, Adrian Kliks, Ahmed Al-Tahmeesschi, Ahmed Elbakary, Alexandros Nikou, Ali Maatouk, Ali Mokh, Amirreza Kazemi, Antonio De Domenico, Athanasios Karapantelakis, Bo Cheng, Bo Yang, Bohao Wang, Carlo Fischione, Chao Zhang, Chaouki Ben Issaid, Chau Yuen, Chenghui Peng, Chongwen Huang, Christina Chaccour, Christo Kurisummoottil Thomas, Dheeraj Sharma, Dimitris Kalogiros, Dusit Niyato, Eli De Poorter , et al. (110 additional authors not shown)

    Abstract: This white paper discusses the role of large-scale AI in the telecommunications industry, with a specific focus on the potential of generative AI to revolutionize network functions and user experiences, especially in the context of 6G systems. It highlights the development and deployment of Large Telecom Models (LTMs), which are tailored AI models designed to address the complex challenges faced b… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  16. arXiv:2502.15786  [pdf, ps, other

    q-bio.NC cs.AI cs.LG eess.SP

    MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding

    Authors: Weikang Qiu, Zheng Huang, Haoyu Hu, Aosong Feng, Yujun Yan, Rex Ying

    Abstract: Decoding functional magnetic resonance imaging (fMRI) signals into text has been a key challenge in the neuroscience community, with the potential to advance brain-computer interfaces and uncover deeper insights into brain mechanisms. However, existing approaches often struggle with suboptimal predictive performance, limited task variety, and poor generalization across subjects. In response to thi… ▽ More

    Submitted 6 June, 2025; v1 submitted 17 February, 2025; originally announced February 2025.

    Comments: Forty-Second International Conference on Machine Learning (ICML 2025)

  17. arXiv:2502.09767  [pdf, other

    cs.LG cs.AI cs.CL

    Non-Markovian Discrete Diffusion with Causal Language Models

    Authors: Yangtian Zhang, Sizhuang He, Daniel Levine, Lawrence Zhao, David Zhang, Syed A Rizvi, Emanuele Zappala, Rex Ying, David van Dijk

    Abstract: Discrete diffusion models offer a flexible, controllable approach to structured sequence generation, yet they still lag behind causal language models in expressive power. A key limitation lies in their reliance on the Markovian assumption, which restricts each step to condition only on the current state, leading to potential uncorrectable error accumulation. In this paper, we introduce CaDDi, a di… ▽ More

    Submitted 27 May, 2025; v1 submitted 13 February, 2025; originally announced February 2025.

    Comments: Under Review

  18. arXiv:2502.07746  [pdf, other

    cs.LG math.AT

    HiPoNet: A Multi-View Simplicial Complex Network for High Dimensional Point-Cloud and Single-Cell Data

    Authors: Siddharth Viswanath, Hiren Madhu, Dhananjay Bhaskar, Jake Kovalic, David R Johnson, Christopher Tape, Ian Adelstein, Rex Ying, Michael Perlmutter, Smita Krishnaswamy

    Abstract: In this paper, we propose HiPoNet, an end-to-end differentiable neural network for regression, classification, and representation learning on high-dimensional point clouds. Our work is motivated by single-cell data which can have very high-dimensionality --exceeding the capabilities of existing methods for point clouds which are mostly tailored for 3D data. Moreover, modern single-cell and spatial… ▽ More

    Submitted 26 May, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

  19. arXiv:2502.07303  [pdf, other

    cs.IR

    Flow Matching for Collaborative Filtering

    Authors: Chengkai Liu, Yangtian Zhang, Jianling Wang, Rex Ying, James Caverlee

    Abstract: Generative models have shown great promise in collaborative filtering by capturing the underlying distribution of user interests and preferences. However, existing approaches struggle with inaccurate posterior approximations and misalignment with the discrete nature of recommendation data, limiting their expressiveness and real-world performance. To address these limitations, we propose FlowCF, a… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  20. arXiv:2501.00365  [pdf, other

    cs.LG cs.AI

    Low-Rank Adaptation for Foundation Models: A Comprehensive Review

    Authors: Menglin Yang, Jialin Chen, Yifei Zhang, Jiahong Liu, Jiasheng Zhang, Qiyao Ma, Harshit Verma, Qianru Zhang, Min Zhou, Irwin King, Rex Ying

    Abstract: The rapid advancement of foundation modelslarge-scale neural networks trained on diverse, extensive datasetshas revolutionized artificial intelligence, enabling unprecedented advancements across domains such as natural language processing, computer vision, and scientific discovery. However, the substantial parameter count of these models, often reaching billions or trillions, poses significant cha… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    MSC Class: I.2

  21. arXiv:2412.14695  [pdf, other

    cs.LG

    Lorentzian Residual Neural Networks

    Authors: Neil He, Menglin Yang, Rex Ying

    Abstract: Hyperbolic neural networks have emerged as a powerful tool for modeling hierarchical data structures prevalent in real-world datasets. Notably, residual connections, which facilitate the direct flow of information across layers, have been instrumental in the success of deep neural networks. However, current methods for constructing hyperbolic residual networks suffer from limitations such as incre… ▽ More

    Submitted 12 January, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

    Comments: 12 pages, 3 figures, KDD 2025

  22. arXiv:2412.13664  [pdf, ps, other

    cs.RO

    A Skeleton-Based Topological Planner for Exploration in Complex Unknown Environments

    Authors: Haochen Niu, Xingwu Ji, Lantao Zhang, Fei Wen, Rendong Ying, Peilin Liu

    Abstract: The capability of autonomous exploration in complex, unknown environments is important in many robotic applications. While recent research on autonomous exploration have achieved much progress, there are still limitations, e.g., existing methods relying on greedy heuristics or optimal path planning are often hindered by repetitive paths and high computational demands. To address such limitations,… ▽ More

    Submitted 7 June, 2025; v1 submitted 18 December, 2024; originally announced December 2024.

    Comments: 7 pages, 7 figures. Accepted to be presented at the ICRA 2025

  23. arXiv:2411.16694  [pdf, other

    q-bio.BM cs.AI

    Reaction-conditioned De Novo Enzyme Design with GENzyme

    Authors: Chenqing Hua, Jiarui Lu, Yong Liu, Odin Zhang, Jian Tang, Rex Ying, Wengong Jin, Guy Wolf, Doina Precup, Shuangjia Zheng

    Abstract: The introduction of models like RFDiffusionAA, AlphaFold3, AlphaProteo, and Chai1 has revolutionized protein structure modeling and interaction prediction, primarily from a binding perspective, focusing on creating ideal lock-and-key models. However, these methods can fall short for enzyme-substrate interactions, where perfect binding models are rare, and induced fit states are more common. To add… ▽ More

    Submitted 9 November, 2024; originally announced November 2024.

  24. arXiv:2411.13865  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Breaking Information Cocoons: A Hyperbolic Graph-LLM Framework for Exploration and Exploitation in Recommender Systems

    Authors: Qiyao Ma, Menglin Yang, Mingxuan Ju, Tong Zhao, Neil Shah, Rex Ying

    Abstract: Modern recommender systems often create information cocoons, restricting users' exposure to diverse content. A key challenge lies in balancing content exploration and exploitation while allowing users to adjust their recommendation preferences. Intuitively, this balance can be modeled as a tree-structured representation, where depth search facilitates exploitation and breadth search enables explor… ▽ More

    Submitted 22 May, 2025; v1 submitted 21 November, 2024; originally announced November 2024.

  25. arXiv:2411.08767  [pdf, other

    cs.NI cs.AI

    SANDWICH: Towards an Offline, Differentiable, Fully-Trainable Wireless Neural Ray-Tracing Surrogate

    Authors: Yifei Jin, Ali Maatouk, Sarunas Girdzijauskas, Shugong Xu, Leandros Tassiulas, Rex Ying

    Abstract: Wireless ray-tracing (RT) is emerging as a key tool for three-dimensional (3D) wireless channel modeling, driven by advances in graphical rendering. Current approaches struggle to accurately model beyond 5G (B5G) network signaling, which often operates at higher frequencies and is more susceptible to environmental conditions and changes. Existing online learning solutions require real-time environ… ▽ More

    Submitted 20 February, 2025; v1 submitted 13 November, 2024; originally announced November 2024.

    Comments: Accepted in ICMLCN 2025

  26. arXiv:2410.20926  [pdf, other

    cs.CL

    Long Sequence Modeling with Attention Tensorization: From Sequence to Tensor Learning

    Authors: Aosong Feng, Rex Ying, Leandros Tassiulas

    Abstract: As the demand for processing extended textual data grows, the ability to handle long-range dependencies and maintain computational efficiency is more critical than ever. One of the key issues for long-sequence modeling using attention-based model is the mismatch between the limited-range modeling power of full attention and the long-range token dependency in the input sequence. In this work, we pr… ▽ More

    Submitted 22 May, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

  27. arXiv:2410.09207  [pdf, other

    cs.AI cs.CL

    P-FOLIO: Evaluating and Improving Logical Reasoning with Abundant Human-Written Reasoning Chains

    Authors: Simeng Han, Aaron Yu, Rui Shen, Zhenting Qi, Martin Riddell, Wenfei Zhou, Yujie Qiao, Yilun Zhao, Semih Yavuz, Ye Liu, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Dragomir Radev, Rex Ying, Arman Cohan

    Abstract: Existing methods on understanding the capabilities of LLMs in logical reasoning rely on binary entailment classification or synthetically derived rationales, which are not sufficient for proper investigation of model's capabilities. We present P-FOLIO, a human-annotated dataset consisting of diverse and complex reasoning chains for a set of realistic logical reasoning stories also written by human… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  28. arXiv:2410.04010  [pdf, other

    cs.LG cs.AI cs.CL cs.NE

    Hyperbolic Fine-tuning for Large Language Models

    Authors: Menglin Yang, Aosong Feng, Bo Xiong, Jihong Liu, Irwin King, Rex Ying

    Abstract: Large language models (LLMs) have demonstrated remarkable performance on various tasks. However, it remains an open question whether the default Euclidean space is the most suitable choice for embedding tokens in LLMs. In this study, we first investigate the non-Euclidean characteristics of LLMs. Our findings reveal that token frequency follows a power-law distribution, with high-frequency tokens… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

    Comments: The preliminary work was accepted for the ICML 2024 LLM Cognition Workshop, and this version includes new investigations, analyses, experiments, and results

  29. arXiv:2409.12177  [pdf, other

    cs.SI cs.DL

    LitFM: A Retrieval Augmented Structure-aware Foundation Model For Citation Graphs

    Authors: Jiasheng Zhang, Jialin Chen, Ali Maatouk, Ngoc Bui, Qianqian Xie, Leandros Tassiulas, Jie Shao, Hua Xu, Rex Ying

    Abstract: With the advent of large language models (LLMs), managing scientific literature via LLMs has become a promising direction of research. However, existing approaches often overlook the rich structural and semantic relevance among scientific literature, limiting their ability to discern the relationships between pieces of scientific knowledge, and suffer from various types of hallucinations. These me… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: 18 pages, 12 figures

  30. arXiv:2409.05314  [pdf, other

    cs.IT cs.AI cs.LG

    Tele-LLMs: A Series of Specialized Large Language Models for Telecommunications

    Authors: Ali Maatouk, Kenny Chirino Ampudia, Rex Ying, Leandros Tassiulas

    Abstract: The emergence of large language models (LLMs) has significantly impacted various fields, from natural language processing to sectors like medicine and finance. However, despite their rapid proliferation, the applications of LLMs in telecommunications remain limited, often relying on general-purpose models that lack domain-specific specialization. This lack of specialization results in underperform… ▽ More

    Submitted 5 May, 2025; v1 submitted 8 September, 2024; originally announced September 2024.

  31. arXiv:2408.06603  [pdf, other

    cs.AI

    Simple but Effective Compound Geometric Operations for Temporal Knowledge Graph Completion

    Authors: Rui Ying, Mengting Hu, Jianfeng Wu, Yalan Xie, Xiaoyi Liu, Zhunheng Wang, Ming Jiang, Hang Gao, Linlin Zhang, Renhong Cheng

    Abstract: Temporal knowledge graph completion aims to infer the missing facts in temporal knowledge graphs. Current approaches usually embed factual knowledge into continuous vector space and apply geometric operations to learn potential patterns in temporal knowledge graphs. However, these methods only adopt a single operation, which may have limitations in capturing the complex temporal dynamics present i… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  32. arXiv:2408.00872  [pdf, other

    cs.AI cs.DB cs.LG

    Online Detection of Anomalies in Temporal Knowledge Graphs with Interpretability

    Authors: Jiasheng Zhang, Rex Ying, Jie Shao

    Abstract: Temporal knowledge graphs (TKGs) are valuable resources for capturing evolving relationships among entities, yet they are often plagued by noise, necessitating robust anomaly detection mechanisms. Existing dynamic graph anomaly detection approaches struggle to capture the rich semantics introduced by node and edge categories within TKGs, while TKG embedding methods lack interpretability, undermini… ▽ More

    Submitted 2 September, 2024; v1 submitted 1 August, 2024; originally announced August 2024.

    Comments: 26 pages, 10 figures. Accepted by SIGMOD 2025

  33. arXiv:2407.09618  [pdf, other

    cs.LG cs.SI

    The Heterophilic Graph Learning Handbook: Benchmarks, Models, Theoretical Analysis, Applications and Challenges

    Authors: Sitao Luan, Chenqing Hua, Qincheng Lu, Liheng Ma, Lirong Wu, Xinyu Wang, Minkai Xu, Xiao-Wen Chang, Doina Precup, Rex Ying, Stan Z. Li, Jian Tang, Guy Wolf, Stefanie Jegelka

    Abstract: Homophily principle, \ie{} nodes with the same labels or similar attributes are more likely to be connected, has been commonly believed to be the main reason for the superiority of Graph Neural Networks (GNNs) over traditional Neural Networks (NNs) on graph-structured data, especially on node-level tasks. However, recent work has identified a non-trivial set of datasets where GNN's performance com… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: Suggestions and comments are welcomed at [email protected]!

  34. arXiv:2407.01290  [pdf, other

    cs.LG cs.AI

    Hypformer: Exploring Efficient Hyperbolic Transformer Fully in Hyperbolic Space

    Authors: Menglin Yang, Harshit Verma, Delvin Ce Zhang, Jiahong Liu, Irwin King, Rex Ying

    Abstract: Hyperbolic geometry have shown significant potential in modeling complex structured data, particularly those with underlying tree-like and hierarchical structures. Despite the impressive performance of various hyperbolic neural networks across numerous domains, research on adapting the Transformer to hyperbolic space remains limited. Previous attempts have mainly focused on modifying self-attentio… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: KDD 2024

  35. arXiv:2407.00849  [pdf, other

    cs.LG

    Towards Understanding Sensitive and Decisive Patterns in Explainable AI: A Case Study of Model Interpretation in Geometric Deep Learning

    Authors: Jiajun Zhu, Siqi Miao, Rex Ying, Pan Li

    Abstract: The interpretability of machine learning models has gained increasing attention, particularly in scientific domains where high precision and accountability are crucial. This research focuses on distinguishing between two critical data patterns -- sensitive patterns (model-related) and decisive patterns (task-related) -- which are commonly used as model interpretations but often lead to confusion.… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  36. arXiv:2406.13839  [pdf, other

    q-bio.BM cs.LG q-bio.GN

    RNA-FrameFlow: Flow Matching for de novo 3D RNA Backbone Design

    Authors: Rishabh Anand, Chaitanya K. Joshi, Alex Morehead, Arian R. Jamasb, Charles Harris, Simon V. Mathis, Kieran Didi, Rex Ying, Bryan Hooi, Pietro Liò

    Abstract: We introduce RNA-FrameFlow, the first generative model for 3D RNA backbone design. We build upon SE(3) flow matching for protein backbone generation and establish protocols for data preparation and evaluation to address unique challenges posed by RNA modeling. We formulate RNA structures as a set of rigid-body frames and associated loss functions which account for larger, more conformationally fle… ▽ More

    Submitted 18 March, 2025; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Oral presentation at Machine Learning in Computational Biology (MLCB), 2024. Also presented as an Oral at ICML 2024 Structured Probabilistic Inference & Generative Modeling Workshop, and a Spotlight at ICML 2024 AI4Science Workshop

  37. arXiv:2406.12072  [pdf, other

    cs.AI cs.LG

    DTGB: A Comprehensive Benchmark for Dynamic Text-Attributed Graphs

    Authors: Jiasheng Zhang, Jialin Chen, Menglin Yang, Aosong Feng, Shuang Liang, Jie Shao, Rex Ying

    Abstract: Dynamic text-attributed graphs (DyTAGs) are prevalent in various real-world scenarios, where each node and edge are associated with text descriptions, and both the graph structure and text descriptions evolve over time. Despite their broad applicability, there is a notable scarcity of benchmark datasets tailored to DyTAGs, which hinders the potential advancement in many research fields. To address… ▽ More

    Submitted 4 November, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 28 pages, 13 figures, camera-ready version for NeurIPS 2024 Datasets and Benchmarks Track

  38. arXiv:2406.09586  [pdf, other

    q-bio.BM

    Protein-Nucleic Acid Complex Modeling with Frame Averaging Transformer

    Authors: Tinglin Huang, Zhenqiao Song, Rex Ying, Wengong Jin

    Abstract: Nucleic acid-based drugs like aptamers have recently demonstrated great therapeutic potential. However, experimental platforms for aptamer screening are costly, and the scarcity of labeled data presents a challenge for supervised methods to learn protein-aptamer binding. To this end, we develop an unsupervised learning approach based on the predicted pairwise contact map between a protein and a nu… ▽ More

    Submitted 3 November, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted at NeurIPS 2024

  39. arXiv:2405.14352  [pdf, other

    cs.LG

    Explaining Graph Neural Networks via Structure-aware Interaction Index

    Authors: Ngoc Bui, Hieu Trung Nguyen, Viet Anh Nguyen, Rex Ying

    Abstract: The Shapley value is a prominent tool for interpreting black-box machine learning models thanks to its strong theoretical foundation. However, for models with structured inputs, such as graph neural networks, existing Shapley-based explainability approaches either focus solely on node-wise importance or neglect the graph structure when perturbing the input instance. This paper introduces the Myers… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 30 pages, ICML'24

  40. arXiv:2404.01340  [pdf, other

    cs.LG cs.AI

    From Similarity to Superiority: Channel Clustering for Time Series Forecasting

    Authors: Jialin Chen, Jan Eric Lenssen, Aosong Feng, Weihua Hu, Matthias Fey, Leandros Tassiulas, Jure Leskovec, Rex Ying

    Abstract: Time series forecasting has attracted significant attention in recent decades. Previous studies have demonstrated that the Channel-Independent (CI) strategy improves forecasting performance by treating different channels individually, while it leads to poor generalization on unseen instances and ignores potentially necessary interactions between channels. Conversely, the Channel-Dependent (CD) str… ▽ More

    Submitted 6 November, 2024; v1 submitted 30 March, 2024; originally announced April 2024.

    Comments: NeurIPS 2024

  41. arXiv:2403.04882  [pdf, other

    cs.LG

    Efficient High-Resolution Time Series Classification via Attention Kronecker Decomposition

    Authors: Aosong Feng, Jialin Chen, Juan Garza, Brooklyn Berry, Francisco Salazar, Yifeng Gao, Rex Ying, Leandros Tassiulas

    Abstract: The high-resolution time series classification problem is essential due to the increasing availability of detailed temporal data in various domains. To tackle this challenge effectively, it is imperative that the state-of-the-art attention model is scalable to accommodate the growing sequence lengths typically encountered in high-resolution time series data, while also demonstrating robustness in… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  42. arXiv:2403.04880  [pdf, other

    cs.CV

    An Item is Worth a Prompt: Versatile Image Editing with Disentangled Control

    Authors: Aosong Feng, Weikang Qiu, Jinbin Bai, Xiao Zhang, Zhen Dong, Kaicheng Zhou, Rex Ying, Leandros Tassiulas

    Abstract: Building on the success of text-to-image diffusion models (DPMs), image editing is an important application to enable human interaction with AI-generated content. Among various editing methods, editing within the prompt space gains more attention due to its capacity and simplicity of controlling semantics. However, since diffusion models are commonly pretrained on descriptive text captions, direct… ▽ More

    Submitted 26 January, 2025; v1 submitted 7 March, 2024; originally announced March 2024.

  43. arXiv:2402.14367  [pdf, other

    cs.LG cs.SI

    Representation Learning for Frequent Subgraph Mining

    Authors: Rex Ying, Tianyu Fu, Andrew Wang, Jiaxuan You, Yu Wang, Jure Leskovec

    Abstract: Identifying frequent subgraphs, also called network motifs, is crucial in analyzing and predicting properties of real-world networks. However, finding large commonly-occurring motifs remains a challenging problem not only due to its NP-hard subroutine of subgraph counting, but also the exponential growth of the number of possible subgraphs patterns. Here we present Subgraph Pattern Miner (SPMiner)… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: Oral Presentation in The Graph Representation Learning and Beyond (GRL+) Workshop from The 37th International Conference on Ma- chine Learning, 2020

  44. arXiv:2401.05561  [pdf, other

    cs.CL

    TrustLLM: Trustworthiness in Large Language Models

    Authors: Yue Huang, Lichao Sun, Haoran Wang, Siyuan Wu, Qihui Zhang, Yuan Li, Chujie Gao, Yixin Huang, Wenhan Lyu, Yixuan Zhang, Xiner Li, Zhengliang Liu, Yixin Liu, Yijue Wang, Zhikun Zhang, Bertie Vidgen, Bhavya Kailkhura, Caiming Xiong, Chaowei Xiao, Chunyuan Li, Eric Xing, Furong Huang, Hao Liu, Heng Ji, Hongyi Wang , et al. (45 additional authors not shown)

    Abstract: Large language models (LLMs), exemplified by ChatGPT, have gained considerable attention for their excellent natural language processing capabilities. Nonetheless, these LLMs present many challenges, particularly in the realm of trustworthiness. Therefore, ensuring the trustworthiness of LLMs emerges as an important topic. This paper introduces TrustLLM, a comprehensive study of trustworthiness in… ▽ More

    Submitted 30 September, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: This work is still under work and we welcome your contribution

  45. arXiv:2312.04615  [pdf, other

    cs.LG cs.DB

    Relational Deep Learning: Graph Representation Learning on Relational Databases

    Authors: Matthias Fey, Weihua Hu, Kexin Huang, Jan Eric Lenssen, Rishabh Ranjan, Joshua Robinson, Rex Ying, Jiaxuan You, Jure Leskovec

    Abstract: Much of the world's most valued data is stored in relational databases and data warehouses, where the data is organized into many tables connected by primary-foreign key relations. However, building machine learning models using this data is both challenging and time consuming. The core problem is that no machine learning method is capable of learning on multiple tables interconnected by primary-f… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: https://relbench.stanford.edu

  46. arXiv:2312.02203  [pdf, other

    q-bio.NC cs.LG

    Learning High-Order Relationships of Brain Regions

    Authors: Weikang Qiu, Huangrui Chu, Selena Wang, Haolan Zuo, Xiaoxiao Li, Yize Zhao, Rex Ying

    Abstract: Discovering reliable and informative relationships among brain regions from functional magnetic resonance imaging (fMRI) signals is essential in phenotypic predictions. Most of the current methods fail to accurately characterize those interactions because they only focus on pairwise connections and overlook the high-order relationships of brain regions. We propose that these high-order relationshi… ▽ More

    Submitted 8 June, 2024; v1 submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted at ICML 2024, Camera Ready Version

  47. arXiv:2311.05767  [pdf, other

    cs.LG

    Dirichlet Energy Enhancement of Graph Neural Networks by Framelet Augmentation

    Authors: Jialin Chen, Yuelin Wang, Cristian Bodnar, Rex Ying, Pietro Lio, Yu Guang Wang

    Abstract: Graph convolutions have been a pivotal element in learning graph representations. However, recursively aggregating neighboring information with graph convolutions leads to indistinguishable node features in deep layers, which is known as the over-smoothing issue. The performance of graph neural networks decays fast as the number of stacked layers increases, and the Dirichlet energy associated with… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  48. arXiv:2311.05764  [pdf, other

    cs.LG

    Generative Explanations for Graph Neural Network: Methods and Evaluations

    Authors: Jialin Chen, Kenza Amara, Junchi Yu, Rex Ying

    Abstract: Graph Neural Networks (GNNs) achieve state-of-the-art performance in various graph-related tasks. However, the black-box nature often limits their interpretability and trustworthiness. Numerous explainability methods have been proposed to uncover the decision-making logic of GNNs, by generating underlying explanatory substructures. In this paper, we conduct a comprehensive review of the existing e… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

  49. arXiv:2310.19324  [pdf, other

    cs.LG cs.AI

    TempME: Towards the Explainability of Temporal Graph Neural Networks via Motif Discovery

    Authors: Jialin Chen, Rex Ying

    Abstract: Temporal graphs are widely used to model dynamic systems with time-varying interactions. In real-world scenarios, the underlying mechanisms of generating future interactions in dynamic systems are typically governed by a set of recurring substructures within the graph, known as temporal motifs. Despite the success and prevalence of current temporal graph neural networks (TGNN), it remains uncertai… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS 2023, Camera Ready Version

  50. arXiv:2310.19321  [pdf, other

    cs.LG cs.AI

    D4Explainer: In-Distribution GNN Explanations via Discrete Denoising Diffusion

    Authors: Jialin Chen, Shirley Wu, Abhijit Gupta, Rex Ying

    Abstract: The widespread deployment of Graph Neural Networks (GNNs) sparks significant interest in their explainability, which plays a vital role in model auditing and ensuring trustworthy graph learning. The objective of GNN explainability is to discern the underlying graph structures that have the most significant impact on model predictions. Ensuring that explanations generated are reliable necessitates… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted at NeurIPS 2023, Camera Ready Version