Skip to main content

Showing 1–50 of 356 results for author: Yin, M

.
  1. arXiv:2507.01224  [pdf, ps, other

    cs.DC

    FLARE: A Dataflow-Aware and Scalable Hardware Architecture for Neural-Hybrid Scientific Lossy Compression

    Authors: Wenqi Jia, Ying Huang, Jian Xu, Zhewen Hu, Sian Jin, Jiannan Tian, Yuede Ji, Miao Yin

    Abstract: Scientific simulation leveraging high-performance computing (HPC) systems is crucial for modeling complex systems and phenomena in fields such as astrophysics, climate science, and fluid dynamics, generating massive datasets that often reach petabyte to exabyte scales. However, managing these vast data volumes introduces significant I/O and network bottlenecks, limiting practical performance and s… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

  2. arXiv:2506.22675  [pdf, ps, other

    stat.ML cs.LG

    Bayesian Invariance Modeling of Multi-Environment Data

    Authors: Luhuan Wu, Mingzhang Yin, Yixin Wang, John P. Cunningham, David M. Blei

    Abstract: Invariant prediction [Peters et al., 2016] analyzes feature/outcome data from multiple environments to identify invariant features - those with a stable predictive relationship to the outcome. Such features support generalization to new environments and help reveal causal mechanisms. Previous methods have primarily tackled this problem through hypothesis testing or regularized optimization. Here w… ▽ More

    Submitted 2 July, 2025; v1 submitted 27 June, 2025; originally announced June 2025.

  3. arXiv:2506.22365  [pdf, ps, other

    cs.LG cs.RO

    Reinforcement Learning with Physics-Informed Symbolic Program Priors for Zero-Shot Wireless Indoor Navigation

    Authors: Tao Li, Haozhe Lei, Mingsheng Yin, Yaqi Hu

    Abstract: When using reinforcement learning (RL) to tackle physical control tasks, inductive biases that encode physics priors can help improve sample efficiency during training and enhance generalization in testing. However, the current practice of incorporating these helpful physics-informed inductive biases inevitably runs into significant manual labor and domain expertise, making them prohibitive for ge… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: Spotlight paper at Reinforcement Learning Conference 2025, Workshop on Inductive Biases in Reinforcement Learning

  4. arXiv:2506.21923  [pdf, ps, other

    cs.CV

    ZeroReg3D: A Zero-shot Registration Pipeline for 3D Consecutive Histopathology Image Reconstruction

    Authors: Juming Xiong, Ruining Deng, Jialin Yue, Siqi Lu, Junlin Guo, Marilyn Lionts, Tianyuan Yao, Can Cui, Junchao Zhu, Chongyu Qu, Mengmeng Yin, Haichun Yang, Yuankai Huo

    Abstract: Histological analysis plays a crucial role in understanding tissue structure and pathology. While recent advancements in registration methods have improved 2D histological analysis, they often struggle to preserve critical 3D spatial relationships, limiting their utility in both clinical and research applications. Specifically, constructing accurate 3D models from 2D slices remains challenging due… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

  5. arXiv:2506.18269  [pdf, ps, other

    cs.HC

    Co-persona: Leveraging LLMs and Expert Collaboration to Understand User Personas through Social Media Data Analysis

    Authors: Min Yin, Haoyu Liu, Boyi Lian, Chunlei Chai

    Abstract: This study introduces Co-Persona, a methodological framework bridging large-scale social media analysis with authentic user understanding through systematic integration of Large Language Models and expert validation. Through a case study of B.Co, a Chinese manufacturer, we investigated Co-Persona application in bedside lamp development. Our methodology analyzed over 38 million posts from Xiao Hong… ▽ More

    Submitted 24 June, 2025; v1 submitted 22 June, 2025; originally announced June 2025.

    Comments: 17pages,5figures,8tables

  6. arXiv:2505.22855  [pdf, ps, other

    eess.IV cs.CV

    IRS: Incremental Relationship-guided Segmentation for Digital Pathology

    Authors: Ruining Deng, Junchao Zhu, Juming Xiong, Can Cui, Tianyuan Yao, Junlin Guo, Siqi Lu, Marilyn Lionts, Mengmeng Yin, Yu Wang, Shilin Zhao, Yucheng Tang, Yihe Yang, Paul Dennis Simonson, Mert R. Sabuncu, Haichun Yang, Yuankai Huo

    Abstract: Continual learning is rapidly emerging as a key focus in computer vision, aiming to develop AI systems capable of continuous improvement, thereby enhancing their value and practicality in diverse real-world applications. In healthcare, continual learning holds great promise for continuously acquired digital pathology data, which is collected in hospitals on a daily basis. However, panoramic segmen… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

  7. arXiv:2505.19501  [pdf, ps, other

    cs.AI

    Toward Scientific Reasoning in LLMs: Training from Expert Discussions via Reinforcement Learning

    Authors: Ming Yin, Yuanhao Qu, Ling Yang, Le Cong, Mengdi Wang

    Abstract: We investigate how to teach large language models (LLMs) to perform scientific reasoning by leveraging expert discussions as a learning signal. Focusing on the genomics domain, we develop an automated pipeline to extract trainable data and introduce Genome-Bench, a new benchmark constructed from over a decade of scientific forum discussions on genome engineering. Our pipeline transforms raw intera… ▽ More

    Submitted 2 June, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

  8. arXiv:2505.17925  [pdf, other

    cs.IR

    Enhancing CTR Prediction with De-correlated Expert Networks

    Authors: Jiancheng Wang, Mingjia Yin, Junwei Pan, Ximei Wang, Hao Wang, Enhong Chen

    Abstract: Modeling feature interactions is essential for accurate click-through rate (CTR) prediction in advertising systems. Recent studies have adopted the Mixture-of-Experts (MoE) approach to improve performance by ensembling multiple feature interaction experts. These studies employ various strategies, such as learning independent embedding tables for each expert or utilizing heterogeneous expert archit… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  9. arXiv:2505.04815  [pdf, other

    math.DS

    Causal Discovery in Symmetric Dynamic Systems with Convergent Cross Mapping

    Authors: Yiting Duan, Yi Guo, Jack Yang, Ming Yin

    Abstract: This paper systematically discusses how the inherent properties of chaotic attractors influence the results of discovering causality from time series using convergent cross mapping, particularly how convergent cross mapping misleads bidirectional causality as unidirectional when the chaotic attractor exhibits symmetry. We propose a novel method based on the k-means clustering method to address the… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  10. arXiv:2505.00212  [pdf, ps, other

    cs.MA cs.CL

    Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems

    Authors: Shaokun Zhang, Ming Yin, Jieyu Zhang, Jiale Liu, Zhiguang Han, Jingyang Zhang, Beibin Li, Chi Wang, Huazheng Wang, Yiran Chen, Qingyun Wu

    Abstract: Failure attribution in LLM multi-agent systems-identifying the agent and step responsible for task failures-provides crucial clues for systems debugging but remains underexplored and labor-intensive. In this paper, we propose and formulate a new research area: automated failure attribution for LLM multi-agent systems. To support this initiative, we introduce the Who&When dataset, comprising extens… ▽ More

    Submitted 1 June, 2025; v1 submitted 30 April, 2025; originally announced May 2025.

    Comments: camera-ready

  11. arXiv:2504.21583  [pdf, other

    cs.NI

    Toward Realization of Low-Altitude Economy Networks: Core Architecture, Integrated Technologies, and Future Directions

    Authors: Yixian Wang, Geng Sun, Zemin Sun, Jiacheng Wang, Jiahui Li, Changyuan Zhao, Jing Wu, Shuang Liang, Minghao Yin, Pengfei Wang, Dusit Niyato, Sumei Sun, Dong In Kim

    Abstract: The rise of the low-altitude economy (LAE) is propelling urban development and emerging industries by integrating advanced technologies to enhance efficiency, safety, and sustainability in low-altitude operations. The widespread adoption of unmanned aerial vehicles (UAVs) and electric vertical takeoff and landing (eVTOL) aircraft plays a crucial role in enabling key applications within LAE, such a… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

    Comments: 25 pages, 12 figures, published to TCCN

  12. arXiv:2504.14148  [pdf, other

    cond-mat.mtrl-sci

    Charge Densities in Crystals and Triply-Periodic Minimal Surfaces

    Authors: Mengdi Yin, Dimitri D. Vvedensky

    Abstract: The relationship between surfaces of constant charge density and triply-periodic minimal surfaces (TPMS) has been the subject of considerable speculation over many years. Zero-potential surfaces generated in crystals by an electrostatic field from a distribution of point charges provide an approximate description of the TPMS for that crystal. We have recently provided a first-principles alternativ… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  13. arXiv:2504.10983  [pdf, other

    cs.LG cs.AI q-bio.BM

    ProtFlow: Fast Protein Sequence Design via Flow Matching on Compressed Protein Language Model Embeddings

    Authors: Zitai Kong, Yiheng Zhu, Yinlong Xu, Hanjing Zhou, Mingzhe Yin, Jialu Wu, Hongxia Xu, Chang-Yu Hsieh, Tingjun Hou, Jian Wu

    Abstract: The design of protein sequences with desired functionalities is a fundamental task in protein engineering. Deep generative methods, such as autoregressive models and diffusion models, have greatly accelerated the discovery of novel protein sequences. However, these methods mainly focus on local or shallow residual semantics and suffer from low inference efficiency, large modeling space and high tr… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  14. arXiv:2504.10044  [pdf, ps, other

    cs.CV

    Aligning Anime Video Generation with Human Feedback

    Authors: Bingwen Zhu, Yudong Jiang, Baohan Xu, Siqian Yang, Mingyu Yin, Yidi Wu, Huyang Sun, Zuxuan Wu

    Abstract: Anime video generation faces significant challenges due to the scarcity of anime data and unusual motion patterns, leading to issues such as motion distortion and flickering artifacts, which result in misalignment with human preferences. Existing reward models, designed primarily for real-world videos, fail to capture the unique appearance and consistency requirements of anime. In this work, we pr… ▽ More

    Submitted 24 June, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

    Comments: 10 pages, 7 figures, 7 tables

  15. Entertainers Between Real and Virtual -- Investigating Viewer Interaction, Engagement, and Relationships with Avatarized Virtual Livestreamers

    Authors: Michael Yin, Chenxinran Shen, Robert Xiao

    Abstract: Virtual YouTubers (VTubers) are avatar-based livestreamers that are voiced and played by human actors. VTubers have been popular in East Asia for years and have more recently seen widespread international growth. Despite their emergent popularity, research has been scarce into the interactions and relationships that exist between avatarized VTubers and their viewers, particularly in contrast to no… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: 15 pages, to be published in the ACM International Conference on Interactive Media Experiences (IMX'25)

  16. VIBES: Exploring Viewer Spatial Interactions as Direct Input for Livestreamed Content

    Authors: Michael Yin, Robert Xiao

    Abstract: Livestreaming has rapidly become a popular online pastime, with real-time interaction between streamer and viewer being a key motivating feature. However, viewers have traditionally had limited opportunity to directly influence the streamed content; even when such interactions are possible, it has been reliant on text-based chat. We investigate the potential of spatial interaction on the livestrea… ▽ More

    Submitted 11 April, 2025; originally announced April 2025.

    Comments: 20 pages, 11 figures, to be published in the ACM International Conference on Interactive Media Experiences (IMX'25)

  17. arXiv:2504.00334  [pdf

    q-bio.QM

    Pharmacokinetic characteristics of Jinhong tablets in normal, chronic superficial gastritis and intestinal microbial disorder rats

    Authors: Tingyu Zhang, Jian Feng, Xia Gao, Xialin Chen, Hongyu Peng, Xiaoxue Fan, Xin Meng, Mingke Yin, Zhenzhong Wang, Bo Zhang, Liang Cao

    Abstract: Jinhong tablet (JHT), a traditional Chinese medicine made from four herbs, effectively treats chronic superficial gastritis (CSG) by soothing the liver, relieving depression, regulating qi, and promoting blood circulation. However, its pharmacokinetics are underexplored. This study investigates JHT's pharmacokinetics in normal rats and its differences in normal, CSG, and intestinal microbial disor… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

  18. arXiv:2503.16033  [pdf

    cond-mat.mes-hall

    Dynamic Carrier Modulation via Nonlinear Acoustoelectric Transport in van der Waals Heterostructures

    Authors: Timothy J. McSorley, Kaustubh Simha, James E. Corcoran, Izzie J. Catanzaro, Haochong Zhang, Meitong Yin, Tzu-Ming Lu, Davis Thuillier, Marshall A. Campbell, Thomas Scaffidi, Luis A. Jauregui

    Abstract: Dynamically manipulating carriers in van der Waals heterostructures could enable solid-state quantum simulators with tunable lattice parameters. A key requirement is forming deep potential wells to reliably trap excitations. Here, we report the observation of nonlinear acoustoelectric transport and dynamic carrier modulation in boron nitride-encapsulated graphene devices coupled to intense surface… ▽ More

    Submitted 20 March, 2025; originally announced March 2025.

  19. arXiv:2503.12555  [pdf, other

    cond-mat.mtrl-sci

    Density-Functional Theory and Triply-Periodic Minimal Surfaces

    Authors: Mengdi Yin, Jing Zhang, Dimitri D Vvedensky

    Abstract: Several authors have suggested that the surfaces of vanishing potential generated by the electrostatic fields from a distribution of point charges resemble triply periodic minimal surfaces (TPMS) corresponding to the positions of the point charges. We provide a theoretical basis for this phenomenological comparison by starting with the Boltzmann equation to show that the surface corresponding to z… ▽ More

    Submitted 18 April, 2025; v1 submitted 16 March, 2025; originally announced March 2025.

  20. arXiv:2503.10742  [pdf, other

    cs.LG cs.CL

    Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing

    Authors: Yudong Liu, Jingwei Sun, Yueqian Lin, Jingyang Zhang, Ming Yin, Qinsi Wang, Jianyi Zhang, Hai Li, Yiran Chen

    Abstract: Vision language models (VLMs) demonstrate strong capabilities in jointly processing visual and textual data. However, they often incur substantial computational overhead due to redundant visual information, particularly in long-form video scenarios. Existing approaches predominantly focus on either vision token pruning, which may overlook spatio-temporal dependencies, or keyframe selection, which… ▽ More

    Submitted 24 April, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

  21. arXiv:2503.10569  [pdf, ps, other

    eess.SY eess.SP

    Low-Rank Matrix Regression via Least-Angle Regression

    Authors: Mingzhou Yin, Matthias A. Müller

    Abstract: Low-rank matrix regression is a fundamental problem in data science with various applications in systems and control. Nuclear norm regularization has been widely applied to solve this problem due to its convexity. However, it suffers from high computational complexity and the inability to directly specify the rank. This work introduces a novel framework for low-rank matrix regression that addresse… ▽ More

    Submitted 3 June, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

  22. arXiv:2503.08125  [pdf, other

    eess.SP

    Quantization Design for Deep Learning-Based CSI Feedback

    Authors: Manru Yin, Shengqian Han, Chenyang Yang

    Abstract: Deep learning-based autoencoders have been employed to compress and reconstruct channel state information (CSI) in frequency-division duplex systems. Practical implementations require judicious quantization of encoder outputs for digital transmission. In this paper, we propose a novel quantization module with bit allocation among encoder outputs and develop a method for joint training the module a… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  23. arXiv:2503.06638  [pdf, other

    eess.SP

    Learning of Uplink Resource Allocation with Multiuser QoS Constraints

    Authors: Manru Yin, Shengqian Han, Chenyang Yang

    Abstract: In the paper the joint optimization of uplink multiuser power and resource block (RB) allocation are studied, where each user has quality of service (QoS) constraints on both long- and short-blocklength transmissions. The objective is to minimize the consumption of RBs for meeting the QoS requirements, leading to a mixed-integer nonlinear programming (MINLP) problem. We resort to deep learning to… ▽ More

    Submitted 9 March, 2025; originally announced March 2025.

  24. arXiv:2503.04362  [pdf, other

    cs.LG cs.AI q-bio.BM

    A Generalist Cross-Domain Molecular Learning Framework for Structure-Based Drug Discovery

    Authors: Yiheng Zhu, Mingyang Li, Junlong Liu, Kun Fu, Jiansheng Wu, Qiuyi Li, Mingze Yin, Jieping Ye, Jian Wu, Zheng Wang

    Abstract: Structure-based drug discovery (SBDD) is a systematic scientific process that develops new drugs by leveraging the detailed physical structure of the target protein. Recent advancements in pre-trained models for biomolecules have demonstrated remarkable success across various biochemical applications, including drug discovery and protein engineering. However, in most approaches, the pre-trained mo… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  25. arXiv:2503.02990  [pdf, ps, other

    math.CO

    Descents and flag major index on conjugacy classes of colored permutation groups without short cycles

    Authors: Kevin Liu, Mei Yin

    Abstract: We consider the descent and flag major index statistics on the colored permutation groups, which are wreath products of the form $\mathfrak{S}_{n,r}=\mathbb{Z}_r\wr \mathfrak{S}_n$. We show that the $k$-th moments of these statistics on $\mathfrak{S}_{n,r}$ will coincide with the corresponding moments on all conjugacy classes without cycles of lengths $1,2,\ldots,2k$. Using this, we establish the… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    MSC Class: 05A05; 05E16; 60C05

  26. arXiv:2502.21011  [pdf, other

    cs.CV

    MagNet: Multi-Level Attention Graph Network for Predicting High-Resolution Spatial Transcriptomics

    Authors: Junchao Zhu, Ruining Deng, Tianyuan Yao, Juming Xiong, Chongyu Qu, Junlin Guo, Siqi Lu, Yucheng Tang, Daguang Xu, Mengmeng Yin, Yu Wang, Shilin Zhao, Yaohong Wang, Haichun Yang, Yuankai Huo

    Abstract: The rapid development of spatial transcriptomics (ST) offers new opportunities to explore the gene expression patterns within the spatial microenvironment. Current research integrates pathological images to infer gene expression, addressing the high costs and time-consuming processes to generate spatial transcriptomics data. However, as spatial transcriptomics resolution continues to improve, exis… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

  27. arXiv:2502.11919  [pdf, other

    cs.HC cs.CL

    From Text to Trust: Empowering AI-assisted Decision Making with Adaptive LLM-powered Analysis

    Authors: Zhuoyan Li, Hangxiao Zhu, Zhuoran Lu, Ziang Xiao, Ming Yin

    Abstract: AI-assisted decision making becomes increasingly prevalent, yet individuals often fail to utilize AI-based decision aids appropriately especially when the AI explanations are absent, potentially as they do not %understand reflect on AI's decision recommendations critically. Large language models (LLMs), with their exceptional conversational and analytical capabilities, present great opportunities… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: CHI 2025

  28. arXiv:2502.07302  [pdf, other

    cs.CV

    CASC-AI: Consensus-aware Self-corrective Learning for Noise Cell Segmentation

    Authors: Ruining Deng, Yihe Yang, David J. Pisapia, Benjamin Liechty, Junchao Zhu, Juming Xiong, Junlin Guo, Zhengyi Lu, Jiacheng Wang, Xing Yao, Runxuan Yu, Rendong Zhang, Gaurav Rudravaram, Mengmeng Yin, Pinaki Sarder, Haichun Yang, Yuankai Huo, Mert R. Sabuncu

    Abstract: Multi-class cell segmentation in high-resolution gigapixel whole slide images (WSIs) is crucial for various clinical applications. However, training such models typically requires labor-intensive, pixel-wise annotations by domain experts. Recent efforts have democratized this process by involving lay annotators without medical expertise. However, conventional non-corrective approaches struggle to… ▽ More

    Submitted 10 March, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

  29. arXiv:2502.07292  [pdf, other

    cs.HC

    Investigating Creativity in Humans and Generative AI Through Circles Exercises

    Authors: Runlin Duan, Shao-Kang Hsia, Yuzhao Chen, Yichen Hu, Ming Yin, Karthik Ramani

    Abstract: Generative AI (GenAI) is transforming the creativity process. However, as presented in this paper, GenAI encounters "narrow creativity" barriers. We observe that both humans and GenAI focus on limited subsets of the design space. We investigate this phenomenon using the "Circles Exercise," a creativity test widely used to examine the creativity of humans. Quantitative analysis reveals that humans… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  30. arXiv:2502.07288  [pdf, other

    cs.CV cs.AI

    KPIs 2024 Challenge: Advancing Glomerular Segmentation from Patch- to Slide-Level

    Authors: Ruining Deng, Tianyuan Yao, Yucheng Tang, Junlin Guo, Siqi Lu, Juming Xiong, Lining Yu, Quan Huu Cap, Pengzhou Cai, Libin Lan, Ze Zhao, Adrian Galdran, Amit Kumar, Gunjan Deotale, Dev Kumar Das, Inyoung Paik, Joonho Lee, Geongyu Lee, Yujia Chen, Wangkai Li, Zhaoyang Li, Xuege Hou, Zeyuan Wu, Shengjin Wang, Maximilian Fischer , et al. (22 additional authors not shown)

    Abstract: Chronic kidney disease (CKD) is a major global health issue, affecting over 10% of the population and causing significant mortality. While kidney biopsy remains the gold standard for CKD diagnosis and treatment, the lack of comprehensive benchmarks for kidney pathology segmentation hinders progress in the field. To address this, we organized the Kidney Pathology Image Segmentation (KPIs) Challenge… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  31. arXiv:2502.07110  [pdf, other

    math.PR math.CO

    Limit distributions for cycles of random parking functions

    Authors: J. E. Paguyo, Mei Yin

    Abstract: We study the asymptotic behavior of cycles of uniformly random parking functions. Our results are multifold: we obtain an explicit formula for the number of parking functions with a prescribed number of cyclic points and show that the scaled number of cyclic points of a random parking function is asymptotically Rayleigh distributed; we establish the classical trio of limit theorems (law of large n… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: 19 pages, 2 figures, comments welcome!

    MSC Class: 60C05; 60F05

  32. arXiv:2502.06453  [pdf, other

    cs.LG cs.AI cs.CL

    MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations

    Authors: Kaixuan Huang, Jiacheng Guo, Zihao Li, Xiang Ji, Jiawei Ge, Wenzhe Li, Yingqing Guo, Tianle Cai, Hui Yuan, Runzhe Wang, Yue Wu, Ming Yin, Shange Tang, Yangsibo Huang, Chi Jin, Xinyun Chen, Chiyuan Zhang, Mengdi Wang

    Abstract: Large language models have demonstrated impressive performance on challenging mathematical reasoning tasks, which has triggered the discussion of whether the performance is achieved by true reasoning capability or memorization. To investigate this question, prior work has constructed mathematical benchmarks when questions undergo simple perturbations -- modifications that still preserve the underl… ▽ More

    Submitted 12 February, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: v2: fix bugs in Fig. 1

  33. arXiv:2502.03266  [pdf, other

    cs.CV cs.RO

    ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models

    Authors: Ying Zhang, Maoliang Yin, Wenfu Bi, Haibao Yan, Shaohan Bian, Cui-Hua Zhang, Changchun Hua

    Abstract: Service robots operating in unstructured environments must effectively recognize and segment unknown objects to enhance their functionality. Traditional supervised learningbased segmentation techniques require extensive annotated datasets, which are impractical for the diversity of objects encountered in real-world scenarios. Unseen Object Instance Segmentation (UOIS) methods aim to address this b… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  34. TD3: Tucker Decomposition Based Dataset Distillation Method for Sequential Recommendation

    Authors: Jiaqing Zhang, Mingjia Yin, Hao Wang, Yawen Li, Yuyang Ye, Xingyu Lou, Junping Du, Enhong Chen

    Abstract: In the era of data-centric AI, the focus of recommender systems has shifted from model-centric innovations to data-centric approaches. The success of modern AI models is built on large-scale datasets, but this also results in significant training costs. Dataset distillation has emerged as a key solution, condensing large datasets to accelerate model training while preserving model performance. How… ▽ More

    Submitted 6 February, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: This work has been accepted by WWW2025

  35. arXiv:2502.01160  [pdf, ps, other

    cs.AI cs.IT

    Scalable Precise Computation of Shannon Entropy

    Authors: Yong Lai, Haolong Tong, Zhenghang Xu, Minghao Yin

    Abstract: Quantitative information flow analyses (QIF) are a class of techniques for measuring the amount of confidential information leaked by a program to its public outputs. Shannon entropy is an important method to quantify the amount of leakage in QIF. This paper focuses on the programs modeled in Boolean constraints and optimizes the two stages of the Shannon entropy computation to implement a scalabl… ▽ More

    Submitted 14 June, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

    Comments: 19 pages, 5 figures

  36. arXiv:2502.00269  [pdf, other

    math.PR math.CO

    Probabilistic $(m,n)$-Parking Functions

    Authors: Pamela E. Harris, Rodrigo Ribeiro, Mei Yin

    Abstract: In this article, we establish new results on the probabilistic parking model (introduced by Durmíc, Han, Harris, Ribeiro, and Yin) with $m$ cars and $n$ parking spots and probability parameter $p\in[0,1]$. For any $ m \leq n$ and $p \in [0,1]$, we study the parking preference of the last car, denoted $a_m$, and determine the conditional distribution of $a_m$ and compute its expected value. We show… ▽ More

    Submitted 31 January, 2025; originally announced February 2025.

    Comments: 17 pages, 2 figures

  37. arXiv:2501.15849  [pdf, ps, other

    eess.SY cs.LG

    Gaussian Process-Based Prediction and Control of Hammerstein-Wiener Systems

    Authors: Mingzhou Yin, Matthias A. Müller

    Abstract: This work investigates data-driven prediction and control of Hammerstein-Wiener systems using physics-informed Gaussian process models. Data-driven prediction algorithms have been developed for structured nonlinear systems based on Willems' fundamental lemma. However, existing frameworks cannot treat output nonlinearities and require a dictionary of basis functions for Hammerstein systems. In this… ▽ More

    Submitted 27 January, 2025; originally announced January 2025.

  38. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  39. arXiv:2501.09804  [pdf, other

    cs.LG cs.AI cs.CL

    Enhancing Generalization in Chain of Thought Reasoning for Smaller Models

    Authors: Maxwell J. Yin, Dingyi Jiang, Yongbing Chen, Boyu Wang, Charles Ling

    Abstract: Chain-of-Thought (CoT) reasoning in smaller language models is a challenging natural language process problem yet highly desirable in many real-life applications. Existing CoT knowledge distillation methods often suffer from overly conservative memorization in smaller LLMs, leading to low generalization confidence. As fully preserving the CoT ability of teacher model is impossible, we hypothesize… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  40. arXiv:2501.08555  [pdf, other

    eess.SP

    Low-Complex Waveform, Modulation and Coding Designs for 3GPP Ambient IoT

    Authors: Mingxi Yin, Chao Wei, Kazuki Takeda, Yinhua Jia, Changlong Xu, Chengjin Zhang, Hao Xu

    Abstract: This paper presents a comprehensive study on low-complexity waveform, modulation and coding (WMC) designs for the 3rd Generation Partnership Project (3GPP) Ambient Internet of Things (A-IoT). A-IoT is a low-cost, low-power IoT system inspired by Ultra High Frequency (UHF) Radio Frequency Identification (RFID) and aims to leverage existing cellular network infrastructure for efficient RF tag manage… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

    Comments: This work has been submitted to the IEEE (IEEE Communications Standards Magazine, Special Issue for Ambient IoT) for possible publication

  41. arXiv:2501.06151  [pdf, other

    eess.IV cs.CV

    PySpatial: A High-Speed Whole Slide Image Pathomics Toolkit

    Authors: Yuechen Yang, Yu Wang, Tianyuan Yao, Ruining Deng, Mengmeng Yin, Shilin Zhao, Haichun Yang, Yuankai Huo

    Abstract: Whole Slide Image (WSI) analysis plays a crucial role in modern digital pathology, enabling large-scale feature extraction from tissue samples. However, traditional feature extraction pipelines based on tools like CellProfiler often involve lengthy workflows, requiring WSI segmentation into patches, feature extraction at the patch level, and subsequent mapping back to the original WSI. To address… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

  42. arXiv:2501.05361  [pdf, other

    cs.LG

    No-Regret Linear Bandits under Gap-Adjusted Misspecification

    Authors: Chong Liu, Dan Qiao, Ming Yin, Ilija Bogunovic, Yu-Xiang Wang

    Abstract: This work studies linear bandits under a new notion of gap-adjusted misspecification and is an extension of Liu et al. (2023). When the underlying reward function is not linear, existing linear bandits work usually relies on a uniform misspecification parameter $ε$ that measures the sup-norm error of the best linear approximation. This results in an unavoidable linear regret whenever $ε> 0$. We pr… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2302.13252

  43. arXiv:2501.02089  [pdf, other

    cs.LG cs.AI

    On the Statistical Complexity for Offline and Low-Adaptive Reinforcement Learning with Structures

    Authors: Ming Yin, Mengdi Wang, Yu-Xiang Wang

    Abstract: This article reviews the recent advances on the statistical foundation of reinforcement learning (RL) in the offline and low-adaptive settings. We will start by arguing why offline RL is the appropriate model for almost any real-life ML problems, even if they have nothing to do with the recent AI breakthroughs that use RL. Then we will zoom into two fundamental problems of offline RL: offline poli… ▽ More

    Submitted 3 January, 2025; originally announced January 2025.

    Comments: Review Article

  44. arXiv:2412.20014  [pdf, other

    cs.LG cs.AI q-bio.BM

    ProtCLIP: Function-Informed Protein Multi-Modal Learning

    Authors: Hanjing Zhou, Mingze Yin, Wei Wu, Mingyang Li, Kun Fu, Jintai Chen, Jian Wu, Zheng Wang

    Abstract: Multi-modality pre-training paradigm that aligns protein sequences and biological descriptions has learned general protein representations and achieved promising performance in various downstream applications. However, these works were still unable to replicate the extraordinary success of language-supervised visual foundation models due to the ineffective usage of aligned protein-text paired data… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

    Journal ref: AAAI 2025

  45. arXiv:2412.16118  [pdf, other

    physics.med-ph cs.AI

    Convolutional Deep Operator Networks for Learning Nonlinear Focused Ultrasound Wave Propagation in Heterogeneous Spinal Cord Anatomy

    Authors: Avisha Kumar, Xuzhe Zhi, Zan Ahmad, Minglang Yin, Amir Manbachi

    Abstract: Focused ultrasound (FUS) therapy is a promising tool for optimally targeted treatment of spinal cord injuries (SCI), offering submillimeter precision to enhance blood flow at injury sites while minimizing impact on surrounding tissues. However, its efficacy is highly sensitive to the placement of the ultrasound source, as the spinal cord's complex geometry and acoustic heterogeneity distort and at… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

    Comments: Accepted for oral presentation at AAAI Conference on Artificial Intelligence: AI for Accelerating Science and Engineering Workshop 2025

  46. arXiv:2412.10255  [pdf, other

    cs.GR cs.AI

    AniSora: Exploring the Frontiers of Animation Video Generation in the Sora Era

    Authors: Yudong Jiang, Baohan Xu, Siqian Yang, Mingyu Yin, Jing Liu, Chao Xu, Siqi Wang, Yidi Wu, Bingwen Zhu, Xinwen Zhang, Xingyu Zheng, Jixuan Xu, Yue Zhang, Jinlong Hou, Huyang Sun

    Abstract: Animation has gained significant interest in the recent film and TV industry. Despite the success of advanced video generation models like Sora, Kling, and CogVideoX in generating natural videos, they lack the same effectiveness in handling animation videos. Evaluating animation video generation is also a great challenge due to its unique artist styles, violating the laws of physics and exaggerate… ▽ More

    Submitted 22 May, 2025; v1 submitted 13 December, 2024; originally announced December 2024.

  47. arXiv:2412.03026  [pdf, other

    cs.CV

    ASIGN: An Anatomy-aware Spatial Imputation Graphic Network for 3D Spatial Transcriptomics

    Authors: Junchao Zhu, Ruining Deng, Tianyuan Yao, Juming Xiong, Chongyu Qu, Junlin Guo, Siqi Lu, Mengmeng Yin, Yu Wang, Shilin Zhao, Haichun Yang, Yuankai Huo

    Abstract: Spatial transcriptomics (ST) is an emerging technology that enables medical computer vision scientists to automatically interpret the molecular profiles underlying morphological features. Currently, however, most deep learning-based ST analyses are limited to two-dimensional (2D) sections, which can introduce diagnostic errors due to the heterogeneity of pathological tissues across 3D sections. Ex… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

  48. arXiv:2411.18795  [pdf, other

    cs.CV

    GloFinder: AI-empowered QuPath Plugin for WSI-level Glomerular Detection, Visualization, and Curation

    Authors: Jialin Yue, Tianyuan Yao, Ruining Deng, Siqi Lu, Junlin Guo, Quan Liu, Mengmeng Yin, Juming Xiong, Haichun Yang, Yuankai Huo

    Abstract: Artificial intelligence (AI) has demonstrated significant success in automating the detection of glomeruli, the key functional units of the kidney, from whole slide images (WSIs) in kidney pathology. However, existing open-source tools are often distributed as source code or Docker containers, requiring advanced programming skills that hinder accessibility for non-programmers, such as clinicians.… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  49. arXiv:2411.18014  [pdf, other

    cs.LG

    Diffeomorphic Latent Neural Operators for Data-Efficient Learning of Solutions to Partial Differential Equations

    Authors: Zan Ahmad, Shiyi Chen, Minglang Yin, Avisha Kumar, Nicolas Charon, Natalia Trayanova, Mauro Maggioni

    Abstract: A computed approximation of the solution operator to a system of partial differential equations (PDEs) is needed in various areas of science and engineering. Neural operators have been shown to be quite effective at predicting these solution generators after training on high-fidelity ground truth data (e.g. numerical simulations). However, in order to generalize well to unseen spatial domains, neu… ▽ More

    Submitted 29 November, 2024; v1 submitted 26 November, 2024; originally announced November 2024.

  50. arXiv:2411.16961  [pdf, other

    eess.IV cs.CV

    Glo-In-One-v2: Holistic Identification of Glomerular Cells, Tissues, and Lesions in Human and Mouse Histopathology

    Authors: Lining Yu, Mengmeng Yin, Ruining Deng, Quan Liu, Tianyuan Yao, Can Cui, Junlin Guo, Yu Wang, Yaohong Wang, Shilin Zhao, Haichun Yang, Yuankai Huo

    Abstract: Segmenting glomerular intraglomerular tissue and lesions traditionally depends on detailed morphological evaluations by expert nephropathologists, a labor-intensive process susceptible to interobserver variability. Our group previously developed the Glo-In-One toolkit for integrated detection and segmentation of glomeruli. In this study, we leverage the Glo-In-One toolkit to version 2 with fine-gr… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.