Skip to main content

Showing 1–50 of 561 results for author: Fang, L

.
  1. arXiv:2506.09909  [pdf, ps, other

    cs.GR

    TransGI: Real-Time Dynamic Global Illumination With Object-Centric Neural Transfer Model

    Authors: Yijie Deng, Lei Han, Lu Fang

    Abstract: Neural rendering algorithms have revolutionized computer graphics, yet their impact on real-time rendering under arbitrary lighting conditions remains limited due to strict latency constraints in practical applications. The key challenge lies in formulating a compact yet expressive material representation. To address this, we propose TransGI, a novel neural rendering method for real-time, high-fid… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  2. arXiv:2506.09420  [pdf, ps, other

    cs.AI cs.CL cs.HC cs.LG cs.MA

    A Call for Collaborative Intelligence: Why Human-Agent Systems Should Precede AI Autonomy

    Authors: Henry Peng Zou, Wei-Chieh Huang, Yaozu Wu, Chunyu Miao, Dongyuan Li, Aiwei Liu, Yue Zhou, Yankai Chen, Weizhi Zhang, Yangning Li, Liancheng Fang, Renhe Jiang, Philip S. Yu

    Abstract: Recent improvements in large language models (LLMs) have led many researchers to focus on building fully autonomous AI agents. This position paper questions whether this approach is the right path forward, as these autonomous systems still have problems with reliability, transparency, and understanding the actual requirements of human. We suggest a different approach: LLM-based Human-Agent Systems… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  3. arXiv:2506.08377  [pdf, ps, other

    cond-mat.soft math-ph

    Micro-Macro Modeling of Polymeric Fluids with Multi-Bead Polymer Chain

    Authors: Xuelian Bao, Lidong Fang, Huaxiong Huang, Zilong Song, Shixin Xu

    Abstract: This work extends the classical dumbbell (two-bead) model of polymer chains to a more detailed multi-bead representation, where each polymer chain consists of $N$ beads connected by $N-1$ springs. We develop a thermodynamically consistent micro-macro model based on the energy variational method to describe the coupled dynamics of polymer configurations and fluid flow. The resulting framework captu… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  4. arXiv:2506.06341  [pdf, ps, other

    cs.IR cs.AI cs.CY

    NR4DER: Neural Re-ranking for Diversified Exercise Recommendation

    Authors: Xinghe Cheng, Xufang Zhou, Liangda Fang, Chaobo He, Yuyu Zhou, Weiqi Luo, Zhiguo Gong, Quanlong Guan

    Abstract: With the widespread adoption of online education platforms, an increasing number of students are gaining new knowledge through Massive Open Online Courses (MOOCs). Exercise recommendation have made strides toward improving student learning outcomes. However, existing methods not only struggle with high dropout rates but also fail to match the diverse learning pace of students. They frequently face… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Comments: accepted for presentation at the SIGIR 2025 Full Papers track

  5. arXiv:2505.24480  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Effective Code-Integrated Reasoning

    Authors: Fei Bai, Yingqian Min, Beichen Zhang, Zhipeng Chen, Wayne Xin Zhao, Lei Fang, Zheng Liu, Zhongyuan Wang, Ji-Rong Wen

    Abstract: In this paper, we investigate code-integrated reasoning, where models generate code when necessary and integrate feedback by executing it through a code interpreter. To acquire this capability, models must learn when and how to use external code tools effectively, which is supported by tool-augmented reinforcement learning (RL) through interactive learning. Despite its benefits, tool-augmented RL… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

    Comments: Technical Report on Slow Thinking with LLMs: Code-Integrated Reasoning

  6. arXiv:2505.24267  [pdf, ps, other

    cs.CR

    MUSE: Model-Agnostic Tabular Watermarking via Multi-Sample Selection

    Authors: Liancheng Fang, Aiwei Liu, Henry Peng Zou, Yankai Chen, Hengrui Zhang, Zhongfen Deng, Philip S. Yu

    Abstract: We introduce MUSE, a watermarking algorithm for tabular generative models. Previous approaches typically leverage DDIM invertibility to watermark tabular diffusion models, but tabular diffusion models exhibit significantly poorer invertibility compared to other modalities, compromising performance. Simultaneously, tabular diffusion models require substantially less computation than other modalitie… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

  7. arXiv:2505.23125  [pdf, ps, other

    eess.SY

    Interturn Fault Detection in IPMSMs: Two Adaptive Observer-based Solutions

    Authors: Romeo Ortega, Alexey Bobtsov, Leyan Fang, Oscar Texis-Loaiza, Johannes Schiffer

    Abstract: In this paper we address the problem of online detection of inter-turn short-circuit faults (ITSCFs) that occur in permanent magnet synchronous motors (PMSMs). We propose two solutions to this problem: (i) a very simple linear observer and (ii) a generalized parameter estimation based observer, that incorporates a high performance estimator -- with both observers detecting the short-circuit curren… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  8. arXiv:2505.23112  [pdf, ps, other

    eess.SY

    Voltage Control of the Boost Converter: PI vs. Nonlinear Passivity-based Control

    Authors: Leyan Fang, Romeo Ortega, Robert Griñó

    Abstract: We carry-out a detailed analysis of direct voltage control of a Boost converter feeding a simple resistive load. First, we prove that using a classical PI control to stabilize a desired equilibrium leads to a very complicated dynamic behavior consisting of two equilibrium points, one of them always unstable for all PI gains and circuit parameter values. Interestingly, the second equilibrium point… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  9. arXiv:2505.19813  [pdf, ps, other

    cs.CV

    GoLF-NRT: Integrating Global Context and Local Geometry for Few-Shot View Synthesis

    Authors: You Wang, Li Fang, Hao Zhu, Fei Hu, Long Ye, Zhan Ma

    Abstract: Neural Radiance Fields (NeRF) have transformed novel view synthesis by modeling scene-specific volumetric representations directly from images. While generalizable NeRF models can generate novel views across unknown scenes by learning latent ray representations, their performance heavily depends on a large number of multi-view observations. However, with limited input views, these methods experien… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: CVPR 2025

  10. arXiv:2505.19793  [pdf, ps, other

    cs.CV

    Depth-Guided Bundle Sampling for Efficient Generalizable Neural Radiance Field Reconstruction

    Authors: Li Fang, Hao Zhu, Longlong Chen, Fei Hu, Long Ye, Zhan Ma

    Abstract: Recent advancements in generalizable novel view synthesis have achieved impressive quality through interpolation between nearby views. However, rendering high-resolution images remains computationally intensive due to the need for dense sampling of all rays. Recognizing that natural scenes are typically piecewise smooth and sampling all rays is often redundant, we propose a novel depth-guided bund… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: CVPR 2025

  11. arXiv:2505.17005  [pdf, ps, other

    cs.CL cs.AI cs.IR

    R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning

    Authors: Huatong Song, Jinhao Jiang, Wenqing Tian, Zhipeng Chen, Yuhuan Wu, Jiahao Zhao, Yingqian Min, Wayne Xin Zhao, Lei Fang, Ji-Rong Wen

    Abstract: Large Language Models (LLMs) are powerful but prone to hallucinations due to static knowledge. Retrieval-Augmented Generation (RAG) helps by injecting external information, but current methods often are costly, generalize poorly, or ignore the internal knowledge of the model. In this paper, we introduce R1-Searcher++, a novel framework designed to train LLMs to adaptively leverage both internal an… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  12. arXiv:2505.16834  [pdf, other

    cs.CL cs.AI cs.IR

    SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis

    Authors: Shuang Sun, Huatong Song, Yuhao Wang, Ruiyang Ren, Jinhao Jiang, Junjie Zhang, Fei Bai, Jia Deng, Wayne Xin Zhao, Zheng Liu, Lei Fang, Zhongyuan Wang, Ji-Rong Wen

    Abstract: Retrieval-augmented generation (RAG) systems have advanced large language models (LLMs) in complex deep search scenarios requiring multi-step reasoning and iterative information retrieval. However, existing approaches face critical limitations that lack high-quality training trajectories or suffer from the distributional mismatches in simulated environments and prohibitive computational costs for… ▽ More

    Submitted 25 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

  13. arXiv:2505.13633  [pdf, ps, other

    cs.CV

    IPENS:Interactive Unsupervised Framework for Rapid Plant Phenotyping Extraction via NeRF-SAM2 Fusion

    Authors: Wentao Song, He Huang, Youqiang Sun, Fang Qu, Jiaqi Zhang, Longhui Fang, Yuwei Hao, Chenyang Peng

    Abstract: Advanced plant phenotyping technologies play a crucial role in targeted trait improvement and accelerating intelligent breeding. Due to the species diversity of plants, existing methods heavily rely on large-scale high-precision manually annotated data. For self-occluded objects at the grain level, unsupervised methods often prove ineffective. This study proposes IPENS, an interactive unsupervised… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  14. arXiv:2505.13478  [pdf, ps, other

    cs.PL cs.DB

    An Extensive Study on Text Serialization Formats and Methods

    Authors: Wang Wei, Li Na, Zhang Lei, Liu Fang, Chen Hao, Yang Xiuying, Huang Lei, Zhao Min, Wu Gang, Zhou Jie, Xu Jing, Sun Tao, Ma Li, Zhu Qiang, Hu Jun, Guo Wei, He Yong, Gao Yuan, Lin Dan, Zheng Yi, Shi Li

    Abstract: Text serialization is a fundamental concept in modern computing, enabling the conversion of complex data structures into a format that can be easily stored, transmitted, and reconstructed. This paper provides an extensive overview of text serialization, exploring its importance, prevalent formats, underlying methods, and comparative performance characteristics. We dive into the advantages and disa… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

  15. arXiv:2505.10063  [pdf, ps, other

    cs.CL

    CAFE: Retrieval Head-based Coarse-to-Fine Information Seeking to Enhance Multi-Document QA Capability

    Authors: Han Peng, Jinhao Jiang, Zican Dong, Wayne Xin Zhao, Lei Fang

    Abstract: Advancements in Large Language Models (LLMs) have extended their input context length, yet they still struggle with retrieval and reasoning in long-context inputs. Existing methods propose to utilize the prompt strategy and retrieval head to alleviate this limitation. However, they still face challenges in balancing retrieval precision and recall, impacting their efficacy in answering questions. T… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  16. arXiv:2505.04994  [pdf, other

    cs.CL cs.AI

    Rethinking Invariance in In-context Learning

    Authors: Lizhe Fang, Yifei Wang, Khashayar Gatmiry, Lei Fang, Yisen Wang

    Abstract: In-Context Learning (ICL) has emerged as a pivotal capability of auto-regressive large language models, yet it is hindered by a notable sensitivity to the ordering of context examples regardless of their mutual independence. To address this issue, recent studies have introduced several variant algorithms of ICL that achieve permutation invariance. However, many of these do not exhibit comparable p… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

  17. arXiv:2505.00753  [pdf, other

    cs.CL cs.LG

    A Survey on Large Language Model based Human-Agent Systems

    Authors: Henry Peng Zou, Wei-Chieh Huang, Yaozu Wu, Yankai Chen, Chunyu Miao, Hoang Nguyen, Yue Zhou, Weizhi Zhang, Liancheng Fang, Langzhou He, Yangning Li, Dongyuan Li, Renhe Jiang, Xue Liu, Philip S. Yu

    Abstract: Recent advances in large language models (LLMs) have sparked growing interest in building fully autonomous agents. However, fully autonomous LLM-based agents still face significant challenges, including limited reliability due to hallucinations, difficulty in handling complex tasks, and substantial safety and ethical risks, all of which limit their feasibility and trustworthiness in real-world app… ▽ More

    Submitted 20 May, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

    Comments: Paper lists and resources are available at https://github.com/HenryPengZou/Awesome-LLM-Based-Human-Agent-Systems

  18. arXiv:2505.00029  [pdf, ps, other

    cs.CL cs.AI

    Keep the General, Inject the Specific: Structured Dialogue Fine-Tuning for Knowledge Injection without Catastrophic Forgetting

    Authors: Yijie Hong, Xiaofei Yin, Xinzhong Wang, Yi Tu, Ya Guo, Sufeng Duan, Weiqiang Wang, Lingyong Fang, Depeng Wang, Huijia Zhu

    Abstract: Large Vision Language Models have demonstrated impressive versatile capabilities through extensive multimodal pre-training, but face significant limitations when incorporating specialized knowledge domains beyond their training distribution. These models struggle with a fundamental dilemma: direct adaptation approaches that inject domain-specific knowledge often trigger catastrophic forgetting of… ▽ More

    Submitted 27 April, 2025; originally announced May 2025.

    Comments: 13 pages, 3 figures

  19. arXiv:2504.14772  [pdf, other

    cs.CL cs.LG stat.ML

    Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions

    Authors: Luyang Fang, Xiaowei Yu, Jiazhang Cai, Yongkai Chen, Shushan Wu, Zhengliang Liu, Zhenyuan Yang, Haoran Lu, Xilin Gong, Yufang Liu, Terry Ma, Wei Ruan, Ali Abbasi, Jing Zhang, Tao Wang, Ehsan Latif, Wei Liu, Wei Zhang, Soheil Kolouri, Xiaoming Zhai, Dajiang Zhu, Wenxuan Zhong, Tianming Liu, Ping Ma

    Abstract: The exponential growth of Large Language Models (LLMs) continues to highlight the need for efficient strategies to meet ever-expanding computational and data demands. This survey provides a comprehensive analysis of two complementary paradigms: Knowledge Distillation (KD) and Dataset Distillation (DD), both aimed at compressing LLMs while preserving their advanced reasoning capabilities and lingui… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

  20. arXiv:2504.11064  [pdf

    cs.MA cs.RO eess.SY

    A Multi-UAV Formation Obstacle Avoidance Method Combined Improved Simulated Annealing and Adaptive Artificial Potential Field

    Authors: Bo Ma, Yi Ji, Liyong Fang

    Abstract: The traditional Artificial Potential Field (APF) method exhibits limitations in its force distribution: excessive attraction when UAVs are far from the target may cause collisions with obstacles, while insufficient attraction near the goal often results in failure to reach the target. Furthermore, APF is highly susceptible to local minima, compromising motion reliability in complex environments. T… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  21. arXiv:2504.10852  [pdf, other

    cs.CV

    Enhancing Features in Long-tailed Data Using Large Vision Model

    Authors: Pengxiao Han, Changkun Ye, Jinguang Tong, Cuicui Jiang, Jie Hong, Li Fang, Xuesong Li

    Abstract: Language-based foundation models, such as large language models (LLMs) or large vision-language models (LVLMs), have been widely studied in long-tailed recognition. However, the need for linguistic data is not applicable to all practical tasks. In this study, we aim to explore using large vision models (LVMs) or visual foundation models (VFMs) to enhance long-tailed data features without any langu… ▽ More

    Submitted 22 April, 2025; v1 submitted 15 April, 2025; originally announced April 2025.

  22. arXiv:2504.07394  [pdf, other

    cs.LG cs.AI

    ClimateBench-M: A Multi-Modal Climate Data Benchmark with a Simple Generative Method

    Authors: Dongqi Fu, Yada Zhu, Zhining Liu, Lecheng Zheng, Xiao Lin, Zihao Li, Liri Fang, Katherine Tieu, Onkar Bhardwaj, Kommy Weldemariam, Hanghang Tong, Hendrik Hamann, Jingrui He

    Abstract: Climate science studies the structure and dynamics of Earth's climate system and seeks to understand how climate changes over time, where the data is usually stored in the format of time series, recording the climate features, geolocation, time attributes, etc. Recently, much research attention has been paid to the climate benchmarks. In addition to the most common task of weather forecasting, sev… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

    Comments: Preprint, 29 pages

  23. The Mini-SiTian Array: first-two-year operation

    Authors: Min He, Hong Wu, Liang Ge, Jian-feng Tian, Zheng Wang, Hai-yang Mu, Yu Zhang, Yang Huang, Jie Zheng, Zhou Fan, Zheng-yang Li, Hong-hui Gu, Heng-geng Han, Kai Xiao, Zhi-rui Li, Jun-jie Jin, Bei-chuan Wang, Jun Ma, Jin-hang Zou, Ying Wu, Jiu-peng Guo, Li-guo Fang, Zhi-gang Hou, Bo-wen Zhang, Yun-fei Xu , et al. (48 additional authors not shown)

    Abstract: The SiTian project, designed to utilize 60 telescopes distributed across multiple sites in China, is a next-generation time-domain survey initiative. As a pathfinder for the SiTian project, the Mini-SiTian (MST) has been proposed and implemented to test the SiTian's brain and data pipeline, and to evaluate the feasibility of its technology and science cases. Mounted at the Xinglong Observatory, th… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: 10 pages, 11 figures, Accepted for publication in a special issue of Research in Astronomy and Astrophysics on the Mini-SiTian Array

  24. arXiv:2503.21380  [pdf, other

    cs.CL

    Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models

    Authors: Haoxiang Sun, Yingqian Min, Zhipeng Chen, Wayne Xin Zhao, Lei Fang, Zheng Liu, Zhongyuan Wang, Ji-Rong Wen

    Abstract: In recent years, the rapid development of large reasoning models has resulted in the saturation of existing benchmarks for evaluating mathematical reasoning, highlighting the urgent need for more challenging and rigorous evaluation frameworks. To address this gap, we introduce OlymMATH, a novel Olympiad-level mathematical benchmark, designed to rigorously test the complex reasoning capabilities of… ▽ More

    Submitted 19 May, 2025; v1 submitted 27 March, 2025; originally announced March 2025.

    Comments: Technical Report on Slow Thinking with LLMs: Evaluation Benchmark

  25. arXiv:2503.13493  [pdf

    eess.SP cs.LG stat.AP

    Analysis of Learning-based Offshore Wind Power Prediction Models with Various Feature Combinations

    Authors: Linhan Fang, Fan Jiang, Ann Mary Toms, Xingpeng Li

    Abstract: Accurate wind speed prediction is crucial for designing and selecting sites for offshore wind farms. This paper investigates the effectiveness of various machine learning models in predicting offshore wind power for a site near the Gulf of Mexico by analyzing meteorological data. After collecting and preprocessing meteorological data, nine different input feature combinations were designed to asse… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  26. arXiv:2503.12375  [pdf, ps, other

    math.NA math-ph

    Optimization-based method for conjugate heat transfer problems

    Authors: Liang Fang, Xiandong Liu, Lei Zhang

    Abstract: We propose a numerical approach for solving conjugate heat transfer problems using the finite volume method. This approach combines a semi-implicit scheme for fluid flow, governed by the incompressible Navier-Stokes equations, with an optimization-based approach for heat transfer across the fluid-solid interface. In the semi-implicit method, the convective term in the momentum equation is treated… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

  27. arXiv:2503.11140  [pdf, other

    cs.CV

    Minding Fuzzy Regions: A Data-driven Alternating Learning Paradigm for Stable Lesion Segmentation

    Authors: Lexin Fang, Yunyang Xu, Xiang Ma, Xuemei Li, Caiming Zhang

    Abstract: Deep learning has achieved significant advancements in medical image segmentation, but existing models still face challenges in accurately segmenting lesion regions. The main reason is that some lesion regions in medical images have unclear boundaries, irregular shapes, and small tissue density differences, leading to label ambiguity. However, the existing model treats all data equally without tak… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: 10 pages, 11 figures, accepted by CVPR 2025

  28. arXiv:2503.09774  [pdf, other

    cs.CL

    Efficient Multi-Task Inferencing: Model Merging with Gromov-Wasserstein Feature Alignment

    Authors: Luyang Fang, Ehsan Latif, Haoran Lu, Yifan Zhou, Ping Ma, Xiaoming Zhai

    Abstract: Automatic scoring of student responses enhances efficiency in education, but deploying a separate neural network for each task increases storage demands, maintenance efforts, and redundant computations. To address these challenges, this paper introduces the Gromov-Wasserstein Scoring Model Merging (GW-SMM) method, which merges models based on feature distribution similarities measured via the Grom… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: Submitted to AIED2025

  29. arXiv:2503.05592  [pdf, other

    cs.AI cs.CL cs.IR

    R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

    Authors: Huatong Song, Jinhao Jiang, Yingqian Min, Jie Chen, Zhipeng Chen, Wayne Xin Zhao, Lei Fang, Ji-Rong Wen

    Abstract: Existing Large Reasoning Models (LRMs) have shown the potential of reinforcement learning (RL) to enhance the complex reasoning capabilities of Large Language Models~(LLMs). While they achieve remarkable performance on challenging tasks such as mathematics and coding, they often rely on their internal knowledge to solve problems, which can be inadequate for time-sensitive or knowledge-intensive qu… ▽ More

    Submitted 18 March, 2025; v1 submitted 7 March, 2025; originally announced March 2025.

  30. arXiv:2503.04548  [pdf, other

    cs.CL

    An Empirical Study on Eliciting and Improving R1-like Reasoning Models

    Authors: Zhipeng Chen, Yingqian Min, Beichen Zhang, Jie Chen, Jinhao Jiang, Daixuan Cheng, Wayne Xin Zhao, Zheng Liu, Xu Miao, Yang Lu, Lei Fang, Zhongyuan Wang, Ji-Rong Wen

    Abstract: In this report, we present the third technical report on the development of slow-thinking models as part of the STILL project. As the technical pathway becomes clearer, scaling RL training has become a central technique for implementing such reasoning models. We systematically experiment with and document the effects of various factors influencing RL training, conducting experiments on both base m… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: Technical Report on Slow Thinking with LLMs: Part III

  31. arXiv:2503.01205  [pdf, ps, other

    math.RA

    Simultaneous direct sum decompositions of several multivariate polynomials

    Authors: Lishan Fang, Hua-Lin Huang, Lili Liao

    Abstract: We consider the problem of simultaneous direct sum decomposition of a set of multivariate polynomials. To this end, we extend Harrison's center theory for a single homogeneous polynomial to this broader setting. It is shown that the center of a set of polynomials is a special Jordan algebra, and simultaneous direct sum decompositions of the given polynomials are in bijection with complete sets of… ▽ More

    Submitted 8 March, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

    MSC Class: 15A69; 13P05

  32. arXiv:2503.01166  [pdf, ps, other

    math.RA

    Simultaneous block diagonalization of a set of symmetric matrices via congruence

    Authors: Lishan Fang, Hua-Lin Huang, Jiayan Huang

    Abstract: This article studies canonical forms derived from the finest simultaneous block diagonalization of a set of symmetric matrices via congruence. Our technique relies on Harrison's center theory, which is extended from a single higher degree form to multiple quadratic forms, hence a set of symmetric matrices. The algebraic structures of centers and the bijective relationship between the simultaneous… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

    MSC Class: 15A69; 15A20; 13P05

  33. arXiv:2503.00618  [pdf, other

    cs.SE cs.HC

    Show Me Why It's Correct: Saving 1/3 of Debugging Time in Program Repair with Interactive Runtime Comparison

    Authors: Ruixin Wang, Zhongkai Zhao, Le Fang, Nan Jiang, Yiling Lou, Lin Tan, Tianyi Zhang

    Abstract: Automated Program Repair (APR) holds the promise of alleviating the burden of debugging and fixing software bugs. Despite this, developers still need to manually inspect each patch to confirm its correctness, which is tedious and time-consuming. This challenge is exacerbated in the presence of plausible patches, which accidentally pass test cases but may not correctly fix the bug. To address this… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

    Comments: 27 pages, 8 figures, OOPSLA 2025

    Journal ref: Proc. ACM Program. Lang. 9, OOPSLA1, Article 145 (April 2025)

  34. arXiv:2502.19305  [pdf, other

    cs.LG cs.AI q-fin.RM q-fin.ST

    Corporate Fraud Detection in Rich-yet-Noisy Financial Graph

    Authors: Shiqi Wang, Zhibo Zhang, Libing Fang, Cam-Tu Nguyen, Wenzhong Li

    Abstract: Corporate fraud detection aims to automatically recognize companies that conduct wrongful activities such as fraudulent financial statements or illegal insider trading. Previous learning-based methods fail to effectively integrate rich interactions in the company network. To close this gap, we collect 18-year financial records in China to form three graph datasets with fraud labels. We analyze the… ▽ More

    Submitted 29 May, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

  35. arXiv:2502.19163  [pdf, ps, other

    cs.CL cs.AI cs.IR cs.LG

    TestNUC: Enhancing Test-Time Computing Approaches and Scaling through Neighboring Unlabeled Data Consistency

    Authors: Henry Peng Zou, Zhengyao Gu, Yue Zhou, Yankai Chen, Weizhi Zhang, Liancheng Fang, Yibo Wang, Yangning Li, Kay Liu, Philip S. Yu

    Abstract: Test-time computing approaches, which leverage additional computational resources during inference, have been proven effective in enhancing large language model performance. This work introduces a novel, linearly scaling approach, TestNUC, that improves test-time predictions by leveraging the local consistency of neighboring unlabeled data-it classifies an input instance by considering not only th… ▽ More

    Submitted 31 May, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    Comments: Accepted by ACL 2025 main conference

  36. arXiv:2502.16804  [pdf, other

    cs.MA cs.AI

    Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances

    Authors: Yaozu Wu, Dongyuan Li, Yankai Chen, Renhe Jiang, Henry Peng Zou, Liancheng Fang, Zhen Wang, Philip S. Yu

    Abstract: Autonomous Driving Systems (ADSs) are revolutionizing transportation by reducing human intervention, improving operational efficiency, and enhancing safety. Large Language Models (LLMs), known for their exceptional planning and reasoning capabilities, have been integrated into ADSs to assist with driving decision-making. However, LLM-based single-agent ADSs face three major challenges: limited per… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

  37. arXiv:2502.16414  [pdf, other

    cs.LG cs.AI

    TabGen-ICL: Residual-Aware In-Context Example Selection for Tabular Data Generation

    Authors: Liancheng Fang, Aiwei Liu, Hengrui Zhang, Henry Peng Zou, Weizhi Zhang, Philip S. Yu

    Abstract: Large Language models (LLMs) have achieved encouraging results in tabular data generation. However, existing approaches require fine-tuning, which is computationally expensive. This paper explores an alternative: prompting a fixed LLM with in-context examples. We observe that using randomly selected in-context examples hampers the LLM's performance, resulting in sub-optimal generation quality. To… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

  38. arXiv:2502.14205  [pdf, other

    cs.LG cs.AI

    Accurate Forgetting for Heterogeneous Federated Continual Learning

    Authors: Abudukelimu Wuerkaixi, Sen Cui, Jingfeng Zhang, Kunda Yan, Bo Han, Gang Niu, Lei Fang, Changshui Zhang, Masashi Sugiyama

    Abstract: Recent years have witnessed a burgeoning interest in federated learning (FL). However, the contexts in which clients engage in sequential learning remain under-explored. Bridging FL and continual learning (CL) gives rise to a challenging practical problem: federated continual learning (FCL). Existing research in FCL primarily focuses on mitigating the catastrophic forgetting issue of continual lea… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: published in ICLR 2024

  39. arXiv:2502.07349  [pdf, other

    physics.flu-dyn

    The contribution of dilatational motion to energy flux in homogeneous compressible turbulence

    Authors: Chensheng Luo, Le Fang, Jian Fang, Haitao Xu, Alain Pumir, Ping-Fan Yang

    Abstract: We analyze the energy flux in compressible turbulence by generalizing the exact decomposition recently proposed by Johnson (Phys. Rev. Lett., vol. 124, 2020. 104501) to study incompressible turbulent flows. This allows us to characterize the effect of dilatational motion on the inter-scale energy transfer in three-dimensional compressible turbulence. Our analysis reveals that the contribution of d… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  40. SparseFormer: Detecting Objects in HRW Shots via Sparse Vision Transformer

    Authors: Wenxi Li, Yuchen Guo, Jilai Zheng, Haozhe Lin, Chao Ma, Lu Fang, Xiaokang Yang

    Abstract: Recent years have seen an increase in the use of gigapixel-level image and video capture systems and benchmarks with high-resolution wide (HRW) shots. However, unlike close-up shots in the MS COCO dataset, the higher resolution and wider field of view raise unique challenges, such as extreme sparsity and huge scale changes, causing existing close-up detectors inaccuracy and inefficiency. In this p… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: This paper is accepted to ACM MM 2024

  41. arXiv:2502.00748  [pdf, ps, other

    nucl-ex nucl-th

    Double-beta decay of $^{150}$Nd to excited levels of $^{150}$Sm

    Authors: A. S. Barabash, P. Belli, R. Bernabei, R. S. Boiko, F. Cappella, V. Caracciolo, R. Cerulli, F. A. Danevich, D. L. Fang, F. Ferella, A. Incicchitti, V. V. Kobychev, S. I. Konovalov, M. Laubenstein, A. Leoncini, V. Merlo, S. Nisi, O. Nitescu, D. V. Poda, O. G. Polischuk, I. B. -K. Shcherbakov, F. Simkovic, A. Timonina, V. S. Tinkova, V. I. Tretyak , et al. (1 additional authors not shown)

    Abstract: The $2\nu2β$ decay of $^{150}$Nd to the first excited 740.5 keV $0^{+}_{1}$ level of $^{150}$Sm was measured over 5.845 yr with the help of a four-crystal low-background HPGe $γ$ spectrometry system in the underground low-background laboratory STELLA of LNGS-INFN. A 2.381 kg highly purified Nd-containing sample was employed as the decay source. The expected de-excitation gamma-quanta of the… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

    Comments: 61 pages, 19 figures, 14 tables

  42. arXiv:2501.16645  [pdf

    physics.optics

    Optical centroid orbiting metrology

    Authors: Liang Fang, Jinman Chen, Qinjun Chen, Chujun Zhao

    Abstract: Optical interferometry has dramatically advanced the development of modern science and technology. Here we introduce an interesting centroid evolution phenomenon of orbital angular momentum (OAM) interference fields with broken rotational symmetry, and establish a novel interferometric paradigm by fully exploiting centroid orbiting information. The centroid positions and their geometric trajectori… ▽ More

    Submitted 12 January, 2025; originally announced January 2025.

  43. arXiv:2501.07917  [pdf

    cs.ET physics.app-ph physics.optics

    Roadmap on Neuromorphic Photonics

    Authors: Daniel Brunner, Bhavin J. Shastri, Mohammed A. Al Qadasi, H. Ballani, Sylvain Barbay, Stefano Biasi, Peter Bienstman, Simon Bilodeau, Wim Bogaerts, Fabian Böhm, G. Brennan, Sonia Buckley, Xinlun Cai, Marcello Calvanese Strinati, B. Canakci, Benoit Charbonnier, Mario Chemnitz, Yitong Chen, Stanley Cheung, Jeff Chiles, Suyeon Choi, Demetrios N. Christodoulides, Lukas Chrostowski, J. Chu, J. H. Clegg , et al. (125 additional authors not shown)

    Abstract: This roadmap consolidates recent advances while exploring emerging applications, reflecting the remarkable diversity of hardware platforms, neuromorphic concepts, and implementation philosophies reported in the field. It emphasizes the critical role of cross-disciplinary collaboration in this rapidly evolving field.

    Submitted 16 January, 2025; v1 submitted 14 January, 2025; originally announced January 2025.

  44. arXiv:2501.06271  [pdf, other

    q-bio.QM cs.AI cs.CE

    Large Language Models for Bioinformatics

    Authors: Wei Ruan, Yanjun Lyu, Jing Zhang, Jiazhang Cai, Peng Shu, Yang Ge, Yao Lu, Shang Gao, Yue Wang, Peilong Wang, Lin Zhao, Tao Wang, Yufang Liu, Luyang Fang, Ziyu Liu, Zhengliang Liu, Yiwei Li, Zihao Wu, Junhao Chen, Hanqi Jiang, Yi Pan, Zhenyuan Yang, Jingyuan Chen, Shizhe Liang, Wei Zhang , et al. (30 additional authors not shown)

    Abstract: With the rapid advancements in large language model (LLM) technology and the emergence of bioinformatics-specific language models (BioLMs), there is a growing need for a comprehensive analysis of the current landscape, computational characteristics, and diverse applications. This survey aims to address this need by providing a thorough review of BioLMs, focusing on their evolution, classification,… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: 64 pages, 1 figure

  45. arXiv:2412.16918  [pdf, other

    cs.CV

    Detect Changes like Humans: Incorporating Semantic Priors for Improved Change Detection

    Authors: Yuhang Gan, Wenjie Xuan, Zhiming Luo, Lei Fang, Zengmao Wang, Juhua Liu, Bo Du

    Abstract: When given two similar images, humans identify their differences by comparing the appearance ({\it e.g., color, texture}) with the help of semantics ({\it e.g., objects, relations}). However, mainstream change detection models adopt a supervised training paradigm, where the annotated binary change map is the main constraint. Thus, these methods primarily emphasize the difference-aware features bet… ▽ More

    Submitted 22 December, 2024; originally announced December 2024.

  46. arXiv:2412.15546  [pdf, other

    math.OC cs.LG

    De-singularity Subgradient for the $q$-th-Powered $\ell_p$-Norm Weber Location Problem

    Authors: Zhao-Rong Lai, Xiaotian Wu, Liangda Fang, Ziliang Chen, Cheng Li

    Abstract: The Weber location problem is widely used in several artificial intelligence scenarios. However, the gradient of the objective does not exist at a considerable set of singular points. Recently, a de-singularity subgradient method has been proposed to fix this problem, but it can only handle the $q$-th-powered $\ell_2$-norm case ($1\leqslant q<2$), which has only finite singular points. In this pap… ▽ More

    Submitted 3 February, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

    Comments: AAAI 2025

  47. arXiv:2412.11217  [pdf, ps, other

    cs.LO

    A Syntactic Approach to Computing Complete and Sound Abstraction in the Situation Calculus

    Authors: Liangda Fang, Xiaoman Wang, Zhang Chen, Kailun Luo, Zhenhe Cui, Quanlong Guan

    Abstract: Abstraction is an important and useful concept in the field of artificial intelligence. To the best of our knowledge, there is no syntactic method to compute a sound and complete abstraction from a given low-level basic action theory and a refinement mapping. This paper aims to address this issue.To this end, we first present a variant of situation calculus,namely linear integer situation calculus… ▽ More

    Submitted 13 January, 2025; v1 submitted 15 December, 2024; originally announced December 2024.

  48. arXiv:2412.06724  [pdf, other

    cs.DB cs.CL

    AutoDCWorkflow: LLM-based Data Cleaning Workflow Auto-Generation and Benchmark

    Authors: Lan Li, Liri Fang, Vetle I. Torvik

    Abstract: We investigate the reasoning capabilities of large language models (LLMs) for automatically generating data-cleaning workflows. To evaluate LLMs' ability to complete data-cleaning tasks, we implemented a pipeline for LLM-based Auto Data Cleaning Workflow (AutoDCWorkflow), prompting LLMs on data cleaning operations to repair three types of data quality issues: duplicates, missing values, and incons… ▽ More

    Submitted 12 December, 2024; v1 submitted 9 December, 2024; originally announced December 2024.

  49. arXiv:2412.02540  [pdf, other

    cs.CR

    Automatic State Machine Inference for Binary Protocol Reverse Engineering

    Authors: Junhai Yang, Fenghua Li, Yixuan Zhang, Junhao Zhang, Liang Fang, Yunchuan Guo

    Abstract: Protocol Reverse Engineering (PRE) is used to analyze protocols by inferring their structure and behavior. However, current PRE methods mainly focus on field identification within a single protocol and neglect Protocol State Machine (PSM) analysis in mixed protocol environments. This results in insufficient analysis of protocols' abnormal behavior and potential vulnerabilities, which are crucial f… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: 4 pages,5 figures

  50. arXiv:2412.02454  [pdf, other

    cs.CL cs.AI cs.CR

    Gracefully Filtering Backdoor Samples for Generative Large Language Models without Retraining

    Authors: Zongru Wu, Pengzhou Cheng, Lingyong Fang, Zhuosheng Zhang, Gongshen Liu

    Abstract: Backdoor attacks remain significant security threats to generative large language models (LLMs). Since generative LLMs output sequences of high-dimensional token logits instead of low-dimensional classification logits, most existing backdoor defense methods designed for discriminative models like BERT are ineffective for generative LLMs. Inspired by the observed differences in learning behavior be… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: Accepted at COLING 2025