Skip to main content

Showing 1–50 of 163 results for author: Lou, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.03042  [pdf, ps, other

    cs.CL cs.AI

    Dynamic Long Short-Term Memory Based Memory Storage For Long Horizon LLM Interaction

    Authors: Yuyang Lou, Charles Li

    Abstract: Memory storage for Large Language models (LLMs) is becoming an increasingly active area of research, particularly for enabling personalization across long conversations. We propose Pref-LSTM, a dynamic and lightweight framework that combines a BERT-based classifier with a LSTM memory module that generates memory embedding which then is soft-prompt injected into a frozen LLM. We synthetically curat… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: 7 pages, 4 figures, 2 tables

    MSC Class: 68T05

  2. arXiv:2507.00390  [pdf, ps, other

    cs.LG

    MoNE: Replacing Redundant Experts with Lightweight Novices for Structured Pruning of MoE

    Authors: Geng Zhang, Yuxuan Han, Yuxuan Lou, Wangbo Zhao, Yiqi Zhang, Yang You

    Abstract: Mixture-of-Experts (MoE) enables efficient scaling of large language models by activating only a subset of experts per input token. However, deploying MoE-based models incurs significant memory overhead due to the need to retain all experts in memory. While structured pruning is promising to reduce memory costs, existing methods often show suboptimal performance and unstable degradation in three d… ▽ More

    Submitted 30 June, 2025; originally announced July 2025.

  3. arXiv:2506.10322  [pdf, ps, other

    cs.SE

    Minimizing False Positives in Static Bug Detection via LLM-Enhanced Path Feasibility Analysis

    Authors: Xueying Du, Kai Yu, Chong Wang, Yi Zou, Wentai Deng, Zuoyu Ou, Xin Peng, Lingming Zhang, Yiling Lou

    Abstract: Static bug analyzers play a crucial role in ensuring software quality. However, existing analyzers for bug detection in large codebases often suffer from high false positive rates. This is primarily due to the limited capabilities of analyzers in path feasibility validation with multiple conditional branches and complex data dependencies. While current LLM-based approaches attempt to address this… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  4. arXiv:2505.20749  [pdf, ps, other

    cs.AI cs.SE

    Can Agents Fix Agent Issues?

    Authors: Alfin Wijaya Rahardja, Junwei Liu, Weitong Chen, Zhenpeng Chen, Yiling Lou

    Abstract: LLM-based agent systems are emerging as a new software paradigm and have been widely adopted across diverse domains such as medicine, robotics, and programming. However, maintaining these systems requires substantial effort, as they are inevitably prone to bugs and continually evolve to meet changing external requirements. Therefore, automatically resolving agent issues (i.e., bug reports or featu… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 18 pages, 7 figures

  5. arXiv:2505.19489  [pdf, ps, other

    cs.AI cs.SE

    Benchmarking and Enhancing LLM Agents in Localizing Linux Kernel Bugs

    Authors: Zhenhao Zhou, Zhuochen Huang, Yike He, Chong Wang, Jiajun Wang, Yijian Wu, Xin Peng, Yiling Lou

    Abstract: The Linux kernel is a critical system, serving as the foundation for numerous systems. Bugs in the Linux kernel can cause serious consequences, affecting billions of users. Fault localization (FL), which aims at identifying the buggy code elements in software, plays an essential role in software quality assurance. While recent LLM agents have achieved promising accuracy in FL on recent benchmarks… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  6. arXiv:2505.17702  [pdf, ps, other

    cs.CV cs.AI

    Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek

    Authors: Xueyang Li, Jiahao Li, Yu Song, Yunzhong Lou, Xiangdong Zhou

    Abstract: The advent of Computer-Aided Design (CAD) generative modeling will significantly transform the design of industrial products. The recent research endeavor has extended into the realm of Large Language Models (LLMs). In contrast to fine-tuning methods, training-free approaches typically utilize the advanced closed-source LLMs, thereby offering enhanced flexibility and efficiency in the development… ▽ More

    Submitted 29 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

  7. arXiv:2505.16875  [pdf, ps, other

    cs.CV cs.AI cs.LG

    T2I-ConBench: Text-to-Image Benchmark for Continual Post-training

    Authors: Zhehao Huang, Yuhang Liu, Yixin Lou, Zhengbao He, Mingzhen He, Wenxing Zhou, Tao Li, Kehan Li, Zeyi Huang, Xiaolin Huang

    Abstract: Continual post-training adapts a single text-to-image diffusion model to learn new tasks without incurring the cost of separate models, but naive post-training causes forgetting of pretrained knowledge and undermines zero-shot compositionality. We observe that the absence of a standardized evaluation protocol hampers related research for continual post-training. To address this, we introduce T2I-C… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  8. arXiv:2505.14634  [pdf, other

    cs.CV eess.IV

    A General Framework for Group Sparsity in Hyperspectral Unmixing Using Endmember Bundles

    Authors: Gokul Bhusal, Yifei Lou, Cristina Garcia-Cardona, Ekaterina Merkurjev

    Abstract: Due to low spatial resolution, hyperspectral data often consists of mixtures of contributions from multiple materials. This limitation motivates the task of hyperspectral unmixing (HU), a fundamental problem in hyperspectral imaging. HU aims to identify the spectral signatures (\textit{endmembers}) of the materials present in an observed scene, along with their relative proportions (\textit{fracti… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  9. arXiv:2505.13905  [pdf, ps, other

    cs.CV cs.RO

    4D-ROLLS: 4D Radar Occupancy Learning via LiDAR Supervision

    Authors: Ruihan Liu, Xiaoyi Wu, Xijun Chen, Liang Hu, Yunjiang Lou

    Abstract: A comprehensive understanding of 3D scenes is essential for autonomous vehicles (AVs), and among various perception tasks, occupancy estimation plays a central role by providing a general representation of drivable and occupied space. However, most existing occupancy estimation methods rely on LiDAR or cameras, which perform poorly in degraded environments such as smoke, rain, snow, and fog. In th… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  10. arXiv:2505.04481  [pdf, ps, other

    cs.CV

    CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation

    Authors: Jiahao Li, Weijian Ma, Xueyang Li, Yunzhong Lou, Guichun Zhou, Xiangdong Zhou

    Abstract: Recently, Large Language Models (LLMs) have achieved significant success, prompting increased interest in expanding their generative capabilities beyond general text into domain-specific areas. This study investigates the generation of parametric sequences for computer-aided design (CAD) models using LLMs. This endeavor represents an initial step towards creating parametric 3D shapes with LLMs, as… ▽ More

    Submitted 10 June, 2025; v1 submitted 7 May, 2025; originally announced May 2025.

  11. arXiv:2504.16529  [pdf, other

    cs.DC

    6G EdgeAI: Performance Evaluation and Analysis

    Authors: Chien-Sheng Yang, Yu-Jen Ku, Yuan-Yao Lou, Nathan Tenny, Alex C. -C. Hsu

    Abstract: Generative AI (GenAI) services powered by large language models (LLMs) increasingly deliver real-time interactions, yet existing 5G multi-access edge computing (MEC) architectures often treat communication and computing as separate domains, limiting their ability to meet stringent latency requirements. To address this challenge, we introduce an Integrated Communication and Computing (ICC) framewor… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

  12. arXiv:2504.15223  [pdf

    cs.LG

    A Deep Learning Framework for Sequence Mining with Bidirectional LSTM and Multi-Scale Attention

    Authors: Tao Yang, Yu Cheng, Yaokun Ren, Yujia Lou, Minggu Wei, Honghui Xin

    Abstract: This paper addresses the challenges of mining latent patterns and modeling contextual dependencies in complex sequence data. A sequence pattern mining algorithm is proposed by integrating Bidirectional Long Short-Term Memory (BiLSTM) with a multi-scale attention mechanism. The BiLSTM captures both forward and backward dependencies in sequences, enhancing the model's ability to perceive global cont… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  13. arXiv:2504.12608  [pdf, other

    cs.SE cs.AI

    Code Copycat Conundrum: Demystifying Repetition in LLM-based Code Generation

    Authors: Mingwei Liu, Juntao Li, Ying Wang, Xueying Du, Zuoyu Ou, Qiuyuan Chen, Bingxu An, Zhao Wei, Yong Xu, Fangming Zou, Xin Peng, Yiling Lou

    Abstract: Despite recent advances in Large Language Models (LLMs) for code generation, the quality of LLM-generated code still faces significant challenges. One significant issue is code repetition, which refers to the model's tendency to generate structurally redundant code, resulting in inefficiencies and reduced readability. To address this, we conduct the first empirical study to investigate the prevale… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  14. arXiv:2504.12312  [pdf, other

    cs.CL

    Socrates or Smartypants: Testing Logic Reasoning Capabilities of Large Language Models with Logic Programming-based Test Oracles

    Authors: Zihao Xu, Junchen Ding, Yiling Lou, Kun Zhang, Dong Gong, Yuekang Li

    Abstract: Large Language Models (LLMs) have achieved significant progress in language understanding and reasoning. Evaluating and analyzing their logical reasoning abilities has therefore become essential. However, existing datasets and benchmarks are often limited to overly simplistic, unnatural, or contextually constrained examples. In response to the growing demand, we introduce SmartyPat-Bench, a challe… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  15. arXiv:2504.11073  [pdf, other

    cs.RO

    FreeDOM: Online Dynamic Object Removal Framework for Static Map Construction Based on Conservative Free Space Estimation

    Authors: Chen Li, Wanlei Li, Wenhao Liu, Yixiang Shu, Yunjiang Lou

    Abstract: Online map construction is essential for autonomous robots to navigate in unknown environments. However, the presence of dynamic objects may introduce artifacts into the map, which can significantly degrade the performance of localization and path planning. To tackle this problem, a novel online dynamic object removal framework for static map construction based on conservative free space estimatio… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  16. arXiv:2504.05758  [pdf

    cs.LG

    Addressing Class Imbalance with Probabilistic Graphical Models and Variational Inference

    Authors: Yujia Lou, Jie Liu, Yuan Sheng, Jiawei Wang, Yiwei Zhang, Yaokun Ren

    Abstract: This study proposes a method for imbalanced data classification based on deep probabilistic graphical models (DPGMs) to solve the problem that traditional methods have insufficient learning ability for minority class samples. To address the classification bias caused by class imbalance, we introduce variational inference optimization probability modeling, which enables the model to adaptively adju… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  17. arXiv:2504.00234  [pdf, other

    cs.GR cs.CV

    CBIL: Collective Behavior Imitation Learning for Fish from Real Videos

    Authors: Yifan Wu, Zhiyang Dou, Yuko Ishiwaka, Shun Ogawa, Yuke Lou, Wenping Wang, Lingjie Liu, Taku Komura

    Abstract: Reproducing realistic collective behaviors presents a captivating yet formidable challenge. Traditional rule-based methods rely on hand-crafted principles, limiting motion diversity and realism in generated collective behaviors. Recent imitation learning methods learn from data but often require ground truth motion trajectories and struggle with authenticity, especially in high-density groups with… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

    Journal ref: ACM Transactions on Graphics (TOG), 2024, Volume 43, Issue 6 Article No.: 242, Pages 1 - 17

  18. arXiv:2503.23046  [pdf, other

    cs.RO cs.LG

    VLM-C4L: Continual Core Dataset Learning with Corner Case Optimization via Vision-Language Models for Autonomous Driving

    Authors: Haibo Hu, Jiacheng Zuo, Yang Lou, Yufei Cui, Jianping Wang, Nan Guan, Jin Wang, Yung-Hui Li, Chun Jason Xue

    Abstract: With the widespread adoption and deployment of autonomous driving, handling complex environments has become an unavoidable challenge. Due to the scarcity and diversity of extreme scenario datasets, current autonomous driving models struggle to effectively manage corner cases. This limitation poses a significant safety risk, according to the National Highway Traffic Safety Administration (NHTSA), a… ▽ More

    Submitted 29 March, 2025; originally announced March 2025.

  19. arXiv:2503.22091  [pdf, other

    cs.DB

    A Graph-native Optimization Framework for Complex Graph Queries

    Authors: Bingqing Lyu, Xiaoli Zhou, Longbin Lai, Yufan Yang, Yunkai Lou, Wenyuan Yu, Jingren Zhou

    Abstract: This technical report extends the SIGMOD 2025 paper "A Modular Graph-Native Query Optimization Framework" by providing a comprehensive exposition of GOpt's advanced technical mechanisms, implementation strategies, and extended evaluations. While the original paper introduced GOpt's unified intermediate representation (GIR) and demonstrated its performance benefits, this report delves into the fram… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  20. arXiv:2503.22050  [pdf

    cs.CV

    A Deep Learning Framework for Boundary-Aware Semantic Segmentation

    Authors: Tai An, Weiqiang Huang, Da Xu, Qingyuan He, Jiacheng Hu, Yujia Lou

    Abstract: As a fundamental task in computer vision, semantic segmentation is widely applied in fields such as autonomous driving, remote sensing image analysis, and medical image processing. In recent years, Transformer-based segmentation methods have demonstrated strong performance in global feature modeling. However, they still struggle with blurred target boundaries and insufficient recognition of small… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  21. arXiv:2503.20118  [pdf, other

    cs.GR cs.AI cs.CV

    Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors

    Authors: Yuke Lou, Yiming Wang, Zhen Wu, Rui Zhao, Wenjia Wang, Mingyi Shi, Taku Komura

    Abstract: Human-object interaction (HOI) synthesis is important for various applications, ranging from virtual reality to robotics. However, acquiring 3D HOI data is challenging due to its complexity and high cost, limiting existing methods to the narrow diversity of object types and interaction patterns in training datasets. This paper proposes a novel zero-shot HOI synthesis framework without relying on e… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

  22. arXiv:2503.11125  [pdf

    cs.LG

    Context-Aware Rule Mining Using a Dynamic Transformer-Based Framework

    Authors: Jie Liu, Yiwei Zhang, Yuan Sheng, Yujia Lou, Haige Wang, Bohuan Yang

    Abstract: This study proposes a dynamic rule data mining algorithm based on an improved Transformer architecture, aiming to improve the accuracy and efficiency of rule mining in a dynamic data environment. With the increase in data volume and complexity, traditional data mining methods are difficult to cope with dynamic data with strong temporal and variable characteristics, so new algorithms are needed to… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  23. arXiv:2503.08097  [pdf, other

    cs.LG

    Evidential Uncertainty Probes for Graph Neural Networks

    Authors: Linlin Yu, Kangshuo Li, Pritom Kumar Saha, Yifei Lou, Feng Chen

    Abstract: Accurate quantification of both aleatoric and epistemic uncertainties is essential when deploying Graph Neural Networks (GNNs) in high-stakes applications such as drug discovery and financial fraud detection, where reliable predictions are critical. Although Evidential Deep Learning (EDL) efficiently quantifies uncertainty using a Dirichlet distribution over predictive probabilities, existing EDL-… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

    Comments: AISTATS 2025

  24. arXiv:2503.07967  [pdf, ps, other

    cs.SE

    Code Digital Twin: Empowering LLMs with Tacit Knowledge for Complex Software Maintenance

    Authors: Xin Peng, Chong Wang, Mingwei Liu, Yiling Lou, Yijian Wu

    Abstract: While large language models (LLMs) have demonstrated promise in software engineering tasks like code completion and generation, their support for the maintenance of complex software systems remains limited. These models often struggle with understanding the tacit knowledge embedded in systems, such as responsibility allocation and collaboration across different modules. To address this gap, we int… ▽ More

    Submitted 3 July, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

    Comments: A vision paper that will be continuously updated

  25. arXiv:2503.07044  [pdf, other

    cs.CL cs.AI

    DatawiseAgent: A Notebook-Centric LLM Agent Framework for Automated Data Science

    Authors: Ziming You, Yumiao Zhang, Dexuan Xu, Yiwei Lou, Yandong Yan, Wei Wang, Huaming Zhang, Yu Huang

    Abstract: Data Science tasks are multifaceted, dynamic, and often domain-specific. Existing LLM-based approaches largely concentrate on isolated phases, neglecting the interdependent nature of many data science tasks and limiting their capacity for comprehensive end-to-end support. We propose DatawiseAgent, a notebook-centric LLM agent framework that unifies interactions among user, agent and the computatio… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  26. arXiv:2503.03586  [pdf, other

    cs.CR

    Benchmarking LLMs and LLM-based Agents in Practical Vulnerability Detection for Code Repositories

    Authors: Alperen Yildiz, Sin G. Teo, Yiling Lou, Yebo Feng, Chong Wang, Dinil M. Divakaran

    Abstract: Large Language Models (LLMs) have shown promise in software vulnerability detection, particularly on function-level benchmarks like Devign and BigVul. However, real-world detection requires interprocedural analysis, as vulnerabilities often emerge through multi-hop function calls rather than isolated functions. While repository-level benchmarks like ReposVul and VulEval introduce interprocedural c… ▽ More

    Submitted 18 March, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

  27. arXiv:2503.02649  [pdf, other

    cs.RO eess.SY

    Learning-Based Passive Fault-Tolerant Control of a Quadrotor with Rotor Failure

    Authors: Jiehao Chen, Kaidong Zhao, Zihan Liu, YanJie Li, Yunjiang Lou

    Abstract: This paper proposes a learning-based passive fault-tolerant control (PFTC) method for quadrotor capable of handling arbitrary single-rotor failures, including conditions ranging from fault-free to complete rotor failure, without requiring any rotor fault information or controller switching. Unlike existing methods that treat rotor faults as disturbances and rely on a single controller for multiple… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

  28. arXiv:2503.00618  [pdf, other

    cs.SE cs.HC

    Show Me Why It's Correct: Saving 1/3 of Debugging Time in Program Repair with Interactive Runtime Comparison

    Authors: Ruixin Wang, Zhongkai Zhao, Le Fang, Nan Jiang, Yiling Lou, Lin Tan, Tianyi Zhang

    Abstract: Automated Program Repair (APR) holds the promise of alleviating the burden of debugging and fixing software bugs. Despite this, developers still need to manually inspect each patch to confirm its correctness, which is tedious and time-consuming. This challenge is exacerbated in the presence of plausible patches, which accidentally pass test cases but may not correctly fix the bug. To address this… ▽ More

    Submitted 1 March, 2025; originally announced March 2025.

    Comments: 27 pages, 8 figures, OOPSLA 2025

    Journal ref: Proc. ACM Program. Lang. 9, OOPSLA1, Article 145 (April 2025)

  29. arXiv:2502.16868  [pdf, other

    cs.DB cs.AI cs.HC

    Graphy'our Data: Towards End-to-End Modeling, Exploring and Generating Report from Raw Data

    Authors: Longbin Lai, Changwei Luo, Yunkai Lou, Mingchen Ju, Zhengyi Yang

    Abstract: Large Language Models (LLMs) have recently demonstrated remarkable performance in tasks such as Retrieval-Augmented Generation (RAG) and autonomous AI agent workflows. Yet, when faced with large sets of unstructured documents requiring progressive exploration, analysis, and synthesis, such as conducting literature survey, existing approaches often fall short. We address this challenge -- termed Pr… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

    Comments: 4 pages

  30. arXiv:2501.02216  [pdf, other

    cs.SE

    Automatically Learning a Precise Measurement for Fault Diagnosis Capability of Test Cases

    Authors: Yifan Zhao, Zeyu Sun, Guoqing Wang, Qingyuan Liang, Yakun Zhang, Yiling Lou, Dan Hao, Lu Zhang

    Abstract: Prevalent Fault Localization (FL) techniques rely on tests to localize buggy program elements. Tests could be treated as fuel to further boost FL by providing more debugging information. Therefore, it is highly valuable to measure the Fault Diagnosis Capability (FDC) of a test for diagnosing faults, so as to select or generate tests to better help FL. To this end, researchers have proposed many FD… ▽ More

    Submitted 4 January, 2025; originally announced January 2025.

    Comments: This paper has been accepted by TOSEM

  31. arXiv:2412.04468  [pdf, other

    cs.CV

    NVILA: Efficient Frontier Visual Language Models

    Authors: Zhijian Liu, Ligeng Zhu, Baifeng Shi, Zhuoyang Zhang, Yuming Lou, Shang Yang, Haocheng Xi, Shiyi Cao, Yuxian Gu, Dacheng Li, Xiuyu Li, Yunhao Fang, Yukang Chen, Cheng-Yu Hsieh, De-An Huang, An-Chieh Cheng, Vishwesh Nath, Jinyi Hu, Sifei Liu, Ranjay Krishna, Daguang Xu, Xiaolong Wang, Pavlo Molchanov, Jan Kautz, Hongxu Yin , et al. (2 additional authors not shown)

    Abstract: Visual language models (VLMs) have made significant advances in accuracy in recent years. However, their efficiency has received much less attention. This paper introduces NVILA, a family of open VLMs designed to optimize both efficiency and accuracy. Building on top of VILA, we improve its model architecture by first scaling up the spatial and temporal resolutions, and then compressing visual tok… ▽ More

    Submitted 5 March, 2025; v1 submitted 5 December, 2024; originally announced December 2024.

  32. arXiv:2412.03844  [pdf, other

    cs.CV cs.AI

    HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting

    Authors: Jingyu Lin, Jiaqi Gu, Lubin Fan, Bojian Wu, Yujing Lou, Renjie Chen, Ligang Liu, Jieping Ye

    Abstract: Generating high-quality novel view renderings of 3D Gaussian Splatting (3DGS) in scenes featuring transient objects is challenging. We propose a novel hybrid representation, termed as HybridGS, using 2D Gaussians for transient objects per image and maintaining traditional 3D Gaussians for the whole static scenes. Note that, the 3DGS itself is better suited for modeling static scenes that assume mu… ▽ More

    Submitted 28 February, 2025; v1 submitted 4 December, 2024; originally announced December 2024.

    Comments: Accpeted by CVPR 2025. Project page: https://gujiaqivadin.github.io/hybridgs/ Code: https://github.com/Yeyuqqwx/HybridGS Data: https://huggingface.co/Eto63277/HybridGS/tree/main

  33. arXiv:2411.19921  [pdf, other

    cs.CV cs.AI cs.CL cs.GR

    SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation

    Authors: Wenjia Wang, Liang Pan, Zhiyang Dou, Jidong Mei, Zhouyingcheng Liao, Yuke Lou, Yifan Wu, Lei Yang, Jingbo Wang, Taku Komura

    Abstract: Simulating stylized human-scene interactions (HSI) in physical environments is a challenging yet fascinating task. Prior works emphasize long-term execution but fall short in achieving both diverse style and physical plausibility. To tackle this challenge, we introduce a novel hierarchical framework named SIMS that seamlessly bridges highlevel script-driven intent with a low-level control policy,… ▽ More

    Submitted 16 March, 2025; v1 submitted 29 November, 2024; originally announced November 2024.

  34. arXiv:2411.00608  [pdf, other

    cs.CV

    HopTrack: A Real-time Multi-Object Tracking System for Embedded Devices

    Authors: Xiang Li, Cheng Chen, Yuan-yao Lou, Mustafa Abdallah, Kwang Taik Kim, Saurabh Bagchi

    Abstract: Multi-Object Tracking (MOT) poses significant challenges in computer vision. Despite its wide application in robotics, autonomous driving, and smart manufacturing, there is limited literature addressing the specific challenges of running MOT on embedded devices. State-of-the-art MOT trackers designed for high-end GPUs often experience low processing rates (<11fps) when deployed on embedded devices… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  35. arXiv:2411.00585  [pdf, other

    cs.CY cs.AI

    Benchmarking Bias in Large Language Models during Role-Playing

    Authors: Xinyue Li, Zhenpeng Chen, Jie M. Zhang, Yiling Lou, Tianlin Li, Weisong Sun, Yang Liu, Xuanzhe Liu

    Abstract: Large Language Models (LLMs) have become foundational in modern language-driven applications, profoundly influencing daily life. A critical technique in leveraging their potential is role-playing, where LLMs simulate diverse roles to enhance their real-world utility. However, while research has highlighted the presence of social biases in LLM outputs, it remains unclear whether and to what extent… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

  36. arXiv:2410.16919  [pdf, other

    cs.RO cs.AI cs.CL cs.LG

    EnvBridge: Bridging Diverse Environments with Cross-Environment Knowledge Transfer for Embodied AI

    Authors: Tomoyuki Kagaya, Yuxuan Lou, Thong Jing Yuan, Subramanian Lakshmi, Jayashree Karlekar, Sugiri Pranata, Natsuki Murakami, Akira Kinose, Koki Oguri, Felix Wick, Yang You

    Abstract: In recent years, Large Language Models (LLMs) have demonstrated high reasoning capabilities, drawing attention for their applications as agents in various decision-making processes. One notably promising application of LLM agents is robotic manipulation. Recent research has shown that LLMs can generate text planning or control code for robots, providing substantial flexibility and interaction capa… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

  37. arXiv:2410.02841  [pdf, other

    cs.CR cs.SE

    Demonstration Attack against In-Context Learning for Code Intelligence

    Authors: Yifei Ge, Weisong Sun, Yihang Lou, Chunrong Fang, Yiran Zhang, Yiming Li, Xiaofang Zhang, Yang Liu, Zhihong Zhao, Zhenyu Chen

    Abstract: Recent advancements in large language models (LLMs) have revolutionized code intelligence by improving programming productivity and alleviating challenges faced by software developers. To further improve the performance of LLMs on specific code intelligence tasks and reduce training costs, researchers reveal a new capability of LLMs: in-context learning (ICL). ICL allows LLMs to learn from a few d… ▽ More

    Submitted 3 October, 2024; originally announced October 2024.

    Comments: 17 pages, 5 figures

  38. arXiv:2410.00695  [pdf, other

    cs.DC cs.RO

    E-MPC: Edge-assisted Model Predictive Control

    Authors: Yuan-Yao Lou, Jonathan Spencer, Kwang Taik Kim, Mung Chiang

    Abstract: Model predictive control (MPC) has become the de facto standard action space for local planning and learning-based control in many continuous robotic control tasks, including autonomous driving. MPC solves a long-horizon cost optimization as a series of short-horizon optimizations based on a global planner-supplied reference path. The primary challenge in MPC, however, is that the computational bu… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

  39. arXiv:2409.19894  [pdf, other

    cs.SE cs.AI

    TRANSAGENT: An LLM-Based Multi-Agent System for Code Translation

    Authors: Zhiqiang Yuan, Weitong Chen, Hanlin Wang, Kai Yu, Xin Peng, Yiling Lou

    Abstract: Code translation converts code from one programming language to another while maintaining its original functionality, which is crucial for software migration, system refactoring, and cross-platform development. Traditional rule-based methods rely on manually-written rules, which can be time-consuming and often result in less readable code. To overcome this, learning-based methods have been develop… ▽ More

    Submitted 1 October, 2024; v1 submitted 29 September, 2024; originally announced September 2024.

  40. arXiv:2409.10839  [pdf, other

    cs.NI cs.DC

    Dynamic DAG-Application Scheduling for Multi-Tier Edge Computing in Heterogeneous Networks

    Authors: Xiang Li, Mustafa Abdallah, Yuan-Yao Lou, Mung Chiang, Kwang Taik Kim, Saurabh Bagchi

    Abstract: Edge computing is deemed a promising technique to execute latency-sensitive applications by offloading computation-intensive tasks to edge servers. Extensive research has been conducted in the field of end-device to edge server task offloading for several goals, including latency minimization, energy optimization, and resource optimization. However, few of them consider our mobile computing device… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 12 pages

  41. arXiv:2409.02977  [pdf, other

    cs.SE cs.AI

    Large Language Model-Based Agents for Software Engineering: A Survey

    Authors: Junwei Liu, Kaixin Wang, Yixuan Chen, Xin Peng, Zhenpeng Chen, Lingming Zhang, Yiling Lou

    Abstract: The recent advance in Large Language Models (LLMs) has shaped a new paradigm of AI agents, i.e., LLM-based agents. Compared to standalone LLMs, LLM-based agents substantially extend the versatility and expertise of LLMs by enhancing LLMs with the capabilities of perceiving and utilizing external resources and tools. To date, LLM-based agents have been applied and shown remarkable effectiveness in… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

  42. arXiv:2408.13480  [pdf, other

    cs.DB

    Towards a Converged Relational-Graph Optimization Framework

    Authors: Yunkai Lou, Longbin Lai, Bingqing Lyu, Yufan Yang, Xiaoli Zhou, Wenyuan Yu, Ying Zhang, Jingren Zhou

    Abstract: The recent ISO SQL:2023 standard adopts SQL/PGQ (Property Graph Queries), facilitating graph-like querying within relational databases. This advancement, however, underscores a significant gap in how to effectively optimize SQL/PGQ queries within relational database systems. To address this gap, we extend the foundational SPJ (Select-Project-Join) queries to SPJM queries, which include an addition… ▽ More

    Submitted 8 December, 2024; v1 submitted 24 August, 2024; originally announced August 2024.

  43. arXiv:2407.08555  [pdf, other

    eess.IV cs.CV

    SLoRD: Structural Low-Rank Descriptors for Shape Consistency in Vertebrae Segmentation

    Authors: Xin You, Yixin Lou, Minghui Zhang, Jie Yang, Yun Gu

    Abstract: Automatic and precise multi-class vertebrae segmentation from CT images is crucial for various clinical applications. However, due to similar appearances between adjacent vertebrae and the existence of various pathologies, existing single-stage and multi-stage methods suffer from imprecise vertebrae segmentation. Essentially, these methods fail to explicitly impose both contour precision and intra… ▽ More

    Submitted 28 May, 2025; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: JBHI accepted

  44. arXiv:2407.02095  [pdf, other

    cs.SE

    TIGER: A Generating-Then-Ranking Framework for Practical Python Type Inference

    Authors: Chong Wang, Jian Zhang, Yiling Lou, Mingwei Liu, Weisong Sun, Yang Liu, Xin Peng

    Abstract: Python's dynamic typing system offers flexibility and expressiveness but can lead to type-related errors, prompting the need for automated type inference to enhance type hinting. While existing learning-based approaches show promising inference accuracy, they struggle with practical challenges in comprehensively handling various types, including complex generic types and (unseen) user-defined type… ▽ More

    Submitted 13 August, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: Accepted by ICSE'25

  45. arXiv:2406.15806  [pdf, other

    cs.RO

    Robust Dynamic Control Barrier Function Based Trajectory Planning for Mobile Manipulator

    Authors: Lihao Xu, Xiaogang Xiong, Bai Yang, Yunjiang Lou

    Abstract: High-dimensional robot dynamic trajectory planning poses many challenges for traditional planning algorithms. Existing planning methods suffer from issues such as long computation times, limited capacity to address intricate obstacle models, and lack of consideration for external disturbances and measurement inaccuracies in these high-dimensional systems. To tackle these challenges, this paper pro… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  46. arXiv:2406.11707  [pdf, other

    cs.CR cs.CV cs.LG

    A First Physical-World Trajectory Prediction Attack via LiDAR-induced Deceptions in Autonomous Driving

    Authors: Yang Lou, Yi Zhu, Qun Song, Rui Tan, Chunming Qiao, Wei-Bin Lee, Jianping Wang

    Abstract: Trajectory prediction forecasts nearby agents' moves based on their historical trajectories. Accurate trajectory prediction is crucial for autonomous vehicles. Existing attacks compromise the prediction model of a victim AV by directly manipulating the historical trajectory of an attacker AV, which has limited real-world applicability. This paper, for the first time, explores an indirect attack ap… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: In Proceedings of the 33rd USENIX Security Symposium 2024

  47. arXiv:2406.11147  [pdf, ps, other

    cs.SE cs.AI

    Vul-RAG: Enhancing LLM-based Vulnerability Detection via Knowledge-level RAG

    Authors: Xueying Du, Geng Zheng, Kaixin Wang, Yi Zou, Yujia Wang, Wentai Deng, Jiayi Feng, Mingwei Liu, Bihuan Chen, Xin Peng, Tao Ma, Yiling Lou

    Abstract: Although LLMs have shown promising potential in vulnerability detection, this study reveals their limitations in distinguishing between vulnerable and similar-but-benign patched code (only 0.06 - 0.14 accuracy). It shows that LLMs struggle to capture the root causes of vulnerabilities during vulnerability detection. To address this challenge, we propose enhancing LLMs with multi-dimensional vulner… ▽ More

    Submitted 17 June, 2025; v1 submitted 16 June, 2024; originally announced June 2024.

  48. arXiv:2406.10018  [pdf, other

    cs.SE

    STALL+: Boosting LLM-based Repository-level Code Completion with Static Analysis

    Authors: Junwei Liu, Yixuan Chen, Mingwei Liu, Xin Peng, Yiling Lou

    Abstract: Repository-level code completion is challenging as it involves complicated contexts from multiple files in the repository. To date, researchers have proposed two technical categories to enhance LLM-based repository-level code completion, i.e., retrieval-augmented generation (RAG) and static analysis integration. This work performs the first study on the static analysis integration in LLM-based rep… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 12 pages, 5 figures

  49. arXiv:2406.03803  [pdf, ps, other

    cs.IT

    Determining the Weight Spectrum of the Reed--Muller Codes RM(m-6,m)

    Authors: Yueying Lou, Qichun Wang

    Abstract: The weight spectra of the Reed-Muller codes $RM(r,m)$ were unknown for $r=3,...,m-5$. In IEEE Trans. Inform. Theory 2024, Carlet determined the weight spectrum of $RM(m-5,m)$ for $m\ge10$ using the Maiorana-McFarland construction, where the result was tried to be extended to $RM(m-6,m)$, but many problems occurred and much work needed to be done. In this paper, we propose a novel way of constructi… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  50. arXiv:2404.14294  [pdf, other

    cs.CL cs.AI

    A Survey on Efficient Inference for Large Language Models

    Authors: Zixuan Zhou, Xuefei Ning, Ke Hong, Tianyu Fu, Jiaming Xu, Shiyao Li, Yuming Lou, Luning Wang, Zhihang Yuan, Xiuhong Li, Shengen Yan, Guohao Dai, Xiao-Ping Zhang, Yuhan Dong, Yu Wang

    Abstract: Large Language Models (LLMs) have attracted extensive attention due to their remarkable performance across various tasks. However, the substantial computational and memory requirements of LLM inference pose challenges for deployment in resource-constrained scenarios. Efforts within the field have been directed towards developing techniques aimed at enhancing the efficiency of LLM inference. This p… ▽ More

    Submitted 19 July, 2024; v1 submitted 22 April, 2024; originally announced April 2024.