Skip to main content

Showing 1–50 of 451 results for author: Hu, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.05972  [pdf, ps, other

    cs.CC cs.LG

    Generalized and Unified Equivalences between Hardness and Pseudoentropy

    Authors: Lunjia Hu, Salil Vadhan

    Abstract: Pseudoentropy characterizations provide a quantitatively precise demonstration of the close relationship between computational hardness and computational randomness. We prove a unified pseudoentropy characterization that generalizes and strengthens previous results for both uniform and non-uniform models of computation. Our characterization holds for a general family of entropy notions that encomp… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

  2. arXiv:2507.05216  [pdf, ps, other

    cs.LG cs.CY stat.AP stat.ML

    Bridging Prediction and Intervention Problems in Social Systems

    Authors: Lydia T. Liu, Inioluwa Deborah Raji, Angela Zhou, Luke Guerdan, Jessica Hullman, Daniel Malinsky, Bryan Wilder, Simone Zhang, Hammaad Adam, Amanda Coston, Ben Laufer, Ezinne Nwankwo, Michael Zanger-Tishler, Eli Ben-Michael, Solon Barocas, Avi Feller, Marissa Gerchick, Talia Gillis, Shion Guha, Daniel Ho, Lily Hu, Kosuke Imai, Sayash Kapoor, Joshua Loftus, Razieh Nabi , et al. (10 additional authors not shown)

    Abstract: Many automated decision systems (ADS) are designed to solve prediction problems -- where the goal is to learn patterns from a sample of the population and apply them to individuals from the same population. In reality, these prediction systems operationalize holistic policy interventions in deployment. Once deployed, ADS can shape impacted population outcomes through an effective policy change in… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

  3. arXiv:2507.04059  [pdf, ps, other

    cs.LG cs.AI cs.CV stat.ML

    Attributing Data for Sharpness-Aware Minimization

    Authors: Chenyang Ren, Yifan Jia, Huanyi Xie, Zhaobin Xu, Tianxing Wei, Liangyu Wang, Lijie Hu, Di Wang

    Abstract: Sharpness-aware Minimization (SAM) improves generalization in large-scale model training by linking loss landscape geometry to generalization. However, challenges such as mislabeled noisy data and privacy concerns have emerged as significant issues. Data attribution, which identifies the contributions of specific training samples, offers a promising solution. However, directly rendering existing d… ▽ More

    Submitted 5 July, 2025; originally announced July 2025.

    Comments: 25 pages

  4. arXiv:2507.01299  [pdf, ps, other

    cs.CL

    La RoSA: Enhancing LLM Efficiency via Layerwise Rotated Sparse Activation

    Authors: Kai Liu, Bowen Xu, Shaoyu Wu, Xin Chen, Hao Zhou, Yongliang Tao, Lulu Hu

    Abstract: Activation sparsity can reduce the computational overhead and memory transfers during the forward pass of Large Language Model (LLM) inference. Existing methods face limitations, either demanding time-consuming recovery training that hinders real-world adoption, or relying on empirical magnitude-based pruning, which causes fluctuating sparsity and unstable inference speed-up. This paper introduces… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: ICML 2025 Acceptance

  5. arXiv:2506.19496  [pdf, ps, other

    cs.LG

    COLUR: Confidence-Oriented Learning, Unlearning and Relearning with Noisy-Label Data for Model Restoration and Refinement

    Authors: Zhihao Sui, Liang Hu, Jian Cao, Usman Naseem, Zhongyuan Lai, Qi Zhang

    Abstract: Large deep learning models have achieved significant success in various tasks. However, the performance of a model can significantly degrade if it is needed to train on datasets with noisy labels with misleading or ambiguous information. To date, there are limited investigations on how to restore performance when model degradation has been incurred by noisy label data. Inspired by the ``forgetting… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: IJCAI 2025

  6. arXiv:2506.19486  [pdf, ps, other

    cs.LG cs.AI cs.CR

    Recalling The Forgotten Class Memberships: Unlearned Models Can Be Noisy Labelers to Leak Privacy

    Authors: Zhihao Sui, Liang Hu, Jian Cao, Dora D. Liu, Usman Naseem, Zhongyuan Lai, Qi Zhang

    Abstract: Machine Unlearning (MU) technology facilitates the removal of the influence of specific data instances from trained models on request. Despite rapid advancements in MU technology, its vulnerabilities are still underexplored, posing potential risks of privacy breaches through leaks of ostensibly unlearned information. Current limited research on MU attacks requires access to original models contain… ▽ More

    Submitted 24 June, 2025; originally announced June 2025.

    Comments: IJCAI 2025

  7. arXiv:2506.18717  [pdf

    cs.CE cs.AI

    A Study of Dynamic Stock Relationship Modeling and S&P500 Price Forecasting Based on Differential Graph Transformer

    Authors: Linyue Hu, Qi Wang

    Abstract: Stock price prediction is vital for investment decisions and risk management, yet remains challenging due to markets' nonlinear dynamics and time-varying inter-stock correlations. Traditional static-correlation models fail to capture evolving stock relationships. To address this, we propose a Differential Graph Transformer (DGT) framework for dynamic relationship modeling and price prediction. Our… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

  8. arXiv:2506.17880  [pdf, ps, other

    cs.LG stat.ME stat.ML

    Choice of Scoring Rules for Indirect Elicitation of Properties with Parametric Assumptions

    Authors: Lingfang Hu, Ian A. Kash

    Abstract: People are commonly interested in predicting a statistical property of a random event such as mean and variance. Proper scoring rules assess the quality of predictions and require that the expected score gets uniquely maximized at the precise prediction, in which case we call the score directly elicits the property. Previous research work has widely studied the existence and the characterization o… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

    Comments: Key words: proper scoring rules, property elicitation, parametric model estimation. Paper length: 20 pages of main text + 2 pages of references + 21 pages of appendices

  9. arXiv:2506.17869  [pdf, ps, other

    cs.CV cs.RO

    Cross-modal State Space Modeling for Real-time RGB-thermal Wild Scene Semantic Segmentation

    Authors: Xiaodong Guo, Zi'ang Lin, Luwen Hu, Zhihong Deng, Tong Liu, Wujie Zhou

    Abstract: The integration of RGB and thermal data can significantly improve semantic segmentation performance in wild environments for field robots. Nevertheless, multi-source data processing (e.g. Transformer-based approaches) imposes significant computational overhead, presenting challenges for resource-constrained systems. To resolve this critical limitation, we introduced CM-SSM, an efficient RGB-therma… ▽ More

    Submitted 21 June, 2025; originally announced June 2025.

  10. arXiv:2506.16704  [pdf, ps, other

    cs.LG stat.ML

    How Many Domains Suffice for Domain Generalization? A Tight Characterization via the Domain Shattering Dimension

    Authors: Cynthia Dwork, Lunjia Hu, Han Shao

    Abstract: We study a fundamental question of domain generalization: given a family of domains (i.e., data distributions), how many randomly sampled domains do we need to collect data from in order to learn a model that performs reasonably well on every seen and unseen domain in the family? We model this problem in the PAC framework and introduce a new combinatorial measure, which we call the domain shatteri… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  11. arXiv:2506.16012  [pdf, ps, other

    cs.RO

    DualTHOR: A Dual-Arm Humanoid Simulation Platform for Contingency-Aware Planning

    Authors: Boyu Li, Siyuan He, Hang Xu, Haoqi Yuan, Yu Zang, Liwei Hu, Junpeng Yue, Zhenxiong Jiang, Pengbo Hu, Börje F. Karlsson, Yehui Tang, Zongqing Lu

    Abstract: Developing embodied agents capable of performing complex interactive tasks in real-world scenarios remains a fundamental challenge in embodied AI. Although recent advances in simulation platforms have greatly enhanced task diversity to train embodied Vision Language Models (VLMs), most platforms rely on simplified robot morphologies and bypass the stochastic nature of low-level execution, which li… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

  12. arXiv:2506.15808  [pdf, ps, other

    cs.IT eess.SP

    Hybrid Near-Far Field 6D Movable Antenna Design Exploiting Directional Sparsity and Deep Learning

    Authors: Xiaodan Shao, Limei Hu, Yulong Sun, Xing Li, Yixiao Zhang, Jingze Ding, Xiaoming Shi, Feng Chen, Derrick Wing Kwan Ng, Robert Schober

    Abstract: Six-dimensional movable antenna (6DMA) has been identified as a new disruptive technology for future wireless systems to support a large number of users with only a few antennas. However, the intricate relationships between the signal carrier wavelength and the transceiver region size lead to inaccuracies in traditional far-field 6DMA channel model, causing discrepancies between the model predicti… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 13 pages

  13. arXiv:2506.15617  [pdf, ps, other

    cs.CL cs.AI cs.LG

    The Compositional Architecture of Regret in Large Language Models

    Authors: Xiangxiang Cui, Shu Yang, Tianjin Huang, Wanyu Lin, Lijie Hu, Di Wang

    Abstract: Regret in Large Language Models refers to their explicit regret expression when presented with evidence contradicting their previously generated misinformation. Studying the regret mechanism is crucial for enhancing model reliability and helps in revealing how cognition is coded in neural networks. To understand this mechanism, we need to first identify regret expressions in model outputs, then an… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: 23 pages

  14. arXiv:2506.14382  [pdf, ps, other

    cs.CV cs.AI

    DepthSeg: Depth prompting in remote sensing semantic segmentation

    Authors: Ning Zhou, Shanxiong Chen, Mingting Zhou, Haigang Sui, Lieyun Hu, Han Li, Li Hua, Qiming Zhou

    Abstract: Remote sensing semantic segmentation is crucial for extracting detailed land surface information, enabling applications such as environmental monitoring, land use planning, and resource assessment. In recent years, advancements in artificial intelligence have spurred the development of automatic remote sensing semantic segmentation methods. However, the existing semantic segmentation methods focus… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  15. arXiv:2506.13695  [pdf, ps, other

    cs.IR

    OneRec Technical Report

    Authors: Guorui Zhou, Jiaxin Deng, Jinghao Zhang, Kuo Cai, Lejian Ren, Qiang Luo, Qianqian Wang, Qigen Hu, Rui Huang, Shiyao Wang, Weifeng Ding, Wuchao Li, Xinchen Luo, Xingmei Wang, Zexuan Cheng, Zixing Zhang, Bin Zhang, Boxuan Wang, Chaoyi Ma, Chengru Song, Chenhui Wang, Di Wang, Dongxue Meng, Fan Yang, Fangyu Zhang , et al. (40 additional authors not shown)

    Abstract: Recommender systems have been widely used in various large-scale user-oriented platforms for many years. However, compared to the rapid developments in the AI community, recommendation systems have not achieved a breakthrough in recent years. For instance, they still rely on a multi-stage cascaded architecture rather than an end-to-end approach, leading to computational fragmentation and optimizat… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: Authors are listed alphabetically by their first name

  16. arXiv:2506.13224  [pdf, ps, other

    cs.CV

    SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds

    Authors: Jinfeng Xu, Xianzhi Li, Yuan Tang, Xu Han, Qiao Yu, Yixue Hao, Long Hu, Min Chen

    Abstract: Recent advancements in deep learning have greatly enhanced 3D object recognition, but most models are limited to closed-set scenarios, unable to handle unknown samples in real-world applications. Open-set recognition (OSR) addresses this limitation by enabling models to both classify known classes and identify novel classes. However, current OSR methods rely on global features to differentiate kno… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: 10 pages, conference

  17. arXiv:2506.08396  [pdf, ps, other

    cs.PL

    Linguine: A Natural-Language Programming Language with Formal Semantics and a Clean Compiler Pipeline

    Authors: Lifan Hu

    Abstract: Linguine is a natural-language-inspired programming language that enables users to write programs in a fluent, controlled subset of English while preserving formal semantics. The language introduces anaphoric constructs, such as pronoun variables (e.g., "it", "them"), that are statically resolved through referent-tracking analysis combined with a Hindley-Milner-style type system. Each pronoun is g… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  18. arXiv:2506.07184  [pdf, ps, other

    cs.AI cs.CL cs.CV

    Mitigating Behavioral Hallucination in Multimodal Large Language Models for Sequential Images

    Authors: Liangliang You, Junchi Yao, Shu Yang, Guimin Hu, Lijie Hu, Di Wang

    Abstract: While multimodal large language models excel at various tasks, they still suffer from hallucinations, which limit their reliability and scalability for broader domain applications. To address this issue, recent research mainly focuses on objective hallucination. However, for sequential images, besides objective hallucination, there is also behavioral hallucination, which is less studied. This work… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

  19. arXiv:2506.07180  [pdf, ps, other

    cs.CL cs.AI cs.CV

    Flattery in Motion: Benchmarking and Analyzing Sycophancy in Video-LLMs

    Authors: Wenrui Zhou, Shu Yang, Qingsong Yang, Zikun Guo, Lijie Hu, Di Wang

    Abstract: As video large language models (Video-LLMs) become increasingly integrated into real-world applications that demand grounded multimodal reasoning, ensuring their factual consistency and reliability is of critical importance. However, sycophancy, the tendency of these models to align with user input even when it contradicts the visual evidence, undermines their trustworthiness in such contexts. Cur… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Comments: 24 pages

  20. arXiv:2506.07168  [pdf, ps, other

    cs.LG cs.AI cs.CL

    Efficient Text-Attributed Graph Learning through Selective Annotation and Graph Alignment

    Authors: Huanyi Xie, Lijie Hu, Lu Yu, Tianhao Huang, Longfei Li, Meng Li, Jun Zhou, Huan Wang, Di Wang

    Abstract: In the realm of Text-attributed Graphs (TAGs), traditional graph neural networks (GNNs) often fall short due to the complex textual information associated with each node. Recent methods have improved node representations by leveraging large language models (LLMs) to enhance node text features, but these approaches typically require extensive annotations or fine-tuning across all nodes, which is bo… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Comments: 23 pages

  21. arXiv:2506.05877  [pdf, ps, other

    cs.LG

    Interpretable Clustering Ensemble

    Authors: Hang Lv, Lianyu Hu, Mudi Jiang, Xinying Liu, Zengyou He

    Abstract: Clustering ensemble has emerged as an important research topic in the field of machine learning. Although numerous methods have been proposed to improve clustering quality, most existing approaches overlook the need for interpretability in high-stakes applications. In domains such as medical diagnosis and financial risk assessment, algorithms must not only be accurate but also interpretable to ens… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  22. arXiv:2506.05767  [pdf, ps, other

    cs.CL cs.AI

    dots.llm1 Technical Report

    Authors: Bi Huo, Bin Tu, Cheng Qin, Da Zheng, Debing Zhang, Dongjie Zhang, En Li, Fu Guo, Jian Yao, Jie Lou, Junfeng Tian, Li Hu, Ran Zhu, Shengdong Chen, Shuo Liu, Su Guang, Te Wo, Weijun Zhang, Xiaoming Shi, Xinxin Peng, Xing Wu, Yawen Liu, Yuqiu Ji, Ze Wen, Zhenhai Liu , et al. (2 additional authors not shown)

    Abstract: Mixture of Experts (MoE) models have emerged as a promising paradigm for scaling language models efficiently by activating only a subset of parameters for each input token. In this report, we present dots.llm1, a large-scale MoE model that activates 14B parameters out of a total of 142B parameters, delivering performance on par with state-of-the-art models while reducing training and inference cos… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  23. arXiv:2506.05286  [pdf, ps, other

    cs.CV cs.LG

    Stable Vision Concept Transformers for Medical Diagnosis

    Authors: Lijie Hu, Songning Lai, Yuan Hua, Shu Yang, Jingfeng Zhang, Di Wang

    Abstract: Transparency is a paramount concern in the medical field, prompting researchers to delve into the realm of explainable AI (XAI). Among these XAI methods, Concept Bottleneck Models (CBMs) aim to restrict the model's latent space to human-understandable high-level concepts by generating a conceptual layer for extracting conceptual features, which has drawn much attention recently. However, existing… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: arXiv admin note: text overlap with arXiv:2304.06129 by other authors

  24. arXiv:2506.04768  [pdf, ps, other

    cs.NI

    Grey Rhino Warning: IPv6 is Becoming Fertile Ground for Reflection Amplification Attacks

    Authors: Ling Hu, Tao Yang, Yu Pang, Bingnan Hou, Zhiping Cai, Bo Yu

    Abstract: Distributed Denial-of-Service (DDoS) attacks represent a cost-effective and potent threat to network stability. While extensively studied in IPv4 networks, DDoS implications in IPv6 remain underexplored. The vast IPv6 address space renders brute-force scanning and amplifier testing for all active addresses impractical. Innovatively, this work investigates AS-level vulnerabilities to reflection amp… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: This paper has been accepted by IWQoS 2025 as a short paper

  25. arXiv:2506.00894  [pdf, ps, other

    cs.SE cs.AI cs.CL cs.LG

    CODEMENV: Benchmarking Large Language Models on Code Migration

    Authors: Keyuan Cheng, Xudong Shen, Yihao Yang, Tengyue Wang, Yang Cao, Muhammad Asif Ali, Hanbin Wang, Lijie Hu, Di Wang

    Abstract: Large language models (LLMs) have shown remarkable capabilities across various software engineering tasks; however, their effectiveness in code migration, adapting code to run in different environments, remains insufficiently studied. In this work, we introduce CODEMENV: Code Migration Across Environment, a new benchmark specifically designed to assess LLMs' abilities in code migration scenarios.… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

    Comments: Accepted by ACL 2025 Findings

  26. arXiv:2506.00829  [pdf, ps, other

    cs.CL cs.AI cs.LG

    COMPKE: Complex Question Answering under Knowledge Editing

    Authors: Keyuan Cheng, Zijian Kan, Zhixian He, Zhuoran Zhang, Muhammad Asif Ali, Ke Xu, Lijie Hu, Di Wang

    Abstract: Knowledge Editing, which efficiently modifies the knowledge in large language models, has gathered great attention. Current benchmarks primarily use multi-hop question answering to assess and analyze newly injected or updated knowledge. However, we argue that these benchmarks fail to effectively evaluate how well the updated models apply this knowledge in real-life scenarios, particularly when que… ▽ More

    Submitted 3 June, 2025; v1 submitted 1 June, 2025; originally announced June 2025.

    Comments: Accepted by ACL 2025 Findings

  27. arXiv:2506.00759  [pdf, ps, other

    cs.CL

    Understanding and Mitigating Cross-lingual Privacy Leakage via Language-specific and Universal Privacy Neurons

    Authors: Wenshuo Dong, Qingsong Yang, Shu Yang, Lijie Hu, Meng Ding, Wanyu Lin, Tianhang Zheng, Di Wang

    Abstract: Large Language Models (LLMs) trained on massive data capture rich information embedded in the training data. However, this also introduces the risk of privacy leakage, particularly involving personally identifiable information (PII). Although previous studies have shown that this risk can be mitigated through methods such as privacy neurons, they all assume that both the (sensitive) training data… ▽ More

    Submitted 8 June, 2025; v1 submitted 31 May, 2025; originally announced June 2025.

  28. arXiv:2505.20814  [pdf, other

    cs.RO cs.CV

    Spatial RoboGrasp: Generalized Robotic Grasping Control Policy

    Authors: Yiqi Huang, Travis Davies, Jiahuan Yan, Jiankai Sun, Xiang Chen, Luhui Hu

    Abstract: Achieving generalizable and precise robotic manipulation across diverse environments remains a critical challenge, largely due to limitations in spatial perception. While prior imitation-learning approaches have made progress, their reliance on raw RGB inputs and handcrafted features often leads to overfitting and poor 3D reasoning under varied lighting, occlusion, and object conditions. In this p… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  29. arXiv:2505.18798  [pdf, ps, other

    cs.LG stat.ML

    Governing Equation Discovery from Data Based on Differential Invariants

    Authors: Lexiang Hu, Yikang Li, Zhouchen Lin

    Abstract: The explicit governing equation is one of the simplest and most intuitive forms for characterizing physical laws. However, directly discovering partial differential equations (PDEs) from data poses significant challenges, primarily in determining relevant terms from a vast search space. Symmetry, as a crucial prior knowledge in scientific fields, has been widely applied in tasks such as designing… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

  30. arXiv:2505.17712  [pdf, ps, other

    cs.CL

    Understanding How Value Neurons Shape the Generation of Specified Values in LLMs

    Authors: Yi Su, Jiayi Zhang, Shu Yang, Xinhai Wang, Lijie Hu, Di Wang

    Abstract: Rapid integration of large language models (LLMs) into societal applications has intensified concerns about their alignment with universal ethical principles, as their internal value representations remain opaque despite behavioral alignment advancements. Current approaches struggle to systematically interpret how values are encoded in neural architectures, limited by datasets that prioritize supe… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  31. arXiv:2505.17169  [pdf, ps, other

    cs.CL cs.AI

    Next Token Perception Score: Analytical Assessment of your LLM Perception Skills

    Authors: Yu-Ang Cheng, Leyang Hu, Hai Huang, Randall Balestriero

    Abstract: Autoregressive pretraining has become the de facto paradigm for learning general-purpose representations in large language models (LLMs). However, linear probe performance across downstream perception tasks shows substantial variability, suggesting that features optimized for next-token prediction do not consistently transfer well to downstream perception tasks. We demonstrate that representations… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  32. arXiv:2505.15146  [pdf, ps, other

    cs.AI

    lmgame-Bench: How Good are LLMs at Playing Games?

    Authors: Lanxiang Hu, Mingjia Huo, Yuxuan Zhang, Haoyang Yu, Eric P. Xing, Ion Stoica, Tajana Rosing, Haojian Jin, Hao Zhang

    Abstract: Playing video games requires perception, memory, and planning, exactly the faculties modern large language model (LLM) agents are expected to master. We study the major challenges in using popular video games to evaluate modern LLMs and find that directly dropping LLMs into games cannot make an effective evaluation, for three reasons -- brittle vision perception, prompt sensitivity, and potential… ▽ More

    Submitted 3 June, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

  33. arXiv:2505.13905  [pdf, ps, other

    cs.CV cs.RO

    4D-ROLLS: 4D Radar Occupancy Learning via LiDAR Supervision

    Authors: Ruihan Liu, Xiaoyi Wu, Xijun Chen, Liang Hu, Yunjiang Lou

    Abstract: A comprehensive understanding of 3D scenes is essential for autonomous vehicles (AVs), and among various perception tasks, occupancy estimation plays a central role by providing a general representation of drivable and occupied space. However, most existing occupancy estimation methods rely on LiDAR or cameras, which perform poorly in degraded environments such as smoke, rain, snow, and fog. In th… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  34. arXiv:2505.13489  [pdf, ps, other

    cs.AI cs.CL

    Contrastive Cross-Course Knowledge Tracing via Concept Graph Guided Knowledge Transfer

    Authors: Wenkang Han, Wang Lin, Liya Hu, Zhenlong Dai, Yiyun Zhou, Mengze Li, Zemin Liu, Chang Yao, Jingyuan Chen

    Abstract: Knowledge tracing (KT) aims to predict learners' future performance based on historical learning interactions. However, existing KT models predominantly focus on data from a single course, limiting their ability to capture a comprehensive understanding of learners' knowledge states. In this paper, we propose TransKT, a contrastive cross-course knowledge tracing method that leverages concept graph… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: Accepted by IJCAI 2025

  35. arXiv:2505.13039  [pdf, ps, other

    cs.CV

    Expert-Like Reparameterization of Heterogeneous Pyramid Receptive Fields in Efficient CNNs for Fair Medical Image Classification

    Authors: Xiao Wu, Xiaoqing Zhang, Zunjie Xiao, Lingxi Hu, Risa Higashita, Jiang Liu

    Abstract: Efficient convolutional neural network (CNN) architecture designs have attracted growing research interests. However, they usually apply single receptive field (RF), small asymmetric RFs, or pyramid RFs to learn different feature representations, still encountering two significant challenges in medical image classification tasks: 1) They have limitations in capturing diverse lesion characteristics… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  36. arXiv:2505.10940  [pdf, ps, other

    cs.IR cs.AI

    Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation

    Authors: Qing Yu, Xiaobei Wang, Shuchang Liu, Yandong Bai, Xiaoyu Yang, Xueliang Wang, Chang Meng, Shanshan Wu, Hailan Yang, Huihui Xiao, Xiang Li, Fan Yang, Xiaoqiang Feng, Lantao Hu, Han Li, Kun Gai, Lixin Zou

    Abstract: Recommender systems filter contents/items valuable to users by inferring preferences from user features and historical behaviors. Mainstream approaches follow the learning-to-rank paradigm, which focus on discovering and modeling item topics (e.g., categories), and capturing user preferences on these topics based on historical interactions. However, this paradigm often neglects the modeling of use… ▽ More

    Submitted 20 May, 2025; v1 submitted 16 May, 2025; originally announced May 2025.

  37. arXiv:2505.07360  [pdf, ps, other

    cs.SE

    BinMetric: A Comprehensive Binary Analysis Benchmark for Large Language Models

    Authors: Xiuwei Shang, Guoqiang Chen, Shaoyin Cheng, Benlong Wu, Li Hu, Gangyang Li, Weiming Zhang, Nenghai Yu

    Abstract: Binary analysis remains pivotal in software security, offering insights into compiled programs without source code access. As large language models (LLMs) continue to excel in diverse language understanding and generation tasks, their potential in decoding complex binary data structures becomes evident. However, the lack of standardized benchmarks in this domain limits the assessment and compariso… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

    Comments: 23 pages, 5 figures, to be published in IJCAI 2025

  38. arXiv:2505.06603  [pdf, other

    cs.CV

    ReplayCAD: Generative Diffusion Replay for Continual Anomaly Detection

    Authors: Lei Hu, Zhiyong Gan, Ling Deng, Jinglin Liang, Lingyu Liang, Shuangping Huang, Tianshui Chen

    Abstract: Continual Anomaly Detection (CAD) enables anomaly detection models in learning new classes while preserving knowledge of historical classes. CAD faces two key challenges: catastrophic forgetting and segmentation of small anomalous regions. Existing CAD methods store image distributions or patch features to mitigate catastrophic forgetting, but they fail to preserve pixel-level detailed features fo… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

    Comments: Accepted by IJCAI 2025

  39. SOAP: Style-Omniscient Animatable Portraits

    Authors: Tingting Liao, Yujian Zheng, Adilbek Karmanov, Liwen Hu, Leyang Jin, Yuliang Xiu, Hao Li

    Abstract: Creating animatable 3D avatars from a single image remains challenging due to style limitations (realistic, cartoon, anime) and difficulties in handling accessories or hairstyles. While 3D diffusion models advance single-view reconstruction for general objects, outputs often lack animation controls or suffer from artifacts because of the domain gap. We propose SOAP, a style-omniscient framework to… ▽ More

    Submitted 18 May, 2025; v1 submitted 8 May, 2025; originally announced May 2025.

    Journal ref: Siggraph 2025, page: https://tingtingliao.github.io/soap/

  40. arXiv:2505.04254  [pdf, other

    cs.SE

    CompileAgent: Automated Real-World Repo-Level Compilation with Tool-Integrated LLM-based Agent System

    Authors: Li Hu, Guoqiang Chen, Xiuwei Shang, Shaoyin Cheng, Benlong Wu, Gangyang Li, Xu Zhu, Weiming Zhang, Nenghai Yu

    Abstract: With open-source projects growing in size and complexity, manual compilation becomes tedious and error-prone, highlighting the need for automation to improve efficiency and accuracy. However, the complexity of compilation instruction search and error resolution makes automatic compilation challenging. Inspired by the success of LLM-based agents in various fields, we propose CompileAgent, the first… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

    Comments: 12 pages, 4 figures

  41. arXiv:2505.02573  [pdf, other

    cs.LG cs.AI cs.DB cs.SI

    Rethinking Federated Graph Learning: A Data Condensation Perspective

    Authors: Hao Zhang, Xunkai Li, Yinlin Zhu, Lianglin Hu

    Abstract: Federated graph learning is a widely recognized technique that promotes collaborative training of graph neural networks (GNNs) by multi-client graphs.However, existing approaches heavily rely on the communication of model parameters or gradients for federated optimization and fail to adequately address the data heterogeneity introduced by intricate and diverse graph distributions. Although some me… ▽ More

    Submitted 5 May, 2025; originally announced May 2025.

  42. arXiv:2505.01713  [pdf, other

    cs.CV

    Vision and Intention Boost Large Language Model in Long-Term Action Anticipation

    Authors: Congqi Cao, Lanshu Hu, Yating Yu, Yanning Zhang

    Abstract: Long-term action anticipation (LTA) aims to predict future actions over an extended period. Previous approaches primarily focus on learning exclusively from video data but lack prior knowledge. Recent researches leverage large language models (LLMs) by utilizing text-based inputs which suffer severe information loss. To tackle these limitations single-modality methods face, we propose a novel Inte… ▽ More

    Submitted 3 May, 2025; originally announced May 2025.

  43. arXiv:2504.21803  [pdf, other

    cs.SE cs.CR

    An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding

    Authors: Xiuwei Shang, Zhenkan Fu, Shaoyin Cheng, Guoqiang Chen, Gangyang Li, Li Hu, Weiming Zhang, Nenghai Yu

    Abstract: Binary code analysis plays a pivotal role in the field of software security and is widely used in tasks such as software maintenance, malware detection, software vulnerability discovery, patch analysis, etc. However, unlike source code, reverse engineers face significant challenges in understanding binary code due to the lack of intuitive semantic information. Although traditional reverse tools ca… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

    Comments: 38 pages, 9 figures

  44. Comprehensive List Generation for Multi-Generator Reranking

    Authors: Hailan Yang, Zhenyu Qi, Shuchang Liu, Xiaoyu Yang, Xiaobei Wang, Xiang Li, Lantao Hu, Han Li, Kun Gai

    Abstract: Reranking models solve the final recommendation lists that best fulfill users' demands. While existing solutions focus on finding parametric models that approximate optimal policies, recent approaches find that it is better to generate multiple lists to compete for a ``pass'' ticket from an evaluator, where the evaluator serves as the supervisor who accurately estimates the performance of the cand… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: 11 pages, 6 figures, 9 tables

    ACM Class: H.3.3

    Journal ref: Proceedings of the 48th International ACM SIGIR, 2025

  45. arXiv:2504.14218  [pdf, ps, other

    cs.CL

    Understanding the Repeat Curse in Large Language Models from a Feature Perspective

    Authors: Junchi Yao, Shu Yang, Jianhua Xu, Lijie Hu, Mengdi Li, Di Wang

    Abstract: Large language models (LLMs) have made remarkable progress in various domains, yet they often suffer from repetitive text generation, a phenomenon we refer to as the "Repeat Curse". While previous studies have proposed decoding strategies to mitigate repetition, the underlying mechanism behind this issue remains insufficiently explored. In this work, we investigate the root causes of repetition in… ▽ More

    Submitted 13 June, 2025; v1 submitted 19 April, 2025; originally announced April 2025.

    Comments: Accepted by ACL 2025, Findings, Long Paper

  46. arXiv:2504.13914  [pdf, other

    cs.CL

    Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

    Authors: ByteDance Seed, :, Jiaze Chen, Tiantian Fan, Xin Liu, Lingjun Liu, Zhiqi Lin, Mingxuan Wang, Chengyi Wang, Xiangpeng Wei, Wenyuan Xu, Yufeng Yuan, Yu Yue, Lin Yan, Qiying Yu, Xiaochen Zuo, Chi Zhang, Ruofei Zhu, Zhecheng An, Zhihao Bai, Yu Bao, Xingyan Bin, Jiangjie Chen, Feng Chen, Hongmin Chen , et al. (249 additional authors not shown)

    Abstract: We introduce Seed1.5-Thinking, capable of reasoning through thinking before responding, resulting in improved performance on a wide range of benchmarks. Seed1.5-Thinking achieves 86.7 on AIME 2024, 55.0 on Codeforces and 77.3 on GPQA, demonstrating excellent reasoning abilities in STEM and coding. Beyond reasoning tasks, the method demonstrates notable generalization across diverse domains. For in… ▽ More

    Submitted 29 April, 2025; v1 submitted 10 April, 2025; originally announced April 2025.

  47. arXiv:2504.09441  [pdf, other

    cs.CV eess.IV

    Structure-Accurate Medical Image Translation via Dynamic Frequency Balance and Knowledge Guidance

    Authors: Jiahua Xu, Dawei Zhou, Lei Hu, Zaiyi Liu, Nannan Wang, Xinbo Gao

    Abstract: Multimodal medical images play a crucial role in the precise and comprehensive clinical diagnosis. Diffusion model is a powerful strategy to synthesize the required medical images. However, existing approaches still suffer from the problem of anatomical structure distortion due to the overfitting of high-frequency information and the weakening of low-frequency information. Thus, we propose a novel… ▽ More

    Submitted 27 May, 2025; v1 submitted 13 April, 2025; originally announced April 2025.

    Comments: Medical image translation, Diffusion model, 16 pages

  48. arXiv:2504.07575  [pdf, other

    cs.IR

    Explicit Uncertainty Modeling for Video Watch Time Prediction

    Authors: Shanshan Wu, Shuchang Liu, Shuai Zhang, Xiaoyu Yang, Xiang Li, Lantao Hu, Han Li

    Abstract: In video recommendation, a critical component that determines the system's recommendation accuracy is the watch-time prediction module, since how long a user watches a video directly reflects personalized preferences. One of the key challenges of this problem is the user's stochastic watch-time behavior. To improve the prediction accuracy for such an uncertain behavior, existing approaches show th… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

  49. arXiv:2504.04994  [pdf, other

    cs.CL cs.AI

    Following the Whispers of Values: Unraveling Neural Mechanisms Behind Value-Oriented Behaviors in LLMs

    Authors: Ling Hu, Yuemei Xu, Xiaoyang Gu, Letao Han

    Abstract: Despite the impressive performance of large language models (LLMs), they can present unintended biases and harmful behaviors driven by encoded values, emphasizing the urgent need to understand the value mechanisms behind them. However, current research primarily evaluates these values through external responses with a focus on AI safety, lacking interpretability and failing to assess social values… ▽ More

    Submitted 20 April, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  50. arXiv:2504.03220  [pdf, other

    cs.LG

    Learning Lie Group Generators from Trajectories

    Authors: Lifan Hu

    Abstract: This work investigates the inverse problem of generator recovery in matrix Lie groups from discretized trajectories. Let $G$ be a real matrix Lie group and $\mathfrak{g} = \text{Lie}(G)$ its corresponding Lie algebra. A smooth trajectory $γ($t$)$ generated by a fixed Lie algebra element $ξ\in \mathfrak{g}$ follows the exponential flow $γ($t$) = g_0 \cdot \exp(t ξ)$. The central task addressed in t… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: 7 pages, 12 figures