Skip to main content

Showing 1–50 of 772 results for author: Lin, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.08744  [pdf, other

    cs.AI

    DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models

    Authors: Xiaoyang Chen, Xinan Dai, Yu Du, Qian Feng, Naixu Guo, Tingshuo Gu, Yuting Gao, Yingyi Gao, Xudong Han, Xiang Jiang, Yilin Jin, Hongyi Lin, Shisheng Lin, Xiangnan Li, Yuante Li, Yixing Li, Zhentao Lai, Zilu Ma, Yingrong Peng, Jiacheng Qian, Hao-Yu Sun, Jianbo Sun, Zirui Wang, Siwei Wu, Zian Wang , et al. (6 additional authors not shown)

    Abstract: To advance the mathematical proficiency of large language models (LLMs), the DeepMath team has launched an open-source initiative aimed at developing an open mathematical LLM and systematically evaluating its mathematical creativity. This paper represents the initial contribution of this initiative. While recent developments in mathematical LLMs have predominantly emphasized reasoning skills, as e… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

    Comments: 14 pages, 4 figures

  2. arXiv:2505.07886  [pdf, ps, other

    cs.CL cs.AI

    PLHF: Prompt Optimization with Few-Shot Human Feedback

    Authors: Chun-Pai Yang, Kan Zheng, Shou-De Lin

    Abstract: Automatic prompt optimization frameworks are developed to obtain suitable prompts for large language models (LLMs) with respect to desired output quality metrics. Although existing approaches can handle conventional tasks such as fixed-solution question answering, defining the metric becomes complicated when the output quality cannot be easily assessed by comparisons with standard golden samples.… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

  3. arXiv:2505.07309  [pdf, ps, other

    cs.LG

    Uncertainty Profiles for LLMs: Uncertainty Source Decomposition and Adaptive Model-Metric Selection

    Authors: Pei-Fu Guo, Yun-Da Tsai, Shou-De Lin

    Abstract: Large language models (LLMs) often generate fluent but factually incorrect outputs, known as hallucinations, which undermine their reliability in real-world applications. While uncertainty estimation has emerged as a promising strategy for detecting such errors, current metrics offer limited interpretability and lack clarity about the types of uncertainty they capture. In this paper, we present a… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  4. arXiv:2505.00041  [pdf, other

    cs.AR

    MCMComm: Hardware-Software Co-Optimization for End-to-End Communication in Multi-Chip-Modules

    Authors: Ritik Raj, Shengjie Lin, William Won, Tushar Krishna

    Abstract: Increasing AI computing demands and slowing transistor scaling have led to the advent of Multi-Chip-Module (MCMs) based accelerators. MCMs enable cost-effective scalability, higher yield, and modular reuse by partitioning large chips into smaller chiplets. However, MCMs come at an increased communication cost, which requires critical analysis and optimization. This paper makes three main contribut… ▽ More

    Submitted 2 May, 2025; v1 submitted 29 April, 2025; originally announced May 2025.

  5. arXiv:2504.20490  [pdf, other

    cs.DC

    Hetu v2: A General and Scalable Deep Learning System with Hierarchical and Heterogeneous Single Program Multiple Data Annotations

    Authors: Haoyang Li, Fangcheng Fu, Hao Ge, Sheng Lin, Xuanyu Wang, Jiawen Niu, Xupeng Miao, Bin Cui

    Abstract: The Single Program Multiple Data (SPMD) paradigm provides a unified abstraction to annotate various parallel dimensions in distributed deep learning (DL) training. With SPMD, users can write training programs from the viewpoint of a single device, and the system will automatically deduce the tensor sharding and communication patterns. However, with the recent development in large-scale DL models,… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

  6. arXiv:2504.17365  [pdf, other

    cs.CV cs.CL

    TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation

    Authors: Ling You, Wenxuan Huang, Xinni Xie, Xiangyi Wei, Bangyan Li, Shaohui Lin, Yang Li, Changbo Wang

    Abstract: Soccer is a globally popular sporting event, typically characterized by long matches and distinctive highlight moments. Recent advances in Multimodal Large Language Models (MLLMs) offer promising capabilities in temporal grounding and video understanding, soccer commentary generation often requires precise temporal localization and semantically rich descriptions over long-form video. However, exis… ▽ More

    Submitted 28 April, 2025; v1 submitted 24 April, 2025; originally announced April 2025.

  7. arXiv:2504.17255  [pdf

    eess.IV cs.AI physics.optics

    3D Deep-learning-based Segmentation of Human Skin Sweat Glands and Their 3D Morphological Response to Temperature Variations

    Authors: Shaoyu Pei, Renxiong Wu, Hao Zheng, Lang Qin, Shuaichen Lin, Yuxing Gan, Wenjing Huang, Zhixuan Wang, Mohan Qin, Yong Liu, Guangming Ni

    Abstract: Skin, the primary regulator of heat exchange, relies on sweat glands for thermoregulation. Alterations in sweat gland morphology play a crucial role in various pathological conditions and clinical diagnoses. Current methods for observing sweat gland morphology are limited by their two-dimensional, in vitro, and destructive nature, underscoring the urgent need for real-time, non-invasive, quantifia… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  8. Optimizing SIA Development: A Case Study in User-Centered Design for Estuary, a Multimodal Socially Interactive Agent Framework

    Authors: Spencer Lin, Miru Jun, Basem Rizk, Karen Shieh, Scott Fisher, Sharon Mozgai

    Abstract: This case study presents our user-centered design model for Socially Intelligent Agent (SIA) development frameworks through our experience developing Estuary, an open source multimodal framework for building low-latency real-time socially interactive agents. We leverage the Rapid Assessment Process (RAP) to collect the thoughts of leading researchers in the field of SIAs regarding the current stat… ▽ More

    Submitted 19 April, 2025; originally announced April 2025.

  9. arXiv:2504.12007  [pdf, other

    cs.IR cs.AI

    Generative Recommendation with Continuous-Token Diffusion

    Authors: Haohao Qu, Wenqi Fan, Shanru Lin

    Abstract: In recent years, there has been a significant trend toward using large language model (LLM)-based recommender systems (RecSys). Current research primarily focuses on representing complex user-item interactions within a discrete space to align with the inherent discrete nature of language models. However, this approach faces limitations due to its discrete nature: (i) information is often compresse… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

  10. arXiv:2504.10325  [pdf, other

    cs.LO

    Cumulative-Time Signal Temporal Logic

    Authors: Hongkai Chen, Zeyu Zhang, Shouvik Roy, Ezio Bartocci, Scott A. Smolka, Scott D. Stoller, Shan Lin

    Abstract: Signal Temporal Logic (STL) is a widely adopted specification language in cyber-physical systems for expressing critical temporal requirements, such as safety conditions and response time. However, STL's expressivity is not sufficient to capture the cumulative duration during which a property holds within an interval of time. To overcome this limitation, we introduce Cumulative-Time Signal Tempora… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

    Comments: 20 pages, 7 figures, 2 tables

  11. arXiv:2504.08685  [pdf, other

    cs.CV cs.AI

    Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

    Authors: Team Seawead, Ceyuan Yang, Zhijie Lin, Yang Zhao, Shanchuan Lin, Zhibei Ma, Haoyuan Guo, Hao Chen, Lu Qi, Sen Wang, Feng Cheng, Feilong Zuo, Xuejiao Zeng, Ziyan Yang, Fangyuan Kong, Meng Wei, Zhiwu Qing, Fei Xiao, Tuyen Hoang, Siyu Zhang, Peihao Zhu, Qi Zhao, Jiangqiao Yan, Liangke Gui, Sheng Bi , et al. (30 additional authors not shown)

    Abstract: This technical report presents a cost-efficient strategy for training a video generation foundation model. We present a mid-sized research model with approximately 7 billion parameters (7B) called Seaweed-7B trained from scratch using 665,000 H100 GPU hours. Despite being trained with moderate computational resources, Seaweed-7B demonstrates highly competitive performance compared to contemporary… ▽ More

    Submitted 4 May, 2025; v1 submitted 11 April, 2025; originally announced April 2025.

    Comments: Technical report (some typos fixed)

  12. arXiv:2504.06684  [pdf, other

    cs.RO cs.MA

    SDHN: Skewness-Driven Hypergraph Networks for Enhanced Localized Multi-Robot Coordination

    Authors: Delin Zhao, Yanbo Shan, Chang Liu, Shenghang Lin, Yingxin Shou, Bin Xu

    Abstract: Multi-Agent Reinforcement Learning is widely used for multi-robot coordination, where simple graphs typically model pairwise interactions. However, such representations fail to capture higher-order collaborations, limiting effectiveness in complex tasks. While hypergraph-based approaches enhance cooperation, existing methods often generate arbitrary hypergraph structures and lack adaptability to e… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  13. arXiv:2504.06358  [pdf, other

    cs.CV

    Towards Calibration Enhanced Network by Inverse Adversarial Attack

    Authors: Yupeng Cheng, Zi Pong Lim, Sarthak Ketanbhai Modi, Yon Shin Teo, Yushi Cao, Shang-Wei Lin

    Abstract: Test automation has become increasingly important as the complexity of both design and content in Human Machine Interface (HMI) software continues to grow. Current standard practice uses Optical Character Recognition (OCR) techniques to automatically extract textual information from HMI screens for validation. At present, one of the key challenges faced during the automation of HMI screen validati… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: 11 pages

  14. arXiv:2504.04224  [pdf, other

    cs.SE eess.SY

    Exploration of Approaches for Robustness and Safety in a Low Code Open Environment for Factory Automation

    Authors: Gustavo Quiros A., Yi Peng Zhu, Tao Cui, Shaokai Lin, Marten Lohstroh, Edward A. Lee

    Abstract: This report is a compilation of technical knowledge and concepts that were produced by the authors and additional contributors in the context of the collaboration projects "Abstraction Requirements for Language of Choice in Industrial Automation" (FY21-22) and "Approaches for Robust and Safe Low-Code" (FY23-24) from Siemens Technology and the University of California, Berkeley. The primary objecti… ▽ More

    Submitted 5 April, 2025; originally announced April 2025.

    Comments: 15 pages, 4 figures, technical report

  15. arXiv:2504.03793  [pdf, other

    cs.LG cs.AI

    Outlook Towards Deployable Continual Learning for Particle Accelerators

    Authors: Kishansingh Rajput, Sen Lin, Auralee Edelen, Willem Blokland, Malachi Schram

    Abstract: Particle Accelerators are high power complex machines. To ensure uninterrupted operation of these machines, thousands of pieces of equipment need to be synchronized, which requires addressing many challenges including design, optimization and control, anomaly detection and machine protection. With recent advancements, Machine Learning (ML) holds promise to assist in more advance prognostics, optim… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: 41 pages, 6 figures, submitted to Machine Learning: Science and Technology Journal

  16. arXiv:2504.00954  [pdf, other

    cs.CV cs.AI

    IDMR: Towards Instance-Driven Precise Visual Correspondence in Multimodal Retrieval

    Authors: Bangwei Liu, Yicheng Bao, Shaohui Lin, Xuhong Wang, Xin Tan, Yingchun Wang, Yuan Xie, Chaochao Lu

    Abstract: Multimodal retrieval systems are becoming increasingly vital for cutting-edge AI technologies, such as embodied AI and AI-driven digital content industries. However, current multimodal retrieval tasks lack sufficient complexity and demonstrate limited practical application value. It spires us to design Instance-Driven Multimodal Image Retrieval (IDMR), a novel task that requires models to retrieve… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  17. arXiv:2503.23350  [pdf, other

    cs.AI

    A Survey of WebAgents: Towards Next-Generation AI Agents for Web Automation with Large Foundation Models

    Authors: Liangbo Ning, Ziran Liang, Zhuohang Jiang, Haohao Qu, Yujuan Ding, Wenqi Fan, Xiao-yong Wei, Shanru Lin, Hui Liu, Philip S. Yu, Qing Li

    Abstract: With the advancement of web techniques, they have significantly revolutionized various aspects of people's lives. Despite the importance of the web, many tasks performed on it are repetitive and time-consuming, negatively impacting overall quality of life. To efficiently handle these tedious daily tasks, one of the most promising approaches is to advance autonomous agents based on Artificial Intel… ▽ More

    Submitted 10 May, 2025; v1 submitted 30 March, 2025; originally announced March 2025.

    Comments: Accepted by KDD 2025;

  18. arXiv:2503.22740  [pdf, other

    cs.LG cs.AI

    CSPO: Cross-Market Synergistic Stock Price Movement Forecasting with Pseudo-volatility Optimization

    Authors: Sida Lin, Yankai Chen, Yiyan Qi, Chenhao Ma, Bokai Cao, Yifei Zhang, Xue Liu, Jian Guo

    Abstract: The stock market, as a cornerstone of the financial markets, places forecasting stock price movements at the forefront of challenges in quantitative finance. Emerging learning-based approaches have made significant progress in capturing the intricate and ever-evolving data patterns of modern markets. With the rapid expansion of the stock market, it presents two characteristics, i.e., stock exogene… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  19. arXiv:2503.20454  [pdf, other

    cs.LG cs.CV

    Lipschitz Constant Meets Condition Number: Learning Robust and Compact Deep Neural Networks

    Authors: Yangqi Feng, Shing-Ho J. Lin, Baoyuan Gao, Xian Wei

    Abstract: Recent research has revealed that high compression of Deep Neural Networks (DNNs), e.g., massive pruning of the weight matrix of a DNN, leads to a severe drop in accuracy and susceptibility to adversarial attacks. Integration of network pruning into an adversarial training framework has been proposed to promote adversarial robustness. It has been observed that a highly pruned weight matrix tends t… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

    Comments: 13 pages, 6 figures

  20. arXiv:2503.18940  [pdf, other

    cs.CV

    Training-free Diffusion Acceleration with Bottleneck Sampling

    Authors: Ye Tian, Xin Xia, Yuxi Ren, Shanchuan Lin, Xing Wang, Xuefeng Xiao, Yunhai Tong, Ling Yang, Bin Cui

    Abstract: Diffusion models have demonstrated remarkable capabilities in visual content generation but remain challenging to deploy due to their high computational cost during inference. This computational burden primarily arises from the quadratic complexity of self-attention with respect to image or video resolution. While existing acceleration methods often compromise output quality or necessitate costly… ▽ More

    Submitted 27 March, 2025; v1 submitted 24 March, 2025; originally announced March 2025.

    Comments: Project Page: https://tyfeld.github.io/BottleneckSampling.github.io/

  21. arXiv:2503.18676  [pdf, ps, other

    cs.LG

    Feature Qualification by Deep Nets: A Constructive Approach

    Authors: Feilong Cao, Shao-Bo Lin

    Abstract: The great success of deep learning has stimulated avid research activities in verifying the power of depth in theory, a common consensus of which is that deep net are versatile in approximating and learning numerous functions. Such a versatility certainly enhances the understanding of the power of depth, but makes it difficult to judge which data features are crucial in a specific learning task. T… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

  22. arXiv:2503.17578  [pdf, other

    cs.LG

    Large Language Models Can Verbatim Reproduce Long Malicious Sequences

    Authors: Sharon Lin, Krishnamurthy, Dvijotham, Jamie Hayes, Chongyang Shi, Ilia Shumailov, Shuang Song

    Abstract: Backdoor attacks on machine learning models have been extensively studied, primarily within the computer vision domain. Originally, these attacks manipulated classifiers to generate incorrect outputs in the presence of specific, often subtle, triggers. This paper re-examines the concept of backdoor attacks in the context of Large Language Models (LLMs), focusing on the generation of long, verbatim… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  23. The Immersive Archive: Archival Strategies for the Sensorama & Sutherland HMD

    Authors: Zeynep Abes, Nathan Fairchild, Spencer Lin, Michael Wahba, Katrina Xiao, Scott S. Fisher

    Abstract: The Immersive Archive is an initiative dedicated to preserve and restore the groundbreaking works from across Extended Reality (XR) history. Originating at the University of Southern California's Mobile and Environmental Media Lab, this archive is committed to developing and exhibiting simulations of influential XR devices that have shaped immersive media over time. This paper examines the challen… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

    Journal ref: Proc. IEEE Conf. AI & XR, 2025, pp. 307-312

  24. arXiv:2503.10592  [pdf, other

    cs.CV

    CameraCtrl II: Dynamic Scene Exploration via Camera-controlled Video Diffusion Models

    Authors: Hao He, Ceyuan Yang, Shanchuan Lin, Yinghao Xu, Meng Wei, Liangke Gui, Qi Zhao, Gordon Wetzstein, Lu Jiang, Hongsheng Li

    Abstract: This paper introduces CameraCtrl II, a framework that enables large-scale dynamic scene exploration through a camera-controlled video diffusion model. Previous camera-conditioned video generative models suffer from diminished video dynamics and limited range of viewpoints when generating videos with large camera movement. We take an approach that progressively expands the generation of dynamic sce… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: Project page: https://hehao13.github.io/Projects-CameraCtrl-II/

  25. arXiv:2503.09991  [pdf, other

    cs.IT

    Finite Field Multiple Access II:from Symbol-wise to Codeword-wise

    Authors: Qi-yue Yu, Shi-wen Lin, Ting-wei Yang

    Abstract: A finite-field multiple-access (FFMA) system separates users within a finite field by utilizing different element-pairs (EPs) as virtual resources. The Cartesian product of distinct EPs forms an EP code, which serves as the input to a finite-field multiplexing module (FF-MUX), allowing the FFMA technique to interchange the order of channel coding and multiplexing. This flexibility enables the FFMA… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: 50 pages, 9 figures

  26. arXiv:2503.08073  [pdf, other

    cs.CV

    Seeing Beyond Haze: Generative Nighttime Image Dehazing

    Authors: Beibei Lin, Stephen Lin, Robby Tan

    Abstract: Nighttime image dehazing is particularly challenging when dense haze and intense glow severely degrade or completely obscure background information. Existing methods often encounter difficulties due to insufficient background priors and limited generative ability, both essential for handling such conditions. In this paper, we introduce BeyondHaze, a generative nighttime dehazing method that not on… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  27. arXiv:2503.07487  [pdf, other

    cs.CV

    LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?

    Authors: Bangyan Li, Wenxuan Huang, Yunhang Shen, Yeqiang Wang, Shaohui Lin, Jingzhong Lin, Ling You, Yinqi Zhang, Ke Li, Xing Sun, Yuling Sun

    Abstract: Recently, multimodal large models (MLLMs) have demonstrated exceptional capabilities in visual understanding and reasoning across various vision-language tasks. However, MLLMs usually perform poorly in zero-shot medical disease recognition, as they do not fully exploit the captured features and available medical knowledge. To address this challenge, we propose LLaVA-RadZ, a simple yet effective fr… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

  28. arXiv:2503.07137  [pdf, other

    cs.LG cs.AI

    A Comprehensive Survey of Mixture-of-Experts: Algorithms, Theory, and Applications

    Authors: Siyuan Mu, Sen Lin

    Abstract: Artificial intelligence (AI) has achieved astonishing successes in many domains, especially with the recent breakthroughs in the development of foundational large models. These large models, leveraging their extensive training data, provide versatile solutions for a wide range of downstream tasks. However, as modern datasets become increasingly diverse and complex, the development of large AI mode… ▽ More

    Submitted 17 April, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

    Comments: 29 pages, 3 figures

  29. arXiv:2503.07125  [pdf, other

    cs.CV

    Learning A Zero-shot Occupancy Network from Vision Foundation Models via Self-supervised Adaptation

    Authors: Sihao Lin, Daqi Liu, Ruochong Fu, Dongrui Liu, Andy Song, Hongwei Xie, Zhihui Li, Bing Wang, Xiaojun Chang

    Abstract: Estimating the 3D world from 2D monocular images is a fundamental yet challenging task due to the labour-intensive nature of 3D annotations. To simplify label acquisition, this work proposes a novel approach that bridges 2D vision foundation models (VFMs) with 3D tasks by decoupling 3D supervision into an ensemble of image-level primitives, e.g., semantic and geometric components. As a key motivat… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: preprint

  30. arXiv:2503.06749  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

    Authors: Wenxuan Huang, Bohan Jia, Zijie Zhai, Shaosheng Cao, Zheyu Ye, Fei Zhao, Zhe Xu, Yao Hu, Shaohui Lin

    Abstract: DeepSeek-R1-Zero has successfully demonstrated the emergence of reasoning capabilities in LLMs purely through Reinforcement Learning (RL). Inspired by this breakthrough, we explore how RL can be utilized to enhance the reasoning capability of MLLMs. However, direct training with RL struggles to activate complex reasoning capabilities such as questioning and reflection in MLLMs, due to the absence… ▽ More

    Submitted 11 March, 2025; v1 submitted 9 March, 2025; originally announced March 2025.

  31. Towards Understanding the Use of MLLM-Enabled Applications for Visual Interpretation by Blind and Low Vision People

    Authors: Ricardo E. Gonzalez Penuela, Ruiying Hu, Sharon Lin, Tanisha Shende, Shiri Azenkot

    Abstract: Blind and Low Vision (BLV) people have adopted AI-powered visual interpretation applications to address their daily needs. While these applications have been helpful, prior work has found that users remain unsatisfied by their frequent errors. Recently, multimodal large language models (MLLMs) have been integrated into visual interpretation applications, and they show promise for more descriptive… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: 8 pages, 1 figure, 4 tables, to appear at CHI 2025

    ACM Class: I.2.1; H.5.2

  32. arXiv:2503.03207  [pdf, other

    cs.PL

    PolyVer: A Compositional Approach for Polyglot System Modeling and Verification

    Authors: Pei-Wei Chen, Shaokai Lin, Adwait Godbole, Ramneet Singh, Elizabeth Polgreen, Edward A. Lee, Sanjit A. Seshia

    Abstract: Several software systems are polyglot; that is, they comprise programs implemented in a combination of programming languages. Verifiers that directly run on mainstream programming languages are currently customized for single languages. Thus, to verify polyglot systems, one usually translates them into a common verification language or formalism on which the verifier runs. In this paper, we presen… ▽ More

    Submitted 12 March, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

    Comments: 27 pages, 8 figures; acknowledgements added, typos fixed

  33. arXiv:2503.02915  [pdf, ps, other

    eess.IV cs.CV cs.LG math.NA physics.med-ph

    Computer-aided shape features extraction and regression models for predicting the ascending aortic aneurysm growth rate

    Authors: Leonardo Geronzi, Antonio Martinez, Michel Rochette, Kexin Yan, Aline Bel-Brunon, Pascal Haigron, Pierre Escrig, Jacques Tomasi, Morgan Daniel, Alain Lalande, Siyu Lin, Diana Marcela Marin-Castrillon, Olivier Bouchot, Jean Porterie, Pier Paolo Valentini, Marco Evangelos Biancolini

    Abstract: Objective: ascending aortic aneurysm growth prediction is still challenging in clinics. In this study, we evaluate and compare the ability of local and global shape features to predict ascending aortic aneurysm growth. Material and methods: 70 patients with aneurysm, for which two 3D acquisitions were available, are included. Following segmentation, three local shape features are computed: (1) t… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Journal ref: Volume 162, August 2023, 107052, Computers in Biology and Medicine

  34. arXiv:2503.01768  [pdf, other

    cs.LG cs.CV

    SHADE-AD: An LLM-Based Framework for Synthesizing Activity Data of Alzheimer's Patients

    Authors: Heming Fu, Hongkai Chen, Shan Lin, Guoliang Xing

    Abstract: Alzheimer's Disease (AD) has become an increasingly critical global health concern, which necessitates effective monitoring solutions in smart health applications. However, the development of such solutions is significantly hindered by the scarcity of AD-specific activity datasets. To address this challenge, we propose SHADE-AD, a Large Language Model (LLM) framework for Synthesizing Human Activit… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: 7 pages, 6 figures, ACM SenSys'25

  35. arXiv:2503.00923  [pdf, other

    cs.RO

    HWC-Loco: A Hierarchical Whole-Body Control Approach to Robust Humanoid Locomotion

    Authors: Sixu Lin, Guanren Qiao, Yunxin Tai, Ang Li, Kui Jia, Guiliang Liu

    Abstract: Humanoid robots, capable of assuming human roles in various workplaces, have become essential to the advancement of embodied intelligence. However, as robots with complex physical structures, learning a control model that can operate robustly across diverse environments remains inherently challenging, particularly under the discrepancies between training and deployment environments. In this study,… ▽ More

    Submitted 10 March, 2025; v1 submitted 2 March, 2025; originally announced March 2025.

  36. arXiv:2503.00402  [pdf, other

    cs.DB

    A Topology-Aware Localized Update Strategy for Graph-Based ANN Index

    Authors: Song Yu, Shengyuan Lin, Shufeng Gong, Yongqing Xie, Ruicheng Liu, Yijie Zhou, Ji Sun, Yanfeng Zhang, Guoliang Li, Ge Yu

    Abstract: The graph-based index has been widely adopted to meet the demand for approximate nearest neighbor search (ANNS) for high-dimensional vectors. However, in dynamic scenarios involving frequent vector insertions and deletions, existing systems improve update throughput by adopting a batch update method. However, a large batch size leads to significant degradation in search accuracy. This work aims… ▽ More

    Submitted 18 March, 2025; v1 submitted 1 March, 2025; originally announced March 2025.

  37. arXiv:2502.20807  [pdf, other

    cs.LG

    Digital Player: Evaluating Large Language Models based Human-like Agent in Games

    Authors: Jiawei Wang, Kai Wang, Shaojie Lin, Runze Wu, Bihan Xu, Lingeng Jiang, Shiwei Zhao, Renyu Zhu, Haoyu Liu, Zhipeng Hu, Zhong Fan, Le Li, Tangjie Lyu, Changjie Fan

    Abstract: With the rapid advancement of Large Language Models (LLMs), LLM-based autonomous agents have shown the potential to function as digital employees, such as digital analysts, teachers, and programmers. In this paper, we develop an application-level testbed based on the open-source strategy game "Unciv", which has millions of active players, to enable researchers to build a "data flywheel" for studyi… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

    Comments: neurips datasets and benchmarks 2024, not accepted

  38. arXiv:2502.20576  [pdf, other

    cs.DB cs.CL

    Smart Routing: Cost-Effective Multi-LLM Serving for Multi-Core AIOS

    Authors: Kai Mei, Wujiang Xu, Shuhang Lin, Yongfeng Zhang

    Abstract: As large language models (LLMs) are increasingly deployed as service endpoints in systems, the surge in query volume creates significant scheduling challenges. Existing scheduling frameworks mainly target at latency optimization while neglecting the capability of LLMs to serve different level of queries, which could lead to computational resource waste. For example, those simple queries can be saf… ▽ More

    Submitted 2 April, 2025; v1 submitted 27 February, 2025; originally announced February 2025.

  39. arXiv:2502.18699  [pdf, other

    cs.CL cs.LG stat.ME

    MPO: An Efficient Post-Processing Framework for Mixing Diverse Preference Alignment

    Authors: Tianze Wang, Dongnan Gui, Yifan Hu, Shuhang Lin, Linjun Zhang

    Abstract: Reinforcement Learning from Human Feedback (RLHF) has shown promise in aligning large language models (LLMs). Yet its reliance on a singular reward model often overlooks the diversity of human preferences. Recent approaches address this limitation by leveraging multi-dimensional feedback to fine-tune corresponding reward models and train LLMs using reinforcement learning. However, the process is c… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

  40. arXiv:2502.16965  [pdf, other

    cs.CV

    Autoregressive Image Generation with Vision Full-view Prompt

    Authors: Miaomiao Cai, Guanjie Wang, Wei Li, Zhijun Tu, Hanting Chen, Shaohui Lin, Jie Hu

    Abstract: In autoregressive (AR) image generation, models based on the 'next-token prediction' paradigm of LLMs have shown comparable performance to diffusion models by reducing inductive biases. However, directly applying LLMs to complex image generation can struggle with reconstructing the image's structure and details, impacting the generation's accuracy and stability. Additionally, the 'next-token predi… ▽ More

    Submitted 12 March, 2025; v1 submitted 24 February, 2025; originally announced February 2025.

  41. arXiv:2502.15130  [pdf, other

    cs.CV

    TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba

    Authors: Xiuwei Chen, Sihao Lin, Xiao Dong, Zisheng Chen, Meng Cao, Jianhua Han, Hang Xu, Xiaodan Liang

    Abstract: Transformers have been favored in both uni-modal and multi-modal foundation models for their flexible scalability in attention modules. Consequently, a number of pre-trained Transformer models, e.g., LLaVA, CLIP, and DEIT, are publicly available. Recent research has introduced subquadratic architectures like Mamba, which enables global awareness with linear complexity. Nevertheless, training speci… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  42. arXiv:2502.11436  [pdf, other

    cs.LG

    ADO: Automatic Data Optimization for Inputs in LLM Prompts

    Authors: Sam Lin, Wenyue Hua, Lingyao Li, Zhenting Wang, Yongfeng Zhang

    Abstract: This study explores a novel approach to enhance the performance of Large Language Models (LLMs) through the optimization of input data within prompts. While previous research has primarily focused on refining instruction components and augmenting input data with in-context examples, our work investigates the potential benefits of optimizing the input data itself. We introduce a two-pronged strateg… ▽ More

    Submitted 16 February, 2025; originally announced February 2025.

  43. arXiv:2502.09156  [pdf

    cs.CL

    Improving TCM Question Answering through Tree-Organized Self-Reflective Retrieval with LLMs

    Authors: Chang Liu, Ying Chang, Jianmin Li, Yiqian Qu, Yu Li, Lingyong Cao, Shuyuan Lin

    Abstract: Objectives: Large language models (LLMs) can harness medical knowledge for intelligent question answering (Q&A), promising support for auxiliary diagnosis and medical talent cultivation. However, there is a deficiency of highly efficient retrieval-augmented generation (RAG) frameworks within the domain of Traditional Chinese Medicine (TCM). Our purpose is to observe the effect of the Tree-Organize… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  44. arXiv:2502.07325  [pdf

    cs.LG math.NA

    Long-term simulation of physical and mechanical behaviors using curriculum-transfer-learning based physics-informed neural networks

    Authors: Yuan Guo, Zhuojia Fu, Jian Min, Shiyu Lin, Xiaoting Liu, Youssef F. Rashed, Xiaoying Zhuang

    Abstract: This paper proposes a Curriculum-Transfer-Learning based physics-informed neural network (CTL-PINN) for long-term simulation of physical and mechanical behaviors. The main innovation of CTL-PINN lies in decomposing long-term problems into a sequence of short-term subproblems. Initially, the standard PINN is employed to solve the first sub-problem. As the simulation progresses, subsequent time-doma… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: 31 pages, 18 figures

  45. QPRAC: Towards Secure and Practical PRAC-based Rowhammer Mitigation using Priority Queues

    Authors: Jeonghyun Woo, Chris S. Lin, Prashant J. Nair, Aamer Jaleel, Gururaj Saileshwar

    Abstract: JEDEC has introduced the Per Row Activation Counting (PRAC) framework for DDR5 and future DRAMs to enable precise counting of DRAM row activations. PRAC enables a holistic mitigation of Rowhammer attacks even at ultra-low Rowhammer thresholds. PRAC uses an Alert Back-Off (ABO) protocol to request the memory controller to issue Rowhammer mitigation requests. However, recent PRAC implementations are… ▽ More

    Submitted 15 May, 2025; v1 submitted 30 January, 2025; originally announced January 2025.

    Comments: 15 pages, including appendices. The paper was presented at HPCA 2025 (https://hpca-conf.org/2025/)

    Journal ref: 2025 IEEE Symposium on High-Performance Computer Architecture (HPCA 2025)

  46. arXiv:2501.18841  [pdf, other

    cs.LG cs.CR

    Trading Inference-Time Compute for Adversarial Robustness

    Authors: Wojciech Zaremba, Evgenia Nitishinskaya, Boaz Barak, Stephanie Lin, Sam Toyer, Yaodong Yu, Rachel Dias, Eric Wallace, Kai Xiao, Johannes Heidecke, Amelia Glaese

    Abstract: We conduct experiments on the impact of increasing inference-time compute in reasoning models (specifically OpenAI o1-preview and o1-mini) on their robustness to adversarial attacks. We find that across a variety of attacks, increased inference-time compute leads to improved robustness. In many cases (with important exceptions), the fraction of model samples where the attack succeeds tends to zero… ▽ More

    Submitted 30 January, 2025; originally announced January 2025.

  47. arXiv:2501.17186  [pdf, other

    cs.AI cs.CL cs.LG

    Complete Chess Games Enable LLM Become A Chess Master

    Authors: Yinqi Zhang, Xintian Han, Haolong Li, Kedi Chen, Shaohui Lin

    Abstract: Large language models (LLM) have shown remarkable abilities in text generation, question answering, language translation, reasoning and many other tasks. It continues to advance rapidly and is becoming increasingly influential in various fields, from technology and business to education and entertainment. Despite LLM's success in multiple areas, its ability to play abstract games, such as chess, i… ▽ More

    Submitted 29 January, 2025; v1 submitted 26 January, 2025; originally announced January 2025.

    Comments: NAACL 2025

  48. arXiv:2501.16744  [pdf, other

    cs.LG cs.AI

    LLM Assisted Anomaly Detection Service for Site Reliability Engineers: Enhancing Cloud Infrastructure Resilience

    Authors: Nimesh Jha, Shuxin Lin, Srideepika Jayaraman, Kyle Frohling, Christodoulos Constantinides, Dhaval Patel

    Abstract: This paper introduces a scalable Anomaly Detection Service with a generalizable API tailored for industrial time-series data, designed to assist Site Reliability Engineers (SREs) in managing cloud infrastructure. The service enables efficient anomaly detection in complex data streams, supporting proactive identification and resolution of issues. Furthermore, it presents an innovative approach to a… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

    Comments: Accepted at the AAAI-2025 Deployable AI Workshop

  49. arXiv:2501.12162  [pdf, other

    cs.CL cs.AI cs.DC cs.LG

    AdaServe: SLO-Customized LLM Serving with Fine-Grained Speculative Decoding

    Authors: Zikun Li, Zhuofu Chen, Remi Delacourt, Gabriele Oliaro, Zeyu Wang, Qinghan Chen, Shuhuai Lin, April Yang, Zhihao Zhang, Zhuoming Chen, Sean Lai, Xupeng Miao, Zhihao Jia

    Abstract: This paper introduces AdaServe, the first LLM serving system to support SLO customization through fine-grained speculative decoding. AdaServe leverages the logits of a draft model to predict the speculative accuracy of tokens and employs a theoretically optimal algorithm to construct token trees for verification. To accommodate diverse SLO requirements without compromising throughput, AdaServe emp… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

  50. arXiv:2501.10963  [pdf, other

    cs.CE

    Open FinLLM Leaderboard: Towards Financial AI Readiness

    Authors: Shengyuan Colin Lin, Felix Tian, Keyi Wang, Xingjian Zhao, Jimin Huang, Qianqian Xie, Luca Borella, Matt White, Christina Dan Wang, Kairong Xiao, Xiao-Yang Liu Yanglet, Li Deng

    Abstract: Financial large language models (FinLLMs) with multimodal capabilities are envisioned to revolutionize applications across business, finance, accounting, and auditing. However, real-world adoption requires robust benchmarks of FinLLMs' and FinAgents' performance. Maintaining an open leaderboard is crucial for encouraging innovative adoption and improving model effectiveness. In collaboration with… ▽ More

    Submitted 29 April, 2025; v1 submitted 19 January, 2025; originally announced January 2025.