Skip to main content

Showing 1–4 of 4 results for author: Tie, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.19793  [pdf, other

    cs.CR

    Prompt Injection Attack to Tool Selection in LLM Agents

    Authors: Jiawen Shi, Zenghui Yuan, Guiyao Tie, Pan Zhou, Neil Zhenqiang Gong, Lichao Sun

    Abstract: Tool selection is a key component of LLM agents. The process operates through a two-step mechanism - \emph{retrieval} and \emph{selection} - to pick the most appropriate tool from a tool library for a given task. In this work, we introduce \textit{ToolHijacker}, a novel prompt injection attack targeting tool selection in no-box scenarios. ToolHijacker injects a malicious tool document into the too… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  2. arXiv:2503.11074  [pdf, other

    cs.AI cs.CL

    Large Reasoning Models in Agent Scenarios: Exploring the Necessity of Reasoning Capabilities

    Authors: Xueyang Zhou, Guiyao Tie, Guowen Zhang, Weidong Wang, Zhigang Zuo, Di Wu, Duanfeng Chu, Pan Zhou, Lichao Sun, Neil Zhenqiang Gong

    Abstract: The rise of Large Reasoning Models (LRMs) signifies a paradigm shift toward advanced computational reasoning. Yet, this progress disrupts traditional agent frameworks, traditionally anchored by execution-oriented Large Language Models (LLMs). To explore this transformation, we propose the LaRMA framework, encompassing nine tasks across Tool Usage, Plan Design, and Problem Solving, assessed with th… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: 71 pages, 5 figures, 6 tables

  3. arXiv:2503.06254  [pdf, other

    cs.CR cs.LG

    Poisoned-MRAG: Knowledge Poisoning Attacks to Multimodal Retrieval Augmented Generation

    Authors: Yinuo Liu, Zenghui Yuan, Guiyao Tie, Jiawen Shi, Pan Zhou, Lichao Sun, Neil Zhenqiang Gong

    Abstract: Multimodal retrieval-augmented generation (RAG) enhances the visual reasoning capability of vision-language models (VLMs) by dynamically accessing information from external knowledge bases. In this work, we introduce \textit{Poisoned-MRAG}, the first knowledge poisoning attack on multimodal RAG systems. Poisoned-MRAG injects a few carefully crafted image-text pairs into the multimodal knowledge da… ▽ More

    Submitted 14 March, 2025; v1 submitted 8 March, 2025; originally announced March 2025.

  4. arXiv:2503.06072  [pdf, other

    cs.CL cs.AI

    A Survey on Post-training of Large Language Models

    Authors: Guiyao Tie, Zeli Zhao, Dingjie Song, Fuyang Wei, Rong Zhou, Yurou Dai, Wen Yin, Zhejian Yang, Jiangyue Yan, Yao Su, Zhenhan Dai, Yifeng Xie, Yihan Cao, Lichao Sun, Pan Zhou, Lifang He, Hechang Chen, Yu Zhang, Qingsong Wen, Tianming Liu, Neil Zhenqiang Gong, Jiliang Tang, Caiming Xiong, Heng Ji, Philip S. Yu , et al. (1 additional authors not shown)

    Abstract: The emergence of Large Language Models (LLMs) has fundamentally transformed natural language processing, making them indispensable across domains ranging from conversational systems to scientific exploration. However, their pre-trained architectures often reveal limitations in specialized contexts, including restricted reasoning capacities, ethical uncertainties, and suboptimal domain-specific per… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.

    Comments: 87 pages, 21 figures, 9 tables