Skip to main content

Showing 1–50 of 75 results for author: Pan, J Z

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.00675  [pdf, other

    cs.CL

    Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions

    Authors: Yiming Du, Wenyu Huang, Danna Zheng, Zhaowei Wang, Sebastien Montella, Mirella Lapata, Kam-Fai Wong, Jeff Z. Pan

    Abstract: Memory is a fundamental component of AI systems, underpinning large language models (LLMs) based agents. While prior surveys have focused on memory applications with LLMs, they often overlook the atomic operations that underlie memory dynamics. In this survey, we first categorize memory representations into parametric, contextual structured, and contextual unstructured and then introduce six funda… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

  2. arXiv:2504.13631  [pdf, other

    cs.AI

    Multi-modal Knowledge Graph Generation with Semantics-enriched Prompts

    Authors: Yajing Xu, Zhiqiang Liu, Jiaoyan Chen, Mingchen Tu, Zhuo Chen, Jeff Z. Pan, Yichi Zhang, Yushan Zhu, Wen Zhang, Huajun Chen

    Abstract: Multi-modal Knowledge Graphs (MMKGs) have been widely applied across various domains for knowledge representation. However, the existing MMKGs are significantly fewer than required, and their construction faces numerous challenges, particularly in ensuring the selection of high-quality, contextually relevant images for knowledge graph enrichment. To address these challenges, we present a framework… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

    Comments: Accepted by IJCNN 2025

  3. arXiv:2503.19470  [pdf, other

    cs.AI cs.CL

    ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

    Authors: Mingyang Chen, Tianpeng Li, Haoze Sun, Yijie Zhou, Chenzheng Zhu, Haofen Wang, Jeff Z. Pan, Wen Zhang, Huajun Chen, Fan Yang, Zenan Zhou, Weipeng Chen

    Abstract: Large Language Models (LLMs) have shown remarkable capabilities in reasoning, exemplified by the success of OpenAI-o1 and DeepSeek-R1. However, integrating reasoning with external search processes remains challenging, especially for complex multi-hop questions requiring multiple retrieval steps. We propose ReSearch, a novel framework that trains LLMs to Reason with Search via reinforcement learnin… ▽ More

    Submitted 27 March, 2025; v1 submitted 25 March, 2025; originally announced March 2025.

    Comments: Work in progress

  4. arXiv:2503.09218  [pdf, other

    cs.CL

    N2C2: Nearest Neighbor Enhanced Confidence Calibration for Cross-Lingual In-Context Learning

    Authors: Jie He, Simon Yu, Deyi Xiong, Víctor Gutiérrez-Basulto, Jeff Z. Pan

    Abstract: Recent advancements of in-context learning (ICL) show language models can significantly improve their performance when demonstrations are provided. However, little attention has been paid to model calibration and prediction confidence of ICL in cross-lingual scenarios. To bridge this gap, we conduct a thorough analysis of ICL for cross-lingual sentiment classification. Our findings suggest that IC… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  5. arXiv:2503.05281  [pdf, other

    cs.CL

    Similarity-Based Domain Adaptation with LLMs

    Authors: Jie He, Wendi Zhou, Xiang Lorraine Li, Jeff Z. Pan

    Abstract: Unsupervised domain adaptation leverages abundant labeled data from various source domains to generalize onto unlabeled target data. Prior research has primarily focused on learning domain-invariant features across the source and target domains. However, these methods often require training a model using source domain data, which is time-consuming and can limit model usage for applications with di… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

  6. arXiv:2502.20175  [pdf, ps, other

    cs.AI cs.CL

    An Extensive Evaluation of PDDL Capabilities in off-the-shelf LLMs

    Authors: Kaustubh Vyas, Damien Graux, Sébastien Montella, Pavlos Vougiouklis, Ruofei Lai, Keshuang Li, Yang Ren, Jeff Z. Pan

    Abstract: In recent advancements, large language models (LLMs) have exhibited proficiency in code generation and chain-of-thought reasoning, laying the groundwork for tackling automatic formal planning tasks. This study evaluates the potential of LLMs to understand and generate Planning Domain Definition Language (PDDL), an essential representation in artificial intelligence planning. We conduct an extensiv… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: Under review

  7. arXiv:2502.18990  [pdf, other

    cs.CL

    GenTool: Enhancing Tool Generalization in Language Models through Zero-to-One and Weak-to-Strong Simulation

    Authors: Jie He, Jennifer Neville, Mengting Wan, Longqi Yang, Hui Liu, Xiaofeng Xu, Xia Song, Jeff Z. Pan, Pei Zhou

    Abstract: Large Language Models (LLMs) can enhance their capabilities as AI assistants by integrating external tools, allowing them to access a wider range of information. While recent LLMs are typically fine-tuned with tool usage examples during supervised fine-tuning (SFT), questions remain about their ability to develop robust tool-usage skills and can effectively generalize to unseen queries and tools.… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

  8. arXiv:2501.14497  [pdf, other

    cs.CL

    Evaluating and Improving Graph to Text Generation with Large Language Models

    Authors: Jie He, Yijun Yang, Wanqiu Long, Deyi Xiong, Victor Gutierrez-Basulto, Jeff Z. Pan

    Abstract: Large language models (LLMs) have demonstrated immense potential across various tasks. However, research for exploring and improving the capabilities of LLMs in interpreting graph structures remains limited. To address this gap, we conduct a comprehensive evaluation of prompting current open-source LLMs on graph-to-text generation tasks. Although we explored the optimal prompting strategies and pr… ▽ More

    Submitted 14 February, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: NAACL 2025

  9. arXiv:2412.18431  [pdf, other

    cs.CL cs.AI cs.IR

    GeAR: Graph-enhanced Agent for Retrieval-augmented Generation

    Authors: Zhili Shen, Chenxin Diao, Pavlos Vougiouklis, Pascual Merita, Shriram Piramanayagam, Damien Graux, Dandan Tu, Zeren Jiang, Ruofei Lai, Yang Ren, Jeff Z. Pan

    Abstract: Retrieval-augmented generation systems rely on effective document retrieval capabilities. By design, conventional sparse or dense retrievers face challenges in multi-hop retrieval scenarios. In this paper, we present GeAR, which advances RAG performance through two key innovations: (i) graph expansion, which enhances any conventional base retriever, such as BM25, and (ii) an agent framework that i… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

  10. arXiv:2412.17032  [pdf, other

    cs.CL

    MINTQA: A Multi-Hop Question Answering Benchmark for Evaluating LLMs on New and Tail Knowledge

    Authors: Jie He, Nan Hu, Wanqiu Long, Jiaoyan Chen, Jeff Z. Pan

    Abstract: Large language models (LLMs) have demonstrated impressive capabilities in various reasoning tasks but face significant challenges with complex, knowledge-intensive multi-hop queries, particularly those involving new or long-tail knowledge. Existing benchmarks often fail to fully address these challenges. To bridge this gap, we introduce MINTQA (Multi-hop Question Answering on New and Tail Knowledg… ▽ More

    Submitted 28 January, 2025; v1 submitted 22 December, 2024; originally announced December 2024.

  11. arXiv:2412.12839  [pdf, other

    cs.AI

    From An LLM Swarm To A PDDL-Empowered HIVE: Planning Self-Executed Instructions In A Multi-Modal Jungle

    Authors: Kaustubh Vyas, Damien Graux, Yijun Yang, Sébastien Montella, Chenxin Diao, Wendi Zhou, Pavlos Vougiouklis, Ruofei Lai, Yang Ren, Keshuang Li, Jeff Z. Pan

    Abstract: In response to the call for agent-based solutions that leverage the ever-increasing capabilities of the deep models' ecosystem, we introduce Hive -- a comprehensive solution for selecting appropriate models and subsequently planning a set of atomic actions to satisfy the end-users' instructions. Hive operates over sets of models and, upon receiving natural language instructions (i.e. user queries)… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Comments: Under review

  12. arXiv:2412.10440  [pdf, other

    cs.CV cs.AI

    Multi-level Matching Network for Multimodal Entity Linking

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Ru Li, Jeff Z. Pan

    Abstract: Multimodal entity linking (MEL) aims to link ambiguous mentions within multimodal contexts to corresponding entities in a multimodal knowledge base. Most existing approaches to MEL are based on representation learning or vision-and-language pre-training mechanisms for exploring the complementary effect among multiple modalities. However, these methods suffer from two limitations. On the one hand,… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Comments: Accepted at KDD'25

  13. arXiv:2410.19743  [pdf, other

    cs.SE cs.AI

    AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction

    Authors: Hongru Wang, Rui Wang, Boyang Xue, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, Kam-Fai Wong

    Abstract: Large Language Models (LLMs) can interact with the real world by connecting with versatile external APIs, resulting in better problem-solving and task automation capabilities. Previous research primarily focuses on APIs with limited arguments from a single source or overlooks the complex dependency relationship between different APIs. However, it is essential to utilize multiple APIs collaborative… ▽ More

    Submitted 10 October, 2024; originally announced October 2024.

  14. arXiv:2410.16708  [pdf, other

    cs.CL

    Atomic Fact Decomposition Helps Attributed Question Answering

    Authors: Zhichao Yan, Jiapu Wang, Jiaoyan Chen, Xiaoli Li, Ru Li, Jeff Z. Pan

    Abstract: Attributed Question Answering (AQA) aims to provide both a trustworthy answer and a reliable attribution report for a given question. Retrieval is a widely adopted approach, including two general paradigms: Retrieval-Then-Read (RTR) and post-hoc retrieval. Recently, Large Language Models (LLMs) have shown remarkable proficiency, prompting growing interest in AQA among researchers. However, RTR-bas… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

  15. arXiv:2410.14668  [pdf, other

    cs.CL

    MiCEval: Unveiling Multimodal Chain of Thought's Quality via Image Description and Reasoning Steps

    Authors: Xiongtao Zhou, Jie He, Lanyu Chen, Jingyu Li, Haojing Chen, Víctor Gutiérrez-Basulto, Jeff Z. Pan, Hanjie Chen

    Abstract: Multimodal Chain of Thought (MCoT) is a popular prompting strategy for improving the performance of multimodal large language models (MLLMs) across a range of complex reasoning tasks. Despite its popularity, there is a notable absence of automated methods for evaluating the quality of reasoning steps in MCoT. To address this gap, we propose Multimodal Chain-of-Thought Evaluation (MiCEval), a frame… ▽ More

    Submitted 28 February, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

    Comments: NAACL 2025

  16. arXiv:2410.06121  [pdf, other

    cs.CL

    Less is More: Making Smaller Language Models Competent Subgraph Retrievers for Multi-hop KGQA

    Authors: Wenyu Huang, Guancheng Zhou, Hongru Wang, Pavlos Vougiouklis, Mirella Lapata, Jeff Z. Pan

    Abstract: Retrieval-Augmented Generation (RAG) is widely used to inject external non-parametric knowledge into large language models (LLMs). Recent works suggest that Knowledge Graphs (KGs) contain valuable external knowledge for LLMs. Retrieving information from KGs differs from extracting it from document sets. Most existing approaches seek to directly retrieve relevant subgraphs, thereby eliminating the… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: Accepted by EMNLP 2024 Findings

  17. arXiv:2409.19753  [pdf, other

    cs.CL

    CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering

    Authors: Yike Wu, Yi Huang, Nan Hu, Yuncheng Hua, Guilin Qi, Jiaoyan Chen, Jeff Z. Pan

    Abstract: Recent studies have explored the use of Large Language Models (LLMs) with Retrieval Augmented Generation (RAG) for Knowledge Graph Question Answering (KGQA). They typically require rewriting retrieved subgraphs into natural language formats comprehensible to LLMs. However, when tackling complex questions, the knowledge rewritten by existing methods may include irrelevant information, omit crucial… ▽ More

    Submitted 19 March, 2025; v1 submitted 29 September, 2024; originally announced September 2024.

  18. arXiv:2407.13578  [pdf, other

    cs.CL cs.AI

    How Reliable are LLMs as Knowledge Bases? Re-thinking Facutality and Consistency

    Authors: Danna Zheng, Mirella Lapata, Jeff Z. Pan

    Abstract: Large Language Models (LLMs) are increasingly explored as knowledge bases (KBs), yet current evaluation methods focus too narrowly on knowledge retention, overlooking other crucial criteria for reliable performance. In this work, we rethink the requirements for evaluating reliable LLM-as-KB usage and highlight two essential factors: factuality, ensuring accurate responses to seen and unseen knowle… ▽ More

    Submitted 16 December, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  19. arXiv:2407.03227  [pdf, other

    cs.CL cs.AI cs.DB

    Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning

    Authors: Zhili Shen, Pavlos Vougiouklis, Chenxin Diao, Kaustubh Vyas, Yuanyi Ji, Jeff Z. Pan

    Abstract: We focus on Text-to-SQL semantic parsing from the perspective of retrieval-augmented generation. Motivated by challenges related to the size of commercial database schemata and the deployability of business intelligence solutions, we propose $\text{ASTReS}$ that dynamically retrieves input database information and uses abstract syntax trees to select few-shot examples for in-context learning. Fu… ▽ More

    Submitted 4 November, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: EMNLP 2024 Main

  20. Start from Zero: Triple Set Prediction for Automatic Knowledge Graph Completion

    Authors: Wen Zhang, Yajing Xu, Peng Ye, Zhiwei Huang, Zezhong Xu, Jiaoyan Chen, Jeff Z. Pan, Huajun Chen

    Abstract: Knowledge graph (KG) completion aims to find out missing triples in a KG. Some tasks, such as link prediction and instance completion, have been proposed for KG completion. They are triple-level tasks with some elements in a missing triple given to predict the missing element of the triple. However, knowing some elements of the missing triple in advance is not always a realistic setting. In this p… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Paper accepted by TKDE in 2024

  21. arXiv:2406.14282  [pdf, other

    cs.CL cs.AI

    Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

    Authors: Junjie Wang, Mingyang Chen, Binbin Hu, Dan Yang, Ziqi Liu, Yue Shen, Peng Wei, Zhiqiang Zhang, Jinjie Gu, Jun Zhou, Jeff Z. Pan, Wen Zhang, Huajun Chen

    Abstract: Improving the performance of large language models (LLMs) in complex question-answering (QA) scenarios has always been a research focal point. Recent studies have attempted to enhance LLMs' performance by combining step-wise planning with external retrieval. While effective for advanced models like GPT-3.5, smaller LLMs face challenges in decomposing complex questions, necessitating supervised fin… ▽ More

    Submitted 23 October, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: EMNLP2024 Findings

  22. arXiv:2406.05130  [pdf, other

    cs.CL

    An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models

    Authors: Xiongtao Zhou, Jie He, Yuhua Ke, Guangyao Zhu, Víctor Gutiérrez-Basulto, Jeff Z. Pan

    Abstract: Multimodal large language models (MLLMs) fine-tuned with multimodal instruction datasets have demonstrated remarkable capabilities in multimodal tasks. However, fine-tuning all parameters of MLLMs has become challenging as they usually contain billions of parameters. To address this issue, we study parameter-efficient fine-tuning (PEFT) methods for MLLMs. We aim to identify effective methods for e… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

    Comments: ACL finding 2024

  23. arXiv:2405.15984  [pdf, other

    cs.CL cs.AI

    Evaluating and Safeguarding the Adversarial Robustness of Retrieval-Based In-Context Learning

    Authors: Simon Yu, Jie He, Pasquale Minervini, Jeff Z. Pan

    Abstract: With the emergence of large language models, such as LLaMA and OpenAI GPT-3, In-Context Learning (ICL) gained significant attention due to its effectiveness and efficiency. However, ICL is very sensitive to the choice, order, and verbaliser used to encode the demonstrations in the prompt. Retrieval-Augmented ICL methods try to address this problem by leveraging retrievers to extract semantically r… ▽ More

    Submitted 8 October, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: COLM 2024, 31 pages, 6 figures

  24. arXiv:2405.13602  [pdf, other

    cs.AI cs.CL cs.LG

    COTET: Cross-view Optimal Transport for Knowledge Graph Entity Typing

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

    Abstract: Knowledge graph entity typing (KGET) aims to infer missing entity type instances in knowledge graphs. Previous research has predominantly centered around leveraging contextual information associated with entities, which provides valuable clues for inference. However, they have long ignored the dual nature of information inherent in entities, encompassing both high-level coarse-grained cluster know… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  25. arXiv:2405.06524  [pdf, other

    cs.CL

    Prompting Large Language Models with Knowledge Graphs for Question Answering Involving Long-tail Facts

    Authors: Wenyu Huang, Guancheng Zhou, Mirella Lapata, Pavlos Vougiouklis, Sebastien Montella, Jeff Z. Pan

    Abstract: Although Large Language Models (LLMs) are effective in performing various NLP tasks, they still struggle to handle tasks that require extensive, real-world knowledge, especially when dealing with long-tail facts (facts related to long-tail entities). This limitation highlights the need to supplement LLMs with non-parametric knowledge. To address this issue, we analysed the effects of different typ… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

  26. arXiv:2404.17590  [pdf, other

    cs.IR cs.AI

    Leveraging Intra-modal and Inter-modal Interaction for Multi-Modal Entity Alignment

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

    Abstract: Multi-modal entity alignment (MMEA) aims to identify equivalent entity pairs across different multi-modal knowledge graphs (MMKGs). Existing approaches focus on how to better encode and aggregate information from different modalities. However, it is not trivial to leverage multi-modal knowledge in entity alignment due to the modal heterogeneity. In this paper, we propose a Multi-Grained Interactio… ▽ More

    Submitted 19 April, 2024; originally announced April 2024.

  27. arXiv:2404.09848  [pdf, other

    cs.AI cs.LG

    HyperMono: A Monotonicity-aware Approach to Hyper-Relational Knowledge Representation

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

    Abstract: In a hyper-relational knowledge graph (HKG), each fact is composed of a main triple associated with attribute-value qualifiers, which express additional factual knowledge. The hyper-relational knowledge graph completion (HKGC) task aims at inferring plausible missing links in a HKG. Most existing approaches to HKGC focus on enhancing the communication between qualifier pairs and main triples, whil… ▽ More

    Submitted 13 August, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  28. arXiv:2404.01253  [pdf, other

    cs.CL

    UniArk: Improving Generalisation and Consistency for Factual Knowledge Extraction through Debiasing

    Authors: Yijun Yang, Jie He, Pinzhen Chen, Víctor Gutiérrez-Basulto, Jeff Z. Pan

    Abstract: Several recent papers have investigated the potential of language models as knowledge bases as well as the existence of severe biases when extracting factual knowledge. In this work, we focus on the factual probing performance over unseen prompts from tuning, and using a probabilistic view we show the inherent misalignment between pre-training and downstream tuning objectives in language models fo… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: NAACL 2024

  29. arXiv:2403.06832  [pdf, other

    cs.CL cs.AI

    Noise-powered Multi-modal Knowledge Graph Representation Framework

    Authors: Zhuo Chen, Yin Fang, Yichi Zhang, Lingbing Guo, Jiaoyan Chen, Jeff Z. Pan, Huajun Chen, Wen Zhang

    Abstract: The rise of Multi-modal Pre-training highlights the necessity for a unified Multi-Modal Knowledge Graph (MMKG) representation learning framework. Such a framework is essential for embedding structured knowledge into multi-modal Large Language Models effectively, alleviating issues like knowledge misconceptions and multi-modal hallucinations. In this work, we explore the efficacy of models in accur… ▽ More

    Submitted 15 January, 2025; v1 submitted 11 March, 2024; originally announced March 2024.

    Comments: COLING 2025 Accepted, Repo is available at https://github.com/zjukg/SNAG

  30. arXiv:2402.14901  [pdf, other

    cs.CL cs.AI

    A Usage-centric Take on Intent Understanding in E-Commerce

    Authors: Wendi Zhou, Tianyi Li, Pavlos Vougiouklis, Mark Steedman, Jeff Z. Pan

    Abstract: Identifying and understanding user intents is a pivotal task for E-Commerce. Despite its essential role in product recommendation and business user profiling analysis, intent understanding has not been consistently defined or accurately benchmarked. In this paper, we focus on predicative user intents as "how a customer uses a product", and pose intent understanding as a natural language reasoning… ▽ More

    Submitted 7 October, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: Acepted by EMNLP 2024 main

  31. arXiv:2402.12554  [pdf, other

    cs.CL

    Archer: A Human-Labeled Text-to-SQL Dataset with Arithmetic, Commonsense and Hypothetical Reasoning

    Authors: Danna Zheng, Mirella Lapata, Jeff Z. Pan

    Abstract: We present Archer, a challenging bilingual text-to-SQL dataset specific to complex reasoning, including arithmetic, commonsense and hypothetical reasoning. It contains 1,042 English questions and 1,042 Chinese questions, along with 521 unique SQL queries, covering 20 English databases across 20 domains. Notably, this dataset demonstrates a significantly higher level of complexity compared to exist… ▽ More

    Submitted 24 February, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: EACL 2024

  32. arXiv:2402.12545  [pdf, other

    cs.CL

    TrustScore: Reference-Free Evaluation of LLM Response Trustworthiness

    Authors: Danna Zheng, Danyang Liu, Mirella Lapata, Jeff Z. Pan

    Abstract: Large Language Models (LLMs) have demonstrated impressive capabilities across various domains, prompting a surge in their practical applications. However, concerns have arisen regarding the trustworthiness of LLMs outputs, particularly in closed-book question-answering tasks, where non-experts may struggle to identify inaccuracies due to the absence of contextual or ground truth information. This… ▽ More

    Submitted 6 May, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  33. arXiv:2402.05391  [pdf, other

    cs.AI cs.CV cs.IR cs.LG

    Knowledge Graphs Meet Multi-Modal Learning: A Comprehensive Survey

    Authors: Zhuo Chen, Yichi Zhang, Yin Fang, Yuxia Geng, Lingbing Guo, Xiang Chen, Qian Li, Wen Zhang, Jiaoyan Chen, Yushan Zhu, Jiaqi Li, Xiaoze Liu, Jeff Z. Pan, Ningyu Zhang, Huajun Chen

    Abstract: Knowledge Graphs (KGs) play a pivotal role in advancing various AI applications, with the semantic web community's exploration into multi-modal dimensions unlocking new avenues for innovation. In this survey, we carefully review over 300 articles, focusing on KG-aware research in two principal aspects: KG-driven Multi-Modal (KG4MM) learning, where KGs support multi-modal tasks, and Multi-Modal Kno… ▽ More

    Submitted 26 February, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: Ongoing work; 41 pages (Main Text), 55 pages (Total), 11 Tables, 13 Figures, 619 citations; Paper list is available at https://github.com/zjukg/KG-MM-Survey

  34. arXiv:2401.15820  [pdf, other

    cs.CV cs.AI

    Knowledge-Aware Neuron Interpretation for Scene Classification

    Authors: Yong Guan, Freddy Lecue, Jiaoyan Chen, Ru Li, Jeff Z. Pan

    Abstract: Although neural models have achieved remarkable performance, they still encounter doubts due to the intransparency. To this end, model prediction explanation is attracting more and more attentions. However, current methods rarely incorporate external knowledge and still suffer from three limitations: (1) Neglecting concept completeness. Merely selecting concepts may not sufficient for prediction.… ▽ More

    Submitted 28 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI2024

  35. arXiv:2401.14640  [pdf, other

    cs.CL

    Benchmarking Large Language Models in Complex Question Answering Attribution using Knowledge Graphs

    Authors: Nan Hu, Jiaoyan Chen, Yike Wu, Guilin Qi, Sheng Bi, Tongtong Wu, Jeff Z. Pan

    Abstract: The attribution of question answering is to provide citations for supporting generated statements, and has attracted wide research attention. The current methods for automatically evaluating the attribution, which are often based on Large Language Models (LLMs), are still inadequate, particularly in recognizing subtle differences between attributions, and complex relationships between citations an… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: 13 pages, 5 figures

  36. arXiv:2401.13256  [pdf, other

    cs.CL cs.AI

    UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems

    Authors: Hongru Wang, Wenyu Huang, Yang Deng, Rui Wang, Zezhong Wang, Yufei Wang, Fei Mi, Jeff Z. Pan, Kam-Fai Wong

    Abstract: Large Language Models (LLMs) has shown exceptional capabilities in many natual language understanding and generation tasks. However, the personalization issue still remains a much-coveted property, especially when it comes to the multiple sources involved in the dialogue system. To better plan and incorporate the use of multiple sources in generating personalized response, we firstly decompose it… ▽ More

    Submitted 26 November, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

  37. Prompting Disentangled Embeddings for Knowledge Graph Completion with Pre-trained Language Model

    Authors: Yuxia Geng, Jiaoyan Chen, Yuhang Zeng, Zhuo Chen, Wen Zhang, Jeff Z. Pan, Yuxiang Wang, Xiaoliang Xu

    Abstract: Both graph structures and textual information play a critical role in Knowledge Graph Completion (KGC). With the success of Pre-trained Language Models (PLMs) such as BERT, they have been applied for text encoding for KGC. However, the current methods mostly prefer to fine-tune PLMs, leading to huge training costs and limited scalability to larger PLMs. In contrast, we propose to utilize prompts a… ▽ More

    Submitted 3 January, 2025; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: published in Expert Systems With Applications, 2025

    Journal ref: Expert Systems With Applications 268 (2025) 126175

  38. arXiv:2310.14050  [pdf, other

    cs.CL

    Code-Switching with Word Senses for Pretraining in Neural Machine Translation

    Authors: Vivek Iyer, Edoardo Barba, Alexandra Birch, Jeff Z. Pan, Roberto Navigli

    Abstract: Lexical ambiguity is a significant and pervasive challenge in Neural Machine Translation (NMT), with many state-of-the-art (SOTA) NMT systems struggling to handle polysemous words (Campolungo et al., 2022). The same holds for the NMT pretraining paradigm of denoising synthetic "code-switched" text (Pan et al., 2021; Iyer et al., 2023), where word senses are ignored in the noising stage -- leading… ▽ More

    Submitted 21 October, 2023; originally announced October 2023.

    Comments: EMNLP (Findings) 2023 Long Paper

  39. arXiv:2310.12008  [pdf, other

    cs.CL cs.AI

    Multi-view Contrastive Learning for Entity Typing over Knowledge Graphs

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

    Abstract: Knowledge graph entity typing (KGET) aims at inferring plausible types of entities in knowledge graphs. Existing approaches to KGET focus on how to better encode the knowledge provided by the neighbors and types of an entity into its representation. However, they ignore the semantic knowledge provided by the way in which types can be clustered together. In this paper, we propose a novel method cal… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 Main

  40. arXiv:2310.05128  [pdf, other

    cs.CL cs.AI cs.LG

    Instances and Labels: Hierarchy-aware Joint Supervised Contrastive Learning for Hierarchical Multi-Label Text Classification

    Authors: Simon Yu, Jie He, Víctor Gutiérrez-Basulto, Jeff Z. Pan

    Abstract: Hierarchical multi-label text classification (HMTC) aims at utilizing a label hierarchy in multi-label classification. Recent approaches to HMTC deal with the problem of imposing an over-constrained premise on the output space by using contrastive learning on generated samples in a semi-supervised manner to bring text and label embeddings closer. However, the generation of samples tends to introdu… ▽ More

    Submitted 19 June, 2024; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: 18 pages; 10 figures. Published as a conference paper at EMNLP 2023 Findings (Long Paper). Code and data available at https://github.com/simonucl/HJCL

  41. arXiv:2308.06512  [pdf, other

    cs.AI cs.CL

    HyperFormer: Enhancing Entity and Relation Interaction for Hyper-Relational Knowledge Graph Completion

    Authors: Zhiwei Hu, Víctor Gutiérrez-Basulto, Zhiliang Xiang, Ru Li, Jeff Z. Pan

    Abstract: Hyper-relational knowledge graphs (HKGs) extend standard knowledge graphs by associating attribute-value qualifiers to triples, which effectively represent additional fine-grained information about its associated triple. Hyper-relational knowledge graph completion (HKGC) aims at inferring unknown triples while considering its qualifiers. Most existing approaches to HKGC exploit a global-level grap… ▽ More

    Submitted 12 August, 2023; originally announced August 2023.

    Comments: Accepted at CIKM'23

  42. arXiv:2308.06374  [pdf, other

    cs.AI cs.CL

    Large Language Models and Knowledge Graphs: Opportunities and Challenges

    Authors: Jeff Z. Pan, Simon Razniewski, Jan-Christoph Kalo, Sneha Singhania, Jiaoyan Chen, Stefan Dietze, Hajira Jabeen, Janna Omeliyanenko, Wen Zhang, Matteo Lissandrini, Russa Biswas, Gerard de Melo, Angela Bonifati, Edlira Vakaj, Mauro Dragoni, Damien Graux

    Abstract: Large Language Models (LLMs) have taken Knowledge Representation -- and the world -- by storm. This inflection point marks a shift from explicit knowledge representation to a renewed focus on the hybrid representation of both explicit knowledge and parametric knowledge. In this position paper, we will discuss some of the common debate points within the community on LLMs (parametric knowledge) and… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 30 pages

  43. arXiv:2307.16210  [pdf, other

    cs.AI cs.CV cs.LG cs.MM

    Rethinking Uncertainly Missing and Ambiguous Visual Modality in Multi-Modal Entity Alignment

    Authors: Zhuo Chen, Lingbing Guo, Yin Fang, Yichi Zhang, Jiaoyan Chen, Jeff Z. Pan, Yangning Li, Huajun Chen, Wen Zhang

    Abstract: As a crucial extension of entity alignment (EA), multi-modal entity alignment (MMEA) aims to identify identical entities across disparate knowledge graphs (KGs) by exploiting associated visual information. However, existing MMEA approaches primarily concentrate on the fusion paradigm of multi-modal entity features, while neglecting the challenges presented by the pervasive phenomenon of missing an… ▽ More

    Submitted 1 August, 2023; v1 submitted 30 July, 2023; originally announced July 2023.

    Comments: International Semantic Web Conference '23 (ISWC 2023), https://github.com/zjukg/UMAEA

  44. arXiv:2305.15932  [pdf, other

    cs.CL

    BUCA: A Binary Classification Approach to Unsupervised Commonsense Question Answering

    Authors: Jie He, Simon Chi Lok U, Víctor Gutiérrez-Basulto, Jeff Z. Pan

    Abstract: Unsupervised commonsense reasoning (UCR) is becoming increasingly popular as the construction of commonsense reasoning datasets is expensive, and they are inevitably limited in their scope. A popular approach to UCR is to fine-tune language models with external knowledge (e.g., knowledge graphs), but this usually requires a large number of training examples. In this paper, we propose to transform… ▽ More

    Submitted 11 April, 2025; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  45. arXiv:2305.11527  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    InstructIE: A Bilingual Instruction-based Information Extraction Dataset

    Authors: Honghao Gui, Shuofei Qiao, Jintian Zhang, Hongbin Ye, Mengshu Sun, Lei Liang, Jeff Z. Pan, Huajun Chen, Ningyu Zhang

    Abstract: Large language models can perform well on general natural language tasks, but their effectiveness is still suboptimal for information extraction (IE). Recent works indicate that the main reason lies in the lack of extensive data on IE instructions. Note that the existing datasets on IE instructions not only have limited coverage but also involve high construction costs. To address this issue, we i… ▽ More

    Submitted 28 July, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: ISWC 2024; project homepage: https://www.zjukg.org/project/InstructIE/ dataset: https://huggingface.co/datasets/zjunlp/InstructIE

  46. arXiv:2303.10368  [pdf, other

    cs.CL

    An Empirical Study of Pre-trained Language Models in Simple Knowledge Graph Question Answering

    Authors: Nan Hu, Yike Wu, Guilin Qi, Dehai Min, Jiaoyan Chen, Jeff Z. Pan, Zafar Ali

    Abstract: Large-scale pre-trained language models (PLMs) such as BERT have recently achieved great success and become a milestone in natural language processing (NLP). It is now the consensus of the NLP community to adopt PLMs as the backbone for downstream tasks. In recent works on knowledge graph question answering (KGQA), BERT or its variants have become necessary in their KGQA models. However, there is… ▽ More

    Submitted 18 March, 2023; originally announced March 2023.

    Comments: Accepted by World Wide Web Journal

  47. arXiv:2303.09189  [pdf, other

    cs.SI cs.CY

    Wiki-based Communities of Interest: Demographics and Outliers

    Authors: Hiba Arnaout, Simon Razniewski, Jeff Z. Pan

    Abstract: In this paper, we release data about demographic information and outliers of communities of interest. Identified from Wiki-based sources, mainly Wikidata, the data covers 7.5k communities, such as members of the White House Coronavirus Task Force, and 345k subjects, e.g., Deborah Birx. We describe the statistical inference methodology adopted to mine such data. We release subject-centric and group… ▽ More

    Submitted 17 March, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: Accepted to ICWSM 2023. For demo, see https://wikiknowledge.onrender.com/demographics/ and for dataset see https://doi.org/10.5281/zenodo.7410436

  48. arXiv:2302.01859  [pdf, other

    cs.CL

    Generalizing to Unseen Elements: A Survey on Knowledge Extrapolation for Knowledge Graphs

    Authors: Mingyang Chen, Wen Zhang, Yuxia Geng, Zezhong Xu, Jeff Z. Pan, Huajun Chen

    Abstract: Knowledge graphs (KGs) have become valuable knowledge resources in various applications, and knowledge graph embedding (KGE) methods have garnered increasing attention in recent years. However, conventional KGE methods still face challenges when it comes to handling unseen entities or relations during model testing. To address this issue, much effort has been devoted to various fields of KGs. In t… ▽ More

    Submitted 16 December, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted to IJCAI 2023 Survey Track

  49. arXiv:2302.01849  [pdf, other

    cs.CL

    Entity-Agnostic Representation Learning for Parameter-Efficient Knowledge Graph Embedding

    Authors: Mingyang Chen, Wen Zhang, Zhen Yao, Yushan Zhu, Yang Gao, Jeff Z. Pan, Huajun Chen

    Abstract: We propose an entity-agnostic representation learning method for handling the problem of inefficient parameter storage costs brought by embedding knowledge graphs. Conventional knowledge graph embedding methods map elements in a knowledge graph, including entities and relations, into continuous vector spaces by assigning them one or multiple specific embeddings (i.e., vector representations). Thus… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted to AAAI 2023 conference

  50. arXiv:2212.14454  [pdf, other

    cs.AI cs.CL

    MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality Hybrid

    Authors: Zhuo Chen, Jiaoyan Chen, Wen Zhang, Lingbing Guo, Yin Fang, Yufeng Huang, Yichi Zhang, Yuxia Geng, Jeff Z. Pan, Wenting Song, Huajun Chen

    Abstract: Multi-modal entity alignment (MMEA) aims to discover identical entities across different knowledge graphs (KGs) whose entities are associated with relevant images. However, current MMEA algorithms rely on KG-level modality fusion strategies for multi-modal entity representation, which ignores the variations of modality preferences of different entities, thus compromising robustness against noise i… ▽ More

    Submitted 30 July, 2023; v1 submitted 29 December, 2022; originally announced December 2022.

    Comments: ACM Multimedia 2023 Accpeted, Repo: https://github.com/zjukg/MEAformer

    Journal ref: ACM MM 2023