Graphusion: A RAG Framework for Knowledge Graph Construction with a Global Perspective

Yang, Rui; Yang, Boming; Feng, Aosong; Ouyang, Sixun; Blum, Moritz; She, Tianwei; Jiang, Yuang; Lecue, Freddy; Lu, Jinghui; Li, Irene

Computer Science > Computation and Language

arXiv:2410.17600 (cs)

[Submitted on 23 Oct 2024 (v1), last revised 3 Feb 2025 (this version, v2)]

Title:Graphusion: A RAG Framework for Knowledge Graph Construction with a Global Perspective

Authors:Rui Yang, Boming Yang, Aosong Feng, Sixun Ouyang, Moritz Blum, Tianwei She, Yuang Jiang, Freddy Lecue, Jinghui Lu, Irene Li

View PDF HTML (experimental)

Abstract:Knowledge Graphs (KGs) are crucial in the field of artificial intelligence and are widely used in downstream tasks, such as question-answering (QA). The construction of KGs typically requires significant effort from domain experts. Large Language Models (LLMs) have recently been used for Knowledge Graph Construction (KGC). However, most existing approaches focus on a local perspective, extracting knowledge triplets from individual sentences or documents, missing a fusion process to combine the knowledge in a global KG. This work introduces Graphusion, a zero-shot KGC framework from free text. It contains three steps: in Step 1, we extract a list of seed entities using topic modeling to guide the final KG includes the most relevant entities; in Step 2, we conduct candidate triplet extraction using LLMs; in Step 3, we design the novel fusion module that provides a global view of the extracted knowledge, incorporating entity merging, conflict resolution, and novel triplet discovery. Results show that Graphusion achieves scores of 2.92 and 2.37 out of 3 for entity extraction and relation recognition, respectively. Moreover, we showcase how Graphusion could be applied to the Natural Language Processing (NLP) domain and validate it in an educational scenario. Specifically, we introduce TutorQA, a new expert-verified benchmark for QA, comprising six tasks and a total of 1,200 QA pairs. Using the Graphusion-constructed KG, we achieve a significant improvement on the benchmark, for example, a 9.2% accuracy improvement on sub-graph completion.

Comments:	arXiv admin note: substantial text overlap with arXiv:2407.10794
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Databases (cs.DB)
Cite as:	arXiv:2410.17600 [cs.CL]
	(or arXiv:2410.17600v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2410.17600

Submission history

From: Moritz Blum [view email]
[v1] Wed, 23 Oct 2024 06:54:03 UTC (15,869 KB)
[v2] Mon, 3 Feb 2025 09:48:26 UTC (15,876 KB)

Computer Science > Computation and Language

Title:Graphusion: A RAG Framework for Knowledge Graph Construction with a Global Perspective

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Graphusion: A RAG Framework for Knowledge Graph Construction with a Global Perspective

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators