Towards Transparent AI: A Survey on Explainable Large Language Models

Palikhe, Avash; Yu, Zhenyu; Wang, Zichong; Zhang, Wenbin

Computer Science > Computation and Language

arXiv:2506.21812 (cs)

[Submitted on 26 Jun 2025]

Title:Towards Transparent AI: A Survey on Explainable Large Language Models

Authors:Avash Palikhe, Zhenyu Yu, Zichong Wang, Wenbin Zhang

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have played a pivotal role in advancing Artificial Intelligence (AI). However, despite their achievements, LLMs often struggle to explain their decision-making processes, making them a 'black box' and presenting a substantial challenge to explainability. This lack of transparency poses a significant obstacle to the adoption of LLMs in high-stakes domain applications, where interpretability is particularly essential. To overcome these limitations, researchers have developed various explainable artificial intelligence (XAI) methods that provide human-interpretable explanations for LLMs. However, a systematic understanding of these methods remains limited. To address this gap, this survey provides a comprehensive review of explainability techniques by categorizing XAI methods based on the underlying transformer architectures of LLMs: encoder-only, decoder-only, and encoder-decoder models. Then these techniques are examined in terms of their evaluation for assessing explainability, and the survey further explores how these explanations are leveraged in practical applications. Finally, it discusses available resources, ongoing research challenges, and future directions, aiming to guide continued efforts toward developing transparent and responsible LLMs.

Subjects:	Computation and Language (cs.CL); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2506.21812 [cs.CL]
	(or arXiv:2506.21812v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2506.21812

Submission history

From: Zhenyu Yu [view email]
[v1] Thu, 26 Jun 2025 23:25:22 UTC (305 KB)

Computer Science > Computation and Language

Title:Towards Transparent AI: A Survey on Explainable Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Towards Transparent AI: A Survey on Explainable Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators