Retrieval-Augmented Generation in Biomedicine: A Survey of Technologies, Datasets, and Clinical Applications
Authors:
Jiawei He,
Boya Zhang,
Hossein Rouhizadeh,
Yingjian Chen,
Rui Yang,
Jin Lu,
Xudong Chen,
Nan Liu,
Irene Li,
Douglas Teodoro
Abstract:
Recent advances in large language models (LLMs) have demonstrated remarkable capabilities in natural language processing tasks. However, their application in the biomedical domain presents unique challenges, particularly regarding factual accuracy and up-to-date knowledge integration. Retrieval Augmented Generation (RAG) has emerged as a promising solution to address these challenges by combining…
▽ More
Recent advances in large language models (LLMs) have demonstrated remarkable capabilities in natural language processing tasks. However, their application in the biomedical domain presents unique challenges, particularly regarding factual accuracy and up-to-date knowledge integration. Retrieval Augmented Generation (RAG) has emerged as a promising solution to address these challenges by combining the generative capabilities of LLMs with external knowledge retrieval. This comprehensive survey examines the application of RAG in the biomedical domain, focusing on its technological components, available datasets, and clinical applications. We present a systematic analysis of retrieval methods, ranking strategies, and generation models, while also exploring the challenges and future directions in this rapidly evolving field. Our work provides researchers and practitioners with a thorough understanding of the current state of biomedical RAG systems and identifies key areas for future research and development.
△ Less
Submitted 11 May, 2025; v1 submitted 2 May, 2025;
originally announced May 2025.