Multi-modality Regional Alignment Network for Covid X-Ray Survival Prediction and Report Generation

Zhong, Zhusi; Li, Jie; Sollee, John; Collins, Scott; Bai, Harrison; Zhang, Paul; Healey, Terrence; Atalay, Michael; Gao, Xinbo; Jiao, Zhicheng

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2405.14113 (eess)

COVID-19 e-print

Important: e-prints posted on arXiv are not peer-reviewed by arXiv; they should not be relied upon without context to guide clinical practice or health-related behavior and should not be reported in news media as established information without consulting multiple experts in the field.

[Submitted on 23 May 2024]

Title:Multi-modality Regional Alignment Network for Covid X-Ray Survival Prediction and Report Generation

Authors:Zhusi Zhong, Jie Li, John Sollee, Scott Collins, Harrison Bai, Paul Zhang, Terrence Healey, Michael Atalay, Xinbo Gao, Zhicheng Jiao

View PDF HTML (experimental)

Abstract:In response to the worldwide COVID-19 pandemic, advanced automated technologies have emerged as valuable tools to aid healthcare professionals in managing an increased workload by improving radiology report generation and prognostic analysis. This study proposes Multi-modality Regional Alignment Network (MRANet), an explainable model for radiology report generation and survival prediction that focuses on high-risk regions. By learning spatial correlation in the detector, MRANet visually grounds region-specific descriptions, providing robust anatomical regions with a completion strategy. The visual features of each region are embedded using a novel survival attention mechanism, offering spatially and risk-aware features for sentence encoding while maintaining global coherence across tasks. A cross LLMs alignment is employed to enhance the image-to-text transfer process, resulting in sentences rich with clinical detail and improved explainability for radiologist. Multi-center experiments validate both MRANet's overall performance and each module's composition within the model, encouraging further advancements in radiology report generation research emphasizing clinical interpretation and trustworthiness in AI models applied to medical studies. The code is available at this https URL.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2405.14113 [eess.IV]
	(or arXiv:2405.14113v1 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2405.14113

Submission history

From: Zhusi Zhong [view email]
[v1] Thu, 23 May 2024 02:41:08 UTC (5,198 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Multi-modality Regional Alignment Network for Covid X-Ray Survival Prediction and Report Generation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Multi-modality Regional Alignment Network for Covid X-Ray Survival Prediction and Report Generation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators