A Survey on Multi-modal Summarization

Jangra, Anubhav; Mukherjee, Sourajit; Jatowt, Adam; Saha, Sriparna; Hasanuzzaman, Mohammad

Computer Science > Computation and Language

arXiv:2109.05199 (cs)

[Submitted on 11 Sep 2021 (v1), last revised 13 Feb 2023 (this version, v2)]

Title:A Survey on Multi-modal Summarization

Authors:Anubhav Jangra, Sourajit Mukherjee, Adam Jatowt, Sriparna Saha, Mohammad Hasanuzzaman

View PDF

Abstract:The new era of technology has brought us to the point where it is convenient for people to share their opinions over an abundance of platforms. These platforms have a provision for the users to express themselves in multiple forms of representations, including text, images, videos, and audio. This, however, makes it difficult for users to obtain all the key information about a topic, making the task of automatic multi-modal summarization (MMS) essential. In this paper, we present a comprehensive survey of the existing research in the area of MMS, covering various modalities like text, image, audio, and video. Apart from highlighting the different evaluation metrics and datasets used for the MMS task, our work also discusses the current challenges and future directions in this field.

Comments:	Accepted in ACM CSUR 2023
Subjects:	Computation and Language (cs.CL); Multimedia (cs.MM); Neural and Evolutionary Computing (cs.NE)
Cite as:	arXiv:2109.05199 [cs.CL]
	(or arXiv:2109.05199v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.05199

Submission history

From: Anubhav Jangra [view email]
[v1] Sat, 11 Sep 2021 06:39:54 UTC (579 KB)
[v2] Mon, 13 Feb 2023 17:36:49 UTC (1,305 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2021-09

Change to browse by:

cs
cs.MM
cs.NE

References & Citations

DBLP - CS Bibliography

listing | bibtex

Adam Jatowt
Sriparna Saha

export BibTeX citation

Computer Science > Computation and Language

Title:A Survey on Multi-modal Summarization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:A Survey on Multi-modal Summarization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators