Fairness-Preserving Text Summarzation

Dash, Abhisek; Shandilya, Anurag; Biswas, Arindam; Chakraborty, Abhijnan; Ghosh, Kripabandhu; Ghosh, Saptarshi

Computer Science > Information Retrieval

arXiv:1810.09147v3 (cs)

[Submitted on 22 Oct 2018 (v1), revised 9 Nov 2018 (this version, v3), latest version 2 Sep 2019 (v5)]

Title:Fairness-Preserving Text Summarzation

Authors:Abhisek Dash, Anurag Shandilya, Arindam Biswas, Abhijnan Chakraborty, Kripabandhu Ghosh, Saptarshi Ghosh

View PDF

Abstract:Given the rapid growth in online information content, text summarization algorithms are progressively used to provide users a succinct idea about the total information content. Historically, summarization algorithms are evaluated only based on how close their outputs are to human-written gold standard summaries. In this work, we propose to evaluate summarization algorithms from an unprecedented viewpoint. Considering that an extractive summarization algorithm selects a subset of the textual units (tweets / sentences) in the input data for inclusion in the summary, we examine the fairness of this selection. Importantly, if the data to be summarized is generated by different socially salient groups, or different political groups, or different news media sources, then we check whether the generated summaries fairly represent these different groups or sources. In real-world datasets we observe that existing summarization algorithms often represent the groups very differently compared to their distributions in the input data. To alleviate such adverse impacts, we propose a novel fair summarization algorithm 'FairSumm' capable of generating high-quality fair summaries.

Subjects:	Information Retrieval (cs.IR)
Cite as:	arXiv:1810.09147 [cs.IR]
	(or arXiv:1810.09147v3 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.1810.09147

Submission history

From: Abhisek Dash [view email]
[v1] Mon, 22 Oct 2018 09:22:28 UTC (158 KB)
[v2] Tue, 6 Nov 2018 08:54:43 UTC (158 KB)
[v3] Fri, 9 Nov 2018 10:21:02 UTC (158 KB)
[v4] Mon, 6 May 2019 19:56:04 UTC (521 KB)
[v5] Mon, 2 Sep 2019 11:52:28 UTC (247 KB)

Computer Science > Information Retrieval

Title:Fairness-Preserving Text Summarzation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Fairness-Preserving Text Summarzation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators