BESSTIE: A Benchmark for Sentiment and Sarcasm Classification for Varieties of English

Srirag, Dipankar; Joshi, Aditya; Painter, Jordan; Kanojia, Diptesh

Computer Science > Computation and Language

arXiv:2412.04726 (cs)

[Submitted on 6 Dec 2024 (v1), last revised 17 Jun 2025 (this version, v3)]

Title:BESSTIE: A Benchmark for Sentiment and Sarcasm Classification for Varieties of English

Authors:Dipankar Srirag, Aditya Joshi, Jordan Painter, Diptesh Kanojia

View PDF HTML (experimental)

Abstract:Despite large language models (LLMs) being known to exhibit bias against non-standard language varieties, there are no known labelled datasets for sentiment analysis of English. To address this gap, we introduce BESSTIE, a benchmark for sentiment and sarcasm classification for three varieties of English: Australian (en-AU), Indian (en-IN), and British (en-UK). We collect datasets for these language varieties using two methods: location-based for Google Places reviews, and topic-based filtering for Reddit comments. To assess whether the dataset accurately represents these varieties, we conduct two validation steps: (a) manual annotation of language varieties and (b) automatic language variety prediction. Native speakers of the language varieties manually annotate the datasets with sentiment and sarcasm labels. We perform an additional annotation exercise to validate the reliance of the annotated labels. Subsequently, we fine-tune nine LLMs (representing a range of encoder/decoder and mono/multilingual models) on these datasets, and evaluate their performance on the two tasks. Our results show that the models consistently perform better on inner-circle varieties (i.e., en-AU and en-UK), in comparison with en-IN, particularly for sarcasm classification. We also report challenges in cross-variety generalisation, highlighting the need for language variety-specific datasets such as ours. BESSTIE promises to be a useful evaluative benchmark for future research in equitable LLMs, specifically in terms of language varieties. The BESSTIE dataset is publicly available at: this https URL datasets/unswnlporg/BESSTIE.

Comments:	Findings of ACL: ACL 2025
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2412.04726 [cs.CL]
	(or arXiv:2412.04726v3 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.04726

Submission history

From: Dipankar Srirag [view email]
[v1] Fri, 6 Dec 2024 02:34:40 UTC (2,638 KB)
[v2] Tue, 18 Feb 2025 02:34:18 UTC (2,252 KB)
[v3] Tue, 17 Jun 2025 13:04:28 UTC (1,377 KB)

Computer Science > Computation and Language

Title:BESSTIE: A Benchmark for Sentiment and Sarcasm Classification for Varieties of English

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:BESSTIE: A Benchmark for Sentiment and Sarcasm Classification for Varieties of English

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators