Creating emoji lexica from unsupervised sentiment analysis of their descriptions

Fernández-Gavilanes, Milagros; Juncal-Martínez, Jonathan; García-Méndez, Silvia; Costa-Montenegro, Enrique; González-Castaño, Francisco Javier

doi:10.1016/j.eswa.2018.02.043

Computer Science > Computation and Language

arXiv:2404.01439 (cs)

[Submitted on 1 Apr 2024]

Title:Creating emoji lexica from unsupervised sentiment analysis of their descriptions

Authors:Milagros Fernández-Gavilanes, Jonathan Juncal-Martínez, Silvia García-Méndez, Enrique Costa-Montenegro, Francisco Javier González-Castaño

View PDF HTML (experimental)

Abstract:Online media, such as blogs and social networking sites, generate massive volumes of unstructured data of great interest to analyze the opinions and sentiments of individuals and organizations. Novel approaches beyond Natural Language Processing are necessary to quantify these opinions with polarity metrics. So far, the sentiment expressed by emojis has received little attention. The use of symbols, however, has boomed in the past four years. About twenty billion are typed in Twitter nowadays, and new emojis keep appearing in each new Unicode version, making them increasingly relevant to sentiment analysis tasks. This has motivated us to propose a novel approach to predict the sentiments expressed by emojis in online textual messages, such as tweets, that does not require human effort to manually annotate data and saves valuable time for other analysis tasks. For this purpose, we automatically constructed a novel emoji sentiment lexicon using an unsupervised sentiment analysis system based on the definitions given by emoji creators in Emojipedia. Additionally, we automatically created lexicon variants by also considering the sentiment distribution of the informal texts accompanying emojis. All these lexica are evaluated and compared regarding the improvement obtained by including them in sentiment analysis of the annotated datasets provided by Kralj Novak et al. (2015). The results confirm the competitiveness of our approach.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2404.01439 [cs.CL]
	(or arXiv:2404.01439v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2404.01439
Related DOI:	https://doi.org/10.1016/j.eswa.2018.02.043

Submission history

From: Silvia García-Méndez [view email]
[v1] Mon, 1 Apr 2024 19:22:58 UTC (2,061 KB)

Computer Science > Computation and Language

Title:Creating emoji lexica from unsupervised sentiment analysis of their descriptions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Creating emoji lexica from unsupervised sentiment analysis of their descriptions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators