Predicting the Type and Target of Offensive Posts in Social Media

Zampieri, Marcos; Malmasi, Shervin; Nakov, Preslav; Rosenthal, Sara; Farra, Noura; Kumar, Ritesh

Computer Science > Computation and Language

arXiv:1902.09666v1 (cs)

[Submitted on 25 Feb 2019 (this version), latest version 16 Apr 2019 (v2)]

Title:Predicting the Type and Target of Offensive Posts in Social Media

Authors:Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Sara Rosenthal, Noura Farra, Ritesh Kumar

View PDF

Abstract:As offensive content has become pervasive in social media, there has been much research on identifying potentially offensive messages. Previous work in this area, however, did not consider the problem as a whole, but rather focused on detecting very specific types of offensive content, e.g., hate speech, cyberbulling, or cyber-aggression. In contrast, here we target several different kinds of offensive content. In particular, we propose to model the task hierarchically, identifying the type and the target of offensive messages in social media. We use the Offensive Language Identification Dataset (OLID), a new dataset with a fine-grained three-layer annotation scheme compiled specifically for this purpose. OLID, which we make publicly available, contains tweets annotated for offensive content. We discuss the main similarities and differences of this dataset compared to other datasets for hate speech identification, aggression detection, and similar tasks. We also evaluate the data with a number of classification methods for this task.

Comments:	NAACL Submission
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1902.09666 [cs.CL]
	(or arXiv:1902.09666v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1902.09666

Submission history

From: Marcos Zampieri [view email]
[v1] Mon, 25 Feb 2019 23:54:40 UTC (119 KB)
[v2] Tue, 16 Apr 2019 16:30:35 UTC (17 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2019-02

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Marcos Zampieri
Shervin Malmasi
Preslav Nakov
Sara Rosenthal
Noura Farra

…

export BibTeX citation

Computer Science > Computation and Language

Title:Predicting the Type and Target of Offensive Posts in Social Media

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Predicting the Type and Target of Offensive Posts in Social Media

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators