Introducing an Abusive Language Classification Framework for Telegram to Investigate the German Hater Community

Wich, Maximilian; Gorniak, Adrian; Eder, Tobias; Bartmann, Daniel; Çakici, Burak Enes; Groh, Georg

Computer Science > Computation and Language

arXiv:2109.07346 (cs)

[Submitted on 15 Sep 2021 (v1), last revised 24 Nov 2021 (this version, v2)]

Title:Introducing an Abusive Language Classification Framework for Telegram to Investigate the German Hater Community

Authors:Maximilian Wich, Adrian Gorniak, Tobias Eder, Daniel Bartmann, Burak Enes Çakici, Georg Groh

View PDF

Abstract:Since traditional social media platforms continue to ban actors spreading hate speech or other forms of abusive languages (a process known as deplatforming), these actors migrate to alternative platforms that do not moderate users content. One popular platform relevant for the German hater community is Telegram for which limited research efforts have been made so far. This study aims to develop a broad framework comprising (i) an abusive language classification model for German Telegram messages and (ii) a classification model for the hatefulness of Telegram channels. For the first part, we use existing abusive language datasets containing posts from other platforms to develop our classification models. For the channel classification model, we develop a method that combines channel-specific content information collected from a topic model with a social graph to predict the hatefulness of channels. Furthermore, we complement these two approaches for hate speech detection with insightful results on the evolution of the hater community on Telegram in Germany. We also propose methods for conducting scalable network analyses for social media platforms to the hate speech research community. As an additional output of this study, we provide an annotated abusive language dataset containing 1,149 annotated Telegram messages.

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2109.07346 [cs.CL]
	(or arXiv:2109.07346v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2109.07346

Submission history

From: Tobias Eder [view email]
[v1] Wed, 15 Sep 2021 14:58:46 UTC (387 KB)
[v2] Wed, 24 Nov 2021 09:39:43 UTC (483 KB)

Computer Science > Computation and Language

Title:Introducing an Abusive Language Classification Framework for Telegram to Investigate the German Hater Community

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Introducing an Abusive Language Classification Framework for Telegram to Investigate the German Hater Community

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators