Hypers at ComMA@ICON: Modelling Aggressiveness, Gender Bias and Communal Bias Identification
Authors:
Sean Benhur,
Roshan Nayak,
Kanchana Sivanraju,
Adeep Hande,
Subalalitha Chinnaudayar Navaneethakrishnan,
Ruba Priyadharshini,
Bharathi Raja Chakravarthi
Abstract:
Due to the exponentially increasing reach of social media, it is essential to focus on its negative aspects as it can potentially divide society and incite people into violence. In this paper, we present our system description of work on the shared task ComMA@ICON, where we have to classify how aggressive the sentence is and if the sentence is gender-biased or communal biased. These three could be…
▽ More
Due to the exponentially increasing reach of social media, it is essential to focus on its negative aspects as it can potentially divide society and incite people into violence. In this paper, we present our system description of work on the shared task ComMA@ICON, where we have to classify how aggressive the sentence is and if the sentence is gender-biased or communal biased. These three could be the primary reasons to cause significant problems in society. As team Hypers we have proposed an approach that utilizes different pretrained models with Attention and mean pooling methods. We were able to get Rank 3 with 0.223 Instance F1 score on Bengali, Rank 2 with 0.322 Instance F1 score on Multi-lingual set, Rank 4 with 0.129 Instance F1 score on Meitei and Rank 5 with 0.336 Instance F1 score on Hindi. The source code and the pretrained models of this work can be found here.
△ Less
Submitted 13 January, 2022; v1 submitted 31 December, 2021;
originally announced December 2021.
Developing Successful Shared Tasks on Offensive Language Identification for Dravidian Languages
Authors:
Bharathi Raja Chakravarthi,
Dhivya Chinnappa,
Ruba Priyadharshini,
Anand Kumar Madasamy,
Sangeetha Sivanesan,
Subalalitha Chinnaudayar Navaneethakrishnan,
Sajeetha Thavareesan,
Dhanalakshmi Vadivel,
Rahul Ponnusamy,
Prasanna Kumar Kumaresan
Abstract:
With the fast growth of mobile computing and Web technologies, offensive language has become more prevalent on social networking platforms. Since offensive language identification in local languages is essential to moderate the social media content, in this paper we work with three Dravidian languages, namely Malayalam, Tamil, and Kannada, that are under-resourced. We present an evaluation task at…
▽ More
With the fast growth of mobile computing and Web technologies, offensive language has become more prevalent on social networking platforms. Since offensive language identification in local languages is essential to moderate the social media content, in this paper we work with three Dravidian languages, namely Malayalam, Tamil, and Kannada, that are under-resourced. We present an evaluation task at FIRE 2020- HASOC-DravidianCodeMix and DravidianLangTech at EACL 2021, designed to provide a framework for comparing different approaches to this problem. This paper describes the data creation, defines the task, lists the participating systems, and discusses various methods.
△ Less
Submitted 5 November, 2021;
originally announced November 2021.