Skip to main content

Showing 1–3 of 3 results for author: Romim, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.00372  [pdf

    cs.CL

    BD-SHS: A Benchmark Dataset for Learning to Detect Online Bangla Hate Speech in Different Social Contexts

    Authors: Nauros Romim, Mosahed Ahmed, Md. Saiful Islam, Arnab Sen Sharma, Hriteshwar Talukder, Mohammad Ruhul Amin

    Abstract: Social media platforms and online streaming services have spawned a new breed of Hate Speech (HS). Due to the massive amount of user-generated content on these sites, modern machine learning techniques are found to be feasible and cost-effective to tackle this problem. However, linguistically diverse datasets covering different social contexts in which offensive language is typically used are requ… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  2. arXiv:2112.01902  [pdf, other

    cs.CL

    HS-BAN: A Benchmark Dataset of Social Media Comments for Hate Speech Detection in Bangla

    Authors: Nauros Romim, Mosahed Ahmed, Md Saiful Islam, Arnab Sen Sharma, Hriteshwar Talukder, Mohammad Ruhul Amin

    Abstract: In this paper, we present HS-BAN, a binary class hate speech (HS) dataset in Bangla language consisting of more than 50,000 labeled comments, including 40.17% hate and rest are non hate speech. While preparing the dataset a strict and detailed annotation guideline was followed to reduce human annotation bias. The HS dataset was also preprocessed linguistically to extract different types of slang c… ▽ More

    Submitted 3 December, 2021; originally announced December 2021.

    Comments: Submitted to ICON 21 (Rejected)

  3. arXiv:2012.09686  [pdf

    cs.CL

    Hate Speech detection in the Bengali language: A dataset and its baseline evaluation

    Authors: Nauros Romim, Mosahed Ahmed, Hriteshwar Talukder, Md Saiful Islam

    Abstract: Social media sites such as YouTube and Facebook have become an integral part of everyone's life and in the last few years, hate speech in the social media comment section has increased rapidly. Detection of hate speech on social media websites faces a variety of challenges including small imbalanced data sets, the findings of an appropriate model and also the choice of feature analysis method. fur… ▽ More

    Submitted 17 December, 2020; originally announced December 2020.

    Comments: 13 pages, 02 figures. To appear on International Joint Conference on Advances in Computational Intelligence, 20-21 November 2020