Skip to main content

Showing 1–3 of 3 results for author: Pranesh, R R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2108.12521  [pdf, other

    cs.CL

    TweetBLM: A Hate Speech Dataset and Analysis of Black Lives Matter-related Microblogs on Twitter

    Authors: Sumit Kumar, Raj Ratn Pranesh

    Abstract: In the past few years, there has been a significant rise in toxic and hateful content on various social media platforms. Recently Black Lives Matter movement came into the picture, causing an avalanche of user generated responses on the internet. In this paper, we have proposed a Black Lives Matter related tweet hate speech dataset TweetBLM. Our dataset comprises 9165 manually annotated tweets tha… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: https://zenodo.org/record/4000539#.YSlrN9NKhQI (Link to data)

  2. arXiv:2105.03313  [pdf, other

    cs.CL cs.DB

    Looking for COVID-19 misinformation in multilingual social media texts

    Authors: Raj Ratn Pranesh, Mehrdad Farokhnejad, Ambesh Shekhar, Genoveva Vargas-Solar

    Abstract: This paper presents the Multilingual COVID-19 Analysis Method (CMTA) for detecting and observing the spread of misinformation about this disease within texts. CMTA proposes a data science (DS) pipeline that applies machine learning models for processing, classifying (Dense-CNN) and analyzing (MBERT) multilingual (micro)-texts. DS pipeline data preparation tasks extract features from multilingual t… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

  3. arXiv:2010.08591  [pdf

    cs.IR cs.AI

    A Conglomerate of Multiple OCR Table Detection and Extraction

    Authors: Smita Pallavi, Raj Ratn Pranesh, Sumit Kumar

    Abstract: Information representation as tables are compact and concise method that eases searching, indexing, and storage requirements. Extracting and cloning tables from parsable documents is easier and widely used, however industry still faces challenge in detecting and extracting tables from OCR documents or images. This paper proposes an algorithm that detects and extracts multiple tables from OCR docum… ▽ More

    Submitted 16 October, 2020; originally announced October 2020.

    Comments: For ICDAR proceedings, see https://panel.waset.org/abstracts/127575