Skip to main content

Showing 1–9 of 9 results for author: Thavareesan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2205.06118  [pdf, other

    cs.CL

    Findings of the Shared Task on Offensive Span Identification from Code-Mixed Tamil-English Comments

    Authors: Manikandan Ravikiran, Bharathi Raja Chakravarthi, Anand Kumar Madasamy, Sangeetha Sivanesan, Ratnavel Rajalakshmi, Sajeetha Thavareesan, Rahul Ponnusamy, Shankar Mahadevan

    Abstract: Offensive content moderation is vital in social media platforms to support healthy online discussions. However, their prevalence in codemixed Dravidian languages is limited to classifying whole comments without identifying part of it contributing to offensiveness. Such limitation is primarily due to the lack of annotated data for offensive spans. Accordingly, in this shared task, we provide Tamil-… ▽ More

    Submitted 12 May, 2022; originally announced May 2022.

    Comments: System Description of Shared Task https://competitions.codalab.org/competitions/36395

  2. arXiv:2111.09811  [pdf, other

    cs.CL

    Findings of the Sentiment Analysis of Dravidian Languages in Code-Mixed Text

    Authors: Bharathi Raja Chakravarthi, Ruba Priyadharshini, Sajeetha Thavareesan, Dhivya Chinnappa, Durairaj Thenmozhi, Elizabeth Sherly, John P. McCrae, Adeep Hande, Rahul Ponnusamy, Shubhanker Banerjee, Charangan Vasantharajan

    Abstract: We present the results of the Dravidian-CodeMix shared task held at FIRE 2021, a track on sentiment analysis for Dravidian Languages in Code-Mixed Text. We describe the task, its organization, and the submitted systems. This shared task is the continuation of last year's Dravidian-CodeMix shared task held at FIRE 2020. This year's tasks included code-mixing at the intra-token and inter-token level… ▽ More

    Submitted 18 November, 2021; originally announced November 2021.

  3. arXiv:2111.03375  [pdf

    cs.CL

    Developing Successful Shared Tasks on Offensive Language Identification for Dravidian Languages

    Authors: Bharathi Raja Chakravarthi, Dhivya Chinnappa, Ruba Priyadharshini, Anand Kumar Madasamy, Sangeetha Sivanesan, Subalalitha Chinnaudayar Navaneethakrishnan, Sajeetha Thavareesan, Dhanalakshmi Vadivel, Rahul Ponnusamy, Prasanna Kumar Kumaresan

    Abstract: With the fast growth of mobile computing and Web technologies, offensive language has become more prevalent on social networking platforms. Since offensive language identification in local languages is essential to moderate the social media content, in this paper we work with three Dravidian languages, namely Malayalam, Tamil, and Kannada, that are under-resourced. We present an evaluation task at… ▽ More

    Submitted 5 November, 2021; originally announced November 2021.

    Comments: 23

  4. arXiv:2108.12177  [pdf, other

    cs.CL

    Offensive Language Identification in Low-resourced Code-mixed Dravidian languages using Pseudo-labeling

    Authors: Adeep Hande, Karthik Puranik, Konthala Yasaswini, Ruba Priyadharshini, Sajeetha Thavareesan, Anbukkarasi Sampath, Kogilavani Shanmugavadivel, Durairaj Thenmozhi, Bharathi Raja Chakravarthi

    Abstract: Social media has effectively become the prime hub of communication and digital marketing. As these platforms enable the free manifestation of thoughts and facts in text, images and video, there is an extensive need to screen them to protect individuals and groups from offensive content targeted at them. Our work intends to classify codemixed social media comments/posts in the Dravidian languages o… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: 27 pages, 12 figures, 10 tables

  5. arXiv:2108.03886  [pdf, other

    cs.CL

    Do Images really do the Talking? Analysing the significance of Images in Tamil Troll meme classification

    Authors: Siddhanth U Hegde, Adeep Hande, Ruba Priyadharshini, Sajeetha Thavareesan, Ratnasingam Sakuntharaj, Sathiyaraj Thangasamy, B Bharathi, Bharathi Raja Chakravarthi

    Abstract: A meme is an part of media created to share an opinion or emotion across the internet. Due to its popularity, memes have become the new forms of communication on social media. However, due to its nature, they are being used in harmful ways such as trolling and cyberbullying progressively. Various data modelling methods create different possibilities in feature extraction and turning them into bene… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: 12 pages

  6. arXiv:2108.03867  [pdf, other

    cs.CC

    Benchmarking Multi-Task Learning for Sentiment Analysis and Offensive Language Identification in Under-Resourced Dravidian Languages

    Authors: Adeep Hande, Siddhanth U Hegde, Ruba Priyadharshini, Rahul Ponnusamy, Prasanna Kumar Kumaresan, Sajeetha Thavareesan, Bharathi Raja Chakravarthi

    Abstract: To obtain extensive annotated data for under-resourced languages is challenging, so in this research, we have investigated whether it is beneficial to train models using multi-task learning. Sentiment analysis and offensive language identification share similar discourse properties. The selection of these tasks is motivated by the lack of large labelled data for user-generated code-mixed datasets.… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

    Comments: 29 pages

  7. arXiv:2104.09081  [pdf, other

    cs.CL

    UVCE-IIITT@DravidianLangTech-EACL2021: Tamil Troll Meme Classification: You need to Pay more Attention

    Authors: Siddhanth U Hegde, Adeep Hande, Ruba Priyadharshini, Sajeetha Thavareesan, Bharathi Raja Chakravarthi

    Abstract: Tamil is a Dravidian language that is commonly used and spoken in the southern part of Asia. In the era of social media, memes have been a fun moment in the day-to-day life of people. Here, we try to analyze the true meaning of Tamil memes by categorizing them as troll and non-troll. We propose an ingenious model comprising of a transformer-transformer architecture that tries to attain state-of-th… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

  8. arXiv:2104.09066  [pdf, other

    cs.CL

    IIITT@LT-EDI-EACL2021-Hope Speech Detection: There is always Hope in Transformers

    Authors: Karthik Puranik, Adeep Hande, Ruba Priyadharshini, Sajeetha Thavareesan, Bharathi Raja Chakravarthi

    Abstract: In a world filled with serious challenges like climate change, religious and political conflicts, global pandemics, terrorism, and racial discrimination, an internet full of hate speech, abusive and offensive content is the last thing we desire for. In this paper, we work to identify and promote positive and supportive content on these platforms. We work with several transformer-based models to cl… ▽ More

    Submitted 19 April, 2021; originally announced April 2021.

  9. arXiv:2010.07773  [pdf, other

    cs.CL cs.AI cs.LG cs.NE

    NUIG-Shubhanker@Dravidian-CodeMix-FIRE2020: Sentiment Analysis of Code-Mixed Dravidian text using XLNet

    Authors: Shubhanker Banerjee, Arun Jayapal, Sajeetha Thavareesan

    Abstract: Social media has penetrated into multilingual societies, however most of them use English to be a preferred language for communication. So it looks natural for them to mix their cultural language with English during conversations resulting in abundance of multilingual data, call this code-mixed data, available in todays' world.Downstream NLP tasks using such data is challenging due to the semantic… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: 7 pages