Skip to main content

Showing 1–13 of 13 results for author: Chavan, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.05157  [pdf, ps, other

    cs.CL cs.AI

    AI Generated Text Detection Using Instruction Fine-tuned Large Language and Transformer-Based Models

    Authors: Chinnappa Guggilla, Budhaditya Roy, Trupti Ramdas Chavan, Abdul Rahman, Edward Bowen

    Abstract: Large Language Models (LLMs) possess an extraordinary capability to produce text that is not only coherent and contextually relevant but also strikingly similar to human writing. They adapt to various styles and genres, producing content that is both grammatically correct and semantically meaningful. Recently, LLMs have been misused to create highly realistic phishing emails, spread fake news, gen… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 7 pages, 3 figures

  2. arXiv:2312.02590  [pdf, other

    cs.CL

    Text Intimacy Analysis using Ensembles of Multilingual Transformers

    Authors: Tanmay Chavan, Ved Patwardhan

    Abstract: Intimacy estimation of a given text has recently gained importance due to the increase in direct interaction of NLP systems with humans. Intimacy is an important aspect of natural language and has a substantial impact on our everyday communication. Thus the level of intimacy can provide us with deeper insights and richer semantics of conversations. In this paper, we present our work on the SemEval… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  3. arXiv:2312.02578  [pdf, other

    cs.CL

    Empathy and Distress Detection using Ensembles of Transformer Models

    Authors: Tanmay Chavan, Kshitij Deshpande, Sheetal Sonawane

    Abstract: This paper presents our approach for the WASSA 2023 Empathy, Emotion and Personality Shared Task. Empathy and distress are human feelings that are implicitly expressed in natural discourses. Empathy and distress detection are crucial challenges in Natural Language Processing that can aid our understanding of conversations. The provided dataset consists of several long-text examples in the English… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted at the WASSA 2023 workshop at ACL 2023

  4. arXiv:2311.18778  [pdf, other

    cs.CL

    Mavericks at BLP-2023 Task 1: Ensemble-based Approach Using Language Models for Violence Inciting Text Detection

    Authors: Saurabh Page, Sudeep Mangalvedhekar, Kshitij Deshpande, Tanmay Chavan, Sheetal Sonawane

    Abstract: This paper presents our work for the Violence Inciting Text Detection shared task in the First Workshop on Bangla Language Processing. Social media has accelerated the propagation of hate and violence-inciting speech in society. It is essential to develop efficient mechanisms to detect and curb the propagation of such texts. The problem of detecting violence-inciting texts is further exacerbated i… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 6 pages, 1 figure, accepted at the BLP Workshop, EMNLP 2023

  5. arXiv:2311.17722  [pdf, other

    cs.CL cs.LG

    SenTest: Evaluating Robustness of Sentence Encoders

    Authors: Tanmay Chavan, Shantanu Patankar, Aditya Kane, Omkar Gokhale, Geetanjali Kale, Raviraj Joshi

    Abstract: Contrastive learning has proven to be an effective method for pre-training models using weakly labeled data in the vision domain. Sentence transformers are the NLP counterparts to this architecture, and have been growing in popularity due to their rich and effective sentence representations. Having effective sentence representations is paramount in multiple tasks, such as information retrieval, re… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

  6. arXiv:2306.14030  [pdf, other

    cs.CL cs.LG

    My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation Benchmarks

    Authors: Tanmay Chavan, Omkar Gokhale, Aditya Kane, Shantanu Patankar, Raviraj Joshi

    Abstract: The research on code-mixed data is limited due to the unavailability of dedicated code-mixed datasets and pre-trained language models. In this work, we focus on the low-resource Indian language Marathi which lacks any prior work in code-mixing. We present L3Cube-MeCorpus, a large code-mixed Marathi-English (Mr-En) corpus with 10 million social media sentences for pretraining. We also release L3Cub… ▽ More

    Submitted 20 July, 2023; v1 submitted 24 June, 2023; originally announced June 2023.

  7. arXiv:2212.10039  [pdf, other

    cs.CL

    A Twitter BERT Approach for Offensive Language Detection in Marathi

    Authors: Tanmay Chavan, Shantanu Patankar, Aditya Kane, Omkar Gokhale, Raviraj Joshi

    Abstract: Automated offensive language detection is essential in combating the spread of hate speech, particularly in social media. This paper describes our work on Offensive Language Identification in low resource Indic language Marathi. The problem is formulated as a text classification task to identify a tweet as offensive or non-offensive. We evaluate different mono-lingual and multi-lingual BERT models… ▽ More

    Submitted 20 December, 2022; originally announced December 2022.

  8. arXiv:2210.08209  [pdf, other

    cs.CL

    Large Language Models for Multi-label Propaganda Detection

    Authors: Tanmay Chavan, Aditya Kane

    Abstract: The spread of propaganda through the internet has increased drastically over the past years. Lately, propaganda detection has started gaining importance because of the negative impact it has on society. In this work, we describe our approach for the WANLP 2022 shared task which handles the task of propaganda detection in a multi-label setting. The task demands the model to label the given text as… ▽ More

    Submitted 20 October, 2022; v1 submitted 15 October, 2022; originally announced October 2022.

  9. arXiv:2210.04267  [pdf, other

    cs.CL cs.AI

    Spread Love Not Hate: Undermining the Importance of Hateful Pre-training for Hate Speech Detection

    Authors: Omkar Gokhale, Aditya Kane, Shantanu Patankar, Tanmay Chavan, Raviraj Joshi

    Abstract: Pre-training large neural language models, such as BERT, has led to impressive gains on many natural language processing (NLP) tasks. Although this method has proven to be effective for many domains, it might not always provide desirable benefits. In this paper, we study the effects of hateful pre-training on low-resource hate speech classification tasks. While previous studies on the English lang… ▽ More

    Submitted 11 December, 2022; v1 submitted 9 October, 2022; originally announced October 2022.

  10. arXiv:2004.11120  [pdf, other

    cs.LG cs.ET cs.NE eess.SP stat.ML

    Software-Level Accuracy Using Stochastic Computing With Charge-Trap-Flash Based Weight Matrix

    Authors: Varun Bhatt, Shalini Shrivastava, Tanmay Chavan, Udayan Ganguly

    Abstract: The in-memory computing paradigm with emerging memory devices has been recently shown to be a promising way to accelerate deep learning. Resistive processing unit (RPU) has been proposed to enable the vector-vector outer product in a crossbar array using a stochastic train of identical pulses to enable one-shot weight update, promising intense speed-up in matrix multiplication operations, which fo… ▽ More

    Submitted 8 March, 2020; originally announced April 2020.

    Comments: 8 pages, 8 figures, submitted to the International Joint Conference on Neural Networks (IJCNN) 2020

  11. Band-to-Band Tunneling based Ultra-Energy Efficient Silicon Neuron

    Authors: Tanmay Chavan, Sangya Dutta, Nihar R. Mohapatra, Udayan Ganguly

    Abstract: The human brain comprises about a hundred billion neurons connected through quadrillion synapses. Spiking Neural Networks (SNNs) take inspiration from the brain to model complex cognitive and learning tasks. Neuromorphic engineering implements SNNs in hardware, aspiring to mimic the brain at scale (i.e., 100 billion neurons) with biological area and energy efficiency. The design of ultra-energy ef… ▽ More

    Submitted 25 February, 2019; originally announced February 2019.

  12. arXiv:1902.09417  [pdf

    cs.ET

    Ultra-low Energy charge trap flash based synapse enabled by parasitic leakage mitigation

    Authors: Shalini Shrivastava, Tanmay Chavan, Udayan Ganguly

    Abstract: Brain-inspired computation promises complex cognitive tasks at biological energy efficiencies. The brain contains $10^4$ synapses per neuron. Hence, ultra-low energy, high-density synapses are needed for spiking neural networks (SNN). In this paper, we use tunneling enabled CTF (Charge Trap Flash) stack for ultra-low-energy operation (1F); Further, CTF on an SOI platform and back-to-back connected… ▽ More

    Submitted 25 February, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

  13. arXiv:1602.02493  [pdf

    cs.NI

    Analysis of Location Management Schemes for MANET using Synthetic Mobility Models

    Authors: Harsha Bhute, G. T. Chavan, Avinash Bhute

    Abstract: In the performance evaluation of a protocol for an ad hoc network, the protocol should be tested under realistic conditions including, but not limited to, a sensible transmission range, limited buffer space for the storage of messages, representative data traffic models, and realistic movements of the mobile users and several mobility models that represent mobile nodes whose movements are dependen… ▽ More

    Submitted 8 February, 2016; originally announced February 2016.

    Comments: 7 pages, 2 figures