Skip to main content

Showing 1–10 of 10 results for author: Patro, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.20937  [pdf, ps, other

    cs.CL

    On VLMs for Diverse Tasks in Multimodal Meme Classification

    Authors: Deepesh Gavit, Debajyoti Mazumder, Samiran Das, Jasabanta Patro

    Abstract: In this paper, we present a comprehensive and systematic analysis of vision-language models (VLMs) for disparate meme classification tasks. We introduced a novel approach that generates a VLM-based understanding of meme images and fine-tunes the LLMs on textual understanding of the embedded meme text for improving the performance. Our contributions are threefold: (1) Benchmarking VLMs with diverse… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 16 pages

  2. arXiv:2505.15050  [pdf, ps, other

    cs.CL

    Improving the fact-checking performance of language models by relying on their entailment ability

    Authors: Gaurav Kumar, Debajyoti Mazumder, Ayush Garg, Jasabanta Patro

    Abstract: Automated fact-checking is a crucial task in this digital age. To verify a claim, current approaches majorly follow one of two strategies i.e. (i) relying on embedded knowledge of language models, and (ii) fine-tuning them with evidence pieces. While the former can make systems to hallucinate, the later have not been very successful till date. The primary reason behind this is that fact verificati… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 44 pages

  3. arXiv:2412.12761  [pdf, other

    cs.CL cs.AI

    Revealing the impact of synthetic native samples and multi-tasking strategies in Hindi-English code-mixed humour and sarcasm detection

    Authors: Debajyoti Mazumder, Aakash Kumar, Jasabanta Patro

    Abstract: In this paper, we reported our experiments with various strategies to improve code-mixed humour and sarcasm detection. We did all of our experiments for Hindi-English code-mixed scenario, as we have the linguistic expertise for the same. We experimented with three approaches, namely (i) native sample mixing, (ii) multi-task learning (MTL), and (iii) prompting very large multilingual language model… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Comments: 26 pages; under review

  4. arXiv:2405.20755  [pdf

    cs.CL

    Improving code-mixed hate detection by native sample mixing: A case study for Hindi-English code-mixed scenario

    Authors: Debajyoti Mazumder, Aakash Kumar, Jasabanta Patro

    Abstract: Hate detection has long been a challenging task for the NLP community. The task becomes complex in a code-mixed environment because the models must understand the context and the hate expressed through language alteration. Compared to the monolingual setup, we see much less work on code-mixed hate as large-scale annotated hate corpora are unavailable for the study. To overcome this bottleneck, we… ▽ More

    Submitted 20 October, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: Generated from XeLaTeX

  5. arXiv:2203.02244  [pdf, other

    cs.CL

    IISERB Brains at SemEval 2022 Task 6: A Deep-learning Framework to Identify Intended Sarcasm in English

    Authors: Tanuj Singh Shekhawat, Manoj Kumar, Udaybhan Rathore, Aditya Joshi, Jasabanta Patro

    Abstract: This paper describes the system architectures and the models submitted by our team "IISERBBrains" to SemEval 2022 Task 6 competition. We contested for all three sub-tasks floated for the English dataset. On the leader-board, wegot19th rank out of43 teams for sub-taskA, the 8th rank out of22 teams for sub-task B,and13th rank out of 16 teams for sub-taskC. Apart from the submitted results and models… ▽ More

    Submitted 4 March, 2022; originally announced March 2022.

    Comments: 7 pages

  6. arXiv:2005.02295  [pdf, other

    cs.CL

    Code-switching patterns can be an effective route to improve performance of downstream NLP applications: A case study of humour, sarcasm and hate speech detection

    Authors: Srijan Bansal, Vishal Garimella, Ayush Suhane, Jasabanta Patro, Animesh Mukherjee

    Abstract: In this paper we demonstrate how code-switching patterns can be utilised to improve various downstream NLP applications. In particular, we encode different switching features to improve humour, sarcasm and hate speech detection tasks. We believe that this simple linguistic observation can also be potentially helpful in improving other similar NLP applications.

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: This work is accepted as a short paper in the proceedings of ACL 2020

  7. arXiv:1811.07853  [pdf, other

    cs.SI

    Characterizing the spread of exaggerated news content over social media

    Authors: Jasabanta Patro, Sabyasachee Baruah, Vivek Gupta, Monojit Choudhury, Pawan Goyal, Animesh Mukherjee

    Abstract: In this paper, we consider a dataset comprising press releases about health research from different universities in the UK along with a corresponding set of news articles. First, we do an exploratory analysis to understand how the basic information published in the scientific journals get exaggerated as they are reported in these press releases or news articles. This initial analysis shows that so… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

    Comments: 10 pages

  8. arXiv:1811.07169  [pdf, other

    cs.SI

    What Propels Celebrity Follower Counts? Language Use or Social Connectivity

    Authors: Jasabanta Patro, Rameshwar Bhaskaran, Animesh Mukherjee

    Abstract: Follower count is a factor that quantifies the popularity of celebrities. It is a reflection of their power, prestige and overall social reach. In this paper we investigate whether the social connectivity or the language choice is more correlated to the future follower count of a celebrity. We collect data about tweets, retweets and mentions of 471 Indian celebrities with verified Twitter accounts… ▽ More

    Submitted 19 November, 2018; v1 submitted 17 November, 2018; originally announced November 2018.

    Comments: 8 pages

  9. All that is English may be Hindi: Enhancing language identification through automatic ranking of likeliness of word borrowing in social media

    Authors: Jasabanta Patro, Bidisha Samanta, Saurabh Singh, Abhipsa Basu, Prithwish Mukherjee, Monojit Choudhury, Animesh Mukherjee

    Abstract: In this paper, we present a set of computational methods to identify the likeliness of a word being borrowed, based on the signals from social media. In terms of Spearman correlation coefficient values, our methods perform more than two times better (nearly 0.62) in predicting the borrowing likeliness compared to the best performing baseline (nearly 0.26) reported in literature. Based on this like… ▽ More

    Submitted 29 July, 2017; v1 submitted 25 July, 2017; originally announced July 2017.

    Comments: 11 pages, accepted in the 2017 conference on Empirical Methods on Natural Language Processing(EMNLP 2017) arXiv admin note: substantial text overlap with arXiv:1703.05122

  10. arXiv:1703.05122  [pdf, other

    cs.CL

    Is this word borrowed? An automatic approach to quantify the likeliness of borrowing in social media

    Authors: Jasabanta Patro, Bidisha Samanta, Saurabh Singh, Prithwish Mukherjee, Monojit Choudhury, Animesh Mukherjee

    Abstract: Code-mixing or code-switching are the effortless phenomena of natural switching between two or more languages in a single conversation. Use of a foreign word in a language; however, does not necessarily mean that the speaker is code-switching because often languages borrow lexical items from other languages. If a word is borrowed, it becomes a part of the lexicon of a language; whereas, during cod… ▽ More

    Submitted 15 March, 2017; originally announced March 2017.

    Comments: 11 pages, 3 Figures