Skip to main content

Showing 51–100 of 137 results for author: Chadha, A

.
  1. arXiv:2403.04786  [pdf, other

    cs.CR cs.CL

    Breaking Down the Defenses: A Comparative Survey of Attacks on Large Language Models

    Authors: Arijit Ghosh Chowdhury, Md Mofijul Islam, Vaibhav Kumar, Faysal Hossain Shezan, Vaibhav Kumar, Vinija Jain, Aman Chadha

    Abstract: Large Language Models (LLMs) have become a cornerstone in the field of Natural Language Processing (NLP), offering transformative capabilities in understanding and generating human-like text. However, with their rising prominence, the security and vulnerability aspects of these models have garnered significant attention. This paper presents a comprehensive survey of the various forms of attacks ta… ▽ More

    Submitted 23 March, 2024; v1 submitted 2 March, 2024; originally announced March 2024.

  2. OffensiveLang: A Community Based Implicit Offensive Language Dataset

    Authors: Amit Das, Mostafa Rahgouy, Dongji Feng, Zheng Zhang, Tathagata Bhattacharya, Nilanjana Raychawdhary, Fatemeh Jamshidi, Vinija Jain, Aman Chadha, Mary Sandage, Lauramarie Pope, Gerry Dozier, Cheryl Seals

    Abstract: The widespread presence of hateful languages on social media has resulted in adverse effects on societal well-being. As a result, addressing this issue with high priority has become very important. Hate speech or offensive languages exist in both explicit and implicit forms, with the latter being more challenging to detect. Current research in this domain encounters several challenges. Firstly, th… ▽ More

    Submitted 14 December, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Journal ref: in IEEE Access, vol. 12, pp. 185661-185672, 2024

  3. arXiv:2403.02246  [pdf, other

    cs.CL

    PHAnToM: Persona-based Prompting Has An Effect on Theory-of-Mind Reasoning in Large Language Models

    Authors: Fiona Anting Tan, Gerard Christopher Yeo, Kokil Jaidka, Fanyou Wu, Weijie Xu, Vinija Jain, Aman Chadha, Yang Liu, See-Kiong Ng

    Abstract: The use of LLMs in natural language reasoning has shown mixed results, sometimes rivaling or even surpassing human performance in simpler classification tasks while struggling with social-cognitive reasoning, a domain where humans naturally excel. These differences have been attributed to many factors, such as variations in prompting and the specific LLMs used. However, no reasons appear conclusiv… ▽ More

    Submitted 22 October, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: 8 pages

  4. arXiv:2403.01152  [pdf, other

    cs.CL cs.AI

    A Survey of AI-generated Text Forensic Systems: Detection, Attribution, and Characterization

    Authors: Tharindu Kumarage, Garima Agrawal, Paras Sheth, Raha Moraffah, Aman Chadha, Joshua Garland, Huan Liu

    Abstract: We have witnessed lately a rapid proliferation of advanced Large Language Models (LLMs) capable of generating high-quality text. While these LLMs have revolutionized text generation across various domains, they also pose significant risks to the information ecosystem, such as the potential for generating convincing propaganda, misinformation, and disinformation at scale. This paper offers a review… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  5. arXiv:2402.18590  [pdf, ps, other

    cs.IR cs.AI

    Exploring the Impact of Large Language Models on Recommender Systems: An Extensive Review

    Authors: Arpita Vats, Vinija Jain, Rahul Raja, Aman Chadha

    Abstract: The paper underscores the significance of Large Language Models (LLMs) in reshaping recommender systems, attributing their value to unique reasoning abilities absent in traditional recommenders. Unlike conventional systems lacking direct user interaction data, LLMs exhibit exceptional proficiency in recommending items, showcasing their adeptness in comprehending intricacies of language. This marks… ▽ More

    Submitted 19 March, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

  6. arXiv:2402.18139  [pdf, other

    cs.CL cs.AI

    Cause and Effect: Can Large Language Models Truly Understand Causality?

    Authors: Swagata Ashwani, Kshiteesh Hegde, Nishith Reddy Mannuru, Mayank Jindal, Dushyant Singh Sengar, Krishna Chaitanya Rao Kathala, Dishant Banga, Vinija Jain, Aman Chadha

    Abstract: With the rise of Large Language Models(LLMs), it has become crucial to understand their capabilities and limitations in deciphering and explaining the complex web of causal relationships that language entails. Current methods use either explicit or implicit causal reasoning, yet there is a strong need for a unified approach combining both to tackle a wide array of causal relationships more effecti… ▽ More

    Submitted 29 September, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: AI Trustworthiness and Risk Assessment for Challenged Contexts (ATRACC) AAAI 2024 Fall Symposium

  7. COBIAS: Assessing the Contextual Reliability of Bias Benchmarks for Language Models

    Authors: Priyanshul Govil, Hemang Jain, Vamshi Krishna Bonagiri, Aman Chadha, Ponnurangam Kumaraguru, Manas Gaur, Sanorita Dey

    Abstract: Large Language Models (LLMs) often inherit biases from the web data they are trained on, which contains stereotypes and prejudices. Current methods for evaluating and mitigating these biases rely on bias-benchmark datasets. These benchmarks measure bias by observing an LLM's behavior on biased statements. However, these statements lack contextual considerations of the situations they try to presen… ▽ More

    Submitted 16 May, 2025; v1 submitted 22 February, 2024; originally announced February 2024.

  8. arXiv:2402.11512  [pdf, other

    cs.CL cs.CY

    From Prejudice to Parity: A New Approach to Debiasing Large Language Model Word Embeddings

    Authors: Aishik Rakshit, Smriti Singh, Shuvam Keshari, Arijit Ghosh Chowdhury, Vinija Jain, Aman Chadha

    Abstract: Embeddings play a pivotal role in the efficacy of Large Language Models. They are the bedrock on which these models grasp contextual relationships and foster a more nuanced understanding of language and consequently perform remarkably on a plethora of complex tasks that require a fundamental understanding of human language. Given that these embeddings themselves often reflect or exhibit bias, it s… ▽ More

    Submitted 6 January, 2025; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted at COLING 2025

  9. arXiv:2402.09346  [pdf, other

    cs.AI

    LLMAuditor: A Framework for Auditing Large Language Models Using Human-in-the-Loop

    Authors: Maryam Amirizaniani, Jihan Yao, Adrian Lavergne, Elizabeth Snell Okada, Aman Chadha, Tanya Roosta, Chirag Shah

    Abstract: As Large Language Models (LLMs) become more pervasive across various users and scenarios, identifying potential issues when using these models becomes essential. Examples of such issues include: bias, inconsistencies, and hallucination. Although auditing the LLM for these problems is often warranted, such a process is neither easy nor accessible for most. An effective method is to probe the LLM us… ▽ More

    Submitted 22 May, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  10. arXiv:2402.09334  [pdf, other

    cs.AI

    AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach

    Authors: Maryam Amirizaniani, Elias Martin, Tanya Roosta, Aman Chadha, Chirag Shah

    Abstract: As Large Language Models (LLMs) are integrated into various sectors, ensuring their reliability and safety is crucial. This necessitates rigorous probing and auditing to maintain their effectiveness and trustworthiness in practical applications. Subjecting LLMs to varied iterations of a single query can unveil potential inconsistencies in their knowledge base or functional capacity. However, a too… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  11. arXiv:2402.07927  [pdf, other

    cs.AI cs.CL cs.HC

    A Systematic Survey of Prompt Engineering in Large Language Models: Techniques and Applications

    Authors: Pranab Sahoo, Ayush Kumar Singh, Sriparna Saha, Vinija Jain, Samrat Mondal, Aman Chadha

    Abstract: Prompt engineering has emerged as an indispensable technique for extending the capabilities of large language models (LLMs) and vision-language models (VLMs). This approach leverages task-specific instructions, known as prompts, to enhance model efficacy without modifying the core model parameters. Rather than updating the model parameters, prompts allow seamless integration of pre-trained models… ▽ More

    Submitted 16 March, 2025; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: 12 pages, 2 figures

  12. arXiv:2402.04929  [pdf, other

    cs.CV cs.AI cs.LG

    Source-Free Domain Adaptation with Diffusion-Guided Source Data Generation

    Authors: Shivang Chopra, Suraj Kothawade, Houda Aynaou, Aman Chadha

    Abstract: This paper introduces a novel approach to leverage the generalizability of Diffusion Models for Source-Free Domain Adaptation (DM-SFDA). Our proposed DMSFDA method involves fine-tuning a pre-trained text-to-image diffusion model to generate source domain images using features from the target images to guide the diffusion process. Specifically, the pre-trained diffusion model is fine-tuned to gener… ▽ More

    Submitted 26 June, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2310.01701

  13. Post-Training Embedding Alignment for Decoupling Enrollment and Runtime Speaker Recognition Models

    Authors: Chenyang Gao, Brecht Desplanques, Chelsea J. -T. Ju, Aman Chadha, Andreas Stolcke

    Abstract: Automated speaker identification (SID) is a crucial step for the personalization of a wide range of speech-enabled services. Typical SID systems use a symmetric enrollment-verification framework with a single model to derive embeddings both offline for voice profiles extracted from enrollment utterances, and online from runtime utterances. Due to the distinct circumstances of enrollment and runtim… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: Accepted to ICASSP 2024

  14. arXiv:2401.11143  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.SD eess.AS eess.SP

    Density Adaptive Attention is All You Need: Robust Parameter-Efficient Fine-Tuning Across Multiple Modalities

    Authors: Georgios Ioannides, Aman Chadha, Aaron Elkins

    Abstract: We propose the Multi-Head Density Adaptive Attention Mechanism (DAAM), a novel probabilistic attention framework that can be used for Parameter-Efficient Fine-tuning (PEFT), and the Density Adaptive Transformer (DAT), designed to enhance information aggregation across multiple modalities, including Speech, Text, and Vision. DAAM integrates learnable mean and variance into its attention mechanism,… ▽ More

    Submitted 28 September, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

  15. arXiv:2401.07872  [pdf, other

    cs.CL

    The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey

    Authors: Saurav Pawar, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Vinija Jain, Aman Chadha, Amitava Das

    Abstract: The advent of Large Language Models (LLMs) represents a notable breakthrough in Natural Language Processing (NLP), contributing to substantial progress in both text comprehension and generation. However, amidst these advancements, it is noteworthy that LLMs often face a limitation in terms of context length extrapolation. Understanding and extending the context length for LLMs is crucial in enhanc… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  16. arXiv:2401.06709  [pdf, other

    cs.CL cs.AI

    Reliability Analysis of Psychological Concept Extraction and Classification in User-penned Text

    Authors: Muskan Garg, MSVPJ Sathvik, Amrit Chadha, Shaina Raza, Sunghwan Sohn

    Abstract: The social NLP research community witness a recent surge in the computational advancements of mental health analysis to build responsible AI models for a complex interplay between language use and self-perception. Such responsible AI models aid in quantifying the psychological concepts from user-penned texts on social media. On thinking beyond the low-level (classification) task, we advance the ex… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  17. arXiv:2401.03378  [pdf, other

    cs.DC math.NA

    CG-Kit: Code Generation Toolkit for Performant and Maintainable Variants of Source Code Applied to Flash-X Hydrodynamics Simulations

    Authors: Johann Rudi, Youngjun Lee, Aidan H. Chadha, Mohamed Wahib, Klaus Weide, Jared P. O'Neal, Anshu Dubey

    Abstract: CG-Kit is a new code generation toolkit that we propose as a solution for portability and maintainability for scientific computing applications. The development of CG-Kit is rooted in the urgent need created by the shifting landscape of high-performance computing platforms and the algorithmic complexities of a particular large-scale multiphysics application: Flash-X. This combination leads to uniq… ▽ More

    Submitted 6 January, 2024; originally announced January 2024.

    Comments: submitted

  18. arXiv:2401.01596  [pdf, other

    cs.AI cs.CL

    MedSumm: A Multimodal Approach to Summarizing Code-Mixed Hindi-English Clinical Queries

    Authors: Akash Ghosh, Arkadeep Acharya, Prince Jha, Aniket Gaudgaul, Rajdeep Majumdar, Sriparna Saha, Aman Chadha, Raghav Jain, Setu Sinha, Shivani Agarwal

    Abstract: In the healthcare domain, summarizing medical questions posed by patients is critical for improving doctor-patient interactions and medical decision-making. Although medical data has grown in complexity and quantity, the current body of research in this domain has primarily concentrated on text-based methods, overlooking the integration of visual cues. Also prior works in the area of medical quest… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: ECIR 2024

  19. arXiv:2401.01313  [pdf, other

    cs.CL

    A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models

    Authors: S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Vinija Jain, Anku Rani, Vipula Rawte, Aman Chadha, Amitava Das

    Abstract: As Large Language Models (LLMs) continue to advance in their ability to write human-like text, a key challenge remains around their tendency to hallucinate generating content that appears factual but is ungrounded. This issue of hallucination is arguably the biggest hindrance to safely deploying these powerful LLMs into real-world production systems that impact people's lives. The journey toward w… ▽ More

    Submitted 8 January, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

  20. arXiv:2312.11541  [pdf, other

    cs.AI cs.CL

    CLIPSyntel: CLIP and LLM Synergy for Multimodal Question Summarization in Healthcare

    Authors: Akash Ghosh, Arkadeep Acharya, Raghav Jain, Sriparna Saha, Aman Chadha, Setu Sinha

    Abstract: In the era of modern healthcare, swiftly generating medical question summaries is crucial for informed and timely patient care. Despite the increasing complexity and volume of medical data, existing studies have focused solely on text-based summarization, neglecting the integration of visual information. Recognizing the untapped potential of combining textual queries with visual representations of… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

    Comments: AAAI 2024

  21. arXiv:2312.07028  [pdf, other

    cs.CL cs.AI

    Dynamic Corrective Self-Distillation for Better Fine-Tuning of Pretrained Models

    Authors: Ibtihel Amara, Vinija Jain, Aman Chadha

    Abstract: We tackle the challenging issue of aggressive fine-tuning encountered during the process of transfer learning of pre-trained language models (PLMs) with limited labeled downstream data. This problem primarily results in a decline in performance on the subsequent task. Inspired by the adaptive boosting method in traditional machine learning, we present an effective dynamic corrective self-distillat… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

  22. arXiv:2312.00292  [pdf, other

    cs.CL

    SEPSIS: I Can Catch Your Lies -- A New Paradigm for Deception Detection

    Authors: Anku Rani, Dwip Dalal, Shreya Gautam, Pankaj Gupta, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

    Abstract: Deception is the intentional practice of twisting information. It is a nuanced societal practice deeply intertwined with human societal evolution, characterized by a multitude of facets. This research explores the problem of deception through the lens of psychology, employing a framework that categorizes deception into three forms: lies of omission, lies of commission, and lies of influence. The p… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  23. arXiv:2310.09680  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Improved Contextual Recognition In Automatic Speech Recognition Systems By Semantic Lattice Rescoring

    Authors: Ankitha Sudarshan, Vinay Samuel, Parth Patwa, Ibtihel Amara, Aman Chadha

    Abstract: Automatic Speech Recognition (ASR) has witnessed a profound research interest. Recent breakthroughs have given ASR systems different prospects such as faithfully transcribing spoken language, which is a pivotal advancement in building conversational agents. However, there is still an imminent challenge of accurately discerning context-dependent words and phrases. In this work, we propose a novel a… ▽ More

    Submitted 3 March, 2024; v1 submitted 14 October, 2023; originally announced October 2023.

  24. arXiv:2310.07818  [pdf, other

    cs.CL cs.AI

    On the Relationship between Sentence Analogy Identification and Sentence Structure Encoding in Large Language Models

    Authors: Thilini Wijesiriwardene, Ruwan Wickramarachchi, Aishwarya Naresh Reganti, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

    Abstract: The ability of Large Language Models (LLMs) to encode syntactic and semantic structures of language is well examined in NLP. Additionally, analogy identification, in the form of word analogies are extensively studied in the last decade of language modeling literature. In this work we specifically look at how LLMs' abilities to capture sentence analogies (sentences that convey analogous meaning to… ▽ More

    Submitted 5 February, 2024; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: To appear in Findings of EACL 2024

  25. arXiv:2310.05280  [pdf, other

    cs.CL cs.AI

    Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems

    Authors: Yixin Wan, Jieyu Zhao, Aman Chadha, Nanyun Peng, Kai-Wei Chang

    Abstract: Recent advancements in Large Language Models empower them to follow freeform instructions, including imitating generic or specific demographic personas in conversations. We define generic personas to represent demographic groups, such as "an Asian person", whereas specific personas may take the form of specific popular Asian names like "Yumi". While the adoption of personas enriches user experienc… ▽ More

    Submitted 2 November, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

  26. arXiv:2310.05030  [pdf, other

    cs.CL cs.AI

    Counter Turing Test CT^2: AI-Generated Text Detection is Not as Easy as You May Think -- Introducing AI Detectability Index

    Authors: Megha Chakraborty, S. M Towhidul Islam Tonmoy, S M Mehedi Zaman, Krish Sharma, Niyar R Barman, Chandan Gupta, Shreya Gautam, Tanay Kumar, Vinija Jain, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: With the rise of prolific ChatGPT, the risk and consequences of AI-generated text has increased alarmingly. To address the inevitable question of ownership attribution for AI-generated artifacts, the US Copyright Office released a statement stating that 'If a work's traditional elements of authorship were produced by a machine, the work lacks human authorship and the Office will not register it'.… ▽ More

    Submitted 23 October, 2023; v1 submitted 8 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 Main

  27. arXiv:2310.04988  [pdf, other

    cs.AI

    The Troubling Emergence of Hallucination in Large Language Models -- An Extensive Definition, Quantification, and Prescriptive Remediations

    Authors: Vipula Rawte, Swagata Chakraborty, Agnibh Pathak, Anubhav Sarkar, S. M Towhidul Islam Tonmoy, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: The recent advancements in Large Language Models (LLMs) have garnered widespread acclaim for their remarkable emerging capabilities. However, the issue of hallucination has parallelly emerged as a by-product, posing significant concerns. While some recent endeavors have been made to identify and mitigate different types of hallucination, there has been a limited emphasis on the nuanced categorizat… ▽ More

    Submitted 22 October, 2023; v1 submitted 7 October, 2023; originally announced October 2023.

  28. arXiv:2310.01701   

    cs.CV cs.AI

    Transcending Domains through Text-to-Image Diffusion: A Source-Free Approach to Domain Adaptation

    Authors: Shivang Chopra, Suraj Kothawade, Houda Aynaou, Aman Chadha

    Abstract: Domain Adaptation (DA) is a method for enhancing a model's performance on a target domain with inadequate annotated data by applying the information the model has acquired from a related source domain with sufficient labeled data. The escalating enforcement of data-privacy regulations like HIPAA, COPPA, FERPA, etc. have sparked a heightened interest in adapting models to novel domains while circum… ▽ More

    Submitted 6 February, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Revamped the whole paper; new version will be re-submitted

  29. arXiv:2309.12426  [pdf, other

    cs.CL cs.AI

    Can LLMs Augment Low-Resource Reading Comprehension Datasets? Opportunities and Challenges

    Authors: Vinay Samuel, Houda Aynaou, Arijit Ghosh Chowdhury, Karthik Venkat Ramanan, Aman Chadha

    Abstract: Large Language Models (LLMs) have demonstrated impressive zero shot performance on a wide range of NLP tasks, demonstrating the ability to reason and apply commonsense. A relevant application is to use them for creating high quality synthetic datasets for downstream tasks. In this work, we probe whether GPT-4 can be used to augment existing extractive reading comprehension datasets. Automating dat… ▽ More

    Submitted 9 July, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: ACL 2024 SRW

  30. arXiv:2309.06517  [pdf, other

    cs.CL

    Overview of Memotion 3: Sentiment and Emotion Analysis of Codemixed Hinglish Memes

    Authors: Shreyash Mishra, S Suryavardan, Megha Chakraborty, Parth Patwa, Anku Rani, Aman Chadha, Aishwarya Reganti, Amitava Das, Amit Sheth, Manoj Chinnakotla, Asif Ekbal, Srijan Kumar

    Abstract: Analyzing memes on the internet has emerged as a crucial endeavor due to the impact this multi-modal form of content wields in shaping online discourse. Memes have become a powerful tool for expressing emotions and sentiments, possibly even spreading hate and misinformation, through humor and sarcasm. In this paper, we present the overview of the Memotion 3 shared task, as part of the DeFactify 2… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

    Comments: Defactify2 @AAAI 2023

  31. arXiv:2309.06358  [pdf, other

    cs.CL cs.AI

    Generative Data Augmentation using LLMs improves Distributional Robustness in Question Answering

    Authors: Arijit Ghosh Chowdhury, Aman Chadha

    Abstract: Robustness in Natural Language Processing continues to be a pertinent issue, where state of the art models under-perform under naturally shifted distributions. In the context of Question Answering, work on domain adaptation methods continues to be a growing body of research. However, very little attention has been given to the notion of domain generalization under natural distribution shifts, wher… ▽ More

    Submitted 8 February, 2024; v1 submitted 2 September, 2023; originally announced September 2023.

    Comments: 10 tables, 1 figure, To appear at EACL 2024 Student Research Workshop

  32. arXiv:2309.05270  [pdf, other

    cs.CL cs.LG

    CONFLATOR: Incorporating Switching Point based Rotatory Positional Encodings for Code-Mixed Language Modeling

    Authors: Mohsin Ali, Kandukuri Sai Teja, Neeharika Gupta, Parth Patwa, Anubhab Chatterjee, Vinija Jain, Aman Chadha, Amitava Das

    Abstract: The mixing of two or more languages is called Code-Mixing (CM). CM is a social norm in multilingual societies. Neural Language Models (NLMs) like transformers have been effective on many NLP tasks. However, NLM for CM is an under-explored area. Though transformers are capable and powerful, they cannot always encode positional information since they are non-recurrent. Therefore, to enrich word info… ▽ More

    Submitted 18 October, 2023; v1 submitted 11 September, 2023; originally announced September 2023.

    Comments: Workshop on Computational Approaches to Linguistic Code-Switching @EMNLP2023

  33. arXiv:2308.14659  [pdf, other

    cs.LG

    RESTORE: Graph Embedding Assessment Through Reconstruction

    Authors: Hong Yung Yip, Chidaksh Ravuru, Neelabha Banerjee, Shashwat Jha, Amit Sheth, Aman Chadha, Amitava Das

    Abstract: Following the success of Word2Vec embeddings, graph embeddings (GEs) have gained substantial traction. GEs are commonly generated and evaluated extrinsically on downstream applications, but intrinsic evaluations of the original graph properties in terms of topological structure and semantic information have been lacking. Understanding these will help identify the deficiency of the various families… ▽ More

    Submitted 5 September, 2023; v1 submitted 28 August, 2023; originally announced August 2023.

  34. arXiv:2308.14301  [pdf, other

    cs.AI

    Artificial Intelligence in Career Counseling: A Test Case with ResumAI

    Authors: Muhammad Rahman, Sachi Figliolini, Joyce Kim, Eivy Cedeno, Charles Kleier, Chirag Shah, Aman Chadha

    Abstract: The rise of artificial intelligence (AI) has led to various means of integration of AI aimed to provide efficiency in tasks, one of which is career counseling. A key part of getting a job is having a solid resume that passes through the first round of programs and recruiters. It is difficult to find good resources or schedule an appointment with a career counselor to help with editing a resume for… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  35. arXiv:2308.09862  [pdf, other

    cs.CL

    Breaking Language Barriers: A Question Answering Dataset for Hindi and Marathi

    Authors: Maithili Sabane, Onkar Litake, Aman Chadha

    Abstract: The recent advances in deep-learning have led to the development of highly sophisticated systems with an unquenchable appetite for data. On the other hand, building good deep-learning models for low-resource languages remains a challenging task. This paper focuses on developing a Question Answering dataset for two such languages- Hindi and Marathi. Despite Hindi being the 3rd most spoken language… ▽ More

    Submitted 17 February, 2024; v1 submitted 18 August, 2023; originally announced August 2023.

  36. arXiv:2308.02080  [pdf, other

    cs.CL cs.LG

    Causality Guided Disentanglement for Cross-Platform Hate Speech Detection

    Authors: Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

    Abstract: Social media platforms, despite their value in promoting open discourse, are often exploited to spread harmful content. Current deep learning and natural language processing models used for detecting this harmful content overly rely on domain-specific terms affecting their capabilities to adapt to generalizable hate speech detection. This is because they tend to focus too narrowly on particular li… ▽ More

    Submitted 10 December, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

    Comments: Accepted to WSDM'24

  37. arXiv:2307.10475  [pdf

    cs.CL cs.CV

    Findings of Factify 2: Multimodal Fake News Detection

    Authors: S Suryavardan, Shreyash Mishra, Megha Chakraborty, Parth Patwa, Anku Rani, Aman Chadha, Aishwarya Reganti, Amitava Das, Amit Sheth, Manoj Chinnakotla, Asif Ekbal, Srijan Kumar

    Abstract: With social media usage growing exponentially in the past few years, fake news has also become extremely prevalent. The detrimental impact of fake news emphasizes the need for research focused on automating the detection of false information and verifying its accuracy. In this work, we present the outcome of the Factify 2 shared task, which provides a multi-modal fact verification and satire news… ▽ More

    Submitted 12 September, 2023; v1 submitted 19 July, 2023; originally announced July 2023.

    Comments: Defactify2 @AAAI 2023

  38. arXiv:2306.09331  [pdf, other

    cs.CV

    Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers

    Authors: Dominick Reilly, Aman Chadha, Srijan Das

    Abstract: Human perception of surroundings is often guided by the various poses present within the environment. Many computer vision tasks, such as human action recognition and robot imitation learning, rely on pose-based entities like human skeletons or robotic arms. However, conventional Vision Transformer (ViT) models uniformly process all patches, neglecting valuable pose priors in input videos. We argu… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: Pre-print. 18 pages

  39. arXiv:2306.08804  [pdf, other

    cs.CL cs.LG

    PEACE: Cross-Platform Hate Speech Detection- A Causality-guided Framework

    Authors: Paras Sheth, Tharindu Kumarage, Raha Moraffah, Aman Chadha, Huan Liu

    Abstract: Hate speech detection refers to the task of detecting hateful content that aims at denigrating an individual or a group based on their religion, gender, sexual orientation, or other characteristics. Due to the different policies of the platforms, different groups of people express hate in different ways. Furthermore, due to the lack of labeled data in some platforms it becomes challenging to build… ▽ More

    Submitted 8 October, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: ECML PKDD 2023

  40. arXiv:2306.05523  [pdf, other

    cs.CL cs.AI cs.CV cs.MM

    FACTIFY3M: A Benchmark for Multimodal Fact Verification with Explainability through 5W Question-Answering

    Authors: Megha Chakraborty, Khushbu Pahwa, Anku Rani, Shreyas Chatterjee, Dwip Dalal, Harshit Dave, Ritvik G, Preethi Gurumurthy, Adarsh Mahor, Samahriti Mukherjee, Aditya Pakala, Ishan Paul, Janvita Reddy, Arghya Sarkar, Kinjal Sensharma, Aman Chadha, Amit P. Sheth, Amitava Das

    Abstract: Combating disinformation is one of the burning societal crises -- about 67% of the American population believes that disinformation produces a lot of uncertainty, and 10% of them knowingly propagate disinformation. Evidence shows that disinformation can manipulate democratic processes and public opinion, causing disruption in the share market, panic and anxiety in society, and even death during cr… ▽ More

    Submitted 30 October, 2023; v1 submitted 22 May, 2023; originally announced June 2023.

    Comments: arXiv admin note: text overlap with arXiv:2305.04329

  41. arXiv:2306.02196  [pdf, other

    cs.CL

    Question-Context Alignment and Answer-Context Dependencies for Effective Answer Sentence Selection

    Authors: Minh Van Nguyen, Kishan KC, Toan Nguyen, Thien Huu Nguyen, Ankit Chadha, Thuy Vu

    Abstract: Answer sentence selection (AS2) in open-domain question answering finds answer for a question by ranking candidate sentences extracted from web documents. Recent work exploits answer context, i.e., sentences around a candidate, by incorporating them as additional input string to the Transformer models to improve the correctness scoring. In this paper, we propose to improve the candidate scoring by… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

    Comments: final copy for INTERSPEECH 2023

  42. arXiv:2305.19230  [pdf, other

    cs.CL cs.AI

    Controlled Text Generation with Hidden Representation Transformations

    Authors: Vaibhav Kumar, Hana Koorehdavoudi, Masud Moshtaghi, Amita Misra, Ankit Chadha, Emilio Ferrara

    Abstract: We propose CHRT (Control Hidden Representation Transformation) - a controlled language generation framework that steers large language models to generate text pertaining to certain attributes (such as toxicity). CHRT gains attribute control by modifying the hidden representation of the base model through learned transformations. We employ a contrastive-learning framework to learn these transformat… ▽ More

    Submitted 31 May, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 as a long paper (Findings)

  43. arXiv:2305.18727  [pdf, other

    cs.CL cs.IR

    An Annotated Dataset for Explainable Interpersonal Risk Factors of Mental Disturbance in Social Media Posts

    Authors: Muskan Garg, Amirmohammad Shahbandegan, Amrit Chadha, Vijay Mago

    Abstract: With a surge in identifying suicidal risk and its severity in social media posts, we argue that a more consequential and explainable research is required for optimal impact on clinical psychology practice and personalized mental healthcare. The success of computational intelligence techniques for inferring mental illness from social media resources, points to natural language processing as a lens… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

  44. Cross-Lingual Knowledge Distillation for Answer Sentence Selection in Low-Resource Languages

    Authors: Shivanshu Gupta, Yoshitomo Matsubara, Ankit Chadha, Alessandro Moschitti

    Abstract: While impressive performance has been achieved on the task of Answer Sentence Selection (AS2) for English, the same does not hold for languages that lack large labeled datasets. In this work, we propose Cross-Lingual Knowledge Distillation (CLKD) from a strong English AS2 teacher as a method to train AS2 models for low-resource languages in the tasks without the need of labeled data for the target… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL 2023 as a long paper (Findings). Datasets are available at https://huggingface.co/datasets/AmazonScience/xtr-wiki_qa and https://huggingface.co/datasets/AmazonScience/tydi-as2

    Journal ref: Findings of the Association for Computational Linguistics: ACL 2023

  45. arXiv:2305.10438  [pdf, other

    cs.CL cs.AI cs.CV cs.MM

    IMAGINATOR: Pre-Trained Image+Text Joint Embeddings using Word-Level Grounding of Images

    Authors: Varuna Krishna, S Suryavardan, Shreyash Mishra, Sathyanarayanan Ramamoorthy, Parth Patwa, Megha Chakraborty, Aman Chadha, Amitava Das, Amit Sheth

    Abstract: Word embeddings, i.e., semantically meaningful vector representation of words, are largely influenced by the distributional hypothesis "You shall know a word by the company it keeps" (Harris, 1954), whereas modern prediction-based neural network embeddings rely on design choices and hyperparameter optimization. Word embeddings like Word2Vec, GloVe etc. well capture the contextuality and real-world… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

  46. arXiv:2305.05050  [pdf, other

    cs.CL cs.AI

    ANALOGICAL -- A Novel Benchmark for Long Text Analogy Evaluation in Large Language Models

    Authors: Thilini Wijesiriwardene, Ruwan Wickramarachchi, Bimal G. Gajera, Shreeyash Mukul Gowaikar, Chandan Gupta, Aman Chadha, Aishwarya Naresh Reganti, Amit Sheth, Amitava Das

    Abstract: Over the past decade, analogies, in the form of word-level analogies, have played a significant role as an intrinsic measure of evaluating the quality of word embedding methods such as word2vec. Modern large language models (LLMs), however, are primarily evaluated on extrinsic measures based on benchmarks such as GLUE and SuperGLUE, and there are only a few investigations on whether LLMs can draw… ▽ More

    Submitted 25 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted as a long paper at Findings of ACL 2023

  47. arXiv:2305.04329  [pdf, other

    cs.CL

    FACTIFY-5WQA: 5W Aspect-based Fact Verification through Question Answering

    Authors: Anku Rani, S. M Towhidul Islam Tonmoy, Dwip Dalal, Shreya Gautam, Megha Chakraborty, Aman Chadha, Amit Sheth, Amitava Das

    Abstract: Automatic fact verification has received significant attention recently. Contemporary automatic fact-checking systems focus on estimating truthfulness using numerical scores which are not human-interpretable. A human fact-checker generally follows several logical steps to verify a verisimilitude claim and conclude whether its truthful or a mere masquerade. Popular fact-checking websites follow a c… ▽ More

    Submitted 28 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: Accepted at ACL main conference 2023

  48. arXiv:2304.03897  [pdf

    cs.CL cs.CV

    Factify 2: A Multimodal Fake News and Satire News Dataset

    Authors: S Suryavardan, Shreyash Mishra, Parth Patwa, Megha Chakraborty, Anku Rani, Aishwarya Reganti, Aman Chadha, Amitava Das, Amit Sheth, Manoj Chinnakotla, Asif Ekbal, Srijan Kumar

    Abstract: The internet gives the world an open platform to express their views and share their stories. While this is very valuable, it makes fake news one of our society's most pressing problems. Manual fact checking process is time consuming, which makes it challenging to disprove misleading assertions before they cause significant harm. This is he driving interest in automatic fact or claim verification.… ▽ More

    Submitted 2 October, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: Defactify2 @AAAI2023

  49. arXiv:2304.03232  [pdf, other

    cs.RO

    Computationally-efficient Motion Cueing Algorithm via Model Predictive Control

    Authors: Akhil Chadha, Vishrut Jain, Andrea Michelle Rios Lazcano, Barys Shyrokau

    Abstract: Driving simulators have been used in the automotive industry for many years because of their ability to perform tests in a safe, reproducible and controlled immersive virtual environment. The improved performance of the simulator and its ability to recreate in-vehicle experience for the user is established through motion cueing algorithms (MCA). Such algorithms have constantly been developed with… ▽ More

    Submitted 6 April, 2023; originally announced April 2023.

    Comments: 6 pages, 7 figures, 1 table, conference

  50. arXiv:2303.12489  [pdf, other

    cs.LG cs.AI cs.CL cs.CV cs.MM

    Few-shot Multimodal Multitask Multilingual Learning

    Authors: Aman Chadha, Vinija Jain

    Abstract: While few-shot learning as a transfer learning paradigm has gained significant traction for scenarios with limited data, it has primarily been explored in the context of building unimodal and unilingual models. Furthermore, a significant part of the existing literature in the domain of few-shot multitask learning perform in-context learning which requires manually generated prompts as the input, y… ▽ More

    Submitted 18 February, 2023; originally announced March 2023.