Skip to main content

Showing 1–50 of 195 results for author: Choudhury, M

.
  1. arXiv:2506.01789  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.CV eess.AS

    Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability

    Authors: Genta Indra Winata, David Anugraha, Emmy Liu, Alham Fikri Aji, Shou-Yi Hung, Aditya Parashar, Patrick Amadeus Irawan, Ruochen Zhang, Zheng-Xin Yong, Jan Christian Blaise Cruz, Niklas Muennighoff, Seungone Kim, Hanyang Zhao, Sudipta Kar, Kezia Erina Suryoraharjo, M. Farid Adilazuarda, En-Shiun Annie Lee, Ayu Purwarianti, Derry Tanti Wijaya, Monojit Choudhury

    Abstract: High-quality datasets are fundamental to training and evaluating machine learning models, yet their creation-especially with accurate human annotations-remains a significant challenge. Many dataset paper submissions lack originality, diversity, or rigorous quality control, and these shortcomings are often overlooked during peer review. Submissions also frequently omit essential details about datas… ▽ More

    Submitted 3 June, 2025; v1 submitted 2 June, 2025; originally announced June 2025.

    Comments: Preprint

  2. arXiv:2506.00308  [pdf, ps, other

    cs.CY cs.AI cs.CL cs.HC

    MythTriage: Scalable Detection of Opioid Use Disorder Myths on a Video-Sharing Platform

    Authors: Hayoung Jung, Shravika Mittal, Ananya Aatreya, Navreet Kaur, Munmun De Choudhury, Tanushree Mitra

    Abstract: Understanding the prevalence of misinformation in health topics online can inform public health policies and interventions. However, measuring such misinformation at scale remains a challenge, particularly for high-stakes but understudied topics like opioid-use disorder (OUD)--a leading cause of death in the U.S. We present the first large-scale study of OUD-related myths on YouTube, a widely-used… ▽ More

    Submitted 30 May, 2025; originally announced June 2025.

    Comments: 34 pages, 14 figures, 21 tables. In submission

  3. arXiv:2505.23231  [pdf, ps, other

    cs.CY

    REDDIX-NET: A Novel Dataset and Benchmark for Moderating Online Explicit Services

    Authors: MSVPJ Sathvik, Manan Roy Choudhury, Rishita Agarwal, Sathwik Narkedimilli, Vivek Gupta

    Abstract: The rise of online platforms has enabled covert illicit activities, including online prostitution, to pose challenges for detection and regulation. In this study, we introduce REDDIX-NET, a novel benchmark dataset specifically designed for moderating online sexual services and going beyond traditional NSFW filters. The dataset is derived from thousands of web-scraped NSFW posts on Reddit and categ… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 29 pages, 15 figures

  4. arXiv:2505.20201  [pdf, other

    cs.CL

    Reasoning Is Not All You Need: Examining LLMs for Multi-Turn Mental Health Conversations

    Authors: Mohit Chandra, Siddharth Sriraman, Harneet Singh Khanuja, Yiqiao Jin, Munmun De Choudhury

    Abstract: Limited access to mental healthcare, extended wait times, and increasing capabilities of Large Language Models (LLMs) has led individuals to turn to LLMs for fulfilling their mental health needs. However, examining the multi-turn mental health conversation capabilities of LLMs remains under-explored. Existing evaluation frameworks typically focus on diagnostic accuracy and win-rates and often over… ▽ More

    Submitted 28 May, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

    Comments: 34 pages, 5 figures, 30 tables

  5. arXiv:2505.08143  [pdf, ps, other

    cs.HC cs.AI

    Communication Styles and Reader Preferences of LLM and Human Experts in Explaining Health Information

    Authors: Jiawei Zhou, Kritika Venkatachalam, Minje Choi, Koustuv Saha, Munmun De Choudhury

    Abstract: With the wide adoption of large language models (LLMs) in information assistance, it is essential to examine their alignment with human communication styles and values. We situate this study within the context of fact-checking health information, given the critical challenge of rectifying conceptions and building trust. Recent studies have explored the potential of LLM for health communication, bu… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  6. arXiv:2505.03770  [pdf, other

    cs.AI

    Proceedings of 1st Workshop on Advancing Artificial Intelligence through Theory of Mind

    Authors: Mouad Abrini, Omri Abend, Dina Acklin, Henny Admoni, Gregor Aichinger, Nitay Alon, Zahra Ashktorab, Ashish Atreja, Moises Auron, Alexander Aufreiter, Raghav Awasthi, Soumya Banerjee, Joe M. Barnby, Rhea Basappa, Severin Bergsmann, Djallel Bouneffouf, Patrick Callaghan, Marc Cavazza, Thierry Chaminade, Sonia Chernova, Mohamed Chetouan, Moumita Choudhury, Axel Cleeremans, Jacek B. Cywinski, Fabio Cuzzolin , et al. (83 additional authors not shown)

    Abstract: This volume includes a selection of papers presented at the Workshop on Advancing Artificial Intelligence through Theory of Mind held at AAAI 2025 in Philadelphia US on 3rd March 2025. The purpose of this volume is to provide an open access and curated anthology for the ToM and AI research community.

    Submitted 28 April, 2025; originally announced May 2025.

    Comments: workshop proceedings

  7. arXiv:2505.00373  [pdf, other

    astro-ph.CO

    Constraints on the state of the IGM at $z\sim 8-10$ using redshifted 21-cm observations with LOFAR

    Authors: R. Ghara, S. Zaroubi, B. Ciardi, G. Mellema, S. K. Giri, F. G. Mertens, M. Mevius, L. V. E. Koopmans, I. T. Iliev, A. Acharya, S. A. Brackenhoff, E. Ceccotti, K. Chege, I. Georgiev, S. Ghosh, I. Hothi, C. Höfer, Q. Ma, S. Munshi, A. R. Offringa, A. K. Shaw, V. N. Pandey, S. Yatawatta, M. Choudhury

    Abstract: The power spectra of the redshifted 21-cm signal from the Epoch of Reionization (EoR) contain information about the ionization and thermal states of the intergalactic medium (IGM), and depend on the properties of the EoR sources. Recently, Mertens et al 2025 has analysed 10 nights of LOFAR high-band data and estimated upper limits on the 21-cm power spectrum at redshifts 8.3, 9.1 and 10.1. Here we… ▽ More

    Submitted 1 May, 2025; originally announced May 2025.

    Comments: 23 pages, 20 figures; accepted for publication in Astronomy and Astrophysics (A&A)

  8. arXiv:2504.16715  [pdf, other

    physics.plasm-ph

    Modeling of Experimentally Observed Two-Dimensional Precursor Solitons in a Dusty Plasma by the forced Kadomtsev-Petviashvili Equation

    Authors: Ajaz Mir, Pintu Bandyopadhyay, Madhurima Choudhury, Krishan Kumar, Abhijit Sen

    Abstract: We compare model solutions of a forced Kadomtsev-Petviashvili (fKP) equation with experimental observations of dust acoustic precursor solitons excited by a supersonically moving charged cylindrical object in a dusty plasma medium. The fKP equation is derived from a three-fluid-Poisson model of the dusty plasma using the reductive perturbation technique and numerically solved for parameters close… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    Comments: 9 pages, 5 figures

  9. arXiv:2504.10501  [pdf, other

    cs.SI cs.CL cs.CY cs.HC

    Exposure to Content Written by Large Language Models Can Reduce Stigma Around Opioid Use Disorder in Online Communities

    Authors: Shravika Mittal, Darshi Shah, Shin Won Do, Mai ElSherief, Tanushree Mitra, Munmun De Choudhury

    Abstract: Widespread stigma, both in the offline and online spaces, acts as a barrier to harm reduction efforts in the context of opioid use disorder (OUD). This stigma is prominently directed towards clinically approved medications for addiction treatment (MAT), people with the condition, and the condition itself. Given the potential of artificial intelligence based technologies in promoting health equity,… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  10. arXiv:2504.09271  [pdf, other

    cs.HC cs.AI cs.CL cs.SI

    Linguistic Comparison of AI- and Human-Written Responses to Online Mental Health Queries

    Authors: Koustuv Saha, Yoshee Jain, Munmun De Choudhury

    Abstract: The ubiquity and widespread use of digital and online technologies have transformed mental health support, with online mental health communities (OMHCs) providing safe spaces for peer support. More recently, generative AI and large language models (LLMs) have introduced new possibilities for scalable, around-the-clock mental health assistance that could potentially augment and supplement the capab… ▽ More

    Submitted 12 April, 2025; originally announced April 2025.

  11. arXiv:2504.08044  [pdf, other

    cs.SI cs.CL

    Large-Scale Analysis of Online Questions Related to Opioid Use Disorder on Reddit

    Authors: Tanmay Laud, Akadia Kacha-Ochana, Steven A. Sumner, Vikram Krishnasamy, Royal Law, Lyna Schieber, Munmun De Choudhury, Mai ElSherief

    Abstract: Opioid use disorder (OUD) is a leading health problem that affects individual well-being as well as general public health. Due to a variety of reasons, including the stigma faced by people using opioids, online communities for recovery and support were formed on different social media platforms. In these communities, people share their experiences and solicit information by asking questions to lea… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: Accepted to ICWSM 2025

    Journal ref: Proceedings of the International AAAI Conference on Web and Social Media (ICWSM'25) (2025)

  12. arXiv:2504.06160  [pdf, other

    cs.CL cs.AI cs.CY cs.LG cs.SI

    Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups

    Authors: Rijul Magu, Arka Dutta, Sean Kim, Ashiqur R. KhudaBukhsh, Munmun De Choudhury

    Abstract: Large Language Models (LLMs) have been shown to demonstrate imbalanced biases against certain groups. However, the study of unprovoked targeted attacks by LLMs towards at-risk populations remains underexplored. Our paper presents three novel contributions: (1) the explicit evaluation of LLM-generated attacks on highly vulnerable mental health groups; (2) a network-based framework to study the prop… ▽ More

    Submitted 11 April, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

    ACM Class: J.4; K.4.1; K.4.2

  13. arXiv:2504.06011  [pdf, other

    cs.CL

    Llama-3-Nanda-10B-Chat: An Open Generative Large Language Model for Hindi

    Authors: Monojit Choudhury, Shivam Chauhan, Rocktim Jyoti Das, Dhruv Sahnan, Xudong Han, Haonan Li, Aaryamonvikram Singh, Alok Anil Jadhav, Utkarsh Agarwal, Mukund Choudhary, Debopriyo Banerjee, Fajri Koto, Junaid Bhat, Awantika Shukla, Samujjwal Ghosh, Samta Kamboj, Onkar Pandit, Lalit Pradhan, Rahul Pal, Sunil Sahu, Soundar Doraiswamy, Parvez Mullah, Ali El Filali, Neha Sengupta, Gokul Ramakrishnan , et al. (5 additional authors not shown)

    Abstract: Developing high-quality large language models (LLMs) for moderately resourced languages presents unique challenges in data availability, model adaptation, and evaluation. We introduce Llama-3-Nanda-10B-Chat, or Nanda for short, a state-of-the-art Hindi-centric instruction-tuned generative LLM, designed to push the boundaries of open-source Hindi language models. Built upon Llama-3-8B, Nanda incorp… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  14. arXiv:2504.02793  [pdf, other

    cs.AI cs.CL cs.CY cs.HC

    A Framework for Situating Innovations, Opportunities, and Challenges in Advancing Vertical Systems with Large AI Models

    Authors: Gaurav Verma, Jiawei Zhou, Mohit Chandra, Srijan Kumar, Munmun De Choudhury

    Abstract: Large artificial intelligence (AI) models have garnered significant attention for their remarkable, often "superhuman", performance on standardized benchmarks. However, when these models are deployed in high-stakes verticals such as healthcare, education, and law, they often reveal notable limitations. For instance, they exhibit brittleness to minor variations in input data, present contextually u… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: pre-print; 7 pages of main content, 1 figure, 1 table

  15. arXiv:2503.11740  [pdf, other

    astro-ph.IM astro-ph.CO

    Square Kilometre Array Science Data Challenge 3a: foreground removal for an EoR experiment

    Authors: A. Bonaldi, P. Hartley, R. Braun, S. Purser, A. Acharya, K. Ahn, M. Aparicio Resco, O. Bait, M. Bianco, A. Chakraborty, E. Chapman, S. Chatterjee, K. Chege, H. Chen, X. Chen, Z. Chen, L. Conaboy, M. Cruz, L. Darriba, M. De Santis, P. Denzel, K. Diao, J. Feron, C. Finlay, B. Gehlot , et al. (159 additional authors not shown)

    Abstract: We present and analyse the results of the Science data challenge 3a (SDC3a, https://sdc3.skao.int/challenges/foregrounds), an EoR foreground-removal community-wide exercise organised by the Square Kilometre Array Observatory (SKAO). The challenge ran for 8 months, from March to October 2023. Participants were provided with realistic simulations of SKA-Low data between 106 MHz and 196 MHz, includin… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: 29 pages, 10 figures, submitted to MNRAS

  16. arXiv:2503.01493  [pdf, other

    cs.CL

    Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh

    Authors: Fajri Koto, Rituraj Joshi, Nurdaulet Mukhituly, Yuxia Wang, Zhuohan Xie, Rahul Pal, Daniil Orel, Parvez Mullah, Diana Turmakhan, Maiya Goloburda, Mohammed Kamran, Samujjwal Ghosh, Bokang Jia, Jonibek Mansurov, Mukhammed Togmanov, Debopriyo Banerjee, Nurkhan Laiyk, Akhmed Sakip, Xudong Han, Ekaterina Kochmar, Alham Fikri Aji, Aaryamonvikram Singh, Alok Anil Jadhav, Satheesh Katipomu, Samta Kamboj , et al. (10 additional authors not shown)

    Abstract: Llama-3.1-Sherkala-8B-Chat, or Sherkala-Chat (8B) for short, is a state-of-the-art instruction-tuned open generative large language model (LLM) designed for Kazakh. Sherkala-Chat (8B) aims to enhance the inclusivity of LLM advancements for Kazakh speakers. Adapted from the LLaMA-3.1-8B model, Sherkala-Chat (8B) is trained on 45.3B tokens across Kazakh, English, Russian, and Turkish. With 8 billion… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: Technical Report

  17. arXiv:2502.12414  [pdf, other

    cs.CL

    Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models

    Authors: Hanin Atwany, Abdul Waheed, Rita Singh, Monojit Choudhury, Bhiksha Raj

    Abstract: Speech foundation models trained at a massive scale, both in terms of model and data size, result in robust systems capable of performing multiple speech tasks, including automatic speech recognition (ASR). These models transcend language and domain barriers, yet effectively measuring their performance remains a challenge. Traditional metrics like word error rate (WER) and character error rate (CE… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: The first two authors contributed equally as co-first authors. The manuscript is 21 pages long and is a work in progress

  18. arXiv:2502.09637  [pdf, other

    cs.CY cs.AI cs.CL

    Meta-Cultural Competence: Climbing the Right Hill of Cultural Awareness

    Authors: Sougata Saha, Saurabh Kumar Pandey, Monojit Choudhury

    Abstract: Numerous recent studies have shown that Large Language Models (LLMs) are biased towards a Western and Anglo-centric worldview, which compromises their usefulness in non-Western cultural settings. However, "culture" is a complex, multifaceted topic, and its awareness, representation, and modeling in LLMs and LLM-based applications can be defined and measured in numerous ways. In this position paper… ▽ More

    Submitted 8 February, 2025; originally announced February 2025.

  19. arXiv:2502.09636  [pdf, other

    cs.CL cs.AI

    Reading between the Lines: Can LLMs Identify Cross-Cultural Communication Gaps?

    Authors: Sougata Saha, Saurabh Kumar Pandey, Harshit Gupta, Monojit Choudhury

    Abstract: In a rapidly globalizing and digital world, content such as book and product reviews created by people from diverse cultures are read and consumed by others from different corners of the world. In this paper, we investigate the extent and patterns of gaps in understandability of book reviews due to the presence of culturally-specific items and elements that might be alien to users from another cul… ▽ More

    Submitted 20 February, 2025; v1 submitted 8 February, 2025; originally announced February 2025.

  20. arXiv:2502.07328  [pdf, other

    cs.SD cs.AI cs.CL cs.LG cs.MM

    Music for All: Representational Bias and Cross-Cultural Adaptability of Music Generation Models

    Authors: Atharva Mehta, Shivam Chauhan, Amirbek Djanibekov, Atharva Kulkarni, Gus Xia, Monojit Choudhury

    Abstract: The advent of Music-Language Models has greatly enhanced the automatic music generation capability of AI systems, but they are also limited in their coverage of the musical genres and cultures of the world. We present a study of the datasets and research papers for music generation and quantify the bias and under-representation of genres. We find that only 5.7% of the total hours of existing music… ▽ More

    Submitted 6 May, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

    Comments: 17 pages, 5 figures, accepted to NAACL'25

  21. arXiv:2502.07101  [pdf, other

    cs.CL

    SMAB: MAB based word Sensitivity Estimation Framework and its Applications in Adversarial Text Generation

    Authors: Saurabh Kumar Pandey, Sachin Vashistha, Debrup Das, Somak Aditya, Monojit Choudhury

    Abstract: To understand the complexity of sequence classification tasks, Hahn et al. (2021) proposed sensitivity as the number of disjoint subsets of the input sequence that can each be individually changed to change the output. Though effective, calculating sensitivity at scale using this framework is costly because of exponential time complexity. Therefore, we introduce a Sensitivity-based Multi-Armed Ban… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  22. arXiv:2501.05621  [pdf, ps, other

    cs.CY cs.HC

    Employing Social Media to Improve Mental Health Outcomes

    Authors: Munmun De Choudhury

    Abstract: As social media platforms are increasingly adopted, the data the data people leave behind is shining new light into our understanding of phenomena, ranging from socio-economic-political events to the spread of infectious diseases. This chapter presents research conducted in the past decade that has harnessed social media data in the service of mental health and well-being. The discussion is organi… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

  23. arXiv:2501.03479  [pdf

    cs.CL

    Women, Infamous, and Exotic Beings: What Honorific Usages in Wikipedia Reveal about the Socio-Cultural Norms

    Authors: Sourabrata Mukherjee, Soumya Teotia, Sougata Saha, Monojit Choudhury

    Abstract: Honorifics serve as powerful linguistic markers that reflect social hierarchies and cultural values. This paper presents a large-scale, cross-linguistic exploration of usage of honorific pronouns in Bengali and Hindi Wikipedia articles, shedding light on how socio-cultural factors shape language. Using LLM (GPT-4o), we annotated 10, 000 articles of real and fictional beings in each language for se… ▽ More

    Submitted 6 March, 2025; v1 submitted 6 January, 2025; originally announced January 2025.

  24. arXiv:2412.18551  [pdf, other

    cs.CL

    Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability

    Authors: Haonan Li, Xudong Han, Zenan Zhai, Honglin Mu, Hao Wang, Zhenxuan Zhang, Yilin Geng, Shom Lin, Renxi Wang, Artem Shelmanov, Xiangyu Qi, Yuxia Wang, Donghai Hong, Youliang Yuan, Meng Chen, Haoqin Tu, Fajri Koto, Tatsuki Kuribayashi, Cong Zeng, Rishabh Bhardwaj, Bingchen Zhao, Yawen Duan, Yi Liu, Emad A. Alghamdi, Yaodong Yang , et al. (10 additional authors not shown)

    Abstract: To address this gap, we introduce Libra-Leaderboard, a comprehensive framework designed to rank LLMs through a balanced evaluation of performance and safety. Combining a dynamic leaderboard with an interactive LLM arena, Libra-Leaderboard encourages the joint optimization of capability and safety. Unlike traditional approaches that average performance and safety metrics, Libra-Leaderboard uses a d… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

  25. arXiv:2412.07951  [pdf, ps, other

    cs.HC cs.AI cs.CY

    From Lived Experience to Insight: Unpacking the Psychological Risks of Using AI Conversational Agents

    Authors: Mohit Chandra, Suchismita Naik, Denae Ford, Ebele Okoli, Munmun De Choudhury, Mahsa Ershadi, Gonzalo Ramos, Javier Hernandez, Ananya Bhattacharjee, Shahed Warreth, Jina Suh

    Abstract: Recent gains in popularity of AI conversational agents have led to their increased use for improving productivity and supporting well-being. While previous research has aimed to understand the risks associated with interactions with AI conversational agents, these studies often fall short in capturing the lived experiences of individuals. Additionally, psychological risks have often been presented… ▽ More

    Submitted 29 May, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: 31 pages, 6 figures, 8 tables; Accepted at ACM FAccT 2025

  26. arXiv:2412.04100  [pdf, other

    cs.SD cs.AI cs.CL cs.LG eess.AS

    Missing Melodies: AI Music Generation and its "Nearly" Complete Omission of the Global South

    Authors: Atharva Mehta, Shivam Chauhan, Monojit Choudhury

    Abstract: Recent advances in generative AI have sparked renewed interest and expanded possibilities for music generation. However, the performance and versatility of these systems across musical genres are heavily influenced by the availability of training data. We conducted an extensive analysis of over one million hours of audio datasets used in AI music generation research and manually reviewed more than… ▽ More

    Submitted 12 December, 2024; v1 submitted 5 December, 2024; originally announced December 2024.

    Comments: Submitted to CACM, 12 pages, 2 figures

  27. arXiv:2411.19925  [pdf, other

    physics.optics physics.app-ph

    Efficiency Enhancement of c-Si/TiO$_2$ Heterojunction Thin Film Solar Cell Using Hybrid Metal-Dielectric Nanostructures

    Authors: Soikot Sarkar, Sajid Muhaimin Choudhury

    Abstract: The hybrid metal-dielectric nanostructures (HMDN) are promising candidates to address the ohmic loss by conventional nanostructures in photovoltaic applications by strong confinement and high scattering directivity. In this study, we present a c-Si/TiO$_2$ heterojunction thin film solar cell (TFSC) where a pair of triangular HMDN comprised of Ag and AZO was utilized to enhance the longer wavelengt… ▽ More

    Submitted 14 May, 2025; v1 submitted 29 November, 2024; originally announced November 2024.

    Comments: 46 page 10 figures

    Journal ref: Solar Energy, Volume 296, August 2025, 113535

  28. arXiv:2411.16508  [pdf, other

    cs.CV cs.CL

    All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages

    Authors: Ashmal Vayani, Dinura Dissanayake, Hasindri Watawana, Noor Ahsan, Nevasini Sasikumar, Omkar Thawakar, Henok Biadglign Ademtew, Yahya Hmaiti, Amandeep Kumar, Kartik Kuckreja, Mykola Maslych, Wafa Al Ghallabi, Mihail Mihaylov, Chao Qin, Abdelrahman M Shaker, Mike Zhang, Mahardika Krisna Ihsani, Amiel Esplana, Monil Gokani, Shachar Mirkin, Harsh Singh, Ashay Srivastava, Endre Hamerlik, Fathinah Asma Izzati, Fadillah Adamsyah Maani , et al. (44 additional authors not shown)

    Abstract: Existing Large Multimodal Models (LMMs) generally focus on only a few regions and languages. As LMMs continue to improve, it is increasingly important to ensure they understand cultural contexts, respect local sensitivities, and support low-resource languages, all while effectively integrating corresponding visual cues. In pursuit of culturally diverse global multimodal models, our proposed All La… ▽ More

    Submitted 30 April, 2025; v1 submitted 25 November, 2024; originally announced November 2024.

    Comments: A Multilingual Multimodal cultural benchmark for 100 languages

  29. arXiv:2411.14720  [pdf

    cs.CL

    Optimizing Social Media Annotation of HPV Vaccine Skepticism and Misinformation Using Large Language Models: An Experimental Evaluation of In-Context Learning and Fine-Tuning Stance Detection Across Multiple Models

    Authors: Luhang Sun, Varsha Pendyala, Yun-Shiuan Chuang, Shanglin Yang, Jonathan Feldman, Andrew Zhao, Munmun De Choudhury, Sijia Yang, Dhavan Shah

    Abstract: This paper leverages large-language models (LLMs) to experimentally determine optimal strategies for scaling up social media content annotation for stance detection on HPV vaccine-related tweets. We examine both conventional fine-tuning and emergent in-context learning methods, systematically varying strategies of prompt engineering across widely used LLMs and their variants (e.g., GPT4, Mistral,… ▽ More

    Submitted 2 April, 2025; v1 submitted 21 November, 2024; originally announced November 2024.

  30. arXiv:2411.12356  [pdf, other

    physics.optics physics.app-ph

    Design of Dual-Band Plasmonic Absorber for Biomedical Sensing and Environmental Monitoring

    Authors: Ayon Sarker, Sajid Muhaimin Choudhury

    Abstract: This study introduces a dual-band plasmonic absorber designed for simultaneous sensing applications in the near-infrared (NIR) and mid-infrared (MIR) regions. The absorber, composed of silver nanostructures on a metal plate with a dielectric spacer, exhibits a combination of localized and gap surface plasmon resonances, resulting in two distinct absorption peaks in theoretical analysis based on th… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

    Comments: 13 pages, 10 figures, submitted to Journal of Optics and Laser Technology for review

  31. arXiv:2411.02798  [pdf, other

    cs.CR

    TRANSPOSE: Transitional Approaches for Spatially-Aware LFI Resilient FSM Encoding

    Authors: Muhtadi Choudhury, Minyan Gao, Avinash Varna, Elad Peer, Domenic Forte

    Abstract: Finite state machines (FSMs) regulate sequential circuits, including access to sensitive information and privileged CPU states. Courtesy of contemporary research on laser attacks, laser-based fault injection (LFI) is becoming even more precise where an adversary can thwart chip security by altering individual flip-flop (FF) values. Different laser models, e.g., bit flip, bit set, and bit reset, ha… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

    Comments: 14 pages, 11 figures

  32. arXiv:2411.02594  [pdf, other

    cs.HC cs.AI cs.CL

    "It's a conversation, not a quiz": A Risk Taxonomy and Reflection Tool for LLM Adoption in Public Health

    Authors: Jiawei Zhou, Amy Z. Chen, Darshi Shah, Laura Schwab Reese, Munmun De Choudhury

    Abstract: Recent breakthroughs in large language models (LLMs) have generated both interest and concern about their potential adoption as accessible information sources or communication tools across different domains. In public health -- where stakes are high and impacts extend across populations -- adopting LLMs poses unique challenges that require thorough evaluation. However, structured approaches for as… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  33. arXiv:2410.22446  [pdf, other

    cs.CL cs.AI

    Do Large Language Models Align with Core Mental Health Counseling Competencies?

    Authors: Viet Cuong Nguyen, Mohammad Taher, Dongwan Hong, Vinicius Konkolics Possobom, Vibha Thirunellayi Gopalakrishnan, Ekta Raj, Zihang Li, Heather J. Soled, Michael L. Birnbaum, Srijan Kumar, Munmun De Choudhury

    Abstract: The rapid evolution of Large Language Models (LLMs) presents a promising solution to the global shortage of mental health professionals. However, their alignment with essential counseling competencies remains underexplored. We introduce CounselingBench, a novel NCMHCE-based benchmark evaluating 22 general-purpose and medical-finetuned LLMs across five key competencies. While frontier models surpas… ▽ More

    Submitted 26 February, 2025; v1 submitted 29 October, 2024; originally announced October 2024.

    Comments: 10 Pages, Accepted to Findings of NAACL 2025

  34. arXiv:2410.20817  [pdf, other

    cs.CL

    The Zeno's Paradox of `Low-Resource' Languages

    Authors: Hellina Hailu Nigatu, Atnafu Lambebo Tonja, Benjamin Rosman, Thamar Solorio, Monojit Choudhury

    Abstract: The disparity in the languages commonly studied in Natural Language Processing (NLP) is typically reflected by referring to languages as low vs high-resourced. However, there is limited consensus on what exactly qualifies as a `low-resource language.' To understand how NLP papers define and study `low resource' languages, we qualitatively analyzed 150 papers from the ACL Anthology and popular spee… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: Accepted at EMNLP 2024

  35. arXiv:2410.19155  [pdf, other

    cs.CL cs.AI cs.CY

    Lived Experience Not Found: LLMs Struggle to Align with Experts on Addressing Adverse Drug Reactions from Psychiatric Medication Use

    Authors: Mohit Chandra, Siddharth Sriraman, Gaurav Verma, Harneet Singh Khanuja, Jose Suarez Campayo, Zihang Li, Michael L. Birnbaum, Munmun De Choudhury

    Abstract: Adverse Drug Reactions (ADRs) from psychiatric medications are the leading cause of hospitalizations among mental health patients. With healthcare systems and online communities facing limitations in resolving ADR-related issues, Large Language Models (LLMs) have the potential to fill this gap. Despite the increasing capabilities of LLMs, past research has not explored their capabilities in detect… ▽ More

    Submitted 7 January, 2025; v1 submitted 24 October, 2024; originally announced October 2024.

    Comments: 30 pages, 8 figures, 16 tables

  36. arXiv:2409.19492  [pdf, ps, other

    cs.CL cs.AI

    MedHalu: Hallucinations in Responses to Healthcare Queries by Large Language Models

    Authors: Vibhor Agarwal, Yiqiao Jin, Mohit Chandra, Munmun De Choudhury, Srijan Kumar, Nishanth Sastry

    Abstract: The remarkable capabilities of large language models (LLMs) in language understanding and generation have not rendered them immune to hallucinations. LLMs can still generate plausible-sounding but factually incorrect or fabricated information. As LLM-empowered chatbots become popular, laypeople may frequently ask health-related queries and risk falling victim to these LLM hallucinations, resulting… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

    Comments: 14 pages

  37. arXiv:2409.15704  [pdf, other

    cs.OS

    Assessing FIFO and Round Robin Scheduling:Effects on Data Pipeline Performance and Energy Usage

    Authors: Malobika Roy Choudhury, Akshat Mehrotra

    Abstract: In the case of compute-intensive machine learning, efficient operating system scheduling is crucial for performance and energy efficiency. This paper conducts a comparative study over FIFO(First-In-First-Out) and RR(Round-Robin) scheduling policies with the application of real-time machine learning training processes and data pipelines on Ubuntu-based systems. Knowing a few patterns of CPU usage a… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  38. arXiv:2409.10658  [pdf, other

    physics.optics

    Lithium Niobate Photonic Topological Insulator-based Multi-Wavelength Optical Demultiplexer with Piezoelectric Switch-Off

    Authors: Prithu Mahmud, Kaniz Fatema Supti, Sajid Muhaimin Choudhury

    Abstract: Photonic topological insulators provide unidirectional, robust, wavelength-selective transport of light at an interface while keeping it insulated at the bulk of the material. The non-trivial topology results in an immunity to backscattering, sharp turns, and fabrication defects. This work leverages these unique properties to design a 2-channel optical demultiplexer based on a lithium niobate phot… ▽ More

    Submitted 8 December, 2024; v1 submitted 16 September, 2024; originally announced September 2024.

    Comments: added reference to journal

    Journal ref: Opt. Express 32, 45786-45800 (2024)

  39. arXiv:2409.09662  [pdf, other

    cs.HC cs.AI cs.CL

    ExploreSelf: Fostering User-driven Exploration and Reflection on Personal Challenges with Adaptive Guidance by Large Language Models

    Authors: Inhwa Song, SoHyun Park, Sachin R. Pendse, Jessica Lee Schleider, Munmun De Choudhury, Young-Ho Kim

    Abstract: Expressing stressful experiences in words is proven to improve mental and physical health, but individuals often disengage with writing interventions as they struggle to organize their thoughts and emotions. Reflective prompts have been used to provide direction, and large language models (LLMs) have demonstrated the potential to provide tailored guidance. However, current systems often limit user… ▽ More

    Submitted 5 February, 2025; v1 submitted 15 September, 2024; originally announced September 2024.

    Comments: 17 pages excluding reference and appendix. Accepted at ACM CHI 2025. https://naver-ai.github.io/exploreself

    Report number: 306 ACM Class: H.5.2; I.2.7

    Journal ref: CHI '25: Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems

  40. arXiv:2407.15227  [pdf, other

    cs.CL cs.SI

    A Community-Centric Perspective for Characterizing and Detecting Anti-Asian Violence-Provoking Speech

    Authors: Gaurav Verma, Rynaa Grover, Jiawei Zhou, Binny Mathew, Jordan Kraemer, Munmun De Choudhury, Srijan Kumar

    Abstract: Violence-provoking speech -- speech that implicitly or explicitly promotes violence against the members of the targeted community, contributed to a massive surge in anti-Asian crimes during the pandemic. While previous works have characterized and built tools for detecting other forms of harmful speech, like fear speech and hate speech, our work takes a community-centric approach to studying anti-… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: Accepted to ACL 2024 Main

  41. arXiv:2407.03523  [pdf, other

    astro-ph.CO

    Inferring IGM parameters from the redshifted 21-cm Power Spectrum using Artificial Neural Networks

    Authors: Madhurima Choudhury, Raghunath Ghara, Saleem Zaroubi, Benedetta Ciardi, Leon V. E. Koopmans, Garrelt Mellema, Abinash Kumar Shaw, Anshuman Acharya, I. T. Iliev, Qing-Bo Ma, Sambit K. Giri

    Abstract: The high redshift 21-cm signal promises to be a crucial probe of the state of the intergalactic medium (IGM). Understanding the connection between the observed 21-cm power spectrum and the physical quantities intricately associated with the IGM is crucial to fully understand the evolution of our Universe. In this study, we develop an emulator using artificial neural network (ANN) to predict the 21… ▽ More

    Submitted 6 May, 2025; v1 submitted 3 July, 2024; originally announced July 2024.

  42. arXiv:2407.02662  [pdf, other

    cs.SI cs.CL cs.CY

    Supporters and Skeptics: LLM-based Analysis of Engagement with Mental Health (Mis)Information Content on Video-sharing Platforms

    Authors: Viet Cuong Nguyen, Mini Jain, Abhijat Chauhan, Heather Jaime Soled, Santiago Alvarez Lesmes, Zihang Li, Michael L. Birnbaum, Sunny X. Tang, Srijan Kumar, Munmun De Choudhury

    Abstract: Over one in five adults in the US lives with a mental illness. In the face of a shortage of mental health professionals and offline resources, online short-form video content has grown to serve as a crucial conduit for disseminating mental health help and resources. However, the ease of content creation and access also contributes to the spread of misinformation, posing risks to accurate diagnosis… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 12 pages, in submission to ICWSM

  43. arXiv:2406.12702  [pdf, other

    cs.CL

    [WIP] Jailbreak Paradox: The Achilles' Heel of LLMs

    Authors: Abhinav Rao, Monojit Choudhury, Somak Aditya

    Abstract: We introduce two paradoxes concerning jailbreak of foundation models: First, it is impossible to construct a perfect jailbreak classifier, and second, a weaker model cannot consistently detect whether a stronger (in a pareto-dominant sense) model is jailbroken or not. We provide formal proofs for these paradoxes and a short case study on Llama and GPT4-o to demonstrate this. We discuss broader the… ▽ More

    Submitted 20 June, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

  44. arXiv:2406.11661  [pdf, other

    cs.CL

    Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting

    Authors: Sagnik Mukherjee, Muhammad Farid Adilazuarda, Sunayana Sitaram, Kalika Bali, Alham Fikri Aji, Monojit Choudhury

    Abstract: Socio-demographic prompting is a commonly employed approach to study cultural biases in LLMs as well as for aligning models to certain cultures. In this paper, we systematically probe four LLMs (Llama 3, Mistral v0.2, GPT-3.5 Turbo and GPT-4) with prompts that are conditioned on culturally sensitive and non-sensitive cues, on datasets that are supposed to be culturally sensitive (EtiCor and CALI)… ▽ More

    Submitted 20 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

  45. arXiv:2406.05519  [pdf, other

    physics.optics physics.app-ph

    Synergizing Deep Learning and Phase Change Materials for Four-state Broadband Multifunctional Metasurfaces in the Visible Range

    Authors: Md. Ehsanul Karim, Md. Redwanul Karim, Sajid Muhaimin Choudhury

    Abstract: In this article, we report, for the first time, broadband multifunctional metasurfaces with more than four distinct functionalities. The constituent meta-atoms combine two different phase change materials, $\mathrm{VO_2}$ and $\mathrm{Sb_2S_3}$ in a multi-stage configuration. FDTD simulations demonstrate a broadband reflection amplitude switching between the four states in visible range due to the… ▽ More

    Submitted 28 July, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Journal ref: Optics & Laser Technology Volume 181, Part A , February 2025, 111730

  46. arXiv:2406.00314  [pdf, other

    cs.CL cs.AI cs.LG

    CASE: Efficient Curricular Data Pre-training for Building Assistive Psychology Expert Models

    Authors: Sarthak Harne, Monjoy Narayan Choudhury, Madhav Rao, TK Srikanth, Seema Mehrotra, Apoorva Vashisht, Aarushi Basu, Manjit Sodhi

    Abstract: The limited availability of psychologists necessitates efficient identification of individuals requiring urgent mental healthcare. This study explores the use of Natural Language Processing (NLP) pipelines to analyze text data from online mental health forums used for consultations. By analyzing forum posts, these pipelines can flag users who may require immediate professional attention. A crucial… ▽ More

    Submitted 2 October, 2024; v1 submitted 1 June, 2024; originally announced June 2024.

  47. arXiv:2405.17840  [pdf, other

    cs.CL

    Benchmarks Underestimate the Readiness of Multi-lingual Dialogue Agents

    Authors: Andrew H. Lee, Sina J. Semnani, Galo Castillo-López, Gäel de Chalendar, Monojit Choudhury, Ashna Dua, Kapil Rajesh Kavitha, Sungkyun Kim, Prashant Kodali, Ponnurangam Kumaraguru, Alexis Lombard, Mehrad Moradshahi, Gihyun Park, Nasredine Semmar, Jiwon Seo, Tianhao Shen, Manish Shrivastava, Deyi Xiong, Monica S. Lam

    Abstract: Creating multilingual task-oriented dialogue (TOD) agents is challenging due to the high cost of training data acquisition. Following the research trend of improving training data efficiency, we show for the first time, that in-context learning is sufficient to tackle multilingual TOD. To handle the challenging dialogue state tracking (DST) subtask, we break it down to simpler steps that are mor… ▽ More

    Submitted 16 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

  48. arXiv:2405.05572  [pdf, other

    cs.CL cs.AI

    From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences

    Authors: Prashant Kodali, Anmol Goel, Likhith Asapu, Vamshi Krishna Bonagiri, Anirudh Govil, Monojit Choudhury, Ponnurangam Kumaraguru, Manish Shrivastava

    Abstract: Current computational approaches for analysing or generating code-mixed sentences do not explicitly model ``naturalness'' or ``acceptability'' of code-mixed sentences, but rely on training corpora to reflect distribution of acceptable code-mixed sentences. Modelling human judgement for the acceptability of code-mixed text can help in distinguishing natural code-mixed text and enable quality-contro… ▽ More

    Submitted 5 May, 2025; v1 submitted 9 May, 2024; originally announced May 2024.

  49. arXiv:2405.05378  [pdf, other

    cs.CL cs.AI cs.CY cs.HC cs.LG

    "They are uncultured": Unveiling Covert Harms and Social Threats in LLM Generated Conversations

    Authors: Preetam Prabhu Srikar Dammu, Hayoung Jung, Anjali Singh, Monojit Choudhury, Tanushree Mitra

    Abstract: Large language models (LLMs) have emerged as an integral part of modern societies, powering user-facing applications such as personal assistants and enterprise applications like recruitment tools. Despite their utility, research indicates that LLMs perpetuate systemic biases. Yet, prior works on LLM harms predominantly focus on Western concepts like race and gender, often overlooking cultural conc… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

  50. arXiv:2404.18460  [pdf, other

    cs.CL cs.AI

    Ethical Reasoning and Moral Value Alignment of LLMs Depend on the Language we Prompt them in

    Authors: Utkarsh Agarwal, Kumar Tanmay, Aditi Khandelwal, Monojit Choudhury

    Abstract: Ethical reasoning is a crucial skill for Large Language Models (LLMs). However, moral values are not universal, but rather influenced by language and culture. This paper explores how three prominent LLMs -- GPT-4, ChatGPT, and Llama2-70B-Chat -- perform ethical reasoning in different languages and if their moral judgement depend on the language in which they are prompted. We extend the study of et… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.