Skip to main content

Showing 1–50 of 118 results for author: Shivani

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.11022  [pdf, ps, other

    cs.SE cs.AI cs.CL cs.CR cs.LG

    Security Degradation in Iterative AI Code Generation -- A Systematic Analysis of the Paradox

    Authors: Shivani Shukla, Himanshu Joshi, Romilla Syed

    Abstract: The rapid adoption of Large Language Models(LLMs) for code generation has transformed software development, yet little attention has been given to how security vulnerabilities evolve through iterative LLM feedback. This paper analyzes security degradation in AI-generated code through a controlled experiment with 400 code samples across 40 rounds of "improvements" using four distinct prompting stra… ▽ More

    Submitted 19 May, 2025; originally announced June 2025.

    Comments: Keywords - Large Language Models, Security Vulnerabilities, AI-Generated Code, Iterative Feedback, Software Security, Secure Coding Practices, Feedback Loops, LLM Prompting Strategies

  2. arXiv:2506.09661  [pdf, ps, other

    eess.IV cs.CV q-bio.TO

    A Cytology Dataset for Early Detection of Oral Squamous Cell Carcinoma

    Authors: Garima Jain, Sanghamitra Pati, Mona Duggal, Amit Sethi, Abhijeet Patil, Gururaj Malekar, Nilesh Kowe, Jitender Kumar, Jatin Kashyap, Divyajeet Rout, Deepali, Hitesh, Nishi Halduniya, Sharat Kumar, Heena Tabassum, Rupinder Singh Dhaliwal, Sucheta Devi Khuraijam, Sushma Khuraijam, Sharmila Laishram, Simmi Kharb, Sunita Singh, K. Swaminadtan, Ranjana Solanki, Deepika Hemranjani, Shashank Nath Singh , et al. (12 additional authors not shown)

    Abstract: Oral squamous cell carcinoma OSCC is a major global health burden, particularly in several regions across Asia, Africa, and South America, where it accounts for a significant proportion of cancer cases. Early detection dramatically improves outcomes, with stage I cancers achieving up to 90 percent survival. However, traditional diagnosis based on histopathology has limited accessibility in low-res… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

    Comments: 7 pages, 2 figurs

  3. arXiv:2506.06971  [pdf, ps, other

    cs.CL cs.CR

    Chain-of-Code Collapse: Reasoning Failures in LLMs via Adversarial Prompting in Code Generation

    Authors: Jaechul Roh, Varun Gandhi, Shivani Anilkumar, Arin Garg

    Abstract: Large Language Models (LLMs) have achieved remarkable success in tasks requiring complex reasoning, such as code generation, mathematical problem solving, and algorithmic synthesis -- especially when aided by reasoning tokens and Chain-of-Thought prompting. Yet, a core question remains: do these models truly reason, or do they merely exploit shallow statistical patterns? In this paper, we introduc… ▽ More

    Submitted 12 June, 2025; v1 submitted 7 June, 2025; originally announced June 2025.

  4. arXiv:2506.05182  [pdf, ps, other

    cs.IR

    On the Comprehensibility of Multi-structured Financial Documents using LLMs and Pre-processing Tools

    Authors: Shivani Upadhyay, Messiah Ataey, Shariyar Murtuza, Yifan Nie, Jimmy Lin

    Abstract: The proliferation of complex structured data in hybrid sources, such as PDF documents and web pages, presents unique challenges for current Large Language Models (LLMs) and Multi-modal Large Language Models (MLLMs) in providing accurate answers. Despite the recent advancements of MLLMs, they still often falter when interpreting intricately structured information, such as nested tables and multi-di… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 15 pages, 5 figures, 9 tables

  5. arXiv:2506.03182  [pdf, ps, other

    cs.CV cs.LG

    TerraIncognita: A Dynamic Benchmark for Species Discovery Using Frontier Models

    Authors: Shivani Chiranjeevi, Hossein Zaremehrjerdi, Zi K. Deng, Talukder Z. Jubery, Ari Grele, Arti Singh, Asheesh K Singh, Soumik Sarkar, Nirav Merchant, Harold F. Greeney, Baskar Ganapathysubramanian, Chinmay Hegde

    Abstract: The rapid global loss of biodiversity, particularly among insects, represents an urgent ecological crisis. Current methods for insect species discovery are manual, slow, and severely constrained by taxonomic expertise, hindering timely conservation actions. We introduce TerraIncognita, a dynamic benchmark designed to evaluate state-of-the-art multimodal models for the challenging problem of identi… ▽ More

    Submitted 29 May, 2025; originally announced June 2025.

  6. arXiv:2505.24090  [pdf, other

    cs.DB cs.AI

    Searching Clinical Data Using Generative AI

    Authors: Karan Hanswadkar, Anika Kanchi, Shivani Tripathi, Shi Qiao, Rony Chatterjee, Alekh Jindal

    Abstract: Artificial Intelligence (AI) is making a major impact on healthcare, particularly through its application in natural language processing (NLP) and predictive analytics. The healthcare sector has increasingly adopted AI for tasks such as clinical data analysis and medical code assignment. However, searching for clinical information in large and often unorganized datasets remains a manual and error-… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

  7. arXiv:2505.00306  [pdf, other

    cs.RO

    J-PARSE: Jacobian-based Projection Algorithm for Resolving Singularities Effectively in Inverse Kinematic Control of Serial Manipulators

    Authors: Shivani Guptasarma, Matthew Strong, Honghao Zhen, Monroe Kennedy III

    Abstract: J-PARSE is a method for smooth first-order inverse kinematic control of a serial manipulator near kinematic singularities. The commanded end-effector velocity is interpreted component-wise, according to the available mobility in each dimension of the task space. First, a substitute "Safety" Jacobian matrix is created, keeping the aspect ratio of the manipulability ellipsoid above a threshold value… ▽ More

    Submitted 6 May, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

    Comments: 18 pages, 25 figures. v1: Fig. 1 replaced with faster-loading version

  8. arXiv:2504.20006  [pdf, other

    cs.IR

    Chatbot Arena Meets Nuggets: Towards Explanations and Diagnostics in the Evaluation of LLM Responses

    Authors: Sahel Sharifymoghaddam, Shivani Upadhyay, Nandan Thakur, Ronak Pradeep, Jimmy Lin

    Abstract: Battles, or side-by-side comparisons in so-called arenas that elicit human preferences, have emerged as a popular approach for assessing the output quality of LLMs. Recently, this idea has been extended to retrieval-augmented generation (RAG) systems. While undoubtedly representing an advance in evaluation, battles have at least two drawbacks, particularly in the context of complex information-see… ▽ More

    Submitted 25 May, 2025; v1 submitted 28 April, 2025; originally announced April 2025.

    Comments: 10 pages, 8 figures, 3 tables

  9. arXiv:2504.15205  [pdf, other

    cs.CL cs.AI cs.IR

    Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges

    Authors: Nandan Thakur, Ronak Pradeep, Shivani Upadhyay, Daniel Campos, Nick Craswell, Jimmy Lin

    Abstract: Retrieval-augmented generation (RAG) enables large language models (LLMs) to generate answers with citations from source documents containing "ground truth", thereby reducing system hallucinations. A crucial factor in RAG evaluation is "support", whether the information in the cited documents supports the answer. To this end, we conducted a large-scale comparative study of 45 participant submissio… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: Accepted at SIGIR 2025 (short)

  10. arXiv:2504.15068  [pdf, other

    cs.IR cs.CL

    The Great Nugget Recall: Automating Fact Extraction and RAG Evaluation with Large Language Models

    Authors: Ronak Pradeep, Nandan Thakur, Shivani Upadhyay, Daniel Campos, Nick Craswell, Jimmy Lin

    Abstract: Large Language Models (LLMs) have significantly enhanced the capabilities of information access systems, especially with retrieval-augmented generation (RAG). Nevertheless, the evaluation of RAG systems remains a barrier to continued progress, a challenge we tackle in this work by proposing an automatic evaluation framework that is validated against human annotations. We believe that the nugget ev… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: To appear in SIGIR 2025. Significant updates and revisions to arXiv:2411.09607

  11. arXiv:2504.11812  [pdf, ps, other

    cs.NE cs.AI

    Learning Strategies in Particle Swarm Optimizer: A Critical Review and Performance Analysis

    Authors: Dikshit Chauhan, Shivani, P. N. Suganthan

    Abstract: Nature has long inspired the development of swarm intelligence (SI), a key branch of artificial intelligence that models collective behaviors observed in biological systems for solving complex optimization problems. Particle swarm optimization (PSO) is widely adopted among SI algorithms due to its simplicity and efficiency. Despite numerous learning strategies proposed to enhance PSO's performance… ▽ More

    Submitted 16 April, 2025; originally announced April 2025.

    Comments: 53 pages, 14 figures

  12. arXiv:2504.00717  [pdf, ps, other

    cs.NE cs.AI

    Advancements in Multimodal Differential Evolution: A Comprehensive Review and Future Perspectives

    Authors: Dikshit Chauhan, Shivani, Donghwi Jung, Anupam Yadav

    Abstract: Multi-modal optimization involves identifying multiple global and local optima of a function, offering valuable insights into diverse optimal solutions within the search space. Evolutionary algorithms (EAs) excel at finding multiple solutions in a single run, providing a distinct advantage over classical optimization techniques that often require multiple restarts without guarantee of obtaining di… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  13. arXiv:2502.14083  [pdf, other

    cs.CL

    Are Rules Meant to be Broken? Understanding Multilingual Moral Reasoning as a Computational Pipeline with UniMoral

    Authors: Shivani Kumar, David Jurgens

    Abstract: Moral reasoning is a complex cognitive process shaped by individual experiences and cultural contexts and presents unique challenges for computational analysis. While natural language processing (NLP) offers promising tools for studying this phenomenon, current research lacks cohesion, employing discordant datasets and tasks that examine isolated aspects of moral reasoning. We bridge this gap with… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: 21 pages, 10 figures, 8 tables

  14. arXiv:2502.04569  [pdf, other

    cs.HC

    Localization of Vibrotactile Stimuli on the Face

    Authors: Shivani Guptasarma, Allison M. Okamura, Monroe Kennedy III

    Abstract: The face remains relatively unexplored as a target region for haptic feedback, despite providing a considerable surface area consisting of highly sensitive skin. There are promising applications for facial haptic feedback, especially in cases of severe upper limb loss or spinal cord injury, where the face is typically less impacted than other body parts. Moreover, the neural representation of the… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    Comments: 5 pages, 5 figures

  15. arXiv:2501.19395  [pdf, other

    cs.RO

    Precision Harvesting in Cluttered Environments: Integrating End Effector Design with Dual Camera Perception

    Authors: Kendall Koe, Poojan Kalpeshbhai Shah, Benjamin Walt, Jordan Westphal, Samhita Marri, Shivani Kamtikar, James Seungbum Nam, Naveen Kumar Uppalapati, Girish Krishnan, Girish Chowdhary

    Abstract: Due to labor shortages in specialty crop industries, a need for robotic automation to increase agricultural efficiency and productivity has arisen. Previous manipulation systems perform well in harvesting in uncluttered and structured environments. High tunnel environments are more compact and cluttered in nature, requiring a rethinking of the large form factor systems and grippers. We propose a n… ▽ More

    Submitted 31 January, 2025; originally announced January 2025.

  16. arXiv:2501.18493  [pdf, ps, other

    cs.HC

    Examining the Expanding Role of Synthetic Data Throughout the AI Development Pipeline

    Authors: Shivani Kapania, Stephanie Ballard, Alex Kessler, Jennifer Wortman Vaughan

    Abstract: Alongside the growth of generative AI, we are witnessing a surge in the use of synthetic data across all stages of the AI development pipeline. It is now common practice for researchers and practitioners to use one large generative model (which we refer to as an auxiliary model) to generate synthetic data that is used to train or evaluate another, reconfiguring AI workflows and reshaping the very… ▽ More

    Submitted 12 May, 2025; v1 submitted 30 January, 2025; originally announced January 2025.

  17. arXiv:2412.14527  [pdf, other

    stat.ML cs.LG

    Statistical Undersampling with Mutual Information and Support Points

    Authors: Alex Mak, Shubham Sahoo, Shivani Pandey, Yidan Yue, Linglong Kong

    Abstract: Class imbalance and distributional differences in large datasets present significant challenges for classification tasks machine learning, often leading to biased models and poor predictive performance for minority classes. This work introduces two novel undersampling approaches: mutual information-based stratified simple random sampling and support points optimization. These methods prioritize re… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

  18. arXiv:2411.11575  [pdf

    cs.NE

    Analysis of Generalized Hebbian Learning Algorithm for Neuromorphic Hardware Using Spinnaker

    Authors: Shivani Sharma, Darshika G. Perera

    Abstract: Neuromorphic computing, inspired by biological neural networks, has emerged as a promising approach for solving complex machine learning tasks with greater efficiency and lower power consumption. The integration of biologically plausible learning algorithms, such as the Generalized Hebbian Algorithm (GHA), is key to enhancing the performance of neuromorphic systems. In this paper, we explore the a… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

    Comments: 8 pages, 1 figure, 7 tables

  19. arXiv:2411.09607  [pdf, other

    cs.IR cs.CL

    Initial Nugget Evaluation Results for the TREC 2024 RAG Track with the AutoNuggetizer Framework

    Authors: Ronak Pradeep, Nandan Thakur, Shivani Upadhyay, Daniel Campos, Nick Craswell, Jimmy Lin

    Abstract: This report provides an initial look at partial results from the TREC 2024 Retrieval-Augmented Generation (RAG) Track. We have identified RAG evaluation as a barrier to continued progress in information access (and more broadly, natural language processing and artificial intelligence), and it is our hope that we can contribute to tackling the many challenges in this space. The central hypothesis w… ▽ More

    Submitted 14 November, 2024; originally announced November 2024.

  20. arXiv:2411.08275  [pdf, other

    cs.IR cs.CL

    A Large-Scale Study of Relevance Assessments with Large Language Models: An Initial Look

    Authors: Shivani Upadhyay, Ronak Pradeep, Nandan Thakur, Daniel Campos, Nick Craswell, Ian Soboroff, Hoa Trang Dang, Jimmy Lin

    Abstract: The application of large language models to provide relevance assessments presents exciting opportunities to advance information retrieval, natural language processing, and beyond, but to date many unknowns remain. This paper reports on the results of a large-scale evaluation (the TREC 2024 RAG Track) where four different relevance assessment approaches were deployed in situ: the "standard" fully… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

  21. arXiv:2409.19430  [pdf, other

    cs.HC cs.CL cs.LG

    'Simulacrum of Stories': Examining Large Language Models as Qualitative Research Participants

    Authors: Shivani Kapania, William Agnew, Motahhare Eslami, Hoda Heidari, Sarah Fox

    Abstract: The recent excitement around generative models has sparked a wave of proposals suggesting the replacement of human participation and labor in research and development--e.g., through surveys, experiments, and interviews--with synthetic research data generated by large language models (LLMs). We conducted interviews with 19 qualitative researchers to understand their perspectives on this paradigm sh… ▽ More

    Submitted 28 September, 2024; originally announced September 2024.

  22. arXiv:2409.15994  [pdf, other

    cs.NE

    A Multi-operator Ensemble LSHADE with Restart and Local Search Mechanisms for Single-objective Optimization

    Authors: Dikshit Chauhan, Anupam Trivedi, Shivani

    Abstract: In recent years, multi-operator and multi-method algorithms have succeeded, encouraging their combination within single frameworks. Despite promising results, there remains room for improvement as only some evolutionary algorithms (EAs) consistently excel across all optimization problems. This paper proposes mLSHADE-RL, an enhanced version of LSHADE-cnEpSin, which is one of the winners of the CEC… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  23. arXiv:2409.08330  [pdf, other

    cs.CL cs.CY cs.HC

    Real or Robotic? Assessing Whether LLMs Accurately Simulate Qualities of Human Responses in Dialogue

    Authors: Jonathan Ivey, Shivani Kumar, Jiayu Liu, Hua Shen, Sushrita Rakshit, Rohan Raju, Haotian Zhang, Aparna Ananthasubramaniam, Junghwan Kim, Bowen Yi, Dustin Wright, Abraham Israeli, Anders Giovanni Møller, Lechen Zhang, David Jurgens

    Abstract: Studying and building datasets for dialogue tasks is both expensive and time-consuming due to the need to recruit, train, and collect data from study participants. In response, much recent work has sought to use large language models (LLMs) to simulate both human-human and human-LLM interactions, as they have been shown to generate convincingly human-like text in many settings. However, to what ex… ▽ More

    Submitted 16 September, 2024; v1 submitted 12 September, 2024; originally announced September 2024.

  24. arXiv:2408.13182  [pdf, other

    cs.IT eess.SP

    Target Detection for OTFS-Aided Cell-Free MIMO ISAC System

    Authors: Shivani Singh, Amudheesan Nakkeeran, Prem Singh, Ekant Sharma, Jyotsna Bapat

    Abstract: This letter focuses on enhancing target detection performance for a multi-user integrated sensing and communication (ISAC) system using orthogonal time frequency space (OTFS)-aided cell-free multiple-input multiple-output (MIMO) technology in high-speed vehicular environments. We propose a sensing-centric (SC) approach for target detection using communication signals with or without sensing signal… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  25. arXiv:2408.10816  [pdf, other

    eess.SP cs.LG

    Deep Learning-based Classification of Dementia using Image Representation of Subcortical Signals

    Authors: Shivani Ranjan, Ayush Tripathi, Harshal Shende, Robin Badal, Amit Kumar, Pramod Yadav, Deepak Joshi, Lalan Kumar

    Abstract: Dementia is a neurological syndrome marked by cognitive decline. Alzheimer's disease (AD) and Frontotemporal dementia (FTD) are the common forms of dementia, each with distinct progression patterns. EEG, a non-invasive tool for recording brain activity, has shown potential in distinguishing AD from FTD and mild cognitive impairment (MCI). Previous studies have utilized various EEG features, such a… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  26. arXiv:2408.00112  [pdf, other

    cs.CV

    Automated Sperm Morphology Analysis Based on Instance-Aware Part Segmentation

    Authors: Wenyuan Chen, Haocong Song, Changsheng Dai, Aojun Jiang, Guanqiao Shan, Hang Liu, Yanlong Zhou, Khaled Abdalla, Shivani N Dhanani, Katy Fatemeh Moosavi, Shruti Pathak, Clifford Librach, Zhuoran Zhang, Yu Sun

    Abstract: Traditional sperm morphology analysis is based on tedious manual annotation. Automated morphology analysis of a high number of sperm requires accurate segmentation of each sperm part and quantitative morphology evaluation. State-of-the-art instance-aware part segmentation networks follow a "detect-then-segment" paradigm. However, due to sperm's slim shape, their segmentation suffers from large con… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

    Comments: Accepted to ICRA 2024

  27. arXiv:2407.09688  [pdf, other

    cs.CL

    Large Language Models for Integrating Social Determinant of Health Data: A Case Study on Heart Failure 30-Day Readmission Prediction

    Authors: Chase Fensore, Rodrigo M. Carrillo-Larco, Shivani A. Patel, Alanna A. Morris, Joyce C. Ho

    Abstract: Social determinants of health (SDOH) $-$ the myriad of circumstances in which people live, grow, and age $-$ play an important role in health outcomes. However, existing outcome prediction models often only use proxies of SDOH as features. Recent open data initiatives present an opportunity to construct a more comprehensive view of SDOH, but manually integrating the most relevant data for individu… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 36 pages including references and appendix. This is a work in progress

  28. arXiv:2407.05025  [pdf, other

    cs.RO cs.HC eess.SY

    ProACT: An Augmented Reality Testbed for Intelligent Prosthetic Arms

    Authors: Shivani Guptasarma, Monroe D. Kennedy III

    Abstract: Upper-limb amputees face tremendous difficulty in operating dexterous powered prostheses. Previous work has shown that aspects of prosthetic hand, wrist, or elbow control can be improved through "intelligent" control, by combining movement-based or gaze-based intent estimation with low-level robotic autonomy. However, no such solutions exist for whole-arm control. Moreover, hardware platforms for… ▽ More

    Submitted 2 December, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: 12 pages, 8 figures. Under review. Code and data are available at https://arm.stanford.edu/proact

    Journal ref: IEEE Transactions on Neural Systems and Rehabilitation Engineering, vol. 33, pp. 354-365, 2025

  29. arXiv:2406.17720  [pdf, other

    cs.CV

    BioTrove: A Large Curated Image Dataset Enabling AI for Biodiversity

    Authors: Chih-Hsuan Yang, Benjamin Feuer, Zaki Jubery, Zi K. Deng, Andre Nakkab, Md Zahid Hasan, Shivani Chiranjeevi, Kelly Marshall, Nirmal Baishnab, Asheesh K Singh, Arti Singh, Soumik Sarkar, Nirav Merchant, Chinmay Hegde, Baskar Ganapathysubramanian

    Abstract: We introduce BioTrove, the largest publicly accessible dataset designed to advance AI applications in biodiversity. Curated from the iNaturalist platform and vetted to include only research-grade data, BioTrove contains 161.9 million images, offering unprecedented scale and diversity from three primary kingdoms: Animalia ("animals"), Fungi ("fungi"), and Plantae ("plants"), spanning approximately… ▽ More

    Submitted 27 January, 2025; v1 submitted 25 June, 2024; originally announced June 2024.

  30. arXiv:2406.06519  [pdf, other

    cs.IR

    UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor

    Authors: Shivani Upadhyay, Ronak Pradeep, Nandan Thakur, Nick Craswell, Jimmy Lin

    Abstract: Copious amounts of relevance judgments are necessary for the effective training and accurate evaluation of retrieval systems. Conventionally, these judgments are made by human assessors, rendering this process expensive and laborious. A recent study by Thomas et al. from Microsoft Bing suggested that large language models (LLMs) can accurately perform the relevance assessment task and provide huma… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 5 pages, 3 figures

  31. arXiv:2405.10311  [pdf, other

    cs.IR

    UniRAG: Universal Retrieval Augmentation for Large Vision Language Models

    Authors: Sahel Sharifymoghaddam, Shivani Upadhyay, Wenhu Chen, Jimmy Lin

    Abstract: Recently, Large Vision Language Models (LVLMs) have unlocked many complex use cases that require Multi-Modal (MM) understanding (e.g., image captioning or visual question answering) and MM generation (e.g., text-guided image generation or editing) capabilities. To further improve the output fidelityof LVLMs we introduce UniRAG, a plug-and-play technique that adds relevant retrieved information to… ▽ More

    Submitted 9 March, 2025; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 14 pages, 6 figures

  32. arXiv:2405.04727  [pdf, other

    cs.IR

    LLMs Can Patch Up Missing Relevance Judgments in Evaluation

    Authors: Shivani Upadhyay, Ehsan Kamalloo, Jimmy Lin

    Abstract: Unjudged documents or holes in information retrieval benchmarks are considered non-relevant in evaluation, yielding no gains in measuring effectiveness. However, these missing judgments may inadvertently introduce biases into the evaluation as their prevalence for a retrieval model is heavily contingent on the pooling process. Thus, filling holes becomes crucial in ensuring reliable and accurate e… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 5 pages, 4 figures

  33. arXiv:2405.01674  [pdf

    cs.CR

    Generative AI in Cybersecurity

    Authors: Shivani Metta, Isaac Chang, Jack Parker, Michael P. Roman, Arturo F. Ehuan

    Abstract: The dawn of Generative Artificial Intelligence (GAI), characterized by advanced models such as Generative Pre-trained Transformers (GPT) and other Large Language Models (LLMs), has been pivotal in reshaping the field of data analysis, pattern recognition, and decision-making processes. This surge in GAI technology has ushered in not only innovative opportunities for data processing and automation… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  34. arXiv:2404.14591  [pdf, other

    cs.CE

    Predicting the Temporal Dynamics of Prosthetic Vision

    Authors: Yuchen Hou, Laya Pullela, Jiaxin Su, Sriya Aluru, Shivani Sista, Xiankun Lu, Michael Beyeler

    Abstract: Retinal implants are a promising treatment option for degenerative retinal disease. While numerous models have been developed to simulate the appearance of elicited visual percepts ("phosphenes"), these models often either focus solely on spatial characteristics or inadequately capture the complex temporal dynamics observed in clinical trials, which vary heavily across implant technologies, subjec… ▽ More

    Submitted 1 May, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

  35. arXiv:2403.19876  [pdf, other

    cs.HC

    "I'm categorizing LLM as a productivity tool": Examining ethics of LLM use in HCI research practices

    Authors: Shivani Kapania, Ruiyi Wang, Toby Jia-Jun Li, Tianshi Li, Hong Shen

    Abstract: Large language models are increasingly applied in real-world scenarios, including research and education. These models, however, come with well-known ethical issues, which may manifest in unexpected ways in human-computer interaction research due to the extensive engagement with human subjects. This paper reports on research practices related to LLM use, drawing on 16 semi-structured interviews an… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  36. arXiv:2403.00887  [pdf, other

    eess.AS cs.AI cs.CL cs.LG cs.SD

    SEGAA: A Unified Approach to Predicting Age, Gender, and Emotion in Speech

    Authors: Aron R, Indra Sigicharla, Chirag Periwal, Mohanaprasad K, Nithya Darisini P S, Sourabh Tiwari, Shivani Arora

    Abstract: The interpretation of human voices holds importance across various applications. This study ventures into predicting age, gender, and emotion from vocal cues, a field with vast applications. Voice analysis tech advancements span domains, from improving customer interactions to enhancing healthcare and retail experiences. Discerning emotions aids mental health, while age and gender detection are vi… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

  37. arXiv:2402.19052  [pdf

    cs.CL cs.HC

    Exploring the Efficacy of Large Language Models in Summarizing Mental Health Counseling Sessions: A Benchmark Study

    Authors: Prottay Kumar Adhikary, Aseem Srivastava, Shivani Kumar, Salam Michael Singh, Puneet Manuja, Jini K Gopinath, Vijay Krishnan, Swati Kedia, Koushik Sinha Deb, Tanmoy Chakraborty

    Abstract: Comprehensive summaries of sessions enable an effective continuity in mental health counseling, facilitating informed therapy planning. Yet, manual summarization presents a significant challenge, diverting experts' attention from the core counseling process. This study evaluates the effectiveness of state-of-the-art Large Language Models (LLMs) in selectively summarizing various components of ther… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  38. arXiv:2402.18944  [pdf, other

    cs.CL cs.AI

    SemEval 2024 -- Task 10: Emotion Discovery and Reasoning its Flip in Conversation (EDiReF)

    Authors: Shivani Kumar, Md Shad Akhtar, Erik Cambria, Tanmoy Chakraborty

    Abstract: We present SemEval-2024 Task 10, a shared task centred on identifying emotions and finding the rationale behind their flips within monolingual English and Hindi-English code-mixed dialogues. This task comprises three distinct subtasks - emotion recognition in conversation for code-mixed dialogues, emotion flip reasoning for code-mixed dialogues, and emotion flip reasoning for English dialogues. Pa… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

    Comments: 11 pages, 3 figures, 7 tables

  39. arXiv:2402.18354  [pdf, other

    physics.ao-ph cs.LG physics.comp-ph physics.flu-dyn

    SuperdropNet: a Stable and Accurate Machine Learning Proxy for Droplet-based Cloud Microphysics

    Authors: Shivani Sharma, David Greenberg

    Abstract: Cloud microphysics has important consequences for climate and weather phenomena, and inaccurate representations can limit forecast accuracy. While atmospheric models increasingly resolve storms and clouds, the accuracy of the underlying microphysics remains limited by computationally expedient bulk moment schemes based on simplifying assumptions. Droplet-based Lagrangian schemes are more accurate… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

  40. arXiv:2402.05398  [pdf, other

    cs.CV

    On the Effect of Image Resolution on Semantic Segmentation

    Authors: Ritambhara Singh, Abhishek Jain, Pietro Perona, Shivani Agarwal, Junfeng Yang

    Abstract: High-resolution semantic segmentation requires substantial computational resources. Traditional approaches in the field typically downscale the input images before processing and then upscale the low-resolution outputs back to their original dimensions. While this strategy effectively identifies broad regions, it often misses finer details. In this study, we demonstrate that a streamlined model ca… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2209.08667 by other authors

  41. arXiv:2402.04744  [pdf, other

    cs.LG cs.AR

    Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers

    Authors: Abhimanyu Rajeshkumar Bambhaniya, Amir Yazdanbakhsh, Suvinay Subramanian, Sheng-Chun Kao, Shivani Agrawal, Utku Evci, Tushar Krishna

    Abstract: N:M Structured sparsity has garnered significant interest as a result of relatively modest overhead and improved efficiency. Additionally, this form of sparsity holds considerable appeal for reducing the memory footprint owing to their modest representation overhead. There have been efforts to develop training recipes for N:M structured sparsity, they primarily focus on low-sparsity regions (… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 18 pages, 8 figures, 17 tables. Code is available at https://github.com/abhibambhaniya/progressive_gradient_flow_nm_sparsity

  42. arXiv:2402.01787  [pdf, other

    cs.CY cs.AI cs.LG

    Harm Amplification in Text-to-Image Models

    Authors: Susan Hao, Renee Shelby, Yuchi Liu, Hansa Srinivasan, Mukul Bhutani, Burcu Karagol Ayan, Ryan Poplin, Shivani Poddar, Sarah Laszlo

    Abstract: Text-to-image (T2I) models have emerged as a significant advancement in generative AI; however, there exist safety concerns regarding their potential to produce harmful image outputs even when users input seemingly safe prompts. This phenomenon, where T2I models generate harmful representations that were not explicit in the input prompt, poses a potentially greater risk than adversarial prompts, l… ▽ More

    Submitted 15 August, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  43. arXiv:2402.01055  [pdf, other

    cs.LG stat.ML

    Multiclass Learning from Noisy Labels for Non-decomposable Performance Measures

    Authors: Mingyuan Zhang, Shivani Agarwal

    Abstract: There has been much interest in recent years in learning good classifiers from data with noisy labels. Most work on learning from noisy labels has focused on standard loss-based performance measures. However, many machine learning problems require using non-decomposable performance measures which cannot be expressed as the expectation or sum of a loss on individual examples; these include for exam… ▽ More

    Submitted 23 April, 2024; v1 submitted 1 February, 2024; originally announced February 2024.

  44. arXiv:2401.12995  [pdf, other

    cs.CL

    Harmonizing Code-mixed Conversations: Personality-assisted Code-mixed Response Generation in Dialogues

    Authors: Shivani Kumar, Tanmoy Chakraborty

    Abstract: Code-mixing, the blending of multiple languages within a single conversation, introduces a distinctive challenge, particularly in the context of response generation. Capturing the intricacies of code-mixing proves to be a formidable task, given the wide-ranging variations influenced by individual speaking styles and cultural backgrounds. In this study, we explore response generation within code-mi… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: 14 pages, 8 figures, 7 tables. Accepted at EACL (findings) 2024

  45. arXiv:2401.01596  [pdf, other

    cs.AI cs.CL

    MedSumm: A Multimodal Approach to Summarizing Code-Mixed Hindi-English Clinical Queries

    Authors: Akash Ghosh, Arkadeep Acharya, Prince Jha, Aniket Gaudgaul, Rajdeep Majumdar, Sriparna Saha, Aman Chadha, Raghav Jain, Setu Sinha, Shivani Agarwal

    Abstract: In the healthcare domain, summarizing medical questions posed by patients is critical for improving doctor-patient interactions and medical decision-making. Although medical data has grown in complexity and quantity, the current body of research in this domain has primarily concentrated on text-based methods, overlooking the integration of visual cues. Also prior works in the area of medical quest… ▽ More

    Submitted 3 January, 2024; originally announced January 2024.

    Comments: ECIR 2024

  46. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

  47. arXiv:2312.08553  [pdf, other

    eess.AS cs.SD

    USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models

    Authors: Shaojin Ding, David Qiu, David Rim, Yanzhang He, Oleg Rybakov, Bo Li, Rohit Prabhavalkar, Weiran Wang, Tara N. Sainath, Zhonglin Han, Jian Li, Amir Yazdanbakhsh, Shivani Agrawal

    Abstract: End-to-end automatic speech recognition (ASR) models have seen revolutionary quality gains with the recent development of large-scale universal speech models (USM). However, deploying these massive USMs is extremely expensive due to the enormous memory usage and computational cost. Therefore, model compression is an important research topic to fit USM-based ASR under budget in real-world scenarios… ▽ More

    Submitted 16 January, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted by ICASSP 2024. Preprint

  48. arXiv:2311.14635  [pdf

    cs.CV cs.RO

    Automated Detection and Counting of Windows using UAV Imagery based Remote Sensing

    Authors: Dhruv Patel, Shivani Chepuri, Sarvesh Thakur, K. Harikumar, Ravi Kiran S., K. Madhava Krishna

    Abstract: Despite the technological advancements in the construction and surveying sector, the inspection of salient features like windows in an under-construction or existing building is predominantly a manual process. Moreover, the number of windows present in a building is directly related to the magnitude of deformation it suffers under earthquakes. In this research, a method to accurately detect and co… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  49. arXiv:2311.09086  [pdf, other

    cs.CL cs.AI cs.SI

    The Uli Dataset: An Exercise in Experience Led Annotation of oGBV

    Authors: Arnav Arora, Maha Jinadoss, Cheshta Arora, Denny George, Brindaalakshmi, Haseena Dawood Khan, Kirti Rawat, Div, Ritash, Seema Mathur, Shivani Yadav, Shehla Rashid Shora, Rie Raut, Sumit Pawar, Apurva Paithane, Sonia, Vivek, Dharini Priscilla, Khairunnisha, Grace Banu, Ambika Tandon, Rishav Thakker, Rahul Dev Korra, Aatman Vaidya, Tarunima Prabhakar

    Abstract: Online gender based violence has grown concomitantly with adoption of the internet and social media. Its effects are worse in the Global majority where many users use social media in languages other than English. The scale and volume of conversations on the internet has necessitated the need for automated detection of hate speech, and more specifically gendered abuse. There is, however, a lack of… ▽ More

    Submitted 24 June, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  50. arXiv:2310.19336  [pdf, other

    cs.RO eess.SY

    Considerations for the Control Design of Augmentative Robots

    Authors: Shivani Guptasarma, Monroe Kennedy III

    Abstract: Robotic systems that are intended to augment human capabilities commonly require the use of semi-autonomous control and artificial sensing, while at the same time aiming to empower the user to make decisions and take actions. This work identifies principles and techniques from the literature that can help to resolve this apparent contradiction. It is postulated that augmentative robots must functi… ▽ More

    Submitted 31 October, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: 7 pages. Presented at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2021) Workshop on Building and Evaluating Ethical Robotic Systems, Prague, Czech Republic, 28-30 September 2021