Skip to main content

Showing 1–50 of 170 results for author: Krishnan, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.14528  [pdf, ps, other

    cs.HC

    Why Johnny Can't Use Agents: Industry Aspirations vs. User Realities with AI Agent Software

    Authors: Pradyumna Shome, Sashreek Krishnan, Sauvik Das

    Abstract: There is growing imprecision about what "AI agents" are, what they can do, and how effectively they can be used by their intended users. We pose two key research questions: (i) How does the tech industry conceive of and market "AI agents"? (ii) What challenges do end-users face when attempting to use commercial AI agents for their advertised uses? We first performed a systematic review of marketed… ▽ More

    Submitted 17 September, 2025; originally announced September 2025.

  2. arXiv:2509.04498  [pdf, ps, other

    cs.CL cs.AI

    Where Should I Study? Biased Language Models Decide! Evaluating Fairness in LMs for Academic Recommendations

    Authors: Krithi Shailya, Akhilesh Kumar Mishra, Gokul S Krishnan, Balaraman Ravindran

    Abstract: Large Language Models (LLMs) are increasingly used as daily recommendation systems for tasks like education planning, yet their recommendations risk perpetuating societal biases. This paper empirically examines geographic, demographic, and economic biases in university and program suggestions from three open-source LLMs: LLaMA-3.1-8B, Gemma-7B, and Mistral-7B. Using 360 simulated user profiles var… ▽ More

    Submitted 1 September, 2025; originally announced September 2025.

  3. arXiv:2509.02007  [pdf, ps, other

    cs.AI

    mFARM: Towards Multi-Faceted Fairness Assessment based on HARMs in Clinical Decision Support

    Authors: Shreyash Adappanavar, Krithi Shailya, Gokul S Krishnan, Sriraam Natarajan, Balaraman Ravindran

    Abstract: The deployment of Large Language Models (LLMs) in high-stakes medical settings poses a critical AI alignment challenge, as models can inherit and amplify societal biases, leading to significant disparities. Existing fairness evaluation methods fall short in these contexts as they typically use simplistic metrics that overlook the multi-dimensional nature of medical harms. This also promotes models… ▽ More

    Submitted 2 September, 2025; originally announced September 2025.

  4. arXiv:2508.13390  [pdf, ps, other

    cs.IR

    FLAIR: Feedback Learning for Adaptive Information Retrieval

    Authors: William Zhang, Yiwen Zhu, Yunlei Lu, Mathieu Demarne, Wenjing Wang, Kai Deng, Nutan Sahoo, Katherine Lin, Miso Cilimdzic, Subru Krishnan

    Abstract: Recent advances in Large Language Models (LLMs) have driven the adoption of copilots in complex technical scenarios, underscoring the growing need for specialized information retrieval solutions. In this paper, we introduce FLAIR, a lightweight, feedback learning framework that adapts copilot systems' retrieval strategies by integrating domain-specific expert feedback. FLAIR operates in two stages… ▽ More

    Submitted 18 August, 2025; originally announced August 2025.

    Comments: Accepted to CIKM2025

    ACM Class: H.3

  5. arXiv:2508.06814  [pdf, ps, other

    cs.DB

    Metadata Management for AI-Augmented Data Workflows

    Authors: Jinjin Zhao, Sanjay Krishnan

    Abstract: AI-augmented data workflows introduce complex governance challenges, as both human and model-driven processes generate, transform, and consume data artifacts. These workflows blend heterogeneous tools, dynamic execution patterns, and opaque model decisions, making comprehensive metadata capture difficult. In this work, we present TableVault, a metadata governance framework designed for human-AI co… ▽ More

    Submitted 9 August, 2025; originally announced August 2025.

  6. arXiv:2507.19844  [pdf, ps, other

    cs.LG cs.AI cs.MA eess.SY

    VAE-GAN Based Price Manipulation in Coordinated Local Energy Markets

    Authors: Biswarup Mukherjee, Li Zhou, S. Gokul Krishnan, Milad Kabirifar, Subhash Lakshminarayana, Charalambos Konstantinou

    Abstract: This paper introduces a model for coordinating prosumers with heterogeneous distributed energy resources (DERs), participating in the local energy market (LEM) that interacts with the market-clearing entity. The proposed LEM scheme utilizes a data-driven, model-free reinforcement learning approach based on the multi-agent deep deterministic policy gradient (MADDPG) framework, enabling prosumers to… ▽ More

    Submitted 26 July, 2025; originally announced July 2025.

    Comments: 2025 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm)

  7. arXiv:2507.16249  [pdf, ps, other

    cs.LG cs.MA

    Multi-Agent Reinforcement Learning for Sample-Efficient Deep Neural Network Mapping

    Authors: Srivatsan Krishnan, Jason Jabbour, Dan Zhang, Natasha Jaques, Aleksandra Faust, Shayegan Omidshafiei, Vijay Janapa Reddi

    Abstract: Mapping deep neural networks (DNNs) to hardware is critical for optimizing latency, energy consumption, and resource utilization, making it a cornerstone of high-performance accelerator design. Due to the vast and complex mapping space, reinforcement learning (RL) has emerged as a promising approach-but its effectiveness is often limited by sample inefficiency. We present a decentralized multi-age… ▽ More

    Submitted 22 July, 2025; originally announced July 2025.

  8. arXiv:2507.08002  [pdf

    cs.HC cs.AI

    Human vs. LLM-Based Thematic Analysis for Digital Mental Health Research: Proof-of-Concept Comparative Study

    Authors: Karisa Parkington, Bazen G. Teferra, Marianne Rouleau-Tang, Argyrios Perivolaris, Alice Rueda, Adam Dubrowski, Bill Kapralos, Reza Samavi, Andrew Greenshaw, Yanbo Zhang, Bo Cao, Yuqi Wu, Sirisha Rambhatla, Sridhar Krishnan, Venkat Bhat

    Abstract: Thematic analysis provides valuable insights into participants' experiences through coding and theme development, but its resource-intensive nature limits its use in large healthcare studies. Large language models (LLMs) can analyze text at scale and identify key content automatically, potentially addressing these challenges. However, their application in mental health interviews needs comparison… ▽ More

    Submitted 2 May, 2025; originally announced July 2025.

  9. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3284 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 22 July, 2025; v1 submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  10. arXiv:2506.18257  [pdf, ps, other

    cs.DB

    TableVault: Managing Dynamic Data Collections for LLM-Augmented Workflows

    Authors: Jinjin Zhao, Sanjay Krishnan

    Abstract: Large Language Models (LLMs) have emerged as powerful tools for automating and executing complex data tasks. However, their integration into more complex data workflows introduces significant management challenges. In response, we present TableVault - a data management system designed to handle dynamic data collections in LLM-augmented environments. TableVault meets the demands of these workflows… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

  11. arXiv:2506.18255  [pdf, ps, other

    cs.DB

    Fast Capture of Cell-Level Provenance in Numpy

    Authors: Jinjin Zhao, Sanjay Krishnan

    Abstract: Effective provenance tracking enhances reproducibility, governance, and data quality in array workflows. However, significant challenges arise in capturing this provenance, including: (1) rapidly evolving APIs, (2) diverse operation types, and (3) large-scale datasets. To address these challenges, this paper presents a prototype annotation system designed for arrays, which captures cell-level prov… ▽ More

    Submitted 22 June, 2025; originally announced June 2025.

  12. arXiv:2506.10999  [pdf

    cs.SE cs.AI

    Automated Validation of COBOL to Java Transformation

    Authors: Atul Kumar, Diptikalyan Saha, Toshikai Yasue, Kohichi Ono, Saravanan Krishnan, Sandeep Hans, Fumiko Satoh, Gerald Mitchell, Sachin Kumar

    Abstract: Recent advances in Large Language Model (LLM) based Generative AI techniques have made it feasible to translate enterpriselevel code from legacy languages such as COBOL to modern languages such as Java or Python. While the results of LLM-based automatic transformation are encouraging, the resulting code cannot be trusted to correctly translate the original code. We propose a framework and a tool t… ▽ More

    Submitted 14 April, 2025; originally announced June 2025.

    Comments: arXiv admin note: text overlap with arXiv:2504.10548

    Journal ref: ASE 2024

  13. arXiv:2505.15020  [pdf, ps, other

    cs.DC

    COSMIC: Enabling Full-Stack Co-Design and Optimization of Distributed Machine Learning Systems

    Authors: Aditi Raju, Jared Ni, William Won, Changhai Man, Srivatsan Krishnan, Srinivas Sridharan, Amir Yazdanbakhsh, Tushar Krishna, Vijay Janapa Reddi

    Abstract: Large-scale machine learning models necessitate distributed systems, posing significant design challenges due to the large parameter space across distinct design stacks. Existing studies often focus on optimizing individual system aspects in isolation. This work challenges this limitation and introduces COSMIC, a full-stack distributed machine learning systems environment enabling end-to-end simul… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 11 pages (excluding references), 10 figures, 6 tables

  14. arXiv:2505.06885  [pdf

    cs.SE cs.IR

    Incremental Analysis of Legacy Applications Using Knowledge Graphs for Application Modernization

    Authors: Saravanan Krishnan, Amith Singhee, Keerthi Narayan Raghunath, Alex Mathai, Atul Kumar, David Wenk

    Abstract: Industries such as banking, telecom, and airlines - o6en have large so6ware systems that are several decades old. Many of these systems are written in old programming languages such as COBOL, PL/1, Assembler, etc. In many cases, the documentation is not updated, and those who developed/designed these systems are no longer around. Understanding these systems for either modernization or even regular… ▽ More

    Submitted 11 May, 2025; originally announced May 2025.

  15. arXiv:2505.06151  [pdf, ps, other

    cs.CL

    Estimating Quality in Therapeutic Conversations: A Multi-Dimensional Natural Language Processing Framework

    Authors: Alice Rueda, Argyrios Perivolaris, Niloy Roy, Dylan Weston, Sarmed Shaya, Zachary Cote, Martin Ivanov, Bazen G. Teferra, Yuqi Wu, Sirisha Rambhatla, Divya Sharma, Andrew Greenshaw, Rakesh Jetly, Yanbo Zhang, Bo Cao, Reza Samavi, Sridhar Krishnan, Venkat Bhat

    Abstract: Engagement between client and therapist is a critical determinant of therapeutic success. We propose a multi-dimensional natural language processing (NLP) framework that objectively classifies engagement quality in counseling sessions based on textual transcripts. Using 253 motivational interviewing transcripts (150 high-quality, 103 low-quality), we extracted 42 features across four domains: conv… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: 12 pages, 4 figures, 7 tables

  16. arXiv:2505.01482  [pdf, ps, other

    cs.AI

    Understanding LLM Scientific Reasoning through Promptings and Model's Explanation on the Answers

    Authors: Alice Rueda, Mohammed S. Hassan, Argyrios Perivolaris, Bazen G. Teferra, Reza Samavi, Sirisha Rambhatla, Yuqi Wu, Yanbo Zhang, Bo Cao, Divya Sharma, Sridhar Krishnan, Venkat Bhat

    Abstract: Large language models (LLMs) have demonstrated remarkable capabilities in natural language understanding, reasoning, and problem-solving across various domains. However, their ability to perform complex, multi-step reasoning task-essential for applications in science, medicine, and law-remains an area of active investigation. This paper examines the reasoning capabilities of contemporary LLMs, ana… ▽ More

    Submitted 25 July, 2025; v1 submitted 2 May, 2025; originally announced May 2025.

  17. arXiv:2504.10548  [pdf

    cs.SE cs.AI

    Automated Testing of COBOL to Java Transformation

    Authors: Sandeep Hans, Atul Kumar, Toshikai Yasue, Kouichi Ono, Saravanan Krishnan, Devika Sondhi, Fumiko Satoh, Gerald Mitchell, Sachin Kumar, Diptikalyan Saha

    Abstract: Recent advances in Large Language Model (LLM) based Generative AI techniques have made it feasible to translate enterprise-level code from legacy languages such as COBOL to modern languages such as Java or Python. While the results of LLM-based automatic transformation are encouraging, the resulting code cannot be trusted to correctly translate the original code, making manual validation of transl… ▽ More

    Submitted 14 April, 2025; originally announced April 2025.

  18. arXiv:2504.06227  [pdf, other

    cs.CL

    LExT: Towards Evaluating Trustworthiness of Natural Language Explanations

    Authors: Krithi Shailya, Shreya Rajpal, Gokul S Krishnan, Balaraman Ravindran

    Abstract: As Large Language Models (LLMs) become increasingly integrated into high-stakes domains, there have been several approaches proposed toward generating natural language explanations. These explanations are crucial for enhancing the interpretability of a model, especially in sensitive domains like healthcare, where transparency and reliability are key. In light of such explanations being generated b… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  19. arXiv:2503.05675  [pdf, other

    cs.LG cs.DB

    Algorithmic Data Minimization for Machine Learning over Internet-of-Things Data Streams

    Authors: Ted Shaowang, Shinan Liu, Jonatas Marques, Nick Feamster, Sanjay Krishnan

    Abstract: Machine learning can analyze vast amounts of data generated by IoT devices to identify patterns, make predictions, and enable real-time decision-making. By processing sensor data, machine learning models can optimize processes, improve efficiency, and enhance personalized user experiences in smart systems. However, IoT systems are often deployed in sensitive environments such as households and off… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: 9 pages, 18 figures

  20. arXiv:2503.03986  [pdf, other

    cs.LG cs.AI

    Training neural networks faster with minimal tuning using pre-computed lists of hyperparameters for NAdamW

    Authors: Sourabh Medapati, Priya Kasimbeg, Shankar Krishnan, Naman Agarwal, George Dahl

    Abstract: If we want to train a neural network using any of the most popular optimization algorithms, we are immediately faced with a dilemma: how to set the various optimization and regularization hyperparameters? When computational resources are abundant, there are a variety of methods for finding good hyperparameter settings, but when resources are limited the only realistic choices are using standard de… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

    Comments: Good defaults for NadamW Optimizer, generalizes well to unseen problems

  21. arXiv:2503.03056  [pdf, other

    cs.LG

    A2Perf: Real-World Autonomous Agents Benchmark

    Authors: Ikechukwu Uchendu, Jason Jabbour, Korneel Van den Berghe, Joel Runevic, Matthew Stewart, Jeffrey Ma, Srivatsan Krishnan, Izzeddin Gur, Austin Huang, Colton Bishop, Paige Bailey, Wenjie Jiang, Ebrahim M. Songhori, Sergio Guadarrama, Jie Tan, Jordan K. Terry, Aleksandra Faust, Vijay Janapa Reddi

    Abstract: Autonomous agents and systems cover a number of application areas, from robotics and digital assistants to combinatorial optimization, all sharing common, unresolved research challenges. It is not sufficient for agents to merely solve a given task; they must generalize to out-of-distribution tasks, perform reliably, and use hardware resources efficiently during training and inference, among other… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 32 pages, 12 figures, preprint

  22. arXiv:2412.12493  [pdf, other

    cs.DB cs.AI

    A Simple and Fast Way to Handle Semantic Errors in Transactions

    Authors: Jinghan Zeng, Eugene Wu, Sanjay Krishnan

    Abstract: Many computer systems are now being redesigned to incorporate LLM-powered agents, enabling natural language input and more flexible operations. This paper focuses on handling database transactions created by large language models (LLMs). Transactions generated by LLMs may include semantic errors, requiring systems to treat them as long-lived. This allows for human review and, if the transaction is… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: 14 pages, 13 figures

  23. arXiv:2412.06099  [pdf, other

    cs.SE cs.AI

    DECO: Life-Cycle Management of Enterprise-Grade Copilots

    Authors: Yiwen Zhu, Mathieu Demarne, Kai Deng, Wenjing Wang, Nutan Sahoo, Divya Vermareddy, Hannah Lerner, Yunlei Lu, Swati Bararia, Anjali Bhavan, William Zhang, Xia Li, Katherine Lin, Miso Cilimdzic, Subru Krishnan

    Abstract: Software engineers frequently grapple with the challenge of accessing disparate documentation and telemetry data, including TroubleShooting Guides (TSGs), incident reports, code repositories, and various internal tools developed by multiple stakeholders. While on-call duties are inevitable, incident resolution becomes even more daunting due to the obscurity of legacy sources and the pressures of s… ▽ More

    Submitted 10 March, 2025; v1 submitted 8 December, 2024; originally announced December 2024.

  24. Intelligent Pooling: Proactive Resource Provisioning in Large-scale Cloud Service

    Authors: Deepak Ravikumar, Alex Yeo, Yiwen Zhu, Aditya Lakra, Harsha Nagulapalli, Santhosh Kumar Ravindran, Steve Suh, Niharika Dutta, Andrew Fogarty, Yoonjae Park, Sumeet Khushalani, Arijit Tarafdar, Kunal Parekh, Subru Krishnan

    Abstract: The proliferation of big data and analytic workloads has driven the need for cloud compute and cluster-based job processing. With Apache Spark, users can process terabytes of data at ease with hundreds of parallel executors. At Microsoft, we aim at providing a fast and succinct interface for users to run Spark applications, such as through creating simple notebook "sessions" by abstracting the und… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

    Journal ref: Proceedings of the VLDB Endowment, Vol. 17, No. 7 ISSN 2150-8097, 2024

  25. Lorentz: Learned SKU Recommendation Using Profile Data

    Authors: Nicholas Glaze, Tria McNeely, Yiwen Zhu, Matthew Gleeson, Helen Serr, Rajeev Bhopi, Subru Krishnan

    Abstract: Cloud operators have expanded their service offerings, known as Stock Keeping Units (SKUs), to accommodate diverse demands, resulting in increased complexity for customers to select appropriate configurations. In a studied system, only 43% of the resource capacity was correctly chosen. Automated solutions addressing this issue often require enriched data, such as workload traces, which are unavail… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

    Journal ref: Proc. ACM Manag. Data, Vol. 2, No. 3 (SIGMOD), Article 149. Publication date: June 2024

  26. arXiv:2411.01251  [pdf

    eess.IV cs.CV cs.LG

    Enhancing Diabetic Retinopathy Detection with CNN-Based Models: A Comparative Study of UNET and Stacked UNET Architectures

    Authors: Ameya Uppina, S Navaneetha Krishnan, Talluri Krishna Sai Teja, Nikhil N Iyer, Joe Dhanith P R

    Abstract: Diabetic Retinopathy DR is a severe complication of diabetes. Damaged or abnormal blood vessels can cause loss of vision. The need for massive screening of a large population of diabetic patients has generated an interest in a computer-aided fully automatic diagnosis of DR. In the realm of Deep learning frameworks, particularly convolutional neural networks CNNs, have shown great interest and prom… ▽ More

    Submitted 20 January, 2025; v1 submitted 2 November, 2024; originally announced November 2024.

  27. arXiv:2409.19044  [pdf, other

    cs.CL cs.AI cs.LG

    On the Inductive Bias of Stacking Towards Improving Reasoning

    Authors: Nikunj Saunshi, Stefani Karp, Shankar Krishnan, Sobhan Miryoosefi, Sashank J. Reddi, Sanjiv Kumar

    Abstract: Given the increasing scale of model sizes, novel training strategies like gradual stacking [Gong et al., 2019, Reddi et al., 2023] have garnered interest. Stacking enables efficient training by gradually growing the depth of a model in stages and using layers from a smaller model in an earlier stage to initialize the next stage. Although efficient for training, the model biases induced by such gro… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

    Comments: Accepted at NeurIPS 2024

  28. arXiv:2409.17048  [pdf, other

    cs.LG cs.NI eess.SP

    Predictive Covert Communication Against Multi-UAV Surveillance Using Graph Koopman Autoencoder

    Authors: Sivaram Krishnan, Jihong Park, Gregory Sherman, Benjamin Campbell, Jinho Choi

    Abstract: Low Probability of Detection (LPD) communication aims to obscure the presence of radio frequency (RF) signals to evade surveillance. In the context of mobile surveillance utilizing unmanned aerial vehicles (UAVs), achieving LPD communication presents significant challenges due to the UAVs' rapid and continuous movements, which are characterized by unknown nonlinear dynamics. Therefore, accurately… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

  29. arXiv:2409.16441  [pdf, other

    eess.IV cs.CV cs.LG

    A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation

    Authors: Avisha Kumar, Kunal Kotkar, Kelly Jiang, Meghana Bhimreddy, Daniel Davidar, Carly Weber-Levine, Siddharth Krishnan, Max J. Kerensky, Ruixing Liang, Kelley Kempski Leadingham, Denis Routkevitch, Andrew M. Hersh, Kimberly Ashayeri, Betty Tyler, Ian Suk, Jennifer Son, Nicholas Theodore, Nitish Thakor, Amir Manbachi

    Abstract: While deep learning has catalyzed breakthroughs across numerous domains, its broader adoption in clinical settings is inhibited by the costly and time-intensive nature of data acquisition and annotation. To further facilitate medical machine learning, we present an ultrasound dataset of 10,223 Brightness-mode (B-mode) images consisting of sagittal slices of porcine spinal cords (N=25) before and a… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  30. arXiv:2408.11303  [pdf, other

    cs.LG eess.SP

    Koopman AutoEncoder via Singular Value Decomposition for Data-Driven Long-Term Prediction

    Authors: Jinho Choi, Sivaram Krishnan, Jihong Park

    Abstract: The Koopman autoencoder, a data-driven technique, has gained traction for modeling nonlinear dynamics using deep learning methods in recent years. Given the linear characteristics inherent to the Koopman operator, controlling its eigenvalues offers an opportunity to enhance long-term prediction performance, a critical task for forecasting future trends in time-series datasets with long-term behavi… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 6 pages, 5 figures, to be presented at IEEE MLSP 2024

  31. arXiv:2407.13103  [pdf

    cs.CY

    Participatory Approaches in AI Development and Governance: Case Studies

    Authors: Ambreesh Parthasarathy, Aditya Phalnikar, Gokul S Krishnan, Ameen Jauhar, Balaraman Ravindran

    Abstract: This paper forms the second of a two-part series on the value of a participatory approach to AI development and deployment. The first paper had crafted a principled, as well as pragmatic, justification for deploying participatory methods in these two exercises (that is, development and deployment of AI). The pragmatic justification is that it improves the quality of the overall algorithm by provid… ▽ More

    Submitted 3 June, 2024; originally announced July 2024.

  32. arXiv:2407.13100  [pdf

    cs.CY

    Participatory Approaches in AI Development and Governance: A Principled Approach

    Authors: Ambreesh Parthasarathy, Aditya Phalnikar, Ameen Jauhar, Dhruv Somayajula, Gokul S Krishnan, Balaraman Ravindran

    Abstract: The widespread adoption of Artificial Intelligence (AI) technologies in the public and private sectors has resulted in them significantly impacting the lives of people in new and unexpected ways. In this context, it becomes important to inquire how their design, development and deployment takes place. Upon this inquiry, it is seen that persons who will be impacted by the deployment of these system… ▽ More

    Submitted 3 June, 2024; originally announced July 2024.

  33. arXiv:2407.09141  [pdf, other

    cs.LG

    Accuracy is Not All You Need

    Authors: Abhinav Dutta, Sanjeev Krishnan, Nipun Kwatra, Ramachandran Ramjee

    Abstract: When Large Language Models (LLMs) are compressed using techniques such as quantization, the predominant way to demonstrate the validity of such techniques is by measuring the model's accuracy on various benchmarks.If the accuracies of the baseline model and the compressed model are close, it is assumed that there was negligible degradation in quality.However, even when the accuracy of baseline and… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Journal ref: https://proceedings.neurips.cc/paper_files/paper/2024/hash/e0e956681b04ac126679e8c7dd706b2e-Abstract-Conference.html

  34. arXiv:2407.07858  [pdf, other

    cs.LG cs.CL

    FACTS About Building Retrieval Augmented Generation-based Chatbots

    Authors: Rama Akkiraju, Anbang Xu, Deepak Bora, Tan Yu, Lu An, Vishal Seth, Aaditya Shukla, Pritam Gundecha, Hridhay Mehta, Ashwin Jha, Prithvi Raj, Abhinav Balasubramanian, Murali Maram, Guru Muthusamy, Shivakesh Reddy Annepally, Sidney Knowles, Min Du, Nick Burnett, Sean Javiya, Ashok Marannan, Mamta Kumari, Surbhi Jha, Ethan Dereszenski, Anupam Chakraborty, Subhash Ranjan , et al. (13 additional authors not shown)

    Abstract: Enterprise chatbots, powered by generative AI, are emerging as key applications to enhance employee productivity. Retrieval Augmented Generation (RAG), Large Language Models (LLMs), and orchestration frameworks like Langchain and Llamaindex are crucial for building these chatbots. However, creating effective enterprise chatbots is challenging and requires meticulous RAG pipeline engineering. This… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 8 pages, 6 figures, 2 tables, Preprint submission to ACM CIKM 2024

  35. arXiv:2405.17845  [pdf, other

    cs.HC

    A System for Quantifying Data Science Workflows with Fine-Grained Procedural Logging and a Pilot Study

    Authors: Jinjin Zhao, Avidgor Gal, Sanjay Krishnan

    Abstract: It is important for researchers to understand precisely how data scientists turn raw data into insights, including typical programming patterns, workflow, and methodology. This paper contributes a novel system, called DataInquirer, that tracks incremental code executions in Jupyter notebooks (a type of computational notebook). The system allows us to quantitatively measure timing, workflow, and op… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  36. arXiv:2405.17701  [pdf, other

    cs.DB

    Compression and In-Situ Query Processing for Fine-Grained Array Lineage

    Authors: Jinjin Zhao, Sanjay Krishnan

    Abstract: Tracking data lineage is important for data integrity, reproducibility, and debugging data science workflows. However, fine-grained lineage (i.e., at a cell level) is challenging to store, even for the smallest datasets. This paper introduces DSLog, a storage system that efficiently stores, indexes, and queries array data lineage, agnostic to capture methodology. A main contribution is our new com… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  37. arXiv:2405.17690  [pdf, other

    cs.HC

    Data Makes Better Data Scientists

    Authors: Jinjin Zhao, Avidgor Gal, Sanjay Krishnan

    Abstract: With the goal of identifying common practices in data science projects, this paper proposes a framework for logging and understanding incremental code executions in Jupyter notebooks. This framework aims to allow reasoning about how insights are generated in data science and extract key observations into best data science practices in the wild. In this paper, we show an early prototype of this fra… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  38. arXiv:2405.17686  [pdf, other

    cs.CV

    Towards Causal Physical Error Discovery in Video Analytics Systems

    Authors: Jinjin Zhao, Ted Shaowang, Stavos Sintos, Sanjay Krishnan

    Abstract: Video analytics systems based on deep learning models are often opaque and brittle and require explanation systems to help users debug. Current model explanation system are very good at giving literal explanations of behavior in terms of pixel contributions but cannot integrate information about the physical or systems processes that might influence a prediction. This paper introduces the idea tha… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  39. arXiv:2405.15893  [pdf, other

    cs.SI

    Quantifying Influencer Impact on Affective Polarization

    Authors: Rezaur Rashid, Joshua Melton, Ouldouz Ghorbani, Siddharth Krishnan, Shannon Reid, Gabriel Terejanu

    Abstract: In today's digital age, social media platforms play a crucial role in shaping public opinion. This study explores how discussions led by influencers on Twitter, now known as 'X', affect public sentiment and contribute to online polarization. We developed a counterfactual framework to analyze the polarization scores of conversations in scenarios both with and without the presence of an influential… ▽ More

    Submitted 16 September, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 8 pages, 4 figures

  40. arXiv:2405.07870  [pdf

    cs.SE

    Mapping the Invisible: A Framework for Tracking COVID-19 Spread Among College Students with Google Location Data

    Authors: Prajindra Sankar Krishnan, Chai Phing Chen, Gamal Alkawsi, Sieh Kiong Tiong, Luiz Fernando Capretz

    Abstract: The COVID-19 pandemic and the implementation of social distancing policies have rapidly changed people's visiting patterns, as reflected in mobility data that tracks mobility traffic using location trackers on cell phones. However, the frequency and duration of concurrent occupancy at specific locations govern the transmission rather than the number of customers visiting. Therefore, understanding… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 8 pages

    Journal ref: Latin American Workshop on Data Fusion (LAFUSION 2023), November/2023, pp 1-8, Rio de Janeiro, Brazil

  41. Towards Building Autonomous Data Services on Azure

    Authors: Yiwen Zhu, Yuanyuan Tian, Joyce Cahoon, Subru Krishnan, Ankita Agarwal, Rana Alotaibi, Jesús Camacho-Rodríguez, Bibin Chundatt, Andrew Chung, Niharika Dutta, Andrew Fogarty, Anja Gruenheid, Brandon Haynes, Matteo Interlandi, Minu Iyer, Nick Jurgens, Sumeet Khushalani, Brian Kroth, Manoj Kumar, Jyoti Leeka, Sergiy Matusevych, Minni Mittal, Andreas Mueller, Kartheek Muthyala, Harsha Nagulapalli , et al. (13 additional authors not shown)

    Abstract: Modern cloud has turned data services into easily accessible commodities. With just a few clicks, users are now able to access a catalog of data processing systems for a wide range of tasks. However, the cloud brings in both complexity and opportunity. While cloud users can quickly start an application by using various data services, it can be difficult to configure and optimize these services to… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Comments: SIGMOD Companion of the 2023 International Conference on Management of Data. 2023

  42. arXiv:2404.10228  [pdf, other

    cs.LG cs.CL cs.SI

    Two-Stage Stance Labeling: User-Hashtag Heuristics with Graph Neural Networks

    Authors: Joshua Melton, Shannon Reid, Gabriel Terejanu, Siddharth Krishnan

    Abstract: The high volume and rapid evolution of content on social media present major challenges for studying the stance of social media users. In this work, we develop a two stage stance labeling method that utilizes the user-hashtag bipartite graph and the user-user interaction graph. In the first stage, a simple and efficient heuristic for stance labeling uses the user-hashtag bipartite graph to iterati… ▽ More

    Submitted 17 May, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

  43. arXiv:2403.20329  [pdf, other

    cs.CL cs.AI cs.LG

    ReALM: Reference Resolution As Language Modeling

    Authors: Joel Ruben Antony Moniz, Soundarya Krishnan, Melis Ozyildirim, Prathamesh Saraf, Halim Cagri Ates, Yuan Zhang, Hong Yu

    Abstract: Reference resolution is an important problem, one that is essential to understand and successfully handle context of different kinds. This context includes both previous turns and context that pertains to non-conversational entities, such as entities on the user's screen or those running in the background. While LLMs have been shown to be extremely powerful for a variety of tasks, their use in ref… ▽ More

    Submitted 18 August, 2024; v1 submitted 29 March, 2024; originally announced March 2024.

    Comments: Accepted at SIGDIAL 2024 (Oral presentation)

  44. arXiv:2402.18707  [pdf, other

    cs.HC cs.RO

    Embodied Supervision: Haptic Display of Automation Command to Improve Supervisory Performance

    Authors: Alia Gilbert, Sachit Krishnan, R. Brent Gillespie

    Abstract: A human operator using a manual control interface has ready access to their own command signal, both by efference copy and proprioception. In contrast, a human supervisor typically relies on visual information alone. We propose supplying a supervisor with a copy of the operators command signal, hypothesizing improved performance, especially when that copy is provided through haptic display. We exp… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: IEEE Haptics Symposium 2024

  45. arXiv:2402.10567  [pdf, other

    cs.CL cs.AI

    InSaAF: Incorporating Safety through Accuracy and Fairness | Are LLMs ready for the Indian Legal Domain?

    Authors: Yogesh Tripathi, Raghav Donakanti, Sahil Girhepuje, Ishan Kavathekar, Bhaskara Hanuma Vedula, Gokul S Krishnan, Shreya Goyal, Anmol Goel, Balaraman Ravindran, Ponnurangam Kumaraguru

    Abstract: Recent advancements in language technology and Artificial Intelligence have resulted in numerous Language Models being proposed to perform various tasks in the legal domain ranging from predicting judgments to generating summaries. Despite their immense potential, these models have been proven to learn and exhibit societal biases and make unfair predictions. In this study, we explore the ability o… ▽ More

    Submitted 17 June, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

  46. arXiv:2402.09426  [pdf, other

    eess.SP cs.LG eess.SY

    Graph Koopman Autoencoder for Predictive Covert Communication Against UAV Surveillance

    Authors: Sivaram Krishnan, Jihong Park, Gregory Sherman, Benjamin Campbell, Jinho Choi

    Abstract: Low Probability of Detection (LPD) communication aims to obscure the very presence of radio frequency (RF) signals, going beyond just hiding the content of the communication. However, the use of Unmanned Aerial Vehicles (UAVs) introduces a challenge, as UAVs can detect RF signals from the ground by hovering over specific areas of interest. With the growing utilization of UAVs in modern surveillanc… ▽ More

    Submitted 23 January, 2024; originally announced February 2024.

  47. arXiv:2402.07332  [pdf, other

    cs.DB cs.CR

    DePLOI: Applying NL2SQL to Synthesize and Audit Database Access Control

    Authors: Pranav Subramaniam, Sanjay Krishnan

    Abstract: In every enterprise database, administrators must define an access control policy that specifies which users have access to which tables. Access control straddles two worlds: policy (organization-level principles that define who should have access) and process (database-level primitives that actually implement the policy). Assessing and enforcing process compliance with a policy is a manual and ad… ▽ More

    Submitted 21 May, 2025; v1 submitted 11 February, 2024; originally announced February 2024.

    Comments: 13 pages, 5 figures, 2 tables

  48. arXiv:2402.06869  [pdf

    cs.CR

    Digital Footprints of Streaming Devices

    Authors: Sundar Krishnan, William Bradley Glisson

    Abstract: These days, there are many ways to watch streaming videos on television. When compared to a standalone smart television, streaming devices such as Roku and Amazon Fire Stick have a plethora of app selections. While these devices are platform agnostic and compatible with smartphones, they can still leave behind crumbs of sensitive data that can cause privacy, security, and forensic issues. In this… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  49. arXiv:2402.03694  [pdf, other

    cs.NI cs.AI

    ServeFlow: A Fast-Slow Model Architecture for Network Traffic Analysis

    Authors: Shinan Liu, Ted Shaowang, Gerry Wan, Jeewon Chae, Jonatas Marques, Sanjay Krishnan, Nick Feamster

    Abstract: Network traffic analysis increasingly uses complex machine learning models as the internet consolidates and traffic gets more encrypted. However, over high-bandwidth networks, flows can easily arrive faster than model inference rates. The temporal nature of network flows limits simple scale-out approaches leveraged in other high-traffic machine learning applications. Accordingly, this paper presen… ▽ More

    Submitted 24 October, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  50. arXiv:2312.15959  [pdf, ps, other

    cs.DS cs.DB

    Range (Rényi) Entropy Queries and Partitioning

    Authors: Aryan Esmailpour, Sanjay Krishnan, Stavros Sintos

    Abstract: Data partitioning that maximizes/minimizes the Shannon entropy, or more generally the Rényi entropy is a crucial subroutine in data compression, columnar storage, and cardinality estimation algorithms. These partition algorithms can be accelerated if we have a data structure to compute the entropy in different subsets of data when the algorithm needs to decide what block to construct. Such a data… ▽ More

    Submitted 4 August, 2025; v1 submitted 26 December, 2023; originally announced December 2023.