Skip to main content

Showing 1–30 of 30 results for author: Mukherjee, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.15546  [pdf, other

    cs.SE cs.AI

    A Framework for Testing and Adapting REST APIs as LLM Tools

    Authors: Jayachandu Bandlamudi, Ritwik Chaudhuri, Neelamadhav Gantayat, Kushal Mukherjee, Prerna Agarwal, Renuka Sindhgatta, Sameep Mehta

    Abstract: Large Language Models (LLMs) are enabling autonomous agents to perform complex workflows using external tools or functions, often provided via REST APIs in enterprise systems. However, directly utilizing these APIs as tools poses challenges due to their complex input schemas, elaborate responses, and often ambiguous documentation. Current benchmarks for tool testing do not adequately address these… ▽ More

    Submitted 1 May, 2025; v1 submitted 21 April, 2025; originally announced April 2025.

    ACM Class: I.2.7

  2. arXiv:2503.18001  [pdf, other

    cs.IR cs.LG cs.SI

    Z-REx: Human-Interpretable GNN Explanations for Real Estate Recommendations

    Authors: Kunal Mukherjee, Zachary Harrison, Saeid Balaneshin

    Abstract: Transparency and interpretability are crucial for enhancing customer confidence and user engagement, especially when dealing with black-box Machine Learning (ML)-based recommendation systems. Modern recommendation systems leverage Graph Neural Network (GNN) due to their ability to produce high-quality recommendations in terms of both relevance and diversity. Therefore, the explainability of GNN is… ▽ More

    Submitted 11 February, 2025; originally announced March 2025.

    ACM Class: I.2; I.5

  3. arXiv:2502.11767  [pdf, other

    cs.LG cs.CL

    From Selection to Generation: A Survey of LLM-based Active Learning

    Authors: Yu Xia, Subhojyoti Mukherjee, Zhouhang Xie, Junda Wu, Xintong Li, Ryan Aponte, Hanjia Lyu, Joe Barrow, Hongjie Chen, Franck Dernoncourt, Branislav Kveton, Tong Yu, Ruiyi Zhang, Jiuxiang Gu, Nesreen K. Ahmed, Yu Wang, Xiang Chen, Hanieh Deilamsalehy, Sungchul Kim, Zhengmian Hu, Yue Zhao, Nedim Lipka, Seunghyun Yoon, Ting-Hao Kenneth Huang, Zichao Wang , et al. (9 additional authors not shown)

    Abstract: Active Learning (AL) has been a powerful paradigm for improving model efficiency and performance by selecting the most informative data points for labeling and training. In recent active learning frameworks, Large Language Models (LLMs) have been employed not only for selection but also for generating entirely new data instances and providing more cost-effective annotations. Motivated by the incre… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  4. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  5. arXiv:2412.05710  [pdf, other

    cs.CL cs.AI cs.IR

    PromptRefine: Enhancing Few-Shot Performance on Low-Resource Indic Languages with Example Selection from Related Example Banks

    Authors: Soumya Suvra Ghosal, Soumyabrata Pal, Koyel Mukherjee, Dinesh Manocha

    Abstract: Large Language Models (LLMs) have recently demonstrated impressive few-shot learning capabilities through in-context learning (ICL). However, ICL performance is highly dependent on the choice of few-shot demonstrations, making the selection of the most optimal examples a persistent research challenge. This issue is further amplified in low-resource Indic languages, where the scarcity of ground-tru… ▽ More

    Submitted 7 December, 2024; originally announced December 2024.

  6. arXiv:2410.12513  [pdf, other

    cs.CL

    FiRST: Finetuning Router-Selective Transformers for Input-Adaptive Latency Reduction

    Authors: Akriti Jain, Saransh Sharma, Koyel Mukherjee, Soumyabrata Pal

    Abstract: Auto-regressive Large Language Models (LLMs) demonstrate remarkable performance across different domains such as vision and language processing. However, due to sequential processing through a stack of transformer layers, autoregressive decoding faces significant computation/latency challenges, particularly in resource-constrained environments like mobile and edge devices. Existing approaches in l… ▽ More

    Submitted 17 December, 2024; v1 submitted 16 October, 2024; originally announced October 2024.

  7. arXiv:2406.17781  [pdf, other

    cs.CV cs.AI cs.HC

    Large Language Models estimate fine-grained human color-concept associations

    Authors: Kushin Mukherjee, Timothy T. Rogers, Karen B. Schloss

    Abstract: Concepts, both abstract and concrete, elicit a distribution of association strengths across perceptual color space, which influence aspects of visual cognition ranging from object recognition to interpretation of information visualizations. While prior work has hypothesized that color-concept associations may be learned from the cross-modal statistical structure of experience, it has been unclear… ▽ More

    Submitted 4 May, 2024; originally announced June 2024.

  8. arXiv:2402.11741  [pdf, other

    cs.DS cs.CC cs.DB cs.DC

    To Store or Not to Store: a graph theoretical approach for Dataset Versioning

    Authors: Anxin Guo, Jingwei Li, Pattara Sukprasert, Samir Khuller, Amol Deshpande, Koyel Mukherjee

    Abstract: In this work, we study the cost efficient data versioning problem, where the goal is to optimize the storage and reconstruction (retrieval) costs of data versions, given a graph of datasets as nodes and edges capturing edit/delta information. One central variant we study is MinSum Retrieval (MSR) where the goal is to minimize the total retrieval costs, while keeping the storage costs bounded. This… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

    Comments: Accepted by IPDPS 2024

  9. arXiv:2402.01742  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Optimizing the Costs of LLM Usage

    Authors: Shivanshu Shekhar, Tanishq Dubey, Koyel Mukherjee, Apoorv Saxena, Atharv Tyagi, Nishanth Kotla

    Abstract: Generative AI and LLMs in particular are heavily used nowadays for various document processing tasks such as question answering and summarization. However, different LLMs come with different capabilities for different tasks as well as with different costs, tokenization, and latency. In fact, enterprises are already incurring huge costs of operating or using LLMs for their respective use cases. I… ▽ More

    Submitted 29 January, 2024; originally announced February 2024.

    Comments: 8 pages + Appendix, Total 12 pages

  10. R2D2: Reducing Redundancy and Duplication in Data Lakes

    Authors: Raunak Shah, Koyel Mukherjee, Atharv Tyagi, Sai Keerthana Karnam, Dhruv Joshi, Shivam Bhosale, Subrata Mitra

    Abstract: Enterprise data lakes often suffer from substantial amounts of duplicate and redundant data, with data volumes ranging from terabytes to petabytes. This leads to both increased storage costs and unnecessarily high maintenance costs for these datasets. In this work, we focus on identifying and reducing redundancy in enterprise data lakes by addressing the problem of 'dataset containment'. To the be… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

    Comments: The first two authors contributed equally. 25 pages, accepted to the International Conference on Management of Data (SIGMOD) 2024. ©Raunak Shah | ACM 2023. This is the author's version of the work. Not for redistribution. The definitive Version of Record was published in Proceedings of the ACM on Management of Data (PACMMOD), http://dx.doi.org/10.1145/3626762

    Journal ref: Proc. ACM Manag. Data 1, 4, Article 268 (December 2023), 25 pages

  11. arXiv:2312.04429  [pdf, other

    cs.CV

    Approximate Caching for Efficiently Serving Diffusion Models

    Authors: Shubham Agarwal, Subrata Mitra, Sarthak Chakraborty, Srikrishna Karanam, Koyel Mukherjee, Shiv Saini

    Abstract: Text-to-image generation using diffusion models has seen explosive popularity owing to their ability in producing high quality images adhering to text prompts. However, production-grade diffusion model serving is a resource intensive task that not only require high-end GPUs which are expensive but also incurs considerable latency. In this paper, we introduce a technique called approximate-caching… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: Accepted at NSDI'24

  12. arXiv:2312.03035  [pdf, other

    cs.CV

    SEVA: Leveraging sketches to evaluate alignment between human and machine visual abstraction

    Authors: Kushin Mukherjee, Holly Huey, Xuanchen Lu, Yael Vinker, Rio Aguina-Kang, Ariel Shamir, Judith E. Fan

    Abstract: Sketching is a powerful tool for creating abstract images that are sparse but meaningful. Sketch understanding poses fundamental challenges for general-purpose vision algorithms because it requires robustness to the sparsity of sketches relative to natural visual inputs and because it demands tolerance for semantic ambiguity, as sketches can reliably evoke multiple meanings. While current vision a… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: Accepted to the Advances in Neural Information Processing Systems (Datasets and Benchmarks Track) 2023

  13. arXiv:2306.00934  [pdf, other

    cs.CR cs.LG

    Interpreting GNN-based IDS Detections Using Provenance Graph Structural Features

    Authors: Kunal Mukherjee, Joshua Wiedemeier, Tianhao Wang, Muhyun Kim, Feng Chen, Murat Kantarcioglu, Kangkook Jee

    Abstract: Advanced cyber threats (e.g., Fileless Malware and Advanced Persistent Threat (APT)) have driven the adoption of provenance-based security solutions. These solutions employ Machine Learning (ML) models for behavioral modeling and critical security tasks such as malware and anomaly detection. However, the opacity of ML-based security models limits their broader adoption, as the lack of transparency… ▽ More

    Submitted 16 December, 2024; v1 submitted 1 June, 2023; originally announced June 2023.

  14. arXiv:2305.14818  [pdf, other

    cs.DB

    Towards Optimizing Storage Costs on the Cloud

    Authors: Koyel Mukherjee, Raunak Shah, Shiv Kumar Saini, Karanpreet Singh, Khushi, Harsh Kesarwani, Kavya Barnwal, Ayush Chauhan

    Abstract: We study the problem of optimizing data storage and access costs on the cloud while ensuring that the desired performance or latency is unaffected. We first propose an optimizer that optimizes the data placement tier (on the cloud) and the choice of compression schemes to apply, for given data partitions with temporal access predictions. Secondly, we propose a model to learn the compression perfor… ▽ More

    Submitted 6 July, 2023; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: The first two authors contributed equally. 12 pages, Accepted to the International Conference on Data Engineering (ICDE) 2023

  15. arXiv:2304.05591  [pdf, other

    cs.CL cs.AI cs.LG

    Semantic Feature Verification in FLAN-T5

    Authors: Siddharth Suresh, Kushin Mukherjee, Timothy T. Rogers

    Abstract: This study evaluates the potential of a large language model for aiding in generation of semantic feature norms - a critical tool for evaluating conceptual structure in cognitive science. Building from an existing human-generated dataset, we show that machine-verified norms capture aspects of conceptual structure beyond what is expressed in human norms alone, and better explain human judgments of… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: To appear as a Tiny Paper at ICLR 2023

  16. arXiv:2304.05012  [pdf, other

    cs.CL cs.AI

    Human-machine cooperation for semantic feature listing

    Authors: Kushin Mukherjee, Siddharth Suresh, Timothy T. Rogers

    Abstract: Semantic feature norms, lists of features that concepts do and do not possess, have played a central role in characterizing human conceptual knowledge, but require extensive human labor. Large language models (LLMs) offer a novel avenue for the automatic generation of such feature lists, but are prone to significant error. Here, we present a new method for combining a learned model of human lexica… ▽ More

    Submitted 11 April, 2023; originally announced April 2023.

    Comments: To be published in the ICLR TinyPaper track

  17. arXiv:2304.02754  [pdf, other

    cs.AI cs.CL cs.LG

    Conceptual structure coheres in human cognition but not in large language models

    Authors: Siddharth Suresh, Kushin Mukherjee, Xizheng Yu, Wei-Chun Huang, Lisa Padua, Timothy T Rogers

    Abstract: Neural network models of language have long been used as a tool for developing hypotheses about conceptual representation in the mind and brain. For many years, such use involved extracting vector-space representations of words and using distances among these to predict or understand human behavior in various semantic tasks. Contemporary large language models (LLMs), however, make it possible to i… ▽ More

    Submitted 10 November, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

  18. arXiv:2206.06868  [pdf, other

    cs.CL cs.AI

    Natural Language Sentence Generation from API Specifications

    Authors: Siyu Huo, Kushal Mukherjee, Jayachandu Bandlamudi, Vatche Isahagian, Vinod Muthusamy, Yara Rizk

    Abstract: APIs are everywhere; they provide access to automation solutions that could help businesses automate some of their tasks. Unfortunately, they may not be accessible to the business users who need them but are not equipped with the necessary technical skills to leverage them. Wrapping these APIs with chatbot capabilities is one solution to make these automation solutions interactive. In this work, w… ▽ More

    Submitted 1 June, 2022; originally announced June 2022.

  19. arXiv:2108.03685  [pdf, other

    cs.HC

    Context Matters: A Theory of Semantic Discriminability for Perceptual Encoding Systems

    Authors: Kushin Mukherjee, Brian Yin, Brianne E. Sherman, Laurent Lessard, Karen B. Schloss

    Abstract: People's associations between colors and concepts influence their ability to interpret the meanings of colors in information visualizations. Previous work has suggested such effects are limited to concepts that have strong, specific associations with colors. However, although a concept may not be strongly associated with any colors, its mapping can be disambiguated in the context of other concepts… ▽ More

    Submitted 21 September, 2023; v1 submitted 8 August, 2021; originally announced August 2021.

    Comments: Published in IEEE Transactions on Visualization and Computer Graphics

  20. arXiv:1910.11605  [pdf, other

    cs.LG cs.CV stat.ML

    A Simple Dynamic Learning Rate Tuning Algorithm For Automated Training of DNNs

    Authors: Koyel Mukherjee, Alind Khare, Ashish Verma

    Abstract: Training neural networks on image datasets generally require extensive experimentation to find the optimal learning rate regime. Especially, for the cases of adversarial training or for training a newly synthesized model, one would not know the best learning rate regime beforehand. We propose an automated algorithm for determining the learning rate trajectory, that works across datasets and models… ▽ More

    Submitted 25 October, 2019; originally announced October 2019.

  21. arXiv:1904.10689  [pdf, other

    cs.LG stat.ML

    Layer Dynamics of Linearised Neural Nets

    Authors: Saurav Basu, Koyel Mukherjee, Shrihari Vasudevan

    Abstract: Despite the phenomenal success of deep learning in recent years, there remains a gap in understanding the fundamental mechanics of neural nets. More research is focussed on handcrafting complex and larger networks, and the design decisions are often ad-hoc and based on intuition. Some recent research has aimed to demystify the learning dynamics in neural nets by attempting to build a theory from f… ▽ More

    Submitted 24 April, 2019; originally announced April 2019.

  22. arXiv:1712.03724  [pdf, other

    cs.CY cs.AI cs.HC

    Cogniculture: Towards a Better Human-Machine Co-evolution

    Authors: Rakesh R Pimplikar, Kushal Mukherjee, Gyana Parija, Harit Vishwakarma, Ramasuri Narayanam, Sarthak Ahuja, Rohith D Vallam, Ritwik Chaudhuri, Joydeep Mondal

    Abstract: Research in Artificial Intelligence is breaking technology barriers every day. New algorithms and high performance computing are making things possible which we could only have imagined earlier. Though the enhancements in AI are making life easier for human beings day by day, there is constant fear that AI based systems will pose a threat to humanity. People in AI community have diverse set of opi… ▽ More

    Submitted 11 December, 2017; originally announced December 2017.

  23. arXiv:1711.08413  [pdf

    cs.CV stat.AP stat.ML

    SolarisNet: A Deep Regression Network for Solar Radiation Prediction

    Authors: Subhadip Dey, Sawon Pratiher, Saon Banerjee, Chanchal Kumar Mukherjee

    Abstract: Effective utilization of photovoltaic (PV) plants requires weather variability robust global solar radiation (GSR) forecasting models. Random weather turbulence phenomena coupled with assumptions of clear sky model as suggested by Hottel pose significant challenges to parametric & non-parametric models in GSR conversion rate estimation. Also, a decent GSR estimate requires costly high-tech radiome… ▽ More

    Submitted 10 December, 2017; v1 submitted 22 November, 2017; originally announced November 2017.

  24. arXiv:1706.02682  [pdf, other

    math.OC cs.DS cs.GT

    Impact of Detour-Aware Policies on Maximizing Profit in Ridesharing

    Authors: Arpita Biswas, Ragavendran Gopalakrishnan, Theja Tulabandhula, Asmita Metrewar, Koyel Mukherjee, Raja Subramaniam Thangaraj

    Abstract: This paper provides efficient solutions to maximize profit for commercial ridesharing services, under a pricing model with detour-based discounts for passengers. We propose greedy heuristics for real-time ride matching that offer different trade-offs between optimality and speed. Simulations on New York City (NYC) taxi trip data show that our heuristics are up to 90% optimal and 10^5 times faster… ▽ More

    Submitted 8 June, 2017; originally announced June 2017.

    Comments: 18 pages, 10 figures

  25. arXiv:1703.07807  [pdf, other

    cs.LG

    Learning to Partition using Score Based Compatibilities

    Authors: Arun Rajkumar, Koyel Mukherjee, Theja Tulabandhula

    Abstract: We study the problem of learning to partition users into groups, where one must learn the compatibilities between the users to achieve optimal groupings. We define four natural objectives that optimize for average and worst case compatibilities and propose new algorithms for adaptively learning optimal groupings. When we do not impose any structure on the compatibilities, we show that the group fo… ▽ More

    Submitted 22 March, 2017; originally announced March 2017.

    Comments: Appears in the Proceedings of the 16th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2017)

  26. arXiv:1610.08154  [pdf, other

    cs.DS

    LP Rounding and Combinatorial Algorithms for Minimizing Active and Busy Time

    Authors: Jessica Chang, Samir Khuller, Koyel Mukherjee

    Abstract: We consider fundamental scheduling problems motivated by energy issues. In this framework, we are given a set of jobs, each with a release time, deadline and required processing length. The jobs need to be scheduled on a machine so that at most g jobs are active at any given time. The duration for which a machine is active (i.e., "on") is referred to as its active time. The goal is to find a feasi… ▽ More

    Submitted 25 October, 2016; originally announced October 2016.

    Comments: 31 pages, originally appeared in SPAA 2014

  27. arXiv:1607.07306  [pdf, other

    cs.GT cs.CC cs.DS

    The Costs and Benefits of Sharing: Sequential Individual Rationality and Sequential Fairness

    Authors: Ragavendran Gopalakrishnan, Koyel Mukherjee, Theja Tulabandhula

    Abstract: In designing dynamic shared service systems that incentivize customers to opt for shared rather than exclusive service, the traditional notion of individual rationality may be insufficient, as a customer's estimated utility could fluctuate arbitrarily during their time in the shared system, as long as their realized utility at service completion is not worse than that for exclusive service. In thi… ▽ More

    Submitted 20 June, 2017; v1 submitted 25 July, 2016; originally announced July 2016.

    Comments: Presented as a poster at EC 2016. Presented as an invited talk (sponsored session) at INFORMS Annual Meeting 2016. Presented at MSOM Service Operations SIG 2017. Currently under review at Management Science

  28. Online Tracking of Skin Colour Regions Against a Complex Background

    Authors: Subhadip Basu, S. Chakraborty, K. Mukherjee, S. K. Pandit

    Abstract: Online tracking of human activity against a complex background is a challenging task for many applications. In this paper, we have developed a robust technique for localizing skin colour regions from unconstrained image frames. A simple and fast segmentation algorithm is used to train a multiplayer perceptron (MLP) for detection of skin colours. Stepper motors are synchronized with the MLP to trac… ▽ More

    Submitted 15 October, 2014; originally announced October 2014.

    Journal ref: Proc. of IEEE INDICON, pp. 184-186, Dec-2004, Kharagpur

  29. arXiv:1410.4012  [pdf

    cs.CV

    Online interpretation of numeric sign language using 2-d skeletal model

    Authors: Subhadip Basu, S. Dey, K. Mukherjee, T. S. Jana

    Abstract: Gesturing is one of the natural modes of human communication. Signs produced by gestures can have a basic meaning coupled with additional information that is layered over the basic meaning of the sign. Sign language is an important example of communicative gestures that are highly structured and well accepted across the world as a communication medium for deaf and dumb. In this paper, an online re… ▽ More

    Submitted 15 October, 2014; originally announced October 2014.

    Journal ref: Proc. of Intl. Conf. on Communication Devices and Intelligent Systems (CODIS), pp. 570-573, Jan-2004, Kolkata

  30. arXiv:1204.6552  [pdf, other

    cs.GT cs.AI cs.MA

    A Game-Theoretic Model Motivated by the DARPA Network Challenge

    Authors: Rajesh Chitnis, MohammadTaghi Hajiaghayi, Jonathan Katz, Koyel Mukherjee

    Abstract: In this paper we propose a game-theoretic model to analyze events similar to the 2009 \emph{DARPA Network Challenge}, which was organized by the Defense Advanced Research Projects Agency (DARPA) for exploring the roles that the Internet and social networks play in incentivizing wide-area collaborations. The challenge was to form a group that would be the first to find the locations of ten moored w… ▽ More

    Submitted 30 January, 2013; v1 submitted 30 April, 2012; originally announced April 2012.