Skip to main content

Showing 1–50 of 109 results for author: Maity, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.23763  [pdf, ps, other

    cs.CV

    Sketch Down the FLOPs: Towards Efficient Networks for Human Sketch

    Authors: Aneeshan Sain, Subhajit Maity, Pinaki Nath Chowdhury, Subhadeep Koley, Ayan Kumar Bhunia, Yi-Zhe Song

    Abstract: As sketch research has collectively matured over time, its adaptation for at-mass commercialisation emerges on the immediate horizon. Despite an already mature research endeavour for photos, there is no research on the efficient inference specifically designed for sketch data. In this paper, we first demonstrate existing state-of-the-art efficient light-weight models designed for photos do not wor… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: Accepted at CVPR 2025, Project Page: https://subhajitmaity.me/SketchDownTheFLOPs

  2. arXiv:2504.05693  [pdf, other

    cs.CL cs.AI

    STRIVE: A Think & Improve Approach with Iterative Refinement for Enhancing Question Quality Estimation

    Authors: Aniket Deroy, Subhankar Maity

    Abstract: Automatically assessing question quality is crucial for educators as it saves time, ensures consistency, and provides immediate feedback for refining teaching materials. We propose a novel methodology called STRIVE (Structured Thinking and Refinement with multiLLMs for Improving Verified Question Estimation) using a series of Large Language Models (LLMs) for automatic question evaluation. This app… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: 5 pages, 6 figures

  3. arXiv:2504.05683  [pdf, other

    cs.CL cs.AI

    Towards Smarter Hiring: Are Zero-Shot and Few-Shot Pre-trained LLMs Ready for HR Spoken Interview Transcript Analysis?

    Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

    Abstract: This research paper presents a comprehensive analysis of the performance of prominent pre-trained large language models (LLMs), including GPT-4 Turbo, GPT-3.5 Turbo, text-davinci-003, text-babbage-001, text-curie-001, text-ada-001, llama-2-7b-chat, llama-2-13b-chat, and llama-2-70b-chat, in comparison to expert human evaluators in providing scores, identifying errors, and offering feedback and imp… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: 32 pages, 24 figures

  4. arXiv:2504.05642  [pdf, other

    cs.CL

    Leveraging Prompt-Tuning for Bengali Grammatical Error Explanation Using Large Language Models

    Authors: Subhankar Maity, Aniket Deroy

    Abstract: We propose a novel three-step prompt-tuning method for Bengali Grammatical Error Explanation (BGEE) using state-of-the-art large language models (LLMs) such as GPT-4, GPT-3.5 Turbo, and Llama-2-70b. Our approach involves identifying and categorizing grammatical errors in Bengali sentences, generating corrected versions of the sentences, and providing natural language explanations for each identifi… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: 9 pages, 2 figures

  5. arXiv:2503.10632  [pdf, other

    cs.LG cs.CV

    Kolmogorov-Arnold Attention: Is Learnable Attention Better For Vision Transformers?

    Authors: Subhajit Maity, Killian Hitsman, Xin Li, Aritra Dutta

    Abstract: Kolmogorov-Arnold networks (KANs) are a remarkable innovation consisting of learnable activation functions with the potential to capture more complex relationships from data. Presently, KANs are deployed by replacing multilayer perceptrons (MLPs) in deep networks, including advanced architectures such as vision Transformers (ViTs). This work asks whether a similar replacement in the attention can… ▽ More

    Submitted 28 May, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

    Comments: Preprint, Appendix included

    MSC Class: 68T07 ACM Class: I.2.6; I.5.1; I.5.5; I.5.4; I.4.10

  6. arXiv:2502.10449  [pdf, ps, other

    cs.CC

    MaxMin Separation Problems: FPT Algorithms for $st$-Separator and Odd Cycle Transversal

    Authors: Ajinkya Gaikwad, Hitendra Kumar, Soumen Maity, Saket Saurabh, Roohani Sharma

    Abstract: In this paper, we study the parameterized complexity of the MaxMin versions of two fundamental separation problems: Maximum Minimal $st$-Separator and Maximum Minimal Odd Cycle Transversal (OCT), both parameterized by the solution size. In the Maximum Minimal $st$-Separator problem, given a graph $G$, two distinct vertices $s$ and $t$ and a positive integer $k$, the goal is to determine whether th… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: Accepted to STACS 2025

  7. arXiv:2502.04662  [pdf, other

    cs.LG eess.SY math.OC

    Adversarially-Robust TD Learning with Markovian Data: Finite-Time Rates and Fundamental Limits

    Authors: Sreejeet Maity, Aritra Mitra

    Abstract: One of the most basic problems in reinforcement learning (RL) is policy evaluation: estimating the long-term return, i.e., value function, corresponding to a given fixed policy. The celebrated Temporal Difference (TD) learning algorithm addresses this problem, and recent work has investigated finite-time convergence guarantees for this algorithm and variants thereof. However, these guarantees hing… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: Accepted to AISTATS 2025

  8. arXiv:2502.03261  [pdf, other

    stat.ML cs.LG cs.NI math.ST

    CARROT: A Cost Aware Rate Optimal Router

    Authors: Seamus Somerstep, Felipe Maia Polo, Allysson Flavio Melo de Oliveira, Prattyush Mangal, Mírian Silva, Onkar Bhardwaj, Mikhail Yurochkin, Subha Maity

    Abstract: With the rapid growth in the number of Large Language Models (LLMs), there has been a recent interest in LLM routing, or directing queries to the cheapest LLM that can deliver a suitable response. We conduct a minimax analysis of the routing problem, providing a lower bound and finding that a simple router that predicts both cost and accuracy for each question can be minimax optimal. Inspired by t… ▽ More

    Submitted 19 May, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: v2: Added o3-mini to CARROT and SPROUT

  9. arXiv:2501.17397  [pdf, ps, other

    cs.CL

    Leveraging In-Context Learning and Retrieval-Augmented Generation for Automatic Question Generation in Educational Domains

    Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

    Abstract: Question generation in education is a time-consuming and cognitively demanding task, as it requires creating questions that are both contextually relevant and pedagogically sound. Current automated question generation methods often generate questions that are out of context. In this work, we explore advanced techniques for automated question generation in educational contexts, focusing on In-Conte… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

    Comments: Accepted at the 16th Meeting of the Forum for Information Retrieval Evaluation as a Regular Paper

  10. arXiv:2411.09214  [pdf, other

    cs.CL

    HateGPT: Unleashing GPT-3.5 Turbo to Combat Hate Speech on X

    Authors: Aniket Deroy, Subhankar Maity

    Abstract: The widespread use of social media platforms like Twitter and Facebook has enabled people of all ages to share their thoughts and experiences, leading to an immense accumulation of user-generated content. However, alongside the benefits, these platforms also face the challenge of managing hate speech and offensive content, which can undermine rational discourse and threaten democratic values. As a… ▽ More

    Submitted 25 March, 2025; v1 submitted 14 November, 2024; originally announced November 2024.

    Comments: Updated and Final Version

  11. arXiv:2411.08998  [pdf, other

    stat.ML cs.LG stat.ME

    Microfoundation Inference for Strategic Prediction

    Authors: Daniele Bracale, Subha Maity, Felipe Maia Polo, Seamus Somerstep, Moulinath Banerjee, Yuekai Sun

    Abstract: Often in prediction tasks, the predictive model itself can influence the distribution of the target variable, a phenomenon termed performative prediction. Generally, this influence stems from strategic actions taken by stakeholders with a vested interest in predictive models. A key challenge that hinders the widespread adaptation of performative prediction in machine learning is that practitioners… ▽ More

    Submitted 10 April, 2025; v1 submitted 13 November, 2024; originally announced November 2024.

  12. arXiv:2411.07917  [pdf, other

    cs.CL

    CryptoLLM: Unleashing the Power of Prompted LLMs for SmartQnA and Classification of Crypto Posts

    Authors: Aniket Deroy, Subhankar Maity

    Abstract: The rapid growth of social media has resulted in an large volume of user-generated content, particularly in niche domains such as cryptocurrency. This task focuses on developing robust classification models to accurately categorize cryptocurrency-related social media posts into predefined classes, including but not limited to objective, positive, negative, etc. Additionally, the task requires part… ▽ More

    Submitted 18 March, 2025; v1 submitted 12 November, 2024; originally announced November 2024.

    Comments: Updated and Final Version

  13. arXiv:2411.06946  [pdf, other

    cs.CL

    Cancer-Answer: Empowering Cancer Care with Advanced Large Language Models

    Authors: Aniket Deroy, Subhankar Maity

    Abstract: Gastrointestinal (GI) tract cancers account for a substantial portion of the global cancer burden, where early diagnosis is critical for improved management and patient outcomes. The complex aetiologies and overlapping symptoms across GI cancers often delay diagnosis, leading to suboptimal treatment strategies. Cancer-related queries are crucial for timely diagnosis, treatment, and patient educati… ▽ More

    Submitted 18 March, 2025; v1 submitted 11 November, 2024; originally announced November 2024.

    Comments: Updated and Final Version

  14. arXiv:2411.05039  [pdf, other

    cs.CL cs.AI

    YouTube Comments Decoded: Leveraging LLMs for Low Resource Language Classification

    Authors: Aniket Deroy, Subhankar Maity

    Abstract: Sarcasm detection is a significant challenge in sentiment analysis, particularly due to its nature of conveying opinions where the intended meaning deviates from the literal expression. This challenge is heightened in social media contexts where code-mixing, especially in Dravidian languages, is prevalent. Code-mixing involves the blending of multiple languages within a single utterance, often wit… ▽ More

    Submitted 13 March, 2025; v1 submitted 6 November, 2024; originally announced November 2024.

    Comments: Updated and Final Version

  15. arXiv:2411.04752  [pdf, other

    cs.CL

    RetrieveGPT: Merging Prompts and Mathematical Models for Enhanced Code-Mixed Information Retrieval

    Authors: Aniket Deroy, Subhankar Maity

    Abstract: Code-mixing, the integration of lexical and grammatical elements from multiple languages within a single sentence, is a widespread linguistic phenomenon, particularly prevalent in multilingual societies. In India, social media users frequently engage in code-mixed conversations using the Roman script, especially among migrant communities who form online groups to share relevant local information.… ▽ More

    Submitted 26 March, 2025; v1 submitted 7 November, 2024; originally announced November 2024.

    Comments: Final and Updated version

  16. arXiv:2411.04025  [pdf, other

    cs.CL

    Prompt Engineering Using GPT for Word-Level Code-Mixed Language Identification in Low-Resource Dravidian Languages

    Authors: Aniket Deroy, Subhankar Maity

    Abstract: Language Identification (LI) is crucial for various natural language processing tasks, serving as a foundational step in applications such as sentiment analysis, machine translation, and information retrieval. In multilingual societies like India, particularly among the youth engaging on social media, text often exhibits code-mixing, blending local languages with English at different linguistic le… ▽ More

    Submitted 12 March, 2025; v1 submitted 6 November, 2024; originally announced November 2024.

    Comments: Updated and Final Version

  17. arXiv:2410.19822  [pdf, ps, other

    cs.CY cs.AI cs.HC

    Human-Centric eXplainable AI in Education

    Authors: Subhankar Maity, Aniket Deroy

    Abstract: As artificial intelligence (AI) becomes more integrated into educational environments, how can we ensure that these systems are both understandable and trustworthy? The growing demand for explainability in AI systems is a critical area of focus. This paper explores Human-Centric eXplainable AI (HCXAI) in the educational landscape, emphasizing its role in enhancing learning outcomes, fostering trus… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: Preprint. Under Review

  18. arXiv:2410.12893  [pdf, other

    cs.CL cs.AI

    MIRROR: A Novel Approach for the Automated Evaluation of Open-Ended Question Generation

    Authors: Aniket Deroy, Subhankar Maity, Sudeshna Sarkar

    Abstract: Automatic question generation is a critical task that involves evaluating question quality by considering factors such as engagement, pedagogical value, and the ability to stimulate critical thinking. These aspects require human-like understanding and judgment, which automated systems currently lack. However, human evaluations are costly and impractical for large-scale samples of generated questio… ▽ More

    Submitted 25 March, 2025; v1 submitted 16 October, 2024; originally announced October 2024.

    Comments: Updated Version

  19. arXiv:2410.10650  [pdf, ps, other

    cs.CL cs.AI

    Generative AI and Its Impact on Personalized Intelligent Tutoring Systems

    Authors: Subhankar Maity, Aniket Deroy

    Abstract: Generative Artificial Intelligence (AI) is revolutionizing educational technology by enabling highly personalized and adaptive learning environments within Intelligent Tutoring Systems (ITS). This report delves into the integration of Generative AI, particularly large language models (LLMs) like GPT-4, into ITS to enhance personalized education through dynamic content generation, real-time feedbac… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Scientific Report (Under Review)

  20. arXiv:2410.10542  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Models

    Authors: Shubham Kumar Nigam, Aniket Deroy, Subhankar Maity, Arnab Bhattacharya

    Abstract: This study investigates judgment prediction in a realistic scenario within the context of Indian judgments, utilizing a range of transformer-based models, including InLegalBERT, BERT, and XLNet, alongside LLMs such as Llama-2 and GPT-3.5 Turbo. In this realistic scenario, we simulate how judgments are predicted at the point when a case is presented for a decision in court, using only the informati… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Accepted on NLLP at EMNLP 2024

  21. arXiv:2410.09576  [pdf, ps, other

    cs.CL cs.AI

    The Future of Learning in the Age of Generative AI: Automated Question Generation and Assessment with Large Language Models

    Authors: Subhankar Maity, Aniket Deroy

    Abstract: In recent years, large language models (LLMs) and generative AI have revolutionized natural language processing (NLP), offering unprecedented capabilities in education. This chapter explores the transformative potential of LLMs in automated question generation and answer assessment. It begins by examining the mechanisms behind LLMs, emphasizing their ability to comprehend and generate human-like t… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

    Comments: Book Chapter (Under Review)

  22. arXiv:2409.19027  [pdf, ps, other

    cs.CL cs.SE

    Code Generation and Algorithmic Problem Solving Using Llama 3.1 405B

    Authors: Aniket Deroy, Subhankar Maity

    Abstract: Code generation by Llama 3.1 models, such as Meta's Llama 3.1 405B, represents a significant advancement in the field of artificial intelligence, particularly in natural language processing and programming automation. This paper explores the capabilities and applications of Llama-driven code generation, highlighting its ability to translate natural language prompts into executable code across mult… ▽ More

    Submitted 2 April, 2025; v1 submitted 26 September, 2024; originally announced September 2024.

    Comments: updated version

  23. arXiv:2409.03237  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Robust Q-Learning under Corrupted Rewards

    Authors: Sreejeet Maity, Aritra Mitra

    Abstract: Recently, there has been a surge of interest in analyzing the non-asymptotic behavior of model-free reinforcement learning algorithms. However, the performance of such algorithms in non-ideal environments, such as in the presence of corrupted rewards, is poorly understood. Motivated by this gap, we investigate the robustness of the celebrated Q-learning algorithm to a strong-contamination attack m… ▽ More

    Submitted 5 September, 2024; originally announced September 2024.

    Comments: Accepted to the Decision and Control Conference (CDC) 2024

  24. arXiv:2406.19391  [pdf, other

    cs.CV

    Fibottention: Inceptive Visual Representation Learning with Diverse Attention Across Heads

    Authors: Ali Khaleghi Rahimian, Manish Kumar Govind, Subhajit Maity, Dominick Reilly, Christian Kümmerle, Srijan Das, Aritra Dutta

    Abstract: Transformer architectures such as Vision Transformers (ViT) have proven effective for solving visual perception tasks. However, they suffer from two major limitations; first, the quadratic complexity of self-attention limits the number of tokens that can be processed, and second, Transformers often require large amounts of training data to attain state-of-the-art performance. In this paper, we pro… ▽ More

    Submitted 19 December, 2024; v1 submitted 27 June, 2024; originally announced June 2024.

    Comments: The complete implementation, including source code and evaluation scripts, is publicly available at: https://github.com/Charlotte-CharMLab/Fibottention

  25. arXiv:2406.15211  [pdf, other

    cs.CL cs.AI

    How Effective is GPT-4 Turbo in Generating School-Level Questions from Textbooks Based on Bloom's Revised Taxonomy?

    Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

    Abstract: We evaluate the effectiveness of GPT-4 Turbo in generating educational questions from NCERT textbooks in zero-shot mode. Our study highlights GPT-4 Turbo's ability to generate questions that require higher-order thinking skills, especially at the "understanding" level according to Bloom's Revised Taxonomy. While we find a notable consistency between questions generated by GPT-4 Turbo and those ass… ▽ More

    Submitted 21 June, 2024; originally announced June 2024.

    Comments: Accepted at Learnersourcing: Student-Generated Content @ Scale 2024

  26. arXiv:2406.08226  [pdf, other

    cs.CV cs.AI cs.LG

    DistilDoc: Knowledge Distillation for Visually-Rich Document Applications

    Authors: Jordy Van Landeghem, Subhajit Maity, Ayan Banerjee, Matthew Blaschko, Marie-Francine Moens, Josep Lladós, Sanket Biswas

    Abstract: This work explores knowledge distillation (KD) for visually-rich document (VRD) applications such as document layout analysis (DLA) and document image classification (DIC). While VRD research is dependent on increasingly sophisticated and cumbersome models, the field has neglected to study efficiency via model compression. Here, we design a KD experimentation methodology for more lean, performant… ▽ More

    Submitted 12 March, 2025; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: Accepted to ICDAR 2024 (Athens, Greece)

  27. arXiv:2406.00039  [pdf

    cs.CL

    How Ready Are Generative Pre-trained Large Language Models for Explaining Bengali Grammatical Errors?

    Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

    Abstract: Grammatical error correction (GEC) tools, powered by advanced generative artificial intelligence (AI), competently correct linguistic inaccuracies in user input. However, they often fall short in providing essential natural language explanations, which are crucial for learning languages and gaining a deeper understanding of the grammatical rules. There is limited exploration of these tools in low-… ▽ More

    Submitted 27 May, 2024; originally announced June 2024.

    Comments: Accepted at Educational Data Mining 2024

  28. arXiv:2405.15172  [pdf, other

    stat.ML cs.LG

    Learning the Distribution Map in Reverse Causal Performative Prediction

    Authors: Daniele Bracale, Subha Maity, Moulinath Banerjee, Yuekai Sun

    Abstract: In numerous predictive scenarios, the predictive model affects the sampling distribution; for example, job applicants often meticulously craft their resumes to navigate through a screening systems. Such shifts in distribution are particularly prevalent in the realm of social computing, yet, the strategies to learn these shifts from data remain remarkably limited. Inspired by a microeconomic model… ▽ More

    Submitted 10 April, 2025; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 17 pages, 4 figures

  29. arXiv:2405.11579  [pdf, ps, other

    cs.CL

    Exploring the Capabilities of Prompted Large Language Models in Educational and Assessment Applications

    Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

    Abstract: In the era of generative artificial intelligence (AI), the fusion of large language models (LLMs) offers unprecedented opportunities for innovation in the field of modern education. We embark on an exploration of prompted LLMs within the context of educational and assessment applications to uncover their potential. Through a series of carefully crafted research questions, we investigate the effect… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: Accepted at EDM 2024

  30. arXiv:2404.10023  [pdf, other

    cs.DS cs.CC

    Parameterized Algorithms for Editing to Uniform Cluster Graph

    Authors: Ajinkya Gaikwad, Hitendra Kumar, Soumen Maity

    Abstract: We study the parameterized complexity of transforming graphs into Uniform Cluster graphs, where each component is an equal-sized clique. We consider Uniform Cluster Vertex Deletion (UCVD), Uniform Cluster Edge Deletion (UCED), Uniform Cluster Edge Addition (UCEA), Uniform Cluster Edge Editing (UCEE), Uniform Cluster Exclusive Vertex Splitting (UCEVS), and Uniform Cluster Inclusive Vertex Splitting… ▽ More

    Submitted 5 February, 2025; v1 submitted 15 April, 2024; originally announced April 2024.

  31. arXiv:2403.04224  [pdf, other

    cs.CL cs.AI cs.LG

    Aligners: Decoupling LLMs and Alignment

    Authors: Lilian Ngweta, Mayank Agarwal, Subha Maity, Alex Gittens, Yuekai Sun, Mikhail Yurochkin

    Abstract: Large Language Models (LLMs) need to be aligned with human expectations to ensure their safety and utility in most applications. Alignment is challenging, costly, and needs to be repeated for every LLM and alignment criterion. We propose to decouple LLMs and alignment by training aligner models that can be used to align any LLM for a given criteria on an as-needed basis, thus also reducing the pot… ▽ More

    Submitted 4 October, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

    Comments: Short version accepted as a Tiny Paper at the International Conference on Learning Representations (ICLR) 2024. Long version accepted to the Conference on Empirical Methods in Natural Language Processing (EMNLP) 2024 Findings

  32. arXiv:2401.07098  [pdf, other

    cs.CL

    A Novel Multi-Stage Prompting Approach for Language Agnostic MCQ Generation using GPT

    Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

    Abstract: We introduce a multi-stage prompting approach (MSP) for the generation of multiple choice questions (MCQs), harnessing the capabilities of GPT models such as text-davinci-003 and GPT-4, renowned for their excellence across various NLP tasks. Our approach incorporates the innovative concept of chain-of-thought prompting, a progressive technique in which the GPT model is provided with a series of in… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: Accepted at ECIR 2024(short paper)

  33. arXiv:2312.13622  [pdf, other

    eess.SP cs.IT

    Jointly Optimal RIS Placement and Power Allocation for Underlay D2D Communications: An Outage Probability Minimization Approach

    Authors: Sarbani Ghose, Deepak Mishra, Santi P. Maity, George C. Alexandropoulos

    Abstract: In this paper, we study underlay device-to-device (D2D) communication systems empowered by a reconfigurable intelligent surface (RIS) for cognitive cellular networks. Considering Rayleigh fading channels and the general case where there exist both the direct and RIS-enabled D2D channels, the outage probability (OP) of the D2D communication link is presented in closed-form. Next, for the considered… ▽ More

    Submitted 7 January, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

  34. arXiv:2312.10748  [pdf, other

    cs.CL cs.SI

    Multi-Label Classification of COVID-Tweets Using Large Language Models

    Authors: Aniket Deroy, Subhankar Maity

    Abstract: Vaccination is important to minimize the risk and spread of various diseases. In recent years, vaccination has been a key step in countering the COVID-19 pandemic. However, many people are skeptical about the use of vaccines for various reasons, including the politics involved, the potential side effects of vaccines, etc. The goal in this task is to build an effective multi-label classifier to lab… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  35. arXiv:2312.04601  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    Weak Supervision Performance Evaluation via Partial Identification

    Authors: Felipe Maia Polo, Subha Maity, Mikhail Yurochkin, Moulinath Banerjee, Yuekai Sun

    Abstract: Programmatic Weak Supervision (PWS) enables supervised model training without direct access to ground truth labels, utilizing weak labels from heuristics, crowdsourcing, or pre-trained models. However, the absence of ground truth complicates model evaluation, as traditional metrics such as accuracy, precision, and recall cannot be directly calculated. In this work, we present a novel method to add… ▽ More

    Submitted 31 October, 2024; v1 submitted 7 December, 2023; originally announced December 2023.

    Comments: NeurIPS 2024

  36. arXiv:2312.01087  [pdf, other

    cs.CL cs.AI

    Prompted Zero-Shot Multi-label Classification of Factual Incorrectness in Machine-Generated Summaries

    Authors: Aniket Deroy, Subhankar Maity, Saptarshi Ghosh

    Abstract: This study addresses the critical issue of factual inaccuracies in machine-generated text summaries, an increasingly prevalent issue in information dissemination. Recognizing the potential of such errors to compromise information reliability, we investigate the nature of factual inconsistencies across machine-summarized content. We introduce a prompt-based classification system that categorizes er… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  37. arXiv:2312.01032  [pdf, other

    cs.CL cs.AI

    Harnessing the Power of Prompt-based Techniques for Generating School-Level Questions using Large Language Models

    Authors: Subhankar Maity, Aniket Deroy, Sudeshna Sarkar

    Abstract: Designing high-quality educational questions is a challenging and time-consuming task. In this work, we propose a novel approach that utilizes prompt-based techniques to generate descriptive and reasoning-based questions. However, current question-answering (QA) datasets are inadequate for conducting our experiments on prompt-based question generation (QG) in an educational setting. Therefore, we… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

  38. arXiv:2312.00554  [pdf, other

    cs.CL cs.AI

    Questioning Biases in Case Judgment Summaries: Legal Datasets or Large Language Models?

    Authors: Aniket Deroy, Subhankar Maity

    Abstract: The evolution of legal datasets and the advent of large language models (LLMs) have significantly transformed the legal field, particularly in the generation of case judgment summaries. However, a critical concern arises regarding the potential biases embedded within these summaries. This study scrutinizes the biases present in case judgment summaries produced by legal datasets and large language… ▽ More

    Submitted 1 December, 2023; originally announced December 2023.

  39. arXiv:2310.01583  [pdf, other

    stat.ML cs.LG

    An Investigation of Representation and Allocation Harms in Contrastive Learning

    Authors: Subha Maity, Mayank Agarwal, Mikhail Yurochkin, Yuekai Sun

    Abstract: The effect of underrepresentation on the performance of minority groups is known to be a serious problem in supervised learning settings; however, it has been underexplored so far in the context of self-supervised learning (SSL). In this paper, we demonstrate that contrastive learning (CL), a popular variant of SSL, tends to collapse representations of minority groups with certain majority groups.… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  40. arXiv:2308.00005  [pdf

    cs.IR

    Detection and Classification of Novel Attacks and Anomaly in IoT Network using Rule based Deep Learning Model

    Authors: Sanjay Chakraborty, Saroj Kumar Pandey, Saikat Maity, Lopamudra Dey

    Abstract: Attackers are now using sophisticated techniques, like polymorphism, to change the attack pattern for each new attack. Thus, the detection of novel attacks has become the biggest challenge for cyber experts and researchers. Recently, anomaly and hybrid approaches are used for the detection of network attacks. Detecting novel attacks, on the other hand, is a key enabler for a wide range of IoT appl… ▽ More

    Submitted 29 July, 2023; originally announced August 2023.

  41. Image Hash Minimization for Tamper Detection

    Authors: Subhajit Maity, Ram Kumar Karsh

    Abstract: Tamper detection using image hash is a very common problem of modern days. Several research and advancements have already been done to address this problem. However, most of the existing methods lack the accuracy of tamper detection when the tampered area is low, as well as requiring long image hashes. In this paper, we propose a novel method objectively to minimize the hash length while enhancing… ▽ More

    Submitted 28 May, 2023; originally announced May 2023.

    Comments: Published at the 9th International Conference on Advances in Pattern Recognition, 2017

    Journal ref: 2017 Ninth International Conference on Advances in Pattern Recognition (ICAPR), Bangalore, India, 2017, pp. 1-6

  42. arXiv:2305.11292  [pdf, other

    physics.chem-ph cs.LG physics.comp-ph

    Multi-Fidelity Machine Learning for Excited State Energies of Molecules

    Authors: Vivin Vinod, Sayan Maity, Peter Zaspel, Ulrich Kleinekathöfer

    Abstract: The accurate but fast calculation of molecular excited states is still a very challenging topic. For many applications, detailed knowledge of the energy funnel in larger molecular aggregates is of key importance requiring highly accurate excited state energies. To this end, machine learning techniques can be an extremely useful tool though the cost of generating highly accurate training datasets s… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  43. SelfDocSeg: A Self-Supervised vision-based Approach towards Document Segmentation

    Authors: Subhajit Maity, Sanket Biswas, Siladittya Manna, Ayan Banerjee, Josep Lladós, Saumik Bhattacharya, Umapada Pal

    Abstract: Document layout analysis is a known problem to the documents research community and has been vastly explored yielding a multitude of solutions ranging from text mining, and recognition to graph-based representation, visual feature extraction, etc. However, most of the existing works have ignored the crucial fact regarding the scarcity of labeled data. With growing internet connectivity to personal… ▽ More

    Submitted 20 August, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

    Comments: Accepted at The 17th International Conference on Document Analysis and Recognition (ICDAR 2023)

    Journal ref: ICDAR 2023 (International Conference on Document Analysis and Recognition) Lecture Notes in Computer Science, vol 14187, pp. 342-360. Springer Nature

  44. arXiv:2304.06574  [pdf, other

    stat.ML cs.LG

    Bayes classifier cannot be learned from noisy responses with unknown noise rates

    Authors: Soham Bakshi, Subha Maity

    Abstract: Training a classifier with noisy labels typically requires the learner to specify the distribution of label noise, which is often unknown in practice. Although there have been some recent attempts to relax that requirement, we show that the Bayes decision rule is unidentified in most classification problems with noisy labels. This suggests it is generally not possible to bypass/relax the requireme… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: Invited to present in ICLR Tiny Paper 2023

  45. arXiv:2303.14732  [pdf

    cs.DL cs.SI econ.GN

    Interdisciplinary Papers Supported by Disciplinary Grants Garner Deep and Broad Scientific Impact

    Authors: Minsu Park, Suman Kalyan Maity, Stefan Wuchty, Dashun Wang

    Abstract: Interdisciplinary research has emerged as a hotbed for innovation and a key approach to addressing complex societal challenges. The increasing dominance of grant-supported research in shaping scientific advances, coupled with growing interest in funding interdisciplinary work, raises fundamental questions about the effectiveness of interdisciplinary grants in fostering high-impact interdisciplinar… ▽ More

    Submitted 14 March, 2025; v1 submitted 26 March, 2023; originally announced March 2023.

  46. arXiv:2303.10866  [pdf, other

    cs.DS

    An Improved Exact Algorithm for Knot-Free Vertex Deletion

    Authors: Ajaykrishnan E S, Soumen Maity, Abhishek Sahu, Saket Saurabh

    Abstract: A knot $K$ in a directed graph $D$ is a strongly connected component of size at least two such that there is no arc $(u,v)$ with $u \in V(K)$ and $v\notin V(K)$. Given a directed graph $D=(V,E)$, we study Knot-Free Vertex Deletion (KFVD), where the goal is to remove the minimum number of vertices such that the resulting graph contains no knots. This problem naturally emerges from its application i… ▽ More

    Submitted 20 March, 2023; originally announced March 2023.

  47. arXiv:2302.09795  [pdf, other

    cs.LG cs.CV stat.ML

    Simple Disentanglement of Style and Content in Visual Representations

    Authors: Lilian Ngweta, Subha Maity, Alex Gittens, Yuekai Sun, Mikhail Yurochkin

    Abstract: Learning visual representations with interpretable features, i.e., disentangled representations, remains a challenging problem. Existing methods demonstrate some success but are hard to apply to large-scale vision datasets like ImageNet. In this work, we propose a simple post-processing framework to disentangle content and style in learned representations from pre-trained vision models. We model t… ▽ More

    Submitted 31 May, 2023; v1 submitted 20 February, 2023; originally announced February 2023.

    Comments: International Conference on Machine Learning (ICML) 2023

  48. arXiv:2211.05829  [pdf, other

    cs.CY cs.LG

    A Machine Learning system to monitor student progress in educational institutes

    Authors: Bibhuprasad Mahakud, Bibhuti Parida, Ipsit Panda, Souvik Maity, Arpita Sahoo, Reeta Sharma

    Abstract: In order to track and comprehend the academic achievement of students, both private and public educational institutions devote a significant amount of resources and labour. One of the difficult issues that institutes deal with on a regular basis is understanding the exam shortcomings of students. The performance of a student is influenced by a variety of factors, including attendance, attentivenes… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

    Comments: 9 pages, 7 figures

  49. arXiv:2209.06378  [pdf, other

    cs.HC

    RMExplorer: A Visual Analytics Approach to Explore the Performance and the Fairness of Disease Risk Models on Population Subgroups

    Authors: Bum Chul Kwon, Uri Kartoun, Shaan Khurshid, Mikhail Yurochkin, Subha Maity, Deanna G Brockman, Amit V Khera, Patrick T Ellinor, Steven A Lubitz, Kenney Ng

    Abstract: Disease risk models can identify high-risk patients and help clinicians provide more personalized care. However, risk models developed on one dataset may not generalize across diverse subpopulations of patients in different datasets and may have unexpected performance. It is challenging for clinical researchers to inspect risk models across different subgroups without any tools. Therefore, we deve… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: IEEE VIS 2022 Short

  50. arXiv:2209.05556  [pdf, other

    cs.RO eess.SY

    Fragile object transportation by a multi-robot system in an unknown environment using a semi-decentralized control approach

    Authors: Dibyendu Roy, Sreejeet Maity, Madhubanti Maitra, Samar Bhattacharya

    Abstract: In this paper, we introduce a semi-decentralized control technique for a swarm of robots transporting a fragile object to a destination in an uncertain occluded environment.The proposed approach has been split into two parts. The initial part (Phase 1) includes a centralized control strategy for creating a specific formation among the agents so that the object to be transported, can be positioned… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 7 pages,8 figures, IEEE International Conference on Robotics and Automation (ICRA) 2023