Skip to main content

Showing 1–50 of 183 results for author: Bhattacharya, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.02920  [pdf, ps, other

    cs.HC cs.AI cs.LG

    Visual-Conversational Interface for Evidence-Based Explanation of Diabetes Risk Prediction

    Authors: Reza Samimi, Aditya Bhattacharya, Lucija Gosak, Gregor Stiglic, Katrien Verbert

    Abstract: Healthcare professionals need effective ways to use, understand, and validate AI-driven clinical decision support systems. Existing systems face two key limitations: complex visualizations and a lack of grounding in scientific evidence. We present an integrated decision support system that combines interactive visualizations with a conversational agent to explain diabetes risk assessments. We prop… ▽ More

    Submitted 25 June, 2025; originally announced July 2025.

    Comments: 18 pages, 5 figures, 7th ACM Conference on Conversational User Interfaces

  2. arXiv:2507.01913  [pdf, ps, other

    cond-mat.mtrl-sci cs.LG

    Advancing Magnetic Materials Discovery -- A structure-based machine learning approach for magnetic ordering and magnetic moment prediction

    Authors: Apoorv Verma, Junaid Jami, Amrita Bhattacharya

    Abstract: Accurately predicting magnetic behavior across diverse materials systems remains a longstanding challenge due to the complex interplay of structural and electronic factors and is pivotal for the accelerated discovery and design of next-generation magnetic materials. In this work, a refined descriptor is proposed that significantly improves the prediction of two critical magnetic properties -- magn… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

  3. arXiv:2506.18770  [pdf, ps, other

    cs.HC

    Importance of User Control in Data-Centric Steering for Healthcare Experts

    Authors: Aditya Bhattacharya, Simone Stumpf, Katrien Verbert

    Abstract: As Artificial Intelligence (AI) becomes increasingly integrated into high-stakes domains like healthcare, effective collaboration between healthcare experts and AI systems is critical. Data-centric steering, which involves fine-tuning prediction models by improving training data quality, plays a key role in this process. However, little research has explored how varying levels of user control affe… ▽ More

    Submitted 22 May, 2025; originally announced June 2025.

    Comments: It is a pre-print version. For the full paper, please view the actual published version

  4. arXiv:2506.16247  [pdf, ps, other

    cs.CL

    Comparative Analysis of Abstractive Summarization Models for Clinical Radiology Reports

    Authors: Anindita Bhattacharya, Tohida Rehman, Debarshi Kumar Sanyal, Samiran Chattopadhyay

    Abstract: The findings section of a radiology report is often detailed and lengthy, whereas the impression section is comparatively more compact and captures key diagnostic conclusions. This research explores the use of advanced abstractive summarization models to generate the concise impression from the findings section of a radiology report. We have used the publicly available MIMIC-CXR dataset. A compara… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: 14 pages, 2 figures, 6 tables

  5. arXiv:2506.13886  [pdf, ps, other

    cs.CL cs.AI

    Investigating the interaction of linguistic and mathematical reasoning in language models using multilingual number puzzles

    Authors: Antara Raaghavi Bhattacharya, Isabel Papadimitriou, Kathryn Davidson, David Alvarez-Melis

    Abstract: Across languages, numeral systems vary widely in how they construct and combine numbers. While humans consistently learn to navigate this diversity, large language models (LLMs) struggle with linguistic-mathematical puzzles involving cross-linguistic numeral systems, which humans can learn to solve successfully. We investigate why this task is difficult for LLMs through a series of experiments tha… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  6. arXiv:2506.13327  [pdf, ps, other

    cs.CV

    Joint Analysis of Optical and SAR Vegetation Indices for Vineyard Monitoring: Assessing Biomass Dynamics and Phenological Stages over Po Valley, Italy

    Authors: Andrea Bergamaschi, Abhinav Verma, Avik Bhattacharya, Fabio Dell'Acqua

    Abstract: Multi-polarized Synthetic Aperture Radar (SAR) technology has gained increasing attention in agriculture, offering unique capabilities for monitoring vegetation dynamics thanks to its all-weather, day-and-night operation and high revisit frequency. This study presents, for the first time, a comprehensive analysis combining dual-polarimetric radar vegetation index (DpRVI) with optical indices to ch… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  7. arXiv:2506.12181  [pdf, ps, other

    cs.LG cs.CL

    Generative or Discriminative? Revisiting Text Classification in the Era of Transformers

    Authors: Siva Rajesh Kasa, Karan Gupta, Sumegh Roychowdhury, Ashutosh Kumar, Yaswanth Biruduraju, Santhosh Kumar Kasa, Nikhil Priyatam Pattisapu, Arindam Bhattacharya, Shailendra Agarwal, Vijay huddar

    Abstract: The comparison between discriminative and generative classifiers has intrigued researchers since Efron's seminal analysis of logistic regression versus discriminant analysis. While early theoretical work established that generative classifiers exhibit lower sample complexity but higher asymptotic error in simple linear settings, these trade-offs remain unexplored in the transformer era. We present… ▽ More

    Submitted 13 June, 2025; originally announced June 2025.

    Comments: 19 pages

  8. arXiv:2505.20312  [pdf, ps, other

    cs.CY cs.AI cs.MA

    Let's Get You Hired: A Job Seeker's Perspective on Multi-Agent Recruitment Systems for Explaining Hiring Decisions

    Authors: Aditya Bhattacharya, Katrien Verbert

    Abstract: During job recruitment, traditional applicant selection methods often lack transparency. Candidates are rarely given sufficient justifications for recruiting decisions, whether they are made manually by human recruiters or through the use of black-box Applicant Tracking Systems (ATS). To address this problem, our work introduces a multi-agent AI system that uses Large Language Models (LLMs) to gui… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: Pre-print version only. Please check the published version for any reference or citation

  9. arXiv:2505.17102  [pdf

    cs.CL

    BanglaByT5: Byte-Level Modelling for Bangla

    Authors: Pramit Bhattacharyya, Arnab Bhattacharya

    Abstract: Large language models (LLMs) have achieved remarkable success across various natural language processing tasks. However, most LLM models use traditional tokenizers like BPE and SentencePiece, which fail to capture the finer nuances of a morphologically rich language like Bangla (Bengali). In this work, we introduce BanglaByT5, the first byte-level encoder-decoder model explicitly tailored for Bang… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  10. arXiv:2505.13173  [pdf, ps, other

    cs.CL

    A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMs

    Authors: V. S. D. S. Mahesh Akavarapu, Hrishikesh Terdalkar, Pramit Bhattacharyya, Shubhangi Agarwal, Vishakha Deulgaonkar, Pralay Manna, Chaitali Dangarikar, Arnab Bhattacharya

    Abstract: Large Language Models (LLMs) have demonstrated remarkable generalization capabilities across diverse tasks and languages. In this study, we focus on natural language understanding in three classical languages -- Sanskrit, Ancient Greek and Latin -- to investigate the factors affecting cross-lingual zero-shot generalization. First, we explore named entity recognition and machine translation into En… ▽ More

    Submitted 31 May, 2025; v1 submitted 19 May, 2025; originally announced May 2025.

    Comments: Accepted to ACL 2025 Findings

    ACM Class: I.2.7

  11. arXiv:2505.04634  [pdf, other

    cs.LG cs.CE

    MatMMFuse: Multi-Modal Fusion model for Material Property Prediction

    Authors: Abhiroop Bhattacharya, Sylvain G. Cloutier

    Abstract: The recent progress of using graph based encoding of crystal structures for high throughput material property prediction has been quite successful. However, using a single modality model prevents us from exploiting the advantages of an enhanced features space by combining different representations. Specifically, pre-trained Large language models(LLMs) can encode a large amount of knowledge which i… ▽ More

    Submitted 30 April, 2025; originally announced May 2025.

    Comments: Presented at AI for Accelerated Materials Design(AI4Mat), ICLR 2025 (https://openreview.net/forum?id=pN4Zg6HBlq#discussion)

  12. Show Me How: Benefits and Challenges of Agent-Augmented Counterfactual Explanations for Non-Expert Users

    Authors: Aditya Bhattacharya, Tim Vanherwegen, Katrien Verbert

    Abstract: Counterfactual explanations offer actionable insights by illustrating how changes to inputs can lead to different outcomes. However, these explanations often suffer from ambiguity and impracticality, limiting their utility for non-expert users with limited AI knowledge. Augmenting counterfactual explanations with Large Language Models (LLMs) has been proposed as a solution, but little research has… ▽ More

    Submitted 6 April, 2025; originally announced April 2025.

    Comments: This is a pre-print version, the original version is available in the proceedings of ACM UMAP 2025

  13. arXiv:2504.06994  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration

    Authors: Omar Alama, Avigyan Bhattacharya, Haoyang He, Seungchan Kim, Yuheng Qiu, Wenshan Wang, Cherie Ho, Nikhil Keetha, Sebastian Scherer

    Abstract: Open-set semantic mapping is crucial for open-world robots. Current mapping approaches either are limited by the depth range or only map beyond-range entities in constrained settings, where overall they fail to combine within-range and beyond-range observations. Furthermore, these methods make a trade-off between fine-grained semantics and efficiency. We introduce RayFronts, a unified representati… ▽ More

    Submitted 9 April, 2025; originally announced April 2025.

  14. arXiv:2504.04737  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    TathyaNyaya and FactLegalLlama: Advancing Factual Judgment Prediction and Explanation in the Indian Legal Context

    Authors: Shubham Kumar Nigam, Balaramamahanthi Deepak Patnaik, Shivam Mishra, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya

    Abstract: In the landscape of Fact-based Judgment Prediction and Explanation (FJPE), reliance on factual data is essential for developing robust and realistic AI-driven decision-making tools. This paper introduces TathyaNyaya, the largest annotated dataset for FJPE tailored to the Indian legal context, encompassing judgments from the Supreme Court of India and various High Courts. Derived from the Hindi ter… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  15. arXiv:2504.03486  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Structured Legal Document Generation in India: A Model-Agnostic Wrapper Approach with VidhikDastaavej

    Authors: Shubham Kumar Nigam, Balaramamahanthi Deepak Patnaik, Ajay Varghese Thomas, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya

    Abstract: Automating legal document drafting can significantly enhance efficiency, reduce manual effort, and streamline legal workflows. While prior research has explored tasks such as judgment prediction and case summarization, the structured generation of private legal documents in the Indian legal domain remains largely unaddressed. To bridge this gap, we introduce VidhikDastaavej, a novel, anonymized da… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

  16. arXiv:2502.15333  [pdf, ps, other

    cs.DS

    Improved Sublinear-time Moment Estimation using Weighted Sampling

    Authors: Anup Bhattacharya, Pinki Pradhan

    Abstract: In this work we study the {\it moment estimation} problem using weighted sampling. Given sample access to a set $A$ with $n$ weighted elements, and a parameter $t>0$, we estimate the $t$-th moment of $A$ given as $S_t=\sum_{a\in A} w(a)^t$. For t=1, this is the sum estimation problem. The moment estimation problem along with a number of its variants have been extensively studied in streaming, subl… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: Abstract shortened to meet submission criteria

  17. arXiv:2502.05836  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    LegalSeg: Unlocking the Structure of Indian Legal Judgments Through Rhetorical Role Classification

    Authors: Shubham Kumar Nigam, Tanmay Dubey, Govind Sharma, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya

    Abstract: In this paper, we address the task of semantic segmentation of legal documents through rhetorical role classification, with a focus on Indian legal judgments. We introduce LegalSeg, the largest annotated dataset for this task, comprising over 7,000 documents and 1.4 million sentences, labeled with 7 rhetorical roles. To benchmark performance, we evaluate multiple state-of-the-art models, including… ▽ More

    Submitted 9 February, 2025; originally announced February 2025.

    Comments: Accepted on NAACL 2025

  18. arXiv:2501.03988  [pdf

    cs.CL

    Semantically Cohesive Word Grouping in Indian Languages

    Authors: N J Karthika, Adyasha Patra, Nagasai Saketh Naidu, Arnab Bhattacharya, Ganesh Ramakrishnan, Chaitali Dangarikar

    Abstract: Indian languages are inflectional and agglutinative and typically follow clause-free word order. The structure of sentences across most major Indian languages are similar when their dependency parse trees are considered. While some differences in the parsing structure occur due to peculiarities of a language or its preferred natural way of conveying meaning, several apparent differences are simply… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  19. Explanatory Debiasing: Involving Domain Experts in the Data Generation Process to Mitigate Representation Bias in AI Systems

    Authors: Aditya Bhattacharya, Simone Stumpf, Robin De Croon, Katrien Verbert

    Abstract: Representation bias is one of the most common types of biases in artificial intelligence (AI) systems, causing AI models to perform poorly on underrepresented data segments. Although AI practitioners use various methods to reduce representation bias, their effectiveness is often constrained by insufficient domain knowledge in the debiasing process. To address this gap, this paper introduces a set… ▽ More

    Submitted 27 February, 2025; v1 submitted 26 December, 2024; originally announced January 2025.

    Comments: Pre-print version, please cite the main article instead of the pre-print version

    Journal ref: ACM CHI 2025

  20. arXiv:2412.17853  [pdf, other

    cs.LG

    Zero Shot Time Series Forecasting Using Kolmogorov Arnold Networks

    Authors: Abhiroop Bhattacharya, Nandinee Haq

    Abstract: Accurate energy price forecasting is crucial for participants in day-ahead energy markets, as it significantly influences their decision-making processes. While machine learning-based approaches have shown promise in enhancing these forecasts, they often remain confined to the specific markets on which they are trained, thereby limiting their adaptability to new or unseen markets. In this paper, w… ▽ More

    Submitted 14 February, 2025; v1 submitted 19 December, 2024; originally announced December 2024.

    Comments: Published In: 2024 NeurIPS Workshop on Time Series in the Age of Large Models

  21. arXiv:2412.08385  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    NyayaAnumana & INLegalLlama: The Largest Indian Legal Judgment Prediction Dataset and Specialized Language Model for Enhanced Decision Analysis

    Authors: Shubham Kumar Nigam, Balaramamahanthi Deepak Patnaik, Shivam Mishra, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya

    Abstract: The integration of artificial intelligence (AI) in legal judgment prediction (LJP) has the potential to transform the legal landscape, particularly in jurisdictions like India, where a significant backlog of cases burdens the legal system. This paper introduces NyayaAnumana, the largest and most diverse corpus of Indian legal cases compiled for LJP, encompassing a total of 7,02,945 preprocessed ca… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Comments: Accepted on COLING 2025

  22. arXiv:2411.14571  [pdf, other

    cs.CR cs.AI cs.CL cs.HC

    Assessment of LLM Responses to End-user Security Questions

    Authors: Vijay Prakash, Kevin Lee, Arkaprabha Bhattacharya, Danny Yuxing Huang, Jessica Staddon

    Abstract: Answering end user security questions is challenging. While large language models (LLMs) like GPT, LLAMA, and Gemini are far from error-free, they have shown promise in answering a variety of questions outside of security. We studied LLM performance in the area of end user security by qualitatively evaluating 3 popular LLMs on 900 systematically collected end user security questions. While LLMs… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

    Comments: 18 pages, 1 figure, 8 tables

  23. arXiv:2411.03303  [pdf, other

    cs.RO

    Monocular Event-Based Vision for Obstacle Avoidance with a Quadrotor

    Authors: Anish Bhattacharya, Marco Cannici, Nishanth Rao, Yuezhan Tao, Vijay Kumar, Nikolai Matni, Davide Scaramuzza

    Abstract: We present the first static-obstacle avoidance method for quadrotors using just an onboard, monocular event camera. Quadrotors are capable of fast and agile flight in cluttered environments when piloted manually, but vision-based autonomous flight in unknown environments is difficult in part due to the sensor limitations of traditional onboard cameras. Event cameras, however, promise nearly zero m… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: 18 pages with supplementary

    Journal ref: Conference on Robot Learning (CoRL), Munich, Germany, 2024

  24. arXiv:2410.10542  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Rethinking Legal Judgement Prediction in a Realistic Scenario in the Era of Large Language Models

    Authors: Shubham Kumar Nigam, Aniket Deroy, Subhankar Maity, Arnab Bhattacharya

    Abstract: This study investigates judgment prediction in a realistic scenario within the context of Indian judgments, utilizing a range of transformer-based models, including InLegalBERT, BERT, and XLNet, alongside LLMs such as Llama-2 and GPT-3.5 Turbo. In this realistic scenario, we simulate how judgments are predicted at the point when a case is presented for a decision in court, using only the informati… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

    Comments: Accepted on NLLP at EMNLP 2024

  25. arXiv:2410.09176  [pdf, other

    cs.CV

    Cross-Domain Evaluation of Few-Shot Classification Models: Natural Images vs. Histopathological Images

    Authors: Ardhendu Sekhar, Aditya Bhattacharya, Vinayak Goyal, Vrinda Goel, Aditya Bhangale, Ravi Kant Gupta, Amit Sethi

    Abstract: In this study, we investigate the performance of few-shot classification models across different domains, specifically natural images and histopathological images. We first train several few-shot classification models on natural images and evaluate their performance on histopathological images. Subsequently, we train the same models on histopathological images and compare their performance. We inc… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

  26. arXiv:2409.20157  [pdf, other

    cs.DS

    RSVP: Beyond Weisfeiler Lehman Graph Isomorphism Test

    Authors: Sourav Dutta, Arnab Bhattacharya

    Abstract: Graph isomorphism, a classical algorithmic problem, determines whether two input graphs are structurally identical or not. Interestingly, it is one of the few problems that is not yet known to belong to either the P or NP-complete complexity classes. As such, intelligent search-space pruning based strategies were proposed for developing isomorphism testing solvers like nauty and bliss, which are s… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

  27. arXiv:2408.17171  [pdf, other

    cs.LG

    SafeTail: Efficient Tail Latency Optimization in Edge Service Scheduling via Computational Redundancy Management

    Authors: Jyoti Shokhanda, Utkarsh Pal, Aman Kumar, Soumi Chattopadhyay, Arani Bhattacharya

    Abstract: Optimizing tail latency while efficiently managing computational resources is crucial for delivering high-performance, latency-sensitive services in edge computing. Emerging applications, such as augmented reality, require low-latency computing services with high reliability on user devices, which often have limited computational capabilities. Consequently, these devices depend on nearby edge serv… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

    Comments: This work has been submitted to the IEEE for possible publication

  28. arXiv:2408.12274  [pdf, other

    cs.NI

    A Deadline-Aware Scheduler for Smart Factory using WiFi 6

    Authors: Mohit Jain, Anis Mishra, Syamantak Das, Andreas Wiese, Arani Bhattacharya, Mukulika Maity

    Abstract: A key strategy for making production in factories more efficient is to collect data about the functioning of machines, and dynamically adapt their working. Such smart factories have data packets with a mix of stringent and non-stringent deadlines with varying levels of importance that need to be delivered via a wireless network. However, the scheduling of packets in the wireless network is crucial… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  29. Representation Debiasing of Generated Data Involving Domain Experts

    Authors: Aditya Bhattacharya, Simone Stumpf, Katrien Verbert

    Abstract: Biases in Artificial Intelligence (AI) or Machine Learning (ML) systems due to skewed datasets problematise the application of prediction models in practice. Representation bias is a prevalent form of bias found in the majority of datasets. This bias arises when training data inadequately represents certain segments of the data space, resulting in poor generalisation of prediction models. Despite… ▽ More

    Submitted 17 May, 2024; originally announced July 2024.

    Comments: Pre-print of a paper accepted for ACM UMAP 2024

    Journal ref: Adjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization (UMAP Adjunct '24), July 1--4, 2024, Cagliari, Italy

  30. arXiv:2407.05280  [pdf, other

    cs.DC

    Perpetual Exploration of a Ring in Presence of Byzantine Black Hole

    Authors: Pritam Goswami, Adri Bhattacharya, Raja Das, Partha Sarathi Mandal

    Abstract: Perpetual exploration is a fundamental problem in the domain of mobile agents, where an agent needs to visit each node infinitely often. This issue has received lot of attention, mainly for ring topologies, presence of black holes adds more complexity. A black hole can destroy any incoming agent without any observable trace. In \cite{BampasImprovedPeriodicDataRetrieval,KralovivcPeriodicDataRetriev… ▽ More

    Submitted 14 November, 2024; v1 submitted 7 July, 2024; originally announced July 2024.

  31. arXiv:2406.14284  [pdf

    cs.CL cs.AI

    Leveraging LLMs for Bangla Grammar Error Correction:Error Categorization, Synthetic Data, and Model Evaluation

    Authors: Pramit Bhattacharyya, Arnab Bhattacharya

    Abstract: Large Language Models (LLMs) perform exceedingly well in Natural Language Understanding (NLU) tasks for many languages including English. However, despite being the fifth most-spoken language globally, Grammatical Error Correction (GEC) in Bangla remains underdeveloped. In this work, we investigate how LLMs can be leveraged for improving Bangla GEC. For that, we first do an extensive categorizatio… ▽ More

    Submitted 5 June, 2025; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted at ACL Findings, 2025

  32. arXiv:2406.04136  [pdf, other

    cs.CL cs.AI cs.LG

    Legal Judgment Reimagined: PredEx and the Rise of Intelligent AI Interpretation in Indian Courts

    Authors: Shubham Kumar Nigam, Anurag Sharma, Danush Khanna, Noel Shallum, Kripabandhu Ghosh, Arnab Bhattacharya

    Abstract: In the era of Large Language Models (LLMs), predicting judicial outcomes poses significant challenges due to the complexity of legal proceedings and the scarcity of expert-annotated datasets. Addressing this, we introduce \textbf{Pred}iction with \textbf{Ex}planation (\texttt{PredEx}), the largest expert-annotated dataset for legal judgment prediction and explanation in the Indian context, featuri… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  33. arXiv:2406.00375  [pdf, other

    cs.RO

    Teledrive: An Embodied AI based Telepresence System

    Authors: Snehasis Banerjee, Sayan Paul, Ruddradev Roychoudhury, Abhijan Bhattacharya, Chayan Sarkar, Ashis Sau, Pradip Pramanick, Brojeshwar Bhowmick

    Abstract: This article presents Teledrive, a telepresence robotic system with embodied AI features that empowers an operator to navigate the telerobot in any unknown remote place with minimal human intervention. We conceive Teledrive in the context of democratizing remote care-giving for elderly citizens as well as for isolated patients, affected by contagious diseases. In particular, this paper focuses on… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: Accepted in Journal of Intelligent Robotic System

    Journal ref: Journal of Intelligent Robotic System 2024

  34. An Explanatory Model Steering System for Collaboration between Domain Experts and AI

    Authors: Aditya Bhattacharya, Simone Stumpf, Katrien Verbert

    Abstract: With the increasing adoption of Artificial Intelligence (AI) systems in high-stake domains, such as healthcare, effective collaboration between domain experts and AI is imperative. To facilitate effective collaboration between domain experts and AI systems, we introduce an Explanatory Model Steering system that allows domain experts to steer prediction models using their domain knowledge. The syst… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Demo paper accepted for ACM UMAP 2024

    Journal ref: Adjunct Proceedings of the 32nd ACM Conference on User Modeling, Adaptation and Personalization (UMAP Adjunct '24), July 1--4, 2024, Cagliari, Italy

  35. arXiv:2405.10391  [pdf, other

    cs.RO cs.AI eess.IV

    Vision Transformers for End-to-End Vision-Based Quadrotor Obstacle Avoidance

    Authors: Anish Bhattacharya, Nishanth Rao, Dhruv Parikh, Pratik Kunapuli, Yuwei Wu, Yuezhan Tao, Nikolai Matni, Vijay Kumar

    Abstract: We demonstrate the capabilities of an attention-based end-to-end approach for high-speed vision-based quadrotor obstacle avoidance in dense, cluttered environments, with comparison to various state-of-the-art learning architectures. Quadrotor unmanned aerial vehicles (UAVs) have tremendous maneuverability when flown fast; however, as flight speed increases, traditional model-based approaches to na… ▽ More

    Submitted 1 April, 2025; v1 submitted 16 May, 2024; originally announced May 2024.

    Comments: 11 pages, 18 figures, 3 tables (with supplementary)

  36. arXiv:2405.06295  [pdf, other

    cs.CL cs.AI

    Aspect-oriented Consumer Health Answer Summarization

    Authors: Rochana Chaturvedi, Abari Bhattacharya, Shweta Yadav

    Abstract: Community Question-Answering (CQA) forums have revolutionized how people seek information, especially those related to their healthcare needs, placing their trust in the collective wisdom of the public. However, there can be several answers in response to a single query, which makes it hard to grasp the key information related to the specific health concern. Typically, CQA forums feature a single… ▽ More

    Submitted 10 May, 2024; originally announced May 2024.

    ACM Class: H.4.3; I.2.7; J.3; J.7; K.6.4

  37. arXiv:2404.14395  [pdf, other

    cs.CL cs.AI cs.LG

    PARAMANU-GANITA: Can Small Math Language Models Rival with Large Language Models on Mathematical Reasoning?

    Authors: Mitodru Niyogi, Arnab Bhattacharya

    Abstract: In this paper, we study whether domain specific pretraining of small generative language models (SLM) from scratch with domain specialized tokenizer and Chain-of-Thought (CoT) instruction fine-tuning results in competitive performance on mathematical reasoning compared to LLMs? Secondly, whether this approach is environmentally sustainable, highly cost efficient? To address these research question… ▽ More

    Submitted 5 March, 2025; v1 submitted 22 April, 2024; originally announced April 2024.

  38. arXiv:2404.00284  [pdf, other

    cs.CL

    A Likelihood Ratio Test of Genetic Relationship among Languages

    Authors: V. S. D. S. Mahesh Akavarapu, Arnab Bhattacharya

    Abstract: Lexical resemblances among a group of languages indicate that the languages could be genetically related, i.e., they could have descended from a common ancestral language. However, such resemblances can arise by chance and, hence, need not always imply an underlying genetic relationship. Many tests of significance based on permutation of wordlists and word similarity measures appeared in the past… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: Accepted at NAACL-2024 (Main Conference)

    ACM Class: I.2.7

  39. arXiv:2403.14696  [pdf, other

    cs.CY cs.GR cs.SI

    MOTIV: Visual Exploration of Moral Framing in Social Media

    Authors: Andrew Wentzel, Lauren Levine, Vipul Dhariwal, Zarah Fatemi, Abarai Bhattacharya, Barbara Di Eugenio, Andrew Rojecki, Elena Zheleva, G. Elisabeta Marai

    Abstract: We present a visual computing framework for analyzing moral rhetoric on social media around controversial topics. Using Moral Foundation Theory, we propose a methodology for deconstructing and visualizing the \textit{when}, \textit{where}, and \textit{who} behind each of these moral dimensions as expressed in microblog data. We characterize the design of this framework, developed in collaboration… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  40. arXiv:2403.13944  [pdf, other

    cs.CY cs.CR cs.HC

    Shortchanged: Uncovering and Analyzing Intimate Partner Financial Abuse in Consumer Complaints

    Authors: Arkaprabha Bhattacharya, Kevin Lee, Vineeth Ravi, Jessica Staddon, Rosanna Bellini

    Abstract: Digital financial services can introduce new digital-safety risks for users, particularly survivors of intimate partner financial abuse (IPFA). To offer improved support for such users, a comprehensive understanding of their support needs and the barriers they face to redress by financial institutions is essential. Drawing from a dataset of 2.7 million customer complaints, we implement a bespoke w… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: 20 pages, 9 figures, 8 tables, This paper will be published in CHI '24: Proceedings of the 2024 CHI Conference on Human Factors in Computing Systems

  41. arXiv:2403.13681  [pdf, other

    cs.CL cs.AI cs.LG

    PARAMANU-AYN: Pretrain from scratch or Continual Pretraining of LLMs for Legal Domain Adaptation?

    Authors: Mitodru Niyogi, Arnab Bhattacharya

    Abstract: In this paper, we present Paramanu-Ayn, a collection of legal language models trained exclusively on Indian legal case documents. This 97-million-parameter Auto-Regressive (AR) decoder-only model was pretrained from scratch with a context size of 8192 on a single GPU for just 185 hours, achieving an efficient MFU of 41.35. We also developed a legal domain specialized BPE tokenizer. We evaluated ou… ▽ More

    Submitted 3 October, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

  42. arXiv:2402.04746  [pdf, other

    cs.DC

    Black Hole Search in Dynamic Tori

    Authors: Adri Bhattacharya, Giuseppe F. Italiano, Partha Sarathi Mandal

    Abstract: We investigate the black hole search problem by a set of mobile agents in a dynamic torus. Black hole is defined to be a dangerous stationary node which has the capability to destroy any number of incoming agents without leaving any trace of its existence. A torus of size $n\times m$ ($3\leq n \leq m$) is a collection of $n$ row rings and $m$ column rings, and the dynamicity is such that each ring… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  43. arXiv:2402.02926  [pdf, other

    cs.CL cs.LG cs.SI

    Automated Cognate Detection as a Supervised Link Prediction Task with Cognate Transformer

    Authors: V. S. D. S. Mahesh Akavarapu, Arnab Bhattacharya

    Abstract: Identification of cognates across related languages is one of the primary problems in historical linguistics. Automated cognate identification is helpful for several downstream tasks including identifying sound correspondences, proto-language reconstruction, phylogenetic classification, etc. Previous state-of-the-art methods for cognate identification are mostly based on distributions of phonemes… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: Accepted to EACL-2024 main conference

    ACM Class: I.2.7

  44. EXMOS: Explanatory Model Steering Through Multifaceted Explanations and Data Configurations

    Authors: Aditya Bhattacharya, Simone Stumpf, Lucija Gosak, Gregor Stiglic, Katrien Verbert

    Abstract: Explanations in interactive machine-learning systems facilitate debugging and improving prediction models. However, the effectiveness of various global model-centric and data-centric explanations in aiding domain experts to detect and resolve potential data issues for model improvement remains unexplored. This research investigates the influence of data-centric and model-centric global explanation… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: This is a pre-print version only for early release. Please view the conference published version from ACM CHI 2024 to get the latest version of the paper

    Journal ref: Proceedings of the CHI Conference on Human Factors in Computing Systems (CHI '24), May 11--16, 2024, Honolulu, HI, USA

  45. arXiv:2401.18034  [pdf

    cs.CL cs.AI

    Paramanu: A Family of Novel Efficient Generative Foundation Language Models for Indian Languages

    Authors: Mitodru Niyogi, Arnab Bhattacharya

    Abstract: We present "Paramanu", a family of novel language models (LM) for Indian languages, consisting of auto-regressive monolingual, bilingual, and multilingual models pretrained from scratch. Currently, it covers 10 languages (Assamese, Bangla, Hindi, Konkani, Maithili, Marathi, Odia, Sanskrit, Tamil, Telugu) across 5 scripts (Bangla, Devanagari, Odia, Tamil, Telugu). The models are pretrained on a sin… ▽ More

    Submitted 10 October, 2024; v1 submitted 31 January, 2024; originally announced January 2024.

  46. arXiv:2401.08858  [pdf, ps, other

    cs.OS

    File System Aging

    Authors: Alex Conway, Ainesh Bakshi, Arghya Bhattacharya, Rory Bennett, Yizheng Jiao, Eric Knorr, Yang Zhan, Michael A. Bender, William Jannen, Rob Johnson, Bradley C. Kuszmaul, Donald E. Porter, Jun Yuan, Martin Farach-Colton

    Abstract: File systems must allocate space for files without knowing what will be added or removed in the future. Over the life of a file system, this may cause suboptimal file placement decisions that eventually lead to slower performance, or aging. Conventional wisdom suggests that file system aging is a solved problem in the common case; heuristics to avoid aging, such as colocating related files and dat… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: 36 pages, 12 figures. Article is an extension of Conway et al. FAST 17. (see https://www.usenix.org/conference/fast17/technical-sessions/presentation/conway) and Conway et al. HotStorage 19. (see https://www.usenix.org/conference/hotstorage19/presentation/conway)

    ACM Class: H.3.2; D.4.3; D.4.2; D.4.8; E.1; E.5; H.3.4

  47. Towards Directive Explanations: Crafting Explainable AI Systems for Actionable Human-AI Interactions

    Authors: Aditya Bhattacharya

    Abstract: With Artificial Intelligence (AI) becoming ubiquitous in every application domain, the need for explanations is paramount to enhance transparency and trust among non-technical users. Despite the potential shown by Explainable AI (XAI) for enhancing understanding of complex AI systems, most XAI methods are designed for technical AI experts rather than non-technical consumers. Consequently, such exp… ▽ More

    Submitted 2 February, 2024; v1 submitted 29 December, 2023; originally announced January 2024.

    Comments: Pre-print version. Please check the published version in ACM CHI 2024 from the related DOI

  48. arXiv:2311.16496  [pdf, other

    cs.LG

    Can Out-of-Domain data help to Learn Domain-Specific Prompts for Multimodal Misinformation Detection?

    Authors: Amartya Bhattacharya, Debarshi Brahma, Suraj Nagaje Mahadev, Anmol Asati, Vikas Verma, Soma Biswas

    Abstract: Spread of fake news using out-of-context images and captions has become widespread in this era of information overload. Since fake news can belong to different domains like politics, sports, etc. with their unique characteristics, inference on a test image-caption pair is contingent on how well the model has been trained on similar data. Since training individual models for each domain is not prac… ▽ More

    Submitted 6 January, 2025; v1 submitted 27 November, 2023; originally announced November 2023.

  49. arXiv:2311.15812  [pdf, other

    cs.CV

    C-SAW: Self-Supervised Prompt Learning for Image Generalization in Remote Sensing

    Authors: Avigyan Bhattacharya, Mainak Singha, Ankit Jha, Biplab Banerjee

    Abstract: We focus on domain and class generalization problems in analyzing optical remote sensing images, using the large-scale pre-trained vision-language model (VLM), CLIP. While contrastively trained VLMs show impressive zero-shot generalization performance, their effectiveness is limited when dealing with diverse domains during training and testing. Existing prompt learning techniques overlook the impo… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted in ACM ICVGIP 2023

  50. arXiv:2311.10984  [pdf, other

    cs.DC

    Black Hole Search in Dynamic Cactus Graph

    Authors: Adri Bhattacharya, Giuseppe F. Italiano, Partha Sarathi Mandal

    Abstract: We study the problem of black hole search by a set of mobile agents, where the underlying graph is a dynamic cactus. A black hole is a dangerous vertex in the graph that eliminates any visiting agent without leaving any trace behind. Key parameters that dictate the complexity of finding the black hole include: the number of agents required (termed as \textit{size}), the number of moves performed b… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: This paper recently got accepted in WALCOM 2024