Skip to main content

Showing 1–50 of 73 results for author: Tathagata

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.04166  [pdf, ps, other

    cs.LG stat.CO stat.ML

    N$^2$: A Unified Python Package and Test Bench for Nearest Neighbor-Based Matrix Completion

    Authors: Caleb Chin, Aashish Khubchandani, Harshvardhan Maskara, Kyuseong Choi, Jacob Feitelberg, Albert Gong, Manit Paul, Tathagata Sadhukhan, Anish Agarwal, Raaz Dwivedi

    Abstract: Nearest neighbor (NN) methods have re-emerged as competitive tools for matrix completion, offering strong empirical performance and recent theoretical guarantees, including entry-wise error bounds, confidence intervals, and minimax optimality. Despite their simplicity, recent work has shown that NN approaches are robust to a range of missingness patterns and effective across diverse applications.… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: 21 pages, 6 figures

  2. arXiv:2505.09612  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Adaptively-weighted Nearest Neighbors for Matrix Completion

    Authors: Tathagata Sadhukhan, Manit Paul, Raaz Dwivedi

    Abstract: In this technical note, we introduce and analyze AWNN: an adaptively weighted nearest neighbor method for performing matrix completion. Nearest neighbor (NN) methods are widely used in missing data problems across multiple disciplines such as in recommender systems and for performing counterfactual inference in panel data settings. Prior works have shown that in addition to being very intuitive an… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: 25 pages, 6 figures

  3. arXiv:2503.21459  [pdf, other

    cs.CV

    RoadSocial: A Diverse VideoQA Dataset and Benchmark for Road Event Understanding from Social Video Narratives

    Authors: Chirag Parikh, Deepti Rawat, Rakshitha R. T., Tathagata Ghosh, Ravi Kiran Sarvadevabhatla

    Abstract: We introduce RoadSocial, a large-scale, diverse VideoQA dataset tailored for generic road event understanding from social media narratives. Unlike existing datasets limited by regional bias, viewpoint bias and expert-driven annotations, RoadSocial captures the global complexity of road events with varied geographies, camera viewpoints (CCTV, handheld, drones) and rich social discourse. Our scalabl… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

    Comments: Accepted at CVPR 2025; Project Page: https://roadsocial.github.io/

  4. arXiv:2503.10302  [pdf, other

    quant-ph cond-mat.dis-nn cs.ET

    Pushing the Boundary of Quantum Advantage in Hard Combinatorial Optimization with Probabilistic Computers

    Authors: Shuvro Chowdhury, Navid Anjum Aadit, Andrea Grimaldi, Eleonora Raimondo, Atharva Raut, P. Aaron Lott, Johan H. Mentink, Marek M. Rams, Federico Ricci-Tersenghi, Massimo Chiappini, Luke S. Theogarajan, Tathagata Srimani, Giovanni Finocchio, Masoud Mohseni, Kerem Y. Camsari

    Abstract: Recent demonstrations on specialized benchmarks have reignited excitement for quantum computers, yet whether they can deliver an advantage for practical real-world problems remains an open question. Here, we show that probabilistic computers (p-computers) when co-designed with hardware to implement powerful Monte Carlo algorithms can surpass state-of-the-art quantum annealers <a href="https://www.… ▽ More

    Submitted 7 April, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

  5. Scalable Connectivity for Ising Machines: Dense to Sparse

    Authors: M Mahmudul Hasan Sajeeb, Navid Anjum Aadit, Shuvro Chowdhury, Tong Wu, Cesely Smith, Dhruv Chinmay, Atharva Raut, Kerem Y. Camsari, Corentin Delacour, Tathagata Srimani

    Abstract: In recent years, hardware implementations of Ising machines have emerged as a viable alternative to quantum computing for solving hard optimization problems among other applications. Unlike quantum hardware, dense connectivity can be achieved in classical systems. However, we show that dense connectivity leads to severe frequency slowdowns and interconnect congestion scaling unfavorably with syste… ▽ More

    Submitted 2 June, 2025; v1 submitted 2 March, 2025; originally announced March 2025.

    Journal ref: Physical Review Applied (2025)

  6. arXiv:2501.09825  [pdf, ps, other

    cs.CL cs.AI

    Bridging Language Barriers in Healthcare: A Study on Arabic LLMs

    Authors: Nada Saadi, Tathagata Raha, Clément Christophe, Marco AF Pimentel, Ronnie Rajan, Praveen K Kanithi

    Abstract: This paper investigates the challenges of developing large language models (LLMs) proficient in both multilingual understanding and medical knowledge. We demonstrate that simply translating medical data does not guarantee strong performance on clinical tasks in the target language. Our experiments reveal that the optimal language mix in training data varies significantly across different medical t… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

  7. arXiv:2412.06853  [pdf, other

    cs.LG cs.AI

    Tube Loss: A Novel Approach for Prediction Interval Estimation and probabilistic forecasting

    Authors: Pritam Anand, Tathagata Bandyopadhyay, Suresh Chandra

    Abstract: This paper proposes a novel loss function, called 'Tube Loss', for simultaneous estimation of bounds of a Prediction Interval (PI) in the regression setup. The PIs obtained by minimizing the empirical risk based on the Tube Loss are shown to be of better quality than the PIs obtained by the existing methods in the following sense. First, it yields intervals that attain the prespecified confidence… ▽ More

    Submitted 17 May, 2025; v1 submitted 8 December, 2024; originally announced December 2024.

  8. arXiv:2411.12965  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    On adaptivity and minimax optimality of two-sided nearest neighbors

    Authors: Tathagata Sadhukhan, Manit Paul, Raaz Dwivedi

    Abstract: Nearest neighbor (NN) algorithms have been extensively used for missing data problems in recommender systems and sequential decision-making systems. Prior theoretical analysis has established favorable guarantees for NN when the underlying data is sufficiently smooth and the missingness probabilities are lower bounded. Here we analyze NN with non-smooth non-linear functions with vast amounts of mi… ▽ More

    Submitted 19 November, 2024; originally announced November 2024.

    Comments: 29 pages, 7 figures

  9. arXiv:2410.16012  [pdf, other

    cs.CV cs.AI cs.LG stat.AP

    Massimo: Public Queue Monitoring and Management using Mass-Spring Model

    Authors: Abhijeet Kumar, Unnati Singh, Rajdeep Chatterjee, Tathagata Bandyopadhyay

    Abstract: An efficient system of a queue control and regulation in public spaces is very important in order to avoid the traffic jams and to improve the customer satisfaction. This article offers a detailed road map based on a merger of intelligent systems and creating an efficient systems of queues in public places. Through the utilization of different technologies i.e. computer vision, machine learning al… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: 8 pages, 6 figures, 3 algorithms, 3 tables

  10. arXiv:2410.14363  [pdf, other

    cs.GT stat.AP

    Skill vs. Chance Quantification for Popular Card & Board Games

    Authors: Tathagata Banerjee, Anushka De, Subhamoy Maitra, Diganta Mukherjee

    Abstract: This paper presents a data-driven statistical framework to quantify the role of skill in games, addressing the long-standing question of whether success in a game is predominantly driven by skill or chance. We analyze player level data from four popular games Chess, Rummy, Ludo, and Teen Patti, using empirical win statistics across varying levels of experience. By modeling win rate as a function o… ▽ More

    Submitted 27 May, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

    Comments: 25 pages, 9 figures

  11. arXiv:2410.05046  [pdf, other

    cs.CL cs.AI

    Named Clinical Entity Recognition Benchmark

    Authors: Wadood M Abdul, Marco AF Pimentel, Muhammad Umar Salman, Tathagata Raha, Clément Christophe, Praveen K Kanithi, Nasir Hayat, Ronnie Rajan, Shadab Khan

    Abstract: This technical report introduces a Named Clinical Entity Recognition Benchmark for evaluating language models in healthcare, addressing the crucial natural language processing (NLP) task of extracting structured information from clinical narratives to support applications like automated coding, clinical trial cohort identification, and clinical decision support. The leaderboard provides a standa… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

    Comments: Technical Report

  12. arXiv:2409.16608  [pdf, other

    cs.ET cs.AR

    Omni 3D: BEOL-Compatible 3D Logic with Omnipresent Power, Signal, and Clock

    Authors: Suhyeong Choi, Carlo Gilardi, Paul Gutwin, Robert M. Radway, Tathagata Srimani, Subhasish Mitra

    Abstract: This paper presents Omni 3D - a 3D-stacked device architecture that is naturally enabled by back-end-of-line (BEOL)-compatible transistors. Omni 3D arbitrarily interleaves metal layers for both signal/power with FETs in 3D (i.e., nFETs and pFETs are stacked in 3D). Thus, signal/power routing layers have fine-grained, all-sided access to the FET active regions maximizing 3D standard cell design fle… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: 8 pages, 15 figures

  13. arXiv:2409.14988  [pdf, other

    cs.CL

    Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs

    Authors: Clément Christophe, Tathagata Raha, Svetlana Maslenkova, Muhammad Umar Salman, Praveen K Kanithi, Marco AF Pimentel, Shadab Khan

    Abstract: Large Language Models (LLMs) have demonstrated significant potential in transforming clinical applications. In this study, we investigate the efficacy of four techniques in adapting LLMs for clinical use-cases: continuous pretraining, instruct fine-tuning, NEFTune, and prompt engineering. We employ these methods on Mistral 7B and Mixtral 8x7B models, leveraging a large-scale clinical pretraining d… ▽ More

    Submitted 23 September, 2024; originally announced September 2024.

  14. arXiv:2409.11422  [pdf

    cs.DC cs.AR

    Next-generation Probabilistic Computing Hardware with 3D MOSAICs, Illusion Scale-up, and Co-design

    Authors: Tathagata Srimani, Robert Radway, Masoud Mohseni, Kerem Çamsarı, Subhasish Mitra

    Abstract: The vast majority of 21st century AI workloads are based on gradient-based deterministic algorithms such as backpropagation. One of the key reasons for the dominance of deterministic ML algorithms is the emergence of powerful hardware accelerators (GPU and TPU) that have enabled the wide-scale adoption and implementation of these algorithms. Meanwhile, discrete and probabilistic Monte Carlo algori… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: 2 pages, 1 figure

  15. arXiv:2409.11297  [pdf

    cs.ET physics.app-ph

    Overcoming Ambient Drift and Negative-Bias Temperature Instability in Foundry Carbon Nanotube Transistors

    Authors: Andrew Yu, Tathagata Srimani, Max Shulaker

    Abstract: Back-end-of-line (BEOL) logic integration is emerging as a complementary scaling path to supplement front-end-of-line (FEOL) Silicon. Among various options for BEOL logic, Carbon Nanotube Field-Effect Transistors (CNFETs) have been integrated within commercial silicon foundries, and complex CNFET circuits (e.g., RISC-V core, SRAM arrays) have been demonstrated. However, there lacks comprehensive s… ▽ More

    Submitted 14 February, 2025; v1 submitted 17 September, 2024; originally announced September 2024.

    Comments: 14 pages, 8 figures

  16. arXiv:2409.07314  [pdf, other

    cs.CL cs.AI

    MEDIC: Towards a Comprehensive Framework for Evaluating LLMs in Clinical Applications

    Authors: Praveen K Kanithi, Clément Christophe, Marco AF Pimentel, Tathagata Raha, Nada Saadi, Hamza Javed, Svetlana Maslenkova, Nasir Hayat, Ronnie Rajan, Shadab Khan

    Abstract: The rapid development of Large Language Models (LLMs) for healthcare applications has spurred calls for holistic evaluation beyond frequently-cited benchmarks like USMLE, to better reflect real-world performance. While real-world assessments are valuable indicators of utility, they often lag behind the pace of LLM evolution, likely rendering findings obsolete upon deployment. This temporal disconn… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

    Comments: Technical report

  17. arXiv:2409.01352  [pdf, other

    cs.SD cs.LG cs.MM eess.AS

    Spectron: Target Speaker Extraction using Conditional Transformer with Adversarial Refinement

    Authors: Tathagata Bandyopadhyay

    Abstract: Recently, attention-based transformers have become a de facto standard in many deep learning applications including natural language processing, computer vision, signal processing, etc.. In this paper, we propose a transformer-based end-to-end model to extract a target speaker's speech from a monaural multi-speaker mixed audio signal. Unlike existing speaker extraction methods, we introduce two ad… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  18. arXiv:2409.00376  [pdf, other

    cs.GT

    Skill Dominance Analysis of Two(Four) player, Three(Five) dice Variant of the Ludo Game

    Authors: Tathagata Banerjee, Diganta Mukherjee

    Abstract: This paper examines two different variants of the Ludo game, involving multiple dice and a fixed number of total turns. Within each variant, multiple game lengths (total no. of turns) are considered. To compare the two variants, a set of intuitive, rule-based strategies is designed, representing different broad methods of strategic play. Game play is simulated between bots (automated software appl… ▽ More

    Submitted 11 November, 2024; v1 submitted 31 August, 2024; originally announced September 2024.

    Comments: 28 pages, 9 figures

  19. arXiv:2408.06142  [pdf, ps, other

    cs.CL cs.AI

    Med42-v2: A Suite of Clinical LLMs

    Authors: Clément Christophe, Praveen K Kanithi, Tathagata Raha, Shadab Khan, Marco AF Pimentel

    Abstract: Med42-v2 introduces a suite of clinical large language models (LLMs) designed to address the limitations of generic models in healthcare settings. These models are built on Llama3 architecture and fine-tuned using specialized clinical data. They underwent multi-stage preference alignment to effectively respond to natural prompts. While generic models are often preference-aligned to avoid answering… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  20. arXiv:2407.21072  [pdf, other

    cs.AI cs.CL

    Beyond Metrics: A Critical Analysis of the Variability in Large Language Model Evaluation Frameworks

    Authors: Marco AF Pimentel, Clément Christophe, Tathagata Raha, Prateek Munjal, Praveen K Kanithi, Shadab Khan

    Abstract: As large language models (LLMs) continue to evolve, the need for robust and standardized evaluation benchmarks becomes paramount. Evaluating the performance of these models is a complex challenge that requires careful consideration of various linguistic tasks, model architectures, and benchmarking methodologies. In recent years, various frameworks have emerged as noteworthy contributions to the fi… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 15 pages, 3 figures

  21. arXiv:2406.11109  [pdf, other

    cs.CL cs.AI cs.LG

    Investigating Annotator Bias in Large Language Models for Hate Speech Detection

    Authors: Amit Das, Zheng Zhang, Najib Hasan, Souvika Sarkar, Fatemeh Jamshidi, Tathagata Bhattacharya, Mostafa Rahgouy, Nilanjana Raychawdhary, Dongji Feng, Vinija Jain, Aman Chadha, Mary Sandage, Lauramarie Pope, Gerry Dozier, Cheryl Seals

    Abstract: Data annotation, the practice of assigning descriptive labels to raw data, is pivotal in optimizing the performance of machine learning models. However, it is a resource-intensive process susceptible to biases introduced by annotators. The emergence of sophisticated Large Language Models (LLMs) presents a unique opportunity to modernize and streamline this complex procedure. While existing researc… ▽ More

    Submitted 16 November, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: Accepted at NeurIPS Safe Generative AI Workshop, 2024

  22. arXiv:2406.10560  [pdf, other

    cs.CL

    Facts-and-Feelings: Capturing both Objectivity and Subjectivity in Table-to-Text Generation

    Authors: Tathagata Dey, Pushpak Bhattacharyya

    Abstract: Table-to-text generation, a long-standing challenge in natural language generation, has remained unexplored through the lens of subjectivity. Subjectivity here encompasses the comprehension of information derived from the table that cannot be described solely by objective data. Given the absence of pre-existing datasets, we introduce the Ta2TS dataset with 3849 data instances. We perform the task… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  23. arXiv:2405.16129  [pdf, other

    cs.CL

    iREL at SemEval-2024 Task 9: Improving Conventional Prompting Methods for Brain Teasers

    Authors: Harshit Gupta, Manav Chaudhary, Tathagata Raha, Shivansh Subramanian, Vasudeva Varma

    Abstract: This paper describes our approach for SemEval-2024 Task 9: BRAINTEASER: A Novel Task Defying Common Sense. The BRAINTEASER task comprises multiple-choice Question Answering designed to evaluate the models' lateral thinking capabilities. It consists of Sentence Puzzle and Word Puzzle subtasks that require models to defy default common-sense associations and exhibit unconventional thinking. We propo… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  24. arXiv:2404.14779  [pdf, other

    cs.CL

    Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches

    Authors: Clément Christophe, Praveen K Kanithi, Prateek Munjal, Tathagata Raha, Nasir Hayat, Ronnie Rajan, Ahmed Al-Mahrooqi, Avani Gupta, Muhammad Umar Salman, Gurpreet Gosal, Bhargav Kanakiya, Charles Chen, Natalia Vassilieva, Boulbaba Ben Amor, Marco AF Pimentel, Shadab Khan

    Abstract: This study presents a comprehensive analysis and comparison of two predominant fine-tuning methodologies - full-parameter fine-tuning and parameter-efficient tuning - within the context of medical Large Language Models (LLMs). We developed and refined a series of LLMs, based on the Llama-2 architecture, specifically designed to enhance medical knowledge retrieval, reasoning, and question-answering… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

    Comments: Published at AAAI 2024 Spring Symposium - Clinical Foundation Models

  25. OffensiveLang: A Community Based Implicit Offensive Language Dataset

    Authors: Amit Das, Mostafa Rahgouy, Dongji Feng, Zheng Zhang, Tathagata Bhattacharya, Nilanjana Raychawdhary, Fatemeh Jamshidi, Vinija Jain, Aman Chadha, Mary Sandage, Lauramarie Pope, Gerry Dozier, Cheryl Seals

    Abstract: The widespread presence of hateful languages on social media has resulted in adverse effects on societal well-being. As a result, addressing this issue with high priority has become very important. Hate speech or offensive languages exist in both explicit and implicit forms, with the latter being more challenging to detect. Current research in this domain encounters several challenges. Firstly, th… ▽ More

    Submitted 14 December, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Journal ref: in IEEE Access, vol. 12, pp. 185661-185672, 2024

  26. arXiv:2312.11828  [pdf, other

    cs.CL cs.MA

    TESS: A Multi-intent Parser for Conversational Multi-Agent Systems with Decentralized Natural Language Understanding Models

    Authors: Burak Aksar, Yara Rizk, Tathagata Chakraborti

    Abstract: Chatbots have become one of the main pathways for the delivery of business automation tools. Multi-agent systems offer a framework for designing chatbots at scale, making it easier to support complex conversations that span across multiple domains as well as enabling developers to maintain and expand their capabilities incrementally over time. However, multi-agent systems complicate the natural la… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

    Comments: 16 pages

  27. arXiv:2311.18355  [pdf, other

    cs.RO

    Enabling Robots to Identify Missing Steps in Robot Tasks for Guided Learning from Demonstration

    Authors: Maximilian Diehl, Tathagata Chakraborti, Karinne Ramirez-Amaro

    Abstract: Learning from Demonstration (LfD) systems are commonly used to teach robots new tasks by generating a set of skills from user-provided demonstrations. These skills can then be sequenced by planning algorithms to execute complex tasks. However, LfD systems typically require a full demonstration of the entire task, even when parts of it are already known to the robot. This limitation comes from the… ▽ More

    Submitted 11 December, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: To appear at IEEE/SICE International Symposium on System Integrations (SII 2025)

  28. arXiv:2311.13720  [pdf, other

    cs.AI

    Can LLMs Fix Issues with Reasoning Models? Towards More Likely Models for AI Planning

    Authors: Turgay Caglar, Sirine Belhaj, Tathagata Chakraborti, Michael Katz, Sarath Sreedharan

    Abstract: This is the first work to look at the application of large language models (LLMs) for the purpose of model space edits in automated planning tasks. To set the stage for this union, we explore two different flavors of model space problems that have been studied in the AI planning literature and explore the effect of an LLM on those tasks. We empirically demonstrate how the performance of an LLM con… ▽ More

    Submitted 4 March, 2024; v1 submitted 22 November, 2023; originally announced November 2023.

    Comments: 24 pages

  29. arXiv:2310.01995  [pdf, other

    cs.CV

    Development of Machine Vision Approach for Mechanical Component Identification based on its Dimension and Pitch

    Authors: Toshit Jain, Faisel Mushtaq, K Ramesh, Sandip Deshmukh, Tathagata Ray, Chandu Parimi, Praveen Tandon, Pramod Kumar Jha

    Abstract: In this work, a highly customizable and scalable vision based system for automation of mechanical assembly lines is described. The proposed system calculates the features that are required to classify and identify the different kinds of bolts that are used in the assembly line. The system describes a novel method of calculating the pitch of the bolt in addition to bolt identification and calculati… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 8 pages

    ACM Class: I.4.7

  30. arXiv:2306.10051  [pdf, other

    cs.DL cs.HC cs.IR

    TOBY: A Tool for Exploring Data in Academic Survey Papers

    Authors: Tathagata Chakraborti, Jungkoo Kang, Christian Muise, Sarath Sreedharan, Michael Walker, Daniel Szafir, Tom Williams

    Abstract: This paper describes TOBY, a visualization tool that helps a user explore the contents of an academic survey paper. The visualization consists of four components: a hierarchical view of taxonomic data in the survey, a document similarity view in the space of taxonomic classes, a network view of citations, and a new paper recommendation tool. In this paper, we will discuss these features in the con… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  31. arXiv:2306.08872  [pdf, other

    cs.CL cs.AI

    Neural models for Factual Inconsistency Classification with Explanations

    Authors: Tathagata Raha, Mukund Choudhary, Abhinav Menon, Harshit Gupta, KV Aditya Srivatsa, Manish Gupta, Vasudeva Varma

    Abstract: Factual consistency is one of the most important requirements when editing high quality documents. It is extremely important for automatic text generation systems like summarization, question answering, dialog modeling, and language modeling. Still, automated factual inconsistency detection is rather under-studied. Existing work has focused on (a) finding fake news keeping a knowledge base in cont… ▽ More

    Submitted 15 June, 2023; originally announced June 2023.

    Comments: ECML-PKDD 2023

  32. arXiv:2206.06530  [pdf, other

    cs.AI

    MACQ: A Holistic View of Model Acquisition Techniques

    Authors: Ethan Callanan, Rebecca De Venezia, Victoria Armstrong, Alison Paredes, Tathagata Chakraborti, Christian Muise

    Abstract: For over three decades, the planning community has explored countless methods for data-driven model acquisition. These range in sophistication (e.g., simple set operations to full-blown reformulations), methodology (e.g., logic-based vs. planing-based), and assumptions (e.g., fully vs. partially observable). With no fewer than 43 publications in the space, it can be overwhelming to understand what… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: 8 pages, 7 figures, KEPS Workshop Submission

    MSC Class: 68T05 ACM Class: I.2.6

  33. arXiv:2205.02209  [pdf, other

    cs.LG cs.AI

    Semi-Supervised Cascaded Clustering for Classification of Noisy Label Data

    Authors: Ashit Gupta, Anirudh Deodhar, Tathagata Mukherjee, Venkataramana Runkana

    Abstract: The performance of supervised classification techniques often deteriorates when the data has noisy labels. Even the semi-supervised classification approaches have largely focused only on the problem of handling missing labels. Most of the approaches addressing the noisy label data rely on deep neural networks (DNN) that require huge datasets for classification tasks. This poses a serious challenge… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: 11 pages

    ACM Class: I.2.1

  34. arXiv:2202.11249  [pdf, other

    cs.RO cs.AI cs.GR cs.HC

    Virtual, Augmented, and Mixed Reality for Human-Robot Interaction: A Survey and Virtual Design Element Taxonomy

    Authors: Michael Walker, Thao Phung, Tathagata Chakraborti, Tom Williams, Daniel Szafir

    Abstract: Virtual, Augmented, and Mixed Reality for Human-Robot Interaction (VAM-HRI) has been gaining considerable attention in research in recent years. However, the HRI community lacks a set of shared terminology and framework for characterizing aspects of mixed reality interfaces, presenting serious problems for future research. Therefore, it is important to have a common set of terms and concepts that… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Comments: Explore contents at ibm.biz/vam-hri

  35. arXiv:2110.02311  [pdf, other

    cs.CL

    COVID-19 India Dataset: Parsing COVID-19 Data in Daily Health Bulletins from States in India

    Authors: Mayank Agarwal, Tathagata Chakraborti, Sachin Grover, Arunima Chaudhary

    Abstract: While India has been one of the hotspots of COVID-19, data about the pandemic from the country has proved to be largely inaccessible at scale. Much of the data exists in unstructured form on the web, and limited aspects of such data are available through public APIs maintained manually through volunteer effort. This has proved to be difficult both in terms of ease of access to detailed data and wi… ▽ More

    Submitted 6 December, 2021; v1 submitted 27 September, 2021; originally announced October 2021.

    Comments: URL: ibm.biz/covid-data-india. Accepted at the Machine Learning in Public Health workshop at NeurIPS 2021

  36. arXiv:2109.03029  [pdf, other

    cs.LG

    Predicting Mood Disorder Symptoms with Remotely Collected Videos Using an Interpretable Multimodal Dynamic Attention Fusion Network

    Authors: Tathagata Banerjee, Matthew Kollada, Pablo Gersberg, Oscar Rodriguez, Jane Tiller, Andrew E Jaffe, John Reynders

    Abstract: We developed a novel, interpretable multimodal classification method to identify symptoms of mood disorders viz. depression, anxiety and anhedonia using audio, video and text collected from a smartphone application. We used CNN-based unimodal encoders to learn dynamic embeddings for each modality and then combined these through a transformer encoder. We applied these methods to a novel dataset - c… ▽ More

    Submitted 7 September, 2021; originally announced September 2021.

    Comments: 8 pages, 3 figures, Published in the Computational Approaches to Mental Health Workshop of the International Conference on Machine Learning 2021, https://sites.google.com/view/ca2mh/accepted-papers

  37. arXiv:2103.02523  [pdf, other

    cs.CL

    NeurIPS 2020 NLC2CMD Competition: Translating Natural Language to Bash Commands

    Authors: Mayank Agarwal, Tathagata Chakraborti, Quchen Fu, David Gros, Xi Victoria Lin, Jaron Maene, Kartik Talamadupula, Zhongwei Teng, Jules White

    Abstract: The NLC2CMD Competition hosted at NeurIPS 2020 aimed to bring the power of natural language processing to the command line. Participants were tasked with building models that can transform descriptions of command line tasks in English to their Bash syntax. This is a report on the competition with details of the task, metrics, data, attempted solutions, and lessons learned.

    Submitted 8 August, 2021; v1 submitted 3 March, 2021; originally announced March 2021.

    Comments: Appears in PMLR Volume 133: NeurIPS 2020 Competition and Demonstration Track. Competition URL: http://ibm.biz/nlc2cmd

  38. arXiv:2101.11954  [pdf, ps, other

    cs.CL cs.AI cs.IR

    Identifying COVID-19 Fake News in Social Media

    Authors: Tathagata Raha, Vijayasaradhi Indurthi, Aayush Upadhyaya, Jeevesh Kataria, Pramud Bommakanti, Vikram Keswani, Vasudeva Varma

    Abstract: The evolution of social media platforms have empowered everyone to access information easily. Social media users can easily share information with the rest of the world. This may sometimes encourage spread of fake news, which can result in undesirable consequences. In this work, we train models which can identify health news related to COVID-19 pandemic as real or fake. Our models achieve a high F… ▽ More

    Submitted 1 February, 2021; v1 submitted 28 January, 2021; originally announced January 2021.

    Comments: CONSTRAINT@AAAI

  39. arXiv:2101.03382  [pdf, other

    cs.CL cs.IR cs.LG

    Task Adaptive Pretraining of Transformers for Hostility Detection

    Authors: Tathagata Raha, Sayar Ghosh Roy, Ujwal Narayan, Zubair Abid, Vasudeva Varma

    Abstract: Identifying adverse and hostile content on the web and more particularly, on social media, has become a problem of paramount interest in recent years. With their ever increasing popularity, fine-tuning of pretrained Transformer-based encoder models with a classifier head are gradually becoming the new baseline for natural language classification tasks. In our work, we explore the gains attributed… ▽ More

    Submitted 9 January, 2021; originally announced January 2021.

    Comments: To be published in: Proceedings of the First Workshop on Combating Online Hostile Posts in Regional Languages during Emergency Situation (CONSTRAINT) at AAAI 2021

  40. arXiv:2101.03207  [pdf, other

    cs.CL cs.AI cs.CY cs.IR cs.LG

    Leveraging Multilingual Transformers for Hate Speech Detection

    Authors: Sayar Ghosh Roy, Ujwal Narayan, Tathagata Raha, Zubair Abid, Vasudeva Varma

    Abstract: Detecting and classifying instances of hate in social media text has been a problem of interest in Natural Language Processing in the recent years. Our work leverages state of the art Transformer language models to identify hate speech in a multilingual setting. Capturing the intent of a post or a comment on social media involves careful evaluation of the language style, semantic content and addit… ▽ More

    Submitted 8 January, 2021; originally announced January 2021.

    Comments: To be published in: FIRE (Working Notes) 2020, Hate Speech and Offensive Content Identification in Indo-European Languages, HASOC 2020

  41. arXiv:2011.10920  [pdf, other

    cs.AI

    A Bayesian Account of Measures of Interpretability in Human-AI Interaction

    Authors: Sarath Sreedharan, Anagha Kulkarni, Tathagata Chakraborti, David E. Smith, Subbarao Kambhampati

    Abstract: Existing approaches for the design of interpretable agent behavior consider different measures of interpretability in isolation. In this paper we posit that, in the design and deployment of human-aware agents in the real world, notions of interpretability are just some among many considerations; and the techniques developed in isolation lack two key properties to be useful when considered together… ▽ More

    Submitted 21 November, 2020; originally announced November 2020.

  42. arXiv:2011.10707  [pdf, other

    cs.AI

    Explainable Composition of Aggregated Assistants

    Authors: Sarath Sreedharan, Tathagata Chakraborti, Yara Rizk, Yasaman Khazaeni

    Abstract: A new design of an AI assistant that has become increasingly popular is that of an "aggregated assistant" -- realized as an orchestrated composition of several individual skills or agents that can each perform atomic tasks. In this paper, we will talk about the role of planning in the automated composition of such assistants and explore how concepts in automated planning can help to establish tran… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  43. arXiv:2007.14576  [pdf, other

    cs.CL

    Development of POS tagger for English-Bengali Code-Mixed data

    Authors: Tathagata Raha, Sainik Kumar Mahata, Dipankar Das, Sivaji Bandyopadhyay

    Abstract: Code-mixed texts are widespread nowadays due to the advent of social media. Since these texts combine two languages to formulate a sentence, it gives rise to various research problems related to Natural Language Processing. In this paper, we try to excavate one such problem, namely, Parts of Speech tagging of code-mixed texts. We have built a system that can POS tag English-Bengali code-mixed data… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: Accepted and published in The sixteenth International Conference on Natural Language Processing (ICON-2019)

  44. arXiv:2007.13257  [pdf, other

    cs.AI

    From Robotic Process Automation to Intelligent Process Automation: Emerging Trends

    Authors: Tathagata Chakraborti, Vatche Isahagian, Rania Khalaf, Yasaman Khazaeni, Vinod Muthusamy, Yara Rizk, Merve Unuvar

    Abstract: In this survey, we study how recent advances in machine intelligence are disrupting the world of business processes. Over the last decade, there has been steady progress towards the automation of business processes under the umbrella of ``robotic process automation'' (RPA). However, we are currently at an inflection point in this evolution, as a new paradigm called ``Intelligent Process Automation… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

    Comments: Internation Conference on Business Process Management 2020 RPA Forum

  45. arXiv:2007.00820  [pdf, other

    cs.AI

    Designing Environments Conducive to Interpretable Robot Behavior

    Authors: Anagha Kulkarni, Sarath Sreedharan, Sarah Keren, Tathagata Chakraborti, David Smith, Subbarao Kambhampati

    Abstract: Designing robots capable of generating interpretable behavior is a prerequisite for achieving effective human-robot collaboration. This means that the robots need to be capable of generating behavior that aligns with human expectations and, when required, provide explanations to the humans in the loop. However, exhibiting such behavior in arbitrary environments could be quite expensive for robots,… ▽ More

    Submitted 2 August, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

  46. arXiv:2002.11697  [pdf, other

    cs.AI cs.HC

    The Emerging Landscape of Explainable AI Planning and Decision Making

    Authors: Tathagata Chakraborti, Sarath Sreedharan, Subbarao Kambhampati

    Abstract: In this paper, we provide a comprehensive outline of the different threads of work in Explainable AI Planning (XAIP) that has emerged as a focus area in the last couple of years and contrast that with earlier efforts in the field in terms of techniques, target users, and delivery mechanisms. We hope that the survey will provide guidance to new researchers in automated planning towards the role of… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

  47. arXiv:2002.00762  [pdf, other

    cs.HC cs.AI

    Project CLAI: Instrumenting the Command Line as a New Environment for AI Agents

    Authors: Mayank Agarwal, Jorge J. Barroso, Tathagata Chakraborti, Eli M. Dow, Kshitij Fadnis, Borja Godoy, Madhavan Pallan, Kartik Talamadupula

    Abstract: This whitepaper reports on Project CLAI (Command Line AI), which aims to bring the power of AI to the command line interface (CLI). The CLAI platform sets up the CLI as a new environment for AI researchers to conquer by surfacing the command line as a generic environment that researchers can interface to using a simple sense-act API, much like the traditional AI agent architecture. In this paper,… ▽ More

    Submitted 17 June, 2020; v1 submitted 31 January, 2020; originally announced February 2020.

    Comments: http://ibm.biz/clai-home

  48. arXiv:2001.03543  [pdf, other

    cs.AI

    A Unified Conversational Assistant Framework for Business Process Automation

    Authors: Yara Rizk, Abhishek Bhandwalder, Scott Boag, Tathagata Chakraborti, Vatche Isahagian, Yasaman Khazaeni, Falk Pollock, Merve Unuvar

    Abstract: Business process automation is a booming multi-billion-dollar industry that promises to remove menial tasks from workers' plates -- through the introduction of autonomous agents -- and free up their time and brain power for more creative and engaging tasks. However, an essential component to the successful deployment of such autonomous agents is the ability of business users to monitor their perfo… ▽ More

    Submitted 7 January, 2020; originally announced January 2020.

  49. arXiv:2001.02619  [pdf, other

    cs.AI

    D3BA: A Tool for Optimizing Business Processes Using Non-Deterministic Planning

    Authors: Tathagata Chakraborti, Yasaman Khazaeni

    Abstract: This paper builds upon recent work in the declarative design of dialogue agents and proposes an exciting new tool -- D3BA -- Declarative Design for Digital Business Automation, built to optimize business processes using the power of AI planning. The tool provides a powerful framework to build, optimize, and maintain complex business processes and optimize them by composing with services that autom… ▽ More

    Submitted 4 February, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

    Comments: Appears in the Proceedings of the AAAI 2020 Workshop on Intelligent Process Automation

  50. arXiv:1912.10127  [pdf, other

    eess.IV cs.LG q-bio.QM stat.ML

    A Generalizable Method for Automated Quality Control of Functional Neuroimaging Datasets

    Authors: Matthew Kollada, Qingzhu Gao, Monika S Mellem, Tathagata Banerjee, William J Martin

    Abstract: Over the last twenty five years, advances in the collection and analysis of fMRI data have enabled new insights into the brain basis of human health and disease. Individual behavioral variation can now be visualized at a neural level as patterns of connectivity among brain regions. Functional brain imaging is enhancing our understanding of clinical psychiatric disorders by revealing ties between r… ▽ More

    Submitted 20 December, 2019; originally announced December 2019.