Skip to main content

Showing 1–50 of 67 results for author: Paul, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.09244  [pdf, other

    cs.CV q-bio.QM stat.AP

    How To Make Your Cell Tracker Say "I dunno!"

    Authors: Richard D. Paul, Johannes Seiffarth, David Rügamer, Hanno Scharr, Katharina Nöh

    Abstract: Cell tracking is a key computational task in live-cell microscopy, but fully automated analysis of high-throughput imaging requires reliable and, thus, uncertainty-aware data analysis tools, as the amount of data recorded within a single experiment exceeds what humans are able to overlook. We here propose and benchmark various methods to reason about and quantify uncertainty in linear assignment-b… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  2. arXiv:2503.08333  [pdf, other

    cs.CV

    1LoRA: Summation Compression for Very Low-Rank Adaptation

    Authors: Alessio Quercia, Zhuo Cao, Arya Bangun, Richard D. Paul, Abigail Morrison, Ira Assent, Hanno Scharr

    Abstract: Parameter-Efficient Fine-Tuning (PEFT) methods have transformed the approach to fine-tuning large models for downstream tasks by enabling the adjustment of significantly fewer parameters than those in the original model matrices. In this work, we study the "very low rank regime", where we fine-tune the lowest amount of parameters per linear layer for each considered PEFT method. We propose 1LoRA (… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  3. arXiv:2502.08438  [pdf, other

    cs.CV cs.AI cs.CL cs.IR cs.MM

    Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions

    Authors: Prajwal Gatti, Kshitij Parikh, Dhriti Prasanna Paul, Manish Gupta, Anand Mishra

    Abstract: Non-native speakers with limited vocabulary often struggle to name specific objects despite being able to visualize them, e.g., people outside Australia searching for numbats. Further, users may want to search for such elusive objects with difficult-to-sketch interactions, e.g., numbat digging in the ground. In such common but complex situations, users desire a search interface that accepts compos… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    Comments: Accepted at AAAI 2024, 9 pages. Project Website: https://vl2g.github.io/projects/cstbir

  4. arXiv:2501.12521  [pdf, other

    cs.SE cs.AI

    An Empirically-grounded tool for Automatic Prompt Linting and Repair: A Case Study on Bias, Vulnerability, and Optimization in Developer Prompts

    Authors: Dhia Elhaq Rzig, Dhruba Jyoti Paul, Kaiser Pister, Jordan Henkel, Foyzul Hassan

    Abstract: The tidal wave of advancements in Large Language Models (LLMs) has led to their swift integration into application-level logic. Many software systems now use prompts to interact with these black-box models, combining natural language with dynamic values interpolated at runtime, to perform tasks ranging from sentiment analysis to question answering. Due to the programmatic and structured natural la… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

  5. arXiv:2501.12501  [pdf, other

    eess.AS cs.SD

    A Domain Adaptation Framework for Speech Recognition Systems with Only Synthetic data

    Authors: Minh Tran, Yutong Pang, Debjyoti Paul, Laxmi Pandey, Kevin Jiang, Jinxi Guo, Ke Li, Shun Zhang, Xuedong Zhang, Xin Lei

    Abstract: We introduce DAS (Domain Adaptation with Synthetic data), a novel domain adaptation framework for pre-trained ASR model, designed to efficiently adapt to various language-defined domains without requiring any real data. In particular, DAS first prompts large language models (LLMs) to generate domain-specific texts before converting these texts to speech via text-to-speech technology. The synthetic… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: ICASSP 2025

  6. arXiv:2501.09333  [pdf, other

    cs.CV cs.AI

    Prompt-CAM: Making Vision Transformers Interpretable for Fine-Grained Analysis

    Authors: Arpita Chowdhury, Dipanjyoti Paul, Zheda Mai, Jianyang Gu, Ziheng Zhang, Kazi Sajeed Mehrab, Elizabeth G. Campolongo, Daniel Rubenstein, Charles V. Stewart, Anuj Karpatne, Tanya Berger-Wolf, Yu Su, Wei-Lun Chao

    Abstract: We present a simple approach to make pre-trained Vision Transformers (ViTs) interpretable for fine-grained analysis, aiming to identify and localize the traits that distinguish visually similar categories, such as bird species. Pre-trained ViTs, such as DINO, have demonstrated remarkable capabilities in extracting localized, discriminative features. However, saliency maps like Grad-CAM often fail… ▽ More

    Submitted 7 April, 2025; v1 submitted 16 January, 2025; originally announced January 2025.

    Comments: Accepted by CVPR 2025 Main Conference

  7. arXiv:2412.06927  [pdf, ps, other

    cs.CR cs.CV

    Gradient-based facial encoding for key generation to encrypt and decrypt multimedia data

    Authors: Ankit Kumar Patel, Dewanshi Paul, Sarthak Giri, Sneha Chaudhary, Bikalpa Gautam

    Abstract: Security systems relying on passwords are vulnerable to being forgotten, guessed, or breached. Likewise, biometric systems that operate independently are at risk of template spoofing and replay incidents. This paper introduces a biocryptosystem utilizing face recognition techniques to address these issues, allowing for the encryption and decryption of various file types through the Advanced Encryp… ▽ More

    Submitted 9 January, 2025; v1 submitted 9 December, 2024; originally announced December 2024.

    Comments: 12 pages, 2 figures, This work has been submitted to the IEEE for possible publication

  8. arXiv:2412.01558  [pdf, other

    cs.CV cs.AI

    VideoLights: Feature Refinement and Cross-Task Alignment Transformer for Joint Video Highlight Detection and Moment Retrieval

    Authors: Dhiman Paul, Md Rizwan Parvez, Nabeel Mohammed, Shafin Rahman

    Abstract: Video Highlight Detection and Moment Retrieval (HD/MR) are essential in video analysis. Recent joint prediction transformer models often overlook their cross-task dynamics and video-text alignment and refinement. Moreover, most models typically use limited, uni-directional attention mechanisms, resulting in weakly integrated representations and suboptimal performance in capturing the interdependen… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    ACM Class: I.2.10; I.2.7

  9. arXiv:2411.19799  [pdf, other

    cs.CL

    INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

    Authors: Angelika Romanou, Negar Foroutan, Anna Sotnikova, Zeming Chen, Sree Harsha Nelaturu, Shivalika Singh, Rishabh Maheshwary, Micol Altomare, Mohamed A. Haggag, Snegha A, Alfonso Amayuelas, Azril Hafizi Amirudin, Viraat Aryabumi, Danylo Boiko, Michael Chang, Jenny Chim, Gal Cohen, Aditya Kumar Dalmia, Abraham Diress, Sharad Duwal, Daniil Dzenhaliou, Daniel Fernando Erazo Florez, Fabian Farestam, Joseph Marvin Imperial, Shayekh Bin Islam , et al. (34 additional authors not shown)

    Abstract: The performance differential of large language models (LLM) between languages hinders their effective deployment in many regions, inhibiting the potential economic and societal value of generative AI tools in many communities. However, the development of functional LLMs in many languages (\ie, multilingual LLMs) is bottlenecked by the lack of high-quality evaluation resources in languages other th… ▽ More

    Submitted 29 November, 2024; originally announced November 2024.

  10. arXiv:2411.00552  [pdf, other

    cs.CV

    Tracking one-in-a-million: Large-scale benchmark for microbial single-cell tracking with experiment-aware robustness metrics

    Authors: J. Seiffarth, L. Blöbaum, R. D. Paul, N. Friederich, A. J. Yamachui Sitcheu, R. Mikut, H. Scharr, A. Grünberger, K. Nöh

    Abstract: Tracking the development of living cells in live-cell time-lapses reveals crucial insights into single-cell behavior and presents tremendous potential for biomedical and biotechnological applications. In microbial live-cell imaging (MLCI), a few to thousands of cells have to be detected and tracked within dozens of growing cell colonies. The challenge of tracking cells is heavily influenced by the… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: 17 pages, 4 figures, 3 tables, BioImage Computing @ ECCV 2024

  11. arXiv:2410.17218  [pdf, other

    cs.AI cs.CL

    Creativity in AI: Progresses and Challenges

    Authors: Mete Ismayilzada, Debjit Paul, Antoine Bosselut, Lonneke van der Plas

    Abstract: Creativity is the ability to produce novel, useful, and surprising ideas, and has been widely studied as a crucial aspect of human cognition. Machine creativity on the other hand has been a long-standing challenge. With the rise of advanced generative AI, there has been renewed interest and debate regarding AI's creative capabilities. Therefore, it is imperative to revisit the state of creativity… ▽ More

    Submitted 9 December, 2024; v1 submitted 22 October, 2024; originally announced October 2024.

    Comments: minor updates to content + figures

  12. arXiv:2410.04254  [pdf, other

    cs.CL cs.AI cs.IR cs.LG cs.SI

    Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia

    Authors: Tomás Feith, Akhil Arora, Martin Gerlach, Debjit Paul, Robert West

    Abstract: Links are a fundamental part of information networks, turning isolated pieces of knowledge into a network of information that is much richer than the sum of its parts. However, adding a new link to the network is not trivial: it requires not only the identification of a suitable pair of source and target entities but also the understanding of the content of the source to locate a suitable position… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

    Comments: EMNLP 2024; 24 pages; 62 figures

  13. arXiv:2409.17085  [pdf, other

    cs.CV stat.ML

    Parameter-efficient Bayesian Neural Networks for Uncertainty-aware Depth Estimation

    Authors: Richard D. Paul, Alessio Quercia, Vincent Fortuin, Katharina Nöh, Hanno Scharr

    Abstract: State-of-the-art computer vision tasks, like monocular depth estimation (MDE), rely heavily on large, modern Transformer-based architectures. However, their application in safety-critical domains demands reliable predictive performance and uncertainty quantification. While Bayesian neural networks provide a conceptually simple approach to serve those requirements, they suffer from the high dimensi… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: Presented at UnCV Workshop at ECCV'24

  14. arXiv:2408.11845  [pdf, other

    cs.CL

    LLaMA based Punctuation Restoration With Forward Pass Only Decoding

    Authors: Yutong Pang, Debjyoti Paul, Kevin Jiang, Xuedong Zhang, Xin Lei

    Abstract: This paper introduces two advancements in the field of Large Language Model Annotation with a focus on punctuation restoration tasks. Our first contribution is the application of LLaMA for punctuation restoration, which demonstrates superior performance compared to the established benchmark. Despite its impressive quality, LLaMA faces challenges regarding inference speed and hallucinations. To a… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  15. arXiv:2408.11841  [pdf, other

    cs.CY cs.AI cs.CL

    Could ChatGPT get an Engineering Degree? Evaluating Higher Education Vulnerability to AI Assistants

    Authors: Beatriz Borges, Negar Foroutan, Deniz Bayazit, Anna Sotnikova, Syrielle Montariol, Tanya Nazaretzky, Mohammadreza Banaei, Alireza Sakhaeirad, Philippe Servant, Seyed Parsa Neshaei, Jibril Frej, Angelika Romanou, Gail Weiss, Sepideh Mamooler, Zeming Chen, Simin Fan, Silin Gao, Mete Ismayilzada, Debjit Paul, Alexandre Schöpfer, Andrej Janchevski, Anja Tiede, Clarence Linden, Emanuele Troiani, Francesco Salvi , et al. (65 additional authors not shown)

    Abstract: AI assistants are being increasingly used by students enrolled in higher education institutions. While these tools provide opportunities for improved teaching and education, they also pose significant challenges for assessment and learning outcomes. We conceptualize these challenges through the lens of vulnerability, the potential for university assessments and learning outcomes to be impacted by… ▽ More

    Submitted 27 November, 2024; v1 submitted 7 August, 2024; originally announced August 2024.

    Comments: 20 pages, 8 figures

    Journal ref: PNAS (2024) Vol. 121 | No. 49

  16. arXiv:2408.03618  [pdf, other

    cs.CL cs.AI cs.LG

    A Logical Fallacy-Informed Framework for Argument Generation

    Authors: Luca Mouchel, Debjit Paul, Shaobo Cui, Robert West, Antoine Bosselut, Boi Faltings

    Abstract: Despite the remarkable performance of Large Language Models (LLMs) in natural language processing tasks, they still struggle with generating logically sound arguments, resulting in potential risks such as spreading misinformation. To address this issue, we introduce FIPO, a fallacy-informed framework that leverages preference optimization methods to steer LLMs toward logically sound arguments. FIP… ▽ More

    Submitted 3 May, 2025; v1 submitted 7 August, 2024; originally announced August 2024.

  17. arXiv:2407.16664  [pdf, other

    cs.CL eess.AS

    Towards scalable efficient on-device ASR with transfer learning

    Authors: Laxmi Pandey, Ke Li, Jinxi Guo, Debjyoti Paul, Arthur Guo, Jay Mahadeokar, Xuedong Zhang

    Abstract: Multilingual pretraining for transfer learning significantly boosts the robustness of low-resource monolingual ASR models. This study systematically investigates three main aspects: (a) the impact of transfer learning on model performance during initial training or fine-tuning, (b) the influence of transfer learning across dataset domains and languages, and (c) the effect on rare-word recognition… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  18. Future of Home-living: Designing Smart Spaces for Modern Domestic Life

    Authors: Fatemeh Alizadeh, Dave Randall, Peter Tolmie, Minha Lee, Yuhui Xu, Sarah Mennicken, Mikołaj P. Woźniak, Dennis Paul, Dominik Pins

    Abstract: The evolution of smart home technologies, particularly agentic ones such as conversational agents, robots, and virtual avatars, is reshaping our understanding of home and domestic life. This shift highlights the complexities of modern domestic life, with the household landscape now featuring diverse cohabiting units like co-housing and communal living arrangements. These agentic technologies prese… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 13 pages, no figures, ECSCW 2024 Workshop Proposal

    Journal ref: ECSCW 2024: the 22nd European Conference on Computer-Supported Cooperative Work

  19. arXiv:2406.12655  [pdf, ps, other

    cs.AI cs.SE

    Benchmarks and Metrics for Evaluations of Code Generation: A Critical Review

    Authors: Debalina Ghosh Paul, Hong Zhu, Ian Bayley

    Abstract: With the rapid development of Large Language Models (LLMs), a large number of machine learning models have been developed to assist programming tasks including the generation of program code from natural language input. However, how to evaluate such LLMs for this task is still an open problem despite of the great amount of research efforts that have been made and reported to evaluate and compare t… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted by the First IEEE International Workshop on Testing and Evaluation of Large Language Models (TELLMe 2024) and will be published in the proceedings of the IEEE AITest 2024 conference

  20. arXiv:2406.12635  [pdf, other

    cs.SE cs.AI

    ScenEval: A Benchmark for Scenario-Based Evaluation of Code Generation

    Authors: Debalina Ghosh Paul, Hong Zhu, Ian Bayley

    Abstract: In the scenario-based evaluation of machine learning models, a key problem is how to construct test datasets that represent various scenarios. The methodology proposed in this paper is to construct a benchmark and attach metadata to each test case. Then a test system can be constructed with test morphisms that filter the test cases based on metadata to form a dataset. The paper demonstrates this… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted for publication in the conference proceedings of IEEE AITest 2024

  21. arXiv:2403.19720  [pdf, other

    math.ST cs.LG stat.ML

    Meta-Learning with Generalized Ridge Regression: High-dimensional Asymptotics, Optimality and Hyper-covariance Estimation

    Authors: Yanhao Jin, Krishnakumar Balasubramanian, Debashis Paul

    Abstract: Meta-learning involves training models on a variety of training tasks in a way that enables them to generalize well on new, unseen test tasks. In this work, we consider meta-learning within the framework of high-dimensional multivariate random-effects linear models and study generalized ridge-regression based predictions. The statistical intuition of using generalized ridge regression in this sett… ▽ More

    Submitted 27 March, 2024; originally announced March 2024.

  22. PromptSet: A Programmer's Prompting Dataset

    Authors: Kaiser Pister, Dhruba Jyoti Paul, Patrick Brophy, Ishan Joshi

    Abstract: The rise of capabilities expressed by large language models has been quickly followed by the integration of the same complex systems into application level logic. Algorithms, programs, systems, and companies are built around structured prompting to black box models where the majority of the design and implementation lies in capturing and quantifying the `agent mode'. The standard way to shape a cl… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: 8 pages, ICSE '24 LLM4Code Workshop

  23. arXiv:2402.13950  [pdf, other

    cs.CL

    Making Reasoning Matter: Measuring and Improving Faithfulness of Chain-of-Thought Reasoning

    Authors: Debjit Paul, Robert West, Antoine Bosselut, Boi Faltings

    Abstract: Large language models (LLMs) have been shown to perform better when asked to reason step-by-step before answering a question. However, it is unclear to what degree the model's final answer is faithful to the stated reasoning steps. In this paper, we perform a causal mediation analysis on twelve LLMs to examine how intermediate reasoning steps generated by the LLM influence the final outcome and fi… ▽ More

    Submitted 6 October, 2024; v1 submitted 21 February, 2024; originally announced February 2024.

    Comments: Accepted at EMNLP Findings

  24. arXiv:2401.14135  [pdf, other

    cs.CL cs.CY cs.LG

    Convolutional Neural Networks can achieve binary bail judgement classification

    Authors: Amit Barman, Devangan Roy, Debapriya Paul, Indranil Dutta, Shouvik Kumar Guha, Samir Karmakar, Sudip Kumar Naskar

    Abstract: There is an evident lack of implementation of Machine Learning (ML) in the legal domain in India, and any research that does take place in this domain is usually based on data from the higher courts of law and works with English data. The lower courts and data from the different regional languages of India are often overlooked. In this paper, we deploy a Convolutional Neural Network (CNN) architec… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted on 20th International Conference on Natural Language Processing (ICON)

  25. arXiv:2401.03183  [pdf, other

    cs.CL

    Exploring Defeasibility in Causal Reasoning

    Authors: Shaobo Cui, Lazar Milikic, Yiyang Feng, Mete Ismayilzada, Debjit Paul, Antoine Bosselut, Boi Faltings

    Abstract: Defeasibility in causal reasoning implies that the causal relationship between cause and effect can be strengthened or weakened. Namely, the causal strength between cause and effect should increase or decrease with the incorporation of strengthening arguments (supporters) or weakening arguments (defeaters), respectively. However, existing works ignore defeasibility in causal reasoning and fail to… ▽ More

    Submitted 27 June, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: Accepted by ACL 2024 (Findings)

  26. arXiv:2311.15384  [pdf, other

    stat.ML cs.LG stat.ME

    Dirichlet Process-based Robust Clustering using the Median-of-Means Estimator

    Authors: Supratik Basu, Jyotishka Ray Choudhury, Debolina Paul, Swagatam Das

    Abstract: Clustering stands as one of the most prominent challenges in unsupervised machine learning. Among centroid-based methods, the classic $k$-means algorithm, based on Lloyd's heuristic, is widely used. Nonetheless, it is a well-known fact that $k$-means and its variants face several challenges, including heavy reliance on initial cluster centroids, susceptibility to converging into local minima of th… ▽ More

    Submitted 29 January, 2025; v1 submitted 26 November, 2023; originally announced November 2023.

  27. arXiv:2311.04284  [pdf, other

    cs.CL cs.AI

    CRAB: Assessing the Strength of Causal Relationships Between Real-world Events

    Authors: Angelika Romanou, Syrielle Montariol, Debjit Paul, Leo Laugier, Karl Aberer, Antoine Bosselut

    Abstract: Understanding narratives requires reasoning about the cause-and-effect relationships between events mentioned in the text. While existing foundation models yield impressive results in many NLP tasks requiring reasoning, it is unclear whether they understand the complexity of the underlying network of causal relationships of events in narratives. In this work, we present CRAB, a new Causal Reasonin… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  28. arXiv:2311.04157  [pdf, other

    cs.CV cs.AI

    A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis

    Authors: Dipanjyoti Paul, Arpita Chowdhury, Xinqi Xiong, Feng-Ju Chang, David Carlyn, Samuel Stevens, Kaiya L. Provost, Anuj Karpatne, Bryan Carstens, Daniel Rubenstein, Charles Stewart, Tanya Berger-Wolf, Yu Su, Wei-Lun Chao

    Abstract: We present a novel usage of Transformers to make image classification interpretable. Unlike mainstream classifiers that wait until the last fully connected layer to incorporate class information to make predictions, we investigate a proactive approach, asking each class to search for itself in an image. We realize this idea via a Transformer encoder-decoder inspired by DEtection TRansformer (DETR)… ▽ More

    Submitted 14 June, 2024; v1 submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted to International Conference on Learning Representations 2024 (ICLR 2024)

  29. arXiv:2311.03374  [pdf, other

    cs.SE cs.AI cs.IR

    Generative AI for Software Metadata: Overview of the Information Retrieval in Software Engineering Track at FIRE 2023

    Authors: Srijoni Majumdar, Soumen Paul, Debjyoti Paul, Ayan Bandyopadhyay, Samiran Chattopadhyay, Partha Pratim Das, Paul D Clough, Prasenjit Majumder

    Abstract: The Information Retrieval in Software Engineering (IRSE) track aims to develop solutions for automated evaluation of code comments in a machine learning framework based on human and large language model generated labels. In this track, there is a binary classification task to classify comments as useful and not useful. The dataset consists of 9048 code comments and surrounding code snippet pairs e… ▽ More

    Submitted 27 October, 2023; originally announced November 2023.

    Comments: Overview Paper of the Information Retrieval of Software Engineering Track at the Forum for Information Retrieval, 2023

  30. arXiv:2310.15239  [pdf, other

    cs.CL cs.AI

    CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks

    Authors: Mete Ismayilzada, Debjit Paul, Syrielle Montariol, Mor Geva, Antoine Bosselut

    Abstract: Recent efforts in natural language processing (NLP) commonsense reasoning research have yielded a considerable number of new datasets and benchmarks. However, most of these datasets formulate commonsense reasoning challenges in artificial scenarios that are not reflective of the tasks which real-world NLP systems are designed to solve. In this work, we present CRoW, a manually-curated, multi-task… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: 37 pages, camera-ready for EMNLP 2023

  31. arXiv:2309.17339  [pdf, other

    cs.LG

    Scaling Experiments in Self-Supervised Cross-Table Representation Learning

    Authors: Maximilian Schambach, Dominique Paul, Johannes S. Otterbach

    Abstract: To analyze the scaling potential of deep tabular representation learning models, we introduce a novel Transformer-based architecture specifically tailored to tabular data and cross-table representation learning by utilizing table-specific tokenizers and a shared Transformer backbone. Our training approach encompasses both single-table and cross-table models, trained via missing value imputation th… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  32. arXiv:2309.08628  [pdf, other

    cs.CL cs.CR cs.LG

    Recovering from Privacy-Preserving Masking with Large Language Models

    Authors: Arpita Vats, Zhe Liu, Peng Su, Debjyoti Paul, Yingyi Ma, Yutong Pang, Zeeshan Ahmed, Ozlem Kalinli

    Abstract: Model adaptation is crucial to handle the discrepancy between proxy training data and actual users data received. To effectively perform adaptation, textual data of users is typically stored on servers or their local devices, where downstream natural language processing (NLP) models can be directly trained using such in-domain data. However, this might raise privacy and security concerns due to th… ▽ More

    Submitted 13 December, 2023; v1 submitted 12 September, 2023; originally announced September 2023.

    Comments: Accepted to ICASSP

  33. arXiv:2308.01285  [pdf, other

    cs.AI cs.HC

    Flows: Building Blocks of Reasoning and Collaborating AI

    Authors: Martin Josifoski, Lars Klein, Maxime Peyrard, Nicolas Baldwin, Yifei Li, Saibo Geng, Julian Paul Schnitzler, Yuxing Yao, Jiheng Wei, Debjit Paul, Robert West

    Abstract: Recent advances in artificial intelligence (AI) have produced highly capable and controllable systems. This creates unprecedented opportunities for structured reasoning as well as collaboration among multiple AI systems and humans. To fully realize this potential, it is essential to develop a principled way of designing and studying such structured interactions. For this purpose, we introduce the… ▽ More

    Submitted 7 February, 2024; v1 submitted 2 August, 2023; originally announced August 2023.

  34. arXiv:2305.09359  [pdf, other

    cs.CL

    Constructing and Interpreting Causal Knowledge Graphs from News

    Authors: Fiona Anting Tan, Debdeep Paul, Sahim Yamaura, Miura Koji, See-Kiong Ng

    Abstract: Many financial jobs rely on news to learn about causal events in the past and present, to make informed decisions and predictions about the future. With the ever-increasing amount of news available online, there is a need to automate the extraction of causal events from unstructured texts. In this work, we propose a methodology to construct causal knowledge graphs (KGs) from news using two steps:… ▽ More

    Submitted 30 July, 2023; v1 submitted 16 May, 2023; originally announced May 2023.

    Comments: Accepted to AAAI Summer Symposium 2023 (AI4FinTech)

  35. arXiv:2304.01904  [pdf, other

    cs.CL

    REFINER: Reasoning Feedback on Intermediate Representations

    Authors: Debjit Paul, Mete Ismayilzada, Maxime Peyrard, Beatriz Borges, Antoine Bosselut, Robert West, Boi Faltings

    Abstract: Language models (LMs) have recently shown remarkable performance on reasoning tasks by explicitly generating intermediate inferences, e.g., chain-of-thought prompting. However, these intermediate inference steps may be inappropriate deductions from the initial context and lead to incorrect final predictions. Here we introduce REFINER, a framework for finetuning LMs to explicitly generate intermedi… ▽ More

    Submitted 4 February, 2024; v1 submitted 4 April, 2023; originally announced April 2023.

    Comments: Accepted at EACL 2024

  36. arXiv:2301.08506  [pdf, other

    cs.CL cs.LG

    Language Agnostic Data-Driven Inverse Text Normalization

    Authors: Szu-Jui Chen, Debjyoti Paul, Yutong Pang, Peng Su, Xuedong Zhang

    Abstract: With the emergence of automatic speech recognition (ASR) models, converting the spoken form text (from ASR) to the written form is in urgent need. This inverse text normalization (ITN) problem attracts the attention of researchers from various fields. Recently, several works show that data-driven ITN methods can output high-quality written form text. Due to the scarcity of labeled spoken-written d… ▽ More

    Submitted 23 January, 2023; v1 submitted 20 January, 2023; originally announced January 2023.

  37. arXiv:2210.07228  [pdf, other

    cs.CL cs.LG

    Language Model Decoding as Likelihood-Utility Alignment

    Authors: Martin Josifoski, Maxime Peyrard, Frano Rajic, Jiheng Wei, Debjit Paul, Valentin Hartmann, Barun Patra, Vishrav Chaudhary, Emre Kıcıman, Boi Faltings, Robert West

    Abstract: A critical component of a successful language generation pipeline is the decoding algorithm. However, the general principles that should guide the choice of a decoding algorithm remain unclear. Previous works only compare decoding algorithms in narrow scenarios, and their findings do not generalize across tasks. We argue that the misalignment between the model's likelihood and the task-specific no… ▽ More

    Submitted 16 March, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted at EACL (Findings) 2023

  38. arXiv:2207.09674  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Improving Data Driven Inverse Text Normalization using Data Augmentation

    Authors: Laxmi Pandey, Debjyoti Paul, Pooja Chitkara, Yutong Pang, Xuedong Zhang, Kjell Schubert, Mark Chou, Shu Liu, Yatharth Saraf

    Abstract: Inverse text normalization (ITN) is used to convert the spoken form output of an automatic speech recognition (ASR) system to a written form. Traditional handcrafted ITN rules can be complex to transcribe and maintain. Meanwhile neural modeling approaches require quality large-scale spoken-written pair examples in the same or similar domain as the ASR system (in-domain data), to train. Both these… ▽ More

    Submitted 20 July, 2022; originally announced July 2022.

  39. arXiv:2201.01973  [pdf, other

    stat.ML cs.LG math.ST

    Robust Linear Predictions: Analyses of Uniform Concentration, Fast Rates and Model Misspecification

    Authors: Saptarshi Chakraborty, Debolina Paul, Swagatam Das

    Abstract: The problem of linear predictions has been extensively studied for the past century under pretty generalized frameworks. Recent advances in the robust statistics literature allow us to analyze robust versions of classical linear models through the prism of Median of Means (MoM). Combining these approaches in a piecemeal way might lead to ad-hoc procedures, and the restricted theoretical conclusion… ▽ More

    Submitted 11 March, 2022; v1 submitted 6 January, 2022; originally announced January 2022.

  40. arXiv:2110.14148  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Uniform Concentration Bounds toward a Unified Framework for Robust Clustering

    Authors: Debolina Paul, Saptarshi Chakraborty, Swagatam Das, Jason Xu

    Abstract: Recent advances in center-based clustering continue to improve upon the drawbacks of Lloyd's celebrated $k$-means algorithm over $60$ years after its introduction. Various methods seek to address poor local minima, sensitivity to outliers, and data that are not well-suited to Euclidean measures of fit, but many are supported largely empirically. Moreover, combining such approaches in a piecemeal m… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: To appear (spotlight) in the Thirty-fifth Conference on Neural Information Processing Systems (NeurIPS), 2021

  41. arXiv:2108.04187  [pdf, other

    cs.MM cs.LG cs.MA

    Scaling New Peaks: A Viewership-centric Approach to Automated Content Curation

    Authors: Subhabrata Majumdar, Deirdre Paul, Eric Zavesky

    Abstract: Summarizing video content is important for video streaming services to engage the user in a limited time span. To this end, current methods involve manual curation or using passive interest cues to annotate potential high-interest segments to form the basis of summarized videos, and are costly and unreliable. We propose a viewership-driven, automated method that accommodates a range of segment ide… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  42. arXiv:2106.03973  [pdf, other

    cs.CL cs.AI

    Generating Hypothetical Events for Abductive Inference

    Authors: Debjit Paul, Anette Frank

    Abstract: Abductive reasoning starts from some observations and aims at finding the most plausible explanation for these observations. To perform abduction, humans often make use of temporal and causal inferences, and knowledge about how some hypothetical situation can result in different outcomes. This work offers the first study of how such knowledge impacts the Abductive NLI task -- which consists in cho… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: Proceedings of The Tenth Joint Conference on Lexical and Computational Semantics (STARSEM 2021)

  43. arXiv:2106.02497  [pdf, other

    cs.CL cs.AI

    COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion

    Authors: Debjit Paul, Anette Frank

    Abstract: Despite recent successes of large pre-trained language models in solving reasoning tasks, their inference capabilities remain opaque. We posit that such models can be made more interpretable by explicitly generating interim inference rules, and using them to guide the generation of task-specific textual outputs. In this paper we present COINS, a recursive inference framework that i) iteratively re… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: ACL 2021

  44. arXiv:2105.12287  [pdf, other

    cs.DB cs.AI

    Database Workload Characterization with Query Plan Encoders

    Authors: Debjyoti Paul, Jie Cao, Feifei Li, Vivek Srikumar

    Abstract: Smart databases are adopting artificial intelligence (AI) technologies to achieve {\em instance optimality}, and in the future, databases will come with prepackaged AI models within their core components. The reason is that every database runs on different workloads, demands specific resources, and settings to achieve optimal performance. It prompts the necessity to understand workloads running in… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

  45. arXiv:2105.03157  [pdf, other

    cs.CL

    CO-NNECT: A Framework for Revealing Commonsense Knowledge Paths as Explicitations of Implicit Knowledge in Texts

    Authors: Maria Becker, Katharina Korfhage, Debjit Paul, Anette Frank

    Abstract: In this work we leverage commonsense knowledge in form of knowledge paths to establish connections between sentences, as a form of explicitation of implicit knowledge. Such connections can be direct (singlehop paths) or require intermediate concepts (multihop paths). To construct such paths we combine two model types in a joint framework we call Co-nnect: a relation classifier that predicts direct… ▽ More

    Submitted 7 May, 2021; originally announced May 2021.

    Comments: Accepted at IWCS 2021

  46. arXiv:2105.00316  [pdf, other

    cs.IT

    t-Entropy: A New Measure of Uncertainty with Some Applications

    Authors: Saptarshi Chakraborty, Debolina Paul, Swagatam Das

    Abstract: The concept of Entropy plays a key role in Information Theory, Statistics, and Machine Learning.This paper introduces a new entropy measure, called the t-entropy, which exploits the concavity of the inverse-tan function. We analytically show that the proposed t-entropy satisfies the prominent axiomatic properties of an entropy measure. We demonstrate an application of the proposed entropy measure… ▽ More

    Submitted 5 May, 2021; v1 submitted 1 May, 2021; originally announced May 2021.

  47. arXiv:2103.16285  [pdf

    cs.CV cs.AI

    Single Test Image-Based Automated Machine Learning System for Distinguishing between Trait and Diseased Blood Samples

    Authors: Sahar A. Nasser, Debjani Paul, Suyash P. Awate

    Abstract: We introduce a machine learning-based method for fully automated diagnosis of sickle cell disease of poor-quality unstained images of a mobile microscope. Our method is capable of distinguishing between diseased, trait (carrier), and normal samples unlike the previous methods that are limited to distinguishing the normal from the abnormal samples only. The novelty of this method comes from disting… ▽ More

    Submitted 30 March, 2021; originally announced March 2021.

  48. arXiv:2102.03403  [pdf, other

    stat.ML cs.LG math.ST

    Robust Principal Component Analysis: A Median of Means Approach

    Authors: Debolina Paul, Saptarshi Chakraborty, Swagatam Das

    Abstract: Principal Component Analysis (PCA) is a fundamental tool for data visualization, denoising, and dimensionality reduction. It is widely popular in Statistics, Machine Learning, Computer Vision, and related fields. However, PCA is well-known to fall prey to outliers and often fails to detect the true underlying low-dimensional structure within the dataset. Following the Median of Means (MoM) philoso… ▽ More

    Submitted 20 July, 2023; v1 submitted 5 February, 2021; originally announced February 2021.

  49. arXiv:2012.10929  [pdf, other

    cs.LG stat.ML

    Automated Clustering of High-dimensional Data with a Feature Weighted Mean Shift Algorithm

    Authors: Saptarshi Chakraborty, Debolina Paul, Swagatam Das

    Abstract: Mean shift is a simple interactive procedure that gradually shifts data points towards the mode which denotes the highest density of data points in the region. Mean shift algorithms have been effectively used for data denoising, mode seeking, and finding the number of clusters in a dataset in an automated fashion. However, the merits of mean shift quickly fade away as the data dimensions increase… ▽ More

    Submitted 10 May, 2021; v1 submitted 20 December, 2020; originally announced December 2020.

    Comments: To appear at the 35-th AAAI Conference on Artificial Intelligence, February 2-9, 2021

  50. Scheduling Beyond CPUs for HPC

    Authors: Yuping Fan, Zhiling Lan, Paul Rich, William E. Allcock, Michael E. Papka, Brian Austin, David Paul

    Abstract: High performance computing (HPC) is undergoing significant changes. The emerging HPC applications comprise both compute- and data-intensive applications. To meet the intense I/O demand from emerging data-intensive applications, burst buffers are deployed in production systems. Existing HPC schedulers are mainly CPU-centric. The extreme heterogeneity of hardware devices, combined with workload chan… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Comments: Accepted by HPDC 2019

    Journal ref: Proceedings of the 28th ACM International Symposium on High-Performance Parallel and Distributed Computing (HPDC'19), 2019