Skip to main content

Showing 1–50 of 101 results for author: Abhilash

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.08189  [pdf, ps, other

    cs.CV cs.CL

    Open World Scene Graph Generation using Vision Language Models

    Authors: Amartya Dutta, Kazi Sajeed Mehrab, Medha Sawhney, Abhilash Neog, Mridul Khurana, Sepideh Fatemi, Aanish Pradhan, M. Maruf, Ismini Lourentzou, Arka Daw, Anuj Karpatne

    Abstract: Scene-Graph Generation (SGG) seeks to recognize objects in an image and distill their salient pairwise relationships. Most methods depend on dataset-specific supervision to learn the variety of interactions, restricting their usefulness in open-world settings, involving novel objects and/or relations. Even methods that leverage large Vision Language Models (VLMs) typically require benchmark-specif… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: Accepted in CVPR 2025 Workshop (CVinW)

  2. arXiv:2506.05629  [pdf, ps, other

    cs.CL

    Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs

    Authors: Ananth Muppidi, Abhilash Nandy, Sambaran Bandyopadhyay

    Abstract: The performance of large language models in domain-specific tasks necessitates fine-tuning, which is computationally expensive and technically challenging. This paper focuses on parameter-efficient fine-tuning using soft prompting, a promising approach that adapts pre-trained models to downstream tasks by learning a small set of parameters. We propose a novel Input Dependent Soft Prompting techniq… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: Accepted in ACL 2025 (Main) Conference

  3. arXiv:2505.06548  [pdf, ps, other

    cs.CL

    REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

    Authors: Aniruddha Roy, Pretam Ray, Abhilash Nandy, Somak Aditya, Pawan Goyal

    Abstract: Instruction-based Large Language Models (LLMs) have proven effective in numerous few-shot or zero-shot Natural Language Processing (NLP) tasks. However, creating human-annotated instruction data is time-consuming, expensive, and often limited in quantity and task diversity. Previous research endeavors have attempted to address this challenge by proposing frameworks capable of generating instructio… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

    Comments: 11 pages

  4. arXiv:2505.00949  [pdf, other

    cs.CL cs.AI cs.LG

    Llama-Nemotron: Efficient Reasoning Models

    Authors: Akhiad Bercovich, Itay Levy, Izik Golan, Mohammad Dabbah, Ran El-Yaniv, Omri Puny, Ido Galil, Zach Moshe, Tomer Ronen, Najeeb Nabwani, Ido Shahaf, Oren Tropp, Ehud Karpas, Ran Zilberstein, Jiaqi Zeng, Soumye Singhal, Alexander Bukharin, Yian Zhang, Tugrul Konuk, Gerald Shen, Ameya Sunil Mahabaleshwarkar, Bilal Kartal, Yoshi Suhara, Olivier Delalleau, Zijia Chen , et al. (109 additional authors not shown)

    Abstract: We introduce the Llama-Nemotron series of models, an open family of heterogeneous reasoning models that deliver exceptional reasoning capabilities, inference efficiency, and an open license for enterprise use. The family comes in three sizes -- Nano (8B), Super (49B), and Ultra (253B) -- and performs competitively with state-of-the-art reasoning models such as DeepSeek-R1 while offering superior i… ▽ More

    Submitted 14 May, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

  5. arXiv:2504.06422  [pdf, other

    eess.IV cs.CV

    Retuve: Automated Multi-Modality Analysis of Hip Dysplasia with Open Source AI

    Authors: Adam McArthur, Stephanie Wichuk, Stephen Burnside, Andrew Kirby, Alexander Scammon, Damian Sol, Abhilash Hareendranathan, Jacob L. Jaremko

    Abstract: Developmental dysplasia of the hip (DDH) poses significant diagnostic challenges, hindering timely intervention. Current screening methodologies lack standardization, and AI-driven studies suffer from reproducibility issues due to limited data and code availability. To address these limitations, we introduce Retuve, an open-source framework for multi-modality DDH analysis, encompassing both ultras… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

    Comments: 12 pages, 8 figures, submitted to Software Impacts

  6. arXiv:2504.01879  [pdf, other

    cs.CL cs.CV cs.IR

    TransientTables: Evaluating LLMs' Reasoning on Temporally Evolving Semi-structured Tables

    Authors: Abhilash Shankarampeta, Harsh Mahajan, Tushar Kataria, Dan Roth, Vivek Gupta

    Abstract: Humans continuously make new discoveries, and understanding temporal sequence of events leading to these breakthroughs is essential for advancing science and society. This ability to reason over time allows us to identify future steps and understand the effects of financial and political decisions on our lives. However, large language models (LLMs) are typically trained on static datasets, limitin… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: 19 Pages. 21 Tables, 1 figure

  7. arXiv:2502.19151  [pdf

    cs.LG cond-mat.mtrl-sci physics.app-ph

    Design of Resistive Frequency Selective Surface based Radar Absorbing Structure-A Deep Learning Approach

    Authors: Vijay Kumar Sutrakar, Nikhil Morge, Anjana PK, Abhilash PV

    Abstract: In this paper, deep learning-based approach for the design of radar absorbing structure using resistive frequency selective surface is proposed. In the present design, reflection coefficient is used as input of deep learning model and the Jerusalem cross based unit cell dimensions is predicted as outcome. Sequential neural network based deep learning model with adaptive moment estimation optimizer… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

  8. arXiv:2502.18170  [pdf, ps, other

    quant-ph cs.CC cs.IT

    Pauli measurements are not optimal for single-copy tomography

    Authors: Jayadev Acharya, Abhilash Dharmavarapu, Yuhan Liu, Nengkun Yu

    Abstract: Quantum state tomography is a fundamental problem in quantum computing. Given $n$ copies of an unknown $N$-qubit state $ρ\in \mathbb{C}^{d \times d},d=2^N$, the goal is to learn the state up to an accuracy $ε$ in trace distance, with at least probability 0.99. We are interested in the copy complexity, the minimum number of copies of $ρ$ needed to fulfill the task. Pauli measurements have attract… ▽ More

    Submitted 25 February, 2025; originally announced February 2025.

    Comments: Accepted at STOC 2025

    ACM Class: E.4; F.2.0; G.3

  9. arXiv:2502.15785  [pdf, other

    cs.LG cs.AI

    Masking the Gaps: An Imputation-Free Approach to Time Series Modeling with Missing Data

    Authors: Abhilash Neog, Arka Daw, Sepideh Fatemi Khorasgani, Anuj Karpatne

    Abstract: A significant challenge in time-series (TS) modeling is the presence of missing values in real-world TS datasets. Traditional two-stage frameworks, involving imputation followed by modeling, suffer from two key drawbacks: (1) the propagation of imputation errors into subsequent TS modeling, (2) the trade-offs between imputation efficacy and imputation complexity. While one-stage approaches attempt… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

    Comments: 15 pages

  10. arXiv:2501.09668  [pdf, other

    cs.RO

    Model Predictive Path Integral Docking of Fully Actuated Surface Vessel

    Authors: Akash Vijayakumar, Atmanand M A, Abhilash Somayajula

    Abstract: Autonomous docking remains one of the most challenging maneuvers in marine robotics, requiring precise control and robust perception in confined spaces. This paper presents a novel approach integrating Model Predictive Path Integral(MPPI) control with real-time LiDAR-based dock detection for autonomous surface vessel docking. Our framework uniquely combines probabilistic trajectory optimization wi… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

    Comments: 6 pages, 6 figures, 1 table, UT2025 Conference, IEEE International Symposium on Underwater Technology 2025

  11. arXiv:2412.09222  [pdf, other

    cs.CR cs.IT

    Building a Privacy Web with SPIDEr -- Secure Pipeline for Information De-Identification with End-to-End Encryption

    Authors: Novoneel Chakraborty, Anshoo Tandon, Kailash Reddy, Kaushal Kirpekar, Bryan Paul Robert, Hari Dilip Kumar, Abhilash Venkatesh, Abhay Sharma

    Abstract: Data de-identification makes it possible to glean insights from data while preserving user privacy. The use of Trusted Execution Environments (TEEs) allow for the execution of de-identification applications on the cloud without the need for a user to trust the third-party application provider. In this paper, we present \textit{SPIDEr - Secure Pipeline for Information De-Identification with End-to-… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Comments: 3 pages, 2 figures

  12. arXiv:2411.07550  [pdf, other

    cs.RO

    Learning Autonomous Docking Operation of Fully Actuated Autonomous Surface Vessel from Expert data

    Authors: Akash Vijayakumar, Atmanand M A, Abhilash Somayajula

    Abstract: This paper presents an approach for autonomous docking of a fully actuated autonomous surface vessel using expert demonstration data. We frame the docking problem as an imitation learning task and employ inverse reinforcement learning (IRL) to learn a reward function from expert trajectories. A two-stage neural network architecture is implemented to incorporate both environmental context from sens… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

    Comments: 5 pages, 8 figures, IEEE Oceans Halifax 2024 Conference, Presented in September 2024 in IEEE Oceans Conference in Halifax, Canada as a Student Poster

  13. arXiv:2410.22476  [pdf, other

    cs.CL cs.IR

    A Pointer Network-based Approach for Joint Extraction and Detection of Multi-Label Multi-Class Intents

    Authors: Ankan Mullick, Sombit Bose, Abhilash Nandy, Gajula Sai Chaitanya, Pawan Goyal

    Abstract: In task-oriented dialogue systems, intent detection is crucial for interpreting user queries and providing appropriate responses. Existing research primarily addresses simple queries with a single intent, lacking effective systems for handling complex queries with multiple intents and extracting different intent spans. Additionally, there is a notable absence of multilingual, multi-intent datasets… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

    Comments: Accepted at EMNLP 2024 Findings (Long Paper)

  14. arXiv:2410.01400  [pdf, other

    cs.CL

    CrowdCounter: A benchmark type-specific multi-target counterspeech dataset

    Authors: Punyajoy Saha, Abhilash Datta, Abhik Jana, Animesh Mukherjee

    Abstract: Counterspeech presents a viable alternative to banning or suspending users for hate speech while upholding freedom of expression. However, writing effective counterspeech is challenging for moderators/users. Hence, developing suggestion tools for writing counterspeech is the need of the hour. One critical challenge in developing such a tool is the lack of quality and diversity of the responses in… ▽ More

    Submitted 2 October, 2024; originally announced October 2024.

    Comments: 19 pages, 1 figure, 14 tables, Code available https://github.com/hate-alert/CrowdCounter

  15. arXiv:2409.13592  [pdf, other

    cs.CV cs.AI cs.CL

    YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

    Authors: Abhilash Nandy, Yash Agarwal, Ashish Patwa, Millon Madhur Das, Aman Bansal, Ankit Raj, Pawan Goyal, Niloy Ganguly

    Abstract: Understanding satire and humor is a challenging task for even current Vision-Language models. In this paper, we propose the challenging tasks of Satirical Image Detection (detecting whether an image is satirical), Understanding (generating the reason behind the image being satirical), and Completion (given one half of the image, selecting the other half from 2 given options, such that the complete… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

    Comments: EMNLP 2024 Main (Long), 18 pages, 14 figures, 12 tables

  16. arXiv:2409.06821  [pdf, other

    cs.CV

    Sam2Rad: A Segmentation Model for Medical Images with Learnable Prompts

    Authors: Assefa Seyoum Wahd, Banafshe Felfeliyan, Yuyue Zhou, Shrimanti Ghosh, Adam McArthur, Jiechen Zhang, Jacob L. Jaremko, Abhilash Hareendranathan

    Abstract: Foundation models like the segment anything model require high-quality manual prompts for medical image segmentation, which is time-consuming and requires expertise. SAM and its variants often fail to segment structures in ultrasound (US) images due to domain shift. We propose Sam2Rad, a prompt learning approach to adapt SAM and its variants for US bone segmentation without human prompts. It int… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

  17. arXiv:2408.16387  [pdf, other

    cs.CR

    Enhancing MOTION2NX for Efficient, Scalable and Secure Image Inference using Convolutional Neural Networks

    Authors: Haritha K, Ramya Burra, Srishti Mittal, Sarthak Sharma, Abhilash Venkatesh, Anshoo Tandon

    Abstract: This work contributes towards the development of an efficient and scalable open-source Secure Multi-Party Computation (SMPC) protocol on machines with moderate computational resources. We use the ABY2.0 SMPC protocol implemented on the C++ based MOTION2NX framework for secure convolutional neural network (CNN) inference application with semi-honest security. Our list of contributions are as follow… ▽ More

    Submitted 24 October, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

    Comments: 20 pages, 1 figure. arXiv admin note: text overlap with arXiv:2310.10133

  18. arXiv:2408.16176  [pdf, other

    cs.CV

    VLM4Bio: A Benchmark Dataset to Evaluate Pretrained Vision-Language Models for Trait Discovery from Biological Images

    Authors: M. Maruf, Arka Daw, Kazi Sajeed Mehrab, Harish Babu Manogaran, Abhilash Neog, Medha Sawhney, Mridul Khurana, James P. Balhoff, Yasin Bakis, Bahadir Altintas, Matthew J. Thompson, Elizabeth G. Campolongo, Josef C. Uyeda, Hilmar Lapp, Henry L. Bart, Paula M. Mabee, Yu Su, Wei-Lun Chao, Charles Stewart, Tanya Berger-Wolf, Wasila Dahdul, Anuj Karpatne

    Abstract: Images are increasingly becoming the currency for documenting biodiversity on the planet, providing novel opportunities for accelerating scientific discoveries in the field of organismal biology, especially with the advent of large vision-language models (VLMs). We ask if pre-trained VLMs can aid scientists in answering a range of biologically relevant questions without any additional fine-tuning.… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 36 pages, 37 figures, 7 tables

  19. arXiv:2408.04886  [pdf

    cs.PF

    Automated PMC-based Power Modeling Methodology for Modern Mobile GPUs

    Authors: Pranab Dash, Y. Charlie Hu, Abhilash Jindal

    Abstract: The rise of machine learning workload on smartphones has propelled GPUs into one of the most power-hungry components of modern smartphones and elevates the need for optimizing the GPU power draw by mobile apps. Optimizing the power consumption of mobile GPUs in turn requires accurate estimation of their power draw during app execution. In this paper, we observe that the prior-art, utilization-freq… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  20. arXiv:2407.14202  [pdf, other

    cs.NE cs.AI

    SHS: Scorpion Hunting Strategy Swarm Algorithm

    Authors: Abhilash Singh, Seyed Muhammad Hossein Mousavi, Kumar Gaurav

    Abstract: We introduced the Scorpion Hunting Strategy (SHS), a novel population-based, nature-inspired optimisation algorithm. This algorithm draws inspiration from the hunting strategy of scorpions, which identify, locate, and capture their prey using the alpha and beta vibration operators. These operators control the SHS algorithm's exploitation and exploration abilities. To formulate an optimisation meth… ▽ More

    Submitted 30 August, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

  21. arXiv:2407.08027  [pdf, other

    cs.CV

    Fish-Vista: A Multi-Purpose Dataset for Understanding & Identification of Traits from Images

    Authors: Kazi Sajeed Mehrab, M. Maruf, Arka Daw, Abhilash Neog, Harish Babu Manogaran, Mridul Khurana, Zhenyang Feng, Bahadir Altintas, Yasin Bakis, Elizabeth G Campolongo, Matthew J Thompson, Xiaojun Wang, Hilmar Lapp, Tanya Berger-Wolf, Paula Mabee, Henry Bart, Wei-Lun Chao, Wasila M Dahdul, Anuj Karpatne

    Abstract: We introduce Fish-Visual Trait Analysis (Fish-Vista), the first organismal image dataset designed for the analysis of visual traits of aquatic species directly from images using problem formulations in computer vision. Fish-Vista contains 69,126 annotated images spanning 4,154 fish species, curated and organized to serve three downstream tasks of species classification, trait identification, and t… ▽ More

    Submitted 27 February, 2025; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: Preprint. Accepted to CVPR 2025

  22. arXiv:2407.04560  [pdf, other

    cs.CV

    Real Time Emotion Analysis Using Deep Learning for Education, Entertainment, and Beyond

    Authors: Abhilash Khuntia, Shubham Kale

    Abstract: The significance of emotion detection is increasing in education, entertainment, and various other domains. We are developing a system that can identify and transform facial expressions into emojis to provide immediate feedback.The project consists of two components. Initially, we will employ sophisticated image processing techniques and neural networks to construct a deep learning model capable o… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: 8 pages, 23 figures

  23. arXiv:2407.03305  [pdf, other

    cs.CV

    Advanced Smart City Monitoring: Real-Time Identification of Indian Citizen Attributes

    Authors: Shubham Kale, Shashank Sharma, Abhilash Khuntia

    Abstract: This project focuses on creating a smart surveillance system for Indian cities that can identify and analyze people's attributes in real time. Using advanced technologies like artificial intelligence and machine learning, the system can recognize attributes such as upper body color, what the person is wearing, accessories they are wearing, headgear, etc., and analyze behavior through cameras insta… ▽ More

    Submitted 5 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: 6 pages , 8 figure , changed title and some alignment issue were resolved, but other contents remains same

  24. Cross-view geo-localization: a survey

    Authors: Abhilash Durgam, Sidike Paheding, Vikas Dhiman, Vijay Devabhaktuni

    Abstract: Cross-view geo-localization has garnered notable attention in the realm of computer vision, spurred by the widespread availability of copious geotagged datasets and the advancements in machine learning techniques. This paper provides a thorough survey of cutting-edge methodologies, techniques, and associated challenges that are integral to this domain, with a focus on feature-based and deep learni… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  25. arXiv:2404.17912  [pdf, other

    cs.CL cs.AI cs.LG

    SERPENT-VLM : Self-Refining Radiology Report Generation Using Vision Language Models

    Authors: Manav Nitin Kapadnis, Sohan Patnaik, Abhilash Nandy, Sourjyadip Ray, Pawan Goyal, Debdoot Sheet

    Abstract: Radiology Report Generation (R2Gen) demonstrates how Multi-modal Large Language Models (MLLMs) can automate the creation of accurate and coherent radiological reports. Existing methods often hallucinate details in text-based reports that don't accurately reflect the image content. To mitigate this, we introduce a novel strategy, SERPENT-VLM (SElf Refining Radiology RePort GENeraTion using Vision L… ▽ More

    Submitted 18 July, 2024; v1 submitted 27 April, 2024; originally announced April 2024.

    Comments: 8 pages, 3 figures, 4 tables, Accepted as oral at Clinical NLP workshop at NAACL 2024

  26. arXiv:2404.04676  [pdf, other

    cs.CL

    Order-Based Pre-training Strategies for Procedural Text Understanding

    Authors: Abhilash Nandy, Yash Kulkarni, Pawan Goyal, Niloy Ganguly

    Abstract: In this paper, we propose sequence-based pretraining methods to enhance procedural understanding in natural language processing. Procedural text, containing sequential instructions to accomplish a task, is difficult to understand due to the changing attributes of entities in the context. We focus on recipes, which are commonly represented as ordered instructions, and use this order as a supervisio… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: 8 pages (Accepted for publication at NAACL 2024 (Main Conference))

  27. arXiv:2404.01329  [pdf, other

    cs.SI

    Unraveling the Dynamics of Television Debates and Social Media Engagement: Insights from an Indian News Show

    Authors: Kiran Garimella, Abhilash Datta

    Abstract: The relationship between television shows and social media has become increasingly intertwined in recent years. Social media platforms, particularly Twitter, have emerged as significant sources of public opinion and discourse on topics discussed in television shows. In India, news debates leverage the popularity of social media to promote hashtags and engage users in discussions and debates on a d… ▽ More

    Submitted 29 March, 2024; originally announced April 2024.

    Comments: Accepted at ICWSM 2024. Please cite the ICWSM version

  28. arXiv:2403.04670  [pdf, other

    cs.LG

    End-to-end Conditional Robust Optimization

    Authors: Abhilash Chenreddy, Erick Delage

    Abstract: The field of Contextual Optimization (CO) integrates machine learning and optimization to solve decision making problems under uncertainty. Recently, a risk sensitive variant of CO, known as Conditional Robust Optimization (CRO), combines uncertainty quantification with robust optimization in order to promote safety and reliability in high stake applications. Exploiting modern differentiable optim… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

  29. arXiv:2402.14300  [pdf, other

    cs.CV

    A Simple Framework Uniting Visual In-context Learning with Masked Image Modeling to Improve Ultrasound Segmentation

    Authors: Yuyue Zhou, Banafshe Felfeliyan, Shrimanti Ghosh, Jessica Knight, Fatima Alves-Pereira, Christopher Keen, Jessica Küpper, Abhilash Rakkunedeth Hareendranathan, Jacob L. Jaremko

    Abstract: Conventional deep learning models deal with images one-by-one, requiring costly and time-consuming expert labeling in the field of medical imaging, and domain-specific restriction limits model generalizability. Visual in-context learning (ICL) is a new and exciting area of research in computer vision. Unlike conventional deep learning, ICL emphasizes the model's ability to adapt to new tasks based… ▽ More

    Submitted 8 March, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  30. arXiv:2401.06331  [pdf

    cs.CV

    Application Of Vision-Language Models For Assessing Osteoarthritis Disease Severity

    Authors: Banafshe Felfeliyan, Yuyue Zhou, Shrimanti Ghosh, Jessica Kupper, Shaobo Liu, Abhilash Hareendranathan, Jacob L. Jaremko

    Abstract: Osteoarthritis (OA) poses a global health challenge, demanding precise diagnostic methods. Current radiographic assessments are time consuming and prone to variability, prompting the need for automated solutions. The existing deep learning models for OA assessment are unimodal single task systems and they don't incorporate relevant text information such as patient demographics, disease history, or… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  31. arXiv:2311.02216  [pdf, other

    cs.CL cs.LG

    Exploring the Numerical Reasoning Capabilities of Language Models: A Comprehensive Analysis on Tabular Data

    Authors: Mubashara Akhtar, Abhilash Shankarampeta, Vivek Gupta, Arpit Patil, Oana Cocarascu, Elena Simperl

    Abstract: Numbers are crucial for various real-world domains such as finance, economics, and science. Thus, understanding and reasoning with numbers are essential skills for language models to solve different tasks. While different numerical benchmarks have been introduced in recent years, they are limited to specific numerical aspects mostly. In this paper, we propose a hierarchical taxonomy for numerical… ▽ More

    Submitted 3 November, 2023; originally announced November 2023.

    Comments: Accepted at EMNLP 2023 (Findings)

  32. arXiv:2310.16048  [pdf, ps, other

    cs.AI cs.CL cs.CY cs.HC cs.LG

    AI Alignment and Social Choice: Fundamental Limitations and Policy Implications

    Authors: Abhilash Mishra

    Abstract: Aligning AI agents to human intentions and values is a key bottleneck in building safe and deployable AI applications. But whose values should AI agents be aligned with? Reinforcement learning with human feedback (RLHF) has emerged as the key framework for AI alignment. RLHF uses feedback from human reinforcers to fine-tune outputs; all widely deployed large language models (LLMs) use RLHF to alig… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 10 pages, no figures

  33. arXiv:2310.14326  [pdf, other

    cs.CL cs.AI

    CLMSM: A Multi-Task Learning Framework for Pre-training on Procedural Text

    Authors: Abhilash Nandy, Manav Nitin Kapadnis, Pawan Goyal, Niloy Ganguly

    Abstract: In this paper, we propose CLMSM, a domain-specific, continual pre-training framework, that learns from a large set of procedural recipes. CLMSM uses a Multi-Task Learning Framework to optimize two objectives - a) Contrastive Learning using hard triplets to learn fine-grained differences across entities in the procedures, and b) a novel Mask-Step Modelling objective to learn step-wise context of a… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

    Comments: Accepted to EMNLP Findings 2023, 14 pages, 4 figures

  34. Collision Avoidance for Autonomous Surface Vessels using Novel Artificial Potential Fields

    Authors: Aditya Kailas Jadhav, Anantha Raj Pandi, Abhilash Somayajula

    Abstract: As the demand for transportation through waterways continues to rise, the number of vessels plying the waters has correspondingly increased. This has resulted in a greater number of accidents and collisions between ships, some of which lead to significant loss of life and financial losses. Research has shown that human error is a major factor responsible for such incidents. The maritime industry i… ▽ More

    Submitted 9 October, 2023; originally announced October 2023.

    Comments: 28 pages, 30 figures

    Journal ref: Ocean Engineering, 288 (2023), 116011

  35. arXiv:2310.00205  [pdf, other

    cs.SE cs.CR

    Finding 709 Defects in 258 Projects: An Experience Report on Applying CodeQL to Open-Source Embedded Software (Experience Paper) -- Extended Report

    Authors: Mingjie Shen, Akul Abhilash Pillai, Brian A. Yuan, James C. Davis, Aravind Machiry

    Abstract: In this experience paper, we report on a large-scale empirical study of Static Application Security Testing (SAST) in Open-Source Embedded Software (EMBOSS) repositories. We collected a corpus of 258 of the most popular EMBOSS projects, and then measured their use of SAST tools via program analysis and a survey (N=25) of their developers. Advanced SAST tools are rarely used -- only 3% of projects… ▽ More

    Submitted 25 April, 2025; v1 submitted 29 September, 2023; originally announced October 2023.

    Comments: This is the extended version of: Mingjie Shen, Akul Abhilash Pillai, Brian A. Yuan, James C. Davis, and Aravind Machiry. 2025. Finding 709 Defects in 258 Projects: An Experience Report on Applying CodeQL to Open-Source Embedded Software (Experience Paper). Proc. ACM Softw. Eng. 2, ISSTA, Article ISSTA048 (July 2025), 24 pages. https://doi.org/10.1145/3728923

  36. arXiv:2309.09490  [pdf, other

    eess.IV cs.CV

    Self-supervised TransUNet for Ultrasound regional segmentation of the distal radius in children

    Authors: Yuyue Zhou, Jessica Knight, Banafshe Felfeliyan, Christopher Keen, Abhilash Rakkunedeth Hareendranathan, Jacob L. Jaremko

    Abstract: Supervised deep learning offers great promise to automate analysis of medical images from segmentation to diagnosis. However, their performance highly relies on the quality and quantity of the data annotation. Meanwhile, curating large annotated datasets for medical images requires a high level of expertise, which is time-consuming and expensive. Recently, to quench the thirst for large data sets… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  37. arXiv:2309.05497  [pdf, other

    cs.CL cs.CY

    Personality Detection and Analysis using Twitter Data

    Authors: Abhilash Datta, Souvic Chakraborty, Animesh Mukherjee

    Abstract: Personality types are important in various fields as they hold relevant information about the characteristics of a human being in an explainable format. They are often good predictors of a person's behaviors in a particular environment and have applications ranging from candidate selection to marketing and mental health. Recently automatic detection of personality traits from texts has gained sign… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: Submitted to ASONAM 2023

  38. arXiv:2307.05911  [pdf, other

    cond-mat.mtrl-sci cs.LG

    Grain and Grain Boundary Segmentation using Machine Learning with Real and Generated Datasets

    Authors: Peter Warren, Nandhini Raju, Abhilash Prasad, Shajahan Hossain, Ramesh Subramanian, Jayanta Kapat, Navin Manjooran, Ranajay Ghosh

    Abstract: We report significantly improved accuracy of grain boundary segmentation using Convolutional Neural Networks (CNN) trained on a combination of real and generated data. Manual segmentation is accurate but time-consuming, and existing computational methods are faster but often inaccurate. To combat this dilemma, machine learning models can be used to achieve the accuracy of manual segmentation and h… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

  39. arXiv:2306.17173  [pdf, other

    cs.NI

    Photon: A Cross Platform P2P Data Transfer Application

    Authors: Abhilash Shreedhar Hegde, Amruta Narayana Hegde, Adeep Krishna Keelar, Ananya Mathur

    Abstract: Modern computing requires efficient and dependable data transport. Current solutions like Bluetooth, SMS (Short Message Service), and Email have their restrictions on efficiency, file size, compatibility, and cost. In order to facilitate direct communication and resource sharing amongst linked devices, this research study offers a cross-platform peer-to-peer (P2P) data transmission solution that t… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  40. arXiv:2306.10374  [pdf, ps, other

    math.OC cs.LG

    A Survey of Contextual Optimization Methods for Decision Making under Uncertainty

    Authors: Utsav Sadana, Abhilash Chenreddy, Erick Delage, Alexandre Forel, Emma Frejinger, Thibaut Vidal

    Abstract: Recently there has been a surge of interest in operations research (OR) and the machine learning (ML) community in combining prediction algorithms and optimization techniques to solve decision-making problems in the face of uncertainty. This gave rise to the field of contextual optimization, under which data-driven procedures are developed to prescribe actions to the decision-maker that make the b… ▽ More

    Submitted 2 February, 2024; v1 submitted 17 June, 2023; originally announced June 2023.

  41. arXiv:2306.06190  [pdf, other

    cs.CL cs.LG

    $FastDoc$: Domain-Specific Fast Continual Pre-training Technique using Document-Level Metadata and Taxonomy

    Authors: Abhilash Nandy, Manav Nitin Kapadnis, Sohan Patnaik, Yash Parag Butala, Pawan Goyal, Niloy Ganguly

    Abstract: In this paper, we propose $FastDoc$ (Fast Continual Pre-training Technique using Document Level Metadata and Taxonomy), a novel, compute-efficient framework that utilizes Document metadata and Domain-Specific Taxonomy as supervision signals to continually pre-train transformer encoder on a domain-specific corpus. The main innovation is that during domain-specific pretraining, an open-domain encode… ▽ More

    Submitted 1 November, 2024; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted to Transactions on Machine Learning Research (TMLR), 36 pages, 8 figures

    MSC Class: 68T50 ACM Class: I.2.7

  42. arXiv:2305.19271  [pdf, other

    cs.CL

    Concise Answers to Complex Questions: Summarization of Long-form Answers

    Authors: Abhilash Potluri, Fangyuan Xu, Eunsol Choi

    Abstract: Long-form question answering systems provide rich information by presenting paragraph-level answers, often containing optional background or auxiliary information. While such comprehensive answers are helpful, not all information is required to answer the question (e.g. users with domain knowledge do not need an explanation of background). Can we provide a concise version of the answer by summariz… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: ACL 2023 Long Paper

  43. arXiv:2303.10782  [pdf, ps, other

    cs.CL cs.CV

    On the Importance of Signer Overlap for Sign Language Detection

    Authors: Abhilash Pal, Stephan Huber, Cyrine Chaabani, Alessandro Manzotti, Oscar Koller

    Abstract: Sign language detection, identifying if someone is signing or not, is becoming crucially important for its applications in remote conferencing software and for selecting useful sign data for training sign language recognition or translation tasks. We argue that the current benchmark data sets for sign language detection estimate overly positive results that do not generalize well due to signer ove… ▽ More

    Submitted 19 March, 2023; originally announced March 2023.

  44. A Bayesian Model Combination-based approach to Active Malware Analysis

    Authors: Abhilash Hota, Jurgen Schonwalder

    Abstract: Active Malware Analysis involves modeling malware behavior by executing actions to trigger responses and explore multiple execution paths. One of the aims is making the action selection more efficient. This paper treats Active Malware Analysis as a Bayes-Active Markov Decision Process and uses a Bayesian Model Combination approach to train an analyzer agent. We show an improvement in performance a… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  45. arXiv:2210.13326  [pdf, other

    cs.CL cs.CV

    Clean Text and Full-Body Transformer: Microsoft's Submission to the WMT22 Shared Task on Sign Language Translation

    Authors: Subhadeep Dey, Abhilash Pal, Cyrine Chaabani, Oscar Koller

    Abstract: This paper describes Microsoft's submission to the first shared task on sign language translation at WMT 2022, a public competition tackling sign language to spoken language translation for Swiss German sign language. The task is very challenging due to data scarcity and an unprecedented vocabulary size of more than 20k words on the target side. Moreover, the data is taken from real broadcast news… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: accepted for publication at WMT2022

  46. arXiv:2210.12259  [pdf, other

    cs.CL

    Enhancing Tabular Reasoning with Pattern Exploiting Training

    Authors: Abhilash Reddy Shankarampeta, Vivek Gupta, Shuo Zhang

    Abstract: Recent methods based on pre-trained language models have exhibited superior performance over tabular tasks (e.g., tabular NLI), despite showing inherent problems such as not using the right evidence and inconsistent predictions across inputs while reasoning over the tabular data. In this work, we utilize Pattern-Exploiting Training (PET) (i.e., strategic MLM) on pre-trained language models to stre… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: The 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing

  47. arXiv:2209.13454  [pdf

    cs.CR cs.AI cs.CY cs.NE cs.NI

    Artificial Intelligence for Cybersecurity: Threats, Attacks and Mitigation

    Authors: Abhilash Chakraborty, Anupam Biswas, Ajoy Kumar Khan

    Abstract: With the advent of the digital era, every day-to-day task is automated due to technological advances. However, technology has yet to provide people with enough tools and safeguards. As the internet connects more-and-more devices around the globe, the question of securing the connected devices grows at an even spiral rate. Data thefts, identity thefts, fraudulent transactions, password compromises,… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: Submitted to Springer

    MSC Class: 68T99 ACM Class: E.3; I.2

  48. Weakly Supervised Medical Image Segmentation With Soft Labels and Noise Robust Loss

    Authors: Banafshe Felfeliyan, Abhilash Hareendranathan, Gregor Kuntze, Stephanie Wichuk, Nils D. Forkert, Jacob L. Jaremko, Janet L. Ronsky

    Abstract: Recent advances in deep learning algorithms have led to significant benefits for solving many medical image analysis problems. Training deep learning models commonly requires large datasets with expert-labeled annotations. However, acquiring expert-labeled annotation is not only expensive but also is subjective, error-prone, and inter-/intra- observer variability introduces noise to labels. This i… ▽ More

    Submitted 16 September, 2022; originally announced September 2022.

  49. Cornucopia: A Framework for Feedback Guided Generation of Binaries

    Authors: Vidush Singhal, Akul Abhilash Pillai, Charitha Saumya, Milind Kulkarni, Aravind Machiry

    Abstract: Binary analysis is an important capability required for many security and software engineering applications. Consequently, there are many binary analysis techniques and tools with varied capabilities. However, testing these tools requires a large, varied binary dataset with corresponding source-level information. In this paper, we present Cornucopia, an architecture agnostic automated framework th… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: This paper has been accepted at the ASE'22 conference. [37th IEEE/ACM International Conference Automated Software Engineering 2022 (ASE)]

  50. arXiv:2209.05911  [pdf, ps, other

    cs.CV

    Computer vision based vehicle tracking as a complementary and scalable approach to RFID tagging

    Authors: Pranav Kant Gaur, Abhilash Bhardwaj, Pritam Shete, Mohini Laghate, Dinesh M Sarode

    Abstract: Logging of incoming/outgoing vehicles serves as a piece of critical information for root-cause analysis to combat security breach incidents in various sensitive organizations. RFID tagging hampers the scalability of vehicle tracking solutions on both logistics as well as technical fronts. For instance, requiring each incoming vehicle(departmental or private) to be RFID tagged is a severe constrain… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.