-
Efficient Computation of Trip-based Group Nearest Neighbor Queries (Full Version)
Authors:
Shahiduz Zaman,
Tanzima Hashem,
Sukarna Barua
Abstract:
In recent years, organizing group meetups for entertainment or other necessities has gained significant importance, especially given the busy nature of daily schedules. People often combine multiple activities, such as dropping kids off at school, commuting to work, and grocery shopping, while seeking opportunities to meet others. To address this need, we propose a novel query type, the Trip-based…
▽ More
In recent years, organizing group meetups for entertainment or other necessities has gained significant importance, especially given the busy nature of daily schedules. People often combine multiple activities, such as dropping kids off at school, commuting to work, and grocery shopping, while seeking opportunities to meet others. To address this need, we propose a novel query type, the Trip-based Group Nearest Neighbor (T-GNN) query, which identifies the optimal meetup Point of Interest (POI) that aligns with users' existing trips. An individual trip consists of a sequence of locations, allowing users the flexibility to detour to the meetup POI at any location within the sequence, known as a detour location. Given a set of trips for the users, the query identifies the optimal meetup POI (e.g., restaurants or movie theaters) and detour locations from each user's trip that minimize the total trip overhead distance. The trip overhead distance refers to the additional distance a user must travel to visit the meetup POI before returning to the next location in their trip. The sum of these overhead distances for all users constitutes the total trip overhead distance. The computation time for processing T-GNN queries increases with the number of POIs. To address this, we introduce three techniques to prune the POIs that cannot contribute to the optimal solution, and thus refine the search space. We also develop an efficient approach for processing T-GNN queries in real-time. Extensive experiments validate the performance of the proposed algorithm.
△ Less
Submitted 29 August, 2025;
originally announced September 2025.
-
QuXAI: Explainers for Hybrid Quantum Machine Learning Models
Authors:
Saikat Barua,
Mostafizur Rahman,
Shehenaz Khaled,
Md Jafor Sadek,
Rafiul Islam,
Shahnewaz Siddique
Abstract:
The emergence of hybrid quantum-classical machine learning (HQML) models opens new horizons of computational intelligence but their fundamental complexity frequently leads to black box behavior that undermines transparency and reliability in their application. Although XAI for quantum systems still in its infancy, a major research gap is evident in robust global and local explainability approaches…
▽ More
The emergence of hybrid quantum-classical machine learning (HQML) models opens new horizons of computational intelligence but their fundamental complexity frequently leads to black box behavior that undermines transparency and reliability in their application. Although XAI for quantum systems still in its infancy, a major research gap is evident in robust global and local explainability approaches that are designed for HQML architectures that employ quantized feature encoding followed by classical learning. The gap is the focus of this work, which introduces QuXAI, an framework based upon Q-MEDLEY, an explainer for explaining feature importance in these hybrid systems. Our model entails the creation of HQML models incorporating quantum feature maps, the use of Q-MEDLEY, which combines feature based inferences, preserving the quantum transformation stage and visualizing the resulting attributions. Our result shows that Q-MEDLEY delineates influential classical aspects in HQML models, as well as separates their noise, and competes well against established XAI techniques in classical validation settings. Ablation studies more significantly expose the virtues of the composite structure used in Q-MEDLEY. The implications of this work are critically important, as it provides a route to improve the interpretability and reliability of HQML models, thus promoting greater confidence and being able to engage in safer and more responsible use of quantum-enhanced AI technology.
Our code and experiments are open-sourced at: https://github.com/GitsSaikat/QuXAI
△ Less
Submitted 12 June, 2025; v1 submitted 15 May, 2025;
originally announced May 2025.
-
LELANTE: LEveraging LLM for Automated ANdroid TEsting
Authors:
Shamit Fatin,
Mehbubul Hasan Al-Quvi,
Haz Sameen Shahgir,
Sukarna Barua,
Anindya Iqbal,
Sadia Sharmin,
Md. Mostofa Akbar,
Kallol Kumar Pal,
A. Asif Al Rashid
Abstract:
Given natural language test case description for an Android application, existing testing approaches require developers to manually write scripts using tools such as Appium and Espresso to execute the corresponding test case. This process is labor-intensive and demands significant effort to maintain as UI interfaces evolve throughout development. In this work, we introduce LELANTE, a novel framewo…
▽ More
Given natural language test case description for an Android application, existing testing approaches require developers to manually write scripts using tools such as Appium and Espresso to execute the corresponding test case. This process is labor-intensive and demands significant effort to maintain as UI interfaces evolve throughout development. In this work, we introduce LELANTE, a novel framework that utilizes large language models (LLMs) to automate test case execution without requiring pre-written scripts. LELANTE interprets natural language test case descriptions, iteratively generate action plans, and perform the actions directly on the Android screen using its GUI. LELANTE employs a screen refinement process to enhance LLM interpretability, constructs a structured prompt for LLMs, and implements an action generation mechanism based on chain-of-thought reasoning of LLMs. To further reduce computational cost and enhance scalability, LELANTE utilizes model distillation using a foundational LLM. In experiments across 390 test cases spanning 10 popular Android applications, LELANTE achieved a 73% test execution success rate. Our results demonstrate that LLMs can effectively bridge the gap between natural language test case description and automated execution, making mobile testing more scalable and adaptable.
△ Less
Submitted 29 April, 2025;
originally announced April 2025.
-
Guardians of the Agentic System: Preventing Many Shots Jailbreak with Agentic System
Authors:
Saikat Barua,
Mostafizur Rahman,
Md Jafor Sadek,
Rafiul Islam,
Shehenaz Khaled,
Ahmedul Kabir
Abstract:
The autonomous AI agents using large language models can create undeniable values in all span of the society but they face security threats from adversaries that warrants immediate protective solutions because trust and safety issues arise. Considering the many-shot jailbreaking and deceptive alignment as some of the main advanced attacks, that cannot be mitigated by the static guardrails used dur…
▽ More
The autonomous AI agents using large language models can create undeniable values in all span of the society but they face security threats from adversaries that warrants immediate protective solutions because trust and safety issues arise. Considering the many-shot jailbreaking and deceptive alignment as some of the main advanced attacks, that cannot be mitigated by the static guardrails used during the supervised training, points out a crucial research priority for real world robustness. The combination of static guardrails in dynamic multi-agent system fails to defend against those attacks. We intend to enhance security for LLM-based agents through the development of new evaluation frameworks which identify and counter threats for safe operational deployment. Our work uses three examination methods to detect rogue agents through a Reverse Turing Test and analyze deceptive alignment through multi-agent simulations and develops an anti-jailbreaking system by testing it with GEMINI 1.5 pro and llama-3.3-70B, deepseek r1 models using tool-mediated adversarial scenarios. The detection capabilities are strong such as 94\% accuracy for GEMINI 1.5 pro yet the system suffers persistent vulnerabilities when under long attacks as prompt length increases attack success rates (ASR) and diversity metrics become ineffective in prediction while revealing multiple complex system faults. The findings demonstrate the necessity of adopting flexible security systems based on active monitoring that can be performed by the agents themselves together with adaptable interventions by system admin as the current models can create vulnerabilities that can lead to the unreliable and vulnerable system. So, in our work, we try to address such situations and propose a comprehensive framework to counteract the security issues.
△ Less
Submitted 12 June, 2025; v1 submitted 23 February, 2025;
originally announced February 2025.
-
PyGen: A Collaborative Human-AI Approach to Python Package Creation
Authors:
Saikat Barua,
Mostafizur Rahman,
Md Jafor Sadek,
Rafiul Islam,
Shehenaz Khaled,
Md. Shohrab Hossain
Abstract:
The principles of automation and innovation serve as foundational elements for advancement in contemporary science and technology. Here, we introduce Pygen, an automation platform designed to empower researchers, technologists, and hobbyists to bring abstract ideas to life as core, usable software tools written in Python. Pygen leverages the immense power of autoregressive large language models to…
▽ More
The principles of automation and innovation serve as foundational elements for advancement in contemporary science and technology. Here, we introduce Pygen, an automation platform designed to empower researchers, technologists, and hobbyists to bring abstract ideas to life as core, usable software tools written in Python. Pygen leverages the immense power of autoregressive large language models to augment human creativity during the ideation, iteration, and innovation process. By combining state-of-the-art language models with open-source code generation technologies, Pygen has significantly reduced the manual overhead of tool development. From a user prompt, Pygen automatically generates Python packages for a complete workflow from concept to package generation and documentation. The findings of our work show that Pygen considerably enhances the researcher's productivity by enabling the creation of resilient, modular, and well-documented packages for various specialized purposes. We employ a prompt enhancement approach to distill the user's package description into increasingly specific and actionable. While being inherently an open-ended task, we have evaluated the generated packages and the documentation using Human Evaluation, LLM-based evaluation, and CodeBLEU, with detailed results in the results section. Furthermore, we documented our results, analyzed the limitations, and suggested strategies to alleviate them. Pygen is our vision of ethical automation, a framework that promotes inclusivity, accessibility, and collaborative development. This project marks the beginning of a large-scale effort towards creating tools where intelligent agents collaborate with humans to improve scientific and technological development substantially.
Our code and generated examples are open-sourced at [https://github.com/GitsSaikat/Pygen]
△ Less
Submitted 12 June, 2025; v1 submitted 12 November, 2024;
originally announced November 2024.
-
Exploring Autonomous Agents through the Lens of Large Language Models: A Review
Authors:
Saikat Barua
Abstract:
Large Language Models (LLMs) are transforming artificial intelligence, enabling autonomous agents to perform diverse tasks across various domains. These agents, proficient in human-like text comprehension and generation, have the potential to revolutionize sectors from customer service to healthcare. However, they face challenges such as multimodality, human value alignment, hallucinations, and ev…
▽ More
Large Language Models (LLMs) are transforming artificial intelligence, enabling autonomous agents to perform diverse tasks across various domains. These agents, proficient in human-like text comprehension and generation, have the potential to revolutionize sectors from customer service to healthcare. However, they face challenges such as multimodality, human value alignment, hallucinations, and evaluation. Techniques like prompting, reasoning, tool utilization, and in-context learning are being explored to enhance their capabilities. Evaluation platforms like AgentBench, WebArena, and ToolLLM provide robust methods for assessing these agents in complex scenarios. These advancements are leading to the development of more resilient and capable autonomous agents, anticipated to become integral in our digital lives, assisting in tasks from email responses to disease diagnosis. The future of AI, with LLMs at the forefront, is promising.
△ Less
Submitted 5 April, 2024;
originally announced April 2024.
-
KAXAI: An Integrated Environment for Knowledge Analysis and Explainable AI
Authors:
Saikat Barua,
Sifat Momen
Abstract:
In order to fully harness the potential of machine learning, it is crucial to establish a system that renders the field more accessible and less daunting for individuals who may not possess a comprehensive understanding of its intricacies. The paper describes the design of a system that integrates AutoML, XAI, and synthetic data generation to provide a great UX design for users. The system allows…
▽ More
In order to fully harness the potential of machine learning, it is crucial to establish a system that renders the field more accessible and less daunting for individuals who may not possess a comprehensive understanding of its intricacies. The paper describes the design of a system that integrates AutoML, XAI, and synthetic data generation to provide a great UX design for users. The system allows users to navigate and harness the power of machine learning while abstracting its complexities and providing high usability. The paper proposes two novel classifiers, Logistic Regression Forest and Support Vector Tree, for enhanced model performance, achieving 96\% accuracy on a diabetes dataset and 93\% on a survey dataset. The paper also introduces a model-dependent local interpreter called MEDLEY and evaluates its interpretation against LIME, Greedy, and Parzen. Additionally, the paper introduces LLM-based synthetic data generation, library-based data generation, and enhancing the original dataset with GAN. The findings on synthetic data suggest that enhancing the original dataset with GAN is the most reliable way to generate synthetic data, as evidenced by KS tests, standard deviation, and feature importance. The authors also found that GAN works best for quantitative datasets.
△ Less
Submitted 30 December, 2023;
originally announced January 2024.
-
When a CBR in Hand is Better than Twins in the Bush
Authors:
Mobyen Uddin Ahmed,
Shaibal Barua,
Shahina Begum,
Mir Riyanul Islam,
Rosina O Weber
Abstract:
AI methods referred to as interpretable are often discredited as inaccurate by supporters of the existence of a trade-off between interpretability and accuracy. In many problem contexts however this trade-off does not hold. This paper discusses a regression problem context to predict flight take-off delays where the most accurate data regression model was trained via the XGBoost implementation of…
▽ More
AI methods referred to as interpretable are often discredited as inaccurate by supporters of the existence of a trade-off between interpretability and accuracy. In many problem contexts however this trade-off does not hold. This paper discusses a regression problem context to predict flight take-off delays where the most accurate data regression model was trained via the XGBoost implementation of gradient boosted decision trees. While building an XGB-CBR Twin and converting the XGBoost feature importance into global weights in the CBR model, the resultant CBR model alone provides the most accurate local prediction, maintains the global importance to provide a global explanation of the model, and offers the most interpretable representation for local explanations. This resultant CBR model becomes a benchmark of accuracy and interpretability for this problem context, and hence it is used to evaluate the two additive feature attribute methods SHAP and LIME to explain the XGBoost regression model. The results with respect to local accuracy and feature attribution lead to potentially valuable future work.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
Sketch2FullStack: Generating Skeleton Code of Full Stack Website and Application from Sketch using Deep Learning and Computer Vision
Authors:
Somoy Subandhu Barua,
Imam Mohammad Zulkarnain,
Abhishek Roy,
Md. Golam Rabiul Alam,
Md Zia Uddin
Abstract:
For a full-stack web or app development, it requires a software firm or more specifically a team of experienced developers to contribute a large portion of their time and resources to design the website and then convert it to code. As a result, the efficiency of the development team is significantly reduced when it comes to converting UI wireframes and database schemas into an actual working syste…
▽ More
For a full-stack web or app development, it requires a software firm or more specifically a team of experienced developers to contribute a large portion of their time and resources to design the website and then convert it to code. As a result, the efficiency of the development team is significantly reduced when it comes to converting UI wireframes and database schemas into an actual working system. It would save valuable resources and fasten the overall workflow if the clients or developers can automate this process of converting the pre-made full-stack website design to get a partially working if not fully working code. In this paper, we present a novel approach of generating the skeleton code from sketched images using Deep Learning and Computer Vision approaches. The dataset for training are first-hand sketched images of low fidelity wireframes, database schemas and class diagrams. The approach consists of three parts. First, the front-end or UI elements detection and extraction from custom-made UI wireframes. Second, individual database table creation from schema designs and lastly, creating a class file from class diagrams.
△ Less
Submitted 26 November, 2022;
originally announced November 2022.
-
Mapping the Landscape of COVID-19 Crisis Visualizations
Authors:
Yixuan Zhang,
Yifan Sun,
Lace Padilla,
Sumit Barua,
Enrico Bertini,
Andrea G. Parker
Abstract:
In response to COVID-19, a vast number of visualizations have been created to communicate information to the public. Information exposure in a public health crisis can impact people's attitudes towards and responses to the crisis and risks, and ultimately the trajectory of a pandemic. As such, there is a need for work that documents, organizes, and investigates what COVID-19 visualizations have be…
▽ More
In response to COVID-19, a vast number of visualizations have been created to communicate information to the public. Information exposure in a public health crisis can impact people's attitudes towards and responses to the crisis and risks, and ultimately the trajectory of a pandemic. As such, there is a need for work that documents, organizes, and investigates what COVID-19 visualizations have been presented to the public. We address this gap through an analysis of 668 COVID-19 visualizations. We present our findings through a conceptual framework derived from our analysis, that examines who, (uses) what data, (to communicate) what messages, in what form, under what circumstances in the context of COVID-19 crisis visualizations. We provide a set of factors to be considered within each component of the framework. We conclude with directions for future crisis visualization research.
△ Less
Submitted 12 January, 2021;
originally announced January 2021.
-
FCC-GAN: A Fully Connected and Convolutional Net Architecture for GANs
Authors:
Sukarna Barua,
Sarah Monazam Erfani,
James Bailey
Abstract:
Generative Adversarial Networks (GANs) are a powerful class of generative models. Despite their successes, the most appropriate choice of a GAN network architecture is still not well understood. GAN models for image synthesis have adopted a deep convolutional network architecture, which eliminates or minimizes the use of fully connected and pooling layers in favor of convolution layers in the gene…
▽ More
Generative Adversarial Networks (GANs) are a powerful class of generative models. Despite their successes, the most appropriate choice of a GAN network architecture is still not well understood. GAN models for image synthesis have adopted a deep convolutional network architecture, which eliminates or minimizes the use of fully connected and pooling layers in favor of convolution layers in the generator and discriminator of GANs. In this paper, we demonstrate that a convolution network architecture utilizing deep fully connected layers and pooling layers can be more effective than the traditional convolution-only architecture, and we propose FCC-GAN, a fully connected and convolutional GAN architecture. Models based on our FCC-GAN architecture learn both faster than the conventional architecture and also generate higher quality of samples. We demonstrate the effectiveness and stability of our approach across four popular image datasets.
△ Less
Submitted 27 May, 2019; v1 submitted 7 May, 2019;
originally announced May 2019.
-
Quality Evaluation of GANs Using Cross Local Intrinsic Dimensionality
Authors:
Sukarna Barua,
Xingjun Ma,
Sarah Monazam Erfani,
Michael E. Houle,
James Bailey
Abstract:
Generative Adversarial Networks (GANs) are an elegant mechanism for data generation. However, a key challenge when using GANs is how to best measure their ability to generate realistic data. In this paper, we demonstrate that an intrinsic dimensional characterization of the data space learned by a GAN model leads to an effective evaluation metric for GAN quality. In particular, we propose a new ev…
▽ More
Generative Adversarial Networks (GANs) are an elegant mechanism for data generation. However, a key challenge when using GANs is how to best measure their ability to generate realistic data. In this paper, we demonstrate that an intrinsic dimensional characterization of the data space learned by a GAN model leads to an effective evaluation metric for GAN quality. In particular, we propose a new evaluation measure, CrossLID, that assesses the local intrinsic dimensionality (LID) of real-world data with respect to neighborhoods found in GAN-generated samples. Intuitively, CrossLID measures the degree to which manifolds of two data distributions coincide with each other. In experiments on 4 benchmark image datasets, we compare our proposed measure to several state-of-the-art evaluation metrics. Our experiments show that CrossLID is strongly correlated with the progress of GAN training, is sensitive to mode collapse, is robust to small-scale noise and image transformations, and robust to sample size. Furthermore, we show how CrossLID can be used within the GAN training process to improve generation quality.
△ Less
Submitted 2 May, 2019;
originally announced May 2019.
-
A novel approach for mobility management inf lte femtocells
Authors:
Pantha Ghosal,
Shouman Barua,
Ramprasad Subramanian,
Shiqi Xing,
Kumbesan Sandrasegaran
Abstract:
LTE is an emerging wireless data communication technology to provide broadband ubiquitous Internet access. Femtocells are included in 3GPP since Release 8 to enhance the indoor network coverage and capacity. The main challenge of mobility management in hierarchical LTE structure is to guarantee efficient handover to or from/to/between Femtocells. This paper focuses, on different types of Handover…
▽ More
LTE is an emerging wireless data communication technology to provide broadband ubiquitous Internet access. Femtocells are included in 3GPP since Release 8 to enhance the indoor network coverage and capacity. The main challenge of mobility management in hierarchical LTE structure is to guarantee efficient handover to or from/to/between Femtocells. This paper focuses, on different types of Handover and comparison performance between different decision algorithms. Furthermore, a speed based Handover algorithm for macro-femto scenario is proposed with simulation results
△ Less
Submitted 10 November, 2014;
originally announced November 2014.
-
Macro with Pico Cells (HetNets) System Behavior Using Well-known scheduling Algorithms
Authors:
Haider Al Kim,
Shouman Barua,
Pantha Ghosal,
Kumbesan Sandrasegaran
Abstract:
This paper demonstrates the concept of using Heterogeneous networks (HetNets) to improve Long Term Evolution (LTE) system by introducing the LTE Advance (LTE-A). The type of HetNets that has been chosen for this study is Macro with Pico cells. Comparing the system performance with and without Pico cells has clearly illustrated using three well-known scheduling algorithms (Proportional Fair PF, Max…
▽ More
This paper demonstrates the concept of using Heterogeneous networks (HetNets) to improve Long Term Evolution (LTE) system by introducing the LTE Advance (LTE-A). The type of HetNets that has been chosen for this study is Macro with Pico cells. Comparing the system performance with and without Pico cells has clearly illustrated using three well-known scheduling algorithms (Proportional Fair PF, Maximum Largest Weighted Delay First MLWDF and Exponential/Proportional Fair EXP/PF). The system is judged based on throughput, Packet Loss Ratio PLR, delay and fairness.A simulation platform called LTE-Sim has been used to collect the data and produce the paper outcomes and graphs. The results prove that adding Pico cells enhances the overall system performance. From the simulation outcomes, the overall system performance is as follows: throughput is duplicated or tripled based on the number of users, the PLR is almost quartered, the delay is nearly reduced ten times (PF case) and changed to be a half (MLWDF/EXP cases), and the fairness stays closer to value of 1. It is considered an efficient and cost effective way to increase the throughput, coverage and reduce the latency.
△ Less
Submitted 8 November, 2014;
originally announced November 2014.