Skip to main content

Showing 1–50 of 135 results for author: Anwar, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.22174  [pdf, ps, other

    cs.RO cs.LG

    ASVSim (AirSim for Surface Vehicles): A High-Fidelity Simulation Framework for Autonomous Surface Vehicle Research

    Authors: Bavo Lesy, Siemen Herremans, Robin Kerstens, Jan Steckel, Walter Daems, Siegfried Mercelis, Ali Anwar

    Abstract: The transport industry has recently shown significant interest in unmanned surface vehicles (USVs), specifically for port and inland waterway transport. These systems can improve operational efficiency and safety, which is especially relevant in the European Union, where initiatives such as the Green Deal are driving a shift towards increased use of inland waterways. At the same time, a shortage o… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: 14 Pages, 11 Figures

  2. arXiv:2506.20302  [pdf, ps, other

    cs.CV

    TDiR: Transformer based Diffusion for Image Restoration Tasks

    Authors: Abbas Anwar, Mohammad Shullar, Ali Arshad Nasir, Mudassir Masood, Saeed Anwar

    Abstract: Images captured in challenging environments often experience various forms of degradation, including noise, color cast, blur, and light scattering. These effects significantly reduce image quality, hindering their applicability in downstream tasks such as object detection, mapping, and classification. Our transformer-based diffusion model was developed to address image restoration tasks, aiming to… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  3. arXiv:2505.10911  [pdf, ps, other

    cs.RO

    ReWiND: Language-Guided Rewards Teach Robot Policies without New Demonstrations

    Authors: Jiahui Zhang, Yusen Luo, Abrar Anwar, Sumedh Anand Sontakke, Joseph J Lim, Jesse Thomason, Erdem Biyik, Jesse Zhang

    Abstract: We introduce ReWiND, a framework for learning robot manipulation tasks solely from language instructions without per-task demonstrations. Standard reinforcement learning (RL) and imitation learning methods require expert supervision through human-designed reward functions or demonstrations for every new task. In contrast, ReWiND starts from a small demonstration dataset to learn: (1) a data-effici… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  4. arXiv:2503.15707  [pdf, other

    cs.RO cs.AI

    Safety Aware Task Planning via Large Language Models in Robotics

    Authors: Azal Ahmad Khan, Michael Andrev, Muhammad Ali Murtaza, Sergio Aguilera, Rui Zhang, Jie Ding, Seth Hutchinson, Ali Anwar

    Abstract: The integration of large language models (LLMs) into robotic task planning has unlocked better reasoning capabilities for complex, long-horizon workflows. However, ensuring safety in LLM-driven plans remains a critical challenge, as these models often prioritize task completion over risk mitigation. This paper introduces SAFER (Safety-Aware Framework for Execution in Robotics), a multi-LLM framewo… ▽ More

    Submitted 19 March, 2025; originally announced March 2025.

  5. arXiv:2503.00323  [pdf, other

    cs.LG cs.AI cs.DC

    FLStore: Efficient Federated Learning Storage for non-training workloads

    Authors: Ahmad Faraz Khan, Samuel Fountain, Ahmed M. Abdelmoniem, Ali R. Butt, Ali Anwar

    Abstract: Federated Learning (FL) is an approach for privacy-preserving Machine Learning (ML), enabling model training across multiple clients without centralized data collection. With an aggregator server coordinating training, aggregating model updates, and storing metadata across rounds. In addition to training, a substantial part of FL systems are the non-training workloads such as scheduling, personali… ▽ More

    Submitted 28 February, 2025; originally announced March 2025.

    Comments: 11 pages, 19 figures, 2 tables This paper has been accepted at the The Eighth Annual Conference on Machine Learning and Systems (MLSys 2025)

  6. arXiv:2502.20825  [pdf, other

    cs.LG cs.AI cs.DC cs.SE

    LADs: Leveraging LLMs for AI-Driven DevOps

    Authors: Ahmad Faraz Khan, Azal Ahmad Khan, Anas Mohamed, Haider Ali, Suchithra Moolinti, Sabaat Haroon, Usman Tahir, Mattia Fazzini, Ali R. Butt, Ali Anwar

    Abstract: Automating cloud configuration and deployment remains a critical challenge due to evolving infrastructures, heterogeneous hardware, and fluctuating workloads. Existing solutions lack adaptability and require extensive manual tuning, leading to inefficiencies and misconfigurations. We introduce LADs, the first LLM-driven framework designed to tackle these challenges by ensuring robustness, adaptabi… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

    Comments: 17 pages with Appendix, 8 figures, and 7 tables. This paper is currently Under Review

  7. arXiv:2502.15618  [pdf, other

    cs.CL cs.AI cs.LG

    Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing

    Authors: Qi Le, Enmao Diao, Ziyan Wang, Xinran Wang, Jie Ding, Li Yang, Ali Anwar

    Abstract: We introduce Probe Pruning (PP), a novel framework for online, dynamic, structured pruning of Large Language Models (LLMs) applied in a batch-wise manner. PP leverages the insight that not all samples and tokens contribute equally to the model's output, and probing a small portion of each batch effectively identifies crucial weights, enabling tailored dynamic pruning for different batches. It comp… ▽ More

    Submitted 21 February, 2025; originally announced February 2025.

    Comments: ICLR 2025

  8. arXiv:2502.09829  [pdf, other

    cs.RO cs.AI cs.LG

    Efficient Evaluation of Multi-Task Robot Policies With Active Experiment Selection

    Authors: Abrar Anwar, Rohan Gupta, Zain Merchant, Sayan Ghosh, Willie Neiswanger, Jesse Thomason

    Abstract: Evaluating learned robot control policies to determine their physical task-level capabilities costs experimenter time and effort. The growing number of policies and tasks exacerbates this issue. It is impractical to test every policy on every task multiple times; each trial requires a manual environment reset, and each task change involves re-arranging objects or even changing robots. Naively sele… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  9. arXiv:2501.13416  [pdf, other

    cs.LG cs.AI cs.RO

    M3PT: A Transformer for Multimodal, Multi-Party Social Signal Prediction with Person-aware Blockwise Attention

    Authors: Yiming Tang, Abrar Anwar, Jesse Thomason

    Abstract: Understanding social signals in multi-party conversations is important for human-robot interaction and artificial social intelligence. Social signals include body pose, head pose, speech, and context-specific activities like acquiring and taking bites of food when dining. Past work in multi-party interaction tends to build task-specific models for predicting social signals. In this work, we addres… ▽ More

    Submitted 2 February, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

  10. arXiv:2501.02822  [pdf, other

    cs.CV cs.AI cs.RO

    RDD4D: 4D Attention-Guided Road Damage Detection And Classification

    Authors: Asma Alkalbani, Muhammad Saqib, Ahmed Salim Alrawahi, Abbas Anwar, Chandarnath Adak, Saeed Anwar

    Abstract: Road damage detection and assessment are crucial components of infrastructure maintenance. However, current methods often struggle with detecting multiple types of road damage in a single image, particularly at varying scales. This is due to the lack of road datasets with various damage types having varying scales. To overcome this deficiency, first, we present a novel dataset called Diverse Road… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

  11. arXiv:2412.01461  [pdf, other

    cs.CV

    A comprehensive review of datasets and deep learning techniques for vision in Unmanned Surface Vehicles

    Authors: Linh Trinh, Siegfried Mercelis, Ali Anwar

    Abstract: Unmanned Surface Vehicles (USVs) have emerged as a major platform in maritime operations, capable of supporting a wide range of applications. USVs can help reduce labor costs, increase safety, save energy, and allow for difficult unmanned tasks in harsh maritime environments. With the rapid development of USVs, many vision tasks such as detection and segmentation become increasingly important. Dat… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

  12. arXiv:2411.04915  [pdf, other

    cs.LG cs.AI

    Evaluating Robustness of Reinforcement Learning Algorithms for Autonomous Shipping

    Authors: Bavo Lesy, Ali Anwar, Siegfried Mercelis

    Abstract: Recently, there has been growing interest in autonomous shipping due to its potential to improve maritime efficiency and safety. The use of advanced technologies, such as artificial intelligence, can address the current navigational and operational challenges in autonomous shipping. In particular, inland waterway transport (IWT) presents a unique set of challenges, such as crowded waterways and va… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

    Comments: 5 pages, 4 figures. Will be presented at IEEE RAAI 2024

  13. arXiv:2410.19198  [pdf, other

    cs.AI cs.CY cs.ET cs.HC cs.LG

    MAP: Multi-Human-Value Alignment Palette

    Authors: Xinran Wang, Qi Le, Ammar Ahmed, Enmao Diao, Yi Zhou, Nathalie Baracaldo, Jie Ding, Ali Anwar

    Abstract: Ensuring that generative AI systems align with human values is essential but challenging, especially when considering multiple human values and their potential trade-offs. Since human values can be personalized and dynamically change over time, the desirable levels of value alignment vary across different ethnic groups, industry sectors, and user cohorts. Within existing frameworks, it is hard to… ▽ More

    Submitted 24 October, 2024; originally announced October 2024.

  14. arXiv:2409.15867  [pdf, other

    cs.AI

    In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Understanding

    Authors: Moucheng Xu, Evangelos Chatzaroulas, Luc McCutcheon, Abdul Ahad, Hamzah Azeem, Janusz Marecki, Ammar Anwar

    Abstract: A Standard Operating Procedure (SOP) defines a low-level, step-by-step written guide for a business software workflow. SOP generation is a crucial step towards automating end-to-end software workflows. Manually creating SOPs can be time-consuming. Recent advancements in large video-language models offer the potential for automating SOP generation by analyzing recordings of human demonstrations. Ho… ▽ More

    Submitted 20 October, 2024; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: To appear in NeurIPS Workshop on Video-Language Models 2024

  15. arXiv:2409.13682  [pdf, other

    cs.RO cs.AI cs.CL

    ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot Navigation

    Authors: Abrar Anwar, John Welsh, Joydeep Biswas, Soha Pouya, Yan Chang

    Abstract: Navigating and understanding complex environments over extended periods of time is a significant challenge for robots. People interacting with the robot may want to ask questions like where something happened, when it occurred, or how long ago it took place, which would require the robot to reason over a long history of their deployment. To address this problem, we introduce a Retrieval-augmented… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  16. arXiv:2409.08416  [pdf, other

    cs.ET cs.NI

    Towards Scalable Quantum Networks

    Authors: Connor Howe, Mohsin Aziz, Ali Anwar

    Abstract: This paper presents a comprehensive study on the scalability challenges and opportunities in quantum communication networks, with the goal of determining parameters that impact networks most as well as the trends that appear when scaling networks. We design simulations of quantum networks comprised of router nodes made up of trapped-ion qubits, separated by quantum repeaters in the form of Bell St… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Comments: 10 pages, 11 figures

  17. arXiv:2409.07631  [pdf, other

    cs.CR cs.DC

    HERL: Tiered Federated Learning with Adaptive Homomorphic Encryption using Reinforcement Learning

    Authors: Jiaxang Tang, Zeshan Fayyaz, Mohammad A. Salahuddin, Raouf Boutaba, Zhi-Li Zhang, Ali Anwar

    Abstract: Federated Learning is a well-researched approach for collaboratively training machine learning models across decentralized data while preserving privacy. However, integrating Homomorphic Encryption to ensure data confidentiality introduces significant computational and communication overheads, particularly in heterogeneous environments where clients have varying computational capacities and securi… ▽ More

    Submitted 11 September, 2024; originally announced September 2024.

  18. arXiv:2409.06805  [pdf, other

    cs.LG cs.AI cs.CR

    Personalized Federated Learning Techniques: Empirical Analysis

    Authors: Azal Ahmad Khan, Ahmad Faraz Khan, Haider Ali, Ali Anwar

    Abstract: Personalized Federated Learning (pFL) holds immense promise for tailoring machine learning models to individual users while preserving data privacy. However, achieving optimal performance in pFL often requires a careful balancing act between memory overhead costs and model accuracy. This paper delves into the trade-offs inherent in pFL, offering valuable insights for selecting the right algorithms… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

  19. arXiv:2409.04986  [pdf, other

    cs.LG

    DynamicFL: Federated Learning with Dynamic Communication Resource Allocation

    Authors: Qi Le, Enmao Diao, Xinran Wang, Vahid Tarokh, Jie Ding, Ali Anwar

    Abstract: Federated Learning (FL) is a collaborative machine learning framework that allows multiple users to train models utilizing their local data in a distributed manner. However, considerable statistical heterogeneity in local data across devices often leads to suboptimal model performance compared with independently and identically distributed (IID) data scenarios. In this paper, we introduce DynamicF… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

  20. arXiv:2408.09556  [pdf, other

    cs.LG cs.AI

    Addressing Heterogeneity in Federated Learning: Challenges and Solutions for a Shared Production Environment

    Authors: Tatjana Legler, Vinit Hegiste, Ahmed Anwar, Martin Ruskowski

    Abstract: Federated learning (FL) has emerged as a promising approach to training machine learning models across decentralized data sources while preserving data privacy, particularly in manufacturing and shared production environments. However, the presence of data heterogeneity variations in data distribution, quality, and volume across different or clients and production sites, poses significant challeng… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  21. arXiv:2408.07704  [pdf, other

    cs.IR cs.AI cs.HC

    Empathic Responding for Digital Interpersonal Emotion Regulation via Content Recommendation

    Authors: Akriti Verma, Shama Islam, Valeh Moghaddam, Adnan Anwar, Sharon Horwood

    Abstract: Interpersonal communication plays a key role in managing people's emotions, especially on digital platforms. Studies have shown that people use social media and consume online content to regulate their emotions and find support for rest and recovery. However, these platforms are not designed for emotion regulation, which limits their effectiveness in this regard. To address this issue, we propose… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  22. arXiv:2408.04442  [pdf, other

    cs.LG cs.AI

    FedAD-Bench: A Unified Benchmark for Federated Unsupervised Anomaly Detection in Tabular Data

    Authors: Ahmed Anwar, Brian Moser, Dayananda Herurkar, Federico Raue, Vinit Hegiste, Tatjana Legler, Andreas Dengel

    Abstract: The emergence of federated learning (FL) presents a promising approach to leverage decentralized data while preserving privacy. Furthermore, the combination of FL and anomaly detection is particularly compelling because it allows for detecting rare and critical anomalies (usually also rare in locally gathered data) in sensitive data from multiple sources, such as cybersecurity and healthcare. Howe… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: 8 pages, 1 figure

  23. arXiv:2407.15901  [pdf

    cs.LG cs.AI

    Enhancing Cognitive Workload Classification Using Integrated LSTM Layers and CNNs for fNIRS Data Analysis

    Authors: Mehshan Ahmed Khan, Houshyar Asadi, Mohammad Reza Chalak Qazani, Adetokunbo Arogbonlo, Siamak Pedrammehr, Adnan Anwar, Asim Bhatti, Saeid Nahavandi, Chee Peng Lim

    Abstract: Functional near-infrared spectroscopy (fNIRS) is employed as a non-invasive method to monitor functional brain activation by capturing changes in the concentrations of oxygenated haemoglobin (HbO) and deoxygenated haemo-globin (HbR). Various machine learning classification techniques have been utilized to distinguish cognitive states. However, conventional machine learning methods, although simple… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: conference

  24. arXiv:2407.15879  [pdf, other

    cs.CR cs.AI cs.DC cs.LG

    Decentralized Federated Anomaly Detection in Smart Grids: A P2P Gossip Approach

    Authors: Muhammad Akbar Husnoo, Adnan Anwar, Md Enamul Haque, A. N. Mahmood

    Abstract: The increasing security and privacy concerns in the Smart Grid sector have led to a significant demand for robust intrusion detection systems within critical smart grid infrastructure. To address the challenges posed by privacy preservation and decentralized power system zones with distinct data ownership, Federated Learning (FL) has emerged as a promising privacy-preserving solution which facilit… ▽ More

    Submitted 9 January, 2025; v1 submitted 20 July, 2024; originally announced July 2024.

  25. arXiv:2407.14418  [pdf, other

    cs.CV

    Improving classification of road surface conditions via road area extraction and contrastive learning

    Authors: Linh Trinh, Ali Anwar, Siegfried Mercelis

    Abstract: Maintaining roads is crucial to economic growth and citizen well-being because roads are a vital means of transportation. In various countries, the inspection of road surfaces is still done manually, however, to automate it, research interest is now focused on detecting the road surface defects via the visual data. While, previous research has been focused on deep learning methods which tend to pr… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: 7 pages

  26. arXiv:2407.12065  [pdf, other

    cs.LG cs.AI

    Data selection method for assessment of autonomous vehicles

    Authors: Linh Trinh, Ali Anwar, Siegfried Mercelis

    Abstract: As the popularity of autonomous vehicles has grown, many standards and regulators, such as ISO, NHTSA, and Euro NCAP, require safety validation to ensure a sufficient level of safety before deploying them in the real world. Manufacturers gather a large amount of public road data for this purpose. However, the majority of these validation activities are done manually by humans. Furthermore, the dat… ▽ More

    Submitted 28 October, 2024; v1 submitted 16 July, 2024; originally announced July 2024.

    Comments: 7 pages

    Journal ref: 27th IEEE International Conference on Intelligent Transportation Systems 2024

  27. arXiv:2407.10197  [pdf, other

    cs.CV

    Multiple data sources and domain generalization learning method for road surface defect classification

    Authors: Linh Trinh, Ali Anwar, Siegfried Mercelis

    Abstract: Roads are an essential mode of transportation, and maintaining them is critical to economic growth and citizen well-being. With the continued advancement of AI, road surface inspection based on camera images has recently been extensively researched and can be performed automatically. However, because almost all of the deep learning methods for detecting road surface defects were optimized for a sp… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 6 pages

  28. arXiv:2407.08219  [pdf, other

    cs.CL cs.HC

    Generating Contextually-Relevant Navigation Instructions for Blind and Low Vision People

    Authors: Zain Merchant, Abrar Anwar, Emily Wang, Souti Chattopadhyay, Jesse Thomason

    Abstract: Navigating unfamiliar environments presents significant challenges for blind and low-vision (BLV) individuals. In this work, we construct a dataset of images and goals across different scenarios such as searching through kitchens or navigating outdoors. We then investigate how grounded instruction generation methods can provide contextually-relevant navigational guidance to users in these instance… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted as RO-MAN 2024 Late Breaking Report

  29. arXiv:2406.13636  [pdf, other

    cs.RO cs.LG

    Contrast Sets for Evaluating Language-Guided Robot Policies

    Authors: Abrar Anwar, Rohan Gupta, Jesse Thomason

    Abstract: Robot evaluations in language-guided, real world settings are time-consuming and often sample only a small space of potential instructions across complex scenes. In this work, we introduce contrast sets for robotics as an approach to make small, but specific, perturbations to otherwise independent, identically distributed (i.i.d.) test instances. We investigate the relationship between experimente… ▽ More

    Submitted 25 October, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepted to CoRL 2024

  30. arXiv:2406.12036  [pdf, other

    cs.CL cs.AI

    MedCalc-Bench: Evaluating Large Language Models for Medical Calculations

    Authors: Nikhil Khandekar, Qiao Jin, Guangzhi Xiong, Soren Dunn, Serina S Applebaum, Zain Anwar, Maame Sarfo-Gyamfi, Conrad W Safranek, Abid A Anwar, Andrew Zhang, Aidan Gilson, Maxwell B Singer, Amisha Dave, Andrew Taylor, Aidong Zhang, Qingyu Chen, Zhiyong Lu

    Abstract: As opposed to evaluating computation and logic-based reasoning, current benchmarks for evaluating large language models (LLMs) in medicine are primarily focused on question-answering involving domain knowledge and descriptive reasoning. While such qualitative capabilities are vital to medical diagnosis, in real-world scenarios, doctors frequently use clinical calculators that follow quantitative e… ▽ More

    Submitted 30 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: Github link: https://github.com/ncbi-nlp/MedCalc-Bench HuggingFace link: https://huggingface.co/datasets/nsk7153/MedCalc-Bench

  31. arXiv:2406.09976  [pdf, other

    cs.LG cs.AI

    Robust Model-Based Reinforcement Learning with an Adversarial Auxiliary Model

    Authors: Siemen Herremans, Ali Anwar, Siegfried Mercelis

    Abstract: Reinforcement learning has demonstrated impressive performance in various challenging problems such as robotics, board games, and classical arcade games. However, its real-world applications can be hindered by the absence of robustness and safety in the learned policies. More specifically, an RL agent that trains in a certain Markov decision process (MDP) often struggles to perform well in nearly… ▽ More

    Submitted 1 July, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: Will be presented at the RL Safety Workshop at RLC 2024

  32. arXiv:2404.17670  [pdf, other

    eess.IV cs.AI cs.CV cs.ET cs.LG

    Federated Learning for Blind Image Super-Resolution

    Authors: Brian B. Moser, Ahmed Anwar, Federico Raue, Stanislav Frolov, Andreas Dengel

    Abstract: Traditional blind image SR methods need to model real-world degradations precisely. Consequently, current research struggles with this dilemma by assuming idealized degradations, which leads to limited applicability to actual user data. Moreover, the ideal scenario - training models on data from the targeted user base - presents significant privacy concerns. To address both challenges, we propose… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  33. arXiv:2404.15369  [pdf, other

    q-bio.NC cs.AI cs.CY

    Can a Machine be Conscious? Towards Universal Criteria for Machine Consciousness

    Authors: Nur Aizaan Anwar, Cosmin Badea

    Abstract: As artificially intelligent systems become more anthropomorphic and pervasive, and their potential impact on humanity more urgent, discussions about the possibility of machine consciousness have significantly intensified, and it is sometimes seen as 'the holy grail'. Many concerns have been voiced about the ramifications of creating an artificial conscious entity. This is compounded by a marked la… ▽ More

    Submitted 30 April, 2024; v1 submitted 19 April, 2024; originally announced April 2024.

    Comments: This work was supported by the UKRI CDT in AI for Healthcare, http://ai4health.io (Grant No. EP/S023283/1)

  34. arXiv:2404.14933  [pdf, other

    cs.LG cs.AI

    Fin-Fed-OD: Federated Outlier Detection on Financial Tabular Data

    Authors: Dayananda Herurkar, Sebastian Palacio, Ahmed Anwar, Joern Hees, Andreas Dengel

    Abstract: Anomaly detection in real-world scenarios poses challenges due to dynamic and often unknown anomaly distributions, requiring robust methods that operate under an open-world assumption. This challenge is exacerbated in practical settings, where models are employed by private organizations, precluding data sharing due to privacy and competitive concerns. Despite potential benefits, the sharing of an… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  35. arXiv:2404.13844  [pdf, other

    cs.LG cs.AI

    ColA: Collaborative Adaptation with Gradient Learning

    Authors: Enmao Diao, Qi Le, Suya Wu, Xinran Wang, Ali Anwar, Jie Ding, Vahid Tarokh

    Abstract: A primary function of back-propagation is to compute both the gradient of hidden representations and parameters for optimization with gradient descent. Training large models requires high computational costs due to their vast parameter sizes. While Parameter-Efficient Fine-Tuning (PEFT) methods aim to train smaller auxiliary models to save computational space, they still present computational over… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  36. arXiv:2404.08755  [pdf, other

    cs.LG cs.AI cs.CV cs.HC

    Training a Vision Language Model as Smartphone Assistant

    Authors: Nicolai Dorka, Janusz Marecki, Ammar Anwar

    Abstract: Addressing the challenge of a digital assistant capable of executing a wide array of user tasks, our research focuses on the realm of instruction-based mobile device control. We leverage recent advancements in large language models (LLMs) and present a visual language model (VLM) that can fulfill diverse tasks on mobile devices. Our model functions by interacting solely with the user interface (UI… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: ICLR 2024 workshop on Generative Models for Decision Making

  37. arXiv:2403.15370  [pdf, other

    cs.CV cs.LG cs.RO

    Augmented Reality based Simulated Data (ARSim) with multi-view consistency for AV perception networks

    Authors: Aqeel Anwar, Tae Eun Choe, Zian Wang, Sanja Fidler, Minwoo Park

    Abstract: Detecting a diverse range of objects under various driving scenarios is essential for the effectiveness of autonomous driving systems. However, the real-world data collected often lacks the necessary diversity presenting a long-tail distribution. Although synthetic data has been utilized to overcome this issue by generating virtual scenes, it faces hurdles such as a significant domain gap and the… ▽ More

    Submitted 22 March, 2024; originally announced March 2024.

    Comments: 17 pages, 15 figures, 7 tables

  38. arXiv:2403.04067  [pdf, other

    cs.RO

    Feel the Bite: Robot-Assisted Inside-Mouth Bite Transfer using Robust Mouth Perception and Physical Interaction-Aware Control

    Authors: Rajat Kumar Jenamani, Daniel Stabile, Ziang Liu, Abrar Anwar, Katherine Dimitropoulou, Tapomayukh Bhattacharjee

    Abstract: Robot-assisted feeding can greatly enhance the lives of those with mobility limitations. Modern feeding systems can pick up and position food in front of a care recipient's mouth for a bite. However, many with severe mobility constraints cannot lean forward and need direct inside-mouth food placement. This demands precision, especially for those with restricted mouth openings, and appropriately re… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

    Comments: HRI 2024

  39. arXiv:2403.02694  [pdf, other

    cs.LG cs.AI cs.CL cs.CR cs.DC

    MeanCache: User-Centric Semantic Caching for LLM Web Services

    Authors: Waris Gill, Mohamed Elidrisi, Pallavi Kalapatapu, Ammar Ahmed, Ali Anwar, Muhammad Ali Gulzar

    Abstract: Large Language Models (LLMs) like ChatGPT and Llama have revolutionized natural language processing and search engine dynamics. However, these models incur exceptionally high computational costs. For instance, GPT-3 consists of 175 billion parameters, where inference demands billions of floating-point operations. Caching is a natural solution to reduce LLM inference costs on repeated queries, whic… ▽ More

    Submitted 7 March, 2025; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: Accepted at 2025 IEEE 39th International Parallel and Distributed Processing Symposium (IPDPS)

    ACM Class: I.2.7

  40. arXiv:2402.13429  [pdf, ps, other

    cs.DB cs.LG cs.OS

    Everything You Always Wanted to Know About Storage Compressibility of Pre-Trained ML Models but Were Afraid to Ask

    Authors: Zhaoyuan Su, Ammar Ahmed, Zirui Wang, Ali Anwar, Yue Cheng

    Abstract: As the number of pre-trained machine learning (ML) models is growing exponentially, data reduction tools are not catching up. Existing data reduction techniques are not specifically designed for pre-trained model (PTM) dataset files. This is largely due to a lack of understanding of the patterns and characteristics of these datasets, especially those relevant to data reduction and compressibility.… ▽ More

    Submitted 20 February, 2024; originally announced February 2024.

    Comments: This paper presents the first, exhaustive analysis to date of PTM datasets on storage compressibility. Motivated by our findings, we design ELF, a simple yet effective, error-bounded, lossy floating-point compression method

    ACM Class: H.2.7

  41. arXiv:2402.06023  [pdf, other

    cs.LG cs.AI cs.GT

    Decision Theory-Guided Deep Reinforcement Learning for Fast Learning

    Authors: Zelin Wan, Jin-Hee Cho, Mu Zhu, Ahmed H. Anwar, Charles Kamhoua, Munindar P. Singh

    Abstract: This paper introduces a novel approach, Decision Theory-guided Deep Reinforcement Learning (DT-guided DRL), to address the inherent cold start problem in DRL. By integrating decision theory principles, DT-guided DRL enhances agents' initial performance and robustness in complex environments, enabling more efficient and reliable convergence during learning. Our investigation encompasses two primary… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  42. arXiv:2312.13632  [pdf, other

    cs.LG cs.AI cs.CV cs.DC cs.SE

    TraceFL: Interpretability-Driven Debugging in Federated Learning via Neuron Provenance

    Authors: Waris Gill, Ali Anwar, Muhammad Ali Gulzar

    Abstract: In Federated Learning, clients train models on local data and send updates to a central server, which aggregates them into a global model using a fusion algorithm. This collaborative yet privacy-preserving training comes at a cost. FL developers face significant challenges in attributing global model predictions to specific clients. Localizing responsible clients is a crucial step towards (a) excl… ▽ More

    Submitted 17 January, 2025; v1 submitted 21 December, 2023; originally announced December 2023.

    Comments: Accepted at 2025 IEEE/ACM 47th International Conference on Software Engineering (ICSE)

  43. Autonomous Port Navigation With Ranging Sensors Using Model-Based Reinforcement Learning

    Authors: Siemen Herremans, Ali Anwar, Arne Troch, Ian Ravijts, Maarten Vangeneugden, Siegfried Mercelis, Peter Hellinckx

    Abstract: Autonomous shipping has recently gained much interest in the research community. However, little research focuses on inland - and port navigation, even though this is identified by countries such as Belgium and the Netherlands as an essential step towards a sustainable future. These environments pose unique challenges, since they can contain dynamic obstacles that do not broadcast their location,… ▽ More

    Submitted 17 November, 2023; originally announced December 2023.

    Comments: Presented at 42nd International Conference on Ocean, Offshore & Arctic Engineering. June 11 - 16, 2023. Melbourne, Australia

    Journal ref: Proceedings of the ASME 2023 42nd International Conference on Ocean, Offshore and Arctic Engineering. Volume 5: Ocean Engineering. Melbourne, Australia. June 11-16, 2023. V005T06A072. ASME

  44. arXiv:2311.11580  [pdf, other

    cs.CV

    SeaDSC: A video-based unsupervised method for dynamic scene change detection in unmanned surface vehicles

    Authors: Linh Trinh, Ali Anwar, Siegfried Mercelis

    Abstract: Recently, there has been an upsurge in the research on maritime vision, where a lot of works are influenced by the application of computer vision for Unmanned Surface Vehicles (USVs). Various sensor modalities such as camera, radar, and lidar have been used to perform tasks such as object detection, segmentation, object tracking, and motion planning. A large subset of this research is focused on t… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: WACV 2024 conference

  45. Safety Aware Autonomous Path Planning Using Model Predictive Reinforcement Learning for Inland Waterways

    Authors: Astrid Vanneste, Simon Vanneste, Olivier Vasseur, Robin Janssens, Mattias Billast, Ali Anwar, Kevin Mets, Tom De Schepper, Siegfried Mercelis, Peter Hellinckx

    Abstract: In recent years, interest in autonomous shipping in urban waterways has increased significantly due to the trend of keeping cars and trucks out of city centers. Classical approaches such as Frenet frame based planning and potential field navigation often require tuning of many configuration parameters and sometimes even require a different configuration depending on the situation. In this paper, w… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Comments: \c{opyright} 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  46. arXiv:2311.06694  [pdf, other

    cs.CL cs.AI cs.CV cs.RO

    Which One? Leveraging Context Between Objects and Multiple Views for Language Grounding

    Authors: Chancharik Mitra, Abrar Anwar, Rodolfo Corona, Dan Klein, Trevor Darrell, Jesse Thomason

    Abstract: When connecting objects and their language referents in an embodied 3D environment, it is important to note that: (1) an object can be better characterized by leveraging comparative information between itself and other objects, and (2) an object's appearance can vary with camera position. As such, we present the Multi-view Approach to Grounding in Context (MAGiC), which selects an object referent… ▽ More

    Submitted 6 April, 2024; v1 submitted 11 November, 2023; originally announced November 2023.

    Journal ref: North American Chapter of the Association for Computational Linguistics (NAACL), 2024

  47. arXiv:2308.11817  [pdf, other

    cs.GT

    Honeypot Allocation for Cyber Deception in Dynamic Tactical Networks: A Game Theoretic Approach

    Authors: Md Abu Sayed, Ahmed H. Anwar, Christopher Kiekintveld, Charles Kamhoua

    Abstract: Honeypots play a crucial role in implementing various cyber deception techniques as they possess the capability to divert attackers away from valuable assets. Careful strategic placement of honeypots in networks should consider not only network aspects but also attackers' preferences. The allocation of honeypots in tactical networks under network mobility is of great interest. To achieve this obje… ▽ More

    Submitted 18 September, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: This paper accepted in 14th International Conference on Decision and Game Theory for Security, GameSec 2023

  48. Blockchain-Based and Fuzzy Logic-Enabled False Data Discovery for the Intelligent Autonomous Vehicular System

    Authors: Ziaur Rahman, Xun Yi, Ibrahim Khalil, Adnan Anwar, Shantanu Pal

    Abstract: Since the beginning of this decade, several incidents report that false data injection attacks targeting intelligent connected vehicles cause huge industrial damage and loss of lives. Data Theft, Flooding, Fuzzing, Hijacking, Malware Spoofing and Advanced Persistent Threats have been immensely growing attack that leads to end-user conflict by abolishing trust on autonomous vehicle. Looking after t… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 11 pages, 11 figures, 4 tables AsiaCCS conference 2023

    MSC Class: 11T71; 68T05 ACM Class: E.3.1; I.2.1

    Journal ref: ACM Symposium on Information, Computer and Communications Security (ASIA CCS 2023)

  49. arXiv:2307.13187  [pdf

    cs.HC cs.AI

    Digital Emotion Regulation on Social Media

    Authors: Akriti Verma, Shama Islam, Valeh Moghaddam, Adnan Anwar

    Abstract: Emotion regulation is the process of consciously altering one's affective state, that is the underlying emotional state such as happiness, confidence, guilt, anger etc. The ability to effectively regulate emotions is necessary for functioning efficiently in everyday life. Today, the pervasiveness of digital technology is being purposefully employed to modify our affective states, a process known a… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

  50. Cyber Deception against Zero-day Attacks: A Game Theoretic Approach

    Authors: Md Abu Sayed, Ahmed H. Anwar, Christopher Kiekintveld, Branislav Bosansky, Charles Kamhoua

    Abstract: Reconnaissance activities precedent other attack steps in the cyber kill chain. Zero-day attacks exploit unknown vulnerabilities and give attackers the upper hand against conventional defenses. Honeypots have been used to deceive attackers by misrepresenting the true state of the network. Existing work on cyber deception does not model zero-day attacks. In this paper, we address the question of "H… ▽ More

    Submitted 25 July, 2023; v1 submitted 24 July, 2023; originally announced July 2023.

    Comments: 20 pages, 1 citation

    Journal ref: International Conference on Decision and Game Theory for Security, pp. 44-63. Cham: Springer International Publishing, 2022