Skip to main content

Showing 1–20 of 20 results for author: Kompella, R R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.07778  [pdf, ps, other

    cs.CV

    Language-Vision Planner and Executor for Text-to-Visual Reasoning

    Authors: Yichang Xu, Gaowen Liu, Ramana Rao Kompella, Sihao Hu, Tiansheng Huang, Fatih Ilhan, Selim Furkan Tekin, Zachary Yahn, Ling Liu

    Abstract: The advancement in large language models (LLMs) and large vision models has fueled the rapid progress in multi-modal visual-text reasoning capabilities. However, existing vision-language models (VLMs) to date suffer from generalization performance. Inspired by recent development in LLMs for visual reasoning, this paper presents VLAgent, an AI system that can create a step-by-step visual reasoning… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  2. arXiv:2506.03117  [pdf, ps, other

    cs.CV

    Targeted Forgetting of Image Subgroups in CLIP Models

    Authors: Zeliang Zhang, Gaowen Liu, Charles Fleming, Ramana Rao Kompella, Chenliang Xu

    Abstract: Foundation models (FMs) such as CLIP have demonstrated impressive zero-shot performance across various tasks by leveraging large-scale, unsupervised pre-training. However, they often inherit harmful or unwanted knowledge from noisy internet-sourced datasets, compromising their reliability in real-world applications. Existing model unlearning methods either rely on access to pre-trained datasets or… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: 12 Figures,5 Pages. The project page is \url{https://zhangaipi.github.io/forget_clip/}

  3. arXiv:2505.20573  [pdf, ps, other

    cs.RO cs.AI

    Collision- and Reachability-Aware Multi-Robot Control with Grounded LLM Planners

    Authors: Jiabao Ji, Yongchao Chen, Yang Zhang, Ramana Rao Kompella, Chuchu Fan, Gaowen Liu, Shiyu Chang

    Abstract: Large language models (LLMs) have demonstrated strong performance in various robot control tasks. However, their deployment in real-world applications remains constrained. Even state-ofthe-art LLMs, such as GPT-o4mini, frequently produce invalid action plans that violate physical constraints, such as directing a robot to an unreachable location or causing collisions between robots. This issue prim… ▽ More

    Submitted 3 June, 2025; v1 submitted 26 May, 2025; originally announced May 2025.

  4. arXiv:2503.13224  [pdf, other

    cs.CR cs.LG

    ProDiF: Protecting Domain-Invariant Features to Secure Pre-Trained Models Against Extraction

    Authors: Tong Zhou, Shijin Duan, Gaowen Liu, Charles Fleming, Ramana Rao Kompella, Shaolei Ren, Xiaolin Xu

    Abstract: Pre-trained models are valuable intellectual property, capturing both domain-specific and domain-invariant features within their weight spaces. However, model extraction attacks threaten these assets by enabling unauthorized source-domain inference and facilitating cross-domain transfer via the exploitation of domain-invariant features. In this work, we introduce **ProDiF**, a novel framework that… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

    Comments: Accepted at the ICLR Workshop on Neural Network Weights as a New Data Modality 2025

  5. arXiv:2502.14075  [pdf, other

    cs.LG

    Towards Vector Optimization on Low-Dimensional Vector Symbolic Architecture

    Authors: Shijin Duan, Yejia Liu, Gaowen Liu, Ramana Rao Kompella, Shaolei Ren, Xiaolin Xu

    Abstract: Vector Symbolic Architecture (VSA) is emerging in machine learning due to its efficiency, but they are hindered by issues of hyperdimensionality and accuracy. As a promising mitigation, the Low-Dimensional Computing (LDC) method significantly reduces the vector dimension by ~100 times while maintaining accuracy, by employing a gradient-based optimization. Despite its potential, LDC optimization fo… ▽ More

    Submitted 15 March, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: 10 pages, 2 figures. Accepted in CPAL 2025

  6. arXiv:2410.13964  [pdf, ps, other

    cs.LG

    Sparse Mixture-of-Experts for Compositional Generalization: Empirical Evidence and Theoretical Foundations of Optimal Sparsity

    Authors: Jinze Zhao, Peihao Wang, Junjie Yang, Ruisi Cai, Gaowen Liu, Jayanth Srinivasa, Ramana Rao Kompella, Yingbin Liang, Zhangyang Wang

    Abstract: Sparse Mixture-of-Experts (SMoE) architectures have gained prominence for their ability to scale neural networks, particularly transformers, without a proportional increase in computational cost. Despite their success, their role in compositional generalization, i.e., adapting to novel combinations of known components, remains under-explored. This study challenges the assumption that minimal exper… ▽ More

    Submitted 14 June, 2025; v1 submitted 17 October, 2024; originally announced October 2024.

    Comments: 23 pages

  7. arXiv:2409.00340  [pdf, other

    cs.CR cs.CV

    LightPure: Realtime Adversarial Image Purification for Mobile Devices Using Diffusion Models

    Authors: Hossein Khalili, Seongbin Park, Vincent Li, Brandan Bright, Ali Payani, Ramana Rao Kompella, Nader Sehatbakhsh

    Abstract: Autonomous mobile systems increasingly rely on deep neural networks for perception and decision-making. While effective, these systems are vulnerable to adversarial machine learning attacks where minor input perturbations can significantly impact outcomes. Common countermeasures involve adversarial training and/or data or network transformation. These methods, though effective, require full access… ▽ More

    Submitted 30 August, 2024; originally announced September 2024.

  8. arXiv:2407.08980  [pdf, other

    cs.DC

    Enabling Elastic Model Serving with MultiWorld

    Authors: Myungjin Lee, Akshay Jajoo, Ramana Rao Kompella

    Abstract: Machine learning models have been exponentially growing in terms of their parameter size over the past few years. We are now seeing the rise of trillion-parameter models. The large models cannot fit into a single GPU and thus require partitioned deployment across GPUs and even hosts. A high-performance collective communication library (CCL) such as NCCL is essential to fully utilize expensive GPU… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  9. arXiv:2406.08607  [pdf, other

    cs.CL cs.AI

    Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference

    Authors: Jiabao Ji, Yujian Liu, Yang Zhang, Gaowen Liu, Ramana Rao Kompella, Sijia Liu, Shiyu Chang

    Abstract: As Large Language Models (LLMs) demonstrate extensive capability in learning from documents, LLM unlearning becomes an increasingly important research area to address concerns of LLMs in terms of privacy, copyright, etc. A conventional LLM unlearning task typically involves two goals: (1) The target LLM should forget the knowledge in the specified forget documents, and (2) it should retain the oth… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 21 pages, 11 figures

  10. arXiv:2405.14136  [pdf, other

    cs.CV

    Efficient Multitask Dense Predictor via Binarization

    Authors: Yuzhang Shang, Dan Xu, Gaowen Liu, Ramana Rao Kompella, Yan Yan

    Abstract: Multi-task learning for dense prediction has emerged as a pivotal area in computer vision, enabling simultaneous processing of diverse yet interrelated pixel-wise prediction tasks. However, the substantial computational demands of state-of-the-art (SoTA) models often limit their widespread deployment. This paper addresses this challenge by introducing network binarization to compress resource-inte… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: Accepted to CVPR'2024

  11. arXiv:2404.02039  [pdf, other

    cs.AI

    A Survey on Large Language Model-Based Game Agents

    Authors: Sihao Hu, Tiansheng Huang, Gaowen Liu, Ramana Rao Kompella, Fatih Ilhan, Selim Furkan Tekin, Yichang Xu, Zachary Yahn, Ling Liu

    Abstract: The development of game agents holds a critical role in advancing towards Artificial General Intelligence. The progress of Large Language Models (LLMs) offers an unprecedented opportunity to evolve and empower game agents with human-like decision-making capabilities in complex computer game environments. This paper provides a comprehensive overview of LLM-based game agents from a holistic viewpoin… ▽ More

    Submitted 30 March, 2025; v1 submitted 2 April, 2024; originally announced April 2024.

  12. arXiv:2403.17287  [pdf, other

    cs.LG cs.DC

    Not All Federated Learning Algorithms Are Created Equal: A Performance Evaluation Study

    Authors: Gustav A. Baumgart, Jaemin Shin, Ali Payani, Myungjin Lee, Ramana Rao Kompella

    Abstract: Federated Learning (FL) emerged as a practical approach to training a model from decentralized data. The proliferation of FL led to the development of numerous FL algorithms and mechanisms. Many prior efforts have given their primary focus on accuracy of those approaches, but there exists little understanding of other aspects such as computational overheads, performance and training stability, etc… ▽ More

    Submitted 25 March, 2024; originally announced March 2024.

  13. arXiv:2403.11697  [pdf, other

    cs.CV

    Urban Scene Diffusion through Semantic Occupancy Map

    Authors: Junge Zhang, Qihang Zhang, Li Zhang, Ramana Rao Kompella, Gaowen Liu, Bolei Zhou

    Abstract: Generating unbounded 3D scenes is crucial for large-scale scene understanding and simulation. Urban scenes, unlike natural landscapes, consist of various complex man-made objects and structures such as roads, traffic signs, vehicles, and buildings. To create a realistic and detailed urban scene, it is crucial to accurately represent the geometry and semantics of the underlying objects, going beyon… ▽ More

    Submitted 19 March, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: The project website is https://metadriverse.github.io/urbandiff/

  14. arXiv:2402.11846  [pdf, other

    cs.CV

    UnlearnCanvas: Stylized Image Dataset for Enhanced Machine Unlearning Evaluation in Diffusion Models

    Authors: Yihua Zhang, Chongyu Fan, Yimeng Zhang, Yuguang Yao, Jinghan Jia, Jiancheng Liu, Gaoyuan Zhang, Gaowen Liu, Ramana Rao Kompella, Xiaoming Liu, Sijia Liu

    Abstract: The technological advancements in diffusion models (DMs) have demonstrated unprecedented capabilities in text-to-image generation and are widely used in diverse applications. However, they have also raised significant societal concerns, such as the generation of harmful content and copyright disputes. Machine unlearning (MU) has emerged as a promising solution, capable of removing undesired genera… ▽ More

    Submitted 29 October, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: NeurIPS 2024 Dataset & Benchmark Track

  15. arXiv:2312.13119  [pdf, other

    cs.CR cs.CL cs.LG

    Graphene: Infrastructure Security Posture Analysis with AI-generated Attack Graphs

    Authors: Xin Jin, Charalampos Katsis, Fan Sang, Jiahao Sun, Elisa Bertino, Ramana Rao Kompella, Ashish Kundu

    Abstract: The rampant occurrence of cybersecurity breaches imposes substantial limitations on the progress of network infrastructures, leading to compromised data, financial losses, potential harm to individuals, and disruptions in essential services. The current security landscape demands the urgent development of a holistic security assessment solution that encompasses vulnerability analysis and investiga… ▽ More

    Submitted 30 April, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  16. arXiv:2311.02373  [pdf, other

    cs.LG

    From Trojan Horses to Castle Walls: Unveiling Bilateral Data Poisoning Effects in Diffusion Models

    Authors: Zhuoshi Pan, Yuguang Yao, Gaowen Liu, Bingquan Shen, H. Vicky Zhao, Ramana Rao Kompella, Sijia Liu

    Abstract: While state-of-the-art diffusion models (DMs) excel in image generation, concerns regarding their security persist. Earlier research highlighted DMs' vulnerability to data poisoning attacks, but these studies placed stricter requirements than conventional methods like `BadNets' in image classification. This is because the art necessitates modifications to the diffusion training and sampling proced… ▽ More

    Submitted 15 June, 2024; v1 submitted 4 November, 2023; originally announced November 2023.

    Comments: 9 pages, 5 figures, 4 tables

  17. arXiv:2305.11288  [pdf, other

    cs.LG

    Riemannian Multinomial Logistics Regression for SPD Neural Networks

    Authors: Ziheng Chen, Yue Song, Gaowen Liu, Ramana Rao Kompella, Xiaojun Wu, Nicu Sebe

    Abstract: Deep neural networks for learning Symmetric Positive Definite (SPD) matrices are gaining increasing attention in machine learning. Despite the significant progress, most existing SPD networks use traditional Euclidean classifiers on an approximated space rather than intrinsic classifiers that accurately capture the geometry of SPD manifolds. Inspired by Hyperbolic Neural Networks (HNNs), we propos… ▽ More

    Submitted 20 March, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Accepted to CVPR 2024

  18. arXiv:2305.05118  [pdf, other

    cs.LG cs.DC

    Flame: Simplifying Topology Extension in Federated Learning

    Authors: Harshit Daga, Jaemin Shin, Dhruv Garg, Ada Gavrilovska, Myungjin Lee, Ramana Rao Kompella

    Abstract: Distributed machine learning approaches, including a broad class of federated learning (FL) techniques, present a number of benefits when deploying machine learning applications over widely distributed infrastructures. The benefits are highly dependent on the details of the underlying machine learning topology, which specifies the functionality executed by the participating nodes, their dependenci… ▽ More

    Submitted 17 January, 2024; v1 submitted 8 May, 2023; originally announced May 2023.

  19. arXiv:2304.02806  [pdf, other

    cs.LG

    Graph Mixture of Experts: Learning on Large-Scale Graphs with Explicit Diversity Modeling

    Authors: Haotao Wang, Ziyu Jiang, Yuning You, Yan Han, Gaowen Liu, Jayanth Srinivasa, Ramana Rao Kompella, Zhangyang Wang

    Abstract: Graph neural networks (GNNs) have found extensive applications in learning from graph data. However, real-world graphs often possess diverse structures and comprise nodes and edges of varying types. To bolster the generalization capacity of GNNs, it has become customary to augment training graph structures through techniques like graph augmentations and large-scale pre-training on a wider array of… ▽ More

    Submitted 17 October, 2023; v1 submitted 5 April, 2023; originally announced April 2023.

    Comments: NeurIPS 2023

  20. arXiv:1712.08129  [pdf, other

    cs.NI

    Fault Localization in Large-Scale Network Policy Deployment

    Authors: Praveen Tammana, Chandra Nagarajan, Pavan Mamillapalli, Ramana Rao Kompella, Myungjin Lee

    Abstract: The recent advances in network management automation and Software-Defined Networking (SDN) are easing network policy management tasks. At the same time, these new technologies create a new mode of failure in the management cycle itself. Network policies are presented in an abstract model at a centralized controller and deployed as low-level rules across network devices. Thus, any software and hard… ▽ More

    Submitted 21 December, 2017; originally announced December 2017.

    Comments: 10 pages, 10 figures, IEEE format, Conference, SDN, Network Policy