Skip to main content

Showing 1–11 of 11 results for author: Huzaifa, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.00139  [pdf, other

    cs.CV

    EFSA: Episodic Few-Shot Adaptation for Text-to-Image Retrieval

    Authors: Muhammad Huzaifa, Yova Kementchedjhieva

    Abstract: Text-to-image retrieval is a critical task for managing diverse visual content, but common benchmarks for the task rely on small, single-domain datasets that fail to capture real-world complexity. Pre-trained vision-language models tend to perform well with easy negatives but struggle with hard negatives--visually similar yet incorrect images--especially in open-domain scenarios. To address this,… ▽ More

    Submitted 12 March, 2025; v1 submitted 28 November, 2024; originally announced December 2024.

  2. arXiv:2411.13205   

    cs.RO cs.CV

    An Integrated Approach to Robotic Object Grasping and Manipulation

    Authors: Owais Ahmed, M Huzaifa, M Areeb, Hamza Ali Khan

    Abstract: In response to the growing challenges of manual labor and efficiency in warehouse operations, Amazon has embarked on a significant transformation by incorporating robotics to assist with various tasks. While a substantial number of robots have been successfully deployed for tasks such as item transportation within warehouses, the complex process of object picking from shelves remains a significant… ▽ More

    Submitted 21 March, 2025; v1 submitted 20 November, 2024; originally announced November 2024.

    Comments: I am making big changes in the paper and continuing its further development with other instituition

  3. arXiv:2409.04018  [pdf, other

    cs.CV

    Towards Energy-Efficiency by Navigating the Trilemma of Energy, Latency, and Accuracy

    Authors: Boyuan Tian, Yihan Pang, Muhammad Huzaifa, Shenlong Wang, Sarita Adve

    Abstract: Extended Reality (XR) enables immersive experiences through untethered headsets but suffers from stringent battery and resource constraints. Energy-efficient design is crucial to ensure both longevity and high performance in XR devices. However, latency and accuracy are often prioritized over energy, leading to a gap in achieving energy efficiency. This paper examines scene reconstruction, a key b… ▽ More

    Submitted 6 September, 2024; originally announced September 2024.

    Comments: ISMAR 2024

  4. arXiv:2407.15913  [pdf, other

    cs.CV

    Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Vision-Language Models

    Authors: Raza Imam, Hanan Gani, Muhammad Huzaifa, Karthik Nandakumar

    Abstract: The conventional modus operandi for adapting pre-trained vision-language models (VLMs) during test-time involves tuning learnable prompts, ie, test-time prompt tuning. This paper introduces Test-Time Low-rank adaptation (TTL) as an alternative to prompt tuning for zero-shot generalization of large-scale VLMs. Taking inspiration from recent advancements in efficiently fine-tuning large language mod… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: Main paper: 11 pages, Supplementary material: 5 pages

  5. arXiv:2403.04701  [pdf, other

    cs.CV cs.AI

    ObjectCompose: Evaluating Resilience of Vision-Based Models on Object-to-Background Compositional Changes

    Authors: Hashmat Shadab Malik, Muhammad Huzaifa, Muzammal Naseer, Salman Khan, Fahad Shahbaz Khan

    Abstract: Given the large-scale multi-modal training of recent vision-based models and their generalization capabilities, understanding the extent of their robustness is critical for their real-world deployment. In this work, we evaluate the resilience of current vision-based models against diverse object-to-background context variations. The majority of robustness evaluation methods have introduced synthet… ▽ More

    Submitted 8 October, 2024; v1 submitted 7 March, 2024; originally announced March 2024.

    Journal ref: Asian Conference on Computer Vision - 2024

  6. arXiv:2402.07059  [pdf, other

    cs.CV

    Domain Adaptable Fine-Tune Distillation Framework For Advancing Farm Surveillance

    Authors: Raza Imam, Muhammad Huzaifa, Nabil Mansour, Shaher Bano Mirza, Fouad Lamghari

    Abstract: In this study, we propose an automated framework for camel farm monitoring, introducing two key contributions: the Unified Auto-Annotation framework and the Fine-Tune Distillation framework. The Unified Auto-Annotation approach combines two models, GroundingDINO (GD), and Segment-Anything-Model (SAM), to automatically annotate raw datasets extracted from surveillance videos. Building upon this fou… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

  7. arXiv:2305.08031  [pdf, other

    cs.CV cs.AI

    On enhancing the robustness of Vision Transformers: Defensive Diffusion

    Authors: Raza Imam, Muhammad Huzaifa, Mohammed El-Amine Azz

    Abstract: Privacy and confidentiality of medical data are of utmost importance in healthcare settings. ViTs, the SOTA vision model, rely on large amounts of patient data for training, which raises concerns about data security and the potential for unauthorized access. Adversaries may exploit vulnerabilities in ViTs to extract sensitive patient information and compromising patient privacy. This work address… ▽ More

    Submitted 13 May, 2023; originally announced May 2023.

    Comments: Our code is publicly available at https://github.com/Muhammad-Huzaifaa/Defensive_Diffusion

  8. arXiv:2207.13280  [pdf, other

    cs.RO

    On-Device CPU Scheduling for Sense-React Systems

    Authors: Aditi Partap, Samuel Grayson, Muhammad Huzaifa, Sarita Adve, Brighten Godfrey, Saurabh Gupta, Kris Hauser, Radhika Mittal

    Abstract: Sense-react systems (e.g. robotics and AR/VR) have to take highly responsive real-time actions, driven by complex decisions involving a pipeline of sensing, perception, planning, and reaction tasks. These tasks must be scheduled on resource-constrained devices such that the performance goals and the requirements of the application are met. This is a difficult scheduling problem that requires handl… ▽ More

    Submitted 14 August, 2022; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: 13 pages, 13 figures. This version of the paper extends a shorter version that has been accepted at IROS'22

  9. arXiv:2201.08603  [pdf, other

    cs.AR

    Trireme: Exploring Hierarchical Multi-Level Parallelism for Domain Specific Hardware Acceleration

    Authors: Georgios Zacharopoulos, Adel Ejjeh, Ying Jing, En-Yu Yang, Tianyu Jia, Iulian Brumar, Jeremy Intan, Muhammad Huzaifa, Sarita Adve, Vikram Adve, Gu-Yeon Wei, David Brooks

    Abstract: The design of heterogeneous systems that include domain specific accelerators is a challenging and time-consuming process. While taking into account area constraints, designers must decide which parts of an application to accelerate in hardware and which to leave in software. Moreover, applications in domains such as Extended Reality (XR) offer opportunities for various forms of parallel execution… ▽ More

    Submitted 21 January, 2022; originally announced January 2022.

    Comments: 20 pages

  10. arXiv:2004.04643  [pdf, other

    cs.DC cs.ET

    Exploring Extended Reality with ILLIXR: A New Playground for Architecture Research

    Authors: Muhammad Huzaifa, Rishi Desai, Samuel Grayson, Xutao Jiang, Ying Jing, Jae Lee, Fang Lu, Yihan Pang, Joseph Ravichandran, Finn Sinclair, Boyuan Tian, Hengzhi Yuan, Jeffrey Zhang, Sarita V. Adve

    Abstract: As we enter the era of domain-specific architectures, systems researchers must understand the requirements of emerging application domains. Augmented and virtual reality (AR/VR) or extended reality (XR) is one such important domain. This paper presents ILLIXR, the first open source end-to-end XR system (1) with state-of-the-art components, (2) integrated with a modular and extensible multithreaded… ▽ More

    Submitted 2 March, 2021; v1 submitted 25 March, 2020; originally announced April 2020.

  11. arXiv:2002.10245  [pdf, other

    cs.DC

    Specializing Coherence, Consistency, and Push/Pull for GPU Graph Analytics

    Authors: Giordano Salvador, Wesley H. Darvin, Muhammad Huzaifa, Johnathan Alsop, Matthew D. Sinclair, Sarita V. Adve

    Abstract: This work provides the first study to explore the interaction of update propagation with and without fine-grained synchronization (push vs. pull), emerging coherence protocols (GPU vs. DeNovo coherence), and software-centric consistency models (DRF0, DRF1, and DRFrlx) for graph workloads on emerging integrated GPU-CPU systems with native unified shared memory. We study 6 graph applications with 6… ▽ More

    Submitted 25 February, 2020; v1 submitted 19 February, 2020; originally announced February 2020.