Skip to main content

Showing 1–50 of 57 results for author: Javed, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.18856  [pdf, other

    cs.CV

    Multi-Resolution Pathology-Language Pre-training Model with Text-Guided Visual Representation

    Authors: Shahad Albastaki, Anabia Sohail, Iyyakutti Iyappan Ganapathi, Basit Alawode, Asim Khan, Sajid Javed, Naoufel Werghi, Mohammed Bennamoun, Arif Mahmood

    Abstract: In Computational Pathology (CPath), the introduction of Vision-Language Models (VLMs) has opened new avenues for research, focusing primarily on aligning image-text pairs at a single magnification level. However, this approach might not be sufficient for tasks like cancer subtype classification, tissue phenotyping, and survival analysis due to the limited level of detail that a single-resolution i… ▽ More

    Submitted 26 April, 2025; originally announced April 2025.

  2. arXiv:2504.11482  [pdf, other

    cs.CV cs.AI cs.PF cs.RO eess.IV

    snnTrans-DHZ: A Lightweight Spiking Neural Network Architecture for Underwater Image Dehazing

    Authors: Vidya Sudevan, Fakhreddine Zayer, Rizwana Kausar, Sajid Javed, Hamad Karki, Giulia De Masi, Jorge Dias

    Abstract: Underwater image dehazing is critical for vision-based marine operations because light scattering and absorption can severely reduce visibility. This paper introduces snnTrans-DHZ, a lightweight Spiking Neural Network (SNN) specifically designed for underwater dehazing. By leveraging the temporal dynamics of SNNs, snnTrans-DHZ efficiently processes time-dependent raw image sequences while maintain… ▽ More

    Submitted 13 April, 2025; originally announced April 2025.

  3. arXiv:2503.20485  [pdf, other

    eess.IV cs.AI cs.PF

    Underwater Image Enhancement by Convolutional Spiking Neural Networks

    Authors: Vidya Sudevan, Fakhreddine Zayer, Rizwana Kausar, Sajid Javed, Hamad Karki, Giulia De Masi, Jorge Dias

    Abstract: Underwater image enhancement (UIE) is fundamental for marine applications, including autonomous vision-based navigation. Deep learning methods using convolutional neural networks (CNN) and vision transformers advanced UIE performance. Recently, spiking neural networks (SNN) have gained attention for their lightweight design, energy efficiency, and scalability. This paper introduces UIE-SNN, the fi… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

  4. arXiv:2503.05762  [pdf

    cs.CY

    Driving Education Advancements of Novice Drivers: A Systematic Literature Review

    Authors: Anannya Ghosh Tusti, Anandi K Dutta, Syed Aaqib Javed, Subasish Das

    Abstract: Most novice drivers are teenagers since many individuals begin their driving journey during adolescence. Novice driver crashes remain a leading cause of death among adolescents, underscoring the necessity for effective education and training programs to improve safety. This systematic review examines advancements in teen driver education from 2000 to 2024, emphasizing the effectiveness of various… ▽ More

    Submitted 23 February, 2025; originally announced March 2025.

    Comments: 31 pages, 5 figures

  5. arXiv:2502.01785  [pdf, other

    cs.CV cs.AI

    AquaticCLIP: A Vision-Language Foundation Model for Underwater Scene Analysis

    Authors: Basit Alawode, Iyyakutti Iyappan Ganapathi, Sajid Javed, Naoufel Werghi, Mohammed Bennamoun, Arif Mahmood

    Abstract: The preservation of aquatic biodiversity is critical in mitigating the effects of climate change. Aquatic scene understanding plays a pivotal role in aiding marine scientists in their decision-making processes. In this paper, we introduce AquaticCLIP, a novel contrastive language-image pre-training model tailored for aquatic scene understanding. AquaticCLIP presents a new unsupervised learning fra… ▽ More

    Submitted 3 February, 2025; originally announced February 2025.

  6. arXiv:2501.11310  [pdf, other

    cs.CV

    Anomaly Detection for Industrial Applications, Its Challenges, Solutions, and Future Directions: A Review

    Authors: Abdelrahman Alzarooni, Ehtesham Iqbal, Samee Ullah Khan, Sajid Javed, Brain Moyo, Yusra Abdulrahman

    Abstract: Anomaly detection from images captured using camera sensors is one of the mainstream applications at the industrial level. Particularly, it maintains the quality and optimizes the efficiency in production processes across diverse industrial tasks, including advanced manufacturing and aerospace engineering. Traditional anomaly detection workflow is based on a manual inspection by human operators, w… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

  7. arXiv:2412.05700  [pdf, other

    cs.CV cs.GR

    Temporally Compressed 3D Gaussian Splatting for Dynamic Scenes

    Authors: Saqib Javed, Ahmad Jarrar Khan, Corentin Dumery, Chen Zhao, Mathieu Salzmann

    Abstract: Recent advancements in high-fidelity dynamic scene reconstruction have leveraged dynamic 3D Gaussians and 4D Gaussian Splatting for realistic scene representation. However, to make these methods viable for real-time applications such as AR/VR, gaming, and rendering on low-power devices, substantial reductions in memory usage and improvements in rendering efficiency are required. While many state-o… ▽ More

    Submitted 7 December, 2024; originally announced December 2024.

    Comments: Code will be released soon

  8. arXiv:2411.13988  [pdf, other

    cs.RO

    Dehazing-aided Multi-Rate Multi-Modal Pose Estimation Framework for Mitigating Visual Disturbances in Extreme Underwater Domain

    Authors: Vidya Sudevan, Fakhreddine Zayer, Taimur Hassan, Sajid Javed, Hamad Karki, Giulia De Masi, Jorge Dias

    Abstract: This paper delves into the potential of DU-VIO, a dehazing-aided hybrid multi-rate multi-modal Visual-Inertial Odometry (VIO) estimation framework, designed to thrive in the challenging realm of extreme underwater environments. The cutting-edge DU-VIO framework is incorporating a GAN-based pre-processing module and a hybrid CNN-LSTM module for precise pose estimation, using visibility-enhanced und… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  9. arXiv:2411.13962  [pdf, other

    cs.RO

    Hybrid-Neuromorphic Approach for Underwater Robotics Applications: A Conceptual Framework

    Authors: Vidya Sudevan, Fakhreddine Zayer, Sajid Javed, Hamad Karki, Giulia De Masi, Jorge Dias

    Abstract: This paper introduces the concept of employing neuromorphic methodologies for task-oriented underwater robotics applications. In contrast to the increasing computational demands of conventional deep learning algorithms, neuromorphic technology, leveraging spiking neural network architectures, promises sophisticated artificial intelligence with significantly reduced computational requirements and p… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  10. arXiv:2411.06078  [pdf, other

    cs.LG

    A Survey on Kolmogorov-Arnold Network

    Authors: Shriyank Somvanshi, Syed Aaqib Javed, Md Monzurul Islam, Diwas Pandit, Subasish Das

    Abstract: This systematic review explores the theoretical foundations, evolution, applications, and future potential of Kolmogorov-Arnold Networks (KAN), a neural network model inspired by the Kolmogorov-Arnold representation theorem. KANs distinguish themselves from traditional neural networks by using learnable, spline-parameterized functions instead of fixed activation functions, allowing for flexible an… ▽ More

    Submitted 9 November, 2024; originally announced November 2024.

  11. arXiv:2411.00144  [pdf, other

    cs.CV cs.GR

    Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis

    Authors: Chen Zhao, Xuan Wang, Tong Zhang, Saqib Javed, Mathieu Salzmann

    Abstract: 3D Gaussian Splatting (3DGS) has demonstrated remarkable effectiveness in novel view synthesis (NVS). However, 3DGS tends to overfit when trained with sparse views, limiting its generalization to novel viewpoints. In this paper, we address this overfitting issue by introducing Self-Ensembling Gaussian Splatting (SE-GS). We achieve self-ensembling by incorporating an uncertainty-aware perturbation… ▽ More

    Submitted 11 March, 2025; v1 submitted 31 October, 2024; originally announced November 2024.

  12. arXiv:2410.19820  [pdf, other

    eess.IV cs.CV

    Advancing Histopathology with Deep Learning Under Data Scarcity: A Decade in Review

    Authors: Ahmad Obeid, Said Boumaraf, Anabia Sohail, Taimur Hassan, Sajid Javed, Jorge Dias, Mohammed Bennamoun, Naoufel Werghi

    Abstract: Recent years witnessed remarkable progress in computational histopathology, largely fueled by deep learning. This brought the clinical adoption of deep learning-based tools within reach, promising significant benefits to healthcare, offering a valuable second opinion on diagnoses, streamlining complex tasks, and mitigating the risks of inconsistency and bias in clinical decisions. However, a well-… ▽ More

    Submitted 18 October, 2024; originally announced October 2024.

    Comments: 36 pages

  13. arXiv:2410.12034  [pdf, other

    cs.LG cs.AI

    A Survey on Deep Tabular Learning

    Authors: Shriyank Somvanshi, Subasish Das, Syed Aaqib Javed, Gian Antariksa, Ahmed Hossain

    Abstract: Tabular data, widely used in industries like healthcare, finance, and transportation, presents unique challenges for deep learning due to its heterogeneous nature and lack of spatial structure. This survey reviews the evolution of deep learning models for tabular data, from early fully connected networks (FCNs) to advanced architectures like TabNet, SAINT, TabTranSELU, and MambaNet. These models i… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: 43 pages, 18 figures, 3 tables

  14. arXiv:2410.06020  [pdf, other

    cs.LG cs.AI cs.CV cs.RO

    QT-DoG: Quantization-aware Training for Domain Generalization

    Authors: Saqib Javed, Hieu Le, Mathieu Salzmann

    Abstract: Domain Generalization (DG) aims to train models that perform well not only on the training (source) domains but also on novel, unseen target data distributions. A key challenge in DG is preventing overfitting to source domains, which can be mitigated by finding flatter minima in the loss landscape. In this work, we propose Quantization-aware Training for Domain Generalization (QT-DoG) and demonstr… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

    Comments: Code will be released soon

  15. arXiv:2407.18970  [pdf, other

    cs.CV

    Region Guided Attention Network for Retinal Vessel Segmentation

    Authors: Syed Javed, Tariq M. Khan, Abdul Qayyum, Arcot Sowmya, Imran Razzak

    Abstract: Retinal imaging has emerged as a promising method of addressing this challenge, taking advantage of the unique structure of the retina. The retina is an embryonic extension of the central nervous system, providing a direct in vivo window into neurological health. Recent studies have shown that specific structural changes in retinal vessels can not only serve as early indicators of various diseases… ▽ More

    Submitted 20 September, 2024; v1 submitted 21 July, 2024; originally announced July 2024.

  16. arXiv:2407.15707  [pdf, other

    cs.CV cs.AI eess.IV

    Predicting the Best of N Visual Trackers

    Authors: Basit Alawode, Sajid Javed, Arif Mahmood, Jiri Matas

    Abstract: We observe that the performance of SOTA visual trackers surprisingly strongly varies across different video attributes and datasets. No single tracker remains the best performer across all tracking attributes and datasets. To bridge this gap, for a given video sequence, we predict the "Best of the N Trackers", called the BofN meta-tracker. At its core, a Tracking Performance Prediction Network (TP… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  17. arXiv:2407.10785  [pdf, other

    eess.IV cs.CV

    Learning biologically relevant features in a pathology foundation model using sparse autoencoders

    Authors: Nhat Minh Le, Ciyue Shen, Neel Patel, Chintan Shah, Darpan Sanghavi, Blake Martin, Alfred Eng, Daniel Shenker, Harshith Padigela, Raymond Biju, Syed Ashar Javed, Jennifer Hipp, John Abel, Harsha Pokkalla, Sean Grullon, Dinkar Juyal

    Abstract: Pathology plays an important role in disease diagnosis, treatment decision-making and drug development. Previous works on interpretability for machine learning models on pathology images have revolved around methods such as attention value visualization and deriving human-interpretable features from model heatmaps. Mechanistic interpretability is an emerging area of model interpretability that foc… ▽ More

    Submitted 16 December, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

  18. arXiv:2406.05205  [pdf, other

    cs.CV cs.CL cs.LG cs.MM eess.IV

    CPLIP: Zero-Shot Learning for Histopathology with Comprehensive Vision-Language Alignment

    Authors: Sajid Javed, Arif Mahmood, Iyyakutti Iyappan Ganapathi, Fayaz Ali Dharejo, Naoufel Werghi, Mohammed Bennamoun

    Abstract: This paper proposes Comprehensive Pathology Language Image Pre-training (CPLIP), a new unsupervised technique designed to enhance the alignment of images and text in histopathology for tasks such as classification and segmentation. This methodology enriches vision-language models by leveraging extensive data without needing ground truth annotations. CPLIP involves constructing a pathology-specific… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  19. arXiv:2405.19387  [pdf, other

    cs.CV

    Video Anomaly Detection in 10 Years: A Survey and Outlook

    Authors: Moshira Abdalla, Sajid Javed, Muaz Al Radi, Anwaar Ulhaq, Naoufel Werghi

    Abstract: Video anomaly detection (VAD) holds immense importance across diverse domains such as surveillance, healthcare, and environmental monitoring. While numerous surveys focus on conventional VAD methods, they often lack depth in exploring specific approaches and emerging trends. This survey explores deep learning-based VAD, expanding beyond traditional supervised training paradigms to encompass emergi… ▽ More

    Submitted 30 June, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  20. arXiv:2405.17520  [pdf, other

    eess.IV cs.CV

    Advancing Medical Image Segmentation with Mini-Net: A Lightweight Solution Tailored for Efficient Segmentation of Medical Images

    Authors: Syed Javed, Tariq M. Khan, Abdul Qayyum, Hamid Alinejad-Rokny, Arcot Sowmya, Imran Razzak

    Abstract: Accurate segmentation of anatomical structures and abnormalities in medical images is crucial for computer-aided diagnosis and analysis. While deep learning techniques excel at this task, their computational demands pose challenges. Additionally, some cutting-edge segmentation methods, though effective for general object segmentation, may not be optimised for medical images. To address these issue… ▽ More

    Submitted 20 September, 2024; v1 submitted 27 May, 2024; originally announced May 2024.

  21. arXiv:2405.07905  [pdf, other

    eess.IV cs.CV

    PLUTO: Pathology-Universal Transformer

    Authors: Dinkar Juyal, Harshith Padigela, Chintan Shah, Daniel Shenker, Natalia Harguindeguy, Yi Liu, Blake Martin, Yibo Zhang, Michael Nercessian, Miles Markey, Isaac Finberg, Kelsey Luu, Daniel Borders, Syed Ashar Javed, Emma Krause, Raymond Biju, Aashish Sood, Allen Ma, Jackson Nyman, John Shamshoian, Guillaume Chhor, Darpan Sanghavi, Marc Thibault, Limin Yu, Fedaa Najdawi , et al. (8 additional authors not shown)

    Abstract: Pathology is the study of microscopic inspection of tissue, and a pathology diagnosis is often the medical gold standard to diagnose disease. Pathology images provide a unique challenge for computer-vision-based analysis: a single pathology Whole Slide Image (WSI) is gigapixel-sized and often contains hundreds of thousands to millions of objects of interest across multiple resolutions. In this wor… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

  22. arXiv:2404.10940  [pdf, other

    cs.CV

    Neuromorphic Vision-based Motion Segmentation with Graph Transformer Neural Network

    Authors: Yusra Alkendi, Rana Azzam, Sajid Javed, Lakmal Seneviratne, Yahya Zweiri

    Abstract: Moving object segmentation is critical to interpret scene dynamics for robotic navigation systems in challenging environments. Neuromorphic vision sensors are tailored for motion perception due to their asynchronous nature, high temporal resolution, and reduced power consumption. However, their unconventional output requires novel perception paradigms to leverage their spatially sparse and tempora… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  23. arXiv:2401.01180  [pdf, other

    cs.CV cs.AI eess.IV

    Accurate and Efficient Urban Street Tree Inventory with Deep Learning on Mobile Phone Imagery

    Authors: Asim Khan, Umair Nawaz, Anwaar Ulhaq, Iqbal Gondal, Sajid Javed

    Abstract: Deforestation, a major contributor to climate change, poses detrimental consequences such as agricultural sector disruption, global warming, flash floods, and landslides. Conventional approaches to urban street tree inventory suffer from inaccuracies and necessitate specialised equipment. To overcome these challenges, this paper proposes an innovative method that leverages deep learning techniques… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: 8 Pages, 7 figures and 5 Tables

  24. arXiv:2311.18488  [pdf, other

    cs.IT

    Low-Complexity Linear Programming Based Decoding of Quantum LDPC codes

    Authors: Sana Javed, Francisco Garcia-Herrero, Bane Vasic, Mark F. Flanagan

    Abstract: This paper proposes two approaches for reducing the impact of the error floor phenomenon when decoding quantum low-density parity-check codes with belief propagation based algorithms. First, a low-complexity syndrome-based linear programming (SB-LP) decoding algorithm is proposed, and second, the proposed SB-LP is applied as a post-processing step after syndrome-based min-sum (SB-MS) decoding. For… ▽ More

    Submitted 19 January, 2024; v1 submitted 30 November, 2023; originally announced November 2023.

    Comments: Accepted for publication at the IEEE International Conference on Communications (ICC) 2024

  25. arXiv:2311.10651  [pdf

    cs.CV

    3D-TexSeg: Unsupervised Segmentation of 3D Texture using Mutual Transformer Learning

    Authors: Iyyakutti Iyappan Ganapathi, Fayaz Ali, Sajid Javed, Syed Sadaf Ali, Naoufel Werghi

    Abstract: Analysis of the 3D Texture is indispensable for various tasks, such as retrieval, segmentation, classification, and inspection of sculptures, knitted fabrics, and biological tissues. A 3D texture is a locally repeated surface variation independent of the surface's overall shape and can be determined using the local neighborhood and its characteristics. Existing techniques typically employ computer… ▽ More

    Submitted 17 November, 2023; originally announced November 2023.

    Comments: This paper is accepted in 3DV-2024

  26. arXiv:2309.15576  [pdf, other

    cs.CV

    Learning Spatial-Temporal Regularized Tensor Sparse RPCA for Background Subtraction

    Authors: Basit Alawode, Sajid Javed

    Abstract: Video background subtraction is one of the fundamental problems in computer vision that aims to segment all moving objects. Robust principal component analysis has been identified as a promising unsupervised paradigm for background subtraction tasks in the last decade thanks to its competitive performance in a number of benchmark datasets. Tensor robust principal component analysis variations have… ▽ More

    Submitted 27 September, 2023; originally announced September 2023.

    Comments: Under review

  27. arXiv:2308.15816  [pdf, other

    cs.CV

    Improving Underwater Visual Tracking With a Large Scale Dataset and Image Enhancement

    Authors: Basit Alawode, Fayaz Ali Dharejo, Mehnaz Ummar, Yuhang Guo, Arif Mahmood, Naoufel Werghi, Fahad Shahbaz Khan, Jiri Matas, Sajid Javed

    Abstract: This paper presents a new dataset and general tracker enhancement method for Underwater Visual Object Tracking (UVOT). Despite its significance, underwater tracking has remained unexplored due to data inaccessibility. It poses distinct challenges; the underwater environment exhibits non-uniform lighting conditions, low visibility, lack of sharpness, low contrast, camouflage, and reflections from s… ▽ More

    Submitted 31 August, 2023; v1 submitted 30 August, 2023; originally announced August 2023.

  28. arXiv:2308.04168  [pdf, other

    cs.CV

    EFaR 2023: Efficient Face Recognition Competition

    Authors: Jan Niklas Kolf, Fadi Boutros, Jurek Elliesen, Markus Theuerkauf, Naser Damer, Mohamad Alansari, Oussama Abdul Hay, Sara Alansari, Sajid Javed, Naoufel Werghi, Klemen Grm, Vitomir Štruc, Fernando Alonso-Fernandez, Kevin Hernandez Diaz, Josef Bigun, Anjith George, Christophe Ecabert, Hatef Otroshi Shahreza, Ketan Kotwal, Sébastien Marcel, Iurii Medvedev, Bo Jin, Diogo Nunes, Ahmad Hassanpour, Pankaj Khatiwada , et al. (2 additional authors not shown)

    Abstract: This paper presents the summary of the Efficient Face Recognition Competition (EFaR) held at the 2023 International Joint Conference on Biometrics (IJCB 2023). The competition received 17 submissions from 6 different teams. To drive further development of efficient face recognition models, the submitted solutions are ranked based on a weighted score of the achieved verification accuracies on a div… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

    Comments: Accepted at IJCB 2023

  29. arXiv:2305.02032  [pdf, other

    cs.CV cs.LG

    Unsupervised Mutual Transformer Learning for Multi-Gigapixel Whole Slide Image Classification

    Authors: Sajid Javed, Arif Mahmood, Talha Qaiser, Naoufel Werghi, Nasir Rajpoot

    Abstract: Classification of gigapixel Whole Slide Images (WSIs) is an important prediction task in the emerging area of computational pathology. There has been a surge of research in deep learning models for WSI classification with clinical applications such as cancer detection or prediction of molecular mutations from WSIs. Most methods require expensive and labor-intensive manual annotations by expert pat… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

  30. arXiv:2303.13405  [pdf, other

    cs.CV cs.LG

    SC-MIL: Supervised Contrastive Multiple Instance Learning for Imbalanced Classification in Pathology

    Authors: Dinkar Juyal, Siddhant Shingi, Syed Ashar Javed, Harshith Padigela, Chintan Shah, Anand Sampat, Archit Khosla, John Abel, Amaro Taylor-Weiner

    Abstract: Multiple Instance learning (MIL) models have been extensively used in pathology to predict biomarkers and risk-stratify patients from gigapixel-sized images. Machine learning problems in medical imaging often deal with rare diseases, making it important for these models to work in a label-imbalanced setting. In pathology images, there is another level of imbalance, where given a positively labeled… ▽ More

    Submitted 9 September, 2023; v1 submitted 23 March, 2023; originally announced March 2023.

  31. arXiv:2303.06753  [pdf, other

    cs.CV cs.LG cs.RO

    Modular Quantization-Aware Training for 6D Object Pose Estimation

    Authors: Saqib Javed, Chengkun Li, Andrew Price, Yinlin Hu, Mathieu Salzmann

    Abstract: Edge applications, such as collaborative robotics and spacecraft rendezvous, demand efficient 6D object pose estimation on resource-constrained embedded platforms. Existing 6D pose estimation networks are often too large for such deployments, necessitating compression while maintaining reliable performance. To address this challenge, we introduce Modular Quantization-Aware Training (MQAT), an adap… ▽ More

    Submitted 4 November, 2024; v1 submitted 12 March, 2023; originally announced March 2023.

    Comments: Accepted to Transactions on Machine Learning Research (TMLR), 2024

  32. arXiv:2302.14807  [pdf, other

    cs.CV cs.RO

    DFR-FastMOT: Detection Failure Resistant Tracker for Fast Multi-Object Tracking Based on Sensor Fusion

    Authors: Mohamed Nagy, Majid Khonji, Jorge Dias, Sajid Javed

    Abstract: Persistent multi-object tracking (MOT) allows autonomous vehicles to navigate safely in highly dynamic environments. One of the well-known challenges in MOT is object occlusion when an object becomes unobservant for subsequent frames. The current MOT methods store objects information, like objects' trajectory, in internal memory to recover the objects after occlusions. However, they retain short-t… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: \c{opyright} 2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

  33. arXiv:2302.10505  [pdf, other

    cs.LG eess.SP

    Higher-order Sparse Convolutions in Graph Neural Networks

    Authors: Jhony H. Giraldo, Sajid Javed, Arif Mahmood, Fragkiskos D. Malliaros, Thierry Bouwmans

    Abstract: Graph Neural Networks (GNNs) have been applied to many problems in computer sciences. Capturing higher-order relationships between nodes is crucial to increase the expressive power of GNNs. However, existing methods to capture these relationships could be infeasible for large-scale graphs. In this work, we introduce a new higher-order sparse convolution based on the Sobolev norm of graph signals.… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: Accepted in IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) 2023

  34. arXiv:2209.01274  [pdf

    cs.CV

    Person Monitoring by Full Body Tracking in Uniform Crowd Environment

    Authors: Zhibo Zhang, Omar Alremeithi, Maryam Almheiri, Marwa Albeshr, Xiaoxiong Zhang, Sajid Javed, Naoufel Werghi

    Abstract: Full body trackers are utilized for surveillance and security purposes, such as person-tracking robots. In the Middle East, uniform crowd environments are the norm which challenges state-of-the-art trackers. Despite tremendous improvements in tracker technology documented in the past literature, these trackers have not been trained using a dataset that captures these environments. In this work, we… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

    Comments: Accepted by the conference International Conference on Advances in Data-driven Computing and Intelligent Systems (ADCIS 2022), published in Scopus indexed Springer Book Series, 'Lecture Notes in Networks and Systems'

  35. arXiv:2208.10238  [pdf, other

    cs.CV

    Learning Branched Fusion and Orthogonal Projection for Face-Voice Association

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Sajid Javed, Muhammad Haroon Yousaf, Alessio Del Bue

    Abstract: Recent years have seen an increased interest in establishing association between faces and voices of celebrities leveraging audio-visual information from YouTube. Prior works adopt metric learning methods to learn an embedding space that is amenable for associated matching and verification tasks. Albeit showing some progress, such formulations are, however, restrictive due to dependency on distanc… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: Submitted: IEEE Transactions on Multimedia. arXiv admin note: substantial text overlap with arXiv:2112.10483

  36. Graph CNN for Moving Object Detection in Complex Environments from Unseen Videos

    Authors: Jhony H. Giraldo, Sajid Javed, Naoufel Werghi, Thierry Bouwmans

    Abstract: Moving Object Detection (MOD) is a fundamental step for many computer vision applications. MOD becomes very challenging when a video sequence captured from a static or moving camera suffers from the challenges: camouflage, shadow, dynamic backgrounds, and lighting variations, to name a few. Deep learning methods have been successfully applied to address MOD with competitive performance. However, i… ▽ More

    Submitted 13 July, 2022; originally announced July 2022.

    Journal ref: Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) Workshops, 2021, pp. 225-233

  37. arXiv:2206.01794  [pdf, other

    cs.CV cs.LG

    Additive MIL: Intrinsically Interpretable Multiple Instance Learning for Pathology

    Authors: Syed Ashar Javed, Dinkar Juyal, Harshith Padigela, Amaro Taylor-Weiner, Limin Yu, Aaditya Prakash

    Abstract: Multiple Instance Learning (MIL) has been widely applied in pathology towards solving critical problems such as automating cancer diagnosis and grading, predicting patient prognosis, and therapy response. Deploying these models in a clinical setting requires careful inspection of these black boxes during development and deployment to identify failures and maintain physician trust. In this work, we… ▽ More

    Submitted 16 October, 2022; v1 submitted 3 June, 2022; originally announced June 2022.

  38. arXiv:2205.10553  [pdf, other

    cs.CV cs.RO

    Robot Person Following in Uniform Crowd Environment

    Authors: Adarsh Ghimire, Xiaoxiong Zhang, Sajid Javed, Jorge Dias, Naoufel Werghi

    Abstract: Person-tracking robots have many applications, such as in security, elderly care, and socializing robots. Such a task is particularly challenging when the person is moving in a Uniform crowd. Also, despite significant progress of trackers reported in the literature, state-of-the-art trackers have hardly addressed person following in such scenarios. In this work, we focus on improving the perceptiv… ▽ More

    Submitted 21 May, 2022; originally announced May 2022.

    Journal ref: ICRA Workshop 2022: ROBOTIC PERCEPTION AND MAPPING: EMERGING TECHNIQUES

  39. arXiv:2205.04213  [pdf, other

    cs.RO

    Deep learning framework for robot for person detection and tracking

    Authors: Adarsh Ghimire, Xiaoxiong Zhang, Naoufel Werghi, Sajid Javed, Jorge Dias

    Abstract: Robustly tracking a person of interest in the crowd with a robotic platform is one of the cornerstones of human-robot interaction. The robot platform which is limited by the computational power, rapid movements, and occlusions of the target requires an efficient and robust framework to perform tracking. This paper proposes a deep learning framework for tracking a person using a mobile robot with a… ▽ More

    Submitted 19 April, 2022; originally announced May 2022.

    Comments: Presented Conference Paper

    Journal ref: Graduate Students Research Conference 2021

  40. arXiv:2204.08978  [pdf, other

    cs.CV

    Real-Time Face Recognition System

    Authors: Adarsh Ghimire, Naoufel Werghi, Sajid Javed, Jorge Dias

    Abstract: Over the past few decades, interest in algorithms for face recognition has been growing rapidly and has even surpassed human-level performance. Despite their accomplishments, their practical integration with a real-time performance-hungry system is not feasible due to high computational costs. So in this paper, we explore the recent, fast, and accurate face recognition system that can be easily in… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

    Comments: Poster

    Journal ref: Graduate Students Research Conference 2022

  41. arXiv:2204.05205  [pdf, other

    eess.IV cs.CV cs.LG

    Rethinking Machine Learning Model Evaluation in Pathology

    Authors: Syed Ashar Javed, Dinkar Juyal, Zahil Shanis, Shreya Chakraborty, Harsha Pokkalla, Aaditya Prakash

    Abstract: Machine Learning has been applied to pathology images in research and clinical practice with promising outcomes. However, standard ML models often lack the rigorous evaluation required for clinical decisions. Machine learning techniques for natural images are ill-equipped to deal with pathology images that are significantly large and noisy, require expensive labeling, are hard to interpret, and ar… ▽ More

    Submitted 18 April, 2022; v1 submitted 11 April, 2022; originally announced April 2022.

    Comments: ICLR 2022 ML Evaluation Workshop

  42. arXiv:2204.04199  [pdf, other

    eess.IV cs.CV

    Underwater Image Enhancement Using Pre-trained Transformer

    Authors: Abderrahmene Boudiaf, Yuhang Guo, Adarsh Ghimire, Naoufel Werghi, Giulia De Masi, Sajid Javed, Jorge Dias

    Abstract: The goal of this work is to apply a denoising image transformer to remove the distortion from underwater images and compare it with other similar approaches. Automatic restoration of underwater images plays an important role since it allows to increase the quality of the images, without the need for more expensive equipment. This is a critical example of the important role of the machine learning… ▽ More

    Submitted 8 April, 2022; originally announced April 2022.

  43. Neuromorphic Camera Denoising using Graph Neural Network-driven Transformers

    Authors: Yusra Alkendi, Rana Azzam, Abdulla Ayyad, Sajid Javed, Lakmal Seneviratne, Yahya Zweiri

    Abstract: Neuromorphic vision is a bio-inspired technology that has triggered a paradigm shift in the computer-vision community and is serving as a key-enabler for a multitude of applications. This technology has offered significant advantages including reduced power consumption, reduced processing needs, and communication speed-ups. However, neuromorphic cameras suffer from significant amounts of measureme… ▽ More

    Submitted 4 July, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

  44. arXiv:2112.02838  [pdf, other

    cs.CV

    Visual Object Tracking with Discriminative Filters and Siamese Networks: A Survey and Outlook

    Authors: Sajid Javed, Martin Danelljan, Fahad Shahbaz Khan, Muhammad Haris Khan, Michael Felsberg, Jiri Matas

    Abstract: Accurate and robust visual object tracking is one of the most challenging and fundamental computer vision problems. It entails estimating the trajectory of the target in an image sequence, given only its initial location, and segmentation, or its rough approximation in the form of a bounding box. Discriminative Correlation Filters (DCFs) and deep Siamese Networks (SNs) have emerged as dominating t… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: Tracking Survey

  45. arXiv:2111.13656  [pdf, other

    cs.CV

    Towards Low-Cost and Efficient Malaria Detection

    Authors: Waqas Sultani, Wajahat Nawaz, Syed Javed, Muhammad Sohail Danish, Asma Saadia, Mohsen Ali

    Abstract: Malaria, a fatal but curable disease claims hundreds of thousands of lives every year. Early and correct diagnosis is vital to avoid health complexities, however, it depends upon the availability of costly microscopes and trained experts to analyze blood-smear slides. Deep learning-based methods have the potential to not only decrease the burden of experts but also improve diagnostic accuracy on l… ▽ More

    Submitted 16 April, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

  46. arXiv:1912.05636  [pdf, ps, other

    cs.CV cs.LG cs.MM

    CineFilter: Unsupervised Filtering for Real Time Autonomous Camera Systems

    Authors: Sudheer Achary, K L Bhanu Moorthy, Syed Ashar Javed, Nikita Shravan, Vineet Gandhi, Anoop Namboodiri

    Abstract: Autonomous camera systems are often subjected to an optimization/filtering operation to smoothen and stabilize the rough trajectory estimates. Most common filtering techniques do reduce the irregularities in data; however, they fail to mimic the behavior of a human cameraman. Global filtering methods modeling human camera operators have been successful; however, they are limited to offline setting… ▽ More

    Submitted 27 May, 2020; v1 submitted 11 December, 2019; originally announced December 2019.

  47. arXiv:1910.01210  [pdf, other

    cs.CV cs.LG cs.RO

    Embodied Language Grounding with 3D Visual Feature Representations

    Authors: Mihir Prabhudesai, Hsiao-Yu Fish Tung, Syed Ashar Javed, Maximilian Sieb, Adam W. Harley, Katerina Fragkiadaki

    Abstract: We propose associating language utterances to 3D visual abstractions of the scene they describe. The 3D visual abstractions are encoded as 3-dimensional visual feature maps. We infer these 3D visual scene feature maps from RGB images of the scene via view prediction: when the generated 3D scene feature map is neurally projected from a camera viewpoint, it should match the corresponding RGB image.… ▽ More

    Submitted 17 June, 2021; v1 submitted 2 October, 2019; originally announced October 2019.

    Journal ref: Conference on Computer Vision and Pattern Recognition. 2020, pp. 2220-2229

  48. arXiv:1812.07368  [pdf, other

    cs.CV

    Handcrafted and Deep Trackers: Recent Visual Object Tracking Approaches and Trends

    Authors: Mustansar Fiaz, Arif Mahmood, Sajid Javed, Soon Ki Jung

    Abstract: In recent years visual object tracking has become a very active research area. An increasing number of tracking algorithms are being proposed each year. It is because tracking has wide applications in various real world problems such as human-computer interaction, autonomous vehicles, robotics, surveillance and security just to name a few. In the current study, we review latest trends and advances… ▽ More

    Submitted 11 February, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: 27pages, 26 figures. arXiv admin note: substantial text overlap with arXiv:1802.03098

  49. arXiv:1811.05255  [pdf, ps, other

    cs.CV

    Deep Neural Network Concepts for Background Subtraction: A Systematic Review and Comparative Evaluation

    Authors: Thierry Bouwmans, Sajid Javed, Maryam Sultana, Soon Ki Jung

    Abstract: Conventional neural networks show a powerful framework for background subtraction in video acquired by static cameras. Indeed, the well-known SOBS method and its variants based on neural networks were the leader methods on the largescale CDnet 2012 dataset during a long time. Recently, convolutional neural networks which belong to deep learning methods were employed with success for background ini… ▽ More

    Submitted 13 November, 2018; originally announced November 2018.

    Comments: 46 pages, 4 figures, submitted to neural networks

  50. arXiv:1811.01526  [pdf, other

    cs.CV

    Unsupervised RGBD Video Object Segmentation Using GANs

    Authors: Maryam Sultana, Arif Mahmood, Sajid Javed, Soon Ki Jung

    Abstract: Video object segmentation is a fundamental step in many advanced vision applications. Most existing algorithms are based on handcrafted features such as HOG, super-pixel segmentation or texture-based techniques, while recently deep features have been found to be more efficient. Existing algorithms observe performance degradation in the presence of challenges such as illumination variations, shadow… ▽ More

    Submitted 5 November, 2018; originally announced November 2018.

    Comments: 15 pages, 3 figures, ACCV workshop on RGB-D-sensing and understanding via combined colour and depth