Skip to main content

Showing 1–31 of 31 results for author: Khan, A M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.17297  [pdf, ps, other

    cs.LG cs.AI

    SafeRL-Lite: A Lightweight, Explainable, and Constrained Reinforcement Learning Library

    Authors: Satyam Mishra, Phung Thao Vi, Shivam Mishra, Vishwanath Bijalwan, Vijay Bhaskar Semwal, Abdul Manan Khan

    Abstract: We introduce SafeRL-Lite, an open-source Python library for building reinforcement learning (RL) agents that are both constrained and explainable. Existing RL toolkits often lack native mechanisms for enforcing hard safety constraints or producing human-interpretable rationales for decisions. SafeRL-Lite provides modular wrappers around standard Gym environments and deep Q-learning agents to enabl… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 10 pages, 7 figures, open-source library, PyPI installable: pip install saferl-lite

    MSC Class: 68T05 ACM Class: I.2.6; I.2.8

  2. arXiv:2504.13242  [pdf, other

    cs.CV

    Dynamic Memory-enhanced Transformer for Hyperspectral Image Classification

    Authors: Muhammad Ahmad, Manuel Mazzara, Salvatore Distefano, Adil Mehmood Khan

    Abstract: Hyperspectral image (HSI) classification remains a challenging task due to the intricate spatial-spectral correlations. Existing transformer models excel in capturing long-range dependencies but often suffer from information redundancy and attention inefficiencies, limiting their ability to model fine-grained relationships crucial for HSI classification. To overcome these limitations, this work pr… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  3. arXiv:2502.06427  [pdf, other

    cs.CV

    Hybrid State-Space and GRU-based Graph Tokenization Mamba for Hyperspectral Image Classification

    Authors: Muhammad Ahmad, Muhammad Hassaan Farooq Butt, Muhammad Usama, Manuel Mazzara, Salvatore Distefano, Adil Mehmood Khan, Danfeng Hong

    Abstract: Hyperspectral image (HSI) classification plays a pivotal role in domains such as environmental monitoring, agriculture, and urban planning. However, it faces significant challenges due to the high-dimensional nature of the data and the complex spectral-spatial relationships inherent in HSI. Traditional methods, including conventional machine learning and convolutional neural networks (CNNs), often… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  4. DiffFormer: a Differential Spatial-Spectral Transformer for Hyperspectral Image Classification

    Authors: Muhammad Ahmad, Manuel Mazzara, Salvatore Distefano, Adil Mehmood Khan, Silvia Liberata Ullo

    Abstract: Hyperspectral image classification (HSIC) has gained significant attention because of its potential in analyzing high-dimensional data with rich spectral and spatial information. In this work, we propose the Differential Spatial-Spectral Transformer (DiffFormer), a novel framework designed to address the inherent challenges of HSIC, such as spectral redundancy and spatial discontinuity. The DiffFo… ▽ More

    Submitted 23 December, 2024; originally announced December 2024.

    Report number: 10.1109/JSTARS.2025.3558889

    Journal ref: 10.1109/JSTARS.2025.3558889

  5. arXiv:2411.00833  [pdf, other

    cs.CV cs.LG

    Yoga Pose Classification Using Transfer Learning

    Authors: M. M. Akash, Rahul Deb Mohalder, Md. Al Mamun Khan, Laboni Paul, Ferdous Bin Ali

    Abstract: Yoga has recently become an essential aspect of human existence for maintaining a healthy body and mind. People find it tough to devote time to the gym for workouts as their lives get more hectic and they work from home. This kind of human pose estimation is one of the notable problems as it has to deal with locating body key points or joints. Yoga-82, a benchmark dataset for large-scale yoga pose… ▽ More

    Submitted 29 October, 2024; originally announced November 2024.

  6. Spatial and Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification

    Authors: Muhammad Ahmad, Muhammad Hassaan Farooq Butt, Adil Mehmood Khan, Manuel Mazzara, Salvatore Distefano, Muhammad Usama, Swalpa Kumar Roy, Jocelyn Chanussot, Danfeng Hong

    Abstract: Recent advancements in transformers, specifically self-attention mechanisms, have significantly improved hyperspectral image (HSI) classification. However, these models often suffer from inefficiencies, as their computational complexity scales quadratically with sequence length. To address these challenges, we propose the morphological spatial mamba (SMM) and morphological spatial-spectral Mamba (… ▽ More

    Submitted 30 November, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

  7. arXiv:2407.04069  [pdf, other

    cs.CL cs.AI cs.LG

    A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations

    Authors: Md Tahmid Rahman Laskar, Sawsan Alqahtani, M Saiful Bari, Mizanur Rahman, Mohammad Abdullah Matin Khan, Haidar Khan, Israt Jahan, Amran Bhuiyan, Chee Wei Tan, Md Rizwan Parvez, Enamul Hoque, Shafiq Joty, Jimmy Huang

    Abstract: Large Language Models (LLMs) have recently gained significant attention due to their remarkable capabilities in performing diverse tasks across various domains. However, a thorough evaluation of these models is crucial before deploying them in real-world applications to ensure they produce reliable performance. Despite the well-established importance of evaluating LLMs in the community, the comple… ▽ More

    Submitted 3 October, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted at EMNLP 2024 (Main Conference)

  8. A Comprehensive Survey for Hyperspectral Image Classification: The Evolution from Conventional to Transformers and Mamba Models

    Authors: Muhammad Ahmad, Salvatore Distifano, Adil Mehmood Khan, Manuel Mazzara, Chenyu Li, Hao Li, Jagannath Aryal, Yao Ding, Gemine Vivone, Danfeng Hong

    Abstract: Hyperspectral Image Classification (HSC) presents significant challenges owing to the high dimensionality and intricate nature of Hyperspectral (HS) data. While traditional Machine Learning (TML) approaches have demonstrated effectiveness, they often encounter substantial obstacles in real-world applications, including the variability of optimal feature sets, subjectivity in human-driven design, i… ▽ More

    Submitted 14 November, 2024; v1 submitted 23 April, 2024; originally announced April 2024.

    Report number: https://doi.org/10.1016/j.neucom.2025.130428

  9. arXiv:2307.07096  [pdf, other

    eess.AS cs.SD

    Low Rank Properties for Estimating Microphones Start Time and Sources Emission Time

    Authors: Faxian Cao, Yongqiang Cheng, Adil Mehmood Khan, Zhijing Yang, S. M. Ahsan Kazmiand Yingxiu Chang

    Abstract: Uncertainty in timing information pertaining to the start time of microphone recordings and sources' emission time pose significant challenges in various applications, such as joint microphones and sources localization. Traditional optimization methods, which directly estimate this unknown timing information (UTIm), often fall short compared to approaches exploiting the low-rank property (LRP). LR… ▽ More

    Submitted 21 July, 2023; v1 submitted 13 July, 2023; originally announced July 2023.

    Comments: 13 pages for main content; 9 pages for proof of proposed low rank properties; 13 figures

  10. arXiv:2306.14255  [pdf, other

    eess.IV cs.CV

    AttResDU-Net: Medical Image Segmentation Using Attention-based Residual Double U-Net

    Authors: Akib Mohammed Khan, Alif Ashrafee, Fahim Shahriar Khan, Md. Bakhtiar Hasan, Md. Hasanul Kabir

    Abstract: Manually inspecting polyps from a colonoscopy for colorectal cancer or performing a biopsy on skin lesions for skin cancer are time-consuming, laborious, and complex procedures. Automatic medical image segmentation aims to expedite this diagnosis process. However, numerous challenges exist due to significant variations in the appearance and sizes of objects with no distinct boundaries. This paper… ▽ More

    Submitted 25 June, 2023; originally announced June 2023.

    Comments: Accepted in 2023 International Joint Conference on Neural Networks (IJCNN 2023)

  11. arXiv:2305.11397  [pdf, other

    eess.AS cs.SD

    Are Microphone Signals Alone Sufficient for Self-Positioning?

    Authors: Faxian Cao, Yongqiang Cheng, Adil Mehmood Khan, Zhijing Yang

    Abstract: In an era where asynchronous environments pose challenges to traditional self-positioning methods, we propose a new transformation to the existing paradigm. Traditionally, time of arrival (TOA) measurements require both microphone and source signals, limiting their applicability in environments with unknown emission time of human voices or sources and unknown recording start time of independent mi… ▽ More

    Submitted 6 July, 2023; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: 1 figure, including 3 sub-figures

  12. arXiv:2305.11387  [pdf, other

    cs.LG cs.AI

    Justices for Information Bottleneck Theory

    Authors: Faxian Cao, Yongqiang Cheng, Adil Mehmood Khan, Zhijing Yang

    Abstract: This study comes as a timely response to mounting criticism of the information bottleneck (IB) theory, injecting fresh perspectives to rectify misconceptions and reaffirm its validity. Firstly, we introduce an auxiliary function to reinterpret the maximal coding rate reduction method as a special yet local optimal case of IB theory. Through this auxiliary function, we clarify the paradox of decrea… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 9 pages, 1 figures (4 subfigures)

  13. arXiv:2303.03004  [pdf, other

    cs.CL

    xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval

    Authors: Mohammad Abdullah Matin Khan, M Saiful Bari, Xuan Long Do, Weishi Wang, Md Rizwan Parvez, Shafiq Joty

    Abstract: Recently, pre-trained large language models (LLMs) have shown impressive abilities in generating codes from natural language descriptions, repairing buggy codes, translating codes between languages, and retrieving relevant code segments. However, the evaluation of these models has often been performed in a scattered way on only one or two specific tasks, in a few languages, at a partial granularit… ▽ More

    Submitted 6 November, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Code & Data available at https://github.com/ntunlp/xCodeEval, https://huggingface.co/datasets/NTU-NLP-sg/xCodeEval. Evaluation framework available at https://github.com/ntunlp/execeval

  14. Rethinking Cooking State Recognition with Vision Transformers

    Authors: Akib Mohammed Khan, Alif Ashrafee, Reeshoon Sayera, Shahriar Ivan, Sabbir Ahmed

    Abstract: To ensure proper knowledge representation of the kitchen environment, it is vital for kitchen robots to recognize the states of the food items that are being cooked. Although the domain of object detection and recognition has been extensively studied, the task of object state classification has remained relatively unexplored. The high intra-class similarity of ingredients during different states o… ▽ More

    Submitted 24 December, 2022; v1 submitted 16 December, 2022; originally announced December 2022.

    Comments: Accepted in 25th ICCIT (6 pages, 5 Figures, 5 Tables)

    Report number: 10055869

    Journal ref: 2022 25th International Conference on Computer and Information Technology (ICCIT)

  15. arXiv:2201.01001  [pdf, other

    cs.CV eess.IV

    Attention Mechanism Meets with Hybrid Dense Network for Hyperspectral Image Classification

    Authors: Muhammad Ahmad, Adil Mehmood Khan, Manuel Mazzara, Salvatore Distefano, Swalpa Kumar Roy, Xin Wu

    Abstract: Convolutional Neural Networks (CNN) are more suitable, indeed. However, fixed kernel sizes make traditional CNN too specific, neither flexible nor conducive to feature learning, thus impacting on the classification accuracy. The convolution of different kernel size networks may overcome this problem by capturing more discriminating and relevant information. In light of this, the proposed solution… ▽ More

    Submitted 4 January, 2022; originally announced January 2022.

  16. arXiv:2108.08339  [pdf, other

    cs.CV cs.AI

    Real-time Bangla License Plate Recognition System for Low Resource Video-based Applications

    Authors: Alif Ashrafee, Akib Mohammed Khan, Mohammad Sabik Irbaz, MD Abdullah Al Nasim

    Abstract: Automatic License Plate Recognition systems aim to provide a solution for detecting, localizing, and recognizing license plate characters from vehicles appearing in video frames. However, deploying such systems in the real world requires real-time performance in low-resource environments. In our paper, we propose a two-stage detection pipeline paired with Vision API that provides real-time inferen… ▽ More

    Submitted 14 November, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

    Comments: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision - Real-World Surveillance 2022 (IEEE/CVF WACV RWS 2022)

  17. arXiv:2107.12826  [pdf, other

    cs.LG cs.AI

    Adversarial Stacked Auto-Encoders for Fair Representation Learning

    Authors: Patrik Joslin Kenfack, Adil Mehmood Khan, Rasheed Hussain, S. M. Ahsan Kazmi

    Abstract: Training machine learning models with the only accuracy as a final goal may promote prejudices and discriminatory behaviors embedded in the data. One solution is to learn latent representations that fulfill specific fairness metrics. Different types of learning methods are employed to map data into the fair representational space. The main purpose is to learn a latent representation of data that s… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

    Comments: ICML2021 ML4data Workshop Paper

  18. arXiv:2103.10257  [pdf, other

    cs.LG cs.AI

    Domain Generalization using Ensemble Learning

    Authors: Yusuf Mesbah, Youssef Youssry Ibrahim, Adil Mehood Khan

    Abstract: Domain generalization is a sub-field of transfer learning that aims at bridging the gap between two different domains in the absence of any knowledge about the target domain. Our approach tackles the problem of a model's weak generalization when it is trained on a single source domain. From this perspective, we build an ensemble model on top of base deep learning models trained on a single source… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

    Comments: 11 pages, 3 figures, 4 tables, summited to IntelliSys 2021

  19. arXiv:2103.00950  [pdf, other

    cs.LG cs.CV

    On the Fairness of Generative Adversarial Networks (GANs)

    Authors: Patrik Joslin Kenfack, Daniil Dmitrievich Arapov, Rasheed Hussain, S. M. Ahsan Kazmi, Adil Mehmood Khan

    Abstract: Generative adversarial networks (GANs) are one of the greatest advances in AI in recent years. With their ability to directly learn the probability distribution of data, and then sample synthetic realistic data. Many applications have emerged, using GANs to solve classical problems in machine learning, such as data augmentation, class unbalance problems, and fair representation learning. In this p… ▽ More

    Submitted 21 May, 2021; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: Corrected typos, added results on CelibA dataset

  20. arXiv:2101.10532  [pdf, other

    cs.CV cs.LG eess.IV

    Hyperspectral Image Classification: Artifacts of Dimension Reduction on Hybrid CNN

    Authors: Muhammad Ahmad, Sidrah Shabbir, Rana Aamir Raza, Manuel Mazzara, Salvatore Distefano, Adil Mehmood Khan

    Abstract: Convolutional Neural Networks (CNN) has been extensively studied for Hyperspectral Image Classification (HSIC) more specifically, 2D and 3D CNN models have proved highly efficient in exploiting the spatial and spectral information of Hyperspectral Images. However, 2D CNN only considers the spatial information and ignores the spectral information whereas 3D CNN jointly exploits spatial-spectral inf… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

    Comments: 9 pages, 9 figures

    Report number: https://doi.org/10.1016/j.ijleo.2021.167757

    Journal ref: 2021

  21. Hyperspectral Image Classification-Traditional to Deep Models: A Survey for Future Prospects

    Authors: Muhammad Ahmad, Sidrah Shabbir, Swalpa Kumar Roy, Danfeng Hong, Xin Wu, Jing Yao, Adil Mehmood Khan, Manuel Mazzara, Salvatore Distefano, Jocelyn Chanussot

    Abstract: Hyperspectral Imaging (HSI) has been extensively utilized in many real-life applications because it benefits from the detailed spectral information contained in each pixel. Notably, the complex characteristics i.e., the nonlinear relation among the captured spectral information and the corresponding object of HSI data make accurate classification challenging for traditional methods. In the last fe… ▽ More

    Submitted 27 April, 2022; v1 submitted 15 January, 2021; originally announced January 2021.

    Comments: https://ieeexplore.ieee.org/abstract/document/9645266

  22. arXiv:2008.06971  [pdf

    eess.SP cs.LG

    Physical Action Categorization using Signal Analysis and Machine Learning

    Authors: Asad Mansoor Khan, Ayesha Sadiq, Sajid Gul Khawaja, Norah Saleh Alghamdi, Muhammad Usman Akram, Ali Saeed

    Abstract: Daily life of thousands of individuals around the globe suffers due to physical or mental disability related to limb movement. The quality of life for such individuals can be made better by use of assistive applications and systems. In such scenario, mapping of physical actions from movement to a computer aided application can lead the way for solution. Surface Electromyography (sEMG) presents a n… ▽ More

    Submitted 1 February, 2022; v1 submitted 16 August, 2020; originally announced August 2020.

  23. HyperProv: Decentralized Resilient Data Provenance at the Edge with Blockchains

    Authors: Petter Tunstad, Amin M. Khan, Phuong Hoai Ha

    Abstract: Data provenance and lineage are critical for ensuring integrity and reproducibility of information in research and application. This is particularly challenging for distributed scenarios, where data may be originating from decentralized sources without any central control by a single trusted entity. We present HyperProv, a general framework for data provenance based on the permissioned blockchain… ▽ More

    Submitted 13 October, 2019; originally announced October 2019.

  24. HeTM: Transactional Memory for Heterogeneous Systems

    Authors: Daniel Castro, Paolo Romano, Aleksandar Ilic, Amin M. Khan

    Abstract: Modern heterogeneous computing architectures, which couple multi-core CPUs with discrete many-core GPUs (or other specialized hardware accelerators), enable unprecedented peak performance and energy efficiency levels. Unfortunately, though, developing applications that can take full advantage of the potential of heterogeneous systems is a notoriously hard task. This work takes a step towards reduc… ▽ More

    Submitted 2 September, 2019; v1 submitted 2 May, 2019; originally announced May 2019.

    Comments: The current work was accepted in the 28th International Conference on Parallel Architectures and Compilation Techniques (PACT'19)

  25. arXiv:1706.01739  [pdf, other

    cs.CR cs.CY

    Multi Sensor-based Implicit User Identification

    Authors: Muhammad Ahmad, Ali Kashif Bashir, Adil Mehmood Khan, Manuel Mazzara, Salvatore Distefano, Shahzad Sarfraz

    Abstract: Smartphones have ubiquitously integrated into our home and work environments, however, users normally rely on explicit but inefficient identification processes in a controlled environment. Therefore, when a device is stolen, a thief can have access to the owner's personal information and services against the stored passwords. As a result of this potential scenario, this work proposes an automatic… ▽ More

    Submitted 24 September, 2020; v1 submitted 6 June, 2017; originally announced June 2017.

  26. arXiv:1706.01720  [pdf, ps, other

    cs.CY

    Seeking Optimum System Settings for Physical Activity Recognition on Smartwatches

    Authors: Muhammad Ahmad, Adil Mehmood Khan, Manuel Mazzara, Salvatore Distefano

    Abstract: Physical activity recognition (PAR) using wearable devices can provide valued information regarding an individual's degree of functional ability and lifestyle. In this regards, smartphone-based physical activity recognition is a well-studied area. Research on smartwatch-based PAR, on the other hand, is still in its infancy. Through a large-scale exploratory study, this work aims to investigate the… ▽ More

    Submitted 21 April, 2019; v1 submitted 6 June, 2017; originally announced June 2017.

    Comments: 15 pages, 2 figures, Accepted in CVC'19

    Journal ref: Computer Vision Conference (CVC'19), 2019

  27. Segmented and Non-Segmented Stacked Denoising Autoencoder for Hyperspectral Band Reduction

    Authors: Muhammad Ahmad, Asad Khan, Adil Mehmood Khan, Rasheed Hussain

    Abstract: Hyperspectral image analysis often requires selecting the most informative bands instead of processing the whole data without losing the key information. Existing band reduction (BR) methods have the capability to reveal the nonlinear properties exhibited in the data but at the expense of loosing its original representation. To cope with the said issue, an unsupervised non-linear segmented and non… ▽ More

    Submitted 22 April, 2018; v1 submitted 19 May, 2017; originally announced May 2017.

    Comments: 10 pages, 14 figures

    Journal ref: Optik-2019

  28. A Distributed Auctioneer for Resource Allocation in Decentralized Systems

    Authors: Amin M. Khan, Xavier Vilaça, Luís Rodrigues, Felix Freitag

    Abstract: In decentralized systems, nodes often need to coordinate to access shared resources in a fair manner. One approach to perform such arbitration is to rely on auction mechanisms. Although there is an extensive literature that studies auctions, most of these works assume the existence of a central, trusted auctioneer. Unfortunately, in fully decentralized systems, where the nodes that need to coopera… ▽ More

    Submitted 25 April, 2016; originally announced April 2016.

    Comments: 17 pages, 5 figures, 1 algorithm, published in ICDCS'16

    Journal ref: 36th IEEE International Conference on Distributed Computing Systems (ICDCS 2016). Nara, Japan. 27-30 June 2016

  29. Assessment of algorithms for mitosis detection in breast cancer histopathology images

    Authors: Mitko Veta, Paul J. van Diest, Stefan M. Willems, Haibo Wang, Anant Madabhushi, Angel Cruz-Roa, Fabio Gonzalez, Anders B. L. Larsen, Jacob S. Vestergaard, Anders B. Dahl, Dan C. Cireşan, Jürgen Schmidhuber, Alessandro Giusti, Luca M. Gambardella, F. Boray Tek, Thomas Walter, Ching-Wei Wang, Satoshi Kondo, Bogdan J. Matuszewski, Frederic Precioso, Violet Snell, Josef Kittler, Teofilo E. de Campos, Adnan M. Khan, Nasir M. Rajpoot , et al. (4 additional authors not shown)

    Abstract: The proliferative activity of breast tumors, which is routinely estimated by counting of mitotic figures in hematoxylin and eosin stained histology sections, is considered to be one of the most important prognostic markers. However, mitosis counting is laborious, subjective and may suffer from low inter-observer agreement. With the wider acceptance of whole slide images in pathology labs, automati… ▽ More

    Submitted 21 November, 2014; originally announced November 2014.

    Comments: 23 pages, 5 figures, accepted for publication in the journal Medical Image Analysis

  30. A Reusable Component for Communication and Data Synchronization in Mobile Distributed Interactive Applications

    Authors: Abdul Malik Khan, Sophie Chabridon, Antoine Beugnard

    Abstract: In Distributed Interactive Applications (DIA) such as multiplayer games, where many participants are involved in a same game session and communicate through a network, they may have an inconsistent view of the virtual world because of the communication delays across the network. This issue becomes even more challenging when communicating through a cellular network while executing the DIA client… ▽ More

    Submitted 14 October, 2010; originally announced October 2010.

    Comments: In Proceedings WCSI 2010, arXiv:1010.2337

    ACM Class: D.2.2; D.2.11

    Journal ref: EPTCS 37, 2010, pp. 86-100

  31. arXiv:1001.1966  [pdf

    cs.CV cs.CR

    A New Method to Extract Dorsal Hand Vein Pattern using Quadratic Inference Function

    Authors: Maleika Heenaye Mamode Khan, Naushad Ali Mamode Khan

    Abstract: Among all biometric, dorsal hand vein pattern is attracting the attention of researchers, of late. Extensive research is being carried out on various techniques in the hope of finding an efficient one which can be applied on dorsal hand vein pattern to improve its accuracy and matching time. One of the crucial step in biometric is the extraction of features. In this paper, we propose a method ba… ▽ More

    Submitted 12 January, 2010; originally announced January 2010.

    Comments: 5 pages IEEE format, International Journal of Computer Science and Information Security, IJCSIS December 2009, ISSN 1947 5500, http://sites.google.com/site/ijcsis/

    Report number: ISSN 1947 5500

    Journal ref: International Journal of Computer Science and Information Security, IJCSIS, Vol. 6, No. 3, pp. 026-030, December 2009, USA