Skip to main content

Showing 1–5 of 5 results for author: Reza, M K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.17823  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Robust Multimodal Learning via Cross-Modal Proxy Tokens

    Authors: Md Kaykobad Reza, Ameya Patil, Mashhour Solh, M. Salman Asif

    Abstract: Multimodal models often experience a significant performance drop when one or more modalities are missing during inference. To address this challenge, we propose a simple yet effective approach that enhances robustness to missing modalities while maintaining strong performance when all modalities are available. Our method introduces cross-modal proxy tokens (CMPTs), which approximate the class tok… ▽ More

    Submitted 2 June, 2025; v1 submitted 29 January, 2025; originally announced January 2025.

    Comments: 21 Pages, 9 Figures, 6 Tables

  2. arXiv:2410.03010  [pdf, other

    cs.LG cs.CV

    MMP: Towards Robust Multi-Modal Learning with Masked Modality Projection

    Authors: Niki Nezakati, Md Kaykobad Reza, Ameya Patil, Mashhour Solh, M. Salman Asif

    Abstract: Multimodal learning seeks to combine data from multiple input sources to enhance the performance of different downstream tasks. In real-world scenarios, performance can degrade substantially if some input modalities are missing. Existing methods that can handle missing modalities involve custom training or adaptation steps for each input modality combination. These approaches are either tied to sp… ▽ More

    Submitted 7 October, 2024; v1 submitted 3 October, 2024; originally announced October 2024.

  3. arXiv:2403.15937  [pdf, other

    cs.SI cs.IR

    Model, Analyze, and Comprehend User Interactions within a Social Media Platform

    Authors: Md Kaykobad Reza, S M Maksudul Alam, Yiran Luo, Youzhe Liu, Md Siam

    Abstract: In this study, we propose a novel graph-based approach to model, analyze and comprehend user interactions within a social media platform based on post-comment relationship. We construct a user interaction graph from social media data and analyze it to gain insights into community dynamics, user behavior, and content preferences. Our investigation reveals that while 56.05% of the active users are s… ▽ More

    Submitted 28 November, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: Accepted by 27th International Conference on Computer and Information Technology (ICCIT), 2024. 6 Pages, 6 Figures

  4. Robust Multimodal Learning with Missing Modalities via Parameter-Efficient Adaptation

    Authors: Md Kaykobad Reza, Ashley Prater-Bennette, M. Salman Asif

    Abstract: Multimodal learning seeks to utilize data from multiple sources to improve the overall performance of downstream tasks. It is desirable for redundancies in the data to make multimodal systems robust to missing or corrupted observations in some correlated modalities. However, we observe that the performance of several existing multimodal networks significantly deteriorates if one or multiple modali… ▽ More

    Submitted 7 October, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: Accepted by IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). 28 pages, 6 figures, 17 tables

  5. MMSFormer: Multimodal Transformer for Material and Semantic Segmentation

    Authors: Md Kaykobad Reza, Ashley Prater-Bennette, M. Salman Asif

    Abstract: Leveraging information across diverse modalities is known to enhance performance on multimodal segmentation tasks. However, effectively fusing information from different modalities remains challenging due to the unique characteristics of each modality. In this paper, we propose a novel fusion strategy that can effectively fuse information from different modality combinations. We also propose a new… ▽ More

    Submitted 7 April, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE Open Journal of Signal Processing. 15 pages, 3 figures, 9 tables