Skip to main content

Showing 1–10 of 10 results for author: Shahgir, H S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.12358  [pdf, ps, other

    cs.LG cs.AI

    AbFlowNet: Optimizing Antibody-Antigen Binding Energy via Diffusion-GFlowNet Fusion

    Authors: Abrar Rahman Abir, Haz Sameen Shahgir, Md Rownok Zahan Ratul, Md Toki Tahmid, Greg Ver Steeg, Yue Dong

    Abstract: Complementarity Determining Regions (CDRs) are critical segments of an antibody that facilitate binding to specific antigens. Current computational methods for CDR design utilize reconstruction losses and do not jointly optimize binding energy, a crucial metric for antibody efficacy. Rather, binding energy optimization is done through computationally expensive Online Reinforcement Learning (RL) pi… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  2. arXiv:2504.20896  [pdf, other

    cs.SE

    LELANTE: LEveraging LLM for Automated ANdroid TEsting

    Authors: Shamit Fatin, Mehbubul Hasan Al-Quvi, Haz Sameen Shahgir, Sukarna Barua, Anindya Iqbal, Sadia Sharmin, Md. Mostofa Akbar, Kallol Kumar Pal, A. Asif Al Rashid

    Abstract: Given natural language test case description for an Android application, existing testing approaches require developers to manually write scripts using tools such as Appium and Espresso to execute the corresponding test case. This process is labor-intensive and demands significant effort to maintain as UI interfaces evolve throughout development. In this work, we introduce LELANTE, a novel framewo… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

    Comments: 6 pages, 4 figures, 29th International Conference on Evaluation and Assessment in Software Engineering (EASE)

  3. arXiv:2504.08202  [pdf, other

    cs.CL

    Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models

    Authors: Yu Fu, Haz Sameen Shahgir, Hui Liu, Xianfeng Tang, Qi He, Yue Dong

    Abstract: Recent advances in long-context models (LCMs), designed to handle extremely long input contexts, primarily focus on utilizing external contextual information, often leaving the influence of large language models' intrinsic knowledge underexplored. In this work, we investigate how this intrinsic knowledge affects content generation and demonstrate that its impact becomes increasingly pronounced as… ▽ More

    Submitted 10 April, 2025; originally announced April 2025.

    Comments: 21 pages,11figures

  4. arXiv:2503.02948  [pdf, other

    cs.CL cs.IR

    ExpertGenQA: Open-ended QA generation in Specialized Domains

    Authors: Haz Sameen Shahgir, Chansong Lim, Jia Chen, Evangelos E. Papalexakis, Yue Dong

    Abstract: Generating high-quality question-answer pairs for specialized technical domains remains challenging, with existing approaches facing a tradeoff between leveraging expert examples and achieving topical diversity. We present ExpertGenQA, a protocol that combines few-shot learning with structured topic and style categorization to generate comprehensive domain-specific QA pairs. Using U.S. Federal Rai… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

  5. arXiv:2407.00416  [pdf, other

    cs.CL

    Too Late to Train, Too Early To Use? A Study on Necessity and Viability of Low-Resource Bengali LLMs

    Authors: Tamzeed Mahfuz, Satak Kumar Dey, Ruwad Naswan, Hasnaen Adil, Khondker Salman Sayeed, Haz Sameen Shahgir

    Abstract: Each new generation of English-oriented Large Language Models (LLMs) exhibits enhanced cross-lingual transfer capabilities and significantly outperforms older LLMs on low-resource languages. This prompts the question: Is there a need for LLMs dedicated to a particular low-resource language? We aim to explore this question for Bengali, a low-to-moderate resource Indo-Aryan language native to the Be… ▽ More

    Submitted 12 December, 2024; v1 submitted 29 June, 2024; originally announced July 2024.

  6. arXiv:2403.15952  [pdf, other

    cs.CV cs.CL

    IllusionVQA: A Challenging Optical Illusion Dataset for Vision Language Models

    Authors: Haz Sameen Shahgir, Khondker Salman Sayeed, Abhik Bhattacharjee, Wasi Uddin Ahmad, Yue Dong, Rifat Shahriyar

    Abstract: The advent of Vision Language Models (VLM) has allowed researchers to investigate the visual understanding of a neural network using natural language. Beyond object classification and detection, VLMs are capable of visual comprehension and common-sense reasoning. This naturally led to the question: How do VLMs respond when the image itself is inherently unreasonable? To this end, we present Illusi… ▽ More

    Submitted 9 August, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

  7. arXiv:2401.12210  [pdf, other

    cs.CV

    Connecting the Dots: Leveraging Spatio-Temporal Graph Neural Networks for Accurate Bangla Sign Language Recognition

    Authors: Haz Sameen Shahgir, Khondker Salman Sayeed, Md Toki Tahmid, Tanjeem Azwad Zaman, Md. Zarif Ul Alam

    Abstract: Recent advances in Deep Learning and Computer Vision have been successfully leveraged to serve marginalized communities in various contexts. One such area is Sign Language - a primary means of communication for the deaf community. However, so far, the bulk of research efforts and investments have gone into American Sign Language, and research activity into low-resource sign languages - especially… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

  8. arXiv:2312.14440  [pdf, other

    cs.LG cs.CR

    Asymmetric Bias in Text-to-Image Generation with Adversarial Attacks

    Authors: Haz Sameen Shahgir, Xianghao Kong, Greg Ver Steeg, Yue Dong

    Abstract: The widespread use of Text-to-Image (T2I) models in content generation requires careful examination of their safety, including their robustness to adversarial attacks. Despite extensive research on adversarial attacks, the reasons for their effectiveness remain underexplored. This paper presents an empirical study on adversarial attacks against T2I models, focusing on analyzing factors associated… ▽ More

    Submitted 17 July, 2024; v1 submitted 22 December, 2023; originally announced December 2023.

    Comments: camera-ready version

  9. arXiv:2310.14005  [pdf, ps, other

    eess.IV cs.CV

    Leveraging Complementary Attention maps in vision transformers for OCT image analysis

    Authors: Haz Sameen Shahgir, Tanjeem Azwad Zaman, Khondker Salman Sayeed, Md. Asif Haider, Sheikh Saifur Rahman Jony, M. Sohel Rahman

    Abstract: Optical Coherence Tomography (OCT) scan yields all possible cross-section images of a retina for detecting biomarkers linked to optical defects. Due to the high volume of data generated, an automated and reliable biomarker detection pipeline is necessary as a primary screening stage. We outline our new state-of-the-art pipeline for identifying biomarkers from OCT scans. In collaboration with tra… ▽ More

    Submitted 30 May, 2025; v1 submitted 21 October, 2023; originally announced October 2023.

    Comments: Accepted in 2025 IEEE International Conference on Image Processing

  10. arXiv:2303.09306  [pdf, ps, other

    cs.CL cs.AI

    BanglaCoNER: Towards Robust Bangla Complex Named Entity Recognition

    Authors: HAZ Sameen Shahgir, Ramisa Alam, Md. Zarif Ul Alam

    Abstract: Named Entity Recognition (NER) is a fundamental task in natural language processing that involves identifying and classifying named entities in text. But much work hasn't been done for complex named entity recognition in Bangla, despite being the seventh most spoken language globally. CNER is a more challenging task than traditional NER as it involves identifying and classifying complex and compou… ▽ More

    Submitted 17 March, 2023; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: Winning Solution for the Bangla Complex Named Entity Recognition Challenge