Skip to main content

Showing 1–8 of 8 results for author: Ishmam, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.15673  [pdf, other

    cs.CV

    Semantic Shield: Defending Vision-Language Models Against Backdooring and Poisoning via Fine-grained Knowledge Alignment

    Authors: Alvi Md Ishmam, Christopher Thomas

    Abstract: In recent years there has been enormous interest in vision-language models trained using self-supervised objectives. However, the use of large-scale datasets scraped from the web for training also makes these models vulnerable to potential security threats, such as backdooring and poisoning attacks. In this paper, we propose a method for mitigating such attacks on contrastively trained vision-lang… ▽ More

    Submitted 23 November, 2024; originally announced November 2024.

    Comments: CVPR 2024

  2. arXiv:2409.12953  [pdf, other

    cs.CV cs.AI

    JourneyBench: A Challenging One-Stop Vision-Language Understanding Benchmark of Generated Images

    Authors: Zhecan Wang, Junzhang Liu, Chia-Wei Tang, Hani Alomari, Anushka Sivakumar, Rui Sun, Wenhao Li, Md. Atabuzzaman, Hammad Ayyubi, Haoxuan You, Alvi Ishmam, Kai-Wei Chang, Shih-Fu Chang, Chris Thomas

    Abstract: Existing vision-language understanding benchmarks largely consist of images of objects in their usual contexts. As a consequence, recent multimodal large language models can perform well with only a shallow visual understanding by relying on background language biases. Thus, strong performance on these benchmarks does not necessarily correlate with strong visual understanding. In this paper, we re… ▽ More

    Submitted 9 January, 2025; v1 submitted 19 September, 2024; originally announced September 2024.

  3. arXiv:2304.00622  [pdf, other

    cs.CV cs.LG

    Automatic Detection of Natural Disaster Effect on Paddy Field from Satellite Images using Deep Learning Techniques

    Authors: Tahmid Alavi Ishmam, Amin Ahsan Ali, Md Ahsraful Amin, A K M Mahbubur Rahman

    Abstract: This paper aims to detect rice field damage from natural disasters in Bangladesh using high-resolution satellite imagery. The authors developed ground truth data for rice field damage from the field level. At first, NDVI differences before and after the disaster are calculated to identify possible crop loss. The areas equal to and above the 0.33 threshold are marked as crop loss areas as significa… ▽ More

    Submitted 2 April, 2023; originally announced April 2023.

    Comments: 6 pages, 13 figures. This paper has been accepted for presentation at the ICCRE2023 conference, held at Nagaoka University of Technology, Japan

  4. arXiv:2202.12250  [pdf, other

    cs.CV cs.AI cs.NE

    BLPnet: A new DNN model and Bengali OCR engine for Automatic License Plate Recognition

    Authors: Md. Saif Hassan Onim, Hussain Nyeem, Koushik Roy, Mahmudul Hasan, Abtahi Ishmam, Md. Akiful Hoque Akif, Tareque Bashar Ovi

    Abstract: The development of the Automatic License Plate Recognition (ALPR) system has received much attention for the English license plate. However, despite being the sixth largest population around the world, no significant progress can be tracked in the Bengali language countries or states for the ALPR system addressing their more alarming traffic management with inadequate road-safety measures. This pa… ▽ More

    Submitted 18 February, 2022; originally announced February 2022.

    Comments: Submitted to Neurocomputing (https://www.sciencedirect.com/journal/neurocomputing/about/aims-and-scope)

  5. arXiv:2112.04752  [pdf, other

    cs.CV

    Modelling Lips-State Detection Using CNN for Non-Verbal Communications

    Authors: Abtahi Ishmam, Mahmudul Hasan, Md. Saif Hassan Onim, Koushik Roy, Md. Akiful Haque Akif, Hussain Nyeem

    Abstract: Vision-based deep learning models can be promising for speech-and-hearing-impaired and secret communications. While such non-verbal communications are primarily investigated with hand-gestures and facial expressions, no research endeavour is tracked so far for the lips state (i.e., open/close)-based interpretation/translation system. In support of this development, this paper reports two new Convo… ▽ More

    Submitted 11 December, 2021; v1 submitted 9 December, 2021; originally announced December 2021.

  6. arXiv:2107.13653  [pdf, other

    cs.LG

    Demand Forecasting in Smart Grid Using Long Short-Term Memory

    Authors: Koushik Roy, Abtahi Ishmam, Kazi Abu Taher

    Abstract: Demand forecasting in power sector has become an important part of modern demand management and response systems with the rise of smart metering enabled grids. Long Short-Term Memory (LSTM) shows promising results in predicting time series data which can also be applied to power load demand in smart grids. In this paper, an LSTM based model using neural network architecture is proposed to forecast… ▽ More

    Submitted 28 July, 2021; originally announced July 2021.

    Comments: 5 pages, 6 figures, 2021 International Conference on Automation, Control and Mechatronics for Industry 4.0 (ACMI), 8-9 July 2021, Rajshahi, Bangladesh

  7. arXiv:2011.13238  [pdf, other

    cs.CL cs.SI

    Towards Interpretable Multilingual Detection of Hate Speech against Immigrants and Women in Twitter at SemEval-2019 Task 5

    Authors: Alvi Md Ishmam

    Abstract: his paper describes our techniques to detect hate speech against women and immigrants on Twitter in multilingual contexts, particularly in English and Spanish. The challenge was designed by SemEval-2019 Task 5, where the participants need to design algorithms to detect hate speech in English and Spanish language with a given target (e.g., women or immigrants). Here, we have developed two deep neur… ▽ More

    Submitted 26 November, 2020; originally announced November 2020.

  8. arXiv:2003.10504  [pdf, other

    cs.CY cs.HC

    Challenges of Bridging the Gap between Mass People and Welfare Organizations in Bangladesh

    Authors: Alvi Md Ishmam, Md Raihan Mia

    Abstract: Computing for the development of marginalized communities is a big deal of challenges for researchers. Different social organizations are working to develop the conditions of a specialized marginalized community namely Street Children, one of the most underprivileged communities in Bangladesh. However, lack of proper engagement among different social welfare organizations, donors, and the mass com… ▽ More

    Submitted 2 April, 2020; v1 submitted 23 March, 2020; originally announced March 2020.