Skip to main content

Showing 1–50 of 74 results for author: Hasan, M A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.20415  [pdf, ps, other

    cs.CR cs.AI cs.MA

    SV-LLM: An Agentic Approach for SoC Security Verification using Large Language Models

    Authors: Dipayan Saha, Shams Tarek, Hasan Al Shaikh, Khan Thamid Hasan, Pavan Sai Nalluri, Md. Ajoad Hasan, Nashmin Alam, Jingbo Zhou, Sujan Kumar Saha, Mark Tehranipoor, Farimah Farahmandi

    Abstract: Ensuring the security of complex system-on-chips (SoCs) designs is a critical imperative, yet traditional verification techniques struggle to keep pace due to significant challenges in automation, scalability, comprehensiveness, and adaptability. The advent of large language models (LLMs), with their remarkable capabilities in natural language understanding, code generation, and advanced reasoning… ▽ More

    Submitted 25 June, 2025; originally announced June 2025.

  2. arXiv:2505.23944  [pdf, ps, other

    cs.CL

    Retrieval Augmented Generation based Large Language Models for Causality Mining

    Authors: Thushara Manjari Naduvilakandy, Hyeju Jang, Mohammad Al Hasan

    Abstract: Causality detection and mining are important tasks in information retrieval due to their enormous use in information extraction, and knowledge graph construction. To solve these tasks, in existing literature there exist several solutions -- both unsupervised and supervised. However, the unsupervised methods suffer from poor performance and they often require significant human intervention for caus… ▽ More

    Submitted 29 May, 2025; originally announced May 2025.

    Comments: 13 pages, 6 figures, published in knowledgeNLP-NAACL2025

  3. arXiv:2505.21673  [pdf

    cs.SI

    Supervised Link Prediction in Co-Authorship Networks Based on Author Node-Based Features

    Authors: Doaa Hassan, Mohammad Al Hasan

    Abstract: Predicting the emergence of future research collaborations between authors in academic social networks (SNs) is a very effective example that demonstrates the link prediction problem. This problem refers to predicting the potential existence or absence of a link between a pair of nodes (authors) on the co-authorship network. Various similarity and aggregation metrics were proposed in the literatur… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  4. arXiv:2505.19163  [pdf, ps, other

    cs.CL cs.AI

    SpokenNativQA: Multilingual Everyday Spoken Queries for LLMs

    Authors: Firoj Alam, Md Arid Hasan, Shammur Absar Chowdhury

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance across various disciplines and tasks. However, benchmarking their capabilities with multilingual spoken queries remains largely unexplored. In this study, we introduce SpokenNativQA, the first multilingual and culturally aligned spoken question-answering (SQA) dataset designed to evaluate LLMs in real-world conversational settin… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: Spoken Question Answering, Multilingual LLMs, Speech-based Evaluation, Dialectal Speech, Low-resource Languages, Multimodal Benchmarking, Conversational AI, Speech-to-Text QA, Real-world Interaction, Natural Language Understanding

    MSC Class: 68T50 ACM Class: I.2.7

  5. arXiv:2504.13990  [pdf, other

    cs.LG cs.AI eess.SY

    PC-DeepNet: A GNSS Positioning Error Minimization Framework Using Permutation-Invariant Deep Neural Network

    Authors: M. Humayun Kabir, Md. Ali Hasan, Md. Shafiqul Islam, Kyeongjun Ko, Wonjae Shin

    Abstract: Global navigation satellite systems (GNSS) face significant challenges in urban and sub-urban areas due to non-line-of-sight (NLOS) propagation, multipath effects, and low received power levels, resulting in highly non-linear and non-Gaussian measurement error distributions. In light of this, conventional model-based positioning approaches, which rely on Gaussian error approximations, struggle to… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

    Comments: 31 pages, 14 figures, 6 tables

  6. arXiv:2504.05995  [pdf, ps, other

    cs.CL cs.AI

    NativQA Framework: Enabling LLMs with Native, Local, and Everyday Knowledge

    Authors: Firoj Alam, Md Arid Hasan, Sahinur Rahman Laskar, Mucahid Kutlu, Kareem Darwish, Shammur Absar Chowdhury

    Abstract: The rapid advancement of large language models (LLMs) has raised concerns about cultural bias, fairness, and their applicability in diverse linguistic and underrepresented regional contexts. To enhance and benchmark the capabilities of LLMs, there is a need to develop large-scale resources focused on multilingual, local, and cultural contexts. In this study, we propose the NativQA framework, which… ▽ More

    Submitted 7 July, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

    Comments: LLMs, Native, Multilingual, Language Diversity, Contextual Understanding, Minority Languages, Culturally Informed, Foundation Models, Large Language Models

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

  7. arXiv:2503.23764  [pdf, other

    cs.CV cs.AI

    WaveFormer: A 3D Transformer with Wavelet-Driven Feature Representation for Efficient Medical Image Segmentation

    Authors: Md Mahfuz Al Hasan, Mahdi Zaman, Abdul Jawad, Alberto Santamaria-Pang, Ho Hin Lee, Ivan Tarapov, Kyle See, Md Shah Imran, Antika Roy, Yaser Pourmohammadi Fallah, Navid Asadizanjani, Reza Forghani

    Abstract: Transformer-based architectures have advanced medical image analysis by effectively modeling long-range dependencies, yet they often struggle in 3D settings due to substantial memory overhead and insufficient capture of fine-grained local features. We address these limitations with WaveFormer, a novel 3D-transformer that: i) leverages the fundamental frequency-domain properties of features for con… ▽ More

    Submitted 31 March, 2025; v1 submitted 31 March, 2025; originally announced March 2025.

  8. arXiv:2502.16612  [pdf, other

    cs.CL cs.AI

    MemeIntel: Explainable Detection of Propagandistic and Hateful Memes

    Authors: Mohamed Bayan Kmainasi, Abul Hasnat, Md Arid Hasan, Ali Ezzat Shahroor, Firoj Alam

    Abstract: The proliferation of multimodal content on social media presents significant challenges in understanding and moderating complex, context-dependent issues such as misinformation, hate speech, and propaganda. While efforts have been made to develop resources and propose new methods for automatic detection, limited attention has been given to label detection and the generation of explanation-based ra… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

    Comments: disinformation, misinformation, factuality, harmfulness, fake news, propaganda, hateful meme, multimodality, text, images

    MSC Class: 68T50 ACM Class: I.2.7

  9. arXiv:2502.16550  [pdf, other

    cs.CL

    Reasoning About Persuasion: Can LLMs Enable Explainable Propaganda Detection?

    Authors: Maram Hasanain, Md Arid Hasan, Mohamed Bayan Kmainasi, Elisa Sartori, Ali Ezzat Shahroor, Giovanni Da San Martino, Firoj Alam

    Abstract: There has been significant research on propagandistic content detection across different modalities and languages. However, most studies have primarily focused on detection, with little attention given to explanations justifying the predicted label. This is largely due to the lack of resources that provide explanations alongside annotated labels. To address this issue, we propose a multilingual (i… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

  10. arXiv:2412.03581  [pdf, other

    cs.IR cs.AI cs.LG

    A Survey on E-Commerce Learning to Rank

    Authors: Md. Ahsanul Kabir, Mohammad Al Hasan, Aritra Mandal, Daniel Tunkelang, Zhe Wu

    Abstract: In e-commerce, ranking the search results based on users' preference is the most important task. Commercial e-commerce platforms, such as, Amazon, Alibaba, eBay, Walmart, etc. perform extensive and relentless research to perfect their search result ranking algorithms because the quality of ranking drives a user's decision to purchase or not to purchase an item, directly affecting the profitability… ▽ More

    Submitted 18 November, 2024; originally announced December 2024.

  11. arXiv:2411.18844  [pdf, ps, other

    cs.CR cs.IT

    Sharing the Path: A Threshold Scheme from Isogenies and Error Correcting Codes

    Authors: Mohamadou Sall, M. Anwar Hasan

    Abstract: In 2022, a prominent supersingular isogeny-based cryptographic scheme, namely SIDH, was compromised by a key recovery attack. However, this attack does not undermine the isogeny path problem, which remains central to the security of isogeny-based cryptography. Following the attacks by Castryck and Decru, as well as Maino and Martindale, Robert gave a mature and polynomial-time algorithm that trans… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

  12. arXiv:2411.15182  [pdf, other

    cs.LG cs.AI

    Forecasting Application Counts in Talent Acquisition Platforms: Harnessing Multimodal Signals using LMs

    Authors: Md Ahsanul Kabir, Kareem Abdelfatah, Shushan He, Mohammed Korayem, Mohammad Al Hasan

    Abstract: As recruitment and talent acquisition have become more and more competitive, recruitment firms have become more sophisticated in using machine learning (ML) methodologies for optimizing their day to day activities. But, most of published ML based methodologies in this area have been limited to the tasks like candidate matching, job to skill matching, job classification and normalization. In this w… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  13. arXiv:2411.07544  [pdf, other

    cs.CV

    Depthwise Separable Convolutions with Deep Residual Convolutions

    Authors: Md Arid Hasan, Krishno Dey

    Abstract: The recent advancement of edge computing enables researchers to optimize various deep learning architectures to employ them in edge devices. In this study, we aim to optimize Xception architecture which is one of the most popular deep learning algorithms for computer vision applications. The Xception architecture is highly effective for object detection tasks. However, it comes with a significant… ▽ More

    Submitted 11 November, 2024; originally announced November 2024.

    Comments: Course Project Report

    ACM Class: I.2.7

  14. arXiv:2410.13153  [pdf, other

    cs.CL

    Better to Ask in English: Evaluation of Large Language Models on English, Low-resource and Cross-Lingual Settings

    Authors: Krishno Dey, Prerona Tarannum, Md. Arid Hasan, Imran Razzak, Usman Naseem

    Abstract: Large Language Models (LLMs) are trained on massive amounts of data, enabling their application across diverse domains and tasks. Despite their remarkable performance, most LLMs are developed and evaluated primarily in English. Recently, a few multi-lingual LLMs have emerged, but their performance in low-resource languages, especially the most spoken languages in South Asia, is less explored. To a… ▽ More

    Submitted 16 October, 2024; originally announced October 2024.

  15. arXiv:2409.11404  [pdf, other

    cs.CL cs.AI

    AraDiCE: Benchmarks for Dialectal and Cultural Capabilities in LLMs

    Authors: Basel Mousi, Nadir Durrani, Fatema Ahmad, Md. Arid Hasan, Maram Hasanain, Tameem Kabbani, Fahim Dalvi, Shammur Absar Chowdhury, Firoj Alam

    Abstract: Arabic, with its rich diversity of dialects, remains significantly underrepresented in Large Language Models, particularly in dialectal variations. We address this gap by introducing seven synthetic datasets in dialects alongside Modern Standard Arabic (MSA), created using Machine Translation (MT) combined with human post-editing. We present AraDiCE, a benchmark for Arabic Dialect and Cultural Eva… ▽ More

    Submitted 17 December, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

    Comments: Benchmarking, Culturally Informed, Large Language Models, Arabic NLP, LLMs, Arabic Dialect, Dialectal Benchmarking

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

  16. arXiv:2409.10240  [pdf, other

    eess.AS cs.SD

    oboVox Far Field Speaker Recognition: A Novel Data Augmentation Approach with Pretrained Models

    Authors: Muhammad Sudipto Siam Dip, Md Anik Hasan, Sapnil Sarker Bipro, Md Abdur Raiyan, Mohammod Abdul Motin

    Abstract: In this study, we address the challenge of speaker recognition using a novel data augmentation technique of adding noise to enrollment files. This technique efficiently aligns the sources of test and enrollment files, enhancing comparability. Various pre-trained models were employed, with the resnet model achieving the highest DCF of 0.84 and an EER of 13.44. The augmentation technique notably imp… ▽ More

    Submitted 16 September, 2024; originally announced September 2024.

    Comments: 5 pages, 2 figures

  17. arXiv:2409.05026  [pdf, other

    cs.IT

    A Double-Difference Doppler Shift-Based Positioning Framework with Ephemeris Error Correction of LEO Satellites

    Authors: Md. Ali Hasan, M. Humayun Kabir, Md. Shafiqul Islam, Sangmin Han, Wonjae Shin

    Abstract: In signals of opportunity (SOPs)-based positioning utilizing low Earth orbit (LEO) satellites, ephemeris data derived from two-line element files can introduce increasing error over time. To handle the erroneous measurement, an additional base receiver with a known position is often used to compensate for the effect of ephemeris error when positioning the user terminal (UT). However, this approach… ▽ More

    Submitted 8 September, 2024; originally announced September 2024.

    Comments: 32 pages, 8 figures, 2 tables

  18. arXiv:2408.14111  [pdf, other

    cs.CV

    Bengali Sign Language Recognition through Hand Pose Estimation using Multi-Branch Spatial-Temporal Attention Model

    Authors: Abu Saleh Musa Miah, Md. Al Mehedi Hasan, Md Hadiuzzaman, Muhammad Nazrul Islam, Jungpil Shin

    Abstract: Hand gesture-based sign language recognition (SLR) is one of the most advanced applications of machine learning, and computer vision uses hand gestures. Although, in the past few years, many researchers have widely explored and studied how to address BSL problems, specific unaddressed issues remain, such as skeleton and transformer-based BSL recognition. In addition, the lack of evaluation of the… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  19. arXiv:2408.12211  [pdf, other

    cs.CV

    Computer-Aided Fall Recognition Using a Three-Stream Spatial-Temporal GCN Model with Adaptive Feature Aggregation

    Authors: Jungpil Shin, Abu Saleh Musa Miah, Rei Egawa1, Koki Hirooka, Md. Al Mehedi Hasan, Yoichi Tomioka, Yong Seok Hwang

    Abstract: The prevention of falls is paramount in modern healthcare, particularly for the elderly, as falls can lead to severe injuries or even fatalities. Additionally, the growing incidence of falls among the elderly, coupled with the urgent need to prevent suicide attempts resulting from medication overdose, underscores the critical importance of accurate and efficient fall detection methods. In this sce… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  20. arXiv:2408.10498  [pdf, other

    eess.IV cs.CV

    Cervical Cancer Detection Using Multi-Branch Deep Learning Model

    Authors: Tatsuhiro Baba, Abu Saleh Musa Miah, Jungpil Shin, Md. Al Mehedi Hasan

    Abstract: Cervical cancer is a crucial global health concern for women, and the persistent infection of High-risk HPV mainly triggers this remains a global health challenge, with young women diagnosis rates soaring from 10\% to 40\% over three decades. While Pap smear screening is a prevalent diagnostic method, visual image analysis can be lengthy and often leads to mistakes. Early detection of the disease… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  21. arXiv:2408.02237  [pdf, other

    cs.CL

    Do Large Language Models Speak All Languages Equally? A Comparative Study in Low-Resource Settings

    Authors: Md. Arid Hasan, Prerona Tarannum, Krishno Dey, Imran Razzak, Usman Naseem

    Abstract: Large language models (LLMs) have garnered significant interest in natural language processing (NLP), particularly their remarkable performance in various downstream tasks in resource-rich languages. Recent studies have highlighted the limitations of LLMs in low-resource languages, primarily focusing on binary classification tasks and giving minimal attention to South Asian languages. These limita… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    ACM Class: F.2.2; I.2.7

  22. arXiv:2407.09823  [pdf, ps, other

    cs.CL cs.AI

    NativQA: Multilingual Culturally-Aligned Natural Query for LLMs

    Authors: Md. Arid Hasan, Maram Hasanain, Fatema Ahmad, Sahinur Rahman Laskar, Sunaya Upadhyay, Vrunda N Sukhadia, Mucahid Kutlu, Shammur Absar Chowdhury, Firoj Alam

    Abstract: Natural Question Answering (QA) datasets play a crucial role in evaluating the capabilities of large language models (LLMs), ensuring their effectiveness in real-world applications. Despite the numerous QA datasets that have been developed and some work has been done in parallel, there is a notable lack of a framework and large scale region-specific datasets queried by native users in their own la… ▽ More

    Submitted 30 May, 2025; v1 submitted 13 July, 2024; originally announced July 2024.

    Comments: LLMs, Native, Multilingual, Language Diversity, Contextual Understanding, Minority Languages, Culturally Informed, Foundation Models, Large Language Models

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

  23. arXiv:2407.05789  [pdf, other

    cs.LG cs.AI

    CANDID DAC: Leveraging Coupled Action Dimensions with Importance Differences in DAC

    Authors: Philipp Bordne, M. Asif Hasan, Eddie Bergman, Noor Awad, André Biedenkapp

    Abstract: High-dimensional action spaces remain a challenge for dynamic algorithm configuration (DAC). Interdependencies and varying importance between action dimensions are further known key characteristics of DAC problems. We argue that these Coupled Action Dimensions with Importance Differences (CANDID) represent aspects of the DAC problem that are not yet fully explored. To address this gap, we introduc… ▽ More

    Submitted 17 September, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: 5 pages main paper, 11 pages references and appendix, 9 figures, to be published in: Proceedings of the Third International Conference on Automated Machine Learning (AutoML 2024), Workshop Track

  24. arXiv:2407.04247  [pdf, other

    cs.CL cs.AI cs.CV

    ArAIEval Shared Task: Propagandistic Techniques Detection in Unimodal and Multimodal Arabic Content

    Authors: Maram Hasanain, Md. Arid Hasan, Fatema Ahmed, Reem Suwaileh, Md. Rafiul Biswas, Wajdi Zaghouani, Firoj Alam

    Abstract: We present an overview of the second edition of the ArAIEval shared task, organized as part of the ArabicNLP 2024 conference co-located with ACL 2024. In this edition, ArAIEval offers two tasks: (i) detection of propagandistic textual spans with persuasion techniques identification in tweets and news articles, and (ii) distinguishing between propagandistic and non-propagandistic memes. A total of… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Comments: propaganda, span detection, disinformation, misinformation, fake news, LLMs, GPT-4, multimodality, multimodal LLMs

    MSC Class: 68T50 ACM Class: F.2.2; I.2.7

  25. arXiv:2406.03916  [pdf, other

    cs.CL cs.AI cs.CV

    ArMeme: Propagandistic Content in Arabic Memes

    Authors: Firoj Alam, Abul Hasnat, Fatema Ahmed, Md Arid Hasan, Maram Hasanain

    Abstract: With the rise of digital communication, memes have become a significant medium for cultural and political expression that is often used to mislead audiences. Identification of such misleading and persuasive multimodal content has become more important among various stakeholders, including social media platforms, policymakers, and the broader society as they often cause harm to individuals, organiz… ▽ More

    Submitted 6 October, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: disinformation, misinformation, factuality, harmfulness, fake news, propaganda, multimodality, text, images

    MSC Class: 68T50 ACM Class: I.2.7

  26. arXiv:2404.10924  [pdf, other

    cs.CL cs.AI

    Binder: Hierarchical Concept Representation through Order Embedding of Binary Vectors

    Authors: Croix Gyurek, Niloy Talukder, Mohammad Al Hasan

    Abstract: For natural language understanding and generation, embedding concepts using an order-based representation is an essential task. Unlike traditional point vector based representation, an order-based representation imposes geometric constraints on the representation vectors for explicitly capturing various semantic relationships that may exist between a pair of concepts. In existing literature, sever… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  27. arXiv:2403.06060  [pdf, other

    cs.CL cs.LG

    Ensemble Language Models for Multilingual Sentiment Analysis

    Authors: Md Arid Hasan

    Abstract: The rapid advancement of social media enables us to analyze user opinions. In recent times, sentiment analysis has shown a prominent research gap in understanding human sentiment based on the content shared on social media. Although sentiment analysis for commonly spoken languages has advanced significantly, low-resource languages like Arabic continue to get little research due to resource limitat… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

    Comments: This is one of my graduate course project reports and currently, I'm not planning to submit to any conferences

    ACM Class: I.2.7

  28. arXiv:2402.11544  [pdf, ps, other

    cs.IT cs.CR

    On efficient normal bases over binary fields

    Authors: Mohamadou Sall, M. Anwar Hasan

    Abstract: Binary field extensions are fundamental to many applications, such as multivariate public key cryptography, code-based cryptography, and error-correcting codes. Their implementation requires a foundation in number theory and algebraic geometry and necessitates the utilization of efficient bases. The continuous increase in the power of computation, and the design of new (quantum) computers increase… ▽ More

    Submitted 18 February, 2024; originally announced February 2024.

  29. arXiv:2312.10903  [pdf, other

    cs.LG cs.AI

    Robust Node Representation Learning via Graph Variational Diffusion Networks

    Authors: Jun Zhuang, Mohammad Al Hasan

    Abstract: Node representation learning by using Graph Neural Networks (GNNs) has been widely explored. However, in recent years, compelling evidence has revealed that GNN-based node representation learning can be substantially deteriorated by delicately-crafted perturbations in a graph structure. To learn robust node representation in the presence of perturbations, various works have been proposed to safegu… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: preprint, under review

  30. arXiv:2312.08656  [pdf, other

    cs.LG cs.AI cs.DC

    MaxK-GNN: Extremely Fast GPU Kernel Design for Accelerating Graph Neural Networks Training

    Authors: Hongwu Peng, Xi Xie, Kaustubh Shivdikar, MD Amit Hasan, Jiahui Zhao, Shaoyi Huang, Omer Khan, David Kaeli, Caiwen Ding

    Abstract: In the acceleration of deep neural network training, the GPU has become the mainstream platform. GPUs face substantial challenges on GNNs, such as workload imbalance and memory access irregularities, leading to underutilized hardware. Existing solutions such as PyG, DGL with cuSPARSE, and GNNAdvisor frameworks partially address these challenges but memory traffic is still significant. We argue t… ▽ More

    Submitted 18 March, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: ASPLOS 2024 accepted publication

    ACM Class: I.2; C.5

  31. BLP-2023 Task 2: Sentiment Analysis

    Authors: Md. Arid Hasan, Firoj Alam, Anika Anjum, Shudipta Das, Afiyat Anjum

    Abstract: We present an overview of the BLP Sentiment Shared Task, organized as part of the inaugural BLP 2023 workshop, co-located with EMNLP 2023. The task is defined as the detection of sentiment in a given piece of social media text. This task attracted interest from 71 participants, among whom 29 and 30 teams submitted systems during the development and evaluation phases, respectively. In total, partic… ▽ More

    Submitted 21 February, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted in BLP Workshop at EMNLP-23

    ACM Class: I.2.7

  32. arXiv:2309.05865  [pdf, other

    cs.LG

    Force-directed graph embedding with hops distance

    Authors: Hamidreza Lotfalizadeh, Mohammad Al Hasan

    Abstract: Graph embedding has become an increasingly important technique for analyzing graph-structured data. By representing nodes in a graph as vectors in a low-dimensional space, graph embedding enables efficient graph processing and analysis tasks like node classification, link prediction, and visualization. In this paper, we propose a novel force-directed graph embedding method that utilizes the steady… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

  33. arXiv:2308.10783  [pdf, other

    cs.CL cs.LG

    Zero- and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis

    Authors: Md. Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker, Sheak Rashed Haider Noori

    Abstract: The rapid expansion of the digital world has propelled sentiment analysis into a critical tool across diverse sectors such as marketing, politics, customer service, and healthcare. While there have been significant advancements in sentiment analysis for widely spoken languages, low-resource languages, such as Bangla, remain largely under-researched due to resource constraints. Furthermore, the rec… ▽ More

    Submitted 4 April, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted at LREC-COLING 2024. Zero-Shot Prompting, Few-Shot Prompting, LLMs, Comparative Study, Fine-tuned Models, Bangla, Sentiment Analysis

    MSC Class: 68T50 ACM Class: I.2.7

  34. arXiv:2303.02567  [pdf, other

    cs.CR cs.LO cs.SE

    Minimize Web Applications vulnerabilities through the early Detection of CRLF Injection

    Authors: MD Asibul Hasan, Md. Mijanur Rahman

    Abstract: Carriage return (CR) and line feed (LF), also known as CRLF injection is a type of vulnerability that allows a hacker to enter special characters into a web application, altering its operation or confusing the administrator. Log poisoning and HTTP response splitting are two prominent harmful uses of this technique. Additionally, CRLF injection can be used by an attacker to exploit other vulnerabil… ▽ More

    Submitted 4 March, 2023; originally announced March 2023.

    Comments: under peer review

  35. Robust Node Classification on Graphs: Jointly from Bayesian Label Transition and Topology-based Label Propagation

    Authors: Jun Zhuang, Mohammad Al Hasan

    Abstract: Node classification using Graph Neural Networks (GNNs) has been widely applied in various real-world scenarios. However, in recent years, compelling evidence emerges that the performance of GNN-based node classification may deteriorate substantially by topological perturbation, such as random connections or adversarial attacks. Various solutions, such as topological denoising methods and mechanism… ▽ More

    Submitted 20 August, 2022; originally announced August 2022.

    Comments: The paper is accepted for CIKM 2022

  36. arXiv:2207.09627  [pdf, other

    cs.CR cs.AI cs.CV cs.LG eess.SY

    EVHA: Explainable Vision System for Hardware Testing and Assurance -- An Overview

    Authors: Md Mahfuz Al Hasan, Mohammad Tahsin Mostafiz, Thomas An Le, Jake Julia, Nidish Vashistha, Shayan Taheri, Navid Asadizanjani

    Abstract: Due to the ever-growing demands for electronic chips in different sectors the semiconductor companies have been mandated to offshore their manufacturing processes. This unwanted matter has made security and trustworthiness of their fabricated chips concerning and caused creation of hardware attacks. In this condition, different entities in the semiconductor supply chain can act maliciously and exe… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: Please contact Dr. Shayan Taheri for any questions and/or comments regarding the paper arXiv submission at: "www.shayan-taheri.com". The Paper Initial Submission: The ACM Journal on Emerging Technologies in Computing Systems (JETC)

  37. arXiv:2207.07308  [pdf, other

    cs.CL cs.LG

    Z-Index at CheckThat! Lab 2022: Check-Worthiness Identification on Tweet Text

    Authors: Prerona Tarannum, Firoj Alam, Md. Arid Hasan, Sheak Rashed Haider Noori

    Abstract: The wide use of social media and digital technologies facilitates sharing various news and information about events and activities. Despite sharing positive information misleading and false information is also spreading on social media. There have been efforts in identifying such misleading information both manually by human experts and automatic tools. Manual effort does not scale well due to the… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: Accepted in CLEF 2022

    ACM Class: I.2.7

  38. arXiv:2203.06592  [pdf, other

    cs.CL cs.LG

    Informative Causality Extraction from Medical Literature via Dependency-tree based Patterns

    Authors: Md. Ahsanul Kabir, AlJohara Almulhim, Xiao Luo, Mohammad Al Hasan

    Abstract: Extracting cause-effect entities from medical literature is an important task in medical information retrieval. A solution for solving this task can be used for compilation of various causality relations, such as, causality between disease and symptoms, between medications and side effects, between genes and diseases, etc. Existing solutions for extracting cause-effect entities work well for sente… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: 22 pages without comment

    Journal ref: Journal of Healthcare Informatics Research 2022

  39. arXiv:2203.06591  [pdf, other

    cs.LG

    ORDSIM: Ordinal Regression for E-Commerce Query Similarity Prediction

    Authors: Md. Ahsanul Kabir, Mohammad Al Hasan, Aritra Mandal, Daniel Tunkelang, Zhe Wu

    Abstract: Query similarity prediction task is generally solved by regression based models with square loss. Such a model is agnostic of absolute similarity values and it penalizes the regression error at all ranges of similarity values at the same scale. However, to boost e-commerce platform's monetization, it is important to predict high-level similarity more accurately than low-level similarity, as highly… ▽ More

    Submitted 13 March, 2022; originally announced March 2022.

    Comments: 9 pages

    Journal ref: Proceedings of the International Workshop on Interactive and Scalable Information Retrieval methods for eCommerce (ISIR-eCom) 2022

  40. arXiv:2203.03762  [pdf, other

    cs.LG

    Defending Graph Convolutional Networks against Dynamic Graph Perturbations via Bayesian Self-supervision

    Authors: Jun Zhuang, Mohammad Al Hasan

    Abstract: In recent years, plentiful evidence illustrates that Graph Convolutional Networks (GCNs) achieve extraordinary accomplishments on the node classification task. However, GCNs may be vulnerable to adversarial attacks on label-scarce dynamic graphs. Many existing works aim to strengthen the robustness of GCNs; for instance, adversarial training is used to shield GCNs against malicious perturbations.… ▽ More

    Submitted 7 March, 2022; originally announced March 2022.

    Comments: The paper is accepted by AAAI 2022

  41. arXiv:2201.12291  [pdf

    q-fin.ST cs.LG

    Simulating Using Deep Learning The World Trade Forecasting of Export-Import Exchange Rate Convergence Factor During COVID-19

    Authors: Effat Ara Easmin Lucky, Md. Mahadi Hasan Sany, Mumenunnesa Keya, Md. Moshiur Rahaman, Umme Habiba Happy, Sharun Akter Khushbu, Md. Arid Hasan

    Abstract: By trade we usually mean the exchange of goods between states and countries. International trade acts as a barometer of the economic prosperity index and every country is overly dependent on resources, so international trade is essential. Trade is significant to the global health crisis, saving lives and livelihoods. By collecting the dataset called "Effects of COVID19 on trade" from the state web… ▽ More

    Submitted 23 January, 2022; originally announced January 2022.

    Comments: Accepted in ICDLAIR 2021

    MSC Class: 68T50 ACM Class: I.2.7

  42. arXiv:2112.07130  [pdf, ps, other

    cs.CR cs.IT math.NT

    A code-based hybrid signcryption scheme

    Authors: Jean Belo Klamti, M. Anwar Hasan

    Abstract: A key encapsulation mechanism (KEM) that takes as input an arbitrary string, i.e., a tag, is known as tag-KEM, while a scheme that combines signature and encryption is called signcryption. In this paper, we present a code-based signcryption tag-KEM scheme. We utilize a code-based signature and an IND-CCA2 (adaptive chosen ciphertext attack) secure version of McEliece's encryption scheme. The propo… ▽ More

    Submitted 21 March, 2023; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: We made some improvment in the paper

    MSC Class: 11T71

  43. arXiv:2108.12828  [pdf, other

    cs.CV cs.CY cs.LG cs.SI

    MEDIC: A Multi-Task Learning Dataset for Disaster Image Classification

    Authors: Firoj Alam, Tanvirul Alam, Md. Arid Hasan, Abul Hasnat, Muhammad Imran, Ferda Ofli

    Abstract: Recent research in disaster informatics demonstrates a practical and important use case of artificial intelligence to save human lives and suffering during natural disasters based on social media contents (text and images). While notable progress has been made using texts, research on exploiting the images remains relatively under-explored. To advance image-based approaches, we propose MEDIC (Avai… ▽ More

    Submitted 8 June, 2022; v1 submitted 29 August, 2021; originally announced August 2021.

    Comments: Multi-task Learning, Social media images, Image Classification, Natural disasters, Crisis Informatics, Deep learning, Dataset

    MSC Class: 68T50 ACM Class: I.2.7

    Journal ref: Neural Computing and Applications 35, 2609-2632 (2023)

  44. arXiv:2106.14344  [pdf, other

    cs.LG

    Non-Exhaustive Learning Using Gaussian Mixture Generative Adversarial Networks

    Authors: Jun Zhuang, Mohammad Al Hasan

    Abstract: Supervised learning, while deployed in real-life scenarios, often encounters instances of unknown classes. Conventional algorithms for training a supervised learning model do not provide an option to detect such instances, so they miss-classify such instances with 100% probability. Open Set Recognition (OSR) and Non-Exhaustive Learning (NEL) are potential solutions to overcome this problem. Most e… ▽ More

    Submitted 2 July, 2021; v1 submitted 27 June, 2021; originally announced June 2021.

    Comments: Accepted by ECML-PKDD 2021

  45. arXiv:2106.14207  [pdf

    eess.IV cs.CV cs.LG

    A Machine Learning Model for Early Detection of Diabetic Foot using Thermogram Images

    Authors: Amith Khandakar, Muhammad E. H. Chowdhury, Mamun Bin Ibne Reaz, Sawal Hamid Md Ali, Md Anwarul Hasan, Serkan Kiranyaz, Tawsifur Rahman, Rashad Alfkey, Ahmad Ashrif A. Bakar, Rayaz A. Malik

    Abstract: Diabetes foot ulceration (DFU) and amputation are a cause of significant morbidity. The prevention of DFU may be achieved by the identification of patients at risk of DFU and the institution of preventative measures through education and offloading. Several studies have reported that thermogram images may help to detect an increase in plantar temperature prior to DFU. However, the distribution of… ▽ More

    Submitted 27 June, 2021; originally announced June 2021.

    Comments: 23 pages, 8 Figures

  46. arXiv:2104.01523  [pdf, other

    cs.CL

    ASPER: Attention-based Approach to Extract Syntactic Patterns denoting Semantic Relations in Sentential Context

    Authors: Md. Ahsanul Kabir, Typer Phillips, Xiao Luo, Mohammad Al Hasan

    Abstract: Semantic relationships, such as hyponym-hypernym, cause-effect, meronym-holonym etc. between a pair of entities in a sentence are usually reflected through syntactic patterns. Automatic extraction of such patterns benefits several downstream tasks, including, entity extraction, ontology building, and question answering. Unfortunately, automatic extraction of such patterns has not yet received much… ▽ More

    Submitted 3 April, 2021; originally announced April 2021.

  47. arXiv:2011.10106  [pdf

    cs.CL cs.IR cs.LG

    Sentiment Classification in Bangla Textual Content: A Comparative Study

    Authors: Md. Arid Hasan, Jannatul Tajrin, Shammur Absar Chowdhury, Firoj Alam

    Abstract: Sentiment analysis has been widely used to understand our views on social and political agendas or user experiences over a product. It is one of the cores and well-researched areas in NLP. However, for low-resource languages, like Bangla, one of the prominent challenge is the lack of resources. Another important limitation, in the current literature for Bangla, is the absence of comparable results… ▽ More

    Submitted 19 November, 2020; originally announced November 2020.

    Comments: Accepted at ICCIT-2020

    MSC Class: 68T50 ACM Class: I.2.7

  48. arXiv:2010.14121  [pdf, other

    cs.LG cs.SI physics.soc-ph

    Deperturbation of Online Social Networks via Bayesian Label Transition

    Authors: Jun Zhuang, Mohammad Al Hasan

    Abstract: Online social networks (OSNs) classify users into different categories based on their online activities and interests, a task which is referred as a node classification task. Such a task can be solved effectively using Graph Convolutional Networks (GCNs). However, a small number of users, so-called perturbators, may perform random activities on an OSN, which significantly deteriorate the performan… ▽ More

    Submitted 18 January, 2022; v1 submitted 27 October, 2020; originally announced October 2020.

    Comments: TL;DR: GraphLT is the first model that adapts the Bayesian label transition method on GCNs for deperturbation in online social networks. Our work is accepted by SDM 2022

  49. arXiv:2004.11692  [pdf, other

    cs.SI physics.soc-ph

    Dynamic topic modeling of the COVID-19 Twitter narrative among U.S. governors and cabinet executives

    Authors: Hao Sha, Mohammad Al Hasan, George Mohler, P. Jeffrey Brantingham

    Abstract: A combination of federal and state-level decision making has shaped the response to COVID-19 in the United States. In this paper we analyze the Twitter narratives around this decision making by applying a dynamic topic model to COVID-19 related tweets by U.S. Governors and Presidential cabinet members. We use a network Hawkes binomial topic model to track evolving sub-topics around risk, testing a… ▽ More

    Submitted 19 April, 2020; originally announced April 2020.

  50. arXiv:1904.09763  [pdf, other

    cs.CV

    Water-Filling: An Efficient Algorithm for Digitized Document Shadow Removal

    Authors: Seungjun Jung, Muhammad Abul Hasan, Changick Kim

    Abstract: In this paper, we propose a novel algorithm to rectify illumination of the digitized documents by eliminating shading artifacts. Firstly, a topographic surface of an input digitized document is created using luminance value of each pixel. Then the shading artifact on the document is estimated by simulating an immersion process. The simulation of the immersion process is modeled using a novel diffu… ▽ More

    Submitted 2 May, 2019; v1 submitted 22 April, 2019; originally announced April 2019.

    Comments: Accepted at Asian Conference on Computer Vision (2018)