Skip to main content

Showing 1–50 of 67 results for author: Abdullah, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.14903  [pdf, ps, other

    cs.CV

    DETONATE: A Benchmark for Text-to-Image Alignment and Kernelized Direct Preference Optimization

    Authors: Renjith Prasad, Abhilekh Borah, Hasnat Md Abdullah, Chathurangi Shyalika, Gurpreet Singh, Ritvik Garimella, Rajarshi Roy, Harshul Surana, Nasrin Imanpour, Suranjana Trivedy, Amit Sheth, Amitava Das

    Abstract: Alignment is crucial for text-to-image (T2I) models to ensure that generated images faithfully capture user intent while maintaining safety and fairness. Direct Preference Optimization (DPO), prominent in large language models (LLMs), is extending its influence to T2I systems. This paper introduces DPO-Kernels for T2I models, a novel extension enhancing alignment across three dimensions: (i) Hybri… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: 59 pages, 10 figures

  2. arXiv:2506.13901  [pdf, ps, other

    cs.CL cs.AI

    Alignment Quality Index (AQI) : Beyond Refusals: AQI as an Intrinsic Alignment Diagnostic via Latent Geometry, Cluster Divergence, and Layer wise Pooled Representations

    Authors: Abhilekh Borah, Chhavi Sharma, Danush Khanna, Utkarsh Bhatt, Gurpreet Singh, Hasnat Md Abdullah, Raghav Kaushik Ravi, Vinija Jain, Jyoti Patel, Shubham Singh, Vasu Sharma, Arpita Vats, Rahul Raja, Aman Chadha, Amitava Das

    Abstract: Alignment is no longer a luxury, it is a necessity. As large language models (LLMs) enter high-stakes domains like education, healthcare, governance, and law, their behavior must reliably reflect human-aligned values and safety constraints. Yet current evaluations rely heavily on behavioral proxies such as refusal rates, G-Eval scores, and toxicity classifiers, all of which have critical blind spo… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  3. arXiv:2506.10983  [pdf

    cs.NE

    Is the Fitness Dependent Optimizer Ready for the Future of Optimization?

    Authors: Ardalan H. Awlla, Tarik A. Rashid, Ronak M. Abdullah

    Abstract: Metaheuristic algorithms are optimization methods that are inspired by real phenomena in nature or the behavior of living beings, e.g., animals, to be used for solving complex problems, as in engineering, energy optimization, health care, etc. One of them was the creation of the Fitness Dependent Optimizer (FDO) in 2019, which is based on bee-inspired swarm intelligence and provides efficient opti… ▽ More

    Submitted 23 January, 2025; originally announced June 2025.

    Comments: 21 pages

  4. arXiv:2506.09771  [pdf

    cs.CY

    Where Journalism Silenced Voices: Exploring Discrimination in the Representation of Indigenous Communities in Bangladesh

    Authors: Abhijit Paul, Adity Khisa, Zarif Masud, Sharif Md. Abdullah, Ahmedul Kabir, Shebuti Rayana

    Abstract: In this paper, we examine the intersections of indigeneity and media representation in shaping perceptions of indigenous communities in Bangladesh. Using a mixed-methods approach, we combine quantitative analysis of media data with qualitative insights from focus group discussions (FGD). First, we identify a total of 4,893 indigenous-related articles from our initial dataset of 2.2 million newspap… ▽ More

    Submitted 11 June, 2025; originally announced June 2025.

  5. arXiv:2506.02995  [pdf, ps, other

    cs.CL

    It's Not a Walk in the Park! Challenges of Idiom Translation in Speech-to-text Systems

    Authors: Iuliia Zaitova, Badr M. Abdullah, Wei Xue, Dietrich Klakow, Bernd Möbius, Tania Avgustinova

    Abstract: Idioms are defined as a group of words with a figurative meaning not deducible from their individual components. Although modern machine translation systems have made remarkable progress, translating idioms remains a major challenge, especially for speech-to-text systems, where research on this topic is notably sparse. In this paper, we systematically evaluate idiom translation as compared to conv… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: 13 pages, 3 figures, ACL 2025

  6. arXiv:2505.24713  [pdf, other

    cs.CL cs.SD eess.AS

    Voice Conversion Improves Cross-Domain Robustness for Spoken Arabic Dialect Identification

    Authors: Badr M. Abdullah, Matthew Baas, Bernd Möbius, Dietrich Klakow

    Abstract: Arabic dialect identification (ADI) systems are essential for large-scale data collection pipelines that enable the development of inclusive speech technologies for Arabic language varieties. However, the reliability of current ADI systems is limited by poor generalization to out-of-domain speech. In this paper, we present an effective approach based on voice conversion for training ADI models tha… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

    Comments: Accepted in Interspeech 2025

  7. arXiv:2505.17217  [pdf, ps, other

    cs.CL cs.AI cs.CY

    Mitigating Gender Bias via Fostering Exploratory Thinking in LLMs

    Authors: Kangda Wei, Hasnat Md Abdullah, Ruihong Huang

    Abstract: Large Language Models (LLMs) often exhibit gender bias, resulting in unequal treatment of male and female subjects across different contexts. To address this issue, we propose a novel data generation framework that fosters exploratory thinking in LLMs. Our approach prompts models to generate story pairs featuring male and female protagonists in structurally identical, morally ambiguous scenarios,… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

  8. arXiv:2505.06062  [pdf, other

    cs.CL

    Attention on Multiword Expressions: A Multilingual Study of BERT-based Models with Regard to Idiomaticity and Microsyntax

    Authors: Iuliia Zaitova, Vitalii Hirak, Badr M. Abdullah, Dietrich Klakow, Bernd Möbius, Tania Avgustinova

    Abstract: This study analyzes the attention patterns of fine-tuned encoder-only models based on the BERT architecture (BERT-based models) towards two distinct types of Multiword Expressions (MWEs): idioms and microsyntactic units (MSUs). Idioms present challenges in semantic non-compositionality, whereas MSUs demonstrate unconventional syntactic behavior that does not conform to standard grammatical categor… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: 10 pages, 3 figures. Findings 2025

    Journal ref: In Findings of the Association for Computational Linguistics: NAACL 2025, pages 4083–4092, Albuquerque, New Mexico https://aclanthology.org/2025.findings-naacl.228/

  9. arXiv:2504.03906  [pdf, other

    cs.CL

    CliME: Evaluating Multimodal Climate Discourse on Social Media and the Climate Alignment Quotient (CAQ)

    Authors: Abhilekh Borah, Hasnat Md Abdullah, Kangda Wei, Ruihong Huang

    Abstract: The rise of Large Language Models (LLMs) has raised questions about their ability to understand climate-related contexts. Though climate change dominates social media, analyzing its multimodal expressions is understudied, and current tools have failed to determine whether LLMs amplify credible solutions or spread unsubstantiated claims. To address this, we introduce CliME (Climate Change Multimoda… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: 16 pages, 9 figures

  10. arXiv:2504.02293  [pdf, other

    cs.CL cs.AI

    State-of-the-Art Translation of Text-to-Gloss using mBART : A case study of Bangla

    Authors: Sharif Md. Abdullah, Abhijit Paul, Shebuti Rayana, Ahmedul Kabir, Zarif Masud

    Abstract: Despite a large deaf and dumb population of 1.7 million, Bangla Sign Language (BdSL) remains a understudied domain. Specifically, there are no works on Bangla text-to-gloss translation task. To address this gap, we begin by addressing the dataset problem. We take inspiration from grammatical rule based gloss generation used in Germany and American sign langauage (ASL) and adapt it for BdSL. We als… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: Initial Version

  11. arXiv:2503.03455  [pdf, other

    cs.SE

    Towards Continuous Experiment-driven MLOps

    Authors: Keerthiga Rajenthiram, Milad Abdullah, Ilias Gerostathopoulos, Petr Hnetynka, Tomáš Bureš, Gerard Pons, Besim Bilalli, Anna Queralt

    Abstract: Despite advancements in MLOps and AutoML, ML development still remains challenging for data scientists. First, there is poor support for and limited control over optimizing and evolving ML models. Second, there is lack of efficient mechanisms for continuous evolution of ML models which would leverage the knowledge gained in previous optimizations of the same or different models. We propose an expe… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

  12. arXiv:2501.06602  [pdf, other

    cs.CV

    A Comparative Performance Analysis of Classification and Segmentation Models on Bangladeshi Pothole Dataset

    Authors: Antara Firoz Parsa, S. M. Abdullah, Anika Hasan Talukder, Md. Asif Shahidullah Kabbya, Shakib Al Hasan, Md. Farhadul Islam, Jannatun Noor

    Abstract: The study involves a comprehensive performance analysis of popular classification and segmentation models, applied over a Bangladeshi pothole dataset, being developed by the authors of this research. This custom dataset of 824 samples, collected from the streets of Dhaka and Bogura performs competitively against the existing industrial and custom datasets utilized in the present literature. The da… ▽ More

    Submitted 11 January, 2025; originally announced January 2025.

    Comments: 8 Tables, 7 Figures

  13. arXiv:2411.16754  [pdf, other

    cs.CV cs.AI

    Visual Counter Turing Test (VCT^2): Discovering the Challenges for AI-Generated Image Detection and Introducing Visual AI Index (V_AI)

    Authors: Nasrin Imanpour, Shashwat Bajpai, Subhankar Ghosh, Sainath Reddy Sankepally, Abhilekh Borah, Hasnat Md Abdullah, Nishoak Kosaraju, Shreyas Dixit, Ashhar Aziz, Shwetangshu Biswas, Vinija Jain, Aman Chadha, Amit Sheth, Amitava Das

    Abstract: The proliferation of AI techniques for image generation, coupled with their increasing accessibility, has raised significant concerns about the potential misuse of these images to spread misinformation. Recent AI-generated image detection (AGID) methods include CNNDetection, NPR, DM Image Detection, Fake Image Detection, DIRE, LASTED, GAN Image Detection, AIDE, SSP, DRCT, RINE, OCC-CLIP, De-Fake,… ▽ More

    Submitted 24 November, 2024; originally announced November 2024.

    Comments: 13 pages, 9 figures

  14. arXiv:2410.01180  [pdf, other

    cs.CV cs.CL

    UAL-Bench: The First Comprehensive Unusual Activity Localization Benchmark

    Authors: Hasnat Md Abdullah, Tian Liu, Kangda Wei, Shu Kong, Ruihong Huang

    Abstract: Localizing unusual activities, such as human errors or surveillance incidents, in videos holds practical significance. However, current video understanding models struggle with localizing these unusual events likely because of their insufficient representation in models' pretraining datasets. To explore foundation models' capability in localizing unusual activity, we introduce UAL-Bench, a compreh… ▽ More

    Submitted 1 October, 2024; originally announced October 2024.

    Journal ref: wacv(2025) 5801-5811

  15. On the Encoding of Gender in Transformer-based ASR Representations

    Authors: Aravind Krishnan, Badr M. Abdullah, Dietrich Klakow

    Abstract: While existing literature relies on performance differences to uncover gender biases in ASR models, a deeper analysis is essential to understand how gender is encoded and utilized during transcript generation. This work investigates the encoding and utilization of gender in the latent representations of two transformer-based ASR models, Wav2Vec2 and HuBERT. Using linear erasure, we demonstrate the… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted at Interspeech 2024

  16. arXiv:2404.17083  [pdf, other

    eess.IV cs.CV

    Calculation of Femur Caput Collum Diaphyseal angle for X-Rays images using Semantic Segmentation

    Authors: Muhammad Abdullah, Anne Querfurth, Deepak Bhatia, Mahdi Mantash

    Abstract: This paper investigates the use of deep learning approaches to estimate the femur caput-collum-diaphyseal (CCD) angle from X-ray images. The CCD angle is an important measurement in the diagnosis of hip problems, and correct prediction can help in the planning of surgical procedures. Manual measurement of this angle, on the other hand, can be time-intensive and vulnerable to inter-observer variabi… ▽ More

    Submitted 26 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  17. arXiv:2404.16212  [pdf, other

    cs.CR cs.CV cs.LG

    An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape

    Authors: Sifat Muhammad Abdullah, Aravind Cheruvu, Shravya Kanchi, Taejoong Chung, Peng Gao, Murtuza Jadliwala, Bimal Viswanath

    Abstract: Deepfake or synthetic images produced using deep generative models pose serious risks to online platforms. This has triggered several research efforts to accurately detect deepfake images, achieving excellent performance on publicly available deepfake datasets. In this work, we study 8 state-of-the-art detectors and argue that they are far from being ready for deployment due to two recent developm… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: Accepted to IEEE S&P 2024; 19 pages, 10 figures

  18. arXiv:2402.15943  [pdf

    cs.SE cs.AI

    Rethinking Software Engineering in the Foundation Model Era: A Curated Catalogue of Challenges in the Development of Trustworthy FMware

    Authors: Ahmed E. Hassan, Dayi Lin, Gopi Krishnan Rajbahadur, Keheliya Gallaba, Filipe R. Cogo, Boyuan Chen, Haoxiang Zhang, Kishanthan Thangarajah, Gustavo Ansaldi Oliva, Jiahuei Lin, Wali Mohammad Abdullah, Zhen Ming Jiang

    Abstract: Foundation models (FMs), such as Large Language Models (LLMs), have revolutionized software development by enabling new use cases and business models. We refer to software built using FMs as FMware. The unique properties of FMware (e.g., prompts, agents, and the need for orchestration), coupled with the intrinsic limitations of FMs (e.g., hallucination) lead to a completely new set of software eng… ▽ More

    Submitted 3 March, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

  19. arXiv:2312.07338  [pdf, other

    cs.CL cs.SD eess.AS

    Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification

    Authors: Mohammed Maqsood Shaik, Dietrich Klakow, Badr M. Abdullah

    Abstract: Pre-trained Transformer-based speech models have shown striking performance when fine-tuned on various downstream tasks such as automatic speech recognition and spoken language identification (SLID). However, the problem of domain mismatch remains a challenge in this area, where the domain of the pre-training data might differ from that of the downstream labeled data used for fine-tuning. In multi… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: Submitted to ICASSP 2024

  20. SynthEnsemble: A Fusion of CNN, Vision Transformer, and Hybrid Models for Multi-Label Chest X-Ray Classification

    Authors: S. M. Nabil Ashraf, Md. Adyelullahil Mamun, Hasnat Md. Abdullah, Md. Golam Rabiul Alam

    Abstract: Chest X-rays are widely used to diagnose thoracic diseases, but the lack of detailed information about these abnormalities makes it challenging to develop accurate automated diagnosis systems, which is crucial for early detection and effective treatment. To address this challenge, we employed deep learning techniques to identify patterns in chest X-rays that correspond to different diseases. We co… ▽ More

    Submitted 22 May, 2024; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: Published in International Conference on Computer and Information Technology (ICCIT) 2023

    ACM Class: I.4; I.5

    Journal ref: 2023 26th International Conference on Computer and Information Technology (ICCIT), Cox's Bazar, Bangladesh, 2023, pp. 1-6

  21. arXiv:2309.11646  [pdf, other

    cs.LG

    An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum Disorder

    Authors: Rownak Ara Rasul, Promy Saha, Diponkor Bala, S M Rakib Ul Karim, Md. Ibrahim Abdullah, Bishwajit Saha

    Abstract: Autistic Spectrum Disorder (ASD) is a neurological disease characterized by difficulties with social interaction, communication, and repetitive activities. While its primary origin lies in genetics, early detection is crucial, and leveraging machine learning offers a promising avenue for a faster and more cost-effective diagnosis. This study employs diverse machine learning methods to identify cru… ▽ More

    Submitted 28 December, 2023; v1 submitted 20 September, 2023; originally announced September 2023.

    Comments: 20 pages, 12 figures, 8 tables

  22. Ensemble-based modeling abstractions for modern self-optimizing systems

    Authors: Michal Töpfer, Milad Abdullah, Tomáš Bureš, Petr Hnětynka, Martin Kruliš

    Abstract: In this paper, we extend our ensemble-based component model DEECo with the capability to use machine-learning and optimization heuristics in establishing and reconfiguration of autonomic component ensembles. We show how to capture these concepts on the model level and give an example of how such a model can be beneficially used for modeling access-control related problem in the Industry 4.0 settin… ▽ More

    Submitted 11 September, 2023; originally announced September 2023.

    Comments: This is the authors' version of the paper - M. Töpfer, M. Abdullah, T. Bureš, P. Hnětynka, M. Kruliš: Ensemble-Based Modeling Abstractions for Modern Self-optimizing Systems, in Proceedings of ISOLA 2022, Rhodes, Greece, pp. 318-334, 2022. The final authenticated publication is available online at https://doi.org/10.1007/978-3-031-19759-8_20

  23. arXiv:2308.06303  [pdf

    econ.GN cs.LG

    A New Approach to Overcoming Zero Trade in Gravity Models to Avoid Indefinite Values in Linear Logarithmic Equations and Parameter Verification Using Machine Learning

    Authors: Mikrajuddin Abdullah

    Abstract: The presence of a high number of zero flow trades continues to provide a challenge in identifying gravity parameters to explain international trade using the gravity model. Linear regression with a logarithmic linear equation encounters an indefinite value on the logarithmic trade. Although several approaches to solving this problem have been proposed, the majority of them are no longer based on l… ▽ More

    Submitted 11 August, 2023; originally announced August 2023.

    Comments: 20 pages, 6 figures

  24. arXiv:2306.02405  [pdf, other

    cs.CL

    An Information-Theoretic Analysis of Self-supervised Discrete Representations of Speech

    Authors: Badr M. Abdullah, Mohammed Maqsood Shaik, Bernd Möbius, Dietrich Klakow

    Abstract: Self-supervised representation learning for speech often involves a quantization step that transforms the acoustic input into discrete units. However, it remains unclear how to characterize the relationship between these discrete units and abstract phonetic categories such as phonemes. In this paper, we develop an information-theoretic framework whereby we represent each phonetic category as a dis… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Accepted in Interspeech 2023

  25. arXiv:2304.11046  [pdf

    cs.SD cs.AI cs.CL cs.HC cs.LG

    Affective social anthropomorphic intelligent system

    Authors: Md. Adyelullahil Mamun, Hasnat Md. Abdullah, Md. Golam Rabiul Alam, Muhammad Mehedi Hassan, Md. Zia Uddin

    Abstract: Human conversational styles are measured by the sense of humor, personality, and tone of voice. These characteristics have become essential for conversational intelligent virtual assistants. However, most of the state-of-the-art intelligent virtual assistants (IVAs) are failed to interpret the affective semantics of human voices. This research proposes an anthropomorphic intelligent system that ca… ▽ More

    Submitted 19 April, 2023; originally announced April 2023.

    Comments: Multimedia Tools and Applications (2023)

  26. arXiv:2303.03873  [pdf

    eess.SY cs.AI cs.LG

    Developing the Reliable Shallow Supervised Learning for Thermal Comfort using ASHRAE RP-884 and ASHRAE Global Thermal Comfort Database II

    Authors: Kanisius Karyono, Badr M. Abdullah, Alison J. Cotgrave, Ana Bras, Jeff Cullen

    Abstract: The artificial intelligence (AI) system designer for thermal comfort faces insufficient data recorded from the current user or overfitting due to unreliable training data. This work introduces the reliable data set for training the AI subsystem for thermal comfort. This paper presents the control algorithm based on shallow supervised learning, which is simple enough to be implemented in the Intern… ▽ More

    Submitted 3 March, 2023; originally announced March 2023.

    Comments: 15 pages with Appendix

    Report number: https://ieeexplore.ieee.org/document/10471265 MSC Class: 93 ACM Class: I.2.1; I.2.6

    Journal ref: 2024, Aug Vol 1

  27. arXiv:2302.05519  [pdf

    cs.NE

    Multi objective Fitness Dependent Optimizer Algorithm

    Authors: Jaza M. Abdullah, Tarik A. Rashid, Bestan B. Maaroof, Seyedali Mirjalili

    Abstract: This paper proposes the multi objective variant of the recently introduced fitness dependent optimizer (FDO). The algorithm is called a Multi objective Fitness Dependent Optimizer (MOFDO) and is equipped with all five types of knowledge (situational, normative, topographical, domain, and historical knowledge) as in FDO. MOFDO is tested on two standard benchmarks for the performance-proof purpose;… ▽ More

    Submitted 26 January, 2023; originally announced February 2023.

    Comments: 29 pages

  28. arXiv:2301.03012  [pdf, other

    cs.CL

    Analyzing the Representational Geometry of Acoustic Word Embeddings

    Authors: Badr M. Abdullah, Dietrich Klakow

    Abstract: Acoustic word embeddings (AWEs) are vector representations such that different acoustic exemplars of the same word are projected nearby in the embedding space. In addition to their use in speech technology applications such as spoken term discovery and keyword spotting, AWE models have been adopted as models of spoken-word processing in several cognitively motivated studies and have been shown to… ▽ More

    Submitted 8 January, 2023; originally announced January 2023.

    Comments: In BlackboxNLP workshop, EMNLP 2022 [ oral presentation ]

  29. Huruf: An Application for Arabic Handwritten Character Recognition Using Deep Learning

    Authors: Minhaz Kamal, Fairuz Shaiara, Chowdhury Mohammad Abdullah, Sabbir Ahmed, Tasnim Ahmed, Md. Hasanul Kabir

    Abstract: Handwriting Recognition has been a field of great interest in the Artificial Intelligence domain. Due to its broad use cases in real life, research has been conducted widely on it. Prominent work has been done in this field focusing mainly on Latin characters. However, the domain of Arabic handwritten character recognition is still relatively unexplored. The inherent cursive nature of the Arabic c… ▽ More

    Submitted 24 December, 2022; v1 submitted 16 December, 2022; originally announced December 2022.

    Comments: Accepted in 25th ICCIT (6 pages, 4 tables, 4 figures)

    Report number: 10054769

    Journal ref: 2022 25th International Conference on Computer and Information Technology (ICCIT)

  30. arXiv:2210.09421  [pdf, other

    cs.CR cs.CL cs.LG

    Deepfake Text Detection: Limitations and Opportunities

    Authors: Jiameng Pu, Zain Sarwar, Sifat Muhammad Abdullah, Abdullah Rehman, Yoonjin Kim, Parantapa Bhattacharya, Mobin Javed, Bimal Viswanath

    Abstract: Recent advances in generative models for language have enabled the creation of convincing synthetic text or deepfake text. Prior work has demonstrated the potential for misuse of deepfake text to mislead content consumers. Therefore, deepfake text detection, the task of discriminating between human and machine-generated text, is becoming increasingly critical. Several defenses have been proposed f… ▽ More

    Submitted 17 October, 2022; originally announced October 2022.

    Comments: Accepted to IEEE S&P 2023; First two authors contributed equally to this work; 18 pages, 7 figures

  31. arXiv:2209.06633  [pdf, other

    cs.CL eess.AS

    Integrating Form and Meaning: A Multi-Task Learning Model for Acoustic Word Embeddings

    Authors: Badr M. Abdullah, Bernd Möbius, Dietrich Klakow

    Abstract: Models of acoustic word embeddings (AWEs) learn to map variable-length spoken word segments onto fixed-dimensionality vector representations such that different acoustic exemplars of the same word are projected nearby in the embedding space. In addition to their speech technology applications, AWE models have been shown to predict human performance on a variety of auditory lexical processing tasks… ▽ More

    Submitted 18 September, 2022; v1 submitted 14 September, 2022; originally announced September 2022.

    Comments: Accepted in INTERSPEECH 2022

  32. Harmony Search: Current Studies and Uses on Healthcare Systems

    Authors: Maryam T. Abdulkhaleq, Tarik A. Rashid, Abeer Alsadoon, Bryar A. Hassan, Mokhtar Mohammadi, Jaza M. Abdullah, Amit Chhabra, Sazan L. Ali, Rawshan N. Othman, Hadil A. Hasan, Sara Azad, Naz A. Mahmood, Sivan S. Abdalrahman, Hezha O. Rasul, Nebojsa Bacanin, S. Vimal

    Abstract: One of the popular metaheuristic search algorithms is Harmony Search (HS). It has been verified that HS can find solutions to optimization problems due to its balanced exploratory and convergence behavior and its simple and flexible structure. This capability makes the algorithm preferable to be applied in several real-world applications in various fields, including healthcare systems, different e… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

    Comments: 37 pages

    Journal ref: Artificial Intelligence in Medicine, 2022

  33. arXiv:2207.04846  [pdf

    cs.NE

    Fitness Dependent Optimizer for IoT Healthcare using Adapted Parameters: A Case Study Implementation

    Authors: Aso M. Aladdin, Jaza M. Abdullah, Kazhan Othman Mohammed Salih, Tarik A. Rashid, Rafid Sagban, Abeer Alsaddon, Nebojsa Bacanin, Amit Chhabra, S. Vimal, Indradip Banerjee

    Abstract: This discusses a case study on Fitness Dependent Optimizer or so-called FDO and adapting its parameters to the Internet of Things (IoT) healthcare. The reproductive way is sparked by the bee swarm and the collaborative decision-making of FDO. As opposed to the honey bee or artificial bee colony algorithms, this algorithm has no connection to them. In FDO, the search agent's position is updated usi… ▽ More

    Submitted 18 May, 2022; originally announced July 2022.

    Comments: 17 pages

    Journal ref: -

  34. arXiv:2204.00547  [pdf, other

    cs.SE

    A Web-Based Tool for Comparative Process Mining

    Authors: Madhavi Bangalore Shankara Narayana, Elisabetta Benevento, Marco Pegoraro, Muhammad Abdullah, Rahim Bin Shahid, Qasim Sajid, Muhammad Usman Mansoor, Wil M. P. van der Aalst

    Abstract: Process mining techniques enable the analysis of a wide variety of processes using event data. Among the available process mining techniques, most consider a single process perspective at a time-in the shape of a model or log. In this paper, we have developed a tool that can compare and visualize the same process under different constraints, allowing to analyze multiple aspects of the process. We… ▽ More

    Submitted 4 April, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

    Comments: 2 pages, 2 figures, 6 references

  35. arXiv:2109.10179  [pdf, other

    cs.CL

    How Familiar Does That Sound? Cross-Lingual Representational Similarity Analysis of Acoustic Word Embeddings

    Authors: Badr M. Abdullah, Iuliia Zaitova, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

    Abstract: How do neural networks "perceive" speech sounds from unknown languages? Does the typological similarity between the model's training language (L1) and an unknown language (L2) have an impact on the model representations of L2 speech signals? To answer these questions, we present a novel experimental design based on representational similarity analysis (RSA) to analyze acoustic word embeddings (AWE… ▽ More

    Submitted 21 September, 2021; originally announced September 2021.

    Comments: BlackboxNLP 2021

  36. arXiv:2106.08686  [pdf, other

    cs.CL cs.SD eess.AS

    Do Acoustic Word Embeddings Capture Phonological Similarity? An Empirical Study

    Authors: Badr M. Abdullah, Marius Mosbach, Iuliia Zaitova, Bernd Möbius, Dietrich Klakow

    Abstract: Several variants of deep neural networks have been successfully employed for building parametric models that project variable-duration spoken word segments onto fixed-size vector representations, or acoustic word embeddings (AWEs). However, it remains unclear to what degree we can rely on the distance in the emerging AWE space as an estimate of word-form similarity. In this paper, we ask: does the… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: Accepted in Interspeech 2021

  37. arXiv:2106.03895  [pdf, other

    cs.CL cs.SD eess.AS

    SIGTYP 2021 Shared Task: Robust Spoken Language Identification

    Authors: Elizabeth Salesky, Badr M. Abdullah, Sabrina J. Mielke, Elena Klyachko, Oleg Serikov, Edoardo Ponti, Ritesh Kumar, Ryan Cotterell, Ekaterina Vylomova

    Abstract: While language identification is a fundamental speech and language processing task, for many languages and language families it remains a challenging task. For many low-resource and endangered languages this is in part due to resource availability: where larger datasets exist, they may be single-speaker or have different domains than desired application scenarios, demanding a need for domain and s… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: The first three authors contributed equally

  38. HEVC Watermarking Techniques for Authentication and Copyright Applications: Challenges and Opportunities

    Authors: Ali A. Elrowayati, Mohamed A. Alrshah, M. F. L. Abdullah, Rohaya Latip

    Abstract: Recently, High-Efficiency Video Coding (HEVC/H.265) has been chosen to replace previous video coding standards, such as H.263 and H.264. Despite the efficiency of HEVC, it still lacks reliable and practical functionalities to support authentication and copyright applications. In order to provide this support, several watermarking techniques have been proposed by many researchers during the last fe… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

    Comments: Review article, 20 pages

  39. arXiv:2011.00960  [pdf, other

    cs.CL

    A Closer Look at Linguistic Knowledge in Masked Language Models: The Case of Relative Clauses in American English

    Authors: Marius Mosbach, Stefania Degaetano-Ortlieb, Marie-Pauline Krielke, Badr M. Abdullah, Dietrich Klakow

    Abstract: Transformer-based language models achieve high performance on various tasks, but we still lack understanding of the kind of linguistic knowledge they learn and rely on. We evaluate three models (BERT, RoBERTa, and ALBERT), testing their grammatical and semantic knowledge by sentence-level probing, diagnostic cases, and masked prediction tasks. We focus on relative clauses (in American English) as… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

    Comments: Accepted to COLING 2020

  40. arXiv:2010.12913  [pdf, other

    cs.CV

    Classifying Eye-Tracking Data Using Saliency Maps

    Authors: Shafin Rahman, Sejuti Rahman, Omar Shahid, Md. Tahmeed Abdullah, Jubair Ahmed Sourov

    Abstract: A plethora of research in the literature shows how human eye fixation pattern varies depending on different factors, including genetics, age, social functioning, cognitive functioning, and so on. Analysis of these variations in visual attention has already elicited two potential research avenues: 1) determining the physiological or psychological state of the subject and 2) predicting the tasks ass… ▽ More

    Submitted 24 October, 2020; originally announced October 2020.

    Comments: Accepted in: International Conference on Pattern Recognition (ICPR)

  41. arXiv:2010.11973  [pdf, other

    cs.CL

    Rediscovering the Slavic Continuum in Representations Emerging from Neural Models of Spoken Language Identification

    Authors: Badr M. Abdullah, Jacek Kudera, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

    Abstract: Deep neural networks have been employed for various spoken language recognition tasks, including tasks that are multilingual by definition such as spoken language identification. In this paper, we present a neural model for Slavic language identification in speech signals and analyze its emergent representations to investigate whether they reflect objective measures of language relatedness and/or… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: Accepted in VarDial 2020 Workshop

  42. arXiv:2009.02805  [pdf, other

    eess.IV cs.CV

    The 2ST-UNet for Pneumothorax Segmentation in Chest X-Rays using ResNet34 as a Backbone for U-Net

    Authors: Ayat Abedalla, Malak Abdullah, Mahmoud Al-Ayyoub, Elhadj Benkhelifa

    Abstract: Pneumothorax, also called a collapsed lung, refers to the presence of the air in the pleural space between the lung and chest wall. It can be small (no need for treatment), or large and causes death if it is not identified and treated on time. It is easily seen and identified by experts using a chest X-ray. Although this method is mostly error-free, it is time-consuming and needs expert radiologis… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

  43. arXiv:2008.00545  [pdf, other

    eess.AS cs.CL

    Cross-Domain Adaptation of Spoken Language Identification for Related Languages: The Curious Case of Slavic Languages

    Authors: Badr M. Abdullah, Tania Avgustinova, Bernd Möbius, Dietrich Klakow

    Abstract: State-of-the-art spoken language identification (LID) systems, which are based on end-to-end deep neural networks, have shown remarkable success not only in discriminating between distant languages but also between closely-related languages or even different spoken varieties of the same language. However, it is still unclear to what extent neural LID models generalize to speech samples with differ… ▽ More

    Submitted 6 August, 2020; v1 submitted 2 August, 2020; originally announced August 2020.

    Comments: To appear in INTERSPEECH 2020

  44. arXiv:2006.09436  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    SAMBA: Safe Model-Based & Active Reinforcement Learning

    Authors: Alexander I. Cowen-Rivers, Daniel Palenicek, Vincent Moens, Mohammed Abdullah, Aivar Sootla, Jun Wang, Haitham Ammar

    Abstract: In this paper, we propose SAMBA, a novel framework for safe reinforcement learning that combines aspects from probabilistic modelling, information theory, and statistics. Our method builds upon PILCO to enable active exploration using novel(semi-)metrics for out-of-sample Gaussian process evaluation optimised through a multi-objective problem that supports conditional-value-at-risk constraints. We… ▽ More

    Submitted 12 June, 2020; originally announced June 2020.

  45. arXiv:2006.07350  [pdf, other

    cs.CR cs.SE

    Exploiting ML algorithms for Efficient Detection and Prevention of JavaScript-XSS Attacks in Android Based Hybrid Applications

    Authors: Usama Khalid, Muhammad Abdullah, Kashif Inayat

    Abstract: The development and analysis of mobile applications in term of security have become an active research area from many years as many apps are vulnerable to different attacks. Especially the concept of hybrid applications has emerged in the last three years where applications are developed in both native and web languages because the use of web languages raises certain security risks in hybrid mobil… ▽ More

    Submitted 30 July, 2020; v1 submitted 12 June, 2020; originally announced June 2020.

  46. arXiv:2006.07345  [pdf, other

    cs.CV cs.CR

    Robust Baggage Detection and Classification Based on Local Tri-directional Pattern

    Authors: Shahbano, Muhammad Abdullah, Kashif Inayat

    Abstract: In recent decades, the automatic video surveillance system has gained significant importance in computer vision community. The crucial objective of surveillance is monitoring and security in public places. In the traditional Local Binary Pattern, the feature description is somehow inaccurate, and the feature size is large enough. Therefore, to overcome these shortcomings, our research proposed a d… ▽ More

    Submitted 31 January, 2021; v1 submitted 12 June, 2020; originally announced June 2020.

    Journal ref: International Journal of Internet Technology and Secured Transactions (2021)

  47. arXiv:2004.02543  [pdf

    cs.CR cs.DC

    SmartCoAuth: Smart-Contract privacy preservation mechanism on querying sensitive records in the cloud

    Authors: Muhammed Siraj, Mohd. Izuan Hafez Hj. Ninggal, Nur Izura Udzir, Muhammad Daniel Hafiz Abdullah, Aziah Asmawi

    Abstract: Sensitive records stored in the cloud such as healthcare records, private conversation and credit card information are targets of hackers and privacy abuse. Current information and record management systems have difficulties achieving privacy protection of such sensitive records in a secure, transparent, decentralized and trustless environment. The Blockchain technology is a nascent and a promisin… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

  48. arXiv:1912.00233  [pdf

    eess.IV cs.CV

    Convolutional neural networks model improvements using demographics and image processing filters on chest x-rays

    Authors: Mir Muhammad Abdullah, Mir Muhammad Abdur Rahman, Mir Mohammed Assadullah

    Abstract: Purpose: The purpose of this study was to observe change in accuracies of convolutional neural networks (CNN) models (ratio of correct classifications to total predictions) on thoracic radiological images by creating different binary classification models based on age, gender, and image pre-processing filters on 14 pathologies. Methodology: This is a quantitative research exploring variation in… ▽ More

    Submitted 30 November, 2019; originally announced December 2019.

    Comments: 27 pages

  49. arXiv:1907.13196  [pdf, other

    cs.LG cs.AI stat.ML

    Wasserstein Robust Reinforcement Learning

    Authors: Mohammed Amin Abdullah, Hang Ren, Haitham Bou Ammar, Vladimir Milenkovic, Rui Luo, Mingtian Zhang, Jun Wang

    Abstract: Reinforcement learning algorithms, though successful, tend to over-fit to training environments hampering their application to the real-world. This paper proposes $\text{W}\text{R}^{2}\text{L}$ -- a robust reinforcement learning algorithm with significant robust performance on low and high-dimensional control tasks. Our method formalises robust reinforcement learning as a novel min-max game with a… ▽ More

    Submitted 16 September, 2019; v1 submitted 30 July, 2019; originally announced July 2019.

  50. arXiv:1904.11033  [pdf, ps, other

    cs.IT cs.CR

    Optimal Downlink Transmission for Cell Free SWIPT Massive MIMO Systems with Active Eavesdropping

    Authors: Mahmoud Alageli, Aissa Ikhlef, Fahad Alsifiany, Mohammed A. M. Abdullah, Gaojie Chen, Jonathon Chambers

    Abstract: This paper considers secure simultaneous wireless information and power transfer (SWIPT) in cell-free massive multiple-input multiple-output (MIMO) systems. The system consists of a large number of randomly (Poisson-distributed) located access points (APs) serving multiple information users (IUs) and an information-untrusted dual-antenna active energy harvester (EH). The active EH uses one antenna… ▽ More

    Submitted 23 April, 2019; originally announced April 2019.