-
RAILS: Retrieval-Augmented Intelligence for Learning Software Development
Authors:
Wali Mohammad Abdullah,
Md. Morshedul Islam,
Devraj Parmar,
Happy Hasmukhbhai Patel,
Sindhuja Prabhakaran,
Baidya Saha
Abstract:
Large Language Models (LLMs) like GPT-3.5-Turbo are increasingly used to assist software development, yet they often produce incomplete code or incorrect imports, especially when lacking access to external or project-specific documentation. We introduce RAILS (Retrieval-Augmented Intelligence for Learning Software Development), a framework that augments LLM prompts with semantically retrieved cont…
▽ More
Large Language Models (LLMs) like GPT-3.5-Turbo are increasingly used to assist software development, yet they often produce incomplete code or incorrect imports, especially when lacking access to external or project-specific documentation. We introduce RAILS (Retrieval-Augmented Intelligence for Learning Software Development), a framework that augments LLM prompts with semantically retrieved context from curated Java resources using FAISS and OpenAI embeddings. RAILS incorporates an iterative validation loop guided by compiler feedback to refine suggestions. We evaluated RAILS on 78 real-world Java import error cases spanning standard libraries, GUI APIs, external tools, and custom utilities. Despite using the same LLM, RAILS outperforms baseline prompting by preserving intent, avoiding hallucinations, and surfacing correct imports even when libraries are unavailable locally. Future work will integrate symbolic filtering via PostgreSQL and extend support to other languages and IDEs.
△ Less
Submitted 27 June, 2025;
originally announced June 2025.
-
P4OMP: Retrieval-Augmented Prompting for OpenMP Parallelism in Serial Code
Authors:
Wali Mohammad Abdullah,
Azmain Kabir
Abstract:
We present P4OMP, a retrieval-augmented framework for transforming serial C/C++ code into OpenMP-annotated parallel code using large language models (LLMs). To our knowledge, this is the first system to apply retrieval-based prompting for OpenMP pragma correctness without model fine-tuning or compiler instrumentation. P4OMP leverages Retrieval-Augmented Generation (RAG) with structured instruction…
▽ More
We present P4OMP, a retrieval-augmented framework for transforming serial C/C++ code into OpenMP-annotated parallel code using large language models (LLMs). To our knowledge, this is the first system to apply retrieval-based prompting for OpenMP pragma correctness without model fine-tuning or compiler instrumentation. P4OMP leverages Retrieval-Augmented Generation (RAG) with structured instructional knowledge from OpenMP tutorials to improve the reliability of prompt-driven code generation. By grounding generation in the retrieved context, P4OMP improves syntactic correctness compared to baseline prompting with GPT-3.5-Turbo. We evaluate P4OMP against a baseline, GPT-3.5-Turbo without retrieval, on a comprehensive benchmark of 108 real-world C++ programs drawn from Stack Overflow, PolyBench, and NAS benchmark suites. P4OMP achieves 100% compilation success on all parallelizable cases, while the baseline fails to compile in 20 out of 108 cases. Six cases that rely on non-random-access iterators or thread-unsafe constructs are excluded due to fundamental OpenMP limitations. A detailed analysis demonstrates how P4OMP consistently avoids scoping errors, syntactic misuse, and invalid directive combinations that commonly affect baseline-generated code. We further demonstrate strong runtime scaling across seven compute-intensive benchmarks on an HPC cluster. P4OMP offers a robust, modular pipeline that significantly improves the reliability and applicability of LLM-generated OpenMP code.
△ Less
Submitted 27 June, 2025;
originally announced June 2025.
-
DETONATE: A Benchmark for Text-to-Image Alignment and Kernelized Direct Preference Optimization
Authors:
Renjith Prasad,
Abhilekh Borah,
Hasnat Md Abdullah,
Chathurangi Shyalika,
Gurpreet Singh,
Ritvik Garimella,
Rajarshi Roy,
Harshul Surana,
Nasrin Imanpour,
Suranjana Trivedy,
Amit Sheth,
Amitava Das
Abstract:
Alignment is crucial for text-to-image (T2I) models to ensure that generated images faithfully capture user intent while maintaining safety and fairness. Direct Preference Optimization (DPO), prominent in large language models (LLMs), is extending its influence to T2I systems. This paper introduces DPO-Kernels for T2I models, a novel extension enhancing alignment across three dimensions: (i) Hybri…
▽ More
Alignment is crucial for text-to-image (T2I) models to ensure that generated images faithfully capture user intent while maintaining safety and fairness. Direct Preference Optimization (DPO), prominent in large language models (LLMs), is extending its influence to T2I systems. This paper introduces DPO-Kernels for T2I models, a novel extension enhancing alignment across three dimensions: (i) Hybrid Loss, integrating embedding-based objectives with traditional probability-based loss for improved optimization; (ii) Kernelized Representations, employing Radial Basis Function (RBF), Polynomial, and Wavelet kernels for richer feature transformations and better separation between safe and unsafe inputs; and (iii) Divergence Selection, expanding beyond DPO's default Kullback-Leibler (KL) regularizer by incorporating Wasserstein and R'enyi divergences for enhanced stability and robustness. We introduce DETONATE, the first large-scale benchmark of its kind, comprising approximately 100K curated image pairs categorized as chosen and rejected. DETONATE encapsulates three axes of social bias and discrimination: Race, Gender, and Disability. Prompts are sourced from hate speech datasets, with images generated by leading T2I models including Stable Diffusion 3.5 Large, Stable Diffusion XL, and Midjourney. Additionally, we propose the Alignment Quality Index (AQI), a novel geometric measure quantifying latent-space separability of safe/unsafe image activations, revealing hidden vulnerabilities. Empirically, we demonstrate that DPO-Kernels maintain strong generalization bounds via Heavy-Tailed Self-Regularization (HT-SR). DETONATE and complete code are publicly released.
△ Less
Submitted 17 June, 2025;
originally announced June 2025.
-
Alignment Quality Index (AQI) : Beyond Refusals: AQI as an Intrinsic Alignment Diagnostic via Latent Geometry, Cluster Divergence, and Layer wise Pooled Representations
Authors:
Abhilekh Borah,
Chhavi Sharma,
Danush Khanna,
Utkarsh Bhatt,
Gurpreet Singh,
Hasnat Md Abdullah,
Raghav Kaushik Ravi,
Vinija Jain,
Jyoti Patel,
Shubham Singh,
Vasu Sharma,
Arpita Vats,
Rahul Raja,
Aman Chadha,
Amitava Das
Abstract:
Alignment is no longer a luxury, it is a necessity. As large language models (LLMs) enter high-stakes domains like education, healthcare, governance, and law, their behavior must reliably reflect human-aligned values and safety constraints. Yet current evaluations rely heavily on behavioral proxies such as refusal rates, G-Eval scores, and toxicity classifiers, all of which have critical blind spo…
▽ More
Alignment is no longer a luxury, it is a necessity. As large language models (LLMs) enter high-stakes domains like education, healthcare, governance, and law, their behavior must reliably reflect human-aligned values and safety constraints. Yet current evaluations rely heavily on behavioral proxies such as refusal rates, G-Eval scores, and toxicity classifiers, all of which have critical blind spots. Aligned models are often vulnerable to jailbreaking, stochasticity of generation, and alignment faking.
To address this issue, we introduce the Alignment Quality Index (AQI). This novel geometric and prompt-invariant metric empirically assesses LLM alignment by analyzing the separation of safe and unsafe activations in latent space. By combining measures such as the Davies-Bouldin Score (DBS), Dunn Index (DI), Xie-Beni Index (XBI), and Calinski-Harabasz Index (CHI) across various formulations, AQI captures clustering quality to detect hidden misalignments and jailbreak risks, even when outputs appear compliant. AQI also serves as an early warning signal for alignment faking, offering a robust, decoding invariant tool for behavior agnostic safety auditing.
Additionally, we propose the LITMUS dataset to facilitate robust evaluation under these challenging conditions. Empirical tests on LITMUS across different models trained under DPO, GRPO, and RLHF conditions demonstrate AQI's correlation with external judges and ability to reveal vulnerabilities missed by refusal metrics. We make our implementation publicly available to foster future research in this area.
△ Less
Submitted 16 June, 2025;
originally announced June 2025.
-
Is the Fitness Dependent Optimizer Ready for the Future of Optimization?
Authors:
Ardalan H. Awlla,
Tarik A. Rashid,
Ronak M. Abdullah
Abstract:
Metaheuristic algorithms are optimization methods that are inspired by real phenomena in nature or the behavior of living beings, e.g., animals, to be used for solving complex problems, as in engineering, energy optimization, health care, etc. One of them was the creation of the Fitness Dependent Optimizer (FDO) in 2019, which is based on bee-inspired swarm intelligence and provides efficient opti…
▽ More
Metaheuristic algorithms are optimization methods that are inspired by real phenomena in nature or the behavior of living beings, e.g., animals, to be used for solving complex problems, as in engineering, energy optimization, health care, etc. One of them was the creation of the Fitness Dependent Optimizer (FDO) in 2019, which is based on bee-inspired swarm intelligence and provides efficient optimization. This paper aims to introduce a comprehensive review of FDO, including its basic concepts, main variations, and applications from the beginning. It systematically gathers and examines every relevant paper, providing significant insights into the algorithm's pros and cons. The objective is to assess FDO's performance in several dimensions and to identify its strengths and weaknesses. This study uses a comparative analysis to show how well FDO and its variations work at solving real-world optimization problems, which helps us understand what they can do. Finally, this paper proposes future research directions that can help researchers further enhance the performance of FDO.
△ Less
Submitted 23 January, 2025;
originally announced June 2025.
-
Where Journalism Silenced Voices: Exploring Discrimination in the Representation of Indigenous Communities in Bangladesh
Authors:
Abhijit Paul,
Adity Khisa,
Zarif Masud,
Sharif Md. Abdullah,
Ahmedul Kabir,
Shebuti Rayana
Abstract:
In this paper, we examine the intersections of indigeneity and media representation in shaping perceptions of indigenous communities in Bangladesh. Using a mixed-methods approach, we combine quantitative analysis of media data with qualitative insights from focus group discussions (FGD). First, we identify a total of 4,893 indigenous-related articles from our initial dataset of 2.2 million newspap…
▽ More
In this paper, we examine the intersections of indigeneity and media representation in shaping perceptions of indigenous communities in Bangladesh. Using a mixed-methods approach, we combine quantitative analysis of media data with qualitative insights from focus group discussions (FGD). First, we identify a total of 4,893 indigenous-related articles from our initial dataset of 2.2 million newspaper articles, using a combination of keyword-based filtering and LLM, achieving 77% accuracy and an F1-score of 81.9\%. From manually inspecting 3 prominent Bangla newspapers, we identify 15 genres that we use as our topics for semi-supervised topic modeling using CorEx. Results show indigenous news articles have higher representation of culture and entertainment (19%, 10% higher than general news articles), and a disproportionate focus on conflict and protest (9%, 7% higher than general news). On the other hand, sentiment analysis reveals that 57% of articles on indigenous topics carry a negative tone, compared to 27% for non-indigenous related news. Drawing from communication studies, we further analyze framing, priming, and agenda-setting (frequency of themes) to support the case for discrimination in representation of indigenous news coverage. For the qualitative part of our analysis, we facilitated FGD, where participants further validated these findings. Participants unanimously expressed their feeling of being under-represented, and that critical issues affecting their communities (such as education, healthcare, and land rights) are systematically marginalized in news media coverage. By highlighting 8 cases of discrimination and media misrepresentation that were frequently mentioned by participants in the FGD, this study emphasizes the urgent need for more equitable media practices that accurately reflect the experiences and struggles of marginalized communities.
△ Less
Submitted 11 June, 2025;
originally announced June 2025.
-
It's Not a Walk in the Park! Challenges of Idiom Translation in Speech-to-text Systems
Authors:
Iuliia Zaitova,
Badr M. Abdullah,
Wei Xue,
Dietrich Klakow,
Bernd Möbius,
Tania Avgustinova
Abstract:
Idioms are defined as a group of words with a figurative meaning not deducible from their individual components. Although modern machine translation systems have made remarkable progress, translating idioms remains a major challenge, especially for speech-to-text systems, where research on this topic is notably sparse. In this paper, we systematically evaluate idiom translation as compared to conv…
▽ More
Idioms are defined as a group of words with a figurative meaning not deducible from their individual components. Although modern machine translation systems have made remarkable progress, translating idioms remains a major challenge, especially for speech-to-text systems, where research on this topic is notably sparse. In this paper, we systematically evaluate idiom translation as compared to conventional news translation in both text-to-text machine translation (MT) and speech-to-text translation (SLT) systems across two language pairs (German to English, Russian to English). We compare state-of-the-art end-to-end SLT systems (SeamlessM4T SLT-to-text, Whisper Large v3) with MT systems (SeamlessM4T SLT-to-text, No Language Left Behind), Large Language Models (DeepSeek, LLaMA) and cascaded alternatives. Our results reveal that SLT systems experience a pronounced performance drop on idiomatic data, often reverting to literal translations even in higher layers, whereas MT systems and Large Language Models demonstrate better handling of idioms. These findings underscore the need for idiom-specific strategies and improved internal representations in SLT architectures.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
Deformed magic numbers at $N=$178 and $Z=$120, 124 in the 112 $\leq N \leq $ 190 superheavy region from Skyrme mean-field calculations
Authors:
W. Asous,
Mastura Syamimi Abdullah,
Meng-Hock Koh,
Kok-Siong Khoo
Abstract:
Background: Various motivations for exploration of superheavy region revolve around the question on whether 126 is a spherical proton magic number, as is the case for neutrons. In exploring this region, identification of nuclei with relatively longer half-life as compared to its neighbours is crucial for experimental studies. Such information is provided from theoretical predictions, which are how…
▽ More
Background: Various motivations for exploration of superheavy region revolve around the question on whether 126 is a spherical proton magic number, as is the case for neutrons. In exploring this region, identification of nuclei with relatively longer half-life as compared to its neighbours is crucial for experimental studies. Such information is provided from theoretical predictions, which are however, heavily dependent on the theoretical model used and observable quantities under investigation.
Purpose: Limiting ourselves to the Skyrme Hartree-Fock-plus-Bardeen-Cooper-Schrieffer approach, we aimed to analyse the appearance of a nuclear region with relatively high stability associated with emergence of spherical and deformed magic numbers in the region of $170 \le N \le 190$ ($112 \le Z \le 130$) based on various observables.
Methods: Three Skyrme parametrizations namely the SkM* frequently employed for fission calculations, and the SLy5 and SLy4 commonly used for superheavy region, are considered to provide comparisons within the Skyrme mean-field approach. We evaluated the variation of electric quadrupole deformation ($β_{20}$), binding energy per nucleon ($BE/A$), two-nucleon separation energy differential ($δS2_{q}$), alpha-decay energy ($Q_α$) and alpha-decay half-lives ($T_{1/2}$).
Conclusion: Our analyses suggest that neutron number $N = 178$ is candidate for deformed magic number around proton number $114 \le Z \le 118$. For protons, $Z = 120$ and $124$ appears to be a candidate for deformed magic number at around $N = 172 \sim 178$. Both sets of deformed magic numbers appear at oblate ground-state deformation.
△ Less
Submitted 3 June, 2025;
originally announced June 2025.
-
Voice Conversion Improves Cross-Domain Robustness for Spoken Arabic Dialect Identification
Authors:
Badr M. Abdullah,
Matthew Baas,
Bernd Möbius,
Dietrich Klakow
Abstract:
Arabic dialect identification (ADI) systems are essential for large-scale data collection pipelines that enable the development of inclusive speech technologies for Arabic language varieties. However, the reliability of current ADI systems is limited by poor generalization to out-of-domain speech. In this paper, we present an effective approach based on voice conversion for training ADI models tha…
▽ More
Arabic dialect identification (ADI) systems are essential for large-scale data collection pipelines that enable the development of inclusive speech technologies for Arabic language varieties. However, the reliability of current ADI systems is limited by poor generalization to out-of-domain speech. In this paper, we present an effective approach based on voice conversion for training ADI models that achieves state-of-the-art performance and significantly improves robustness in cross-domain scenarios. Evaluated on a newly collected real-world test set spanning four different domains, our approach yields consistent improvements of up to +34.1% in accuracy across domains. Furthermore, we present an analysis of our approach and demonstrate that voice conversion helps mitigate the speaker bias in the ADI dataset. We release our robust ADI model and cross-domain evaluation dataset to support the development of inclusive speech technologies for Arabic.
△ Less
Submitted 30 May, 2025;
originally announced May 2025.
-
A Unified Multi-Scale Attention-Based Network for Automatic 3D Segmentation of Lung Parenchyma & Nodules In Thoracic CT Images
Authors:
Muhammad Abdullah,
Furqan Shaukat
Abstract:
Lung cancer has been one of the major threats across the world with the highest mortalities. Computer-aided detection (CAD) can help in early detection and thus can help increase the survival rate. Accurate lung parenchyma segmentation (to include the juxta-pleural nodules) and lung nodule segmentation, the primary symptom of lung cancer, play a crucial role in the overall accuracy of the Lung CAD…
▽ More
Lung cancer has been one of the major threats across the world with the highest mortalities. Computer-aided detection (CAD) can help in early detection and thus can help increase the survival rate. Accurate lung parenchyma segmentation (to include the juxta-pleural nodules) and lung nodule segmentation, the primary symptom of lung cancer, play a crucial role in the overall accuracy of the Lung CAD pipeline. Lung nodule segmentation is quite challenging because of the diverse nodule types and other inhibit structures present within the lung lobes. Traditional machine/deep learning methods suffer from generalization and robustness. Recent Vision Language Models/Foundation Models perform well on the anatomical level, but they suffer on fine-grained segmentation tasks, and their semi-automatic nature limits their effectiveness in real-time clinical scenarios. In this paper, we propose a novel method for accurate 3D segmentation of lung parenchyma and lung nodules. The proposed architecture is an attention-based network with residual blocks at each encoder-decoder state. Max pooling is replaced by strided convolutions at the encoder, and trilinear interpolation is replaced by transposed convolutions at the decoder to maximize the number of learnable parameters. Dilated convolutions at each encoder-decoder stage allow the model to capture the larger context without increasing computational costs. The proposed method has been evaluated extensively on one of the largest publicly available datasets, namely LUNA16, and is compared with recent notable work in the domain using standard performance metrics like Dice score, IOU, etc. It can be seen from the results that the proposed method achieves better performance than state-of-the-art methods. The source code, datasets, and pre-processed data can be accessed using the link: https://github.com/EMeRALDsNRPU/Attention-Based-3D-ResUNet.
△ Less
Submitted 3 June, 2025; v1 submitted 23 May, 2025;
originally announced May 2025.
-
Mitigating Gender Bias via Fostering Exploratory Thinking in LLMs
Authors:
Kangda Wei,
Hasnat Md Abdullah,
Ruihong Huang
Abstract:
Large Language Models (LLMs) often exhibit gender bias, resulting in unequal treatment of male and female subjects across different contexts. To address this issue, we propose a novel data generation framework that fosters exploratory thinking in LLMs. Our approach prompts models to generate story pairs featuring male and female protagonists in structurally identical, morally ambiguous scenarios,…
▽ More
Large Language Models (LLMs) often exhibit gender bias, resulting in unequal treatment of male and female subjects across different contexts. To address this issue, we propose a novel data generation framework that fosters exploratory thinking in LLMs. Our approach prompts models to generate story pairs featuring male and female protagonists in structurally identical, morally ambiguous scenarios, then elicits and compares their moral judgments. When inconsistencies arise, the model is guided to produce balanced, gender-neutral judgments. These story-judgment pairs are used to fine-tune or optimize the models via Direct Preference Optimization (DPO). Experimental results show that our method significantly reduces gender bias while preserving or even enhancing general model capabilities. We will release the code and generated data.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
Dynamical Properties and Velocity Dispersion-Mass Relation of $z \sim 1$ Galaxy Clusters from the GOGREEN and GCLASS Surveys
Authors:
Shrouk Abdulshafy,
Mohamed H. Abdullah,
Gillian Wilson,
Michael L. Balogh,
Raouf H. Mabrouk
Abstract:
We investigate a sample of 14 galaxy clusters from the GOGREEN and GCLASS (GG) spectroscopic datasets within the redshift range $(0.87 \leq z \leq 1.37)$ and cluster masses $\mathrm{M}_{200} \gtrsim 2\times 10^{14}$ \hm. Using the highly effective GalWeight technique for cluster membership assignment developed by our own team, we derive the dynamical parameters of these clusters through the virial…
▽ More
We investigate a sample of 14 galaxy clusters from the GOGREEN and GCLASS (GG) spectroscopic datasets within the redshift range $(0.87 \leq z \leq 1.37)$ and cluster masses $\mathrm{M}_{200} \gtrsim 2\times 10^{14}$ \hm. Using the highly effective GalWeight technique for cluster membership assignment developed by our own team, we derive the dynamical parameters of these clusters through the virial mass estimator. We examine the velocity dispersion-cluster mass relation $(σ\mathrm{MR})$ for the GG cluster sample. We find, $\log{σ_{200}} = (2.94\pm0.02) + (0.37\pm0.07)\log{\mathrm{M}_{200}}$ with an intrinsic scatter of $(σ_\mathrm{int} = 0.02 \pm 0.02)$. Our results demonstrate that the $(σ\mathrm{MR})$ relation is consistent with predictions from cosmological simulations, highlighting the reliability of the GalWeight technique for cluster membership assignment. Furthermore, the $(σ\mathrm{MR})$ validates the robustness of the virial mass estimator in accurately recovering cluster masses and associated parameters. Importantly, our findings confirm that velocity dispersion can be used directly to estimate cluster mass without relying on dynamical mass estimators.
△ Less
Submitted 17 May, 2025;
originally announced May 2025.
-
Attention on Multiword Expressions: A Multilingual Study of BERT-based Models with Regard to Idiomaticity and Microsyntax
Authors:
Iuliia Zaitova,
Vitalii Hirak,
Badr M. Abdullah,
Dietrich Klakow,
Bernd Möbius,
Tania Avgustinova
Abstract:
This study analyzes the attention patterns of fine-tuned encoder-only models based on the BERT architecture (BERT-based models) towards two distinct types of Multiword Expressions (MWEs): idioms and microsyntactic units (MSUs). Idioms present challenges in semantic non-compositionality, whereas MSUs demonstrate unconventional syntactic behavior that does not conform to standard grammatical categor…
▽ More
This study analyzes the attention patterns of fine-tuned encoder-only models based on the BERT architecture (BERT-based models) towards two distinct types of Multiword Expressions (MWEs): idioms and microsyntactic units (MSUs). Idioms present challenges in semantic non-compositionality, whereas MSUs demonstrate unconventional syntactic behavior that does not conform to standard grammatical categorizations. We aim to understand whether fine-tuning BERT-based models on specific tasks influences their attention to MWEs, and how this attention differs between semantic and syntactic tasks. We examine attention scores to MWEs in both pre-trained and fine-tuned BERT-based models. We utilize monolingual models and datasets in six Indo-European languages - English, German, Dutch, Polish, Russian, and Ukrainian. Our results show that fine-tuning significantly influences how models allocate attention to MWEs. Specifically, models fine-tuned on semantic tasks tend to distribute attention to idiomatic expressions more evenly across layers. Models fine-tuned on syntactic tasks show an increase in attention to MSUs in the lower layers, corresponding with syntactic processing requirements.
△ Less
Submitted 9 May, 2025;
originally announced May 2025.
-
Quantifying the Velocity Anisotropy Profile of Galaxy Clusters Using the Uchuu Cosmological Simulation
Authors:
Mohamed H. Abdullah,
Raouf H. Mabrouk,
Tomoaki Ishiyama,
Gillian Wilson,
Magdy Y. Amin
Abstract:
Galaxy clusters are powerful laboratories for studying both cosmic structure formation and galaxy evolution. We present a comprehensive analysis of the velocity anisotropy profile, beta(r), in galaxy clusters using the Uchuu-UniverseMachine mock galaxy catalog, which combines the large-volume Uchuu N-body simulation with the UniverseMachine galaxy formation model. Focusing on clusters with log(M20…
▽ More
Galaxy clusters are powerful laboratories for studying both cosmic structure formation and galaxy evolution. We present a comprehensive analysis of the velocity anisotropy profile, beta(r), in galaxy clusters using the Uchuu-UniverseMachine mock galaxy catalog, which combines the large-volume Uchuu N-body simulation with the UniverseMachine galaxy formation model. Focusing on clusters with log(M200) >= 13.9 [h^-1 M_sun] up to redshift z = 1.5, we investigate the behavior of beta(r) as a function of cluster-centric radius, mass, and redshift. We find that beta(r) exhibits a universal shape: it rises from isotropic values near the cluster core, peaks at approximately 1.7 R200, declines around 3.4 R200 due to orbital mixing, and increases again in the outskirts due to the dominance of first-infalling galaxies. Our results show that more massive clusters have higher radial anisotropy and larger peak beta values. Moreover, beta(r) evolves with redshift, with high-redshift clusters displaying more radially dominated orbits and enhanced infall motions. We further derive redshift-dependent power-law scaling relations between M200 and key physical radii: hydrostatic (R_hs), infall (R_inf), and turnaround (R_ta). These findings offer a robust theoretical framework for interpreting the dynamical properties of observed galaxy clusters and provide key insights into the evolution of their dynamical state over cosmic time.
△ Less
Submitted 6 April, 2025;
originally announced April 2025.
-
CliME: Evaluating Multimodal Climate Discourse on Social Media and the Climate Alignment Quotient (CAQ)
Authors:
Abhilekh Borah,
Hasnat Md Abdullah,
Kangda Wei,
Ruihong Huang
Abstract:
The rise of Large Language Models (LLMs) has raised questions about their ability to understand climate-related contexts. Though climate change dominates social media, analyzing its multimodal expressions is understudied, and current tools have failed to determine whether LLMs amplify credible solutions or spread unsubstantiated claims. To address this, we introduce CliME (Climate Change Multimoda…
▽ More
The rise of Large Language Models (LLMs) has raised questions about their ability to understand climate-related contexts. Though climate change dominates social media, analyzing its multimodal expressions is understudied, and current tools have failed to determine whether LLMs amplify credible solutions or spread unsubstantiated claims. To address this, we introduce CliME (Climate Change Multimodal Evaluation), a first-of-its-kind multimodal dataset, comprising 2579 Twitter and Reddit posts. The benchmark features a diverse collection of humorous memes and skeptical posts, capturing how these formats distill complex issues into viral narratives that shape public opinion and policy discussions. To systematically evaluate LLM performance, we present the Climate Alignment Quotient (CAQ), a novel metric comprising five distinct dimensions: Articulation, Evidence, Resonance, Transition, and Specificity. Additionally, we propose three analytical lenses: Actionability, Criticality, and Justice, to guide the assessment of LLM-generated climate discourse using CAQ. Our findings, based on the CAQ metric, indicate that while most evaluated LLMs perform relatively well in Criticality and Justice, they consistently underperform on the Actionability axis. Among the models evaluated, Claude 3.7 Sonnet achieves the highest overall performance. We publicly release our CliME dataset and code to foster further research in this domain.
△ Less
Submitted 4 April, 2025;
originally announced April 2025.
-
State-of-the-Art Translation of Text-to-Gloss using mBART : A case study of Bangla
Authors:
Sharif Md. Abdullah,
Abhijit Paul,
Shebuti Rayana,
Ahmedul Kabir,
Zarif Masud
Abstract:
Despite a large deaf and dumb population of 1.7 million, Bangla Sign Language (BdSL) remains a understudied domain. Specifically, there are no works on Bangla text-to-gloss translation task. To address this gap, we begin by addressing the dataset problem. We take inspiration from grammatical rule based gloss generation used in Germany and American sign langauage (ASL) and adapt it for BdSL. We als…
▽ More
Despite a large deaf and dumb population of 1.7 million, Bangla Sign Language (BdSL) remains a understudied domain. Specifically, there are no works on Bangla text-to-gloss translation task. To address this gap, we begin by addressing the dataset problem. We take inspiration from grammatical rule based gloss generation used in Germany and American sign langauage (ASL) and adapt it for BdSL. We also leverage LLM to generate synthetic data and use back-translation, text generation for data augmentation. With dataset prepared, we started experimentation. We fine-tuned pretrained mBART-50 and mBERT-multiclass-uncased model on our dataset. We also trained GRU, RNN and a novel seq-to-seq model with multi-head attention. We observe significant high performance (ScareBLEU=79.53) with fine-tuning pretrained mBART-50 multilingual model from Facebook. We then explored why we observe such high performance with mBART. We soon notice an interesting property of mBART -- it was trained on shuffled and masked text data. And as we know, gloss form has shuffling property. So we hypothesize that mBART is inherently good at text-to-gloss tasks. To find support against this hypothesis, we trained mBART-50 on PHOENIX-14T benchmark and evaluated it with existing literature. Our mBART-50 finetune demonstrated State-of-the-Art performance on PHOENIX-14T benchmark, far outperforming existing models in all 6 metrics (ScareBLEU = 63.89, BLEU-1 = 55.14, BLEU-2 = 38.07, BLEU-3 = 27.13, BLEU-4 = 20.68, COMET = 0.624). Based on the results, this study proposes a new paradigm for text-to-gloss task using mBART models. Additionally, our results show that BdSL text-to-gloss task can greatly benefit from rule-based synthetic dataset.
△ Less
Submitted 3 April, 2025;
originally announced April 2025.
-
Towards Continuous Experiment-driven MLOps
Authors:
Keerthiga Rajenthiram,
Milad Abdullah,
Ilias Gerostathopoulos,
Petr Hnetynka,
Tomáš Bureš,
Gerard Pons,
Besim Bilalli,
Anna Queralt
Abstract:
Despite advancements in MLOps and AutoML, ML development still remains challenging for data scientists. First, there is poor support for and limited control over optimizing and evolving ML models. Second, there is lack of efficient mechanisms for continuous evolution of ML models which would leverage the knowledge gained in previous optimizations of the same or different models. We propose an expe…
▽ More
Despite advancements in MLOps and AutoML, ML development still remains challenging for data scientists. First, there is poor support for and limited control over optimizing and evolving ML models. Second, there is lack of efficient mechanisms for continuous evolution of ML models which would leverage the knowledge gained in previous optimizations of the same or different models. We propose an experiment-driven MLOps approach which tackles these problems. Our approach relies on the concept of an experiment, which embodies a fully controllable optimization process. It introduces full traceability and repeatability to the optimization process, allows humans to be in full control of it, and enables continuous improvement of the ML system. Importantly, it also establishes knowledge, which is carried over and built across a series of experiments and allows for improving the efficiency of experimentation over time. We demonstrate our approach through its realization and application in the ExtremeXP1 project (Horizon Europe).
△ Less
Submitted 5 March, 2025;
originally announced March 2025.
-
Evidence for topological origin of large spin-shift current in antiferromagnetic Ti$_{4}$C$_{3}$
Authors:
Ali Sufyan,
Hasan M. Abdullah,
J. Andreas Larsson,
Alexander C. Tyner
Abstract:
The shift current is a non-linear photocurrent generally associated with the underlying quantum geometry. However, a topological origin for the shift photocurrent in non-centrosymmetric systems has recently been proposed. The corresponding topological classification goes beyond the ten-fold paradigm and is associated with the presence of a reverting Thouless pump (RTP). In this work we examine an…
▽ More
The shift current is a non-linear photocurrent generally associated with the underlying quantum geometry. However, a topological origin for the shift photocurrent in non-centrosymmetric systems has recently been proposed. The corresponding topological classification goes beyond the ten-fold paradigm and is associated with the presence of a reverting Thouless pump (RTP). In this work we examine an antiferromagnetic monolayer within the family of MXenes, Ti$_{4}$C$_{3}$. This material is centrosymmetric, however, magnetic ordering violates inversion symmetry. We demonstrate evidence of an RTP in each spin-sector which has been perturbed, destroying quantization of the invariant. Nevertheless, a giant spin-resolved shift current persists. We further investigate the mid-gap edge states and classification of the system as a fragile topological insulator to which trivial bands have been coupled.
△ Less
Submitted 18 February, 2025;
originally announced February 2025.
-
Resolving shortwave and longwave irradiation distributions across the human body in outdoor built environments
Authors:
Kambiz Sadeghi,
Shri H. Viswanathan,
Ankit Joshi,
Lyle Bartels,
Sylwester Wereski,
Cibin T. Jose,
Galina Mihaleva,
Muhammad Abdullah,
Ariane Middel,
Konrad Rykaczewski
Abstract:
Outdoor built environments can be designed to enhance thermal comfort, yet the relationship between the two is often assessed in whole-body terms, overlooking the asymmetric nature of thermal interactions between the human body and its surroundings. Moreover, the radiative component of heat exchange-dominant in hot and dry climates-is typically lumped into a single artificial metric, the mean radi…
▽ More
Outdoor built environments can be designed to enhance thermal comfort, yet the relationship between the two is often assessed in whole-body terms, overlooking the asymmetric nature of thermal interactions between the human body and its surroundings. Moreover, the radiative component of heat exchange-dominant in hot and dry climates-is typically lumped into a single artificial metric, the mean radiant temperature, rather than being resolved into its shortwave and longwave spectral components. The shortwave irradiation distribution on the human body is often highly anisotropic, causing localized thermal discomfort in outdoor environments. However, no existing methods effectively quantify shortwave and longwave irradiation distributions on the human body. To address this gap, we developed two methods to quantify these processes. The first approach uses an outdoor thermal manikin with a white-coated side, enabling the separation of spectral components by subtracting measurements from symmetrically corresponding surface zones of tan color. The second hybrid approach converts radiometer measurements in six directions into boundary conditions for computational thermal manikin simulations. We evaluated irradiation distributions for various body parts using both methods during outdoor measurements across sunny, partially shaded, and fully shaded sites under warm to extremely hot conditions. In most cases, the two methods produced closely aligned results, with divergences highlighting their respective strengths and limitations. Additionally, we used the manikin to quantify irradiation attenuation provided by five long-sleeve shirts with colors ranging from white to black. These advanced methods can be integrated with airflow and thermoregulatory modeling to optimize outdoor built environments for enhanced human thermal comfort.
△ Less
Submitted 12 March, 2025; v1 submitted 6 February, 2025;
originally announced February 2025.
-
A Comparative Performance Analysis of Classification and Segmentation Models on Bangladeshi Pothole Dataset
Authors:
Antara Firoz Parsa,
S. M. Abdullah,
Anika Hasan Talukder,
Md. Asif Shahidullah Kabbya,
Shakib Al Hasan,
Md. Farhadul Islam,
Jannatun Noor
Abstract:
The study involves a comprehensive performance analysis of popular classification and segmentation models, applied over a Bangladeshi pothole dataset, being developed by the authors of this research. This custom dataset of 824 samples, collected from the streets of Dhaka and Bogura performs competitively against the existing industrial and custom datasets utilized in the present literature. The da…
▽ More
The study involves a comprehensive performance analysis of popular classification and segmentation models, applied over a Bangladeshi pothole dataset, being developed by the authors of this research. This custom dataset of 824 samples, collected from the streets of Dhaka and Bogura performs competitively against the existing industrial and custom datasets utilized in the present literature. The dataset was further augmented four-fold for segmentation and ten-fold for classification evaluation. We tested nine classification models (CCT, CNN, INN, Swin Transformer, ConvMixer, VGG16, ResNet50, DenseNet201, and Xception) and four segmentation models (U-Net, ResU-Net, U-Net++, and Attention-Unet) over both the datasets. Among the classification models, lightweight models namely CCT, CNN, INN, Swin Transformer, and ConvMixer were emphasized due to their low computational requirements and faster prediction times. The lightweight models performed respectfully, oftentimes equating to the performance of heavyweight models. In addition, augmentation was found to enhance the performance of all the tested models. The experimental results exhibit that, our dataset performs on par or outperforms the similar classification models utilized in the existing literature, reaching accuracy and f1-scores over 99%. The dataset also performed on par with the existing datasets for segmentation, achieving model Dice Similarity Coefficient up to 67.54% and IoU scores up to 59.39%.
△ Less
Submitted 11 January, 2025;
originally announced January 2025.
-
Thermal Transport Properties of Magnons on the $α$-T$_3$ Lattice
Authors:
Luqman Saleem,
Hasan M. Abdullah,
Udo Schwingenschlogl,
Aurelien Manchon
Abstract:
We theoretically investigate magnons on the $α$-T$_3$ lattice. Atomistic spin dynamics simulations show that next-nearest neighbor hopping and easy-axis anisotropy stabilize ferromagnetic order in the presence of Dzyaloshinskii-Moriya interaction. We identify one topologically trivial magnon insulator phase and three magnon Chern insulator phases. The topologically trivial magnon insulator phase e…
▽ More
We theoretically investigate magnons on the $α$-T$_3$ lattice. Atomistic spin dynamics simulations show that next-nearest neighbor hopping and easy-axis anisotropy stabilize ferromagnetic order in the presence of Dzyaloshinskii-Moriya interaction. We identify one topologically trivial magnon insulator phase and three magnon Chern insulator phases. The topologically trivial magnon insulator phase exhibits a small but non-zero magnon thermal Hall conductivity, while in the magnon Chern insulator phases the Chern number of the lowest magnon band dominates the magnon thermal Hall conductivity. The sign of the magnon thermal Hall conductivity does not change at the topological phase boundaries, but distinct changes are observed in the magnitude.
△ Less
Submitted 7 January, 2025;
originally announced January 2025.
-
Visual Counter Turing Test (VCT^2): Discovering the Challenges for AI-Generated Image Detection and Introducing Visual AI Index (V_AI)
Authors:
Nasrin Imanpour,
Shashwat Bajpai,
Subhankar Ghosh,
Sainath Reddy Sankepally,
Abhilekh Borah,
Hasnat Md Abdullah,
Nishoak Kosaraju,
Shreyas Dixit,
Ashhar Aziz,
Shwetangshu Biswas,
Vinija Jain,
Aman Chadha,
Amit Sheth,
Amitava Das
Abstract:
The proliferation of AI techniques for image generation, coupled with their increasing accessibility, has raised significant concerns about the potential misuse of these images to spread misinformation. Recent AI-generated image detection (AGID) methods include CNNDetection, NPR, DM Image Detection, Fake Image Detection, DIRE, LASTED, GAN Image Detection, AIDE, SSP, DRCT, RINE, OCC-CLIP, De-Fake,…
▽ More
The proliferation of AI techniques for image generation, coupled with their increasing accessibility, has raised significant concerns about the potential misuse of these images to spread misinformation. Recent AI-generated image detection (AGID) methods include CNNDetection, NPR, DM Image Detection, Fake Image Detection, DIRE, LASTED, GAN Image Detection, AIDE, SSP, DRCT, RINE, OCC-CLIP, De-Fake, and Deep Fake Detection. However, we argue that the current state-of-the-art AGID techniques are inadequate for effectively detecting contemporary AI-generated images and advocate for a comprehensive reevaluation of these methods. We introduce the Visual Counter Turing Test (VCT^2), a benchmark comprising ~130K images generated by contemporary text-to-image models (Stable Diffusion 2.1, Stable Diffusion XL, Stable Diffusion 3, DALL-E 3, and Midjourney 6). VCT^2 includes two sets of prompts sourced from tweets by the New York Times Twitter account and captions from the MS COCO dataset. We also evaluate the performance of the aforementioned AGID techniques on the VCT$^2$ benchmark, highlighting their ineffectiveness in detecting AI-generated images. As image-generative AI models continue to evolve, the need for a quantifiable framework to evaluate these models becomes increasingly critical. To meet this need, we propose the Visual AI Index (V_AI), which assesses generated images from various visual perspectives, including texture complexity and object coherence, setting a new standard for evaluating image-generative AI models. To foster research in this domain, we make our https://huggingface.co/datasets/anonymous1233/COCO_AI and https://huggingface.co/datasets/anonymous1233/twitter_AI datasets publicly available.
△ Less
Submitted 24 November, 2024;
originally announced November 2024.
-
UAL-Bench: The First Comprehensive Unusual Activity Localization Benchmark
Authors:
Hasnat Md Abdullah,
Tian Liu,
Kangda Wei,
Shu Kong,
Ruihong Huang
Abstract:
Localizing unusual activities, such as human errors or surveillance incidents, in videos holds practical significance. However, current video understanding models struggle with localizing these unusual events likely because of their insufficient representation in models' pretraining datasets. To explore foundation models' capability in localizing unusual activity, we introduce UAL-Bench, a compreh…
▽ More
Localizing unusual activities, such as human errors or surveillance incidents, in videos holds practical significance. However, current video understanding models struggle with localizing these unusual events likely because of their insufficient representation in models' pretraining datasets. To explore foundation models' capability in localizing unusual activity, we introduce UAL-Bench, a comprehensive benchmark for unusual activity localization, featuring three video datasets: UAG-OOPS, UAG-SSBD, UAG-FunQA, and an instruction-tune dataset: OOPS-UAG-Instruct, to improve model capabilities. UAL-Bench evaluates three approaches: Video-Language Models (Vid-LLMs), instruction-tuned Vid-LLMs, and a novel integration of Vision-Language Models and Large Language Models (VLM-LLM). Our results show the VLM-LLM approach excels in localizing short-span unusual events and predicting their onset (start time) more accurately than Vid-LLMs. We also propose a new metric, R@1, TD <= p, to address limitations in existing evaluation methods. Our findings highlight the challenges posed by long-duration videos, particularly in autism diagnosis scenarios, and the need for further advancements in localization techniques. Our work not only provides a benchmark for unusual activity localization but also outlines the key challenges for existing foundation models, suggesting future research directions on this important task.
△ Less
Submitted 1 October, 2024;
originally announced October 2024.
-
Effect of ground-state deformation on the Isoscalar Giant Monopole Resonance and the first observation of overtones of the Isoscalar Giant Quadrupole Resonance in rare-earth Nd isotopes
Authors:
M. Abdullah,
S. Bagchi,
M. N. Harakeh,
H. Akimune,
D. Das,
T. Doi,
L. M. Donaldson,
Y. Fujikawa,
M. Fujiwara,
T. Furuno,
U. Garg,
Y. K. Gupta,
K. B. Howard,
Y. Hijikata,
K. Inaba,
S. Ishida,
M. Itoh,
N. Kalantar-Nayestanaki,
D. Kar,
T. Kawabata,
S. Kawashima,
K. Khokhar,
K. Kitamura,
N. Kobayashi,
Y. Matsuda
, et al. (11 additional authors not shown)
Abstract:
The strength distributions of the Isoscalar Giant Monopole Resonance (ISGMR) and Isoscalar Giant Quadrupole Resonance (ISGQR) in 142,146-150Nd have been determined via inelastic alpha-particle scattering with the Grand Raiden (GR) Spectrometer at the Research Center for Nuclear Physics (RCNP), Japan. In the deformed nuclei 146-150Nd, the ISGMR strength distributions exhibit a splitting into two co…
▽ More
The strength distributions of the Isoscalar Giant Monopole Resonance (ISGMR) and Isoscalar Giant Quadrupole Resonance (ISGQR) in 142,146-150Nd have been determined via inelastic alpha-particle scattering with the Grand Raiden (GR) Spectrometer at the Research Center for Nuclear Physics (RCNP), Japan. In the deformed nuclei 146-150Nd, the ISGMR strength distributions exhibit a splitting into two components, while the nearly spherical nucleus 142Nd displays a single peak in the ISGMR strength distribution. A noteworthy achievement in this study is the first-time detection of overtones in the Isoscalar Giant Quadrupole Resonance (ISGQR) strength distributions within Nd isotopes at an excitation energy around 25 MeV obtained through Multipole Decomposition Analysis (MDA).
△ Less
Submitted 8 July, 2024;
originally announced July 2024.
-
Viscous Instabilities in Transversely Strained Channel Flows
Authors:
Muhammad Abdullah
Abstract:
We investigate here linear stability in a canonical three-dimensional boundary layer generated by the superposition of a spanwise pressure gradient upon an otherwise standard channel flow. As the main result, we introduce a simple coordinate transformation that enables the complete description of modal and non-modal stability using previous results on Poiseuille flow. We leverage this insight to d…
▽ More
We investigate here linear stability in a canonical three-dimensional boundary layer generated by the superposition of a spanwise pressure gradient upon an otherwise standard channel flow. As the main result, we introduce a simple coordinate transformation that enables the complete description of modal and non-modal stability using previous results on Poiseuille flow. We leverage this insight to derive closed forms for some relevant stability metrics. In particular, the critical Reynolds number for exponential-in-time growth is found to monotonically decrease with the strength of the cross-flow. A suitably chosen re-scaling, however, shows that the stability characteristics ultimately approach those of channel flow, despite the presence of a non-zero spanwise shear. Unstable eigenmodes akin to the Tollmien-Schlichting wave are found to propagate along the direction of the net flow. From a non-modal perspective, the maximal transient (algebraic) growth increases quadratically with the spanwise pressure differential and, similar to two-dimensional flows, is fueled by the lift-up effect. In this regard, the linear energy budget highlights a dramatic increase in energy production against the spanwise shear.
△ Less
Submitted 27 June, 2024;
originally announced June 2024.
-
On the Encoding of Gender in Transformer-based ASR Representations
Authors:
Aravind Krishnan,
Badr M. Abdullah,
Dietrich Klakow
Abstract:
While existing literature relies on performance differences to uncover gender biases in ASR models, a deeper analysis is essential to understand how gender is encoded and utilized during transcript generation. This work investigates the encoding and utilization of gender in the latent representations of two transformer-based ASR models, Wav2Vec2 and HuBERT. Using linear erasure, we demonstrate the…
▽ More
While existing literature relies on performance differences to uncover gender biases in ASR models, a deeper analysis is essential to understand how gender is encoded and utilized during transcript generation. This work investigates the encoding and utilization of gender in the latent representations of two transformer-based ASR models, Wav2Vec2 and HuBERT. Using linear erasure, we demonstrate the feasibility of removing gender information from each layer of an ASR model and show that such an intervention has minimal impacts on the ASR performance. Additionally, our analysis reveals a concentration of gender information within the first and last frames in the final layers, explaining the ease of erasing gender in these layers. Our findings suggest the prospect of creating gender-neutral embeddings that can be integrated into ASR frameworks without compromising their efficacy.
△ Less
Submitted 14 June, 2024;
originally announced June 2024.
-
Calculation of Femur Caput Collum Diaphyseal angle for X-Rays images using Semantic Segmentation
Authors:
Muhammad Abdullah,
Anne Querfurth,
Deepak Bhatia,
Mahdi Mantash
Abstract:
This paper investigates the use of deep learning approaches to estimate the femur caput-collum-diaphyseal (CCD) angle from X-ray images. The CCD angle is an important measurement in the diagnosis of hip problems, and correct prediction can help in the planning of surgical procedures. Manual measurement of this angle, on the other hand, can be time-intensive and vulnerable to inter-observer variabi…
▽ More
This paper investigates the use of deep learning approaches to estimate the femur caput-collum-diaphyseal (CCD) angle from X-ray images. The CCD angle is an important measurement in the diagnosis of hip problems, and correct prediction can help in the planning of surgical procedures. Manual measurement of this angle, on the other hand, can be time-intensive and vulnerable to inter-observer variability. In this paper, we present a deep-learning algorithm that can reliably estimate the femur CCD angle from X-ray images. To train and test the performance of our model, we employed an X-ray image dataset with associated femur CCD angle measurements. Furthermore, we built a prototype to display the resulting predictions and to allow the user to interact with the predictions. As this is happening in a sterile setting during surgery, we expanded our interface to the possibility of being used only by voice commands.
Our results show that our deep learning model predicts the femur CCD angle on X-ray images with great accuracy, with a mean absolute error of 4.3 degrees on the left femur and 4.9 degrees on the right femur on the test dataset. Our results suggest that deep learning has the potential to give a more efficient and accurate technique for predicting the femur CCD angle, which might have substantial therapeutic implications for the diagnosis and management of hip problems.
△ Less
Submitted 26 May, 2024; v1 submitted 25 April, 2024;
originally announced April 2024.
-
An Analysis of Recent Advances in Deepfake Image Detection in an Evolving Threat Landscape
Authors:
Sifat Muhammad Abdullah,
Aravind Cheruvu,
Shravya Kanchi,
Taejoong Chung,
Peng Gao,
Murtuza Jadliwala,
Bimal Viswanath
Abstract:
Deepfake or synthetic images produced using deep generative models pose serious risks to online platforms. This has triggered several research efforts to accurately detect deepfake images, achieving excellent performance on publicly available deepfake datasets. In this work, we study 8 state-of-the-art detectors and argue that they are far from being ready for deployment due to two recent developm…
▽ More
Deepfake or synthetic images produced using deep generative models pose serious risks to online platforms. This has triggered several research efforts to accurately detect deepfake images, achieving excellent performance on publicly available deepfake datasets. In this work, we study 8 state-of-the-art detectors and argue that they are far from being ready for deployment due to two recent developments. First, the emergence of lightweight methods to customize large generative models, can enable an attacker to create many customized generators (to create deepfakes), thereby substantially increasing the threat surface. We show that existing defenses fail to generalize well to such \emph{user-customized generative models} that are publicly available today. We discuss new machine learning approaches based on content-agnostic features, and ensemble modeling to improve generalization performance against user-customized models. Second, the emergence of \textit{vision foundation models} -- machine learning models trained on broad data that can be easily adapted to several downstream tasks -- can be misused by attackers to craft adversarial deepfakes that can evade existing defenses. We propose a simple adversarial attack that leverages existing foundation models to craft adversarial samples \textit{without adding any adversarial noise}, through careful semantic manipulation of the image content. We highlight the vulnerabilities of several defenses against our attack, and explore directions leveraging advanced foundation models and adversarial training to defend against this new threat.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Revisiting Crossflow-Based Stabilization in Channel Flows
Authors:
Muhammad Abdullah,
George Ilhwan Park
Abstract:
Stabilization schemes in wall-bounded flows often invoke fluid transpiration through porous boundaries. While these have been extensively validated for external flows, their efficacy in channels, particularly from the standpoint of non-modal perturbations, is yet to be demonstrated. Here, we show that crossflow strengths previously considered ``ideal'' for optimizing stability in channels in fact…
▽ More
Stabilization schemes in wall-bounded flows often invoke fluid transpiration through porous boundaries. While these have been extensively validated for external flows, their efficacy in channels, particularly from the standpoint of non-modal perturbations, is yet to be demonstrated. Here, we show that crossflow strengths previously considered ``ideal'' for optimizing stability in channels in fact admit strong non-modal energy amplification. We begin by supplementing existing modal calculations and then show via the resolvent that extremely strong and potentially unfeasible crossflows are required to suppress non-modal growth in linearly stable regimes. Investigation of unforced algebraic growth paints a similar picture. Here, a component-wise budget analysis reveals that energy redistribution through pressure-velocity correlations plays an important role in driving energy growth/decay. The superposition of a moving wall is also considered, and it is shown that while energy amplification generally worsens, it can potentially be suppressed beyond a regime shift in parameter space. However, these flows are marred by rapidly declining mass transport, rendering their ultimate utility questionable. Our results suggest that crossflow-based stabilization might not be useful in internal flows.
△ Less
Submitted 22 April, 2024;
originally announced April 2024.
-
A classification of $1$-homogeneous distance-regular graphs with positive intersection number $a_1$
Authors:
Jack H. Koolen,
Mamoon Abdullah,
Brhane Gebremichel,
Jae-Ho Lee
Abstract:
Let $Γ$ be a graph with diameter at least two. Then $Γ$ is said to be $1$-homogeneous (in the sense of Nomura) whenever for every pair of adjacent vertices $x$ and $y$ in $Γ$, the distance partition of the vertex set of $Γ$ with respect to both $x$ and $y$ is equitable, and the parameters corresponding to equitable partitions are independent of the choice of $x$ and $y$. Assume that $Γ$ is $1$-hom…
▽ More
Let $Γ$ be a graph with diameter at least two. Then $Γ$ is said to be $1$-homogeneous (in the sense of Nomura) whenever for every pair of adjacent vertices $x$ and $y$ in $Γ$, the distance partition of the vertex set of $Γ$ with respect to both $x$ and $y$ is equitable, and the parameters corresponding to equitable partitions are independent of the choice of $x$ and $y$. Assume that $Γ$ is $1$-homogeneous distance-regular with intersection number $a_1>0$ and diameter $D\geqslant 5$. Define $b=b_1/(θ_1+1)$, where $b_1$ is the intersection number and $θ_1$ is the second largest eigenvalue of $Γ$. We show that if intersection number $c_2$ is at least $2$, then $b\geqslant 1$ and one of the following (i)--(vi) holds: (i) $Γ$ is a regular near $2D$-gon, (ii) $Γ$ is a Johnson graph $J(2D,D)$, (iii) $Γ$ is a halved $\ell$-cube with $\ell \in \{2D,2D+1\}$, (iv) $Γ$ is a folded Johnson graph $\bar{J}(4D,2D)$, (v) $Γ$ is a folded halved $(4D)$-cube, (vi) the valency of $Γ$ is bounded by a function of $b$. Using this result, we characterize $1$-homogeneous graphs with classical parameters and $a_1>0$, as well as tight distance-regular graphs.
△ Less
Submitted 30 January, 2025; v1 submitted 1 April, 2024;
originally announced April 2024.
-
Rethinking Software Engineering in the Foundation Model Era: A Curated Catalogue of Challenges in the Development of Trustworthy FMware
Authors:
Ahmed E. Hassan,
Dayi Lin,
Gopi Krishnan Rajbahadur,
Keheliya Gallaba,
Filipe R. Cogo,
Boyuan Chen,
Haoxiang Zhang,
Kishanthan Thangarajah,
Gustavo Ansaldi Oliva,
Jiahuei Lin,
Wali Mohammad Abdullah,
Zhen Ming Jiang
Abstract:
Foundation models (FMs), such as Large Language Models (LLMs), have revolutionized software development by enabling new use cases and business models. We refer to software built using FMs as FMware. The unique properties of FMware (e.g., prompts, agents, and the need for orchestration), coupled with the intrinsic limitations of FMs (e.g., hallucination) lead to a completely new set of software eng…
▽ More
Foundation models (FMs), such as Large Language Models (LLMs), have revolutionized software development by enabling new use cases and business models. We refer to software built using FMs as FMware. The unique properties of FMware (e.g., prompts, agents, and the need for orchestration), coupled with the intrinsic limitations of FMs (e.g., hallucination) lead to a completely new set of software engineering challenges. Based on our industrial experience, we identified 10 key SE4FMware challenges that have caused enterprise FMware development to be unproductive, costly, and risky. In this paper, we discuss these challenges in detail and state the path for innovation that we envision. Next, we present FMArts, which is our long-term effort towards creating a cradle-to-grave platform for the engineering of trustworthy FMware. Finally, we (i) show how the unique properties of FMArts enabled us to design and develop a complex FMware for a large customer in a timely manner and (ii) discuss the lessons that we learned in doing so. We hope that the disclosure of the aforementioned challenges and our associated efforts to tackle them will not only raise awareness but also promote deeper and further discussions, knowledge sharing, and innovative solutions across the software engineering discipline.
△ Less
Submitted 3 March, 2024; v1 submitted 24 February, 2024;
originally announced February 2024.
-
Linear Stability Analysis of Oblique Couette-Poiseuille flows
Authors:
Muhammad Abdullah,
George Ilhwan Park
Abstract:
We perform a detailed numerical study of modal and non-modal stability in oblique Couette-Poiseuille profiles, which are among the simplest examples of three-dimensional boundary layers. Through a comparison with the Orr-Sommerfeld operator for the aligned case, we show how an effective wall speed succinctly characterizes modal stability. Large-scale parameter sweeps reveal that the misalignment b…
▽ More
We perform a detailed numerical study of modal and non-modal stability in oblique Couette-Poiseuille profiles, which are among the simplest examples of three-dimensional boundary layers. Through a comparison with the Orr-Sommerfeld operator for the aligned case, we show how an effective wall speed succinctly characterizes modal stability. Large-scale parameter sweeps reveal that the misalignment between the pressure gradient and wall motion is, in general, destabilizing. For flows that are sufficiently oblique, the instability is found to depend exclusively on the direction of wall motion and not on its speed, a conclusion supported, in part, by the perturbation energy budget and the evolution of the critical layers. Closed forms for the critical parameters in this regime are derived using a simple analysis. Finally, a modified long-wavelength approximation is developed, and the resulting asymptotic eigenvalue problem is used to show that there is no cutoff wall speed for unconditional stability whenever the angle of wall motion is non-zero, in stark contrast to the aligned case. From a non-modal perspective, pseudo-resonance is examined through the resolvent and the $ε$-pseudospectra. An analysis of the unforced initial value problem shows that the maximum energy gain is highly dependent on both the magnitude and direction of the wall velocity. However, the strongest amplification is always achieved for configurations that are only weakly skewed. Finally, the optimal perturbations appear to develop via a lift-up effect induced by an Orr-like mechanism.
△ Less
Submitted 11 February, 2024;
originally announced February 2024.
-
Distance-regular graphs with a few $q$-distance eigenvalues
Authors:
Mamoon Abdullah,
Brhane Gebremichel,
Sakander Hayat,
Jack H. Koolen
Abstract:
In this paper we study when the $q$-distance matrix of a distance-regular graph has few distinct eigenvalues. We mainly concentrate on diameter 3.
In this paper we study when the $q$-distance matrix of a distance-regular graph has few distinct eigenvalues. We mainly concentrate on diameter 3.
△ Less
Submitted 10 January, 2024;
originally announced January 2024.
-
The Correlation Function and Detection of Baryon Acoustic Oscillation Peak from the Spectroscopic SDSS GalWCat Galaxy Cluster Catalogue
Authors:
Mohamed H. Abdullah,
Anatoly Klypin,
Francisco Prada,
Gillian Wilson,
Tomoaki Ishiyama,
Julia Ereza
Abstract:
We measure the two point correlation function (CF) of 1357 galaxy clusters with a mass of $\log_{10}{M_{200}}\geq 13.6$~\hm~and at a redshift of $z \leq 0.125$. This work differs from previous analyses in that it utilizes a spectroscopic cluster catalogue, $\mathtt{SDSS-GalWCat}$, to measure the CF and detect the baryon acoustic oscillation (BAO) signal. Unlike previous studies which use statistic…
▽ More
We measure the two point correlation function (CF) of 1357 galaxy clusters with a mass of $\log_{10}{M_{200}}\geq 13.6$~\hm~and at a redshift of $z \leq 0.125$. This work differs from previous analyses in that it utilizes a spectroscopic cluster catalogue, $\mathtt{SDSS-GalWCat}$, to measure the CF and detect the baryon acoustic oscillation (BAO) signal. Unlike previous studies which use statistical techniques, we compute covariance errors directly by generating a set of 1086 galaxy cluster lightcones from the GLAM $N$-body simulation. Fitting the CF with a power-law model of the form $ξ(s) = (s/s_0)^{-γ}$, we determine the best-fit correlation length and power-law index at three mass thresholds. We find that the correlation length increases with increasing the mass threshold while the power-law index is almost constant. For $\log_{10}{M_{200}}\geq 13.6$~\hm, we find $s_0 = 14.54\pm0.87$~\h~and $γ=1.97\pm0.11$. We detect the BAO signal at $s = 100$~\h~with a significance of $1.60 σ$. Fitting the CF with a $Λ$CDM model, we find $D_\mathrm{V}(z = 0.089)\mathrm{r}^{fid}_d/\mathrm{r}_d = 267.62 \pm 26$ \h, consistent with Planck 2015 cosmology. We present a set of 108 high-fidelity simulated galaxy cluster lightcones from the high-resolution \U~N-body simulation, employed for methodological validation. We find $D_\mathrm{V}(z = 0.089)/r_d = 2.666 \pm 0.129$, indicating that our method does not introduce any bias in the parameter estimation for this small sample of galaxy clusters.
△ Less
Submitted 20 December, 2023;
originally announced December 2023.
-
Self-supervised Adaptive Pre-training of Multilingual Speech Models for Language and Dialect Identification
Authors:
Mohammed Maqsood Shaik,
Dietrich Klakow,
Badr M. Abdullah
Abstract:
Pre-trained Transformer-based speech models have shown striking performance when fine-tuned on various downstream tasks such as automatic speech recognition and spoken language identification (SLID). However, the problem of domain mismatch remains a challenge in this area, where the domain of the pre-training data might differ from that of the downstream labeled data used for fine-tuning. In multi…
▽ More
Pre-trained Transformer-based speech models have shown striking performance when fine-tuned on various downstream tasks such as automatic speech recognition and spoken language identification (SLID). However, the problem of domain mismatch remains a challenge in this area, where the domain of the pre-training data might differ from that of the downstream labeled data used for fine-tuning. In multilingual tasks such as SLID, the pre-trained speech model may not support all the languages in the downstream task. To address this challenge, we propose self-supervised adaptive pre-training (SAPT) to adapt the pre-trained model to the target domain and languages of the downstream task. We apply SAPT to the XLSR-128 model and investigate the effectiveness of this approach for the SLID task. First, we demonstrate that SAPT improves XLSR performance on the FLEURS benchmark with substantial gains up to 40.1% for under-represented languages. Second, we apply SAPT on four different datasets in a few-shot learning setting, showing that our approach improves the sample efficiency of XLSR during fine-tuning. Our experiments provide strong empirical evidence that continual adaptation via self-supervision improves downstream performance for multilingual speech models.
△ Less
Submitted 12 December, 2023;
originally announced December 2023.
-
The Cluster Mass Function and the $σ_8$-tension
Authors:
Alexandros Papageorgiou,
Manolis Plionis,
Spyros Basilakos,
H. M. Abdullah
Abstract:
We use a large set of halo mass function (HMF) models in order to investigate their ability to represent the observational Cluster Mass Function (CMF), derived from the $\mathtt{GalWCat19}$ cluster catalogue, within the $Λ$CDM cosmology. We apply the $χ^2$ minimization procedure to constrain the free parameters of the models, namely $Ω_m$ and $σ_8$. We find that all HMF models fit well the observa…
▽ More
We use a large set of halo mass function (HMF) models in order to investigate their ability to represent the observational Cluster Mass Function (CMF), derived from the $\mathtt{GalWCat19}$ cluster catalogue, within the $Λ$CDM cosmology. We apply the $χ^2$ minimization procedure to constrain the free parameters of the models, namely $Ω_m$ and $σ_8$. We find that all HMF models fit well the observational CMF, while the Bocquet et. al. model provides the best fit, with the lowest $χ^2$ value. Utilizing the {\em Index of Inconsistency} (IOI) measure, we further test the possible inconsistency of the models with respect to a variety of {\em Planck 2018} $Λ$CDM cosmologies, resulting from the combination of different probes (CMB - BAO or CMB - DES). We find that the HMF models that fitted well the observed CMF provide consistent cosmological parameters with those of the {\em Planck} CMB analysis, except for the Press $\&$ Schechter, Yahagi et. al., and Despali et. al. models which return large IOI values. The inverse $χ_{\rm min}^2$-weighted average values of $Ω_m$ and $σ_8$, over all 23 theoretical HMF models are: ${\bar Ω_{m,0}}=0.313\pm 0.022$ and ${\bar σ_8}=0.798\pm0.040$, which are clearly consistent with the results of {\em Planck}-CMB, providing $S_8=σ_8\left(Ω_m/0.3\right)^{1/2}= 0.815\pm 0.05$. Within the $Λ$CDM paradigm and independently of the selected HMF model in the analysis, we find that the current CMF shows no $σ_8$-tension with the corresponding {\em Planck}-CMB results.
△ Less
Submitted 16 November, 2023;
originally announced November 2023.
-
SynthEnsemble: A Fusion of CNN, Vision Transformer, and Hybrid Models for Multi-Label Chest X-Ray Classification
Authors:
S. M. Nabil Ashraf,
Md. Adyelullahil Mamun,
Hasnat Md. Abdullah,
Md. Golam Rabiul Alam
Abstract:
Chest X-rays are widely used to diagnose thoracic diseases, but the lack of detailed information about these abnormalities makes it challenging to develop accurate automated diagnosis systems, which is crucial for early detection and effective treatment. To address this challenge, we employed deep learning techniques to identify patterns in chest X-rays that correspond to different diseases. We co…
▽ More
Chest X-rays are widely used to diagnose thoracic diseases, but the lack of detailed information about these abnormalities makes it challenging to develop accurate automated diagnosis systems, which is crucial for early detection and effective treatment. To address this challenge, we employed deep learning techniques to identify patterns in chest X-rays that correspond to different diseases. We conducted experiments on the "ChestX-ray14" dataset using various pre-trained CNNs, transformers, hybrid(CNN+Transformer) models and classical models. The best individual model was the CoAtNet, which achieved an area under the receiver operating characteristic curve (AUROC) of 84.2%. By combining the predictions of all trained models using a weighted average ensemble where the weight of each model was determined using differential evolution, we further improved the AUROC to 85.4%, outperforming other state-of-the-art methods in this field. Our findings demonstrate the potential of deep learning techniques, particularly ensemble deep learning, for improving the accuracy of automatic diagnosis of thoracic diseases from chest X-rays. Code available at:https://github.com/syednabilashraf/SynthEnsemble
△ Less
Submitted 22 May, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum Disorder
Authors:
Rownak Ara Rasul,
Promy Saha,
Diponkor Bala,
S M Rakib Ul Karim,
Md. Ibrahim Abdullah,
Bishwajit Saha
Abstract:
Autistic Spectrum Disorder (ASD) is a neurological disease characterized by difficulties with social interaction, communication, and repetitive activities. While its primary origin lies in genetics, early detection is crucial, and leveraging machine learning offers a promising avenue for a faster and more cost-effective diagnosis. This study employs diverse machine learning methods to identify cru…
▽ More
Autistic Spectrum Disorder (ASD) is a neurological disease characterized by difficulties with social interaction, communication, and repetitive activities. While its primary origin lies in genetics, early detection is crucial, and leveraging machine learning offers a promising avenue for a faster and more cost-effective diagnosis. This study employs diverse machine learning methods to identify crucial ASD traits, aiming to enhance and automate the diagnostic process. We study eight state-of-the-art classification models to determine their effectiveness in ASD detection. We evaluate the models using accuracy, precision, recall, specificity, F1-score, area under the curve (AUC), kappa, and log loss metrics to find the best classifier for these binary datasets. Among all the classification models, for the children dataset, the SVM and LR models achieve the highest accuracy of 100% and for the adult dataset, the LR model produces the highest accuracy of 97.14%. Our proposed ANN model provides the highest accuracy of 94.24% for the new combined dataset when hyperparameters are precisely tuned for each model. As almost all classification models achieve high accuracy which utilize true labels, we become interested in delving into five popular clustering algorithms to understand model behavior in scenarios without true labels. We calculate Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), and Silhouette Coefficient (SC) metrics to select the best clustering models. Our evaluation finds that spectral clustering outperforms all other benchmarking clustering models in terms of NMI and ARI metrics while demonstrating comparability to the optimal SC achieved by k-means. The implemented code is available at GitHub.
△ Less
Submitted 28 December, 2023; v1 submitted 20 September, 2023;
originally announced September 2023.
-
Ensemble-based modeling abstractions for modern self-optimizing systems
Authors:
Michal Töpfer,
Milad Abdullah,
Tomáš Bureš,
Petr Hnětynka,
Martin Kruliš
Abstract:
In this paper, we extend our ensemble-based component model DEECo with the capability to use machine-learning and optimization heuristics in establishing and reconfiguration of autonomic component ensembles. We show how to capture these concepts on the model level and give an example of how such a model can be beneficially used for modeling access-control related problem in the Industry 4.0 settin…
▽ More
In this paper, we extend our ensemble-based component model DEECo with the capability to use machine-learning and optimization heuristics in establishing and reconfiguration of autonomic component ensembles. We show how to capture these concepts on the model level and give an example of how such a model can be beneficially used for modeling access-control related problem in the Industry 4.0 settings. We argue that incorporating machine-learning and optimization heuristics is a key feature for modern smart systems which are to learn over the time and optimize their behavior at runtime to deal with uncertainty in their environment.
△ Less
Submitted 11 September, 2023;
originally announced September 2023.
-
Theoretical foundation for the Pareto distribution of international trade strength and introduction of an equation for international trade forecasting
Authors:
Mikrajuddin Abdullah
Abstract:
I propose a new terminology, international trade strength, which is defined as the ratio of a country's total international trade to its GDP. This parameter represents a country's ability to generate international trade by utilizing its GDP. This figure is equivalent to GDP per capita, which represents a country's ability to use its population to generate GDP. Trade strength varies by country. The…
▽ More
I propose a new terminology, international trade strength, which is defined as the ratio of a country's total international trade to its GDP. This parameter represents a country's ability to generate international trade by utilizing its GDP. This figure is equivalent to GDP per capita, which represents a country's ability to use its population to generate GDP. Trade strength varies by country. The intriguing question is, what distribution function does the trade strength fulfill? In this paper, a theoretical foundation for predicting the distribution of trade strength and the rate of change of trade strength were developed. These two quantities were found to satisfy the Pareto distribution function. The equations were confirmed using data from the World Integrated Trade Solution (WITS) and the World Bank by comparing the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) to five types of distribution functions (exponential, lognormal, gamma, Pareto, and Weibull). I also discovered that the fitting Pareto power parameter is fairly close to the theoretical parameter. In addition, a formula for forecasting a country's total international trade in the following years was also developed.
△ Less
Submitted 19 August, 2023;
originally announced September 2023.
-
A New Approach to Overcoming Zero Trade in Gravity Models to Avoid Indefinite Values in Linear Logarithmic Equations and Parameter Verification Using Machine Learning
Authors:
Mikrajuddin Abdullah
Abstract:
The presence of a high number of zero flow trades continues to provide a challenge in identifying gravity parameters to explain international trade using the gravity model. Linear regression with a logarithmic linear equation encounters an indefinite value on the logarithmic trade. Although several approaches to solving this problem have been proposed, the majority of them are no longer based on l…
▽ More
The presence of a high number of zero flow trades continues to provide a challenge in identifying gravity parameters to explain international trade using the gravity model. Linear regression with a logarithmic linear equation encounters an indefinite value on the logarithmic trade. Although several approaches to solving this problem have been proposed, the majority of them are no longer based on linear regression, making the process of finding solutions more complex. In this work, we suggest a two-step technique for determining the gravity parameters: first, perform linear regression locally to establish a dummy value to substitute trade flow zero, and then estimating the gravity parameters. Iterative techniques are used to determine the optimum parameters. Machine learning is used to test the estimated parameters by analyzing their position in the cluster. We calculated international trade figures for 2004, 2009, 2014, and 2019. We just examine the classic gravity equation and discover that the powers of GDP and distance are in the same cluster and are both worth roughly one. The strategy presented here can be used to solve other problems involving log-linear regression.
△ Less
Submitted 11 August, 2023;
originally announced August 2023.
-
An Information-Theoretic Analysis of Self-supervised Discrete Representations of Speech
Authors:
Badr M. Abdullah,
Mohammed Maqsood Shaik,
Bernd Möbius,
Dietrich Klakow
Abstract:
Self-supervised representation learning for speech often involves a quantization step that transforms the acoustic input into discrete units. However, it remains unclear how to characterize the relationship between these discrete units and abstract phonetic categories such as phonemes. In this paper, we develop an information-theoretic framework whereby we represent each phonetic category as a dis…
▽ More
Self-supervised representation learning for speech often involves a quantization step that transforms the acoustic input into discrete units. However, it remains unclear how to characterize the relationship between these discrete units and abstract phonetic categories such as phonemes. In this paper, we develop an information-theoretic framework whereby we represent each phonetic category as a distribution over discrete units. We then apply our framework to two different self-supervised models (namely wav2vec 2.0 and XLSR) and use American English speech as a case study. Our study demonstrates that the entropy of phonetic distributions reflects the variability of the underlying speech sounds, with phonetically similar sounds exhibiting similar distributions. While our study confirms the lack of direct, one-to-one correspondence, we find an intriguing, indirect relationship between phonetic categories and discrete units.
△ Less
Submitted 4 June, 2023;
originally announced June 2023.
-
Distance-regular graphs with exactly one positive $q$-distance eigenvalue
Authors:
Jack H. Koolen,
Mamoon Abdullah,
Brhane Gebremichel,
Sakander Hayat
Abstract:
In this paper, we study the $q$-distance matrix for a distance-regular graph and show that the $q$-distance matrix of a distance-regular graph with classical parameters ($D, q, α, β$) has exactly three distinct eigenvalues, of which one is zero. Moreover, we study distance-regular graphs whose $q$-distance matrix has exactly one positive eigenvalue.
In this paper, we study the $q$-distance matrix for a distance-regular graph and show that the $q$-distance matrix of a distance-regular graph with classical parameters ($D, q, α, β$) has exactly three distinct eigenvalues, of which one is zero. Moreover, we study distance-regular graphs whose $q$-distance matrix has exactly one positive eigenvalue.
△ Less
Submitted 23 May, 2023;
originally announced May 2023.
-
Affective social anthropomorphic intelligent system
Authors:
Md. Adyelullahil Mamun,
Hasnat Md. Abdullah,
Md. Golam Rabiul Alam,
Muhammad Mehedi Hassan,
Md. Zia Uddin
Abstract:
Human conversational styles are measured by the sense of humor, personality, and tone of voice. These characteristics have become essential for conversational intelligent virtual assistants. However, most of the state-of-the-art intelligent virtual assistants (IVAs) are failed to interpret the affective semantics of human voices. This research proposes an anthropomorphic intelligent system that ca…
▽ More
Human conversational styles are measured by the sense of humor, personality, and tone of voice. These characteristics have become essential for conversational intelligent virtual assistants. However, most of the state-of-the-art intelligent virtual assistants (IVAs) are failed to interpret the affective semantics of human voices. This research proposes an anthropomorphic intelligent system that can hold a proper human-like conversation with emotion and personality. A voice style transfer method is also proposed to map the attributes of a specific emotion. Initially, the frequency domain data (Mel-Spectrogram) is created by converting the temporal audio wave data, which comprises discrete patterns for audio features such as notes, pitch, rhythm, and melody. A collateral CNN-Transformer-Encoder is used to predict seven different affective states from voice. The voice is also fed parallelly to the deep-speech, an RNN model that generates the text transcription from the spectrogram. Then the transcripted text is transferred to the multi-domain conversation agent using blended skill talk, transformer-based retrieve-and-generate generation strategy, and beam-search decoding, and an appropriate textual response is generated. The system learns an invertible mapping of data to a latent space that can be manipulated and generates a Mel-spectrogram frame based on previous Mel-spectrogram frames to voice synthesize and style transfer. Finally, the waveform is generated using WaveGlow from the spectrogram. The outcomes of the studies we conducted on individual models were auspicious. Furthermore, users who interacted with the system provided positive feedback, demonstrating the system's effectiveness.
△ Less
Submitted 19 April, 2023;
originally announced April 2023.
-
Revealing the higher-order spin nature of the Hall effect in non-collinear antiferromagnet $\mathrm{Mn_3Ni_{0.35}Cu_{0.65}N}$
Authors:
Adithya Rajan,
Tom G. Saunderson,
Fabian R. Lux,
Rocío Yanes Díaz,
Hasan M. Abdullah,
Arnab Bose,
Beatrice Bednarz,
Jun-Young Kim,
Dongwook Go,
Tetsuya Hajiri,
Gokaran Shukla,
Olena Gomonay,
Yugui Yao,
Wanxiang Feng,
Hidefumi Asano,
Udo Schwingenschlögl,
Luis López-Díaz,
Jairo Sinova,
Yuriy Mokrousov,
Aurélien Manchon,
Mathias Kläui
Abstract:
Ferromagnets generate an anomalous Hall effect even without the presence of a magnetic field, something that conventional antiferromagnets cannot replicate but noncollinear antiferromagnets can. The anomalous Hall effect governed by the resistivity tensor plays a crucial role in determining the presence of time reversal symmetry and the topology present in the system. In this work we reveal the co…
▽ More
Ferromagnets generate an anomalous Hall effect even without the presence of a magnetic field, something that conventional antiferromagnets cannot replicate but noncollinear antiferromagnets can. The anomalous Hall effect governed by the resistivity tensor plays a crucial role in determining the presence of time reversal symmetry and the topology present in the system. In this work we reveal the complex origin of the anomalous Hall effect arising in noncollinear antiferromagnets by performing Hall measurements with fields applied in selected directions in space with respect to the crystalline axes. Our coplanar magnetic field geometry goes beyond the conventional perpendicular field geometry used for ferromagnets and allows us to suppress any magnetic dipole contribution. It allows us to map the in-plane anomalous Hall contribution and we demonstrate a 120$^\circ$ symmetry which we find to be governed by the octupole moment at high fields. At low fields we subsequently discover a surprising topological Hall-like signature and, from a combination of theoretical techniques, we show that the spins can be recast into dipole, emergent octupole and noncoplanar effective magnetic moments. These co-existing orders enable magnetization dynamics unachievable in either ferromagnetic or conventional collinear antiferromagnetic materials.
△ Less
Submitted 21 April, 2023;
originally announced April 2023.
-
Buffalo Genome Projects: Current Situation and Future Perspective in Improving Breeding Programs
Authors:
Ahmed M. Mousbah,
Hesham M. Abdullah,
Waleed S. Mohammed,
Ali M. El-Refy,
Mohamed Helmy
Abstract:
Buffaloes are farm animals that contribute to food security by providing high quality meat and milk. They can better tolerate the adverse effects of global climate change on their meat and milk production. Despite their advantages, buffaloes are heavily neglected animals with fewer studies compared to other farm animals, hence, the real potential of buffaloes has never been realized. The complete…
▽ More
Buffaloes are farm animals that contribute to food security by providing high quality meat and milk. They can better tolerate the adverse effects of global climate change on their meat and milk production. Despite their advantages, buffaloes are heavily neglected animals with fewer studies compared to other farm animals, hence, the real potential of buffaloes has never been realized. The complete genome sequencing projects of buffaloes are essential to better understanding the buffalos biology and production since they allow scientists to identify important genes and understand how the gene networks interact to determine the critical features of buffaloes. The genome projects are also valuable for gaining better knowledge of growth, development, maintenance, and determining factors associated with increased meat and milk production. Furthermore, having access to a complete genome of high quality and comprehensive annotations provides a powerful tool in breeding programs. The current review surveyed the publicly available buffalo genome projects and studied the impact of incorporating genomic selection into the buffalo breeding program. Our survey of the publicly available buffalo genome projects showed the promise of genomic selection in developing water buffalo science and technology for food security on a global scale.
△ Less
Submitted 11 April, 2023;
originally announced April 2023.
-
Developing the Reliable Shallow Supervised Learning for Thermal Comfort using ASHRAE RP-884 and ASHRAE Global Thermal Comfort Database II
Authors:
Kanisius Karyono,
Badr M. Abdullah,
Alison J. Cotgrave,
Ana Bras,
Jeff Cullen
Abstract:
The artificial intelligence (AI) system designer for thermal comfort faces insufficient data recorded from the current user or overfitting due to unreliable training data. This work introduces the reliable data set for training the AI subsystem for thermal comfort. This paper presents the control algorithm based on shallow supervised learning, which is simple enough to be implemented in the Intern…
▽ More
The artificial intelligence (AI) system designer for thermal comfort faces insufficient data recorded from the current user or overfitting due to unreliable training data. This work introduces the reliable data set for training the AI subsystem for thermal comfort. This paper presents the control algorithm based on shallow supervised learning, which is simple enough to be implemented in the Internet of Things (IoT) system for residential usage using ASHRAE RP-884 and ASHRAE Global Thermal Comfort Database II. No training data for thermal comfort is available as reliable as this dataset, but the direct use of this data can lead to overfitting. This work offers the algorithm for data filtering and semantic data augmentation for the ASHRAE database for the supervised learning process. Overfitting always becomes a problem due to the psychological aspect involved in the thermal comfort decision. The method to check the AI system based on the psychrometric chart against overfitting is presented. This paper also assesses the most important parameters needed to achieve human thermal comfort. This method can support the development of reinforced learning for thermal comfort.
△ Less
Submitted 3 March, 2023;
originally announced March 2023.
-
Magnetic Lens Made of a Single Solenoid for Controlling Bending of Two-Dimensional Ion Beam
Authors:
Ardi Khalifah,
Riri Murniati,
Mikrajuddin Abdullah
Abstract:
The magnetic field inside an ideal solenoid cavity with an arbitrary cross-section is always constant, while it is always zero outside the solenoid. We can make a solenoid lens that can focus a parallel beam to a point behind it by adjusting the curvature of the solenoid circumference. In this paper, we discuss the design of magnetic lenses ranging from simple geometries to the general ones. We di…
▽ More
The magnetic field inside an ideal solenoid cavity with an arbitrary cross-section is always constant, while it is always zero outside the solenoid. We can make a solenoid lens that can focus a parallel beam to a point behind it by adjusting the curvature of the solenoid circumference. In this paper, we discuss the design of magnetic lenses ranging from simple geometries to the general ones. We discovered that there are an infinite number of curvatures that can be used to focus the parallel beam to a specific focal point. Using this property, we also present the concept of a simple mass spectrometer by measuring the intensity of the ion captured by a detector placed at the focal point. This result is expected to enrich learning material in undergraduate courses, especially for the topic of electricity and magnetism.
△ Less
Submitted 23 January, 2023;
originally announced February 2023.
-
Multi objective Fitness Dependent Optimizer Algorithm
Authors:
Jaza M. Abdullah,
Tarik A. Rashid,
Bestan B. Maaroof,
Seyedali Mirjalili
Abstract:
This paper proposes the multi objective variant of the recently introduced fitness dependent optimizer (FDO). The algorithm is called a Multi objective Fitness Dependent Optimizer (MOFDO) and is equipped with all five types of knowledge (situational, normative, topographical, domain, and historical knowledge) as in FDO. MOFDO is tested on two standard benchmarks for the performance-proof purpose;…
▽ More
This paper proposes the multi objective variant of the recently introduced fitness dependent optimizer (FDO). The algorithm is called a Multi objective Fitness Dependent Optimizer (MOFDO) and is equipped with all five types of knowledge (situational, normative, topographical, domain, and historical knowledge) as in FDO. MOFDO is tested on two standard benchmarks for the performance-proof purpose; classical ZDT test functions, which is a widespread test suite that takes its name from its authors Zitzler, Deb, and Thiele, and on IEEE Congress of Evolutionary Computation benchmark (CEC 2019) multi modal multi objective functions. MOFDO results are compared to the latest variant of multi objective particle swarm optimization (MOPSO), non-dominated sorting genetic algorithm third improvement (NSGA-III), and multi objective dragonfly algorithm (MODA). The comparative study shows the superiority of MOFDO in most cases and comparative results in other cases. Moreover, MOFDO is used for optimizing real-world engineering problems (e.g., welded beam design problems). It is observed that the proposed algorithm successfully provides a wide variety of well-distributed feasible solutions, which enable the decision-makers to have more applicable-comfort choices to consider.
△ Less
Submitted 26 January, 2023;
originally announced February 2023.
-
Turkish Inflation, Private Debt & how to overcome it
Authors:
Mahmood Abdullah
Abstract:
The thing about inflation is that it ravages your income if you don not keep up with it and you do not know when it will stop.
The thing about inflation is that it ravages your income if you don not keep up with it and you do not know when it will stop.
△ Less
Submitted 17 January, 2023;
originally announced January 2023.