Skip to main content

Showing 201–250 of 4,434 results for author: Mohammed

.
  1. Forecasting Empty Container availability for Vehicle Booking System Application

    Authors: Arthur Cartel Foahom Gouabou, Mohammed Al-Kharaz, Faouzi Hakimi, Tarek Khaled, Kenza Amzil

    Abstract: Container terminals, pivotal nodes in the network of empty container movement, hold significant potential for enhancing operational efficiency within terminal depots through effective collaboration between transporters and terminal operators. This collaboration is crucial for achieving optimization, leading to streamlined operations and reduced congestion, thereby benefiting both parties. Conseque… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Journal ref: Procedia Computer Science, 2024, 246, pp.3103-3112

  2. arXiv:2503.11700  [pdf

    stat.AP

    Comparative Study of the Median Based Unit Rayleigh and its Generalized Form the Generalized Odd Median Based Unit Rayleigh

    Authors: Iman Mohammed Attia

    Abstract: In the present paper, the author discusses the Generalized Odd Median Base Unit Rayleigh (GOMBUR) in relation to the Median Based Unit Rayleigh (MBUR) to evaluate the additive value of the new shape parameter on the estimation process as regards validity indices, goodness of fit statistics, estimated variances of the estimated parameters and their standard errors. This evaluation is conducted on r… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  3. arXiv:2503.11668  [pdf

    stat.AP math.PR

    The New Generalized Odd Median Based Unit Rayleigh with a New Shape Oscillating Hazard Rate Function

    Authors: Iman Attia

    Abstract: In this paper, the author presents the generalized form of the Median-Based Unit Rayleigh (MBUR) distribution, a novel statistical distribution that is specifically defined within the interval (0, 1) expressing oscillating hazard rate function. This generalization adds a new parameter to the MBUR distribution that significantly addresses the unique characteristics of data represented as ratios and… ▽ More

    Submitted 24 February, 2025; originally announced March 2025.

  4. mobilityDCAT-AP: a Metadata Specification for Enhanced Cross-border Mobility Data Sharing

    Authors: Mario Scrocca, Lina Molinas Comet, Benjamin Witsch, Daham Mohammed Mustafa, Christoph Lange, Marco Comerio, Peter Lubrich

    Abstract: Integrated and efficient mobility requires data sharing among the involved stakeholders. In this direction, regulators and transport authorities have been defining policies to foster the digitalisation and online publication of mobility data. However, the creation of several heterogeneous data portals for mobility data resulted in a fragmented ecosystem that challenges data accessibility. In this… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: Paper accepted for publication at the 22th Extended Semantic Web Conference (ESWC) 2025. This preprint has not undergone peer review or any post-submission improvements or corrections. The Version of Record of this contribution will be published in the conference proceedings

  5. Step-by-Step Data Cleaning Recommendations to Improve ML Prediction Accuracy

    Authors: Sedir Mohammed, Felix Naumann, Hazar Harmouch

    Abstract: Data quality is crucial in machine learning (ML) applications, as errors in the data can significantly impact the prediction accuracy of the underlying ML model. Therefore, data cleaning is an integral component of any ML pipeline. However, in practical scenarios, data cleaning incurs significant costs, as it often involves domain experts for configuring and executing the cleaning process. Thus, e… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Journal ref: Proceedings 28th International Conference on Extending Database Technology (EDBT) 2025, Barcelona, Spain, March 25-28, 2025, 542-554

  6. arXiv:2503.10953  [pdf, ps, other

    eess.SY

    Safe Control of Second-Order Systems with Linear Constraints

    Authors: Mohammed Alyaseen, Nikolay Atanasov, Jorge Cortes

    Abstract: Control barrier functions (CBFs) offer a powerful tool for enforcing safety specifications in control synthesis. This paper deals with the problem of constructing valid CBFs. Given a second-order system and any desired safety set with linear boundaries in the position space, we construct a provably control-invariant subset of this desired safety set. The constructed subset does not sacrifice any p… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

  7. arXiv:2503.09293  [pdf, other

    cs.CV

    Better Together: Unified Motion Capture and 3D Avatar Reconstruction

    Authors: Arthur Moreau, Mohammed Brahimi, Richard Shaw, Athanasios Papaioannou, Thomas Tanay, Zhensong Zhang, Eduardo Pérez-Pellitero

    Abstract: We present Better Together, a method that simultaneously solves the human pose estimation problem while reconstructing a photorealistic 3D human avatar from multi-view videos. While prior art usually solves these problems separately, we argue that joint optimization of skeletal motion with a 3D renderable body model brings synergistic effects, i.e. yields more precise motion capture and improved v… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: 14 pages, 6 figures

  8. arXiv:2503.08358  [pdf, ps, other

    cs.RO

    DG16M: A Large-Scale Dataset for Dual-Arm Grasping with Force-Optimized Grasps

    Authors: Md Faizal Karim, Mohammed Saad Hashmi, Shreya Bollimuntha, Mahesh Reddy Tapeti, Gaurav Singh, Nagamanikandan Govindan, K Madhava Krishna

    Abstract: Dual-arm robotic grasping is crucial for handling large objects that require stable and coordinated manipulation. While single-arm grasping has been extensively studied, datasets tailored for dual-arm settings remain scarce. We introduce a large-scale dataset of 16 million dual-arm grasps, evaluated under improved force-closure constraints. Additionally, we develop a benchmark dataset containing 3… ▽ More

    Submitted 30 June, 2025; v1 submitted 11 March, 2025; originally announced March 2025.

  9. arXiv:2503.07912  [pdf, ps, other

    math.AP

    Non-homogeneous problem for the fractional wave equation with irregular coefficients and data

    Authors: Manel Bouguenna, Mohammed Elamine Sebih

    Abstract: In this paper, we consider the Cauchy problem for a non-homogeneous wave equation generated by the fractional Laplacian and involving different kinds of lower order terms. We allow the equation coefficients and data to be of distributional type or less regular, having in mind the Dirac delta function and its powers, and we prove that the problem is well-posed in the sense of the concept of very we… ▽ More

    Submitted 12 March, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

    Comments: arXiv admin note: substantial text overlap with arXiv:2311.01639

    MSC Class: 35L81; 35L05; 35D30; 35A35

  10. arXiv:2503.07647  [pdf, other

    cs.LG physics.ao-ph physics.data-an

    On the Importance of Clearsky Model in Short-Term Solar Radiation Forecasting

    Authors: Cyril Voyant, Milan Despotovic, Gilles Notton, Yves-Marie Saint-Drenan, Mohammed Asloune, Luis Garcia-Gutierrez

    Abstract: Clearsky models are widely used in solar energy for many applications such as quality control, resource assessment, satellite-base irradiance estimation and forecasting. However, their use in forecasting and nowcasting is associated with a number of challenges. Synchronization errors, reliance on the Clearsky index (ratio of the global horizontal irradiance to its cloud-free counterpart) and high… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: 20 pages, 10 Figures and 1 Table

  11. arXiv:2503.07450  [pdf, ps, other

    cs.AI

    From Idea to Implementation: Evaluating the Influence of Large Language Models in Software Development -- An Opinion Paper

    Authors: Sargam Yadav, Asifa Mehmood Qureshi, Abhishek Kaushik, Shubham Sharma, Roisin Loughran, Subramaniam Kazhuparambil, Andrew Shaw, Mohammed Sabry, Niamh St John Lynch, . Nikhil Singh, Padraic O'Hara, Pranay Jaiswal, Roshan Chandru, David Lillis

    Abstract: The introduction of transformer architecture was a turning point in Natural Language Processing (NLP). Models based on the transformer architecture such as Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-Trained Transformer (GPT) have gained widespread popularity in various applications such as software development and education. The availability of Large Language… ▽ More

    Submitted 13 June, 2025; v1 submitted 10 March, 2025; originally announced March 2025.

    Comments: The project is partially supported by the DkIT Postgraduate Scholarship, Research Ireland under Grant number 13/RC/2094_2, and Grant number 21/FFP-A/925

  12. arXiv:2503.06985  [pdf, other

    cs.LG

    Learning Decision Trees as Amortized Structure Inference

    Authors: Mohammed Mahfoud, Ghait Boukachab, Michał Koziarski, Alex Hernandez-Garcia, Stefan Bauer, Yoshua Bengio, Nikolay Malkin

    Abstract: Building predictive models for tabular data presents fundamental challenges, notably in scaling consistently, i.e., more resources translating to better performance, and generalizing systematically beyond the training data distribution. Designing decision tree models remains especially challenging given the intractably large search space, and most existing methods rely on greedy heuristics, while… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: Code: $\href{https://github.com/GFNOrg/dt-gfn}{https://github.com/GFNOrg/dt-gfn}$

  13. arXiv:2503.05951  [pdf, other

    cs.AR cs.AI

    TPU-Gen: LLM-Driven Custom Tensor Processing Unit Generator

    Authors: Deepak Vungarala, Mohammed E. Elbtity, Sumiya Syed, Sakila Alam, Kartik Pandit, Arnob Ghosh, Ramtin Zand, Shaahin Angizi

    Abstract: The increasing complexity and scale of Deep Neural Networks (DNNs) necessitate specialized tensor accelerators, such as Tensor Processing Units (TPUs), to meet various computational and energy efficiency requirements. Nevertheless, designing optimal TPU remains challenging due to the high domain expertise level, considerable manual design time, and lack of high-quality, domain-specific datasets. T… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: 8 Pages, 9 Figures, 5 Tables

  14. arXiv:2503.05706  [pdf, other

    cs.CY stat.AP

    The Impact of Building-Induced Visibility Restrictions on Intersection Accidents

    Authors: Hanlin Tian, Yuxiang Feng, Wei Zhou, Anupriya, Mohammed Quddus, Yiannis Demiris, Panagiotis Angeloudis

    Abstract: Traffic accidents, especially at intersections, are a major road safety concern. Previous research has extensively studied intersection-related accidents, but the effect of building-induced visibility restrictions at intersections on accident rates has been under-explored, particularly in urban contexts. Using OpenStreetMap data, the UK's geographic and accident datasets, and the UK Traffic Count… ▽ More

    Submitted 13 February, 2025; originally announced March 2025.

    Comments: TRBAM-24-02409

  15. arXiv:2503.05516  [pdf

    cs.CY cs.AI cs.CL cs.HC

    Cognitive Bias Detection Using Advanced Prompt Engineering

    Authors: Frederic Lemieux, Aisha Behr, Clara Kellermann-Bryant, Zaki Mohammed

    Abstract: Cognitive biases, systematic deviations from rationality in judgment, pose significant challenges in generating objective content. This paper introduces a novel approach for real-time cognitive bias detection in user-generated text using large language models (LLMs) and advanced prompt engineering techniques. The proposed system analyzes textual data to identify common cognitive biases such as con… ▽ More

    Submitted 7 March, 2025; originally announced March 2025.

    Comments: 17 pages. 6 Figures, 2 Tables

  16. arXiv:2503.05135  [pdf, other

    math.CO

    Characterizing the positive inertia index of connected signed graphs in terms of girth

    Authors: Suliman Khan, Sakander Hayat, Mohammed J. F. Alenazi

    Abstract: Let $G^σ=(G,σ)$ be a connected signed graph and $A(G^σ)$ be its adjacency matrix. The positive inertia index of $G^σ$, denoted by $p^{+}(G^σ)$, is defined as the number of positive eigenvalues of $A(G^σ)$. Assume that $G^σ$ contains at least one cycle, and let $g_{r}$ be its girth. In this paper, we prove $p^{+}(G^σ) \geq \lceil \frac {g_{r}}{2} \rceil-1$ for a signed graph $G^σ$. The extremal sig… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: 10 pages, 2 figures

    MSC Class: 05C22; 05C50

  17. arXiv:2503.05002  [pdf, other

    cond-mat.stat-mech cond-mat.str-el

    Magnetic Phase Transitions and Mixed Spin in Double Perovskite $Sr_{2}FeMoO_{6}$

    Authors: Said Khaireddine, Redouane Assad, Mohammed El Falaki, Rachid Ahl Lamara, Lalla Btissam Drissi

    Abstract: The magnetic properties of the double perovskite oxide $Sr_{2}$FeMo$O_{6}$ are analyzed using a mixed-spin Ising model with spins $\left( \frac{1}{2},\frac{5}{2}\right) $ in the presence of a random crystal field $Δ$ and exchange interactions $ J $ on a three-dimensional (3D) cubic lattice. The study employs both the Mean-Field Approximation (MFA) based on the Bogoliubov inequality for Gibbs free… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  18. arXiv:2503.04748  [pdf

    cs.CY

    Large Language Models in Healthcare

    Authors: Mohammed Al-Garadi, Tushar Mungle, Abdulaziz Ahmed, Abeed Sarker, Zhuqi Miao, Michael E. Matheny

    Abstract: Large language models (LLMs) hold promise for transforming healthcare, from streamlining administrative and clinical workflows to enriching patient engagement and advancing clinical decision-making. However, their successful integration requires rigorous development, adaptation, and evaluation strategies tailored to clinical needs. In this Review, we highlight recent advancements, explore emerging… ▽ More

    Submitted 2 April, 2025; v1 submitted 6 February, 2025; originally announced March 2025.

  19. arXiv:2503.04724  [pdf, other

    cs.CL

    LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

    Authors: Sambal Shikhar, Mohammed Irfan Kurpath, Sahal Shaji Mullappilly, Jean Lahoud, Fahad Khan, Rao Muhammad Anwer, Salman Khan, Hisham Cholakkal

    Abstract: Recent advancements in speech-to-speech dialogue systems leverage LLMs for multimodal interactions, yet they remain hindered by fine-tuning requirements, high computational overhead, and text-speech misalignment. Existing speech-enabled LLMs often degrade conversational quality by modifying the LLM, thereby compromising its linguistic capabilities. In contrast, we propose LLMVoX, a lightweight 30M… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  20. arXiv:2503.03929  [pdf, ps, other

    math.OC math.FA

    Kantorovich duality for optimal transport on completely regular Hausdorff spaces

    Authors: Mohammed Bachir

    Abstract: We introduce a new intermediate optimization problem situated between Kantorovich's primal and dual formulations. This new problem extends Kantorovich's duality to separable Baire measures, which are strictly more general than tight (or Radon) measures in completely regular Hausdorff spaces. In the special case where the measures are Radon, our intermediate problem aligns with the classical Kantor… ▽ More

    Submitted 19 June, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

    MSC Class: 49Q22; 46N10; 49N15; 90C46; 28C05

  21. arXiv:2503.03391  [pdf, other

    cs.LG cs.AI

    Multi-Agent DRL for Queue-Aware Task Offloading in Hierarchical MEC-Enabled Air-Ground Networks

    Authors: Muhammet Hevesli, Abegaz Mohammed Seid, Aiman Erbad, Mohamed Abdallah

    Abstract: Mobile edge computing (MEC)-enabled air-ground networks are a key component of 6G, employing aerial base stations (ABSs) such as unmanned aerial vehicles (UAVs) and high-altitude platform stations (HAPS) to provide dynamic services to ground IoT devices (IoTDs). These IoTDs support real-time applications (e.g., multimedia and Metaverse services) that demand high computational resources and strict… ▽ More

    Submitted 5 March, 2025; originally announced March 2025.

  22. arXiv:2503.03132  [pdf, other

    cs.CV

    Dynamic Neural Surfaces for Elastic 4D Shape Representation and Analysis

    Authors: Awais Nizamani, Hamid Laga, Guanjin Wang, Farid Boussaid, Mohammed Bennamoun, Anuj Srivastava

    Abstract: We propose a novel framework for the statistical analysis of genus-zero 4D surfaces, i.e., 3D surfaces that deform and evolve over time. This problem is particularly challenging due to the arbitrary parameterizations of these surfaces and their varying deformation speeds, necessitating effective spatiotemporal registration. Traditionally, 4D surfaces are discretized, in space and time, before comp… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: 22 pages, 23 figures, conference paper

    Journal ref: CVPR 2025

  23. arXiv:2503.02968  [pdf, other

    cs.LG cs.CR

    Privacy-Preserving Fair Synthetic Tabular Data

    Authors: Fatima J. Sarmin, Atiquer R. Rahman, Christopher J. Henry, Noman Mohammed

    Abstract: Sharing of tabular data containing valuable but private information is limited due to legal and ethical issues. Synthetic data could be an alternative solution to this sharing problem, as it is artificially generated by machine learning algorithms and tries to capture the underlying data distribution. However, machine learning models are not free from memorization and may introduce biases, as they… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

  24. arXiv:2503.02860  [pdf, other

    hep-ex

    PileUp Mitigation at the HL-LHC Using Attention for Event-Wide Context

    Authors: Luke Vaughan, Mohammed Rakib, Shivang Patel, Flera Rizatdinova, Alexander Khanov, Arunkumar Bagavathi

    Abstract: The Large Hadron Collider, LHC, collides bunches of protons resulting in multiple interactions that occur practically simultaneously. This creates a pileup effect that distorts physics measurements due to the products of pileup collisions. In order to improve the discovery potential of the LHC, it is necessary to mitigate the effect of pileup interactions on the processes of interest. In this pape… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: Accepted to PAKDD2025. 12 Pages. 6 Figures

  25. Branching fraction measurement of the decay $B^+ \to ψ(2S) φ(1020) K^+$

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis, L. An , et al. (1128 additional authors not shown)

    Abstract: The branching fraction of the decay $B^+\to ψ(2S)φ(1020)K^+$, relative to the topologically similar decay $B^+\to J/ψφ(1020) K^+$, is measured using proton-proton collision data collected by the LHCb experiment at center-of-mass energies of 7, 8, and 13 TeV, corresponding to an integrated luminosity of $9\,\mathrm{fb}^{-1}$. The ratio is found to be $0.061 \pm 0.004 \pm 0.009$, where the first unc… ▽ More

    Submitted 14 May, 2025; v1 submitted 4 March, 2025; originally announced March 2025.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3320/ (LHCb public pages)

    Report number: LHCb-PAPER-2024-039, CERN-EP-2025-011

    Journal ref: Phys. Rev. D 111 (2025) 092008

  26. Remote Sensing Image Classification Using Convolutional Neural Network (CNN) and Transfer Learning Techniques

    Authors: Mustafa Majeed Abd Zaid, Ahmed Abed Mohammed, Putra Sumari

    Abstract: This study investigates the classification of aerial images depicting transmission towers, forests, farmland, and mountains. To complete the classification job, features are extracted from input photos using a Convolutional Neural Network (CNN) architecture. Then, the images are classified using Softmax. To test the model, we ran it for ten epochs using a batch size of 90, the Adam optimizer, and… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: This paper is published in Journal of Computer Science, Volume 21 No. 3, 2025. It contains 635-645 pages

    Journal ref: J. Comput. Sci., 21(3), 635-645, 2025

  27. arXiv:2503.02152  [pdf, other

    cs.LG cs.CL

    Tabby: Tabular Data Synthesis with Language Models

    Authors: Sonia Cromp, Satya Sai Srinath Namburi GNVV, Mohammed Alkhudhayri, Catherine Cao, Samuel Guo, Nicholas Roberts, Frederic Sala

    Abstract: While advances in large language models (LLMs) have greatly improved the quality of synthetic text data in recent years, synthesizing tabular data has received relatively less attention. We address this disparity with Tabby, a simple but powerful post-training modification to the standard Transformer language model architecture, enabling its use for tabular dataset synthesis. Tabby enables the rep… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: 21 pages, 8 figures

    ACM Class: I.2.6

  28. PhishVQC: Optimizing Phishing URL Detection with Correlation Based Feature Selection and Variational Quantum Classifier

    Authors: Md. Farhan Shahriyar, Gazi Tanbhir, Abdullah Md Raihan Chy, Mohammed Abdul Al Arafat Tanzin, Md. Jisan Mashrafi

    Abstract: Phishing URL detection is crucial in cybersecurity as malicious websites disguise themselves to steal sensitive infor mation. Traditional machine learning techniques struggle to per form well in complex real-world scenarios due to large datasets and intricate patterns. Motivated by quantum computing, this paper proposes using Variational Quantum Classifiers (VQC) to enhance phishing URL detection.… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: This paper has been accepted and presented at the 3rd International Conference on Intelligent Systems Advanced Computing and Communication (ISACC 2025)

  29. arXiv:2503.01650  [pdf, other

    cs.LG cs.RO

    CAPS: Context-Aware Priority Sampling for Enhanced Imitation Learning in Autonomous Driving

    Authors: Hamidreza Mirkhani, Behzad Khamidehi, Ehsan Ahmadi, Fazel Arasteh, Mohammed Elmahgiubi, Weize Zhang, Umar Rajguru, Kasra Rezaee

    Abstract: In this paper, we introduce CAPS (Context-Aware Priority Sampling), a novel method designed to enhance data efficiency in learning-based autonomous driving systems. CAPS addresses the challenge of imbalanced training datasets in imitation learning by leveraging Vector Quantized Variational Autoencoders (VQ-VAEs). The use of VQ-VAE provides a structured and interpretable data representation, which… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  30. arXiv:2503.01493  [pdf, other

    cs.CL

    Llama-3.1-Sherkala-8B-Chat: An Open Large Language Model for Kazakh

    Authors: Fajri Koto, Rituraj Joshi, Nurdaulet Mukhituly, Yuxia Wang, Zhuohan Xie, Rahul Pal, Daniil Orel, Parvez Mullah, Diana Turmakhan, Maiya Goloburda, Mohammed Kamran, Samujjwal Ghosh, Bokang Jia, Jonibek Mansurov, Mukhammed Togmanov, Debopriyo Banerjee, Nurkhan Laiyk, Akhmed Sakip, Xudong Han, Ekaterina Kochmar, Alham Fikri Aji, Aaryamonvikram Singh, Alok Anil Jadhav, Satheesh Katipomu, Samta Kamboj , et al. (10 additional authors not shown)

    Abstract: Llama-3.1-Sherkala-8B-Chat, or Sherkala-Chat (8B) for short, is a state-of-the-art instruction-tuned open generative large language model (LLM) designed for Kazakh. Sherkala-Chat (8B) aims to enhance the inclusivity of LLM advancements for Kazakh speakers. Adapted from the LLaMA-3.1-8B model, Sherkala-Chat (8B) is trained on 45.3B tokens across Kazakh, English, Russian, and Turkish. With 8 billion… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: Technical Report

  31. arXiv:2503.00151  [pdf, other

    cs.CL cs.AI

    Palm: A Culturally Inclusive and Linguistically Diverse Dataset for Arabic LLMs

    Authors: Fakhraddin Alwajih, Abdellah El Mekki, Samar Mohamed Magdy, Abdelrahim A. Elmadany, Omer Nacar, El Moatez Billah Nagoudi, Reem Abdel-Salam, Hanin Atwany, Youssef Nafea, Abdulfattah Mohammed Yahya, Rahaf Alhamouri, Hamzah A. Alsayadi, Hiba Zayed, Sara Shatnawi, Serry Sibaee, Yasir Ech-Chammakhy, Walid Al-Dhabyani, Marwa Mohamed Ali, Imen Jarraya, Ahmed Oumar El-Shangiti, Aisha Alraeesi, Mohammed Anwar Al-Ghrawi, Abdulrahman S. Al-Batati, Elgizouli Mohamed, Noha Taha Elgindi , et al. (19 additional authors not shown)

    Abstract: As large language models (LLMs) become increasingly integrated into daily life, ensuring their cultural sensitivity and inclusivity is paramount. We introduce our dataset, a year-long community-driven project covering all 22 Arab countries. The dataset includes instructions (input, response pairs) in both Modern Standard Arabic (MSA) and dialectal Arabic (DA), spanning 20 diverse topics. Built by… ▽ More

    Submitted 28 February, 2025; originally announced March 2025.

    Comments: More information about our dataset is available at our project page: https://github.com/UBC-NLP/palm

  32. arXiv:2502.20573  [pdf

    cs.CV cs.CL

    Visual Reasoning at Urban Intersections: FineTuning GPT-4o for Traffic Conflict Detection

    Authors: Sari Masri, Huthaifa I. Ashqar, Mohammed Elhenawy

    Abstract: Traffic control in unsignalized urban intersections presents significant challenges due to the complexity, frequent conflicts, and blind spots. This study explores the capability of leveraging Multimodal Large Language Models (MLLMs), such as GPT-4o, to provide logical and visual reasoning by directly using birds-eye-view videos of four-legged intersections. In this proposed method, GPT-4o acts as… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  33. arXiv:2502.20572  [pdf

    cs.CV cs.CL

    HazardNet: A Small-Scale Vision Language Model for Real-Time Traffic Safety Detection at Edge Devices

    Authors: Mohammad Abu Tami, Mohammed Elhenawy, Huthaifa I. Ashqar

    Abstract: Traffic safety remains a vital concern in contemporary urban settings, intensified by the increase of vehicles and the complicated nature of road networks. Traditional safety-critical event detection systems predominantly rely on sensor-based approaches and conventional machine learning algorithms, necessitating extensive data collection and complex training processes to adhere to traffic safety r… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  34. arXiv:2502.20245  [pdf, other

    cs.CL

    From Retrieval to Generation: Comparing Different Approaches

    Authors: Abdelrahman Abdallah, Jamshid Mozafari, Bhawna Piryani, Mohammed Ali, Adam Jatowt

    Abstract: Knowledge-intensive tasks, particularly open-domain question answering (ODQA), document reranking, and retrieval-augmented language modeling, require a balance between retrieval accuracy and generative flexibility. Traditional retrieval models such as BM25 and Dense Passage Retrieval (DPR), efficiently retrieve from large corpora but often lack semantic depth. Generative models like GPT-4-o provid… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: work on progress

  35. arXiv:2502.20027  [pdf

    cs.NE math.OC

    Modified FOX Optimizer for Solving optimization problems

    Authors: Dler O. Hasan, Hardi M. Mohammed, Zrar Khalid Abdul

    Abstract: The FOX optimizer, inspired by red fox hunting behavior, is a powerful algorithm for solving real-world and engineering problems. However, despite balancing exploration and exploitation, it can prematurely converge to local optima, as agent positions are updated solely based on the current best-known position, causing all agents to converge on one location. This study proposes the modified FOX opt… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: 39 pages, 11 figures, 16 tables

  36. arXiv:2502.19048  [pdf

    cs.CV

    An Improved 3D Skeletons UP-Fall Dataset: Enhancing Data Quality for Efficient Impact Fall Detection

    Authors: Tresor Y. Koffi, Youssef Mourchid, Mohammed Hindawi, Yohan Dupuis

    Abstract: Detecting impact where an individual makes contact with the ground within a fall event is crucial in fall detection systems, particularly for elderly care where prompt intervention can prevent serious injuries. The UP-Fall dataset, a key resource in fall detection research, has proven valuable but suffers from limitations in data accuracy and comprehensiveness. These limitations cause confusion in… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: 17th International Conference on Machine Vision (ICMV 2024) will take place in Edinburgh, UK during October 10-13, 2024

  37. arXiv:2502.19004  [pdf, other

    cs.NI cs.AI cs.GT

    A Multi-Agent DRL-Based Framework for Optimal Resource Allocation and Twin Migration in the Multi-Tier Vehicular Metaverse

    Authors: Nahom Abishu Hayla, A. Mohammed Seid, Aiman Erbad, Tilahun M. Getu, Ala Al-Fuqaha, Mohsen Guizani

    Abstract: Although multi-tier vehicular Metaverse promises to transform vehicles into essential nodes -- within an interconnected digital ecosystem -- using efficient resource allocation and seamless vehicular twin (VT) migration, this can hardly be achieved by the existing techniques operating in a highly dynamic vehicular environment, since they can hardly balance multi-objective optimization problems suc… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: 15 pages, 16 figures

  38. arXiv:2502.18987  [pdf, other

    hep-ex

    Observation of a new charmed baryon decaying to $Ξ_c^+ π^- π^+$

    Authors: LHCb collaboration, R. Aaij, A. S. W. Abdelmotteleb, C. Abellan Beteta, F. Abudinén, T. Ackernley, A. A. Adefisoye, B. Adeva, M. Adinolfi, P. Adlarson, C. Agapopoulou, C. A. Aidala, Z. Ajaltouni, S. Akar, K. Akiba, P. Albicocco, J. Albrecht, F. Alessio, M. Alexander, Z. Aliouche, P. Alvarez Cartelle, R. Amalric, S. Amato, J. L. Amey, Y. Amhis , et al. (1135 additional authors not shown)

    Abstract: The $Ξ_c^+ π^- π^+$ spectrum is investigated using proton-proton collisions at a center-of-mass energy of 13TeV, corresponding to an integrated luminosity of 5.4fb$^{-1}$, collected by the LHCb experiment during 2016--2018. Four states are observed with high significance, and their masses and widths are measured to be \begin{align*} m[Ξ_c(2815)^{+}] &= 2816.65 \pm 0.03 \pm 0.03 \pm 0.23 ~\text{M… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: All figures and tables, along with machine-readable versions and any supplementary material and additional information, are available at https://lbfence.cern.ch/alcm/public/analysis/full-details/3080/ (LHCb public pages)

    Report number: LHCb-PAPER-2024-055, CERN-EP-2025-019

  39. arXiv:2502.17512  [pdf

    cs.LG physics.flu-dyn

    Learning multi-phase flow and transport in fractured porous media with auto-regressive and recurrent graph neural networks

    Authors: Mohammed Al Kobaisi, Wenjuan Zhang, Waleed Diab, Hadi Hajibeygi

    Abstract: In the past three decades, a wide array of computational methodologies and simulation frameworks has emerged to address the complexities of modeling multi-phase flow and transport processes in fractured porous media. The conformal mesh approaches which explicitly align the computational grid with fracture surfaces are considered by many to be the most accurate. However, such methods require excess… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

  40. arXiv:2502.16797  [pdf, other

    cs.LG

    Forecasting Rare Language Model Behaviors

    Authors: Erik Jones, Meg Tong, Jesse Mu, Mohammed Mahfoud, Jan Leike, Roger Grosse, Jared Kaplan, William Fithian, Ethan Perez, Mrinank Sharma

    Abstract: Standard language model evaluations can fail to capture risks that emerge only at deployment scale. For example, a model may produce safe responses during a small-scale beta test, yet reveal dangerous information when processing billions of requests at deployment. To remedy this, we introduce a method to forecast potential risks across orders of magnitude more queries than we test during evaluatio… ▽ More

    Submitted 23 February, 2025; originally announced February 2025.

  41. arXiv:2502.16316  [pdf

    cs.SE

    Enhancing Collaboration for Software Engineers through Matching

    Authors: Nayaab Azim, Sadath Ullah Khan Mohammed, Evan Phaup, Adeyemi Aina

    Abstract: In recent years, the field of software engineering has experienced a considerable increase in demand for competent experts, resulting in an increased demand for platforms that connect software engineers and facilitate collaboration. In response to this necessity, in this paper we present a project to solve the lack of a proper one-stop connection platform for software engineers and promoting colla… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

    Comments: 5 pages

  42. arXiv:2502.15722  [pdf

    cs.IR cs.HC

    Open-Source Retrieval Augmented Generation Framework for Retrieving Accurate Medication Insights from Formularies for African Healthcare Workers

    Authors: Axum AI, :, J. Owoyemi, S. Abubakar, A. Owoyemi, T. O. Togunwa, F. C. Madubuko, S. Oyatoye, Z. Oyetolu, K. Akyea, A. O. Mohammed, A. Adebakin

    Abstract: Accessing accurate medication insights is vital for enhancing patient safety, minimizing errors, and supporting clinical decision-making. However, healthcare professionals in Africa often rely on manual and time-consuming processes to retrieve drug information, exacerbated by limited access to pharmacists due to brain drain and healthcare disparities. This paper presents "Drug Insights," an open-s… ▽ More

    Submitted 27 January, 2025; originally announced February 2025.

    Comments: 4 pages, 2 tables and 3 figures

  43. arXiv:2502.15698  [pdf, other

    cs.IR

    Developing an Artificial Intelligence Tool for Personalized Breast Cancer Treatment Plans based on the NCCN Guidelines

    Authors: Abdul M. Mohammed, Iqtidar Mansoor, Sarah Blythe, Dennis Trujillo

    Abstract: Cancer treatments require personalized approaches based on a patient's clinical condition, medical history, and evidence-based guidelines. The National Comprehensive Cancer Network (NCCN) provides frequently updated, complex guidelines through visuals like flowcharts and diagrams, which can be time consuming for oncologists to stay current with treatment protocols. This study presents an AI (Artif… ▽ More

    Submitted 5 January, 2025; originally announced February 2025.

  44. On-demand generation of entangled photons pairs in the telecom O-band from nanowire quantum dots

    Authors: Mohammed K. Alqedra, Chiao-Tzu Huang, Edith Yeung, Wen-Hao Chang, Sofiane Haffouz, Philip J. Poole, Dan Dalacu, Ali W. Elshaari, Val Zwiller

    Abstract: On-demand entangled photon pairs at telecom wavelengths are crucial for quantum communication, distributed quantum computing, and quantum-enhanced sensing and metrology. The O-band is particularly advantageous because of its minimal chromatic dispersion and transmission loss in optical fibers, making it well-suited for long-distance quantum networks. Site-controlled nanowire quantum dots have emer… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Journal ref: NanoLett.5c01130 (2025)

  45. arXiv:2502.13791  [pdf, ps, other

    cs.CL

    From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

    Authors: Nathanaël Carraz Rakotonirina, Mohammed Hamdy, Jon Ander Campos, Lucas Weber, Alberto Testoni, Marzieh Fadaee, Sandro Pezzelle, Marco Del Tredici

    Abstract: Large Language Models (LLMs) are increasingly used in working environments for a wide range of tasks, excelling at solving individual problems in isolation. However, are they also able to effectively collaborate over long-term interactions? To investigate this, we introduce MemoryCode, a synthetic multi-session dataset designed to test LLMs' ability to track and execute simple coding instructions… ▽ More

    Submitted 6 June, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: Published as conference paper at ACL 2025

  46. arXiv:2502.13595  [pdf, ps, other

    cs.CL cs.AI cs.IR

    MMTEB: Massive Multilingual Text Embedding Benchmark

    Authors: Kenneth Enevoldsen, Isaac Chung, Imene Kerboua, Márton Kardos, Ashwin Mathur, David Stap, Jay Gala, Wissam Siblini, Dominik Krzemiński, Genta Indra Winata, Saba Sturua, Saiteja Utpala, Mathieu Ciancone, Marion Schaeffer, Gabriel Sequeira, Diganta Misra, Shreeya Dhakal, Jonathan Rystrøm, Roman Solomatin, Ömer Çağatan, Akash Kundu, Martin Bernstorff, Shitao Xiao, Akshita Sukhlecha, Bhavish Pahwa , et al. (61 additional authors not shown)

    Abstract: Text embeddings are typically evaluated on a limited set of tasks, which are constrained by language, domain, and task diversity. To address these limitations and provide a more comprehensive evaluation, we introduce the Massive Multilingual Text Embedding Benchmark (MMTEB) - a large-scale, community-driven expansion of MTEB, covering over 500 quality-controlled evaluation tasks across 250+ langua… ▽ More

    Submitted 8 June, 2025; v1 submitted 19 February, 2025; originally announced February 2025.

    Comments: Accepted for ICLR: https://openreview.net/forum?id=zl3pfz4VCV

  47. Towards a Robust Quality Assurance Framework for Cloud Computing Environments

    Authors: Mohammed Alharbi, RJ Qureshi

    Abstract: Trends such as cloud computing raise issues regarding stable and uniform quality assurance and validation of software requirements. Current QA frameworks are poorly defined, often not automated, and lack the flexibility needed for on-demand, cloud based environments. These gaps lead to inconsistencies in service delivery, challenges in scaling organizational capacity, and internal and external ine… ▽ More

    Submitted 19 February, 2025; originally announced February 2025.

    Comments: 13 Pages

    Journal ref: International Journal of Software Engineering & Applications (IJSEA), online Feb. 2025; Vol. 16, No. 1, 2025, pp. 39-51

  48. arXiv:2502.13277  [pdf, other

    cs.LG cs.AI

    HyperGCL: Multi-Modal Graph Contrastive Learning via Learnable Hypergraph Views

    Authors: Khaled Mohammed Saifuddin, Shihao Ji, Esra Akbas

    Abstract: Recent advancements in Graph Contrastive Learning (GCL) have demonstrated remarkable effectiveness in improving graph representations. However, relying on predefined augmentations (e.g., node dropping, edge perturbation, attribute masking) may result in the loss of task-relevant information and a lack of adaptability to diverse input data. Furthermore, the selection of negative samples remains rar… ▽ More

    Submitted 26 February, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Comments: 9 pages, 2 figures

  49. arXiv:2502.12639  [pdf, other

    cond-mat.mes-hall

    Effect of laser field and magnetic flux on scattering in graphene quantum dots

    Authors: Mohammed El Azar, Ahmed Bouhlal, Hocine Bahlouli, Ahmed Jellal

    Abstract: We show how Dirac electrons interact with a graphene quantum dots (GQDs) when exposed to both a magnetic flux and circularly polarized light. After obtaining the solutions of the energy spectrum, we compute the scattering coefficients. These allow us to show how efficiently the electrons diffuse and how their probability density is distributed in space. Our results show that light polarization is… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

    Comments: 13 pages, 8 figures

  50. arXiv:2502.12458  [pdf, other

    cs.CL

    An Empirical Evaluation of Encoder Architectures for Fast Real-Time Long Conversational Understanding

    Authors: Annamalai Senthilnathan, Kristjan Arumae, Mohammed Khalilia, Zhengzheng Xing, Aaron R. Colak

    Abstract: Analyzing long text data such as customer call transcripts is a cost-intensive and tedious task. Machine learning methods, namely Transformers, are leveraged to model agent-customer interactions. Unfortunately, Transformers adhere to fixed-length architectures and their self-attention mechanism scales quadratically with input length. Such limitations make it challenging to leverage traditional Tra… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.