Search | arXiv e-print repository

Improving compiler support for SIMD offload using Arm Streaming SVE

Authors: Mohamed Husain Noor Mohamed, Adarsh Patil, Latchesar Ionkov, Eric Van Hensbergen

Abstract: The wider adoption of tightly coupled core-adjacent accelerators, such as Arm Scalable Matrix Extension (SME), hinges on lowering software programming complexity. In this paper, we focus on enabling the use of SME architecture in Streaming Scalable Vector Extension (SSVE) mode for workloads written in C/C++. While current compilers optimize loops for all types of SIMD instructions, these technique… ▽ More The wider adoption of tightly coupled core-adjacent accelerators, such as Arm Scalable Matrix Extension (SME), hinges on lowering software programming complexity. In this paper, we focus on enabling the use of SME architecture in Streaming Scalable Vector Extension (SSVE) mode for workloads written in C/C++. While current compilers optimize loops for all types of SIMD instructions, these techniques primarily target vector units within the core and falter when applied to disaggregated, core-adjacent SIMD accelerators. Our goal is to enable the compiler to automatically generate code for such accelerators only when profitable. To this end, we investigate a path towards performant, precise, and repeatable computation offloading through two compiler ecosystems. We revisit LLVM compiler passes, MLIR transforms and their associated cost models, and heuristics. We hope that these insights can provide directions for evolving compiler capabilities towards automatic code generation for this next-generation vector processing paradigm. △ Less

Submitted 2 June, 2025; originally announced June 2025.

arXiv:2505.23595 [pdf]

DeepChest: Dynamic Gradient-Free Task Weighting for Effective Multi-Task Learning in Chest X-ray Classification

Authors: Youssef Mohamed, Noran Mohamed, Khaled Abouhashad, Feilong Tang, Sara Atito, Shoaib Jameel, Imran Razzak, Ahmed B. Zaky

Abstract: While Multi-Task Learning (MTL) offers inherent advantages in complex domains such as medical imaging by enabling shared representation learning, effectively balancing task contributions remains a significant challenge. This paper addresses this critical issue by introducing DeepChest, a novel, computationally efficient and effective dynamic task-weighting framework specifically designed for multi… ▽ More While Multi-Task Learning (MTL) offers inherent advantages in complex domains such as medical imaging by enabling shared representation learning, effectively balancing task contributions remains a significant challenge. This paper addresses this critical issue by introducing DeepChest, a novel, computationally efficient and effective dynamic task-weighting framework specifically designed for multi-label chest X-ray (CXR) classification. Unlike existing heuristic or gradient-based methods that often incur substantial overhead, DeepChest leverages a performance-driven weighting mechanism based on effective analysis of task-specific loss trends. Given a network architecture (e.g., ResNet18), our model-agnostic approach adaptively adjusts task importance without requiring gradient access, thereby significantly reducing memory usage and achieving a threefold increase in training speed. It can be easily applied to improve various state-of-the-art methods. Extensive experiments on a large-scale CXR dataset demonstrate that DeepChest not only outperforms state-of-the-art MTL methods by 7% in overall accuracy but also yields substantial reductions in individual task losses, indicating improved generalization and effective mitigation of negative transfer. The efficiency and performance gains of DeepChest pave the way for more practical and robust deployment of deep learning in critical medical diagnostic applications. The code is publicly available at https://github.com/youssefkhalil320/DeepChest-MTL △ Less

Submitted 29 May, 2025; originally announced May 2025.

arXiv:2501.13890 [pdf, ps, other]

Federated Granger Causality Learning for Interdependent Clients with State Space Representation

Authors: Ayush Mohanty, Nazal Mohamed, Paritosh Ramanan, Nagi Gebraeel

Abstract: Advanced sensors and IoT devices have improved the monitoring and control of complex industrial enterprises. They have also created an interdependent fabric of geographically distributed process operations (clients) across these enterprises. Granger causality is an effective approach to detect and quantify interdependencies by examining how one client's state affects others over time. Understandin… ▽ More Advanced sensors and IoT devices have improved the monitoring and control of complex industrial enterprises. They have also created an interdependent fabric of geographically distributed process operations (clients) across these enterprises. Granger causality is an effective approach to detect and quantify interdependencies by examining how one client's state affects others over time. Understanding these interdependencies captures how localized events, such as faults and disruptions, can propagate throughout the system, possibly causing widespread operational impacts. However, the large volume and complexity of industrial data pose challenges in modeling these interdependencies. This paper develops a federated approach to learning Granger causality. We utilize a linear state space system framework that leverages low-dimensional state estimates to analyze interdependencies. This addresses bandwidth limitations and the computational burden commonly associated with centralized data processing. We propose augmenting the client models with the Granger causality information learned by the server through a Machine Learning (ML) function. We examine the co-dependence between the augmented client and server models and reformulate the framework as a standalone ML algorithm providing conditions for its sublinear and linear convergence rates. We also study the convergence of the framework to a centralized oracle model. Moreover, we include a differential privacy analysis to ensure data security while preserving causal insights. Using synthetic data, we conduct comprehensive experiments to demonstrate the robustness of our approach to perturbations in causality, the scalability to the size of communication, number of clients, and the dimensions of raw data. We also evaluate the performance on two real-world industrial control system datasets by reporting the volume of data saved by decentralization. △ Less

Submitted 29 May, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

Comments: Published as a conference paper at International Conference on Learning Representations (ICLR) 2025

arXiv:2501.13132 [pdf, other]

A Hierarchical Reinforcement Learning Framework for Multi-UAV Combat Using Leader-Follower Strategy

Authors: Jinhui Pang, Jinglin He, Noureldin Mohamed Abdelaal Ahmed Mohamed, Changqing Lin, Zhihui Zhang, Xiaoshuai Hao

Abstract: Multi-UAV air combat is a complex task involving multiple autonomous UAVs, an evolving field in both aerospace and artificial intelligence. This paper aims to enhance adversarial performance through collaborative strategies. Previous approaches predominantly discretize the action space into predefined actions, limiting UAV maneuverability and complex strategy implementation. Others simplify the pr… ▽ More Multi-UAV air combat is a complex task involving multiple autonomous UAVs, an evolving field in both aerospace and artificial intelligence. This paper aims to enhance adversarial performance through collaborative strategies. Previous approaches predominantly discretize the action space into predefined actions, limiting UAV maneuverability and complex strategy implementation. Others simplify the problem to 1v1 combat, neglecting the cooperative dynamics among multiple UAVs. To address the high-dimensional challenges inherent in six-degree-of-freedom space and improve cooperation, we propose a hierarchical framework utilizing the Leader-Follower Multi-Agent Proximal Policy Optimization (LFMAPPO) strategy. Specifically, the framework is structured into three levels. The top level conducts a macro-level assessment of the environment and guides execution policy. The middle level determines the angle of the desired action. The bottom level generates precise action commands for the high-dimensional action space. Moreover, we optimize the state-value functions by assigning distinct roles with the leader-follower strategy to train the top-level policy, followers estimate the leader's utility, promoting effective cooperation among agents. Additionally, the incorporation of a target selector, aligned with the UAVs' posture, assesses the threat level of targets. Finally, simulation experiments validate the effectiveness of our proposed method. △ Less

Submitted 21 January, 2025; originally announced January 2025.

arXiv:2412.12143 [pdf, other]

Harnessing Transfer Learning from Swahili: Advancing Solutions for Comorian Dialects

Authors: Naira Abdou Mohamed, Zakarya Erraji, Abdessalam Bahafid, Imade Benelallam

Abstract: If today some African languages like Swahili have enough resources to develop high-performing Natural Language Processing (NLP) systems, many other languages spoken on the continent are still lacking such support. For these languages, still in their infancy, several possibilities exist to address this critical lack of data. Among them is Transfer Learning, which allows low-resource languages to be… ▽ More If today some African languages like Swahili have enough resources to develop high-performing Natural Language Processing (NLP) systems, many other languages spoken on the continent are still lacking such support. For these languages, still in their infancy, several possibilities exist to address this critical lack of data. Among them is Transfer Learning, which allows low-resource languages to benefit from the good representation of other languages that are similar to them. In this work, we adopt a similar approach, aiming to pioneer NLP technologies for Comorian, a group of four languages or dialects belonging to the Bantu family. Our approach is initially motivated by the hypothesis that if a human can understand a different language from their native language with little or no effort, it would be entirely possible to model this process on a machine. To achieve this, we consider ways to construct Comorian datasets mixed with Swahili. One thing to note here is that in terms of Swahili data, we only focus on elements that are closest to Comorian by calculating lexical distances between candidate and source data. We empirically test this hypothesis in two use cases: Automatic Speech Recognition (ASR) and Machine Translation (MT). Our MT model achieved ROUGE-1, ROUGE-2, and ROUGE-L scores of 0.6826, 0.42, and 0.6532, respectively, while our ASR system recorded a WER of 39.50\% and a CER of 13.76\%. This research is crucial for advancing NLP in underrepresented languages, with potential to preserve and promote Comorian linguistic heritage in the digital age. △ Less

Submitted 9 December, 2024; originally announced December 2024.

Comments: This paper was presented at the 6th Deep Learning Indaba Conference (DLI 2024)

arXiv:2411.06511 [pdf, other]

Time-delayed Dynamic Mode Decomposition for families of periodic trajectories in Cislunar Space

Authors: Sriram Narayanan, Mohamed Naveed Gul Mohamed, Indranil Nayak, Suman Chakravorty, Mrinal Kumar

Abstract: In recent years, the development of the Lunar Gateway and Artemis missions has renewed interest in lunar exploration, including both manned and unmanned missions. This interest necessitates accurate initial orbit determination (IOD) and orbit prediction (OP) in this domain, which faces significant challenges such as severe nonlinearity, sensitivity to initial conditions, large state-space volume,… ▽ More In recent years, the development of the Lunar Gateway and Artemis missions has renewed interest in lunar exploration, including both manned and unmanned missions. This interest necessitates accurate initial orbit determination (IOD) and orbit prediction (OP) in this domain, which faces significant challenges such as severe nonlinearity, sensitivity to initial conditions, large state-space volume, and sparse, faint, and unreliable measurements. This paper explores the capability of data-driven Koopman operator-based approximations for OP in these scenarios. Three stable periodic trajectories from distinct cislunar families are analyzed. The analysis includes theoretical justification for using a linear time-invariant system as the data-driven surrogate. This theoretical framework is supported by experimental validation. Furthermore, the accuracy is assessed by comparing the spectral content captured to period estimates derived from the fast Fourier transform (FFT) and Poincare-like sections. △ Less

Submitted 10 November, 2024; originally announced November 2024.

Comments: arXiv admin note: text overlap with arXiv:2401.13784

arXiv:2406.18125 [pdf, other]

ResumeAtlas: Revisiting Resume Classification with Large-Scale Datasets and Large Language Models

Authors: Ahmed Heakl, Youssef Mohamed, Noran Mohamed, Aly Elsharkawy, Ahmed Zaky

Abstract: The increasing reliance on online recruitment platforms coupled with the adoption of AI technologies has highlighted the critical need for efficient resume classification methods. However, challenges such as small datasets, lack of standardized resume templates, and privacy concerns hinder the accuracy and effectiveness of existing classification models. In this work, we address these challenges b… ▽ More The increasing reliance on online recruitment platforms coupled with the adoption of AI technologies has highlighted the critical need for efficient resume classification methods. However, challenges such as small datasets, lack of standardized resume templates, and privacy concerns hinder the accuracy and effectiveness of existing classification models. In this work, we address these challenges by presenting a comprehensive approach to resume classification. We curated a large-scale dataset of 13,389 resumes from diverse sources and employed Large Language Models (LLMs) such as BERT and Gemma1.1 2B for classification. Our results demonstrate significant improvements over traditional machine learning approaches, with our best model achieving a top-1 accuracy of 92\% and a top-5 accuracy of 97.5\%. These findings underscore the importance of dataset quality and advanced model architectures in enhancing the accuracy and robustness of resume classification systems, thus advancing the field of online recruitment practices. △ Less

Submitted 12 July, 2024; v1 submitted 26 June, 2024; originally announced June 2024.

Comments: 8 pages, 6 figures, 1 table, 6th International Conference on AI in Computational Linguistics

arXiv:2404.08621 [pdf, other]

Convexity in Optimal Control Problems

Authors: Abhijeet, Mohamed Naveed Gul Mohamed, Aayushman Sharma, Suman Chakravorty

Abstract: This paper investigates the central role played by the Hamiltonian in continuous-time nonlinear optimal control problems. We show that the strict convexity of the Hamiltonian in the control variable is a sufficient condition for the existence of a unique optimal trajectory, and the nonlinearity/non-convexity of the dynamics and the cost are immaterial. The analysis is extended to discrete-time pro… ▽ More This paper investigates the central role played by the Hamiltonian in continuous-time nonlinear optimal control problems. We show that the strict convexity of the Hamiltonian in the control variable is a sufficient condition for the existence of a unique optimal trajectory, and the nonlinearity/non-convexity of the dynamics and the cost are immaterial. The analysis is extended to discrete-time problems, revealing that discretization destroys the convex Hamiltonian structure, leading to multiple spurious optima, unless the time discretization is sufficiently small. We present simulated results comparing the "indirect" Iterative Linear Quadratic Regulator (iLQR) and the "direct" Sequential Quadratic Programming (SQP) approach for solving the optimal control problem for the cartpole and pendulum models to validate the theoretical analysis. Results show that the ILQR always converges to the "globally" optimum solution while the SQP approach gets stuck in spurious minima given multiple random initial guesses for a time discretization that is insufficiently small, while both converge to the same unique solution if the discretization is sufficiently small. △ Less

Submitted 12 April, 2024; originally announced April 2024.

arXiv:2403.16979 [pdf, other]

An Optimal Solution to Infinite Horizon Nonlinear Control Problems: Part II

Authors: Mohamed Naveed Gul Mohamed, Aayushman Sharma, Raman Goyal, Suman Chakravorty

Abstract: This paper considers the infinite horizon optimal control problem for nonlinear systems. Under the condition of nonlinear controllability of the system to any terminal set containing the origin and forward invariance of the terminal set, we establish a regularized solution approach consisting of a ``finite free final time" optimal transfer problem to the terminal set which renders the set globally… ▽ More This paper considers the infinite horizon optimal control problem for nonlinear systems. Under the condition of nonlinear controllability of the system to any terminal set containing the origin and forward invariance of the terminal set, we establish a regularized solution approach consisting of a ``finite free final time" optimal transfer problem to the terminal set which renders the set globally asymptotically stable. Further, we show that the approximations converge to the optimal infinite horizon cost as the size of the terminal set decreases to zero. We also perform the analysis for the discounted problem and show that the terminal set is asymptotically stable only for a subset of the state space and not globally. The theory is empirically evaluated on various nonholonomic robotic systems to show that the cost of our approximate problem converges and the transfer time into the terminal set is dependent on the initial state of the system, necessitating the free final time formulation. △ Less

Submitted 25 March, 2024; originally announced March 2024.

arXiv:2401.13784 [pdf, other]

On the Predictive Capability of Dynamic Mode Decomposition for Nonlinear Periodic Systems with Focus on Orbital Mechanics

Authors: Sriram Narayanan, Mohamed Naveed Gul Mohamed, Indranil Nayak, Suman Chakravorty, Mrinal Kumar

Abstract: This paper discusses the predictive capability of Dynamic Mode Decomposition (DMD) in the context of orbital mechanics. The focus is specifically on the Hankel variant of DMD which uses a stacked set of time-delayed observations for system identification and subsequent prediction. A theory on the minimum number of time delays required for accurate reconstruction of periodic trajectories of nonline… ▽ More This paper discusses the predictive capability of Dynamic Mode Decomposition (DMD) in the context of orbital mechanics. The focus is specifically on the Hankel variant of DMD which uses a stacked set of time-delayed observations for system identification and subsequent prediction. A theory on the minimum number of time delays required for accurate reconstruction of periodic trajectories of nonlinear systems is presented and corroborated using experimental analysis. In addition, the window size for training and prediction regions, respectively, is presented. The need for a meticulous approach while using DMD is emphasized by drawing comparisons between its performance on two candidate satellites, the ISS and MOLNIYA-3-50. △ Less

Submitted 24 January, 2024; originally announced January 2024.

arXiv:2309.05204 [pdf, other]

Accelerated Proximal Iterative re-Weighted $\ell_1$ Alternating Minimization for Image Deblurring

Authors: Tarmizi Adam, Alexander Malyshev, Mohd Fikree Hassan, Nur Syarafina Mohamed, Md Sah Hj Salam

Abstract: The quadratic penalty alternating minimization (AM) method is widely used for solving the convex $\ell_1$ total variation (TV) image deblurring problem. However, quadratic penalty AM for solving the nonconvex nonsmooth $\ell_p$, $0 < p < 1$ TV image deblurring problems is less studied. In this paper, we propose two algorithms, namely proximal iterative re-weighted $\ell_1$ AM (PIRL1-AM) and its ac… ▽ More The quadratic penalty alternating minimization (AM) method is widely used for solving the convex $\ell_1$ total variation (TV) image deblurring problem. However, quadratic penalty AM for solving the nonconvex nonsmooth $\ell_p$, $0 < p < 1$ TV image deblurring problems is less studied. In this paper, we propose two algorithms, namely proximal iterative re-weighted $\ell_1$ AM (PIRL1-AM) and its accelerated version, accelerated proximal iterative re-weighted $\ell_1$ AM (APIRL1-AM) for solving the nonconvex nonsmooth $\ell_p$ TV image deblurring problem. The proposed algorithms are derived from the proximal iterative re-weighted $\ell_1$ (IRL1) algorithm and the proximal gradient algorithm. Numerical results show that PIRL1-AM is effective in retaining sharp edges in image deblurring while APIRL1-AM can further provide convergence speed up in terms of the number of algorithm iterations and computational time. △ Less

Submitted 10 September, 2023; originally announced September 2023.

arXiv:2308.14693 [pdf, other]

Hybrid PLS-ML Authentication Scheme for V2I Communication Networks

Authors: Hala Amin, Jawaher Kaldari, Nora Mohamed, Waqas Aman, Saif Al-Kuwari

Abstract: Vehicular communication networks are rapidly emerging as vehicles become smarter. However, these networks are increasingly susceptible to various attacks. The situation is exacerbated by the rise in automated vehicles complicates, emphasizing the need for security and authentication measures to ensure safe and effective traffic management. In this paper, we propose a novel hybrid physical layer se… ▽ More Vehicular communication networks are rapidly emerging as vehicles become smarter. However, these networks are increasingly susceptible to various attacks. The situation is exacerbated by the rise in automated vehicles complicates, emphasizing the need for security and authentication measures to ensure safe and effective traffic management. In this paper, we propose a novel hybrid physical layer security (PLS)-machine learning (ML) authentication scheme by exploiting the position of the transmitter vehicle as a device fingerprint. We use a time-of-arrival (ToA) based localization mechanism where the ToA is estimated at roadside units (RSUs), and the coordinates of the transmitter vehicle are extracted at the base station (BS).Furthermore, to track the mobility of the moving legitimate vehicle, we use ML model trained on several system parameters. We try two ML models for this purpose, i.e., support vector regression and decision tree. To evaluate our scheme, we conduct binary hypothesis testing on the estimated positions with the help of the ground truths provided by the ML model, which classifies the transmitter node as legitimate or malicious. Moreover, we consider the probability of false alarm and the probability of missed detection as performance metrics resulting from the binary hypothesis testing, and mean absolute error (MAE), mean square error (MSE), and coefficient of determination $\text{R}^2$ to further evaluate the ML models. We also compare our scheme with a baseline scheme that exploits the angle of arrival at RSUs for authentication. We observe that our proposed position-based mechanism outperforms the baseline scheme significantly in terms of missed detections. △ Less

Submitted 28 August, 2023; originally announced August 2023.

Comments: Accepted for Publication following Presentation at IEEE ISNCC-23

arXiv:2305.13568 [pdf, ps, other]

Anticipated BSDEs driven by fractional Brownian motion with time-delayed generator

Authors: Pei Zhang, Nur Anisah Mohamed, Adriana Irawati Nur Ibrahim

Abstract: This paper discusses a new type of anticipated backward stochastic differential equation with a time-delayed generator (DABSDEs, for short) driven by fractional Brownian motion, also known as fractional BSDEs, with Hurst parameter $H\in(1/2,1)$, which extends the results of the anticipated backward stochastic differential equation to the case of the drive is fractional Brownian motion instead of a… ▽ More This paper discusses a new type of anticipated backward stochastic differential equation with a time-delayed generator (DABSDEs, for short) driven by fractional Brownian motion, also known as fractional BSDEs, with Hurst parameter $H\in(1/2,1)$, which extends the results of the anticipated backward stochastic differential equation to the case of the drive is fractional Brownian motion instead of a standard Brownian motion and in which the generator considers not only the present and future times but also the past time. By using the fixed point theorem, we will demonstrate the existence and uniqueness of the solutions to these equations. Moreover, we shall establish a comparison theorem for the solutions. △ Less

Submitted 22 May, 2023; originally announced May 2023.

arXiv:2304.00375 [pdf, other]

An Optimal Solution to Infinite Horizon Nonlinear Control Problems

Authors: Mohamed Naveed Gul Mohamed, Raman Goyal, Suman Chakravorty

Abstract: In this paper, we consider the infinite horizon optimal control problem for nonlinear systems. Under the conditions of controllability of the linearized system around the origin, and nonlinear controllability of the system to a terminal set containing the origin, we establish an approximate regularized solution approach consisting of a ``finite free final time" optimal transfer problem to the term… ▽ More In this paper, we consider the infinite horizon optimal control problem for nonlinear systems. Under the conditions of controllability of the linearized system around the origin, and nonlinear controllability of the system to a terminal set containing the origin, we establish an approximate regularized solution approach consisting of a ``finite free final time" optimal transfer problem to the terminal set, and an infinite horizon linear regulation problem within the terminal set, that is shown to render the origin globally asymptotically stable. Further, we show that the approximations converge to the true optimal cost function as the size of the terminal set decreases to zero. The approach is empirically evaluated on the pendulum and cart-pole swing-up problems to show that the finite time transfer is far shorter than the effective horizon required to solve the infinite horizon problem without the proposed regularization. △ Less

Submitted 1 April, 2023; originally announced April 2023.

arXiv:2211.10583 [pdf, other]

An Information-State Based Approach to Linear Time Varying System Identification and Control

Authors: Mohamed Naveed Gul Mohamed, Raman Goyal, Suman Chakravorty, Ran Wang

Abstract: This paper considers the problem of system identification for linear time varying systems. We propose a new system realization approach that uses an "information-state" as the state vector, where the "information-state" is composed of a finite number of past inputs and outputs. The system identification algorithm uses input-output data to fit an autoregressive moving average model (ARMA) to repres… ▽ More This paper considers the problem of system identification for linear time varying systems. We propose a new system realization approach that uses an "information-state" as the state vector, where the "information-state" is composed of a finite number of past inputs and outputs. The system identification algorithm uses input-output data to fit an autoregressive moving average model (ARMA) to represent the current output in terms of finite past inputs and outputs. This information-state-based approach allows us to directly realize a state-space model using the estimated time varying ARMA paramters linear time varying (LTV) systems. The paper develops the theoretical foundation for using ARMA parameters-based system representation using only the concept of linear observability, details the reasoning for exact output modeling using only the finite history, and shows that there is no need to separate the free and the forced response for identification. The paper also discusses the implications of using the information-state system for optimal output feedback control and shows that the solution obtained using a suitably posed information state problem is optimal for the original problem. The proposed approach is tested on various different systems, and the performance is compared with state-of-the-art LTV system identification techniques. △ Less

Submitted 5 April, 2024; v1 submitted 18 November, 2022; originally announced November 2022.

arXiv:2210.17092 [pdf, other]

Confidence-Nets: A Step Towards better Prediction Intervals for regression Neural Networks on small datasets

Authors: Mohamedelmujtaba Altayeb, Abdelrahman M. Elamin, Hozaifa Ahmed, Eithar Elfatih Elfadil Ibrahim, Omer Haydar, Saba Abdulaziz, Najlaa H. M. Mohamed

Abstract: The recent decade has seen an enormous rise in the popularity of deep learning and neural networks. These algorithms have broken many previous records and achieved remarkable results. Their outstanding performance has significantly sped up the progress of AI, and so far various milestones have been achieved earlier than expected. However, in the case of relatively small datasets, the performance o… ▽ More The recent decade has seen an enormous rise in the popularity of deep learning and neural networks. These algorithms have broken many previous records and achieved remarkable results. Their outstanding performance has significantly sped up the progress of AI, and so far various milestones have been achieved earlier than expected. However, in the case of relatively small datasets, the performance of Deep Neural Networks (DNN) may suffer from reduced accuracy compared to other Machine Learning models. Furthermore, it is difficult to construct prediction intervals or evaluate the uncertainty of predictions when dealing with regression tasks. In this paper, we propose an ensemble method that attempts to estimate the uncertainty of predictions, increase their accuracy and provide an interval for the expected variation. Compared with traditional DNNs that only provide a prediction, our proposed method can output a prediction interval by combining DNNs, extreme gradient boosting (XGBoost) and dissimilarity computation techniques. Albeit the simple design, this approach significantly increases accuracy on small datasets and does not introduce much complexity to the architecture of the neural network. The proposed method is tested on various datasets, and a significant improvement in the performance of the neural network model is seen. The model's prediction interval can include the ground truth value at an average rate of 71% and 78% across training sizes of 90% and 55%, respectively. Finally, we highlight other aspects and applications of the approach in experimental error estimation, and the application of transfer learning. △ Less

Submitted 31 October, 2022; originally announced October 2022.

arXiv:2208.06026 [pdf, ps, other]

Backward Stochastic Differential Equations (BSDEs) Using Infinite-dimensional Martingales with Subdifferential Operator

Authors: Pei Zhang, Adriana Irawati Nur Ibrahim, Nur Anisah Mohamed

Abstract: In this paper, we focus on a family of backward stochastic differential equations (BSDEs) with sub-differential operators that are driven by infinite-dimensional martingales which involve symmetry, that is, the process involves a positive definite nuclear operator Q. We shall show that the solution to such infinite-dimensional BSDEs exists and is unique. The existence of the solution is establishe… ▽ More In this paper, we focus on a family of backward stochastic differential equations (BSDEs) with sub-differential operators that are driven by infinite-dimensional martingales which involve symmetry, that is, the process involves a positive definite nuclear operator Q. We shall show that the solution to such infinite-dimensional BSDEs exists and is unique. The existence of the solution is established using Yosida approximations, and the uniqueness is proved using Fixed Point Theorem. Furthermore, as an application of the main result, we shall show that the backward stochastic partial differential equation driven by infinite-dimensional martingales with a continuous linear operator has a unique solution under the condition that the function F equals to zero. △ Less

Submitted 11 August, 2022; originally announced August 2022.

arXiv:2204.05683 [pdf, other]

Brittle fracture studied by ultra-high speed synchrotron X-ray diffraction imaging

Authors: Antoine Petit, Sylvia Pokam, Frederic Mazen, Samuel Tardif, Didier Landru, Oleg Kononchuk, Nadia Ben Mohamed, Margie P. Olbinado, Alexander Rack, Francois Rieutord

Abstract: Ever since the very first human-made knapped tools, the control of fracture propagation in brittle materials has been a vector of technological development. Nowadays, a broad range of applications relies on crack propagation control, from the mitigation of damages, e.g., from impacts in glass screens or windshields, to industrial processes harnessing fracture to achieve clean cuts over large dista… ▽ More Ever since the very first human-made knapped tools, the control of fracture propagation in brittle materials has been a vector of technological development. Nowadays, a broad range of applications relies on crack propagation control, from the mitigation of damages, e.g., from impacts in glass screens or windshields, to industrial processes harnessing fracture to achieve clean cuts over large distances. Yet, studying the fracture in real time is a challenging task, since cracks can propagate up to a few km/s in materials that are often opaque. Here, we report on the in situ investigation of cracks propagating at up to 2.5 km/s along a (001) plane of a silicon single crystal, using X-ray diffraction megahertz imaging with intense and time-structured synchrotron radiation. The studied system is based on the Smart Cut process, where a buried layer in a material (typically Si) is weakened by micro-cracks and then used to drive a macroscopic crack (0.1 m) in a plane parallel to the surface with minimal deviation (1 nm). The results we report here provide the first direct confirmation that the shape of the crack front is not affected by the distribution of the micro-cracks, which had been a postulate for previous studies based on post-fracture results. We further measured instantaneous crack velocities over the centimeter-wide field-of-view, which had only been previously inferred from sparse point measurements, and evidence the effect of local heating by the X-ray beam. Finally, we also observed the post-crack movements of the separated wafer parts, which can be explained using pneumatics and elasticity. Thus, this study provides a comprehensive view of controlled fracture propagation in a crystalline material, paving the way for the in situ measurement of ultra-fast strain field propagation. △ Less

Submitted 12 April, 2022; originally announced April 2022.

Comments: 4 figures

arXiv:2107.08086 [pdf, other]

An Information-state based Approach to the Optimal Output Feedback Control of Nonlinear Systems

Authors: Raman Goyal, Ran Wang, Mohamed Naveed Gul Mohamed, Aayushman Sharma, Suman Chakravorty

Abstract: This paper develops a data-based approach to the closed-loop output feedback control of nonlinear dynamical systems with a partial nonlinear observation model. We propose an information state based approach to rigorously transform the partially observed problem into a fully observed problem where the information state consists of the past several observations and control inputs. We further show th… ▽ More This paper develops a data-based approach to the closed-loop output feedback control of nonlinear dynamical systems with a partial nonlinear observation model. We propose an information state based approach to rigorously transform the partially observed problem into a fully observed problem where the information state consists of the past several observations and control inputs. We further show the equivalence of the transformed and the initial partially observed optimal control problems and provide the conditions to solve for the deterministic optimal solution. We develop a data based generalization of the iterative Linear Quadratic Regulator (iLQR) to partially observed systems using a local linear time varying model of the information state dynamics approximated by an Autoregressive moving average (ARMA) model, that is generated using only the input-output data. This open-loop trajectory optimization solution is then used to design a local feedback control law, and the composite law then provides an optimum solution to the partially observed feedback design problem. The efficacy of the developed method is shown by controlling complex high dimensional nonlinear dynamical systems in the presence of model and sensing uncertainty. △ Less

Submitted 5 October, 2023; v1 submitted 16 July, 2021; originally announced July 2021.

arXiv:2011.10829 [pdf, other]

On the Convergence of Reinforcement Learning in Nonlinear Continuous State Space Problems

Authors: Raman Goyal, Suman Chakravorty, Ran Wang, Mohamed Naveed Gul Mohamed

Abstract: We consider the problem of Reinforcement Learning for nonlinear stochastic dynamical systems. We show that in the RL setting, there is an inherent ``Curse of Variance" in addition to Bellman's infamous ``Curse of Dimensionality", in particular, we show that the variance in the solution grows factorial-exponentially in the order of the approximation. A fundamental consequence is that this precludes… ▽ More We consider the problem of Reinforcement Learning for nonlinear stochastic dynamical systems. We show that in the RL setting, there is an inherent ``Curse of Variance" in addition to Bellman's infamous ``Curse of Dimensionality", in particular, we show that the variance in the solution grows factorial-exponentially in the order of the approximation. A fundamental consequence is that this precludes the search for anything other than ``local" feedback solutions in RL, in order to control the explosive variance growth, and thus, ensure accuracy. We further show that the deterministic optimal control has a perturbation structure, in that the higher order terms do not affect the calculation of lower order terms, which can be utilized in RL to get accurate local solutions. △ Less

Submitted 28 July, 2021; v1 submitted 21 November, 2020; originally announced November 2020.

arXiv:2004.01041 [pdf, other]

On the Feedback Law in Stochastic Optimal Nonlinear Control

Authors: Mohamed Naveed Gul Mohamed, Suman Chakravorty, Raman Goyal, Ran Wang

Abstract: We consider the problem of nonlinear stochastic optimal control. This problem is thought to be fundamentally intractable owing to Bellman's "curse of dimensionality". We present a result that shows that repeatedly solving an open-loop deterministic problem from the current state with progressively shorter horizons, similar to Model Predictive Control (MPC), results in a feedback policy that is… ▽ More We consider the problem of nonlinear stochastic optimal control. This problem is thought to be fundamentally intractable owing to Bellman's "curse of dimensionality". We present a result that shows that repeatedly solving an open-loop deterministic problem from the current state with progressively shorter horizons, similar to Model Predictive Control (MPC), results in a feedback policy that is $O(ε^4)$ near to the true global stochastic optimal policy, where $ε$ is a perturbation parameter modulating the noise. We also show that the optimal deterministic feedback problem has a perturbation structure such that higher-order terms of the feedback law do not affect lower-order terms and that this structure is lost in the optimal stochastic feedback problem. Consequently, solving the Stochastic Dynamic Programming problem is highly susceptible to noise, even in low dimensional problems, and in practice, the MPC-type feedback law offers superior performance even for high noise levels. △ Less

Submitted 10 October, 2024; v1 submitted 1 April, 2020; originally announced April 2020.

Comments: arXiv admin note: substantial text overlap with arXiv:2002.10505, arXiv:2002.09478

arXiv:2002.10505 [pdf, other]

Experiments with Tractable Feedback in Robotic Planning under Uncertainty: Insights over a wide range of noise regimes (Extended Report)

Authors: Mohamed Naveed Gul Mohamed, Suman Chakravorty, Dylan A. Shell

Abstract: We consider the problem of robotic planning under uncertainty. This problem may be posed as a stochastic optimal control problem, complete solution to which is fundamentally intractable owing to the infamous curse of dimensionality. We report the results of an extensive simulation study in which we have compared two methods, both of which aim to salvage tractability by using alternative, albeit in… ▽ More We consider the problem of robotic planning under uncertainty. This problem may be posed as a stochastic optimal control problem, complete solution to which is fundamentally intractable owing to the infamous curse of dimensionality. We report the results of an extensive simulation study in which we have compared two methods, both of which aim to salvage tractability by using alternative, albeit inexact, means for treating feedback. The first is a recently proposed method based on a near-optimal "decoupling principle" for tractable feedback design, wherein a nominal open-loop problem is solved, followed by a linear feedback design around the open-loop. The second is Model Predictive Control (MPC), a widely-employed method that uses repeated re-computation of the nominal open-loop problem during execution to correct for noise, though when interpreted as feedback, this can only said to be an implicit form. We examine a much wider range of noise levels than have been previously reported and empirical evidence suggests that the decoupling method allows for tractable planning over a wide range of uncertainty conditions without unduly sacrificing performance. △ Less

Submitted 18 July, 2020; v1 submitted 20 February, 2020; originally announced February 2020.

Comments: arXiv admin note: substantial text overlap with arXiv:1909.08585, arXiv:2002.09478

arXiv:1911.02942 [pdf]

Efficient Numerical Scheme for Solving (1+1), (2+1)-dimensional and Coupled Burgers Equation

Authors: N. A. Mohamed, A. S. Rashed

Abstract: A numerical scheme based on backward differentiation formula (BDF) and generalized differential quadrature method (GDQM) has been developed. The proposed scheme has been employed to investigate three cases of Burgers equation, one-dimensional, two-dimensional and two-dimensional coupled models. The results showed an effectiveness accuracy in absolute error and error norms A numerical scheme based on backward differentiation formula (BDF) and generalized differential quadrature method (GDQM) has been developed. The proposed scheme has been employed to investigate three cases of Burgers equation, one-dimensional, two-dimensional and two-dimensional coupled models. The results showed an effectiveness accuracy in absolute error and error norms △ Less

Submitted 4 November, 2019; originally announced November 2019.

arXiv:1909.08585 [pdf, other]

Decoupling stochastic optimal control problems for efficient solution: insights from experiments across a wide range of noise regimes

Authors: Mohamed Naveed Gul Mohamed, Suman Chakravorty, Dylan A. Shell

Abstract: We consider the problem of robotic planning under uncertainty in this paper. This problem may be posed as a stochastic optimal control problem, a solution to which is fundamentally intractable owing to the infamous "curse of dimensionality". Hence, we consider the extension of a "decoupling principle" that was recently proposed by some of the authors, wherein a nominal open-loop problem is solved… ▽ More We consider the problem of robotic planning under uncertainty in this paper. This problem may be posed as a stochastic optimal control problem, a solution to which is fundamentally intractable owing to the infamous "curse of dimensionality". Hence, we consider the extension of a "decoupling principle" that was recently proposed by some of the authors, wherein a nominal open-loop problem is solved followed by a linear feedback design around the open-loop, and which was shown to be near-optimal to second order in terms of a "small noise" parameter, to a much wider range of noise levels. Our empirical evidence suggests that this allows for tractable planning over a wide range of uncertainty conditions without unduly sacrificing performance. △ Less

Submitted 18 September, 2019; originally announced September 2019.

arXiv:1606.06953 [pdf]

Anisotropic Optical Properties of Layered Germanium Sulfide

Authors: Dezhi Tan, Hong En Lim, Feijiu Wang, Nur Baizura Mohamed, Shinichiro Mouri, Koirala Sandhaya, Wenjing Zhang, Yuhei Miyauchi, Mari Ohfuchi, Kazunari Matsuda

Abstract: Two-dimensional (2D) layered materials, transition metal dichalcogenides and black phosphorus, have attracted much interest from the viewpoints of fundamental physics and device applications. The establishment of new functionalities in anisotropic layered 2D materials is a challenging but rewarding frontier, owing to their remarkable optical properties and prospects for new devices. Here, we repor… ▽ More Two-dimensional (2D) layered materials, transition metal dichalcogenides and black phosphorus, have attracted much interest from the viewpoints of fundamental physics and device applications. The establishment of new functionalities in anisotropic layered 2D materials is a challenging but rewarding frontier, owing to their remarkable optical properties and prospects for new devices. Here, we report the anisotropic optical properties of layered 2D monochalcogenide of germanium sulfide (GeS). Three Raman scattering peaks corresponding to the B3g, A1g, and A2g modes with strong polarization dependence are demonstrated in the GeS flakes, which validates polarized Raman spectroscopy as an effective method for identifying the crystal orientation of anisotropic layered GeS. Photoluminescence (PL) is observed with a peak at around 1.66 eV that originates from the direct optical transition in GeS at room temperature. Moreover, determination of the polarization dependent characteristics of the PL and absorption reveals an anisotropic optical transition near the band edge of GeS, which is also supported by the density functional theory calculations. This anisotropic layered GeS presents the opportunities for the discovery of new physical phenomena and will find applications that exploit its anisotropic properties. △ Less

Submitted 20 June, 2016; originally announced June 2016.

arXiv:1509.01074 [pdf]

A Novice Guide towards Human Motion Analysis and Understanding

Authors: Ahmed Nabil Mohamed

Abstract: Human motion analysis and understanding has been, and is still, the focus of attention of many disciplines which is considered an obvious indicator of the wide and massive importance of the subject. The purpose of this article is to shed some light on this very important subject, so it can be a good insight for a novice computer vision researcher in this field by providing him/her with a wealth of… ▽ More Human motion analysis and understanding has been, and is still, the focus of attention of many disciplines which is considered an obvious indicator of the wide and massive importance of the subject. The purpose of this article is to shed some light on this very important subject, so it can be a good insight for a novice computer vision researcher in this field by providing him/her with a wealth of knowledge about the subject covering many directions. There are two main contributions of this article. The first one investigates various aspects of some disciplines (e.g., arts, philosophy, psychology, and neuroscience) that are interested in the subject and review some of their contributions stressing on those that can be useful for computer vision researchers. Moreover, many examples are illustrated to indicate the benefits of integrating concepts and results among different disciplines. The second contribution is concerned with the subject from the computer vision aspect where we discuss the following issues. First, we explore many demanding and promising applications to reveal the wide and massive importance of the field. Second, we list various types of sensors that may be used for acquiring various data. Third, we review different taxonomies used for classifying motions. Fourth, we review various processes involved in motion analysis. Fifth, we exhibit how different surveys are structured. Sixth, we examine many of the most cited and recent reviews in the field that have been published during the past two decades to reveal various approaches used for implementing different stages of the problem and refer to various algorithms and their suitability for different situations. Moreover, we provide a long list of public datasets and discuss briefly some examples of these datasets. Finally, we provide a general discussion of the subject from the aspect of computer vision. △ Less

Submitted 3 September, 2015; originally announced September 2015.

Comments: 35 Pages

arXiv:1407.0699 [pdf]

Enumeration of Spanning Trees Using Edge Exchange with Minimal Partitioning

Authors: Nasr Mohamed

Abstract: In this thesis, Minimal Partitioning (MP) algorithm, an innovative algorithm for enumerating all the spanning trees in an undirected graph is presented. While MP algorithm uses a computational tree graph to traverse all possible spanning trees by the edge exchange technique, it has two unique properties compared to previous algorithms. In the first place, the algorithm maintains a state of minim… ▽ More In this thesis, Minimal Partitioning (MP) algorithm, an innovative algorithm for enumerating all the spanning trees in an undirected graph is presented. While MP algorithm uses a computational tree graph to traverse all possible spanning trees by the edge exchange technique, it has two unique properties compared to previous algorithms. In the first place, the algorithm maintains a state of minimal partition size in the spanning tree due to edge deletion. This is realized by swapping peripheral edges, more precisely leaf edges, in most of edge exchange operations. Consequently, the main structure of the spanning trees is preserved during the steps of the enumeration process. This extra constraint proves to be advantageous in many applications where the partition size is a factor in the solution cost. Secondly, we introduce, and utilize, the new concept of edge promotion: the exchanged edges always share one end. Practically, and as a result of this property, the interface between the two partitions of the spanning tree during edge exchange has to be maintained from one side only. For a graph $G(V,E)$, MP algorithm requires $O(log V+E/V)$ expected time and $OV log V)$ worst case time for generating each spanning tree. MP algorithm requires a total expected space limit of $O(E log V)$ with worst case limit of $O(EV)$. Like all edge exchange algorithms, MP algorithm retains the advantage of compacted output of $O(1)$ per spanning tree by listing the relative differences only. Three sample real-world applications of spanning trees enumeration are explored and the effects of using MP algorithm are studied. Namely: construction of nets of polyhedra, multi-robots spanning tree routing, and computing the electric current in edges of a network. We report that MP algorithm outperforms other algorithm by $O(V)$ time complexity. △ Less

Submitted 2 July, 2014; originally announced July 2014.

Comments: Master Thesis

ACM Class: G.2.2; I.3.5

arXiv:0908.1057 [pdf]

Transmission Performance Analysis of Digital Wire and Wireless Optical Links in Local and Wide Areas Optical Networks

Authors: Abd El Naser A. Mohamed, Mohamed M. E. El Halawany, Ahmed Nabih Zaki Rashed, Amina E. M. El Nabawy

Abstract: In the present paper, the transmission performance analysis of digital wire and wireless optical links in local and wide areas optical networks have been modeled and parametrically investigated over wide range of the affecting parameters. Moreover, we have analyzed the basic equations of the comparative study of the performance of digital fiber optic links with wire and wireless optical links. T… ▽ More In the present paper, the transmission performance analysis of digital wire and wireless optical links in local and wide areas optical networks have been modeled and parametrically investigated over wide range of the affecting parameters. Moreover, we have analyzed the basic equations of the comparative study of the performance of digital fiber optic links with wire and wireless optical links. The development of optical wireless communication systems is accelerating as a high cost effective to wire fiber optic links. The optical wireless technology is used mostly in wide bandwidth data transmission applications. Finally, we have investigated the maximum transmission distance and data transmission bit rates that can be achieved within digital wire and wireless optical links for local and wide areas optical network applications. △ Less

Submitted 7 August, 2009; originally announced August 2009.

Comments: 10 pages IEEE Format, International Journal of Computer Science and Information Security, IJCSIS 2009, ISSN 1947 5500, Impact Factor 0.423

Report number: ISSN 1947 5500

Journal ref: Internation Journal of Computer Science and Information Security, IJCSIS, Vol. 3, No.1, July 2009, USA

arXiv:0908.1056 [pdf]

Recent Applications of Optical Parametric Amplifiers in Hybrid WDM TDM Local Area Optical Networks

Authors: Abd El Naser A. Mohamed, Mohamed M. E. El-Halawany, Ahmed Nabih Zaki Rashed, Mahmoud M. A. Eid

Abstract: In the present paper, the recent applications of optical parametric amplifiers (OPAs) in hybrid wavelength division multiplexing (WDM)/time division multiplexing (TDM) local area passive optical networks have been modeled and parametrically investigated over wide range of the affecting parameters. Moreover, we have analyzed the ability of the hybrid WDM/TDM Passive optical networks to handle a t… ▽ More In the present paper, the recent applications of optical parametric amplifiers (OPAs) in hybrid wavelength division multiplexing (WDM)/time division multiplexing (TDM) local area passive optical networks have been modeled and parametrically investigated over wide range of the affecting parameters. Moreover, we have analyzed the ability of the hybrid WDM/TDM Passive optical networks to handle a triple play solution, offering voice, video, and data services to the multiple users. Finally, we have investigated the maximum time division multiplexing (MTDM) bit rates for optical network units (ONUs) for maximum number of supported users with optical parametric amplifier technique across the single mode fiber (SMF) or highly nonlinear fiber (HNLF) cables to achieve both maximum network reach and quality of service (QOS). △ Less

Submitted 7 August, 2009; originally announced August 2009.

Comments: 10 pages IEEE Format, International Journal of Computer Science and Information Security, IJCSIS 2009, ISSN 1947 5500, Impact Factor 0.423

Report number: ISSN 1947 5500

Journal ref: Internation Journal of Computer Science and Information Security, IJCSIS, Vol. 3, No.1, July 2009, USA

Showing 1–29 of 29 results for author: Mohamed, N