-
Least-Squares-Embedded Optimization for Accelerated Convergence of PINNs in Acoustic Wavefield Simulations
Authors:
Mohammad Mahdi Abedi,
David Pardo,
Tariq Alkhalifah
Abstract:
Physics-Informed Neural Networks (PINNs) have shown promise in solving partial differential equations (PDEs), including the frequency-domain Helmholtz equation. However, standard training of PINNs using gradient descent (GD) suffers from slow convergence and instability, particularly for high-frequency wavefields. For scattered acoustic wavefield simulation based on Helmholtz equation, we derive a…
▽ More
Physics-Informed Neural Networks (PINNs) have shown promise in solving partial differential equations (PDEs), including the frequency-domain Helmholtz equation. However, standard training of PINNs using gradient descent (GD) suffers from slow convergence and instability, particularly for high-frequency wavefields. For scattered acoustic wavefield simulation based on Helmholtz equation, we derive a hybrid optimization framework that accelerates training convergence by embedding a least-squares (LS) solver directly into the GD loss function. This formulation enables optimal updates for the linear output layer. Our method is applicable with or without perfectly matched layers (PML), and we provide practical tensor-based implementations for both scenarios. Numerical experiments on benchmark velocity models demonstrate that our approach achieves faster convergence, higher accuracy, and improved stability compared to conventional PINN training. In particular, our results show that the LS-enhanced method converges rapidly even in cases where standard GD-based training fails. The LS solver operates on a small normal matrix, ensuring minimal computational overhead and making the method scalable for large-scale wavefield simulations.
△ Less
Submitted 23 April, 2025;
originally announced April 2025.
-
Gabor-Enhanced Physics-Informed Neural Networks for Fast Simulations of Acoustic Wavefields
Authors:
Mohammad Mahdi Abedi,
David Pardo,
Tariq Alkhalifah
Abstract:
Physics-Informed Neural Networks (PINNs) have gained increasing attention for solving partial differential equations, including the Helmholtz equation, due to their flexibility and mesh-free formulation. However, their low-frequency bias limits their accuracy and convergence speed for high-frequency wavefield simulations. To alleviate these problems, we propose a simplified PINN framework that inc…
▽ More
Physics-Informed Neural Networks (PINNs) have gained increasing attention for solving partial differential equations, including the Helmholtz equation, due to their flexibility and mesh-free formulation. However, their low-frequency bias limits their accuracy and convergence speed for high-frequency wavefield simulations. To alleviate these problems, we propose a simplified PINN framework that incorporates Gabor functions, designed to capture the oscillatory and localized nature of wavefields more effectively. Unlike previous attempts that rely on auxiliary networks to learn Gabor parameters, we redefine the network's task to map input coordinates to a custom Gabor coordinate system, simplifying the training process without increasing the number of trainable parameters compared to a simple PINN. We validate the proposed method across multiple velocity models, including the complex Marmousi and Overthrust models, and demonstrate its superior accuracy, faster convergence, and better robustness features compared to both traditional PINNs and earlier Gabor-based PINNs. Additionally, we propose an efficient integration of a Perfectly Matched Layer (PML) to enhance wavefield behavior near the boundaries. These results suggest that our approach offers an efficient and accurate alternative for scattered wavefield modeling and lays the groundwork for future improvements in PINN-based seismic applications.
△ Less
Submitted 28 February, 2025; v1 submitted 24 February, 2025;
originally announced February 2025.
-
Radio Resource Management and Path Planning in Intelligent Transportation Systems via Reinforcement Learning for Environmental Sustainability
Authors:
S. Norouzi,
N. Azarasa,
M. R. Abedi,
N. Mokari,
S. E. Seyedabrishami,
H. Saeedi,
E. A. Jorswieck
Abstract:
Efficient and dynamic path planning has become an important topic for urban areas with larger density of connected vehicles (CV) which results in reduction of travel time and directly contributes to environmental sustainability through reducing energy consumption. CVs exploit the cellular wireless vehicle-to-everything (C-V2X) communication technology to disseminate the vehicle-to-infrastructure (…
▽ More
Efficient and dynamic path planning has become an important topic for urban areas with larger density of connected vehicles (CV) which results in reduction of travel time and directly contributes to environmental sustainability through reducing energy consumption. CVs exploit the cellular wireless vehicle-to-everything (C-V2X) communication technology to disseminate the vehicle-to-infrastructure (V2I) messages to the Base-station (BS) to improve situation awareness on urban roads. In this paper, we investigate radio resource management (RRM) in such a framework to minimize the age of information (AoI) so as to enhance path planning results. We use the fact that V2I messages with lower AoI value result in less error in estimating the road capacity and more accurate path planning. Through simulations, we compare road travel times and volume over capacity (V/C) against different levels of AoI and demonstrate the promising performance of the proposed framework.
△ Less
Submitted 13 May, 2024;
originally announced May 2024.
-
Beyond Traditional Teaching: The Potential of Large Language Models and Chatbots in Graduate Engineering Education
Authors:
Mahyar Abedi,
Ibrahem Alshybani,
Muhammad Rubayat Bin Shahadat,
Michael S. Murillo
Abstract:
In the rapidly evolving landscape of education, digital technologies have repeatedly disrupted traditional pedagogical methods. This paper explores the latest of these disruptions: the potential integration of large language models (LLMs) and chatbots into graduate engineering education. We begin by tracing historical and technological disruptions to provide context and then introduce key terms su…
▽ More
In the rapidly evolving landscape of education, digital technologies have repeatedly disrupted traditional pedagogical methods. This paper explores the latest of these disruptions: the potential integration of large language models (LLMs) and chatbots into graduate engineering education. We begin by tracing historical and technological disruptions to provide context and then introduce key terms such as machine learning and deep learning and the underlying mechanisms of recent advancements, namely attention/transformer models and graphics processing units. The heart of our investigation lies in the application of an LLM-based chatbot in a graduate fluid mechanics course. We developed a question bank from the course material and assessed the chatbot's ability to provide accurate, insightful responses. The results are encouraging, demonstrating not only the bot's ability to effectively answer complex questions but also the potential advantages of chatbot usage in the classroom, such as the promotion of self-paced learning, the provision of instantaneous feedback, and the reduction of instructors' workload. The study also examines the transformative effect of intelligent prompting on enhancing the chatbot's performance. Furthermore, we demonstrate how powerful plugins like Wolfram Alpha for mathematical problem-solving and code interpretation can significantly extend the chatbot's capabilities, transforming it into a comprehensive educational tool. While acknowledging the challenges and ethical implications surrounding the use of such AI models in education, we advocate for a balanced approach. The use of LLMs and chatbots in graduate education can be greatly beneficial but requires ongoing evaluation and adaptation to ensure ethical and efficient use.
△ Less
Submitted 19 December, 2023; v1 submitted 9 September, 2023;
originally announced September 2023.
-
Improving the Performance of DNN-based Software Services using Automated Layer Caching
Authors:
Mohammadamin Abedi,
Yanni Iouannou,
Pooyan Jamshidi,
Hadi Hemmati
Abstract:
Deep Neural Networks (DNNs) have become an essential component in many application domains including web-based services. A variety of these services require high throughput and (close to) real-time features, for instance, to respond or react to users' requests or to process a stream of incoming data on time. However, the trend in DNN design is toward larger models with many layers and parameters t…
▽ More
Deep Neural Networks (DNNs) have become an essential component in many application domains including web-based services. A variety of these services require high throughput and (close to) real-time features, for instance, to respond or react to users' requests or to process a stream of incoming data on time. However, the trend in DNN design is toward larger models with many layers and parameters to achieve more accurate results. Although these models are often pre-trained, the computational complexity in such large models can still be relatively significant, hindering low inference latency. Implementing a caching mechanism is a typical systems engineering solution for speeding up a service response time. However, traditional caching is often not suitable for DNN-based services. In this paper, we propose an end-to-end automated solution to improve the performance of DNN-based services in terms of their computational complexity and inference latency. Our caching method adopts the ideas of self-distillation of DNN models and early exits. The proposed solution is an automated online layer caching mechanism that allows early exiting of a large model during inference time if the cache model in one of the early exits is confident enough for final prediction. One of the main contributions of this paper is that we have implemented the idea as an online caching, meaning that the cache models do not need access to training data and perform solely based on the incoming data at run-time, making it suitable for applications using pre-trained models. Our experiments results on two downstream tasks (face and object classification) show that, on average, caching can reduce the computational complexity of those services up to 58\% (in terms of FLOPs count) and improve their inference latency up to 46\% with low to zero reduction in accuracy.
△ Less
Submitted 18 September, 2022;
originally announced September 2022.
-
SWIPT-Enabled Multiple Access Channel: Effects of Decoding Cost and Non-linear EH Model
Authors:
Pouria Nezhadmohammad,
Mohsen Abedi,
Mohammad Javad Emadi,
Risto Wichman
Abstract:
We studied power splitting-based simultaneous wireless information and power transfer (PS-SWIPT) in multiple access channels (MAC), considering the decoding cost and non-linear energy harvesting (EH) constraints at the receiving nodes to study practical limitations of an EH communication system. Under these restrictions, we formulated and analyzed the achievable rate and maximum departure regions…
▽ More
We studied power splitting-based simultaneous wireless information and power transfer (PS-SWIPT) in multiple access channels (MAC), considering the decoding cost and non-linear energy harvesting (EH) constraints at the receiving nodes to study practical limitations of an EH communication system. Under these restrictions, we formulated and analyzed the achievable rate and maximum departure regions in two well-studied scenarios, i.e., a classical PS-SWIPT MAC and a PS-SWIPT MAC with user cooperation. In the classical PS-SWIPT MAC setting, closed-form expressions for the optimal values of the PS factors are derived for two fundamental decoding schemes: simultaneous decoding and successive interference cancellation. In the PS-SWIPT MAC with user cooperation, the joint optimal power allocation for users as well as the optimal PS factor are derived. This reveals that one decoding scheme outperforms the other in the classical PS-SWIPT MAC, depending on the function type of the decoding cost. Finally, it is shown that the cooperation between users can potentially boost the performance of a PS-SWIPT MAC under decoding cost and non-linear EH constraints. Moreover, effects of the decoding cost functions, non-linear EH model and channel quality between the users are studied, and performance characteristics of the system are discussed.
△ Less
Submitted 11 November, 2021;
originally announced November 2021.
-
AI-Based and Mobility-Aware Energy Efficient Resource Allocation and Trajectory Design for NFV Enabled Aerial Networks
Authors:
Mohsen Pourghasemian,
Mohammad Reza Abedi,
Shima Salarhosseini,
Nader Mokari,
Mohammad Reza Javan,
Eduard A. Jorswieck
Abstract:
In this paper, we propose a novel joint intelligent trajectory design and resource allocation algorithm based on user's mobility and their requested services for unmanned aerial vehicles (UAVs) assisted networks, where UAVs act as nodes of a network function virtualization (NFV) enabled network. Our objective is to maximize energy efficiency and minimize the average delay on all services by alloca…
▽ More
In this paper, we propose a novel joint intelligent trajectory design and resource allocation algorithm based on user's mobility and their requested services for unmanned aerial vehicles (UAVs) assisted networks, where UAVs act as nodes of a network function virtualization (NFV) enabled network. Our objective is to maximize energy efficiency and minimize the average delay on all services by allocating the limited radio and NFV resources. In addition, due to the traffic conditions and mobility of users, we let some Virtual Network Functions (VNFs) to migrate from their current locations to other locations to satisfy the Quality of Service requirements. We formulate our problem to find near-optimal locations of UAVs, transmit power, subcarrier assignment, placement, and scheduling the requested service's functions over the UAVs and perform suitable VNF migration. Then we propose a novel Hierarchical Hybrid Continuous and Discrete Action (HHCDA) deep reinforcement learning method to solve our problem. Finally, the convergence and computational complexity of the proposed algorithm and its performance analyzed for different parameters. Simulation results show that our proposed HHCDA method decreases the request reject rate and average delay by 31.5% and 20% and increases the energy efficiency by 40% compared to DDPG method.
△ Less
Submitted 21 May, 2021;
originally announced May 2021.
-
Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning
Authors:
Mohammad Akbari,
Mohammad Reza Abedi,
Roghayeh Joda,
Mohsen Pourghasemian,
Nader Mokari,
Melike Erol-Kantarci
Abstract:
In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and schedul…
▽ More
In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and scheduling in these schemes is NP-hard and finding a globally optimal solution by traditional approaches is complex. Recently, deep reinforcement learning (DRL) has appeared as a viable way to solve such problems. In this paper, we first utilize single agent low-complex compound action actor-critic RL to cover both discrete and continuous actions and jointly minimize VNF cost and AoI in terms of network resources under end-to end Quality of Service constraints. To surmount the single-agent capacity limitation for learning, we then extend our solution to a multi-agent DRL scheme in which agents collaborate with each other. Simulation results demonstrate that single-agent schemes significantly outperform the greedy algorithm in terms of average network cost and AoI. Moreover, multi-agent solution decreases the average cost by dividing the tasks between the agents. However, it needs more iterations to be learned due to the requirement on the agents collaboration.
△ Less
Submitted 10 May, 2021;
originally announced May 2021.
-
Dynamic Frame Structure for Next Generation Wireless Networks
Authors:
Mohammad R. Abedi,
Mohammad R. Javan,
Nader Mokari,
Eduard . A. Jorswieck
Abstract:
In this paper, we devise a novel radio resource block (RB) structure named dynamic resource block structure (D-RBS) which can handle low latency traffics and large fluctuations in data rates by exploiting smart time and frequency duplexing. In our framework, the main resource block with a predefined bandwidth and time duration is divided into several small blocks with the same bandwidth and time d…
▽ More
In this paper, we devise a novel radio resource block (RB) structure named dynamic resource block structure (D-RBS) which can handle low latency traffics and large fluctuations in data rates by exploiting smart time and frequency duplexing. In our framework, the main resource block with a predefined bandwidth and time duration is divided into several small blocks with the same bandwidth and time duration. Depending on the service requirements, e.g., data rate and latency, the users are assigned to some these small blocks which could be noncontiguous both in frequency and time. This is in contrast to the previously introduced static resource block structure (S-RBS) where the size of each RB is predetermined and fixed. We provide resource allocation frameworks for this RB structure and formulate the optimization problems whose solutions are obtained by alternate search method (ASM) based on successive convex approximation approach (SCA). We provide a global optimal solution by exploiting the monotonic optimization method. By simulation we study the performance of our proposed scheme with S-RBS scheme and show it has 26% gain compared to the S-RBS scheme.
△ Less
Submitted 2 February, 2020;
originally announced February 2020.
-
Single or Multiple Frames Content Delivery for Next-Generation Networks?
Authors:
Mohammad R. Abedi,
Nader Mokari,
Mohammad R. Javan,
Eduard. A. Jorswieck
Abstract:
This paper addresses the four enabling technologies, namely multi-user sparse code multiple access (SCMA), content caching, energy harvesting, and physical layer security for proposing an energy and spectral efficient resource allocation algorithm for the access and backhaul links in heterogeneous cellular networks. Although each of the above mentioned issues could be a topic of research, in a rea…
▽ More
This paper addresses the four enabling technologies, namely multi-user sparse code multiple access (SCMA), content caching, energy harvesting, and physical layer security for proposing an energy and spectral efficient resource allocation algorithm for the access and backhaul links in heterogeneous cellular networks. Although each of the above mentioned issues could be a topic of research, in a real situation, we would face a complicated scenario where they should be considered jointly, and hence, our target is to consider these technologies jointly in a unified framework. Moreover, we propose two novel content delivery scenarios: 1) single frame content delivery (SFCD), and 2) multiple frames content delivery (MFCD), where the time duration of serving user requests is divided into several frames. In the first scenario, the requested content by each user is served over one frame. However, in the second scenario, the requested content by each user can be delivered over several frames. We formulate the resource allocation for the proposed scenarios as optimization problems where our main aim is to maximize the energy efficiency of access links subject to the transmit power and rate constraints of access and backhaul links, caching and energy harvesting constraints, and SCMA codebook allocation limitations. Due to the practical limitations, we assume that the channel state information values between eavesdroppers and base stations are uncertain and design the network for the worst case scenario. Since the corresponding optimization problems are mixed integer non-linear and nonconvex programming, NP-hard, and intractable, we propose an iterative algorithm based on the well-known alternate and successive convex approximation methods.
△ Less
Submitted 14 August, 2018; v1 submitted 19 February, 2018;
originally announced February 2018.
-
Optimal Energy Management for Energy Harvesting Transmitter and Receiver with Helper
Authors:
Mohsen Abedi,
Mohammad Javad Emadi,
Behzad Shahrasbi
Abstract:
We study energy harvesting (EH) transmitter and receiver, where the receiver decodes data using the harvested energy from the nature and from an independent EH node, named helper. Helper cooperates with the receiver by transferring its harvested energy to the receiver over an orthogonal fading channel. We study an offline optimal power management policy to maximize the reliable information rate. T…
▽ More
We study energy harvesting (EH) transmitter and receiver, where the receiver decodes data using the harvested energy from the nature and from an independent EH node, named helper. Helper cooperates with the receiver by transferring its harvested energy to the receiver over an orthogonal fading channel. We study an offline optimal power management policy to maximize the reliable information rate. The harvested energy in all three nodes are assumed to be known. We consider four different scenarios; First, for the case that both transmitter and the receiver have batteries, we show that the optimal policy is transferring the helper harvested energy to the receiver, immediately. Next, for the case of non-battery receiver and full power transmitter, we model a virtual EH receiver with minimum energy constraint to achieve an optimal policy. Then, we consider a non-battery EH receiver and EH transmitter with battery. Finally, we derive optimal power management wherein neither the transmitter nor the receiver have batteries. We propose three iterative algorithms to compute optimal energy management policies. Numerical results are presented to corroborate the advantage of employing the helper.
△ Less
Submitted 17 April, 2016;
originally announced April 2016.
-
Automatic ECG Beat Arrhythmia Detection
Authors:
M. Bazarghan,
Y. Jaberi,
R. Amandi,
M. Abedi
Abstract:
Background: In recent years automated data analysis techniques have drawn great attention and are used in almost every field of research including biomedical. Artificial Neural Networks (ANNs) are one of the Computer- Aided- Diagnosis tools which are used extensively by advances in computer hardware technology. The application of these techniques for disease diagnosis has made great progress and i…
▽ More
Background: In recent years automated data analysis techniques have drawn great attention and are used in almost every field of research including biomedical. Artificial Neural Networks (ANNs) are one of the Computer- Aided- Diagnosis tools which are used extensively by advances in computer hardware technology. The application of these techniques for disease diagnosis has made great progress and is widely used by physicians. An Electrocardiogram carries vital information about heart activity and physicians use this signal for cardiac disease diagnosis which was the great motivation towards our study. Methods: In this study we are using Probabilistic Neural Networks (PNN) as an automatic technique for ECG signal analysis along with a Genetic Algorithm (GA). As every real signal recorded by the equipment can have different artifacts, we need to do some preprocessing steps before feeding it to the ANN. Wavelet transform is used for extracting the morphological parameters and median filter for data reduction of the ECG signal. The subset of morphological parameters are chosen and optimized using GA. We had two approaches in our investigation, the first one uses the whole signal with 289 normalized and de-noised data points as input to the ANN. In the second approach after applying all the preprocessing steps the signal is reduced to 29 data points and also their important parameters extracted to form the ANN input with 35 data points. Results: The outcome of the two approaches for 8 types of arrhythmia shows that the second approach is superior than the first one with an average accuracy of %99.42.
△ Less
Submitted 7 November, 2012; v1 submitted 2 September, 2012;
originally announced September 2012.