-
Microstructural Studies Using Generative Adversarial Network (GAN): a Case Study
Authors:
Owais Ahmad,
Vishal Panwar,
Kaushik Das,
Rajdip Mukherjee,
Somnath Bhowmick
Abstract:
The generative adversarial network (GAN) is one of the most widely used deep generative models for synthesizing high-quality images with the same statistics as the training set. Finite element method (FEM) based property prediction often relies on synthetically generated microstructures. The phase-field model is a computational method of generating realistic microstructures considering the underly…
▽ More
The generative adversarial network (GAN) is one of the most widely used deep generative models for synthesizing high-quality images with the same statistics as the training set. Finite element method (FEM) based property prediction often relies on synthetically generated microstructures. The phase-field model is a computational method of generating realistic microstructures considering the underlying thermodynamics and kinetics of the material. Due to the expensive nature of the simulations, it is not always feasible to use phase-field for synthetic microstructure generation. In this work, we train a GAN with microstructures generated from the phase-field simulations. Mechanical properties calculated using the finite element method on synthetic and actual phase field microstructures show excellent agreement. Since the GAN model generates thousands of images within seconds, it has the potential to improve the quality of synthetic microstructures needed for FEM calculations or any other applications requiring a large number of realistic synthetic images at minimal computational cost.
△ Less
Submitted 6 June, 2025;
originally announced June 2025.
-
ProMi: An Efficient Prototype-Mixture Baseline for Few-Shot Segmentation with Bounding-Box Annotations
Authors:
Florent Chiaroni,
Ali Ayub,
Ola Ahmad
Abstract:
In robotics applications, few-shot segmentation is crucial because it allows robots to perform complex tasks with minimal training data, facilitating their adaptation to diverse, real-world environments. However, pixel-level annotations of even small amount of images is highly time-consuming and costly. In this paper, we present a novel few-shot binary segmentation method based on bounding-box ann…
▽ More
In robotics applications, few-shot segmentation is crucial because it allows robots to perform complex tasks with minimal training data, facilitating their adaptation to diverse, real-world environments. However, pixel-level annotations of even small amount of images is highly time-consuming and costly. In this paper, we present a novel few-shot binary segmentation method based on bounding-box annotations instead of pixel-level labels. We introduce, ProMi, an efficient prototype-mixture-based method that treats the background class as a mixture of distributions. Our approach is simple, training-free, and effective, accommodating coarse annotations with ease. Compared to existing baselines, ProMi achieves the best results across different datasets with significant gains, demonstrating its effectiveness. Furthermore, we present qualitative experiments tailored to real-world mobile robot tasks, demonstrating the applicability of our approach in such scenarios. Our code: https://github.com/ThalesGroup/promi.
△ Less
Submitted 18 May, 2025;
originally announced May 2025.
-
Robust and Noise-resilient Long-Term Prediction of Spatiotemporal Data Using Variational Mode Graph Neural Networks with 3D Attention
Authors:
Osama Ahmad,
Zubair Khalid
Abstract:
This paper focuses on improving the robustness of spatiotemporal long-term prediction using a variational mode graph convolutional network (VMGCN) by introducing 3D channel attention. The deep learning network for this task relies on historical data inputs, yet real-time data can be corrupted by sensor noise, altering its distribution. We model this noise as independent and identically distributed…
▽ More
This paper focuses on improving the robustness of spatiotemporal long-term prediction using a variational mode graph convolutional network (VMGCN) by introducing 3D channel attention. The deep learning network for this task relies on historical data inputs, yet real-time data can be corrupted by sensor noise, altering its distribution. We model this noise as independent and identically distributed (i.i.d.) Gaussian noise and incorporate it into the LargeST traffic volume dataset, resulting in data with both inherent and additive noise components. Our approach involves decomposing the corrupted signal into modes using variational mode decomposition, followed by feeding the data into a learning pipeline for prediction. We integrate a 3D attention mechanism encompassing spatial, temporal, and channel attention. The spatial and temporal attention modules learn their respective correlations, while the channel attention mechanism is used to suppress noise and highlight the significant modes in the spatiotemporal signals. Additionally, a learnable soft thresholding method is implemented to exclude unimportant modes from the feature vector, and a feature reduction method based on the signal-to-noise ratio (SNR) is applied. We compare the performance of our approach against baseline models, demonstrating that our method achieves superior long-term prediction accuracy, robustness to noise, and improved performance with mode truncation compared to the baseline models. The code of the paper is available at https://github.com/OsamaAhmad369/VMGCN.
△ Less
Submitted 9 April, 2025;
originally announced April 2025.
-
Continuous Boostlet Transform and Associated Uncertainty Principles
Authors:
Owais Ahmad,
Jasifa Fayaz
Abstract:
The Continuous Boostlet Transform (CBT) is introduced as a powerful tool for analyzing spatiotemporal signals, particularly acoustic wavefields. Overcoming the limitations of classical wavelets, the CBT leverages the Poincaré group and isotropic dilations to capture sparse features of natural acoustic fields. This paper presents the mathematical framework of the CBT, including its definition, fund…
▽ More
The Continuous Boostlet Transform (CBT) is introduced as a powerful tool for analyzing spatiotemporal signals, particularly acoustic wavefields. Overcoming the limitations of classical wavelets, the CBT leverages the Poincaré group and isotropic dilations to capture sparse features of natural acoustic fields. This paper presents the mathematical framework of the CBT, including its definition, fundamental properties, and associated uncertainty principles, such as Heisenberg's, logarithmic, Pitt's, and Nazarov's inequalities. These results illuminate the trade-offs between time and frequency localization in the boostlet domain. Practical examples with constant and exponential functions highlight the CBT's adaptability. With applications in radar, communications, audio processing, and seismic analysis, the CBT offers flexible time-frequency resolution, making it ideal for non-stationary and transient signals, and a valuable tool for modern signal processing.
△ Less
Submitted 21 March, 2025;
originally announced April 2025.
-
Deep Learning Assisted Denoising of Experimental Micrographs
Authors:
Owais Ahmad,
Albert Linda,
Saumya Ranjan Jha,
Somnath Bhowmick
Abstract:
Microstructure imaging is crucial in materials science, but experimental images often introduce noise that obscures critical structural details. This study presents a novel deep learning approach for robust microstructure image denoising, combining phase-field simulations, Fourier transform techniques, and an attention-based neural network. The innovative framework addresses dataset limitations by…
▽ More
Microstructure imaging is crucial in materials science, but experimental images often introduce noise that obscures critical structural details. This study presents a novel deep learning approach for robust microstructure image denoising, combining phase-field simulations, Fourier transform techniques, and an attention-based neural network. The innovative framework addresses dataset limitations by synthetically generating training data by combining computational phase-field microstructures with experimental optical micrographs. The neural network architecture features an attention mechanism that dynamically focuses on important microstructural features while systematically eliminating noise types like scratches and surface imperfections. Testing on a FeMnNi alloy system demonstrated the model's exceptional performance across multiple magnifications. By successfully removing diverse noise patterns while maintaining grain boundary integrity, the research provides a generalizable deep-learning framework for microstructure image enhancement with broad applicability in materials science.
△ Less
Submitted 9 June, 2025; v1 submitted 23 March, 2025;
originally announced March 2025.
-
ECG-EmotionNet: Nested Mixture of Expert (NMoE) Adaptation of ECG-Foundation Model for Driver Emotion Recognition
Authors:
Nastaran Mansourian,
Arash Mohammadi,
M. Omair Ahmad,
M. N. S. Swamy
Abstract:
Driver emotion recognition plays a crucial role in driver monitoring systems, enhancing human-autonomy interactions and the trustworthiness of Autonomous Driving (AD). Various physiological and behavioural modalities have been explored for this purpose, with Electrocardiogram (ECG) emerging as a standout choice for real-time emotion monitoring, particularly in dynamic and unpredictable driving con…
▽ More
Driver emotion recognition plays a crucial role in driver monitoring systems, enhancing human-autonomy interactions and the trustworthiness of Autonomous Driving (AD). Various physiological and behavioural modalities have been explored for this purpose, with Electrocardiogram (ECG) emerging as a standout choice for real-time emotion monitoring, particularly in dynamic and unpredictable driving conditions. Existing methods, however, often rely on multi-channel ECG signals recorded under static conditions, limiting their applicability in real-world dynamic driving scenarios. To address this limitation, the paper introduces ECG-EmotionNet, a novel architecture designed specifically for emotion recognition in dynamic driving environments. ECG-EmotionNet is constructed by adapting a recently introduced ECG Foundation Model (FM) and uniquely employs single-channel ECG signals, ensuring both robust generalizability and computational efficiency. Unlike conventional adaptation methods such as full fine-tuning, linear probing, or low-rank adaptation, we propose an intuitively pleasing alternative, referred to as the nested Mixture of Experts (MoE) adaptation. More precisely, each transformer layer of the underlying FM is treated as a separate expert, with embeddings extracted from these experts fused using trainable weights within a gating mechanism. This approach enhances the representation of both global and local ECG features, leading to a 6% improvement in accuracy and a 7% increase in the F1 score, all while maintaining computational efficiency. The effectiveness of the proposed ECG-EmotionNet architecture is evaluated using a recently introduced and challenging driver emotion monitoring dataset.
△ Less
Submitted 3 March, 2025;
originally announced March 2025.
-
Learning Backbones: Sparsifying Graphs through Zero Forcing for Effective Graph-Based Learning
Authors:
Obaid Ullah Ahmad,
Anwar Said,
Mudassir Shabbir,
Xenofon Koutsoukos,
Waseem Abbas
Abstract:
This paper introduces a novel framework for graph sparsification that preserves the essential learning attributes of original graphs, improving computational efficiency and reducing complexity in learning algorithms. We refer to these sparse graphs as "learning backbones". Our approach leverages the zero-forcing (ZF) phenomenon, a dynamic process on graphs with applications in network control. The…
▽ More
This paper introduces a novel framework for graph sparsification that preserves the essential learning attributes of original graphs, improving computational efficiency and reducing complexity in learning algorithms. We refer to these sparse graphs as "learning backbones". Our approach leverages the zero-forcing (ZF) phenomenon, a dynamic process on graphs with applications in network control. The key idea is to generate a tree from the original graph that retains critical dynamical properties. By correlating these properties with learning attributes, we construct effective learning backbones. We evaluate the performance of our ZF-based backbones in graph classification tasks across eight datasets and six baseline models. The results demonstrate that our method outperforms existing techniques. Additionally, we explore extensions using node distance metrics to further enhance the framework's utility.
△ Less
Submitted 24 February, 2025;
originally announced February 2025.
-
Spatiotemporal Air Quality Mapping in Urban Areas Using Sparse Sensor Data, Satellite Imagery, Meteorological Factors, and Spatial Features
Authors:
Osama Ahmad,
Zubair Khalid,
Muhammad Tahir,
Momin Uppal
Abstract:
Monitoring air pollution is crucial for protecting human health from exposure to harmful substances. Traditional methods of air quality monitoring, such as ground-based sensors and satellite-based remote sensing, face limitations due to high deployment costs, sparse sensor coverage, and environmental interferences. To address these challenges, this paper proposes a framework for high-resolution sp…
▽ More
Monitoring air pollution is crucial for protecting human health from exposure to harmful substances. Traditional methods of air quality monitoring, such as ground-based sensors and satellite-based remote sensing, face limitations due to high deployment costs, sparse sensor coverage, and environmental interferences. To address these challenges, this paper proposes a framework for high-resolution spatiotemporal Air Quality Index (AQI) mapping using sparse sensor data, satellite imagery, and various spatiotemporal factors. By leveraging Graph Neural Networks (GNNs), we estimate AQI values at unmonitored locations based on both spatial and temporal dependencies. The framework incorporates a wide range of environmental features, including meteorological data, road networks, points of interest (PoIs), population density, and urban green spaces, which enhance prediction accuracy. We illustrate the use of our approach through a case study in Lahore, Pakistan, where multi-resolution data is used to generate the air quality index map at a fine spatiotemporal scale.
△ Less
Submitted 19 January, 2025;
originally announced January 2025.
-
LatentQGAN: A Hybrid QGAN with Classical Convolutional Autoencoder
Authors:
Alexis Vieloszynski,
Soumaya Cherkaoui,
Ola Ahmad,
Jean-Frédéric Laprade,
Oliver Nahman-Lévesque,
Abdallah Aaraba,
Shengrui Wang
Abstract:
Quantum machine learning consists in taking advantage of quantum computations to generate classical data. A potential application of quantum machine learning is to harness the power of quantum computers for generating classical data, a process essential to a multitude of applications such as enriching training datasets, anomaly detection, and risk management in finance. Given the success of Genera…
▽ More
Quantum machine learning consists in taking advantage of quantum computations to generate classical data. A potential application of quantum machine learning is to harness the power of quantum computers for generating classical data, a process essential to a multitude of applications such as enriching training datasets, anomaly detection, and risk management in finance. Given the success of Generative Adversarial Networks in classical image generation, the development of its quantum versions has been actively conducted. However, existing implementations on quantum computers often face significant challenges, such as scalability and training convergence issues. To address these issues, we propose LatentQGAN, a novel quantum model that uses a hybrid quantum-classical GAN coupled with an autoencoder. Although it was initially designed for image generation, the LatentQGAN approach holds potential for broader application across various practical data generation tasks. Experimental outcomes on both classical simulators and noisy intermediate scale quantum computers have demonstrated significant performance enhancements over existing quantum methods, alongside a significant reduction in quantum resources overhead.
△ Less
Submitted 19 November, 2024; v1 submitted 22 September, 2024;
originally announced September 2024.
-
Variational Mode-Driven Graph Convolutional Network for Spatiotemporal Traffic Forecasting
Authors:
Osama Ahmad,
Zubair Khalid
Abstract:
This paper focuses on spatiotemporal (ST) traffic prediction using graph neural networks. Given that ST data consists of non-stationary and complex time events, interpreting and predicting such trends is comparatively complicated. Representation of ST data in modes helps us to infer behavior and assess the impact of noise on prediction applications. We propose a framework that decomposes ST data i…
▽ More
This paper focuses on spatiotemporal (ST) traffic prediction using graph neural networks. Given that ST data consists of non-stationary and complex time events, interpreting and predicting such trends is comparatively complicated. Representation of ST data in modes helps us to infer behavior and assess the impact of noise on prediction applications. We propose a framework that decomposes ST data into modes using the variational mode decomposition (VMD) method, which is then fed into the neural network for forecasting future states. This hybrid approach is known as a variational mode graph convolutional network (VMGCN). Instead of exhaustively searching for the number of modes, they are determined using the reconstruction loss from the real-time application data. We also study the significance of each mode and the impact of bandwidth constraints on different horizon predictions in traffic flow data. We evaluate the performance of our proposed network on the LargeST dataset for both short and long-term predictions. Our framework yields better results compared to state-of-the-art methods.
△ Less
Submitted 15 October, 2024; v1 submitted 28 August, 2024;
originally announced August 2024.
-
QuaCK-TSF: Quantum-Classical Kernelized Time Series Forecasting
Authors:
Abdallah Aaraba,
Soumaya Cherkaoui,
Ola Ahmad,
Jean-Frédéric Laprade,
Olivier Nahman-Lévesque,
Alexis Vieloszynski,
Shengrui Wang
Abstract:
Forecasting in probabilistic time series is a complex endeavor that extends beyond predicting future values to also quantifying the uncertainty inherent in these predictions. Gaussian process regression stands out as a Bayesian machine learning technique adept at addressing this multifaceted challenge. This paper introduces a novel approach that blends the robustness of this Bayesian technique wit…
▽ More
Forecasting in probabilistic time series is a complex endeavor that extends beyond predicting future values to also quantifying the uncertainty inherent in these predictions. Gaussian process regression stands out as a Bayesian machine learning technique adept at addressing this multifaceted challenge. This paper introduces a novel approach that blends the robustness of this Bayesian technique with the nuanced insights provided by the kernel perspective on quantum models, aimed at advancing quantum kernelized probabilistic forecasting. We incorporate a quantum feature map inspired by Ising interactions and demonstrate its effectiveness in capturing the temporal dependencies critical for precise forecasting. The optimization of our model's hyperparameters circumvents the need for computationally intensive gradient descent by employing gradient-free Bayesian optimization. Comparative benchmarks against established classical kernel models are provided, affirming that our quantum-enhanced approach achieves competitive performance.
△ Less
Submitted 21 August, 2024;
originally announced August 2024.
-
DarSwin-Unet: Distortion Aware Encoder-Decoder Architecture
Authors:
Akshaya Athwale,
Ichrak Shili,
Émile Bergeron,
Ola Ahmad,
Jean-François Lalonde
Abstract:
Wide-angle fisheye images are becoming increasingly common for perception tasks in applications such as robotics, security, and mobility (e.g. drones, avionics). However, current models often either ignore the distortions in wide-angle images or are not suitable to perform pixel-level tasks. In this paper, we present an encoder-decoder model based on a radial transformer architecture that adapts t…
▽ More
Wide-angle fisheye images are becoming increasingly common for perception tasks in applications such as robotics, security, and mobility (e.g. drones, avionics). However, current models often either ignore the distortions in wide-angle images or are not suitable to perform pixel-level tasks. In this paper, we present an encoder-decoder model based on a radial transformer architecture that adapts to distortions in wide-angle lenses by leveraging the physical characteristics defined by the radial distortion profile. In contrast to the original model, which only performs classification tasks, we introduce a U-Net architecture, DarSwin-Unet, designed for pixel level tasks. Furthermore, we propose a novel strategy that minimizes sparsity when sampling the image for creating its input tokens. Our approach enhances the model capability to handle pixel-level tasks in wide-angle fisheye images, making it more effective for real-world applications. Compared to other baselines, DarSwin-Unet achieves the best results across different datasets, with significant gains when trained on bounded levels of distortions (very low, low, medium, and high) and tested on all, including out-of-distribution distortions. We demonstrate its performance on depth estimation and show through extensive experiments that DarSwin-Unet can perform zero-shot adaptation to unseen distortions of different wide-angle lenses.
△ Less
Submitted 17 February, 2025; v1 submitted 24 July, 2024;
originally announced July 2024.
-
Biquaternion Windowed Linear Canonical Transform
Authors:
Owais Ahmad,
Aijaz Ahmad Dar
Abstract:
In this paper, we introduce the notion of windowed linear canonical transform in biquaternion setting namely Biquaternion Windowed Linear Canonical Transform (BiQWLCT) and various properties of BiQWLCT, such as linearity, shift, parity, orthogonality relation, inversion formula, Plancherel theorem are established. Heisenberg uncertainty principle associated with the Biquaternion Windowed Linear Ca…
▽ More
In this paper, we introduce the notion of windowed linear canonical transform in biquaternion setting namely Biquaternion Windowed Linear Canonical Transform (BiQWLCT) and various properties of BiQWLCT, such as linearity, shift, parity, orthogonality relation, inversion formula, Plancherel theorem are established. Heisenberg uncertainty principle associated with the Biquaternion Windowed Linear Canonical Transform is also derived. Towards the culmination, an example and some potential applications are presented.
△ Less
Submitted 25 June, 2024;
originally announced June 2024.
-
PathoWAve: A Deep Learning-based Weight Averaging Method for Improving Domain Generalization in Histopathology Images
Authors:
Parastoo Sotoudeh Sharifi,
M. Omair Ahmad,
M. N. S. Swamy
Abstract:
Recent advancements in deep learning (DL) have significantly advanced medical image analysis. In the field of medical image processing, particularly in histopathology image analysis, the variation in staining protocols and differences in scanners present significant domain shift challenges, undermine the generalization capabilities of models to the data from unseen domains, prompting the need for…
▽ More
Recent advancements in deep learning (DL) have significantly advanced medical image analysis. In the field of medical image processing, particularly in histopathology image analysis, the variation in staining protocols and differences in scanners present significant domain shift challenges, undermine the generalization capabilities of models to the data from unseen domains, prompting the need for effective domain generalization (DG) strategies to improve the consistency and reliability of automated cancer detection tools in diagnostic decision-making. In this paper, we introduce Pathology Weight Averaging (PathoWAve), a multi-source DG strategy for addressing domain shift phenomenon of DL models in histopathology image analysis. Integrating specific weight averaging technique with parallel training trajectories and a strategically combination of regular augmentations with histopathology-specific data augmentation methods, PathoWAve enables a comprehensive exploration and precise convergence within the loss landscape. This method significantly enhanced generalization capabilities of DL models across new, unseen histopathology domains. To the best of our knowledge, PathoWAve is the first proposed weight averaging method for DG in histopathology image analysis. Our quantitative results on Camelyon17 WILDS dataset demonstrate PathoWAve's superiority over previous proposed methods to tackle the domain shift phenomenon in histopathology image processing. Our code is available at \url{https://github.com/ParastooSotoudeh/PathoWAve}.
△ Less
Submitted 21 June, 2024;
originally announced June 2024.
-
Neural Active Learning Meets the Partial Monitoring Framework
Authors:
Maxime Heuillet,
Ola Ahmad,
Audrey Durand
Abstract:
We focus on the online-based active learning (OAL) setting where an agent operates over a stream of observations and trades-off between the costly acquisition of information (labelled observations) and the cost of prediction errors. We propose a novel foundation for OAL tasks based on partial monitoring, a theoretical framework specialized in online learning from partially informative actions. We…
▽ More
We focus on the online-based active learning (OAL) setting where an agent operates over a stream of observations and trades-off between the costly acquisition of information (labelled observations) and the cost of prediction errors. We propose a novel foundation for OAL tasks based on partial monitoring, a theoretical framework specialized in online learning from partially informative actions. We show that previously studied binary and multi-class OAL tasks are instances of partial monitoring. We expand the real-world potential of OAL by introducing a new class of cost-sensitive OAL tasks. We propose NeuralCBP, the first PM strategy that accounts for predictive uncertainty with deep neural networks. Our extensive empirical evaluation on open source datasets shows that NeuralCBP has favorable performance against state-of-the-art baselines on multiple binary, multi-class and cost-sensitive OAL tasks.
△ Less
Submitted 14 May, 2024;
originally announced May 2024.
-
Improving Graph Machine Learning Performance Through Feature Augmentation Based on Network Control Theory
Authors:
Anwar Said,
Obaid Ullah Ahmad,
Waseem Abbas,
Mudassir Shabbir,
Xenofon Koutsoukos
Abstract:
Network control theory (NCT) offers a robust analytical framework for understanding the influence of network topology on dynamic behaviors, enabling researchers to decipher how certain patterns of external control measures can steer system dynamics towards desired states. Distinguished from other structure-function methodologies, NCT's predictive capabilities can be coupled with deploying Graph Ne…
▽ More
Network control theory (NCT) offers a robust analytical framework for understanding the influence of network topology on dynamic behaviors, enabling researchers to decipher how certain patterns of external control measures can steer system dynamics towards desired states. Distinguished from other structure-function methodologies, NCT's predictive capabilities can be coupled with deploying Graph Neural Networks (GNNs), which have demonstrated exceptional utility in various network-based learning tasks. However, the performance of GNNs heavily relies on the expressiveness of node features, and the lack of node features can greatly degrade their performance. Furthermore, many real-world systems may lack node-level information, posing a challenge for GNNs.To tackle this challenge, we introduce a novel approach, NCT-based Enhanced Feature Augmentation (NCT-EFA), that assimilates average controllability, along with other centrality indices, into the feature augmentation pipeline to enhance GNNs performance. Our evaluation of NCT-EFA, on six benchmark GNN models across two experimental setting. solely employing average controllability and in combination with additional centrality metrics. showcases an improved performance reaching as high as 11%. Our results demonstrate that incorporating NCT into feature enrichment can substantively extend the applicability and heighten the performance of GNNs in scenarios where node-level information is unavailable.
△ Less
Submitted 3 May, 2024;
originally announced May 2024.
-
Trajectory Planning of Robotic Manipulator in Dynamic Environment Exploiting DRL
Authors:
Osama Ahmad,
Zawar Hussain,
Hammad Naeem
Abstract:
This study is about the implementation of a reinforcement learning algorithm in the trajectory planning of manipulators. We have a 7-DOF robotic arm to pick and place the randomly placed block at a random target point in an unknown environment. The obstacle is randomly moving which creates a hurdle in picking the object. The objective of the robot is to avoid the obstacle and pick the block with c…
▽ More
This study is about the implementation of a reinforcement learning algorithm in the trajectory planning of manipulators. We have a 7-DOF robotic arm to pick and place the randomly placed block at a random target point in an unknown environment. The obstacle is randomly moving which creates a hurdle in picking the object. The objective of the robot is to avoid the obstacle and pick the block with constraints to a fixed timestamp. In this literature, we have applied a deep deterministic policy gradient (DDPG) algorithm and compared the model's efficiency with dense and sparse rewards.
△ Less
Submitted 25 March, 2024;
originally announced March 2024.
-
Radiative lifetime of the A 2Π1/2 state in RaF with relevance to laser cooling
Authors:
M. Athanasakis-Kaklamanakis,
S. G. Wilkins,
P. Lassègues,
L. Lalanne,
J. R. Reilly,
O. Ahmad,
M. Au,
S. W. Bai,
J. Berbalk,
C. Bernerd,
A. Borschevsky,
A. A. Breier,
K. Chrysalidis,
T. E. Cocolios,
R. P. de Groote,
C. M. Fajardo-Zambrano,
K. T. Flanagan,
S. Franchoo,
R. F. Garcia Ruiz,
D. Hanstorp,
R. Heinke,
P. Imgram,
A. Koszorús,
A. A. Kyuberis,
J. Lim
, et al. (16 additional authors not shown)
Abstract:
The radiative lifetime of the $A$ $^2 Π_{1/2}$ (v=0) state in radium monofluoride (RaF) is measured to be 35(1) ns. The lifetime of this state and the related decay rate $Γ= 2.86(8) \times 10^7$ $s^{-1}$ are of relevance to the laser cooling of RaF via the optically closed $A$ $^2 Π_{1/2} \leftarrow X$ $^2Σ_{1/2}$ transition, which makes the molecule a promising probe to search for new physics. Ra…
▽ More
The radiative lifetime of the $A$ $^2 Π_{1/2}$ (v=0) state in radium monofluoride (RaF) is measured to be 35(1) ns. The lifetime of this state and the related decay rate $Γ= 2.86(8) \times 10^7$ $s^{-1}$ are of relevance to the laser cooling of RaF via the optically closed $A$ $^2 Π_{1/2} \leftarrow X$ $^2Σ_{1/2}$ transition, which makes the molecule a promising probe to search for new physics. RaF is found to have a comparable photon-scattering rate to homoelectronic laser-coolable molecules. Thanks to its highly diagonal Franck-Condon matrix, it is expected to scatter an order of magnitude more photons than other molecules when using just 3 cooling lasers, before it decays to a dark state. The lifetime measurement in RaF is benchmarked by measuring the lifetime of the $8P_{3/2}$ state in Fr to be 83(3) ns, in agreement with literature.
△ Less
Submitted 6 June, 2024; v1 submitted 14 March, 2024;
originally announced March 2024.
-
Control-based Graph Embeddings with Data Augmentation for Contrastive Learning
Authors:
Obaid Ullah Ahmad,
Anwar Said,
Mudassir Shabbir,
Waseem Abbas,
Xenofon Koutsoukos
Abstract:
In this paper, we study the problem of unsupervised graph representation learning by harnessing the control properties of dynamical networks defined on graphs. Our approach introduces a novel framework for contrastive learning, a widely prevalent technique for unsupervised representation learning. A crucial step in contrastive learning is the creation of 'augmented' graphs from the input graphs. T…
▽ More
In this paper, we study the problem of unsupervised graph representation learning by harnessing the control properties of dynamical networks defined on graphs. Our approach introduces a novel framework for contrastive learning, a widely prevalent technique for unsupervised representation learning. A crucial step in contrastive learning is the creation of 'augmented' graphs from the input graphs. Though different from the original graphs, these augmented graphs retain the original graph's structural characteristics. Here, we propose a unique method for generating these augmented graphs by leveraging the control properties of networks. The core concept revolves around perturbing the original graph to create a new one while preserving the controllability properties specific to networks and graphs. Compared to the existing methods, we demonstrate that this innovative approach enhances the effectiveness of contrastive learning frameworks, leading to superior results regarding the accuracy of the classification tasks. The key innovation lies in our ability to decode the network structure using these control properties, opening new avenues for unsupervised graph representation learning.
△ Less
Submitted 17 April, 2024; v1 submitted 7 March, 2024;
originally announced March 2024.
-
Improvising Age Verification Technologies in Canada: Technical, Regulatory and Social Dynamics
Authors:
Azfar Adib,
Wei-Ping Zhu,
M. Omair Ahmad
Abstract:
Age verification, which is a mandatory legal requirement for delivering certain age-appropriate services or products, has recently been emphasized around the globe to ensure online safety for children. The rapid advancement of artificial intelligence has facilitated the recent development of some cutting-edge age-verification technologies, particularly using biometrics. However, successful deploym…
▽ More
Age verification, which is a mandatory legal requirement for delivering certain age-appropriate services or products, has recently been emphasized around the globe to ensure online safety for children. The rapid advancement of artificial intelligence has facilitated the recent development of some cutting-edge age-verification technologies, particularly using biometrics. However, successful deployment and mass acceptance of these technologies are significantly dependent on the corresponding socio-economic and regulatory context. This paper reviews such key dynamics for improvising age-verification technologies in Canada. It is particularly essential for such technologies to be inclusive, transparent, adaptable, privacy-preserving, and secure. Effective collaboration between academia, government, and industry entities can help to meet the growing demands for age-verification services in Canada while maintaining a user-centric approach.
△ Less
Submitted 15 February, 2024;
originally announced February 2024.
-
Randomized Confidence Bounds for Stochastic Partial Monitoring
Authors:
Maxime Heuillet,
Ola Ahmad,
Audrey Durand
Abstract:
The partial monitoring (PM) framework provides a theoretical formulation of sequential learning problems with incomplete feedback. On each round, a learning agent plays an action while the environment simultaneously chooses an outcome. The agent then observes a feedback signal that is only partially informative about the (unobserved) outcome. The agent leverages the received feedback signals to se…
▽ More
The partial monitoring (PM) framework provides a theoretical formulation of sequential learning problems with incomplete feedback. On each round, a learning agent plays an action while the environment simultaneously chooses an outcome. The agent then observes a feedback signal that is only partially informative about the (unobserved) outcome. The agent leverages the received feedback signals to select actions that minimize the (unobserved) cumulative loss. In contextual PM, the outcomes depend on some side information that is observable by the agent before selecting the action on each round. In this paper, we consider the contextual and non-contextual PM settings with stochastic outcomes. We introduce a new class of PM strategies based on the randomization of deterministic confidence bounds. We also extend regret guarantees to settings where existing stochastic strategies are not applicable. Our experiments show that the proposed RandCBP and RandCBPsidestar strategies have favorable performance against state-of-the-art baselines in multiple PM games. To advocate for the adoption of the PM framework, we design a use case on the real-world problem of monitoring the error rate of any deployed classification system.
△ Less
Submitted 15 May, 2024; v1 submitted 7 February, 2024;
originally announced February 2024.
-
Successive Data Injection in Conditional Quantum GAN Applied to Time Series Anomaly Detection
Authors:
Benjamin Kalfon,
Soumaya Cherkaoui,
Jean-Frédéric Laprade,
Ola Ahmad,
Shengrui Wang
Abstract:
Classical GAN architectures have shown interesting results for solving anomaly detection problems in general and for time series anomalies in particular, such as those arising in communication networks. In recent years, several quantum GAN architectures have been proposed in the literature. When detecting anomalies in time series using QGANs, huge challenges arise due to the limited number of qubi…
▽ More
Classical GAN architectures have shown interesting results for solving anomaly detection problems in general and for time series anomalies in particular, such as those arising in communication networks. In recent years, several quantum GAN architectures have been proposed in the literature. When detecting anomalies in time series using QGANs, huge challenges arise due to the limited number of qubits compared to the size of the data. To address these challenges, we propose a new high-dimensional encoding approach, named Successive Data Injection (SuDaI). In this approach, we explore a larger portion of the quantum state than that in the conventional angle encoding, the method used predominantly in the literature, through repeated data injections into the quantum state. SuDaI encoding allows us to adapt the QGAN for anomaly detection with network data of a much higher dimensionality than with the existing known QGANs implementations. In addition, SuDaI encoding applies to other types of high-dimensional time series and can be used in contexts beyond anomaly detection and QGANs, opening up therefore multiple fields of application.
△ Less
Submitted 8 October, 2023;
originally announced October 2023.
-
Mending of Spatio-Temporal Dependencies in Block Adjacency Matrix
Authors:
Osama Ahmad,
Omer Abdul Jalil,
Usman Nazir,
Murtaza Taj
Abstract:
In the realm of applications where data dynamically evolves across spatial and temporal dimensions, Graph Neural Networks (GNNs) are often complemented by sequence modeling architectures, such as RNNs and transformers, to effectively model temporal changes. These hybrid models typically arrange the spatial and temporal learning components in series. A pioneering effort to jointly model the spatio-…
▽ More
In the realm of applications where data dynamically evolves across spatial and temporal dimensions, Graph Neural Networks (GNNs) are often complemented by sequence modeling architectures, such as RNNs and transformers, to effectively model temporal changes. These hybrid models typically arrange the spatial and temporal learning components in series. A pioneering effort to jointly model the spatio-temporal dependencies using only GNNs was the introduction of the Block Adjacency Matrix \(\mathbf{A_B}\) \cite{1}, which was constructed by diagonally concatenating adjacency matrices from graphs at different time steps. This approach resulted in a single graph encompassing complete spatio-temporal data; however, the graphs from different time steps remained disconnected, limiting GNN message-passing to spatially connected nodes only. Addressing this critical challenge, we propose a novel end-to-end learning architecture specifically designed to mend the temporal dependencies, resulting in a well-connected graph. Thus, we provide a framework for the learnable representation of spatio-temporal data as graphs. Our methodology demonstrates superior performance on benchmark datasets, such as SurgVisDom and C2D2, surpassing existing state-of-the-art graph models in terms of accuracy. Our model also achieves significantly lower computational complexity, having far fewer parameters than methods reliant on CLIP and 3D CNN architectures.
△ Less
Submitted 30 August, 2024; v1 submitted 4 October, 2023;
originally announced October 2023.
-
Controllability Backbone in Networks
Authors:
Obaid Ullah Ahmad,
Waseem Abbas,
Mudassir Shabbir
Abstract:
This paper studies the controllability backbone problem in dynamical networks defined over graphs. The main idea of the controllability backbone is to identify a small subset of edges in a given network such that any subnetwork containing those edges/links has at least the same network controllability as the original network while assuming the same set of input/leader vertices. We consider the str…
▽ More
This paper studies the controllability backbone problem in dynamical networks defined over graphs. The main idea of the controllability backbone is to identify a small subset of edges in a given network such that any subnetwork containing those edges/links has at least the same network controllability as the original network while assuming the same set of input/leader vertices. We consider the strong structural controllability (SSC) in our work, which is useful but computationally challenging. Thus, we utilize two lower bounds on the network's SSC based on the zero forcing notion and graph distances. We provide algorithms to compute controllability backbones while preserving these lower bounds. We thoroughly analyze the proposed algorithms and compute the number of edges in the controllability backbones. Finally, we compare and numerically evaluate our methods on random graphs.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
Pinning down electron correlations in RaF via spectroscopy of excited states and high-accuracy relativistic quantum chemistry
Authors:
M. Athanasakis-Kaklamanakis,
S. G. Wilkins,
L. V. Skripnikov,
A. Koszorús,
A. A. Breier,
O. Ahmad,
M. Au,
S. W. Bai,
I. Belošević,
J. Berbalk,
R. Berger,
C. Bernerd,
M. L. Bissell,
A. Borschevsky,
A. Brinson,
K. Chrysalidis,
T. E. Cocolios,
R. P. de Groote,
A. Dorne,
C. M. Fajardo-Zambrano,
R. W. Field,
K. T. Flanagan,
S. Franchoo,
R. F. Garcia Ruiz,
K. Gaul
, et al. (31 additional authors not shown)
Abstract:
We report the spectroscopy of the 14 lowest excited electronic states in the radioactive molecule radium monofluoride (RaF). The observed excitation energies are compared with fully relativistic state-of-the-art Fock-space coupled cluster (FS-RCC) calculations, which achieve an agreement of >=99.64% (within ~12 meV) with experiment for all states. Guided by theory, a firm assignment of the angular…
▽ More
We report the spectroscopy of the 14 lowest excited electronic states in the radioactive molecule radium monofluoride (RaF). The observed excitation energies are compared with fully relativistic state-of-the-art Fock-space coupled cluster (FS-RCC) calculations, which achieve an agreement of >=99.64% (within ~12 meV) with experiment for all states. Guided by theory, a firm assignment of the angular momentum and term symbol is made for 10 states and a tentative assignment for 4 states. The role of high-order electron correlation and quantum electrodynamics effects in the excitation energy of excited states is studied, found to be important for all states. Establishing the simultaneous accuracy and precision of calculations is an important step for research at the intersection of particle, nuclear, and chemical physics, including searches of physics beyond the Standard Model, for which RaF is a promising probe.
△ Less
Submitted 20 December, 2024; v1 submitted 28 August, 2023;
originally announced August 2023.
-
MoP-CLIP: A Mixture of Prompt-Tuned CLIP Models for Domain Incremental Learning
Authors:
Julien Nicolas,
Florent Chiaroni,
Imtiaz Ziko,
Ola Ahmad,
Christian Desrosiers,
Jose Dolz
Abstract:
Despite the recent progress in incremental learning, addressing catastrophic forgetting under distributional drift is still an open and important problem. Indeed, while state-of-the-art domain incremental learning (DIL) methods perform satisfactorily within known domains, their performance largely degrades in the presence of novel domains. This limitation hampers their generalizability, and restri…
▽ More
Despite the recent progress in incremental learning, addressing catastrophic forgetting under distributional drift is still an open and important problem. Indeed, while state-of-the-art domain incremental learning (DIL) methods perform satisfactorily within known domains, their performance largely degrades in the presence of novel domains. This limitation hampers their generalizability, and restricts their scalability to more realistic settings where train and test data are drawn from different distributions. To address these limitations, we present a novel DIL approach based on a mixture of prompt-tuned CLIP models (MoP-CLIP), which generalizes the paradigm of S-Prompting to handle both in-distribution and out-of-distribution data at inference. In particular, at the training stage we model the features distribution of every class in each domain, learning individual text and visual prompts to adapt to a given domain. At inference, the learned distributions allow us to identify whether a given test sample belongs to a known domain, selecting the correct prompt for the classification task, or from an unseen domain, leveraging a mixture of the prompt-tuned CLIP models. Our empirical evaluation reveals the poor performance of existing DIL methods under domain shift, and suggests that the proposed MoP-CLIP performs competitively in the standard DIL settings while outperforming state-of-the-art methods in OOD scenarios. These results demonstrate the superiority of MoP-CLIP, offering a robust and general solution to the problem of domain incremental learning.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Causal Analysis for Robust Interpretability of Neural Networks
Authors:
Ola Ahmad,
Nicolas Bereux,
Loïc Baret,
Vahid Hashemi,
Freddy Lecue
Abstract:
Interpreting the inner function of neural networks is crucial for the trustworthy development and deployment of these black-box models. Prior interpretability methods focus on correlation-based measures to attribute model decisions to individual examples. However, these measures are susceptible to noise and spurious correlations encoded in the model during the training phase (e.g., biased inputs,…
▽ More
Interpreting the inner function of neural networks is crucial for the trustworthy development and deployment of these black-box models. Prior interpretability methods focus on correlation-based measures to attribute model decisions to individual examples. However, these measures are susceptible to noise and spurious correlations encoded in the model during the training phase (e.g., biased inputs, model overfitting, or misspecification). Moreover, this process has proven to result in noisy and unstable attributions that prevent any transparent understanding of the model's behavior. In this paper, we develop a robust interventional-based method grounded by causal analysis to capture cause-effect mechanisms in pre-trained neural networks and their relation to the prediction. Our novel approach relies on path interventions to infer the causal mechanisms within hidden layers and isolate relevant and necessary information (to model prediction), avoiding noisy ones. The result is task-specific causal explanatory graphs that can audit model behavior and express the actual causes underlying its performance. We apply our method to vision models trained on classification tasks. On image classification tasks, we provide extensive quantitative experiments to show that our approach can capture more stable and faithful explanations than standard attribution-based methods. Furthermore, the underlying causal graphs reveal the neural interactions in the model, making it a valuable tool in other applications (e.g., model repair).
△ Less
Submitted 20 June, 2023; v1 submitted 15 May, 2023;
originally announced May 2023.
-
Accelerating microstructure modelling via machine learning: a new method combining Autoencoder and ConvLSTM
Authors:
Owais Ahmad,
Naveen Kumar,
Rajdip Mukherjee,
Somnath Bhowmick
Abstract:
Phase-field modeling is an elegant and versatile computation tool to predict microstructure evolution in materials in the mesoscale regime. However, these simulations require rigorous numerical solutions of differential equations, which are accurate but computationally expensive. To overcome this difficulty, we combine two popular machine learning techniques, autoencoder and convolutional long sho…
▽ More
Phase-field modeling is an elegant and versatile computation tool to predict microstructure evolution in materials in the mesoscale regime. However, these simulations require rigorous numerical solutions of differential equations, which are accurate but computationally expensive. To overcome this difficulty, we combine two popular machine learning techniques, autoencoder and convolutional long short-term memory (ConvLSTM), to accelerate the study of microstructural evolution without compromising the resolution of the microstructural representation. After training with phase-field generated microstructures of ten known compositions, the model can accurately predict the microstructure for the future nth frames based on previous m frames for an unknown composition. Replacing n phase-field steps with machine-learned microstructures can significantly accelerate the in silico study of microstructure evolution.
△ Less
Submitted 1 May, 2023;
originally announced May 2023.
-
DarSwin: Distortion Aware Radial Swin Transformer
Authors:
Akshaya Athwale,
Arman Afrasiyabi,
Justin Lagüe,
Ichrak Shili,
Ola Ahmad,
Jean-François Lalonde
Abstract:
Wide-angle lenses are commonly used in perception tasks requiring a large field of view. Unfortunately, these lenses produce significant distortions, making conventional models that ignore the distortion effects unable to adapt to wide-angle images. In this paper, we present a novel transformer-based model that automatically adapts to the distortion produced by wide-angle lenses. Our proposed imag…
▽ More
Wide-angle lenses are commonly used in perception tasks requiring a large field of view. Unfortunately, these lenses produce significant distortions, making conventional models that ignore the distortion effects unable to adapt to wide-angle images. In this paper, we present a novel transformer-based model that automatically adapts to the distortion produced by wide-angle lenses. Our proposed image encoder architecture, dubbed DarSwin, leverages the physical characteristics of such lenses analytically defined by the radial distortion profile. In contrast to conventional transformer-based architectures, DarSwin comprises a radial patch partitioning, a distortion-based sampling technique for creating token embeddings, and an angular position encoding for radial patch merging. Compared to other baselines, DarSwin achieves the best results on different datasets with significant gains when trained on bounded levels of distortions (very low, low, medium, and high) and tested on all, including out-of-distribution distortions. While the base DarSwin architecture requires knowledge of the radial distortion profile, we show it can be combined with a self-calibration network that estimates such a profile from the input image itself, resulting in a completely uncalibrated pipeline. Finally, we also present DarSwin-Unet, which extends DarSwin, to an encoder-decoder architecture suitable for pixel-level tasks. We demonstrate its performance on depth estimation and show through extensive experiments that DarSwin-Unet can perform zero-shot adaptation to unseen distortions of different wide-angle lenses. The code and models are publicly available at https://lvsn.github.io/darswin/
△ Less
Submitted 24 July, 2024; v1 submitted 19 April, 2023;
originally announced April 2023.
-
An initial Theory to Understand and Manage Requirements Engineering Debt in Practice
Authors:
Julian Frattini,
Davide Fucci,
Daniel Mendez,
Rodrigo Spinola,
Vladimir Mandic,
Nebojsa Tausan,
Muhammad Ovais Ahmad,
Javier Gonzalez-Huerta
Abstract:
Context: Advances in technical debt research demonstrate the benefits of applying the financial debt metaphor to support decision-making in software development activities. Although decision-making during requirements engineering has significant consequences, the debt metaphor in requirements engineering is inadequately explored. Objective: We aim to conceptualize how the debt metaphor applies to…
▽ More
Context: Advances in technical debt research demonstrate the benefits of applying the financial debt metaphor to support decision-making in software development activities. Although decision-making during requirements engineering has significant consequences, the debt metaphor in requirements engineering is inadequately explored. Objective: We aim to conceptualize how the debt metaphor applies to requirements engineering by organizing concepts related to practitioners' understanding and managing of requirements engineering debt (RED). Method: We conducted two in-depth expert interviews to identify key requirements engineering debt concepts and construct a survey instrument. We surveyed 69 practitioners worldwide regarding their perception of the concepts and developed an initial analytical theory. Results: We propose a RED theory that aligns key concepts from technical debt research but emphasizes the specific nature of requirements engineering. In particular, the theory consists of 23 falsifiable propositions derived from the literature, the interviews, and survey results. Conclusions: The concepts of requirements engineering debt are perceived to be similar to their technical debt counterpart. Nevertheless, measuring and tracking requirements engineering debt are immature in practice. Our proposed theory serves as the first guide toward further research in this area.
△ Less
Submitted 8 March, 2023; v1 submitted 11 November, 2022;
originally announced November 2022.
-
Team performance and large scale agile software development
Authors:
Muhammad Ovais Ahmad,
Hadi Ghanbari,
Tomas Gustavsson
Abstract:
Software development is a team work and largely dependent on open social interaction and continuous learning of individuals. Drawing on well established theoretical concepts proposed by social psychology and organizational science disciplines, we develop a theoretical framework proposing that team climate has a significant influence on team learning and ultimately affects team performance. Our stu…
▽ More
Software development is a team work and largely dependent on open social interaction and continuous learning of individuals. Drawing on well established theoretical concepts proposed by social psychology and organizational science disciplines, we develop a theoretical framework proposing that team climate has a significant influence on team learning and ultimately affects team performance. Our study consists of two goals. First to understand the preconditions of team learning and second to investigate the relationship between team learning, psychological safety, and team performance in large scale agile software development projects. We plan to conduct a survey with software professionals in Sweden from three companies partners in pur large-scale agile research project.
△ Less
Submitted 16 September, 2022;
originally announced September 2022.
-
DiffeoRaptor: Diffeomorphic Inter-modal Image Registration using RaPTOR
Authors:
Nima Masoumi,
Hassan Rivaz,
M. Omair Ahmad,
Yiming Xiao
Abstract:
Purpose: Diffeomorphic image registration is essential in many medical imaging applications. Several registration algorithms of such type have been proposed, but primarily for intra-contrast alignment. Currently, efficient inter-modal/contrast diffeomorphic registration, which is vital in numerous applications, remains a challenging task. Methods: We proposed a novel inter-modal/contrast registrat…
▽ More
Purpose: Diffeomorphic image registration is essential in many medical imaging applications. Several registration algorithms of such type have been proposed, but primarily for intra-contrast alignment. Currently, efficient inter-modal/contrast diffeomorphic registration, which is vital in numerous applications, remains a challenging task. Methods: We proposed a novel inter-modal/contrast registration algorithm that leverages Robust PaTch-based cOrrelation Ratio (RaPTOR) metric to allow inter-modal/contrast image alignment and bandlimited geodesic shooting demonstrated in Fourier Approximated Lie Algebras (FLASH) algorithm for fast diffeomorphic registration. Results: The proposed algorithm, named DiffeoRaptor, was validated with three public databases for the tasks of brain and abdominal image registration while comparing the results against three state-of-the-art techniques, including FLASH, NiftyReg, and Symmetric image normalization (SyN). Conclusions: Our results demonstrated that DiffeoRaptor offered comparable or better registration performance in terms of registration accuracy. Moreover, DiffeoRaptor produces smoother deformations than SyN in inter-modal and contrast registration. The code for DiffeoRaptor is publicly available at https://github.com/nimamasoumi/DiffeoRaptor.
△ Less
Submitted 12 September, 2022;
originally announced September 2022.
-
FisheyeHDK: Hyperbolic Deformable Kernel Learning for Ultra-Wide Field-of-View Image Recognition
Authors:
Ola Ahmad,
Freddy Lecue
Abstract:
Conventional convolution neural networks (CNNs) trained on narrow Field-of-View (FoV) images are the state-of-the-art approaches for object recognition tasks. Some methods proposed the adaptation of CNNs to ultra-wide FoV images by learning deformable kernels. However, they are limited by the Euclidean geometry and their accuracy degrades under strong distortions caused by fisheye projections. In…
▽ More
Conventional convolution neural networks (CNNs) trained on narrow Field-of-View (FoV) images are the state-of-the-art approaches for object recognition tasks. Some methods proposed the adaptation of CNNs to ultra-wide FoV images by learning deformable kernels. However, they are limited by the Euclidean geometry and their accuracy degrades under strong distortions caused by fisheye projections. In this work, we demonstrate that learning the shape of convolution kernels in non-Euclidean spaces is better than existing deformable kernel methods. In particular, we propose a new approach that learns deformable kernel parameters (positions) in hyperbolic space. FisheyeHDK is a hybrid CNN architecture combining hyperbolic and Euclidean convolution layers for positions and features learning. First, we provide an intuition of hyperbolic space for wide FoV images. Using synthetic distortion profiles, we demonstrate the effectiveness of our approach. We select two datasets - Cityscapes and BDD100K 2020 - of perspective images which we transform to fisheye equivalents at different scaling factors (analog to focal lengths). Finally, we provide an experiment on data collected by a real fisheye camera. Validations and experiments show that our approach improves existing deformable kernel methods for CNN adaptation on fisheye images.
△ Less
Submitted 14 March, 2022;
originally announced March 2022.
-
SEMOUR: A Scripted Emotional Speech Repository for Urdu
Authors:
Nimra Zaheer,
Obaid Ullah Ahmad,
Ammar Ahmed,
Muhammad Shehryar Khan,
Mudassir Shabbir
Abstract:
Designing reliable Speech Emotion Recognition systems is a complex task that inevitably requires sufficient data for training purposes. Such extensive datasets are currently available in only a few languages, including English, German, and Italian. In this paper, we present SEMOUR, the first scripted database of emotion-tagged speech in the Urdu language, to design an Urdu Speech Recognition Syste…
▽ More
Designing reliable Speech Emotion Recognition systems is a complex task that inevitably requires sufficient data for training purposes. Such extensive datasets are currently available in only a few languages, including English, German, and Italian. In this paper, we present SEMOUR, the first scripted database of emotion-tagged speech in the Urdu language, to design an Urdu Speech Recognition System. Our gender-balanced dataset contains 15,040 unique instances recorded by eight professional actors eliciting a syntactically complex script. The dataset is phonetically balanced, and reliably exhibits a varied set of emotions as marked by the high agreement scores among human raters in experiments. We also provide various baseline speech emotion prediction scores on the database, which could be used for various applications like personalized robot assistants, diagnosis of psychological disorders, and getting feedback from a low-tech-enabled population, etc. On a random test sample, our model correctly predicts an emotion with a state-of-the-art 92% accuracy.
△ Less
Submitted 19 May, 2021;
originally announced May 2021.
-
Adaptable Deformable Convolutions for Semantic Segmentation of Fisheye Images in Autonomous Driving Systems
Authors:
Clément Playout,
Ola Ahmad,
Freddy Lecue,
Farida Cheriet
Abstract:
Advanced Driver-Assistance Systems rely heavily on perception tasks such as semantic segmentation where images are captured from large field of view (FoV) cameras. State-of-the-art works have made considerable progress toward applying Convolutional Neural Network (CNN) to standard (rectilinear) images. However, the large FoV cameras used in autonomous vehicles produce fisheye images characterized…
▽ More
Advanced Driver-Assistance Systems rely heavily on perception tasks such as semantic segmentation where images are captured from large field of view (FoV) cameras. State-of-the-art works have made considerable progress toward applying Convolutional Neural Network (CNN) to standard (rectilinear) images. However, the large FoV cameras used in autonomous vehicles produce fisheye images characterized by strong geometric distortion. This work demonstrates that a CNN trained on standard images can be readily adapted to fisheye images, which is crucial in real-world applications where time-consuming real-time data transformation must be avoided. Our adaptation protocol mainly relies on modifying the support of the convolutions by using their deformable equivalents on top of pre-existing layers. We prove that tuning an optimal support only requires a limited amount of labeled fisheye images, as a small number of training samples is sufficient to significantly improve an existing model's performance on wide-angle images. Furthermore, we show that finetuning the weights of the network is not necessary to achieve high performance once the deformable components are learned. Finally, we provide an in-depth analysis of the effect of the deformable convolutions, bringing elements of discussion on the behavior of CNN models.
△ Less
Submitted 19 February, 2021;
originally announced February 2021.
-
Novel Special Affine Wavelet Transform and Associated Uncertainity Inequalities
Authors:
Owais Ahmad,
Neyaz Ahmad Sheikh
Abstract:
{.2in} {\small {\bf Abstract.} Due to the extra degrees of freedom, special affine Fourier transform (SAFT) has achieved a respectable status within a short span and got versatile applicability in the areas of signal processing, image processing,sampling theory, quantum mechanics. However, due to its global kernel, SAFT fails to obtain local information of non-transient signals. To overcome this,…
▽ More
{.2in} {\small {\bf Abstract.} Due to the extra degrees of freedom, special affine Fourier transform (SAFT) has achieved a respectable status within a short span and got versatile applicability in the areas of signal processing, image processing,sampling theory, quantum mechanics. However, due to its global kernel, SAFT fails to obtain local information of non-transient signals. To overcome this, we in this paper introduce the concept of novel special affine wavelet transform (NSAWT) and extend key harmonic analysis results to NSAWT analogous to those for the wavelet transform. We first establish some fundamental properties including Moyal's principle, Inversion formula and the range theorem. Some Heisenberg type inequalities and Pitt's inequality are established for SAFT and consequently Heisenberg uncertainity principle is derived for NSAWT.
△ Less
Submitted 25 September, 2020;
originally announced September 2020.
-
Fractional Multiresolution Analysis and Associated Scaling Functions in $L^2(\mathbb R)$
Authors:
Owais Ahmad,
Neyaz A. Sheikh,
Firdous A. Shah
Abstract:
In this paper, we show how to construct an orthonormal basis from Riesz basis by assuming that the fractional translates of a single function in the core subspace of the fractional multiresolution analysis form a Riesz basis instead of an orthonormal basis. In the definition of fractional multiresolution analysis, we show that the intersection triviality condition follows from the other conditions…
▽ More
In this paper, we show how to construct an orthonormal basis from Riesz basis by assuming that the fractional translates of a single function in the core subspace of the fractional multiresolution analysis form a Riesz basis instead of an orthonormal basis. In the definition of fractional multiresolution analysis, we show that the intersection triviality condition follows from the other conditions. Furthermore, we show that the union density condition also follows under the assumption that the fractional Fourier transform of the scaling function is continuous at $0$. At the culmination, we provide the complete characterization of the scaling functions associated with fractional multiresolutrion analysis.
△ Less
Submitted 20 August, 2020;
originally announced August 2020.
-
Fractional Biorthogonal wavelets in $L^2(\mathbb R)$
Authors:
Owais Ahmad,
Neyaz A. Sheikh,
Firdous A. Shah
Abstract:
The fractional Fourier transform (FrFT), which is a generalization of the Fourier transform, has become the focus of many research papers in recent years because of its applications in electrical engineering and optics. In this paper, we introduce the notion of fractional biorthogonal wavelets on $\mathbb{R}$ and obtain the necessary and sufficient conditions for the translates of a single functio…
▽ More
The fractional Fourier transform (FrFT), which is a generalization of the Fourier transform, has become the focus of many research papers in recent years because of its applications in electrical engineering and optics. In this paper, we introduce the notion of fractional biorthogonal wavelets on $\mathbb{R}$ and obtain the necessary and sufficient conditions for the translates of a single function to form the fractional Riesz bases for their closed linear span. We also provide a complete characterization for the fractional biorthogonality of the translates of fractional scaling functions of two fractional MRAs and the associated fractional biorthogonal wavelet families. Moreover, under mild assumptions on the fractional scaling functions and the corresponding fractional wavelets, we show that the fractional wavelets can generate Reisz bases for $L^2(\mathbb R).$.
△ Less
Submitted 20 August, 2020;
originally announced August 2020.
-
Multi-Site Infant Brain Segmentation Algorithms: The iSeg-2019 Challenge
Authors:
Yue Sun,
Kun Gao,
Zhengwang Wu,
Zhihao Lei,
Ying Wei,
Jun Ma,
Xiaoping Yang,
Xue Feng,
Li Zhao,
Trung Le Phan,
Jitae Shin,
Tao Zhong,
Yu Zhang,
Lequan Yu,
Caizi Li,
Ramesh Basnet,
M. Omair Ahmad,
M. N. S. Swamy,
Wenao Ma,
Qi Dou,
Toan Duc Bui,
Camilo Bermudez Noguera,
Bennett Landman,
Ian H. Gotlib,
Kathryn L. Humphreys
, et al. (8 additional authors not shown)
Abstract:
To better understand early brain growth patterns in health and disorder, it is critical to accurately segment infant brain magnetic resonance (MR) images into white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF). Deep learning-based methods have achieved state-of-the-art performance; however, one of major limitations is that the learning-based methods may suffer from the multi-site i…
▽ More
To better understand early brain growth patterns in health and disorder, it is critical to accurately segment infant brain magnetic resonance (MR) images into white matter (WM), gray matter (GM), and cerebrospinal fluid (CSF). Deep learning-based methods have achieved state-of-the-art performance; however, one of major limitations is that the learning-based methods may suffer from the multi-site issue, that is, the models trained on a dataset from one site may not be applicable to the datasets acquired from other sites with different imaging protocols/scanners. To promote methodological development in the community, iSeg-2019 challenge (http://iseg2019.web.unc.edu) provides a set of 6-month infant subjects from multiple sites with different protocols/scanners for the participating methods. Training/validation subjects are from UNC (MAP) and testing subjects are from UNC/UMN (BCP), Stanford University, and Emory University. By the time of writing, there are 30 automatic segmentation methods participating in iSeg-2019. We review the 8 top-ranked teams by detailing their pipelines/implementations, presenting experimental results and evaluating performance in terms of the whole brain, regions of interest, and gyral landmark curves. We also discuss their limitations and possible future directions for the multi-site issue. We hope that the multi-site dataset in iSeg-2019 and this review article will attract more researchers on the multi-site issue.
△ Less
Submitted 11 July, 2020; v1 submitted 4 July, 2020;
originally announced July 2020.
-
Design and Hardware Implementation of a Separable Image Steganographic Scheme Using Public-key Cryptosystem
Authors:
Salah Harb,
M. Omair Ahmad,
M. N. S Swamy
Abstract:
In this paper, a novel and efficient hardware implementation of steganographic cryptosystem based on a public-key cryptography is proposed. Digital images are utilized as carriers of secret data between sender and receiver parties in the communication channel. The proposed public-key cryptosystem offers a separable framework that allows to embed or extract secret data and encrypt or decrypt the ca…
▽ More
In this paper, a novel and efficient hardware implementation of steganographic cryptosystem based on a public-key cryptography is proposed. Digital images are utilized as carriers of secret data between sender and receiver parties in the communication channel. The proposed public-key cryptosystem offers a separable framework that allows to embed or extract secret data and encrypt or decrypt the carrier using the public-private key pair, independently. Paillier cryptographic system is adopted to encrypt and decrypt pixels of the digital image. To achieve efficiency, a proposed efficient parallel montgomery exponentiation core is designed and implemented for performing the underlying field operations in the Paillier cryptosystem. The hardware implementation results of the proposed steganographic cryptosystem show an efficiency in terms of area (resources), performance (speed) and power consumption. Our steganographic cryptosystem represents a small footprint making it well-suited for the embedded systems and real-time processing engines in applications such as medical scanning devices, autopilot cars and drones.
△ Less
Submitted 4 June, 2020;
originally announced June 2020.
-
Applying r-spatiogram in object tracking for occlusion handling
Authors:
Niloufar Salehi Dastjerdi,
M. Omair Ahmad
Abstract:
Object tracking is one of the most important problems in computer vision. The aim of video tracking is to extract the trajectories of a target or object of interest, i.e. accurately locate a moving target in a video sequence and discriminate target from non-targets in the feature space of the sequence. So, feature descriptors can have significant effects on such discrimination. In this paper, we u…
▽ More
Object tracking is one of the most important problems in computer vision. The aim of video tracking is to extract the trajectories of a target or object of interest, i.e. accurately locate a moving target in a video sequence and discriminate target from non-targets in the feature space of the sequence. So, feature descriptors can have significant effects on such discrimination. In this paper, we use the basic idea of many trackers which consists of three main components of the reference model, i.e., object modeling, object detection and localization, and model updating. However, there are major improvements in our system. Our forth component, occlusion handling, utilizes the r-spatiogram to detect the best target candidate. While spatiogram contains some moments upon the coordinates of the pixels, r-spatiogram computes region-based compactness on the distribution of the given feature in the image that captures richer features to represent the objects. The proposed research develops an efficient and robust way to keep tracking the object throughout video sequences in the presence of significant appearance variations and severe occlusions. The proposed method is evaluated on the Princeton RGBD tracking dataset considering sequences with different challenges and the obtained results demonstrate the effectiveness of the proposed method.
△ Less
Submitted 17 March, 2020;
originally announced March 2020.
-
Fault-Tolerant Metric Dimension of $P(n,2)$ with Prism Graph
Authors:
Z. Ahmad,
M. O. Ahmad,
A. Q. Baig,
M. Naeem
Abstract:
Let $G$ be a connected graph and $d(a,b)$ be the distance between the vertices $a$ and $b$. A subset $U =\{u_1,u_2,\cdots,u_k\}$ of the vertices is called a resolving set for $G$ if for every two distinct vertices $a,b \in V(G)$, there is a vertex $u_ξ\in U$ such that $d(a,u_ξ)\neq d(b,u_ξ)$. A resolving set containing a minimum number of vertices is called a metric basis for $G$ and the number of…
▽ More
Let $G$ be a connected graph and $d(a,b)$ be the distance between the vertices $a$ and $b$. A subset $U =\{u_1,u_2,\cdots,u_k\}$ of the vertices is called a resolving set for $G$ if for every two distinct vertices $a,b \in V(G)$, there is a vertex $u_ξ\in U$ such that $d(a,u_ξ)\neq d(b,u_ξ)$. A resolving set containing a minimum number of vertices is called a metric basis for $G$ and the number of vertices in a metric basis is its metric dimension denoted by $dim(G)$. A resolving set $U$ for $G$ is fault-tolerant if $U \setminus \{u\}$ is also a resolving set, for each $u \in U$, and the fault-tolerant metric dimension of $G$ is the minimum cardinality of such a set. In this paper we introduce the study of the fault-tolerant metric dimension of $P(n,2)$ with prism graph.
△ Less
Submitted 14 November, 2018;
originally announced November 2018.
-
A Variational Step for Reduction of Mixed Gaussian-Impulse Noise from Images
Authors:
Mohammad Tariqul Islam,
Dipayan Saha,
S. M. Mahbubur Rahman,
M. Omair Ahmad,
M. N. S. Swamy
Abstract:
Reduction of mixed noise is an ill posed problem for the occurrence of contrasting distributions of noise in the image. The mixed noise that is usually encountered is the simultaneous presence of additive white Gaussian noise (AWGN) and impulse noise (IN). A standard approach to denoise an image with such corruption is to apply a rank order filter (ROF) followed by an efficient linear filter to re…
▽ More
Reduction of mixed noise is an ill posed problem for the occurrence of contrasting distributions of noise in the image. The mixed noise that is usually encountered is the simultaneous presence of additive white Gaussian noise (AWGN) and impulse noise (IN). A standard approach to denoise an image with such corruption is to apply a rank order filter (ROF) followed by an efficient linear filter to remove the residual noise. However, ROF cannot completely remove the heavy tail of the noise distribution originating from the IN and thus the denoising performance can be suboptimal. In this paper, we present a variational step to remove the heavy tail of the noise distribution. Through experiments, it is shown that this approach can significantly improve the denoising performance of mixed AWGN-IN using well-established methods.
△ Less
Submitted 1 November, 2018;
originally announced November 2018.
-
Interpretable Fully Convolutional Classification of Intrapapillary Capillary Loops for Real-Time Detection of Early Squamous Neoplasia
Authors:
Luis C. Garcia-Peraza-Herrera,
Martin Everson,
Wenqi Li,
Inmanol Luengo,
Lorenz Berger,
Omer Ahmad,
Laurence Lovat,
Hsiu-Po Wang,
Wen-Lun Wang,
Rehan Haidry,
Danail Stoyanov,
Tom Vercauteren,
Sebastien Ourselin
Abstract:
In this work, we have concentrated our efforts on the interpretability of classification results coming from a fully convolutional neural network. Motivated by the classification of oesophageal tissue for real-time detection of early squamous neoplasia, the most frequent kind of oesophageal cancer in Asia, we present a new dataset and a novel deep learning method that by means of deep supervision…
▽ More
In this work, we have concentrated our efforts on the interpretability of classification results coming from a fully convolutional neural network. Motivated by the classification of oesophageal tissue for real-time detection of early squamous neoplasia, the most frequent kind of oesophageal cancer in Asia, we present a new dataset and a novel deep learning method that by means of deep supervision and a newly introduced concept, the embedded Class Activation Map (eCAM), focuses on the interpretability of results as a design constraint of a convolutional network. We present a new approach to visualise attention that aims to give some insights on those areas of the oesophageal tissue that lead a network to conclude that the images belong to a particular class and compare them with those visual features employed by clinicians to produce a clinical diagnosis. In comparison to a baseline method which does not feature deep supervision but provides attention by grafting Class Activation Maps, we improve the F1-score from 87.3% to 92.7% and provide more detailed attention maps.
△ Less
Submitted 2 May, 2018;
originally announced May 2018.
-
Speech Enhancement in Adverse Environments Based on Non-stationary Noise-driven Spectral Subtraction and SNR-dependent Phase Compensation
Authors:
Md Tauhidul Islam,
Asaduzzaman,
Celia Shahnaz,
Wei-Ping Zhu,
M. Omair Ahmad
Abstract:
A two-step enhancement method based on spectral subtraction and phase spectrum compensation is presented in this paper for noisy speeches in adverse environments involving non-stationary noise and medium to low levels of SNR. The magnitude of the noisy speech spectrum is modified in the first step of the proposed method by a spectral subtraction approach, where a new noise estimation method based…
▽ More
A two-step enhancement method based on spectral subtraction and phase spectrum compensation is presented in this paper for noisy speeches in adverse environments involving non-stationary noise and medium to low levels of SNR. The magnitude of the noisy speech spectrum is modified in the first step of the proposed method by a spectral subtraction approach, where a new noise estimation method based on the low frequency information of the noisy speech is introduced. We argue that this method of noise estimation is capable of estimating the non-stationary noise accurately. The phase spectrum of the noisy speech is modified in the second step consisting of phase spectrum compensation, where an SNR-dependent approach is incorporated to determine the amount of compensation to be imposed on the phase spectrum. A modified complex spectrum is obtained by aggregating the magnitude from the spectral subtraction step and modified phase spectrum from the phase compensation step, which is found to be a better representation of enhanced speech spectrum. Speech files available in the NOIZEUS database are used to carry extensive simulations for evaluation of the proposed method.
△ Less
Submitted 18 February, 2018;
originally announced March 2018.
-
Enhancement of Noisy Speech Exploiting an Exponential Model Based Threshold and a Custom Thresholding Function in Perceptual Wavelet Packet Domain
Authors:
Md Tauhidul Islam,
Celia Shahnaz,
Wei-Ping Zhu,
M. Omair Ahmad
Abstract:
For enhancement of noisy speech, a method of threshold determination based on modeling of Teager energy (TE) operated perceptual wavelet packet (PWP) coefficients of the noisy speech by exponential distribution is presented. A custom thresholding function based on the combination of mu-law and semisoft thresholding functions is designed and exploited to apply the statistically derived threshold up…
▽ More
For enhancement of noisy speech, a method of threshold determination based on modeling of Teager energy (TE) operated perceptual wavelet packet (PWP) coefficients of the noisy speech by exponential distribution is presented. A custom thresholding function based on the combination of mu-law and semisoft thresholding functions is designed and exploited to apply the statistically derived threshold upon the PWP coefficients. The effectiveness of the proposed method is evaluated for car and multi-talker babble noise corrupted speech signals through performing extensive simulations using the NOIZEUS database. The proposed method outperforms some of the state-of-the-art speech enhancement methods both at high and low levels of SNRs in terms of the standard objective measures and the subjective evaluations including formal listening tests.
△ Less
Submitted 14 February, 2018;
originally announced February 2018.
-
Enhancement of Noisy Speech with Low Speech Distortion Based on Probabilistic Geometric Spectral Subtraction
Authors:
Md Tauhidul Islam,
Celia Shahnaz,
Wei-Ping Zhu,
M. Omair Ahmad
Abstract:
A speech enhancement method based on probabilistic geometric approach to spectral subtraction (PGA) performed on short time magnitude spectrum is presented in this paper. A confidence parameter of noise estimation is introduced in the gain function of the proposed method to prevent subtraction of the overestimated and underestimated noise, which not only removes the noise efficiently but also prev…
▽ More
A speech enhancement method based on probabilistic geometric approach to spectral subtraction (PGA) performed on short time magnitude spectrum is presented in this paper. A confidence parameter of noise estimation is introduced in the gain function of the proposed method to prevent subtraction of the overestimated and underestimated noise, which not only removes the noise efficiently but also prevents the speech distortion. The noise compensated magnitude spectrum is then recombined with the unchanged phase spectrum to produce a modified complex spectrum prior to synthesize an enhanced frame. Extensive simulations are carried out using the speech files available in the NOIZEUS database in order to evaluate the performance of the proposed method.
△ Less
Submitted 12 February, 2018;
originally announced February 2018.
-
Modeling of Teager Energy Operated Perceptual Wavelet Packet Coefficients with an Erlang-2 PDF for Real Time Enhancement of Noisy Speech
Authors:
Md Tauhidul Islam,
Celia Shahnaz,
Wei-Ping Zhu,
M. Omair Ahmad
Abstract:
In this paper, for real time enhancement of noisy speech, a method of threshold determination based on modeling of Teager energy (TE) operated perceptual wavelet packet (PWP) coefficients of the noisy speech and noise by an Erlang-2 PDF is presented. The proposed method is computationally much faster than the existing wavelet packet based thresholding methods. A custom thresholding function based…
▽ More
In this paper, for real time enhancement of noisy speech, a method of threshold determination based on modeling of Teager energy (TE) operated perceptual wavelet packet (PWP) coefficients of the noisy speech and noise by an Erlang-2 PDF is presented. The proposed method is computationally much faster than the existing wavelet packet based thresholding methods. A custom thresholding function based on a combination of mu-law and semisoft thresholding functions is designed and exploited to apply the statistically derived threshold upon the PWP coefficients. The proposed custom thresholding function works as a mu-law or a semisoft thresholding function or their combination based on the probability of speech presence and absence in a subband of the PWP transformed noisy speech. By using the speech files available in NOIZEUS database, a number of simulations are performed to evaluate the performance of the proposed method for speech signals in the presence of Gaussian white and street noises. The proposed method outperforms some of the state-of-the-art speech enhancement methods both at high and low levels of SNRs in terms of standard objective measures and subjective evaluations including formal listening tests.
△ Less
Submitted 9 February, 2018;
originally announced February 2018.
-
A Divide and Conquer Strategy for Musical Noise-free Speech Enhancement in Adverse Environments
Authors:
Md Tauhidul Islam,
Celia Shahnaz,
Wei-Ping Zhu,
M. Omair Ahmad
Abstract:
A divide and conquer strategy for enhancement of noisy speeches in adverse environments involving lower levels of SNR is presented in this paper, where the total system of speech enhancement is divided into two separate steps. The first step is based on noise compensation on short time magnitude and the second step is based on phase compensation. The magnitude spectrum is compensated based on a mo…
▽ More
A divide and conquer strategy for enhancement of noisy speeches in adverse environments involving lower levels of SNR is presented in this paper, where the total system of speech enhancement is divided into two separate steps. The first step is based on noise compensation on short time magnitude and the second step is based on phase compensation. The magnitude spectrum is compensated based on a modified spectral subtraction method where the cross-terms containing spectra of noise and clean speech are taken into consideration, which are neglected in the traditional spectral subtraction methods. By employing the modified magnitude and unchanged phase, a procedure is formulated to compensate the overestimation or underestimation of noise by phase compensation method based on the probability of speech presence. A modified complex spectrum based on these two steps are obtained to synthesize a musical noise free enhanced speech. Extensive simulations are carried out using the speech files available in the NOIZEUS database in order to evaluate the performance of the proposed method. It is shown in terms of the objective measures, spectrogram analysis and formal subjective listening tests that the proposed method consistently outperforms some of the state-of-the-art methods of speech enhancement for noisy speech corrupted by street or babble noise at very low as well as medium levels of SNR.
△ Less
Submitted 7 February, 2018;
originally announced February 2018.
-
Construction of $J^{\text{th}}$-stage Nonuniform Wavelets on Local Fields
Authors:
Owais Ahmad,
F. A. Shah
Abstract:
Shah and Abdullah [Complex Analysis Operator Theory, 9 (2015), 1589-1608] have introduced a generalized notion of nonuniform multiresolution analysis (NUMRA) on local field $K$ of positive characteristic in which the translation set $Λ$ acting on the scaling function to generate the core space $V_{0}$ is no longer a group, but is the union of ${\mathcal Z}$ and a translate of ${\mathcal Z}$, given…
▽ More
Shah and Abdullah [Complex Analysis Operator Theory, 9 (2015), 1589-1608] have introduced a generalized notion of nonuniform multiresolution analysis (NUMRA) on local field $K$ of positive characteristic in which the translation set $Λ$ acting on the scaling function to generate the core space $V_{0}$ is no longer a group, but is the union of ${\mathcal Z}$ and a translate of ${\mathcal Z}$, given by $Λ=\left\{0,u(r)/N \right\}+{\mathcal Z}$, where $N \ge 1$ is an integer and $r$ is an odd integer such that $r$ and $N$ are relatively prime, and ${\mathcal Z}=\{u(n): n\in\mathbb N_{0}\}$ is a complete list of distinct cosets of the unit disc $\mathfrak D$ in $K^+.$ In this paper, we focus on the extension of nonuniform continuous wavelets to the construction of $J^{\text{th}}$-stage nonuniform discrete wavelets on local fields. We establish some general characterizations for the $J^{\text{th}}$-stage nonuniform discrete wavelet systems to be orthornormal bases in $L^2(Λ)$. Moreover, we establish a relation between the continuous wavelets of $L^2(K)$ and their discrete counterparts of $l^2(Λ)$.
△ Less
Submitted 1 January, 2018;
originally announced January 2018.