-
Unstable cosmic-ray nuclei constrain low-diffusion zones in the Galactic disk
Authors:
Hanno Jacobs,
Philipp Mertsch,
Vo Hong Minh Phan
Abstract:
Observations of the vicinity of a variety of galactic gamma-ray sources have indicated a local suppression of diffusivity of cosmic rays by up to three orders of magnitude. However, the impact of these low-diffusion zones on \emph{global} properties of cosmic-ray transport is however only poorly understood. Here, we argue that cosmic-ray nuclear ratios, like the boron-to-carbon ratio and relative…
▽ More
Observations of the vicinity of a variety of galactic gamma-ray sources have indicated a local suppression of diffusivity of cosmic rays by up to three orders of magnitude. However, the impact of these low-diffusion zones on \emph{global} properties of cosmic-ray transport is however only poorly understood. Here, we argue that cosmic-ray nuclear ratios, like the boron-to-carbon ratio and relative abundances of Beryllium isotopes are sensitive to the filling fraction of such low-diffusion zones and hence their measurements can be used to constrain the typical sizes and ages of such regions. We have performed a careful parameter study of a cosmic-ray transport model that allows for different diffusion coefficients $κ_{\mathrm{disk}}$ and $κ_{\mathrm{halo}}$ in the galactic disk and halo, respectively. Making use of preliminary data from the AMS-02 experiment on the ratio of Beryllium isotopes, we find a $3.5 σ$ preference for a suppression of the diffusion coefficient in the disk with a best-fit value of $κ_{\mathrm{disk}}/κ_{\mathrm{halo}} = 0.20^{+0.10}_{-0.06}$. We forecast that with upcoming data from the HELIX balloon experiment, the significance could increase to $6.8 σ$. Adopting a coarse-graining approach, we find that such a strong suppression could be realised if the filling fraction of low-diffusion zones in the disk was $\sim 66 \, \%$. We conclude that the impact of regions of suppressed diffusion might be larger than usually assumed and ought to be taken into account in models of Galactic cosmic ray transport.
△ Less
Submitted 17 May, 2023;
originally announced May 2023.
-
Sharpness & Shift-Aware Self-Supervised Learning
Authors:
Ngoc N. Tran,
Son Duong,
Hoang Phan,
Tung Pham,
Dinh Phung,
Trung Le
Abstract:
Self-supervised learning aims to extract meaningful features from unlabeled data for further downstream tasks. In this paper, we consider classification as a downstream task in phase 2 and develop rigorous theories to realize the factors that implicitly influence the general loss of this classification task. Our theories signify that sharpness-aware feature extractors benefit the classification ta…
▽ More
Self-supervised learning aims to extract meaningful features from unlabeled data for further downstream tasks. In this paper, we consider classification as a downstream task in phase 2 and develop rigorous theories to realize the factors that implicitly influence the general loss of this classification task. Our theories signify that sharpness-aware feature extractors benefit the classification task in phase 2 and the existing data shift between the ideal (i.e., the ideal one used in theory development) and practical (i.e., the practical one used in implementation) distributions to generate positive pairs also remarkably affects this classification task. Further harvesting these theoretical findings, we propose to minimize the sharpness of the feature extractor and a new Fourier-based data augmentation technique to relieve the data shift in the distributions generating positive pairs, reaching Sharpness & Shift-Aware Contrastive Learning (SSA-CLR). We conduct extensive experiments to verify our theoretical findings and demonstrate that sharpness & shift-aware contrastive learning can remarkably boost the performance as well as obtaining more robust extracted features compared with the baselines.
△ Less
Submitted 17 May, 2023;
originally announced May 2023.
-
Image Reconstruction using Superpixel Clustering and Tensor Completion
Authors:
Maame G. Asante-Mensah,
Anh Huy Phan,
Salman Ahmadi-Asl,
Zaher Al Aghbari,
Andrzej Cichocki
Abstract:
This paper presents a pixel selection method for compact image representation based on superpixel segmentation and tensor completion. Our method divides the image into several regions that capture important textures or semantics and selects a representative pixel from each region to store. We experiment with different criteria for choosing the representative pixel and find that the centroid pixel…
▽ More
This paper presents a pixel selection method for compact image representation based on superpixel segmentation and tensor completion. Our method divides the image into several regions that capture important textures or semantics and selects a representative pixel from each region to store. We experiment with different criteria for choosing the representative pixel and find that the centroid pixel performs the best. We also propose two smooth tensor completion algorithms that can effectively reconstruct different types of images from the selected pixels. Our experiments show that our superpixel-based method achieves better results than uniform sampling for various missing ratios.
△ Less
Submitted 16 May, 2023;
originally announced May 2023.
-
Stochastic Variance-Reduced Majorization-Minimization Algorithms
Authors:
Duy-Nhat Phan,
Sedi Bartz,
Nilabja Guha,
Hung M. Phan
Abstract:
We study a class of nonconvex nonsmooth optimization problems in which the objective is a sum of two functions: One function is the average of a large number of differentiable functions, while the other function is proper, lower semicontinuous and has a surrogate function that satisfies standard assumptions. Such problems arise in machine learning and regularized empirical risk minimization applic…
▽ More
We study a class of nonconvex nonsmooth optimization problems in which the objective is a sum of two functions: One function is the average of a large number of differentiable functions, while the other function is proper, lower semicontinuous and has a surrogate function that satisfies standard assumptions. Such problems arise in machine learning and regularized empirical risk minimization applications. However, nonconvexity and the large-sum structure are challenging for the design of new algorithms. Consequently, effective algorithms for such scenarios are scarce. We introduce and study three stochastic variance-reduced majorization-minimization (MM) algorithms, combining the general MM principle with new variance-reduced techniques. We provide almost surely subsequential convergence of the generated sequence to a stationary point. We further show that our algorithms possess the best-known complexity bounds in terms of gradient evaluations. We demonstrate the effectiveness of our algorithms on sparse binary classification problems, sparse multi-class logistic regressions, and neural networks by employing several widely-used and publicly available data sets.
△ Less
Submitted 11 May, 2023;
originally announced May 2023.
-
Learning to Parallelize with OpenMP by Augmented Heterogeneous AST Representation
Authors:
Le Chen,
Quazi Ishtiaque Mahmud,
Hung Phan,
Nesreen K. Ahmed,
Ali Jannesari
Abstract:
Detecting parallelizable code regions is a challenging task, even for experienced developers. Numerous recent studies have explored the use of machine learning for code analysis and program synthesis, including parallelization, in light of the success of machine learning in natural language processing. However, applying machine learning techniques to parallelism detection presents several challeng…
▽ More
Detecting parallelizable code regions is a challenging task, even for experienced developers. Numerous recent studies have explored the use of machine learning for code analysis and program synthesis, including parallelization, in light of the success of machine learning in natural language processing. However, applying machine learning techniques to parallelism detection presents several challenges, such as the lack of an adequate dataset for training, an effective code representation with rich information, and a suitable machine learning model to learn the latent features of code for diverse analyses. To address these challenges, we propose a novel graph-based learning approach called Graph2Par that utilizes a heterogeneous augmented abstract syntax tree (Augmented-AST) representation for code. The proposed approach primarily focused on loop-level parallelization with OpenMP. Moreover, we create an OMP\_Serial dataset with 18598 parallelizable and 13972 non-parallelizable loops to train the machine learning models. Our results show that our proposed approach achieves the accuracy of parallelizable code region detection with 85\% accuracy and outperforms the state-of-the-art token-based machine learning approach. These results indicate that our approach is competitive with state-of-the-art tools and capable of handling loops with complex structures that other tools may overlook.
△ Less
Submitted 9 May, 2023;
originally announced May 2023.
-
Adaptive Cross Tubal Tensor Approximation
Authors:
Salman Ahmadi-Asl,
Anh Huy Phan,
Andrzej Cichocki,
Anastasia Sozykina,
Zaher Al Aghbari,
Jun Wang,
Ivan Oseledets
Abstract:
In this paper, we propose a new adaptive cross algorithm for computing a low tubal rank approximation of third-order tensors, with less memory and lower computational complexity than the truncated tensor SVD (t-SVD). This makes it applicable for decomposing large-scale tensors. We conduct numerical experiments on synthetic and real-world datasets to confirm the efficiency and feasibility of the pr…
▽ More
In this paper, we propose a new adaptive cross algorithm for computing a low tubal rank approximation of third-order tensors, with less memory and lower computational complexity than the truncated tensor SVD (t-SVD). This makes it applicable for decomposing large-scale tensors. We conduct numerical experiments on synthetic and real-world datasets to confirm the efficiency and feasibility of the proposed algorithm. The simulation results show more than one order of magnitude acceleration in the computation of low tubal rank (t-SVD) for large-scale tensors. An application to pedestrian attribute recognition is also presented.
△ Less
Submitted 11 May, 2023; v1 submitted 8 May, 2023;
originally announced May 2023.
-
Development of a Vision System to Enhance the Reliability of the Pick-and-Place Robot for Autonomous Testing of Camera Module used in Smartphones
Authors:
Hoang-Anh Phan,
Duy Nam Bui,
Tuan Nguyen Dinh,
Bao-Anh Hoang,
An Nguyen Ngoc,
Dong Tran Huu Quoc,
Ha Tran Thi Thuy,
Tung Thanh Bui,
Van Nguyen Thi Thanh
Abstract:
Pick-and-place robots are commonly used in modern industrial manufacturing. For complex devices/parts like camera modules used in smartphones, which contain optical parts, electrical components and interfacing connectors, the placement operation may not absolutely accurate, which may cause damage in the device under test during the mechanical movement to make good contact for electrical functions…
▽ More
Pick-and-place robots are commonly used in modern industrial manufacturing. For complex devices/parts like camera modules used in smartphones, which contain optical parts, electrical components and interfacing connectors, the placement operation may not absolutely accurate, which may cause damage in the device under test during the mechanical movement to make good contact for electrical functions inspection. In this paper, we proposed an effective vision system including hardware and algorithm to enhance the reliability of the pick-and-place robot for autonomous testing memory of camera modules. With limited hardware based on camera and raspberry PI and using simplify image processing algorithm based on histogram information, the vision system can confirm the presence of the camera modules in feeding tray and the placement accuracy of the camera module in test socket. Through that, the system can work with more flexibility and avoid damaging the device under test. The system was experimentally quantified through testing approximately 2000 camera modules in a stable light condition. Experimental results demonstrate that the system achieves accuracy of more than 99.92%. With its simplicity and effectiveness, the proposed vision system can be considered as a useful solution for using in pick-and-place systems in industry.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
A sensor fusion approach for improving implementation speed and accuracy of RTAB-Map algorithm based indoor 3D mapping
Authors:
Hoang-Anh Phan,
Phuc Vinh Nguyen,
Thu Hang Thi Khuat,
Hieu Dang Van,
Dong Huu Quoc Tran,
Bao Lam Dang,
Tung Thanh Bui,
Van Nguyen Thi Thanh,
Trinh Chu Duc
Abstract:
In recent years, 3D mapping for indoor environments has undergone considerable research and improvement because of its effective applications in various fields, including robotics, autonomous navigation, and virtual reality. Building an accurate 3D map for indoor environment is challenging due to the complex nature of the indoor space, the problem of real-time embedding and positioning errors of t…
▽ More
In recent years, 3D mapping for indoor environments has undergone considerable research and improvement because of its effective applications in various fields, including robotics, autonomous navigation, and virtual reality. Building an accurate 3D map for indoor environment is challenging due to the complex nature of the indoor space, the problem of real-time embedding and positioning errors of the robot system. This study proposes a method to improve the accuracy, speed, and quality of 3D indoor mapping by fusing data from the Inertial Measurement System (IMU) of the Intel Realsense D435i camera, the Ultrasonic-based Indoor Positioning System (IPS), and the encoder of the robot's wheel using the extended Kalman filter (EKF) algorithm. The merged data is processed using a Real-time Image Based Mapping algorithm (RTAB-Map), with the processing frequency updated in synch with the position frequency of the IPS device. The results suggest that fusing IMU and IPS data significantly improves the accuracy, mapping time, and quality of 3D maps. Our study highlights the proposed method's potential to improve indoor mapping in various fields, indicating that the fusion of multiple data sources can be a valuable tool in creating high-quality 3D indoor maps.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
An Enhanced Sampling-Based Method With Modified Next-Best View Strategy For 2D Autonomous Robot Exploration
Authors:
Dong Huu Quoc Tran,
Hoang-Anh Phan,
Hieu Dang Van,
Tan Van Duong,
Tung Thanh Bui,
Van Nguyen Thi Thanh
Abstract:
Autonomous exploration is a new technology in the field of robotics that has found widespread application due to its objective to help robots independently localize, scan maps, and navigate any terrain without human control. Up to present, the sampling-based exploration strategies have been the most effective for aerial and ground vehicles equipped with depth sensors producing three-dimensional po…
▽ More
Autonomous exploration is a new technology in the field of robotics that has found widespread application due to its objective to help robots independently localize, scan maps, and navigate any terrain without human control. Up to present, the sampling-based exploration strategies have been the most effective for aerial and ground vehicles equipped with depth sensors producing three-dimensional point clouds. Those methods utilize the sampling task to choose random points or make samples based on Rapidly-exploring Random Trees (RRT). Then, they decide on frontiers or Next Best Views (NBV) with useful volumetric information. However, most state-of-the-art sampling-based methodology is challenging to implement in two-dimensional robots due to the lack of environmental knowledge, thus resulting in a bad volumetric gain for evaluating random destinations. This study proposed an enhanced sampling-based solution for indoor robot exploration to decide Next Best View (NBV) in 2D environments. Our method makes RRT until have the endpoints as frontiers and evaluates those with the enhanced utility function. The volumetric information obtained from environments was estimated using non-uniform distribution to determine cells that are occupied and have an uncertain probability. Compared to the sampling-based Frontier Detection and Receding Horizon NBV approaches, the methodology executed performed better in Gazebo platform-simulated environments, achieving a significantly larger explored area, with the average distance and time traveled being reduced. Moreover, the operated proposed method on an author-built 2D robot exploring the entire natural environment confirms that the method is effective and applicable in real-world scenarios.
△ Less
Submitted 8 May, 2023;
originally announced May 2023.
-
One-loop formulas for off-shell decay $H^* \rightarrow W^+W^-$ in 't Hooft-Veltman gauge and its applications
Authors:
Khiem Hong Phan,
Dzung Tri Tran,
Anh Thu Nguyen
Abstract:
We present analytic results for one-loop radiative corrections to off-shell decay $H^* \rightarrow W^+W^-$ in 't Hooft-Veltman gauge within Standard Model framework. In numerical results, we show off-shell decay rate and its corrections with varying off-shell Higgs mass. The results show that the corrections are of $10\%$ contributions to total decay rates. Furthermore, we study the impacts of one…
▽ More
We present analytic results for one-loop radiative corrections to off-shell decay $H^* \rightarrow W^+W^-$ in 't Hooft-Veltman gauge within Standard Model framework. In numerical results, we show off-shell decay rate and its corrections with varying off-shell Higgs mass. The results show that the corrections are of $10\%$ contributions to total decay rates. Furthermore, we study the impacts of one-loop radiative corrections to off-shell decay $H^* \rightarrow W^+W^-$ in Higgs processes at future colliders. The signal processes such as $e^-e^+\rightarrow ZH^*\rightarrow Z(WW)$ with including the initial beam polarizations and $e^-e^+\rightarrow ν_e\barν_e H^* \rightarrow ν_e\barν_e (WW)$ and $e^-γ\rightarrow e^-H^* \rightarrow e^-WW$ are examined. We find that the effects are visible impacts and these should be taken into account at future colliders.
△ Less
Submitted 6 May, 2023;
originally announced May 2023.
-
One-loop expressions for $h\rightarrow l\bar{l}γ$ in Higgs extensions of the Standard Model
Authors:
L. T. Hue,
Dzung Tri Tran,
Thanh Huy Nguyen,
Khiem Hong Phan
Abstract:
A systematic study of one-loop contributions to the decay channels $h\rightarrow l\bar{l}γ$ with $l=ν_{e,μ, τ}, e, μ$, performed in Higgs extended versions of the Standard Model, is presented in the 't Hooft-Veltman gauge. Analytic formulas for one-loop form factors are expressed in terms of the logarithm and di-logarithmic functions. As a result, these form factors can be reduced to those relatin…
▽ More
A systematic study of one-loop contributions to the decay channels $h\rightarrow l\bar{l}γ$ with $l=ν_{e,μ, τ}, e, μ$, performed in Higgs extended versions of the Standard Model, is presented in the 't Hooft-Veltman gauge. Analytic formulas for one-loop form factors are expressed in terms of the logarithm and di-logarithmic functions. As a result, these form factors can be reduced to those relating to the loop-induced decay processes $h\rightarrow γγ, Zγ$, confirming not only previous results using different approaches but also close relations between the three kinds of the loop-induced Higgs decay rates. For phenomenological study, we focus on the two observables, namely the enhancement factors defined as ratios of the decay rates calculated between the Higgs extended versions and the standard model, and the forward-backward asymmetries of fermions, which can be used to search for Higgs extensions of the SM. We show that direct effects of mixing between neutral Higgs bosons and indirect contributions of charged Higg boson exchanges can be probed at future colliders.
△ Less
Submitted 6 May, 2023;
originally announced May 2023.
-
Making Atomic-Level Magnetism Tunable with Light at Room Temperature
Authors:
V. O. Jimenez,
Y. T. H. Pham,
D. Zhou,
M. Z. Liu,
F. A. Nugera,
V. Kalappattil,
T. Eggers,
K. Hoang,
D. L. Duong,
M. Terrones,
H. R. Gutierrez,
M. H. Phan
Abstract:
The capacity to manipulate magnetization in two-dimensional dilute magnetic semiconductors (2D-DMSs) using light, specifically in magnetically doped transition metal dichalcogenide (TMD) monolayers (M-doped TX2, where M = V, Fe, Cr; T = W, Mo; X = S, Se, Te), may lead to innovative applications in spintronics, spin-caloritronics, valleytronics, and quantum computation. This Perspective paper explo…
▽ More
The capacity to manipulate magnetization in two-dimensional dilute magnetic semiconductors (2D-DMSs) using light, specifically in magnetically doped transition metal dichalcogenide (TMD) monolayers (M-doped TX2, where M = V, Fe, Cr; T = W, Mo; X = S, Se, Te), may lead to innovative applications in spintronics, spin-caloritronics, valleytronics, and quantum computation. This Perspective paper explores the mediation of magnetization by light under ambient conditions in 2D-TMD DMSs and heterostructures. By combining magneto-LC resonance (MLCR) experiments with density functional theory (DFT) calculations, we show that the magnetization can be enhanced using light in V-doped TMD monolayers (e.g., V-WS2, V-WSe2, V-MoS2). This phenomenon is attributed to excess holes in the conduction and valence bands, as well as carriers trapped in magnetic doping states, which together mediate the magnetization of the semiconducting layer. In 2D-TMD heterostructures such as VSe2/WS2 and VSe2/MoS2, we demonstrate the significance of proximity, charge-transfer, and confinement effects in amplifying light-mediated magnetism. This effect is attributed to photon absorption at the TMD layer (e.g., WS2, MoS2) that generates electron-hole pairs mediating the magnetization of the heterostructure. These findings will encourage further research in the field of 2D magnetism and establish a novel direction for designing 2D-TMDs and heterostructures with optically tunable magnetic functionalities, paving the way for next-generation magneto-optic nanodevices.
△ Less
Submitted 1 May, 2023;
originally announced May 2023.
-
CoRe-Sleep: A Multimodal Fusion Framework for Time Series Robust to Imperfect Modalities
Authors:
Konstantinos Kontras,
Christos Chatzichristos,
Huy Phan,
Johan Suykens,
Maarten De Vos
Abstract:
Sleep abnormalities can have severe health consequences. Automated sleep staging, i.e. labelling the sequence of sleep stages from the patient's physiological recordings, could simplify the diagnostic process. Previous work on automated sleep staging has achieved great results, mainly relying on the EEG signal. However, often multiple sources of information are available beyond EEG. This can be pa…
▽ More
Sleep abnormalities can have severe health consequences. Automated sleep staging, i.e. labelling the sequence of sleep stages from the patient's physiological recordings, could simplify the diagnostic process. Previous work on automated sleep staging has achieved great results, mainly relying on the EEG signal. However, often multiple sources of information are available beyond EEG. This can be particularly beneficial when the EEG recordings are noisy or even missing completely. In this paper, we propose CoRe-Sleep, a Coordinated Representation multimodal fusion network that is particularly focused on improving the robustness of signal analysis on imperfect data. We demonstrate how appropriately handling multimodal information can be the key to achieving such robustness. CoRe-Sleep tolerates noisy or missing modalities segments, allowing training on incomplete data. Additionally, it shows state-of-the-art performance when testing on both multimodal and unimodal data using a single model on SHHS-1, the largest publicly available study that includes sleep stage labels. The results indicate that training the model on multimodal data does positively influence performance when tested on unimodal data. This work aims at bridging the gap between automated analysis tools and their clinical utility.
△ Less
Submitted 27 March, 2023;
originally announced April 2023.
-
An Inception-Residual-Based Architecture with Multi-Objective Loss for Detecting Respiratory Anomalies
Authors:
Dat Ngo,
Lam Pham,
Huy Phan,
Minh Tran,
Delaram Jarchi,
Sefki Kolozali
Abstract:
This paper presents a deep learning system applied for detecting anomalies from respiratory sound recordings. Initially, our system begins with audio feature extraction using Gammatone and Continuous Wavelet transformation. This step aims to transform the respiratory sound input into a two-dimensional spectrogram where both spectral and temporal features are presented. Then, our proposed system in…
▽ More
This paper presents a deep learning system applied for detecting anomalies from respiratory sound recordings. Initially, our system begins with audio feature extraction using Gammatone and Continuous Wavelet transformation. This step aims to transform the respiratory sound input into a two-dimensional spectrogram where both spectral and temporal features are presented. Then, our proposed system integrates Inception-residual-based backbone models combined with multi-head attention and multi-objective loss to classify respiratory anomalies. Instead of applying a simple concatenation approach by combining results from various spectrograms, we propose a Linear combination, which has the ability to regulate equally the contribution of each individual spectrogram throughout the training process. To evaluate the performance, we conducted experiments over the benchmark dataset of SPRSound (The Open-Source SJTU Paediatric Respiratory Sound) proposed by the IEEE BioCAS 2022 challenge. As regards the Score computed by an average between the average score and harmonic score, our proposed system gained significant improvements of 9.7%, 15.8%, 17.8%, and 16.1% in Task 1-1, Task 1-2, Task 2-1, and Task 2-2, respectively, compared to the challenge baseline system. Notably, we achieved the Top-1 performance in Task 2-1 and Task 2-2 with the highest Score of 74.5% and 53.9%, respectively.
△ Less
Submitted 19 June, 2023; v1 submitted 7 March, 2023;
originally announced March 2023.
-
Flat Seeking Bayesian Neural Networks
Authors:
Van-Anh Nguyen,
Tung-Long Vuong,
Hoang Phan,
Thanh-Toan Do,
Dinh Phung,
Trung Le
Abstract:
Bayesian Neural Networks (BNNs) provide a probabilistic interpretation for deep learning models by imposing a prior distribution over model parameters and inferring a posterior distribution based on observed data. The model sampled from the posterior distribution can be used for providing ensemble predictions and quantifying prediction uncertainty. It is well-known that deep learning models with l…
▽ More
Bayesian Neural Networks (BNNs) provide a probabilistic interpretation for deep learning models by imposing a prior distribution over model parameters and inferring a posterior distribution based on observed data. The model sampled from the posterior distribution can be used for providing ensemble predictions and quantifying prediction uncertainty. It is well-known that deep learning models with lower sharpness have better generalization ability. However, existing posterior inferences are not aware of sharpness/flatness in terms of formulation, possibly leading to high sharpness for the models sampled from them. In this paper, we develop theories, the Bayesian setting, and the variational inference approach for the sharpness-aware posterior. Specifically, the models sampled from our sharpness-aware posterior, and the optimal approximate posterior estimating this sharpness-aware posterior, have better flatness, hence possibly possessing higher generalization ability. We conduct experiments by leveraging the sharpness-aware posterior with state-of-the-art Bayesian Neural Networks, showing that the flat-seeking counterparts outperform their baselines in all metrics of interest.
△ Less
Submitted 6 November, 2023; v1 submitted 6 February, 2023;
originally announced February 2023.
-
cross-modal fusion techniques for utterance-level emotion recognition from text and speech
Authors:
Jiachen Luo,
Huy Phan,
Joshua Reiss
Abstract:
Multimodal emotion recognition (MER) is a fundamental complex research problem due to the uncertainty of human emotional expression and the heterogeneity gap between different modalities. Audio and text modalities are particularly important for a human participant in understanding emotions. Although many successful attempts have been designed multimodal representations for MER, there still exist m…
▽ More
Multimodal emotion recognition (MER) is a fundamental complex research problem due to the uncertainty of human emotional expression and the heterogeneity gap between different modalities. Audio and text modalities are particularly important for a human participant in understanding emotions. Although many successful attempts have been designed multimodal representations for MER, there still exist multiple challenges to be addressed: 1) bridging the heterogeneity gap between multimodal features and model inter- and intra-modal interactions of multiple modalities; 2) effectively and efficiently modelling the contextual dynamics in the conversation sequence. In this paper, we propose Cross-Modal RoBERTa (CM-RoBERTa) model for emotion detection from spoken audio and corresponding transcripts. As the core unit of the CM-RoBERTa, parallel self- and cross- attention is designed to dynamically capture inter- and intra-modal interactions of audio and text. Specially, the mid-level fusion and residual module are employed to model long-term contextual dependencies and learn modality-specific patterns. We evaluate the approach on the MELD dataset and the experimental results show the proposed approach achieves the state-of-art performance on the dataset.
△ Less
Submitted 5 February, 2023;
originally announced February 2023.
-
deep learning of segment-level feature representation for speech emotion recognition in conversations
Authors:
Jiachen Luo,
Huy Phan,
Joshua Reiss
Abstract:
Accurately detecting emotions in conversation is a necessary yet challenging task due to the complexity of emotions and dynamics in dialogues. The emotional state of a speaker can be influenced by many different factors, such as interlocutor stimulus, dialogue scene, and topic. In this work, we propose a conversational speech emotion recognition method to deal with capturing attentive contextual d…
▽ More
Accurately detecting emotions in conversation is a necessary yet challenging task due to the complexity of emotions and dynamics in dialogues. The emotional state of a speaker can be influenced by many different factors, such as interlocutor stimulus, dialogue scene, and topic. In this work, we propose a conversational speech emotion recognition method to deal with capturing attentive contextual dependency and speaker-sensitive interactions. First, we use a pretrained VGGish model to extract segment-based audio representation in individual utterances. Second, an attentive bi-directional gated recurrent unit (GRU) models contextual-sensitive information and explores intra- and inter-speaker dependencies jointly in a dynamic manner. The experiments conducted on the standard conversational dataset MELD demonstrate the effectiveness of the proposed method when compared against state-of the-art methods.
△ Less
Submitted 5 February, 2023;
originally announced February 2023.
-
Rig Inversion by Training a Differentiable Rig Function
Authors:
Mathieu Marquis Bolduc,
Hau Nghiep Phan
Abstract:
Rig inversion is the problem of creating a method that can find the rig parameter vector that best approximates a given input mesh. In this paper we propose to solve this problem by first obtaining a differentiable rig function by training a multi layer perceptron to approximate the rig function. This differentiable rig function can then be used to train a deep learning model of rig inversion.
Rig inversion is the problem of creating a method that can find the rig parameter vector that best approximates a given input mesh. In this paper we propose to solve this problem by first obtaining a differentiable rig function by training a multi layer perceptron to approximate the rig function. This differentiable rig function can then be used to train a deep learning model of rig inversion.
△ Less
Submitted 11 January, 2023;
originally announced January 2023.
-
Internal and external alignment of carbonaceous grains within the radiative torque paradigm
Authors:
Thiem Hoang,
Vo Hong Minh Phan,
Le Ngoc Tram
Abstract:
We study the internal and external alignment of carbonaceous grains, including graphite and hydrogenated amorphous carbon (HAC), in the interstellar medium (ISM) within the RAdiative Torque (RAT) paradigm. For internal alignment (IA), we find that HAC grains having nuclear paramagnetism due to hydrogen protons can have efficient nuclear relaxation, whereas both HAC and graphite grains can have eff…
▽ More
We study the internal and external alignment of carbonaceous grains, including graphite and hydrogenated amorphous carbon (HAC), in the interstellar medium (ISM) within the RAdiative Torque (RAT) paradigm. For internal alignment (IA), we find that HAC grains having nuclear paramagnetism due to hydrogen protons can have efficient nuclear relaxation, whereas both HAC and graphite grains can have efficient inelastic relaxation for grains aligned both at low$-J$ and high$-J$ attractors. For external alignment, HAC and graphite grains can align with the radiation direction ($k$-RAT) at low$-J$ attractors but cannot have stable alignment at high$-J$ attractors due to the suppression of radiative precession. HAC also has slow Larmor precession compared to the randomization by gas collisions and cannot be aligned with the magnetic field ($B$-RAT). Small HAC grains of $a<0.05μ$m drifting through the diffuse ISM can be weakly aligned along the induced electric field ($E$-RAT) at high$-J$ attractors due to its fast precession. Paramagnetic relaxation by nuclear magnetism is found inefficient for HAC grains due to the rapid suppression of nuclear susceptibility when grains rotate at high$-J$ attractors. We then study the alignment of carbon dust in the envelope of a typical C-rich Asymptotic Giant Branch star, IRC+10216. We find that grains aligned at low$-J$ attractors can occur via $k$-RAT with the wrong IA in the inner region but via $B$-RAT in the outermost region. However, grains aligned at high$-J$ attractors have the right IA alignment via $k$-RAT due to efficient inelastic relaxation. The polarization pattern observed toward IRC+10216 by SOFIA/HAWC+ can reproduced when only grains at low$-J$ attractors are present due to removal of grains at high$-J$ attractors by the RAT disruption.
△ Less
Submitted 18 January, 2023;
originally announced January 2023.
-
L-SeqSleepNet: Whole-cycle Long Sequence Modelling for Automatic Sleep Staging
Authors:
Huy Phan,
Kristian P. Lorenzen,
Elisabeth Heremans,
Oliver Y. Chén,
Minh C. Tran,
Philipp Koch,
Alfred Mertins,
Mathias Baumert,
Kaare Mikkelsen,
Maarten De Vos
Abstract:
Human sleep is cyclical with a period of approximately 90 minutes, implying long temporal dependency in the sleep data. Yet, exploring this long-term dependency when developing sleep staging models has remained untouched. In this work, we show that while encoding the logic of a whole sleep cycle is crucial to improve sleep staging performance, the sequential modelling approach in existing state-of…
▽ More
Human sleep is cyclical with a period of approximately 90 minutes, implying long temporal dependency in the sleep data. Yet, exploring this long-term dependency when developing sleep staging models has remained untouched. In this work, we show that while encoding the logic of a whole sleep cycle is crucial to improve sleep staging performance, the sequential modelling approach in existing state-of-the-art deep learning models are inefficient for that purpose. We thus introduce a method for efficient long sequence modelling and propose a new deep learning model, L-SeqSleepNet, which takes into account whole-cycle sleep information for sleep staging. Evaluating L-SeqSleepNet on four distinct databases of various sizes, we demonstrate state-of-the-art performance obtained by the model over three different EEG setups, including scalp EEG in conventional Polysomnography (PSG), in-ear EEG, and around-the-ear EEG (cEEGrid), even with a single EEG channel input. Our analyses also show that L-SeqSleepNet is able to alleviate the predominance of N2 sleep (the major class in terms of classification) to bring down errors in other sleep stages. Moreover the network becomes much more robust, meaning that for all subjects where the baseline method had exceptionally poor performance, their performance are improved significantly. Finally, the computation time only grows at a sub-linear rate when the sequence length increases.
△ Less
Submitted 4 August, 2023; v1 submitted 9 January, 2023;
originally announced January 2023.
-
Learning from Taxonomy: Multi-label Few-Shot Classification for Everyday Sound Recognition
Authors:
Jinhua Liang,
Huy Phan,
Emmanouil Benetos
Abstract:
Everyday sound recognition aims to infer types of sound events in audio streams. While many works succeeded in training models with high performance in a fully-supervised manner, they are still restricted to the demand of large quantities of labelled data and the range of predefined classes. To overcome these drawbacks, this work firstly curates a new database named FSD-FS for multi-label few-shot…
▽ More
Everyday sound recognition aims to infer types of sound events in audio streams. While many works succeeded in training models with high performance in a fully-supervised manner, they are still restricted to the demand of large quantities of labelled data and the range of predefined classes. To overcome these drawbacks, this work firstly curates a new database named FSD-FS for multi-label few-shot audio classification. It then explores how to incorporate audio taxonomy in few-shot learning. Specifically, this work proposes label-dependent prototypical networks (LaD-protonet) to exploit parent-children relationships between labels. Plus, it applies taxonomy-aware label smoothing techniques to boost model performance. Experiments demonstrate that LaD-protonet outperforms original prototypical networks as well as other state-of-the-art methods. Moreover, its performance can be further boosted when combined with taxonomy-aware label smoothing.
△ Less
Submitted 17 December, 2022;
originally announced December 2022.
-
Improving trajectory localization accuracy via direction-of-arrival derivative estimation
Authors:
Ruchi Pandey,
Shreyas Jaiswal,
Huy Phan,
Santosh Nannuru
Abstract:
Sound source localization is crucial in acoustic sensing and monitoring-related applications. In this paper, we do a comprehensive analysis of improvement in sound source localization by combining the direction of arrivals (DOAs) with their derivatives which quantify the changes in the positions of sources over time. This study uses the SALSA-Lite feature with a convolutional recurrent neural netw…
▽ More
Sound source localization is crucial in acoustic sensing and monitoring-related applications. In this paper, we do a comprehensive analysis of improvement in sound source localization by combining the direction of arrivals (DOAs) with their derivatives which quantify the changes in the positions of sources over time. This study uses the SALSA-Lite feature with a convolutional recurrent neural network (CRNN) model for predicting DOAs and their first-order derivatives. An update rule is introduced to combine the predicted DOAs with the estimated derivatives to obtain the final DOAs. The experimental validation is done using TAU-NIGENS Spatial Sound Events (TNSSE) 2021 dataset. We compare the performance of the networks predicting DOAs with derivative vs. the one predicting only the DOAs at low SNR levels. The results show that combining the derivatives with the DOAs improves the localization accuracy of moving sources.
△ Less
Submitted 10 December, 2022; v1 submitted 7 December, 2022;
originally announced December 2022.
-
CSTAR: Towards Compact and STructured Deep Neural Networks with Adversarial Robustness
Authors:
Huy Phan,
Miao Yin,
Yang Sui,
Bo Yuan,
Saman Zonouz
Abstract:
Model compression and model defense for deep neural networks (DNNs) have been extensively and individually studied. Considering the co-importance of model compactness and robustness in practical applications, several prior works have explored to improve the adversarial robustness of the sparse neural networks. However, the structured sparse models obtained by the exiting works suffer severe perfor…
▽ More
Model compression and model defense for deep neural networks (DNNs) have been extensively and individually studied. Considering the co-importance of model compactness and robustness in practical applications, several prior works have explored to improve the adversarial robustness of the sparse neural networks. However, the structured sparse models obtained by the exiting works suffer severe performance degradation for both benign and robust accuracy, thereby causing a challenging dilemma between robustness and structuredness of the compact DNNs. To address this problem, in this paper, we propose CSTAR, an efficient solution that can simultaneously impose the low-rankness-based Compactness, high STructuredness and high Adversarial Robustness on the target DNN models. By formulating the low-rankness and robustness requirement within the same framework and globally determining the ranks, the compressed DNNs can simultaneously achieve high compression performance and strong adversarial robustness. Evaluations for various DNN models on different datasets demonstrate the effectiveness of CSTAR. Compared with the state-of-the-art robust structured pruning methods, CSTAR shows consistently better performance. For instance, when compressing ResNet-18 on CIFAR-10, CSTAR can achieve up to 20.07% and 11.91% improvement for benign accuracy and robust accuracy, respectively. For compressing ResNet-18 with 16x compression ratio on Imagenet, CSTAR can obtain 8.58% benign accuracy gain and 4.27% robust accuracy gain compared to the existing robust structured pruning method.
△ Less
Submitted 17 February, 2023; v1 submitted 4 December, 2022;
originally announced December 2022.
-
Continual Learning with Optimal Transport based Mixture Model
Authors:
Quyen Tran,
Hoang Phan,
Khoat Than,
Dinh Phung,
Trung Le
Abstract:
Online Class Incremental learning (CIL) is a challenging setting in Continual Learning (CL), wherein data of new tasks arrive in incoming streams and online learning models need to handle incoming data streams without revisiting previous ones. Existing works used a single centroid adapted with incoming data streams to characterize a class. This approach possibly exposes limitations when the incomi…
▽ More
Online Class Incremental learning (CIL) is a challenging setting in Continual Learning (CL), wherein data of new tasks arrive in incoming streams and online learning models need to handle incoming data streams without revisiting previous ones. Existing works used a single centroid adapted with incoming data streams to characterize a class. This approach possibly exposes limitations when the incoming data stream of a class is naturally multimodal. To address this issue, in this work, we first propose an online mixture model learning approach based on nice properties of the mature optimal transport theory (OT-MM). Specifically, the centroids and covariance matrices of the mixture model are adapted incrementally according to incoming data streams. The advantages are two-fold: (i) we can characterize more accurately complex data streams and (ii) by using centroids for each class produced by OT-MM, we can estimate the similarity of an unseen example to each class more reasonably when doing inference. Moreover, to combat the catastrophic forgetting in the CIL scenario, we further propose Dynamic Preservation. Particularly, after performing the dynamic preservation technique across data streams, the latent representations of the classes in the old and new tasks become more condensed themselves and more separate from each other. Together with a contraction feature extractor, this technique facilitates the model in mitigating the catastrophic forgetting. The experimental results on real-world datasets show that our proposed method can significantly outperform the current state-of-the-art baselines.
△ Less
Submitted 5 December, 2022; v1 submitted 30 November, 2022;
originally announced November 2022.
-
One-loop formulas for $H\rightarrow Z ν_l\barν_l$ for $l = e,μ, τ$ in 't Hooft-Veltman gauge
Authors:
Dzung Tri Tran,
Khiem Hong Phan
Abstract:
In this paper, we present analytical results for one-loop contributing to the decay processes $H\rightarrow Z ν_l\barν_l$ (for $l = e, μ, τ$). The calculations are performed within the Standard Model framework in 't Hooft-Veltman gauge. One-loop form factors are then written in terms of scalar one-loop functions in the standard notations of {\tt LoopTools}. As a result, one-loop decay rates for th…
▽ More
In this paper, we present analytical results for one-loop contributing to the decay processes $H\rightarrow Z ν_l\barν_l$ (for $l = e, μ, τ$). The calculations are performed within the Standard Model framework in 't Hooft-Veltman gauge. One-loop form factors are then written in terms of scalar one-loop functions in the standard notations of {\tt LoopTools}. As a result, one-loop decay rates for the decay channels can be evaluated numerically by using the package. Furthermore, we analyse the signals of $H\rightarrow Z ν_l\barν_l$ via the production processes $e^-e^+ \rightarrow ZH^* \rightarrow Z (H^* \rightarrow Z ν_l\barν_l)$ including the initial beam polarizations at future lepton collider. The Standard Model background such as the processes $e^-e^+ \rightarrow ν_l\barν_l ZZ$ are also examined in this study. In numerical results, we find that one-loop corrections are about $10\%$ contributions to the decay rates. They are sizeable contributions and should be taken into account at future colliders. We show that the signals $H\rightarrow Zν_l\barν_l$ are clearly visible at center-of-mass energy $\sqrt{s}=250$ GeV and it is hard to probe at higher-energy regions due to the dominant of the background.
△ Less
Submitted 6 March, 2023; v1 submitted 28 November, 2022;
originally announced November 2022.
-
Improving Multi-task Learning via Seeking Task-based Flat Regions
Authors:
Hoang Phan,
Lam Tran,
Quyen Tran,
Ngoc N. Tran,
Tuan Truong,
Qi Lei,
Nhat Ho,
Dinh Phung,
Trung Le
Abstract:
Multi-Task Learning (MTL) is a widely-used and powerful learning paradigm for training deep neural networks that allows learning more than one objective by a single backbone. Compared to training tasks separately, MTL significantly reduces computational costs, improves data efficiency, and potentially enhances model performance by leveraging knowledge across tasks. Hence, it has been adopted in a…
▽ More
Multi-Task Learning (MTL) is a widely-used and powerful learning paradigm for training deep neural networks that allows learning more than one objective by a single backbone. Compared to training tasks separately, MTL significantly reduces computational costs, improves data efficiency, and potentially enhances model performance by leveraging knowledge across tasks. Hence, it has been adopted in a variety of applications, ranging from computer vision to natural language processing and speech recognition. Among them, there is an emerging line of work in MTL that focuses on manipulating the task gradient to derive an ultimate gradient descent direction to benefit all tasks. Despite achieving impressive results on many benchmarks, directly applying these approaches without using appropriate regularization techniques might lead to suboptimal solutions on real-world problems. In particular, standard training that minimizes the empirical loss on the training data can easily suffer from overfitting to low-resource tasks or be spoiled by noisy-labeled ones, which can cause negative transfer between tasks and overall performance drop. To alleviate such problems, we propose to leverage a recently introduced training method, named Sharpness-aware Minimization, which can enhance model generalization ability on single-task learning. Accordingly, we present a novel MTL training methodology, encouraging the model to find task-based flat minima for coherently improving its generalization capability on all tasks. Finally, we conduct comprehensive experiments on a variety of applications to demonstrate the merit of our proposed approach to existing gradient-based MTL methods, as suggested by our developed theory.
△ Less
Submitted 23 May, 2025; v1 submitted 24 November, 2022;
originally announced November 2022.
-
A Comparative Study of Question Answering over Knowledge Bases
Authors:
Khiem Vinh Tran,
Hao Phu Phan,
Khang Nguyen Duc Quach,
Ngan Luu-Thuy Nguyen,
Jun Jo,
Thanh Tam Nguyen
Abstract:
Question answering over knowledge bases (KBQA) has become a popular approach to help users extract information from knowledge bases. Although several systems exist, choosing one suitable for a particular application scenario is difficult. In this article, we provide a comparative study of six representative KBQA systems on eight benchmark datasets. In that, we study various question types, propert…
▽ More
Question answering over knowledge bases (KBQA) has become a popular approach to help users extract information from knowledge bases. Although several systems exist, choosing one suitable for a particular application scenario is difficult. In this article, we provide a comparative study of six representative KBQA systems on eight benchmark datasets. In that, we study various question types, properties, languages, and domains to provide insights on where existing systems struggle. On top of that, we propose an advanced mapping algorithm to aid existing models in achieving superior results. Moreover, we also develop a multilingual corpus COVID-KGQA, which encourages COVID-19 research and multilingualism for the diversity of future AI. Finally, we discuss the key findings and their implications as well as performance guidelines and some future improvements. Our source code is available at \url{https://github.com/tamlhp/kbqa}.
△ Less
Submitted 15 November, 2022;
originally announced November 2022.
-
Field line subdiffusion and cosmic ray perpendicular transport in isotropic turbulence
Authors:
Marco Kuhlen,
Vo Hong Minh Phan,
Philipp Mertsch
Abstract:
The transport of high-energy particles in the presence of small-scale, turbulent magnetic fields is a long-standing issue in astrophysics. Analytical theories disagree with numerical simulations at rigidities where the particles' gyroradii are slightly smaller than the correlation length of turbulence. At the same time, extending the numerical simulations to lower rigidities has proven computation…
▽ More
The transport of high-energy particles in the presence of small-scale, turbulent magnetic fields is a long-standing issue in astrophysics. Analytical theories disagree with numerical simulations at rigidities where the particles' gyroradii are slightly smaller than the correlation length of turbulence. At the same time, extending the numerical simulations to lower rigidities has proven computationally prohibitive. In this letter, we provide a solution to the problem of perpendicular transport in isotropic turbulence at both, high and low rigidities. We also clarify the relation between the perpendicular diffusion of particles and the transport of magnetic field lines. To this end, we have run a large suite of test particle simulations at unprecedentedly low rigidites, making extensive use of graphical processing units (GPUs). We have also developed an analytical model, based on (1) initial particle transport along field lines, (2) the transport of field lines and (3) the eventual decorrelation of particles from field lines. Our numerical results exhibit a non-standard rigidity-dependence for the perpendicular diffusion coefficient at intermediate rigidites. At the lowest rigidities, the standard rigidity-dependence is recovered. The simulated diffusion coefficients are nicely reproduced by our analytical model. We have traced the non-standard rigidity-dependence to a subdiffusive phase in the field line transport. Our study has important implications for the transport of Galactic cosmic rays, acceleration at perpendicular shocks and for high-energy particles in the heliosphere.
△ Less
Submitted 30 October, 2023; v1 submitted 10 November, 2022;
originally announced November 2022.
-
Diffusion of relativistic charged particles and field lines in isotropic turbulence
Authors:
Marco Kuhlen,
Vo Hong Minh Phan,
Philipp Mertsch
Abstract:
The transport of non-thermal particles across a large-scale magnetic field in the presence of magnetised turbulence has been a long-standing issue in high-energy astrophysics. Of particular interest is the dependence of the parallel and perpendicular mean free paths $λ_{\parallel}$ and $λ_{\perp}$ on rigidity $\mathcal{R}$. We have revisited this important issue with a view to applications from th…
▽ More
The transport of non-thermal particles across a large-scale magnetic field in the presence of magnetised turbulence has been a long-standing issue in high-energy astrophysics. Of particular interest is the dependence of the parallel and perpendicular mean free paths $λ_{\parallel}$ and $λ_{\perp}$ on rigidity $\mathcal{R}$. We have revisited this important issue with a view to applications from the transport of Galactic cosmic rays to acceleration at astrophysical shocks. We have run test particle simulations of cosmic ray transport in synthetic, isotropic Kolmogorov turbulence at unprecedentedly low reduced rigidites $r_g/L_c \simeq 10^{-4}$, corresponding to $\mathcal{R} \simeq 10 \, \text{TV}$ for a turbulent magnetic field of $B_{rms} = 4 \, μ\text{G}$ and correlation length $L_c = 30 \, \text{pc}$. Extracting the (asymptotic) parallel and perpendicular mean free paths $λ_{\parallel}$ and $λ_{\perp}$, we have found $λ_{\parallel} \propto (r_g/L_c)^{1/3}$ as expected for a Kolmogorov turbulence spectrum. In contrast, $λ_{\perp}$ has a faster dependence on $r_g/L_c$ for $10^{-2} \lesssim r_g/L_c \lesssim 1$, but for $r_g/L_c \ll 10^{-2}$, also $λ_{\perp} \propto (r_g/L_c)^{1/3}$. Our results have important implications for the transport of Galactic cosmic rays.
△ Less
Submitted 24 November, 2023; v1 submitted 10 November, 2022;
originally announced November 2022.
-
Modelling black-box audio effects with time-varying feature modulation
Authors:
Marco Comunità,
Christian J. Steinmetz,
Huy Phan,
Joshua D. Reiss
Abstract:
Deep learning approaches for black-box modelling of audio effects have shown promise, however, the majority of existing work focuses on nonlinear effects with behaviour on relatively short time-scales, such as guitar amplifiers and distortion. While recurrent and convolutional architectures can theoretically be extended to capture behaviour at longer time scales, we show that simply scaling the wi…
▽ More
Deep learning approaches for black-box modelling of audio effects have shown promise, however, the majority of existing work focuses on nonlinear effects with behaviour on relatively short time-scales, such as guitar amplifiers and distortion. While recurrent and convolutional architectures can theoretically be extended to capture behaviour at longer time scales, we show that simply scaling the width, depth, or dilation factor of existing architectures does not result in satisfactory performance when modelling audio effects such as fuzz and dynamic range compression. To address this, we propose the integration of time-varying feature-wise linear modulation into existing temporal convolutional backbones, an approach that enables learnable adaptation of the intermediate activations. We demonstrate that our approach more accurately captures long-range dependencies for a range of fuzz and compressor implementations across both time and frequency domain metrics. We provide sound examples, source code, and pretrained models to faciliate reproducibility.
△ Less
Submitted 9 May, 2023; v1 submitted 1 November, 2022;
originally announced November 2022.
-
One-loop calculations for $H\rightarrow f\bar{f}γ$ in the $U(1)_{B-L}$ extension for Standard Model
Authors:
Khiem Hong Phan,
Anh Thu Nguyen,
Dzung Tri Tran
Abstract:
In this paper, we present the calculations for $H\rightarrow f\bar{f}γ$ in the $U(1)_{B-L}$ extension for Standard Model. Analytic results for one-loop form factors in the decay process are expressed in terms of the scalar one-loop Passarino$-$Veltman functions in the conventions of {\tt LoopTools}. Therefore, the decay rates can be evaluated numerically by using this package. In phenomenological…
▽ More
In this paper, we present the calculations for $H\rightarrow f\bar{f}γ$ in the $U(1)_{B-L}$ extension for Standard Model. Analytic results for one-loop form factors in the decay process are expressed in terms of the scalar one-loop Passarino$-$Veltman functions in the conventions of {\tt LoopTools}. Therefore, the decay rates can be evaluated numerically by using this package. In phenomenological results, we show the differential decay rates with respect to invariant mass of fermion pair $m_{ff}$, new neutral gauge mass $M_{Z'}$ and the coupling $g'$ of $U(1)_{B-L}$ gauge group. We find that the contributions of the $U(1)_{B-L}$ extension for Standard Model are visible effects and they must be taken into account at future colliders.
△ Less
Submitted 27 September, 2022;
originally announced September 2022.
-
One-loop on-shell and off-shell decay $H^*\rightarrow VV$ at future $e^-e^-$ collider
Authors:
Anh Thu Nguyen,
Dzung Tri Tran,
Khiem Hong Phan
Abstract:
One-loop on-shell and off-shell decays $H\rightarrow VV$ with $VV=γγ, Zγ, ZZ$ are presented in this paper. The effects of one-loop on-shell and off-shell Higgs decays in Higgs productions at $e^-e^-$ collisions are also then examined. We find that the impacts of one-loop Higgs decays are significant and they are must be taken into account at $e^-e^-$ collision.
One-loop on-shell and off-shell decays $H\rightarrow VV$ with $VV=γγ, Zγ, ZZ$ are presented in this paper. The effects of one-loop on-shell and off-shell Higgs decays in Higgs productions at $e^-e^-$ collisions are also then examined. We find that the impacts of one-loop Higgs decays are significant and they are must be taken into account at $e^-e^-$ collision.
△ Less
Submitted 27 September, 2022; v1 submitted 27 September, 2022;
originally announced September 2022.
-
One-loop off-shell decay $H^* \rightarrow ZZ$ at future colliders
Authors:
Khiem Hong Phan,
Dzung Tri Tran,
Anh Thu Nguyen
Abstract:
We present one-loop formulas for contributing to the $HZZ$ vertex in 't Hooft-Veltman gauge within Standard Model framework. One-loop off-shell Higgs decay rates to $Z$-pair are investigated in both unpolarized and longitudinal polarization for $Z$ bosons in final state. The corrections are range of $7\%$ to $8.4\%$ when we vary the off-shell Higgs mass from $200$ GeV to $500$ GeV. In applications…
▽ More
We present one-loop formulas for contributing to the $HZZ$ vertex in 't Hooft-Veltman gauge within Standard Model framework. One-loop off-shell Higgs decay rates to $Z$-pair are investigated in both unpolarized and longitudinal polarization for $Z$ bosons in final state. The corrections are range of $7\%$ to $8.4\%$ when we vary the off-shell Higgs mass from $200$ GeV to $500$ GeV. In applications, we study off-shell Higgs decay $H^* \rightarrow ZZ$ in the Higgs productions at future colliders such as the signal processes $γ^*(Q^2)γ\rightarrow H^* \rightarrow ZZ$ and $e^-γ\rightarrow e^-H^* \rightarrow e^- ZZ$ are analyzed.
△ Less
Submitted 6 May, 2023; v1 submitted 26 September, 2022;
originally announced September 2022.
-
Stochasticity of Cosmic Rays from Supernova Remnants and the Ionization Rates in Molecular Clouds
Authors:
Vo Hong Minh Phan,
Sarah Recchia,
Philipp Mertsch,
Stefano Gabici
Abstract:
Cosmic rays are the only agent able to penetrate into the interior of dense molecular clouds. Depositing (part of) their energy through ionisation, cosmic rays play an essential role in determining the physical and chemical evolution of star-forming regions. To a first approximation their effect can be quantified by the cosmic-ray induced ionization rate. Interestingly, theoretical estimates of th…
▽ More
Cosmic rays are the only agent able to penetrate into the interior of dense molecular clouds. Depositing (part of) their energy through ionisation, cosmic rays play an essential role in determining the physical and chemical evolution of star-forming regions. To a first approximation their effect can be quantified by the cosmic-ray induced ionization rate. Interestingly, theoretical estimates of the ionization rate assuming the cosmic-ray spectra observed in the local interstellar medium result in an ionization rate that is one to two orders of magnitude below the values inferred from observations. However, due to the discrete nature of sources, the local spectra of MeV cosmic rays are in general not representative for the spectra elsewhere in the Galaxy. Such stochasticity effects have the potential of reconciling modelled ionization rates with measured ones. Here, we model the distribution of low-energy cosmic-ray spectra expected from a statistical population of supernova remnants in the Milky Way. The corresponding distribution for the ionization rate is derived and confronted with data. We find that the stochastic uncertainty helps with explaining the surprisingly high ionization rates observed in many molecular clouds.
△ Less
Submitted 25 May, 2023; v1 submitted 21 September, 2022;
originally announced September 2022.
-
Personalized Longitudinal Assessment of Multiple Sclerosis Using Smartphones
Authors:
Oliver Y. Chén,
Florian Lipsmeier,
Huy Phan,
Frank Dondelinger,
Andrew Creagh,
Christian Gossens,
Michael Lindemann,
Maarten de Vos
Abstract:
Personalized longitudinal disease assessment is central to quickly diagnosing, appropriately managing, and optimally adapting the therapeutic strategy of multiple sclerosis (MS). It is also important for identifying the idiosyncratic subject-specific disease profiles. Here, we design a novel longitudinal model to map individual disease trajectories in an automated way using sensor data that may co…
▽ More
Personalized longitudinal disease assessment is central to quickly diagnosing, appropriately managing, and optimally adapting the therapeutic strategy of multiple sclerosis (MS). It is also important for identifying the idiosyncratic subject-specific disease profiles. Here, we design a novel longitudinal model to map individual disease trajectories in an automated way using sensor data that may contain missing values. First, we collect digital measurements related to gait and balance, and upper extremity functions using sensor-based assessments administered on a smartphone. Next, we treat missing data via imputation. We then discover potential markers of MS by employing a generalized estimation equation. Subsequently, parameters learned from multiple training datasets are ensembled to form a simple, unified longitudinal predictive model to forecast MS over time in previously unseen people with MS. To mitigate potential underestimation for individuals with severe disease scores, the final model incorporates additional subject-specific fine-tuning using data from the first day. The results show that the proposed model is promising to achieve personalized longitudinal MS assessment; they also suggest that features related to gait and balance as well as upper extremity function, remotely collected from sensor-based assessments, may be useful digital markers for predicting MS over time.
△ Less
Submitted 20 September, 2022;
originally announced September 2022.
-
RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN
Authors:
Huy Phan,
Cong Shi,
Yi Xie,
Tianfang Zhang,
Zhuohang Li,
Tianming Zhao,
Jian Liu,
Yan Wang,
Yingying Chen,
Bo Yuan
Abstract:
Recently backdoor attack has become an emerging threat to the security of deep neural network (DNN) models. To date, most of the existing studies focus on backdoor attack against the uncompressed model; while the vulnerability of compressed DNNs, which are widely used in the practical applications, is little exploited yet. In this paper, we propose to study and develop Robust and Imperceptible Bac…
▽ More
Recently backdoor attack has become an emerging threat to the security of deep neural network (DNN) models. To date, most of the existing studies focus on backdoor attack against the uncompressed model; while the vulnerability of compressed DNNs, which are widely used in the practical applications, is little exploited yet. In this paper, we propose to study and develop Robust and Imperceptible Backdoor Attack against Compact DNN models (RIBAC). By performing systematic analysis and exploration on the important design knobs, we propose a framework that can learn the proper trigger patterns, model parameters and pruning masks in an efficient way. Thereby achieving high trigger stealthiness, high attack success rate and high model efficiency simultaneously. Extensive evaluations across different datasets, including the test against the state-of-the-art defense mechanisms, demonstrate the high robustness, stealthiness and model efficiency of RIBAC. Code is available at https://github.com/huyvnphan/ECCV2022-RIBAC
△ Less
Submitted 22 August, 2022;
originally announced August 2022.
-
Heterogeneous Graph Neural Networks for Software Effort Estimation
Authors:
Hung Phan,
Ali Jannesari
Abstract:
Software effort can be measured by story point [35]. Current approaches for automatically estimating story points focus on applying pre-trained embedding models and deep learning for text regression to solve this problem which required expensive embedding models. We propose HeteroSP, a tool for estimating story points from textual input of Agile software project issues. We select GPT2SP [12] and D…
▽ More
Software effort can be measured by story point [35]. Current approaches for automatically estimating story points focus on applying pre-trained embedding models and deep learning for text regression to solve this problem which required expensive embedding models. We propose HeteroSP, a tool for estimating story points from textual input of Agile software project issues. We select GPT2SP [12] and Deep-SE [8] as the baselines for comparison. First, from the analysis of the story point dataset [8], we conclude that software issues are actually a mixture of natural language sentences with quoted code snippets and have problems related to large-size vocabulary. Second, we provide a module to normalize the input text including words and code tokens of the software issues. Third, we design an algorithm to convert an input software issue to a graph with different types of nodes and edges. Fourth, we construct a heterogeneous graph neural networks model with the support of fastText [6] for constructing initial node embedding to learn and predict the story points of new issues. We did the comparison over three scenarios of estimation, including within project, cross-project within the repository, and cross-project cross repository with our baseline approaches. We achieve the average Mean Absolute Error (MAE) as 2.38, 2.61, and 2.63 for three scenarios. We outperform GPT2SP in 2/3 of the scenarios while outperforming Deep-SE in the most challenging scenario with significantly less amount of running time. We also compare our approaches with different homogeneous graph neural network models and the results show that the heterogeneous graph neural networks model outperforms the homogeneous models in story point estimation. For time performance, we achieve about 570 seconds as the time performance in both three processes: node embedding initialization, model construction, and story point estimation.
△ Less
Submitted 30 June, 2022; v1 submitted 22 June, 2022;
originally announced June 2022.
-
Nanoscale mineralogy and organic structure in Orgueil (CI) and EET 92042 (CR) carbonaceous chondrites studied with AFM-IR spectroscopy
Authors:
Van T. H. Phan,
Rolando Rebois,
Pierre Beck,
Eric Quirico,
Lydie Bonal,
Takaaki Noguchi
Abstract:
Meteorite matrices from primitive chondrites are an interplay of ingredients at the sub-micron scale, which requires analytical techniques with the nanometer spatial resolution to decipher the composition of individual components in their petrographic context. Infrared spectroscopy is an effective method that enables to probe of vibrations at the molecule-atomic scale of organic and inorganic comp…
▽ More
Meteorite matrices from primitive chondrites are an interplay of ingredients at the sub-micron scale, which requires analytical techniques with the nanometer spatial resolution to decipher the composition of individual components in their petrographic context. Infrared spectroscopy is an effective method that enables to probe of vibrations at the molecule-atomic scale of organic and inorganic compounds but is often limited to a few micrometers in spatial resolution. To efficiently distinguish spectral signatures of the different constituents, we apply here nano-IR spectroscopy (AFM-IR), based on the combination of infrared and atomic force microscopy, having a spatial resolution beyond the diffraction limits. Our study aims to characterize two chosen meteorite samples to investigate primitive material in terms of bulk chemistry (the CI chondrite Orgueil) and organic composition (the CR chondrite EET 92042). We confirm that this technique allows unmixing the IR signatures of organics and minerals to assess the variability of organic structure within these samples. We report an investigation of the impact of the widely used chemical HF/HCl (Hydrogen Fluoride/Hydrochloric) extraction on the nature of refractory organics (Insoluble Organic Matter, IOM) and provide insights on the mineralogy of meteorites matrices from these two samples by comparing to reference (extra)terrestrial materials. These findings are discussed with a perspective toward understanding the impact of post-accretional aqueous alteration and thermal metamorphism on the composition of chondrites. Last, we highlight that the heterogeneity of organic matter within meteoritic materials extends down to the nanoscale, and by comparison with IOMs, oxygenated chemical groups are not affected by acid extractions.
△ Less
Submitted 20 June, 2022;
originally announced June 2022.
-
Stochastic Multiple Target Sampling Gradient Descent
Authors:
Hoang Phan,
Ngoc Tran,
Trung Le,
Toan Tran,
Nhat Ho,
Dinh Phung
Abstract:
Sampling from an unnormalized target distribution is an essential problem with many applications in probabilistic inference. Stein Variational Gradient Descent (SVGD) has been shown to be a powerful method that iteratively updates a set of particles to approximate the distribution of interest. Furthermore, when analysing its asymptotic properties, SVGD reduces exactly to a single-objective optimiz…
▽ More
Sampling from an unnormalized target distribution is an essential problem with many applications in probabilistic inference. Stein Variational Gradient Descent (SVGD) has been shown to be a powerful method that iteratively updates a set of particles to approximate the distribution of interest. Furthermore, when analysing its asymptotic properties, SVGD reduces exactly to a single-objective optimization problem and can be viewed as a probabilistic version of this single-objective optimization problem. A natural question then arises: "Can we derive a probabilistic version of the multi-objective optimization?". To answer this question, we propose Stochastic Multiple Target Sampling Gradient Descent (MT-SGD), enabling us to sample from multiple unnormalized target distributions. Specifically, our MT-SGD conducts a flow of intermediate distributions gradually orienting to multiple target distributions, which allows the sampled particles to move to the joint high-likelihood region of the target distributions. Interestingly, the asymptotic analysis shows that our approach reduces exactly to the multiple-gradient descent algorithm for multi-objective optimization, as expected. Finally, we conduct comprehensive experiments to demonstrate the merit of our approach to multi-task learning.
△ Less
Submitted 10 February, 2023; v1 submitted 4 June, 2022;
originally announced June 2022.
-
DPER: Dynamic Programming for Exist-Random Stochastic SAT
Authors:
Vu H. N. Phan,
Moshe Y. Vardi
Abstract:
In Bayesian inference, the maximum a posteriori (MAP) problem combines the most probable explanation (MPE) and marginalization (MAR) problems. The counterpart in propositional logic is the exist-random stochastic satisfiability (ER-SSAT) problem, which combines the satisfiability (SAT) and weighted model counting (WMC) problems. Both MAP and ER-SSAT have the form…
▽ More
In Bayesian inference, the maximum a posteriori (MAP) problem combines the most probable explanation (MPE) and marginalization (MAR) problems. The counterpart in propositional logic is the exist-random stochastic satisfiability (ER-SSAT) problem, which combines the satisfiability (SAT) and weighted model counting (WMC) problems. Both MAP and ER-SSAT have the form $\operatorname{argmax}_X \sum_Y f(X, Y)$, where $f$ is a real-valued function over disjoint sets $X$ and $Y$ of variables. These two optimization problems request a value assignment for the $X$ variables that maximizes the weighted sum of $f(X, Y)$ over all value assignments for the $Y$ variables. ER-SSAT has been shown to be a promising approach to formally verify fairness in supervised learning. Recently, dynamic programming on graded project-join trees has been proposed to solve weighted projected model counting (WPMC), a related problem that has the form $\sum_X \max_Y f(X, Y)$. We extend this WPMC framework to exactly solve ER-SSAT and implement a dynamic-programming solver named DPER. Our empirical evaluation indicates that DPER contributes to the portfolio of state-of-the-art ER-SSAT solvers (DC-SSAT and erSSAT) through competitive performance on low-width problem instances.
△ Less
Submitted 19 May, 2022;
originally announced May 2022.
-
DPO: Dynamic-Programming Optimization on Hybrid Constraints
Authors:
Vu H. N. Phan,
Moshe Y. Vardi
Abstract:
In Bayesian inference, the most probable explanation (MPE) problem requests a variable instantiation with the highest probability given some evidence. Since a Bayesian network can be encoded as a literal-weighted CNF formula $\varphi$, we study Boolean MPE, a more general problem that requests a model $τ$ of $\varphi$ with the highest weight, where the weight of $τ$ is the product of weights of li…
▽ More
In Bayesian inference, the most probable explanation (MPE) problem requests a variable instantiation with the highest probability given some evidence. Since a Bayesian network can be encoded as a literal-weighted CNF formula $\varphi$, we study Boolean MPE, a more general problem that requests a model $τ$ of $\varphi$ with the highest weight, where the weight of $τ$ is the product of weights of literals satisfied by $τ$. It is known that Boolean MPE can be solved via reduction to (weighted partial) MaxSAT. Recent work proposed DPMC, a dynamic-programming model counter that leverages graph-decomposition techniques to construct project-join trees. A project-join tree is an execution plan that specifies how to conjoin clauses and project out variables. We build on DPMC and introduce DPO, a dynamic-programming optimizer that exactly solves Boolean MPE. By using algebraic decision diagrams (ADDs) to represent pseudo-Boolean (PB) functions, DPO is able to handle disjunctive clauses as well as XOR clauses. (Cardinality constraints and PB constraints may also be compactly represented by ADDs, so one can further extend DPO's support for hybrid inputs.) To test the competitiveness of DPO, we generate random XOR-CNF formulas. On these hybrid benchmarks, DPO significantly outperforms MaxHS, UWrMaxSat, and GaussMaxHS, which are state-of-the-art exact solvers for MaxSAT.
△ Less
Submitted 17 May, 2022;
originally announced May 2022.
-
On Internal and External Alignment of Dust Grains in Protostellar Environments
Authors:
Thiem Hoang,
Le Ngoc Tram,
Vo Hong Minh Phan,
Nguyen Chau Giang,
Nguyen Thi Phuong,
Nguyen Duc Dieu
Abstract:
We study the physical processes inducing the alignment of the grain axis of maximum inertia moment with the angular momentum (${\bf J}$, i.e., internal alignment) and of ${\bf J}$ with the magnetic field (i.e., external alignment) of very large grains (VLGs, of radius $a>10μ$m) using the grain alignment framework based on radiative torques (RATs) and mechanical torques (METs). We derive analytical…
▽ More
We study the physical processes inducing the alignment of the grain axis of maximum inertia moment with the angular momentum (${\bf J}$, i.e., internal alignment) and of ${\bf J}$ with the magnetic field (i.e., external alignment) of very large grains (VLGs, of radius $a>10μ$m) using the grain alignment framework based on radiative torques (RATs) and mechanical torques (METs). We derive analytical formulae for critical sizes of grain alignment, assuming that grains are aligned at both low$-J$ and high$-J$ attractors by RATs (METs). For protostellar cores, we find that super-Barnett relaxation can induce efficient internal alignment for VLGs with large iron inclusions aligned at high$-J$ attractors by RATs (METs). In contrast, inelastic relaxation can be efficient for VLGs made of any composition. For external alignment, we find that VLGs with iron inclusions aligned at high$-J$ attractors can have magnetic alignment by RATs ($B-$RAT) or METs ($B-$ MET), enabling dust polarization as a reliable tracer of magnetic fields in such dense regions. Still, grains at low$-J$ attractors or grains without iron inclusions have alignment along the radiation direction ($k-$RAT) or gas flow ($v-$MET). For protostellar disks, we find that super-Barnett relaxation can be efficient for grains with large iron inclusions in the outer disk thanks to spinup by METs, but inelastic relaxation is inefficient. VLGs aligned at low-J attractors can have $k-$RAT ($v-$MET) alignment, but grains aligned at high$-J$ attractors have likely $B-$RAT ($B-$MET) alignment. Grain alignment by METs appears to be more important than RATs in protostellar disks.
△ Less
Submitted 17 October, 2022; v1 submitted 4 May, 2022;
originally announced May 2022.
-
HybridNets: End-to-End Perception Network
Authors:
Dat Vu,
Bao Ngo,
Hung Phan
Abstract:
End-to-end Network has become increasingly important in multi-tasking. One prominent example of this is the growing significance of a driving perception system in autonomous driving. This paper systematically studies an end-to-end perception network for multi-tasking and proposes several key optimizations to improve accuracy. First, the paper proposes efficient segmentation head and box/class pred…
▽ More
End-to-end Network has become increasingly important in multi-tasking. One prominent example of this is the growing significance of a driving perception system in autonomous driving. This paper systematically studies an end-to-end perception network for multi-tasking and proposes several key optimizations to improve accuracy. First, the paper proposes efficient segmentation head and box/class prediction networks based on weighted bidirectional feature network. Second, the paper proposes automatically customized anchor for each level in the weighted bidirectional feature network. Third, the paper proposes an efficient training loss function and training strategy to balance and optimize network. Based on these optimizations, we have developed an end-to-end perception network to perform multi-tasking, including traffic object detection, drivable area segmentation and lane detection simultaneously, called HybridNets, which achieves better accuracy than prior art. In particular, HybridNets achieves 77.3 mean Average Precision on Berkeley DeepDrive Dataset, outperforms lane detection with 31.6 mean Intersection Over Union with 12.83 million parameters and 15.6 billion floating-point operations. In addition, it can perform visual perception tasks in real-time and thus is a practical and accurate solution to the multi-tasking problem. Code is available at https://github.com/datvuthanh/HybridNets.
△ Less
Submitted 16 March, 2022;
originally announced March 2022.
-
Story Point Effort Estimation by Text Level Graph Neural Network
Authors:
Hung Phan,
Ali Jannesari
Abstract:
Estimating the software projects' efforts developed by agile methods is important for project managers or technical leads. It provides a summary as a first view of how many hours and developers are required to complete the tasks. There are research works on automatic predicting the software efforts, including Term Frequency Inverse Document Frequency (TFIDF) as the traditional approach for this pr…
▽ More
Estimating the software projects' efforts developed by agile methods is important for project managers or technical leads. It provides a summary as a first view of how many hours and developers are required to complete the tasks. There are research works on automatic predicting the software efforts, including Term Frequency Inverse Document Frequency (TFIDF) as the traditional approach for this problem. Graph Neural Network is a new approach that has been applied in Natural Language Processing for text classification. The advantages of Graph Neural Network are based on the ability to learn information via graph data structure, which has more representations such as the relationships between words compared to approaches of vectorizing sequence of words. In this paper, we show the potential and possible challenges of Graph Neural Network text classification in story point level estimation. By the experiments, we show that the GNN Text Level Classification can achieve as high accuracy as about 80 percent for story points level classification, which is comparable to the traditional approach. We also analyze the GNN approach and point out several current disadvantages that the GNN approach can improve for this problem or other problems in software engineering.
△ Less
Submitted 14 March, 2022; v1 submitted 6 March, 2022;
originally announced March 2022.
-
Global-Local Regularization Via Distributional Robustness
Authors:
Hoang Phan,
Trung Le,
Trung Phung,
Tuan Anh Bui,
Nhat Ho,
Dinh Phung
Abstract:
Despite superior performance in many situations, deep neural networks are often vulnerable to adversarial examples and distribution shifts, limiting model generalization ability in real-world applications. To alleviate these problems, recent approaches leverage distributional robustness optimization (DRO) to find the most challenging distribution, and then minimize loss function over this most cha…
▽ More
Despite superior performance in many situations, deep neural networks are often vulnerable to adversarial examples and distribution shifts, limiting model generalization ability in real-world applications. To alleviate these problems, recent approaches leverage distributional robustness optimization (DRO) to find the most challenging distribution, and then minimize loss function over this most challenging distribution. Regardless of achieving some improvements, these DRO approaches have some obvious limitations. First, they purely focus on local regularization to strengthen model robustness, missing a global regularization effect which is useful in many real-world applications (e.g., domain adaptation, domain generalization, and adversarial machine learning). Second, the loss functions in the existing DRO approaches operate in only the most challenging distribution, hence decouple with the original distribution, leading to a restrictive modeling capability. In this paper, we propose a novel regularization technique, following the veins of Wasserstein-based DRO framework. Specifically, we define a particular joint distribution and Wasserstein-based uncertainty, allowing us to couple the original and most challenging distributions for enhancing modeling capability and applying both local and global regularizations. Empirical studies on different learning problems demonstrate that our proposed approach significantly outperforms the existing regularization approaches in various domains: semi-supervised learning, domain adaptation, domain generalization, and adversarial machine learning.
△ Less
Submitted 12 February, 2023; v1 submitted 1 March, 2022;
originally announced March 2022.
-
Emergence of Intergranular Tunneling Dominated Negative Magnetoresistance in Helimagnetic Manganese Phosphide Nanorod Thin Films
Authors:
B. Muchharla,
R. P. Madhogaria,
D. DeTellem,
C. M. Hung,
A. Chanda,
A. T. Duong,
P. T. Huy,
M. T. Trinh,
S. Cho,
S. Witanachchi,
M. H. Phan
Abstract:
Helical magnets are emerging as a novel class of materials for spintronics and sensor applications; however, research on their charge and spin transport properties in a thin film form is less explored. Herein, we report the temperature and magnetic field dependent charge transport properties of a highly crystalline MnP nanorod thin film over a wide temperature range (2-350 K). The MnP nanorod film…
▽ More
Helical magnets are emerging as a novel class of materials for spintronics and sensor applications; however, research on their charge and spin transport properties in a thin film form is less explored. Herein, we report the temperature and magnetic field dependent charge transport properties of a highly crystalline MnP nanorod thin film over a wide temperature range (2-350 K). The MnP nanorod films of 100 nm thickness were grown on Si substrates at 500 oC using molecular beam epitaxy. The temperature dependent resistivity data exhibits a metallic behavior over the entire measured temperature range. However, large negative magnetoresistance of up to 12% is observed below 50 K at which the system enters a stable helical (screw) magnetic state. In this temperature regime, the MR(H,T) dependence seems to show a magnetic field manipulated phase coexistence. The observed magnetoresistance is dominantly governed by the intergranular spin dependent tunneling mechanism. These findings pinpoint a correlation between the transport and magnetism in this helimagnetic system.
△ Less
Submitted 16 February, 2022;
originally announced February 2022.
-
Bayesian inference of three-dimensional gas maps: II. Galactic HI
Authors:
Philipp Mertsch,
Vo Hong Minh Phan
Abstract:
The 21-cm emission from atomic hydrogen (HI) is one of the most important tracers of the structure and dynamics of the interstellar medium. Thanks to Galactic rotation, the line is Doppler shifted and, assuming a model for the velocity field, data from gas line surveys can be deprojected along the line of sight. However, given our vantage point in the Galaxy, such a reconstruction suffers from a n…
▽ More
The 21-cm emission from atomic hydrogen (HI) is one of the most important tracers of the structure and dynamics of the interstellar medium. Thanks to Galactic rotation, the line is Doppler shifted and, assuming a model for the velocity field, data from gas line surveys can be deprojected along the line of sight. However, given our vantage point in the Galaxy, such a reconstruction suffers from a number of ambiguities. Here, we argue that those can be cured by exploiting the spatial coherence of the gas density that is implied by the physical processes shaping it. We have adopted a Bayesian inference framework that allows reconstructing the three-dimensional map of HI and quantifying its uncertainty. We employ data from the HI4PI compilation to produce three-dimensional maps of Galactic HI. The reconstructed density shows structure on a variety of scales. In particular, some spurs and spiral arms can be identified with ease. We discuss the morphology of the surface mass density and the radial and vertical profiles. The reconstructed three-dimensional HI densities are available at https://doi.org/10.5281/zenodo.5956696.
△ Less
Submitted 25 May, 2023; v1 submitted 4 February, 2022;
originally announced February 2022.
-
Polyphonic audio event detection: multi-label or multi-class multi-task classification problem?
Authors:
Huy Phan,
Thi Ngoc Tho Nguyen,
Philipp Koch,
Alfred Mertins
Abstract:
Polyphonic events are the main error source of audio event detection (AED) systems. In deep-learning context, the most common approach to deal with event overlaps is to treat the AED task as a multi-label classification problem. By doing this, we inherently consider multiple one-vs.-rest classification problems, which are jointly solved by a single (i.e. shared) network. In this work, to better ha…
▽ More
Polyphonic events are the main error source of audio event detection (AED) systems. In deep-learning context, the most common approach to deal with event overlaps is to treat the AED task as a multi-label classification problem. By doing this, we inherently consider multiple one-vs.-rest classification problems, which are jointly solved by a single (i.e. shared) network. In this work, to better handle polyphonic mixtures, we propose to frame the task as a multi-class classification problem by considering each possible label combination as one class. To circumvent the large number of arising classes due to combinatorial explosion, we divide the event categories into multiple groups and construct a multi-task problem in a divide-and-conquer fashion, where each of the tasks is a multi-class classification problem. A network architecture is then devised for multi-class multi-task modelling. The network is composed of a backbone subnet and multiple task-specific subnets. The task-specific subnets are designed to learn time-frequency and channel attention masks to extract features for the task at hand from the common feature maps learned by the backbone. Experiments on the TUT-SED-Synthetic-2016 with high degree of event overlap show that the proposed approach results in more favorable performance than the common multi-label approach.
△ Less
Submitted 29 January, 2022;
originally announced January 2022.
-
Feature matching as improved transfer learning technique for wearable EEG
Authors:
Elisabeth R. M. Heremans,
Huy Phan,
Amir H. Ansari,
Pascal Borzée,
Bertien Buyse,
Dries Testelmans,
Maarten De Vos
Abstract:
Objective: With the rapid rise of wearable sleep monitoring devices with non-conventional electrode configurations, there is a need for automated algorithms that can perform sleep staging on configurations with small amounts of labeled data. Transfer learning has the ability to adapt neural network weights from a source modality (e.g. standard electrode configuration) to a new target modality (e.g…
▽ More
Objective: With the rapid rise of wearable sleep monitoring devices with non-conventional electrode configurations, there is a need for automated algorithms that can perform sleep staging on configurations with small amounts of labeled data. Transfer learning has the ability to adapt neural network weights from a source modality (e.g. standard electrode configuration) to a new target modality (e.g. non-conventional electrode configuration). Methods: We propose feature matching, a new transfer learning strategy as an alternative to the commonly used finetuning approach. This method consists of training a model with larger amounts of data from the source modality and few paired samples of source and target modality. For those paired samples, the model extracts features of the target modality, matching these to the features from the corresponding samples of the source modality. Results: We compare feature matching to finetuning for three different target domains, with two different neural network architectures, and with varying amounts of training data. Particularly on small cohorts (i.e. 2 - 5 labeled recordings in the non-conventional recording setting), feature matching systematically outperforms finetuning with mean relative differences in accuracy ranging from 0.4% to 4.7% for the different scenarios and datasets. Conclusion: Our findings suggest that feature matching outperforms finetuning as a transfer learning approach, especially in very low data regimes. Significance: As such, we conclude that feature matching is a promising new method for wearable sleep staging with novel devices.
△ Less
Submitted 29 December, 2021;
originally announced January 2022.
-
Self-confinement of low-energy cosmic rays around supernova remnants
Authors:
Hanno Jacobs,
Philipp Mertsch,
Vo Hong Minh Phan
Abstract:
Supernova remnants have long been considered as a promising candidate for sources of Galactic cosmic rays. However, modelling cosmic-ray transport around these sources is complicated by the fact that the overdensity of cosmic rays close to their acceleration site can lead to self-confinement, that is the generation of turbulence upon which these particles scatter. Such a highly non-linear problem…
▽ More
Supernova remnants have long been considered as a promising candidate for sources of Galactic cosmic rays. However, modelling cosmic-ray transport around these sources is complicated by the fact that the overdensity of cosmic rays close to their acceleration site can lead to self-confinement, that is the generation of turbulence upon which these particles scatter. Such a highly non-linear problem can be addressed by numerically solving the coupled differential equations describing the evolution in space and time of the escaping particles and of the turbulent plasma waves. In this work, we focus essentially on the propagation of cosmic rays from supernova remnants in the warm ionized and warm neutral phases of the interstellar medium and propose an extended framework to take into account also the effect of energy loss relevant for cosmic rays of energy below 10 GeV. Interestingly, the diffusion coefficient of low-energy cosmic rays could be suppressed by up to 2 orders of magnitude for several tens of kiloyears after the escape from the shock. The cosmic-ray spectrum outside the supernova remnant flattens below 1 GeV at a sufficiently late time reminiscient of the spectral behaviour observed by Voyager. We also find the grammage accumulated around the source to be non-negligible, with important implications for precision fitting of the cosmic-ray spectra.
△ Less
Submitted 17 December, 2021;
originally announced December 2021.