Search | arXiv e-print repository

Unstable cosmic-ray nuclei constrain low-diffusion zones in the Galactic disk

Authors: Hanno Jacobs, Philipp Mertsch, Vo Hong Minh Phan

Abstract: Observations of the vicinity of a variety of galactic gamma-ray sources have indicated a local suppression of diffusivity of cosmic rays by up to three orders of magnitude. However, the impact of these low-diffusion zones on \emph{global} properties of cosmic-ray transport is however only poorly understood. Here, we argue that cosmic-ray nuclear ratios, like the boron-to-carbon ratio and relative… ▽ More Observations of the vicinity of a variety of galactic gamma-ray sources have indicated a local suppression of diffusivity of cosmic rays by up to three orders of magnitude. However, the impact of these low-diffusion zones on \emph{global} properties of cosmic-ray transport is however only poorly understood. Here, we argue that cosmic-ray nuclear ratios, like the boron-to-carbon ratio and relative abundances of Beryllium isotopes are sensitive to the filling fraction of such low-diffusion zones and hence their measurements can be used to constrain the typical sizes and ages of such regions. We have performed a careful parameter study of a cosmic-ray transport model that allows for different diffusion coefficients $κ_{\mathrm{disk}}$ and $κ_{\mathrm{halo}}$ in the galactic disk and halo, respectively. Making use of preliminary data from the AMS-02 experiment on the ratio of Beryllium isotopes, we find a $3.5 σ$ preference for a suppression of the diffusion coefficient in the disk with a best-fit value of $κ_{\mathrm{disk}}/κ_{\mathrm{halo}} = 0.20^{+0.10}_{-0.06}$. We forecast that with upcoming data from the HELIX balloon experiment, the significance could increase to $6.8 σ$. Adopting a coarse-graining approach, we find that such a strong suppression could be realised if the filling fraction of low-diffusion zones in the disk was $\sim 66 \, \%$. We conclude that the impact of regions of suppressed diffusion might be larger than usually assumed and ought to be taken into account in models of Galactic cosmic ray transport. △ Less

Submitted 17 May, 2023; originally announced May 2023.

Comments: 15 pages,10 figures, prepared for submission to MNRAS

arXiv:2305.10252 [pdf, other]

Sharpness & Shift-Aware Self-Supervised Learning

Authors: Ngoc N. Tran, Son Duong, Hoang Phan, Tung Pham, Dinh Phung, Trung Le

Abstract: Self-supervised learning aims to extract meaningful features from unlabeled data for further downstream tasks. In this paper, we consider classification as a downstream task in phase 2 and develop rigorous theories to realize the factors that implicitly influence the general loss of this classification task. Our theories signify that sharpness-aware feature extractors benefit the classification ta… ▽ More Self-supervised learning aims to extract meaningful features from unlabeled data for further downstream tasks. In this paper, we consider classification as a downstream task in phase 2 and develop rigorous theories to realize the factors that implicitly influence the general loss of this classification task. Our theories signify that sharpness-aware feature extractors benefit the classification task in phase 2 and the existing data shift between the ideal (i.e., the ideal one used in theory development) and practical (i.e., the practical one used in implementation) distributions to generate positive pairs also remarkably affects this classification task. Further harvesting these theoretical findings, we propose to minimize the sharpness of the feature extractor and a new Fourier-based data augmentation technique to relieve the data shift in the distributions generating positive pairs, reaching Sharpness & Shift-Aware Contrastive Learning (SSA-CLR). We conduct extensive experiments to verify our theoretical findings and demonstrate that sharpness & shift-aware contrastive learning can remarkably boost the performance as well as obtaining more robust extracted features compared with the baselines. △ Less

Submitted 17 May, 2023; originally announced May 2023.

arXiv:2305.09564 [pdf, other]

Image Reconstruction using Superpixel Clustering and Tensor Completion

Authors: Maame G. Asante-Mensah, Anh Huy Phan, Salman Ahmadi-Asl, Zaher Al Aghbari, Andrzej Cichocki

Abstract: This paper presents a pixel selection method for compact image representation based on superpixel segmentation and tensor completion. Our method divides the image into several regions that capture important textures or semantics and selects a representative pixel from each region to store. We experiment with different criteria for choosing the representative pixel and find that the centroid pixel… ▽ More This paper presents a pixel selection method for compact image representation based on superpixel segmentation and tensor completion. Our method divides the image into several regions that capture important textures or semantics and selects a representative pixel from each region to store. We experiment with different criteria for choosing the representative pixel and find that the centroid pixel performs the best. We also propose two smooth tensor completion algorithms that can effectively reconstruct different types of images from the selected pixels. Our experiments show that our superpixel-based method achieves better results than uniform sampling for various missing ratios. △ Less

Submitted 16 May, 2023; originally announced May 2023.

arXiv:2305.06848 [pdf, other]

Stochastic Variance-Reduced Majorization-Minimization Algorithms

Authors: Duy-Nhat Phan, Sedi Bartz, Nilabja Guha, Hung M. Phan

Abstract: We study a class of nonconvex nonsmooth optimization problems in which the objective is a sum of two functions: One function is the average of a large number of differentiable functions, while the other function is proper, lower semicontinuous and has a surrogate function that satisfies standard assumptions. Such problems arise in machine learning and regularized empirical risk minimization applic… ▽ More We study a class of nonconvex nonsmooth optimization problems in which the objective is a sum of two functions: One function is the average of a large number of differentiable functions, while the other function is proper, lower semicontinuous and has a surrogate function that satisfies standard assumptions. Such problems arise in machine learning and regularized empirical risk minimization applications. However, nonconvexity and the large-sum structure are challenging for the design of new algorithms. Consequently, effective algorithms for such scenarios are scarce. We introduce and study three stochastic variance-reduced majorization-minimization (MM) algorithms, combining the general MM principle with new variance-reduced techniques. We provide almost surely subsequential convergence of the generated sequence to a stationary point. We further show that our algorithms possess the best-known complexity bounds in terms of gradient evaluations. We demonstrate the effectiveness of our algorithms on sparse binary classification problems, sparse multi-class logistic regressions, and neural networks by employing several widely-used and publicly available data sets. △ Less

Submitted 11 May, 2023; originally announced May 2023.

MSC Class: 90C26; 65K05

arXiv:2305.05779 [pdf, other]

Learning to Parallelize with OpenMP by Augmented Heterogeneous AST Representation

Authors: Le Chen, Quazi Ishtiaque Mahmud, Hung Phan, Nesreen K. Ahmed, Ali Jannesari

Abstract: Detecting parallelizable code regions is a challenging task, even for experienced developers. Numerous recent studies have explored the use of machine learning for code analysis and program synthesis, including parallelization, in light of the success of machine learning in natural language processing. However, applying machine learning techniques to parallelism detection presents several challeng… ▽ More Detecting parallelizable code regions is a challenging task, even for experienced developers. Numerous recent studies have explored the use of machine learning for code analysis and program synthesis, including parallelization, in light of the success of machine learning in natural language processing. However, applying machine learning techniques to parallelism detection presents several challenges, such as the lack of an adequate dataset for training, an effective code representation with rich information, and a suitable machine learning model to learn the latent features of code for diverse analyses. To address these challenges, we propose a novel graph-based learning approach called Graph2Par that utilizes a heterogeneous augmented abstract syntax tree (Augmented-AST) representation for code. The proposed approach primarily focused on loop-level parallelization with OpenMP. Moreover, we create an OMP\_Serial dataset with 18598 parallelizable and 13972 non-parallelizable loops to train the machine learning models. Our results show that our proposed approach achieves the accuracy of parallelizable code region detection with 85\% accuracy and outperforms the state-of-the-art token-based machine learning approach. These results indicate that our approach is competitive with state-of-the-art tools and capable of handling loops with complex structures that other tools may overlook. △ Less

Submitted 9 May, 2023; originally announced May 2023.

arXiv:2305.05030 [pdf, ps, other]

Adaptive Cross Tubal Tensor Approximation

Authors: Salman Ahmadi-Asl, Anh Huy Phan, Andrzej Cichocki, Anastasia Sozykina, Zaher Al Aghbari, Jun Wang, Ivan Oseledets

Abstract: In this paper, we propose a new adaptive cross algorithm for computing a low tubal rank approximation of third-order tensors, with less memory and lower computational complexity than the truncated tensor SVD (t-SVD). This makes it applicable for decomposing large-scale tensors. We conduct numerical experiments on synthetic and real-world datasets to confirm the efficiency and feasibility of the pr… ▽ More In this paper, we propose a new adaptive cross algorithm for computing a low tubal rank approximation of third-order tensors, with less memory and lower computational complexity than the truncated tensor SVD (t-SVD). This makes it applicable for decomposing large-scale tensors. We conduct numerical experiments on synthetic and real-world datasets to confirm the efficiency and feasibility of the proposed algorithm. The simulation results show more than one order of magnitude acceleration in the computation of low tubal rank (t-SVD) for large-scale tensors. An application to pedestrian attribute recognition is also presented. △ Less

Submitted 11 May, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

arXiv:2305.04605 [pdf]

doi 10.1109/ICEET53442.2021.9659578

Development of a Vision System to Enhance the Reliability of the Pick-and-Place Robot for Autonomous Testing of Camera Module used in Smartphones

Authors: Hoang-Anh Phan, Duy Nam Bui, Tuan Nguyen Dinh, Bao-Anh Hoang, An Nguyen Ngoc, Dong Tran Huu Quoc, Ha Tran Thi Thuy, Tung Thanh Bui, Van Nguyen Thi Thanh

Abstract: Pick-and-place robots are commonly used in modern industrial manufacturing. For complex devices/parts like camera modules used in smartphones, which contain optical parts, electrical components and interfacing connectors, the placement operation may not absolutely accurate, which may cause damage in the device under test during the mechanical movement to make good contact for electrical functions… ▽ More Pick-and-place robots are commonly used in modern industrial manufacturing. For complex devices/parts like camera modules used in smartphones, which contain optical parts, electrical components and interfacing connectors, the placement operation may not absolutely accurate, which may cause damage in the device under test during the mechanical movement to make good contact for electrical functions inspection. In this paper, we proposed an effective vision system including hardware and algorithm to enhance the reliability of the pick-and-place robot for autonomous testing memory of camera modules. With limited hardware based on camera and raspberry PI and using simplify image processing algorithm based on histogram information, the vision system can confirm the presence of the camera modules in feeding tray and the placement accuracy of the camera module in test socket. Through that, the system can work with more flexibility and avoid damaging the device under test. The system was experimentally quantified through testing approximately 2000 camera modules in a stable light condition. Experimental results demonstrate that the system achieves accuracy of more than 99.92%. With its simplicity and effectiveness, the proposed vision system can be considered as a useful solution for using in pick-and-place systems in industry. △ Less

Submitted 8 May, 2023; originally announced May 2023.

Comments: Published to 2021 International Conference on Engineering and Emerging Technologies (ICEET 2021). 6 pages

arXiv:2305.04594 [pdf, other]

A sensor fusion approach for improving implementation speed and accuracy of RTAB-Map algorithm based indoor 3D mapping

Authors: Hoang-Anh Phan, Phuc Vinh Nguyen, Thu Hang Thi Khuat, Hieu Dang Van, Dong Huu Quoc Tran, Bao Lam Dang, Tung Thanh Bui, Van Nguyen Thi Thanh, Trinh Chu Duc

Abstract: In recent years, 3D mapping for indoor environments has undergone considerable research and improvement because of its effective applications in various fields, including robotics, autonomous navigation, and virtual reality. Building an accurate 3D map for indoor environment is challenging due to the complex nature of the indoor space, the problem of real-time embedding and positioning errors of t… ▽ More In recent years, 3D mapping for indoor environments has undergone considerable research and improvement because of its effective applications in various fields, including robotics, autonomous navigation, and virtual reality. Building an accurate 3D map for indoor environment is challenging due to the complex nature of the indoor space, the problem of real-time embedding and positioning errors of the robot system. This study proposes a method to improve the accuracy, speed, and quality of 3D indoor mapping by fusing data from the Inertial Measurement System (IMU) of the Intel Realsense D435i camera, the Ultrasonic-based Indoor Positioning System (IPS), and the encoder of the robot's wheel using the extended Kalman filter (EKF) algorithm. The merged data is processed using a Real-time Image Based Mapping algorithm (RTAB-Map), with the processing frequency updated in synch with the position frequency of the IPS device. The results suggest that fusing IMU and IPS data significantly improves the accuracy, mapping time, and quality of 3D maps. Our study highlights the proposed method's potential to improve indoor mapping in various fields, indicating that the fusion of multiple data sources can be a valuable tool in creating high-quality 3D indoor maps. △ Less

Submitted 8 May, 2023; originally announced May 2023.

Comments: Accepted to 20th International Joint Conference on Computer Science and Software Engineering (JCSSE 2023). 5 pages

arXiv:2305.04576 [pdf, other]

An Enhanced Sampling-Based Method With Modified Next-Best View Strategy For 2D Autonomous Robot Exploration

Authors: Dong Huu Quoc Tran, Hoang-Anh Phan, Hieu Dang Van, Tan Van Duong, Tung Thanh Bui, Van Nguyen Thi Thanh

Abstract: Autonomous exploration is a new technology in the field of robotics that has found widespread application due to its objective to help robots independently localize, scan maps, and navigate any terrain without human control. Up to present, the sampling-based exploration strategies have been the most effective for aerial and ground vehicles equipped with depth sensors producing three-dimensional po… ▽ More Autonomous exploration is a new technology in the field of robotics that has found widespread application due to its objective to help robots independently localize, scan maps, and navigate any terrain without human control. Up to present, the sampling-based exploration strategies have been the most effective for aerial and ground vehicles equipped with depth sensors producing three-dimensional point clouds. Those methods utilize the sampling task to choose random points or make samples based on Rapidly-exploring Random Trees (RRT). Then, they decide on frontiers or Next Best Views (NBV) with useful volumetric information. However, most state-of-the-art sampling-based methodology is challenging to implement in two-dimensional robots due to the lack of environmental knowledge, thus resulting in a bad volumetric gain for evaluating random destinations. This study proposed an enhanced sampling-based solution for indoor robot exploration to decide Next Best View (NBV) in 2D environments. Our method makes RRT until have the endpoints as frontiers and evaluates those with the enhanced utility function. The volumetric information obtained from environments was estimated using non-uniform distribution to determine cells that are occupied and have an uncertain probability. Compared to the sampling-based Frontier Detection and Receding Horizon NBV approaches, the methodology executed performed better in Gazebo platform-simulated environments, achieving a significantly larger explored area, with the average distance and time traveled being reduced. Moreover, the operated proposed method on an author-built 2D robot exploring the entire natural environment confirms that the method is effective and applicable in real-world scenarios. △ Less

Submitted 8 May, 2023; originally announced May 2023.

Comments: Accepted to 20th International Joint Conference on Computer Science and Software Engineering (JCSSE 2023). 6 pages

arXiv:2305.04009 [pdf, ps, other]

One-loop formulas for off-shell decay $H^* \rightarrow W^+W^-$ in 't Hooft-Veltman gauge and its applications

Authors: Khiem Hong Phan, Dzung Tri Tran, Anh Thu Nguyen

Abstract: We present analytic results for one-loop radiative corrections to off-shell decay $H^* \rightarrow W^+W^-$ in 't Hooft-Veltman gauge within Standard Model framework. In numerical results, we show off-shell decay rate and its corrections with varying off-shell Higgs mass. The results show that the corrections are of $10\%$ contributions to total decay rates. Furthermore, we study the impacts of one… ▽ More We present analytic results for one-loop radiative corrections to off-shell decay $H^* \rightarrow W^+W^-$ in 't Hooft-Veltman gauge within Standard Model framework. In numerical results, we show off-shell decay rate and its corrections with varying off-shell Higgs mass. The results show that the corrections are of $10\%$ contributions to total decay rates. Furthermore, we study the impacts of one-loop radiative corrections to off-shell decay $H^* \rightarrow W^+W^-$ in Higgs processes at future colliders. The signal processes such as $e^-e^+\rightarrow ZH^*\rightarrow Z(WW)$ with including the initial beam polarizations and $e^-e^+\rightarrow ν_e\barν_e H^* \rightarrow ν_e\barν_e (WW)$ and $e^-γ\rightarrow e^-H^* \rightarrow e^-WW$ are examined. We find that the effects are visible impacts and these should be taken into account at future colliders. △ Less

Submitted 6 May, 2023; originally announced May 2023.

Comments: Submit to Communications in Physics in VietNam

Report number: DTU2023-02

arXiv:2305.04002 [pdf, ps, other]

doi 10.1093/ptep/ptad106

One-loop expressions for $h\rightarrow l\bar{l}γ$ in Higgs extensions of the Standard Model

Authors: L. T. Hue, Dzung Tri Tran, Thanh Huy Nguyen, Khiem Hong Phan

Abstract: A systematic study of one-loop contributions to the decay channels $h\rightarrow l\bar{l}γ$ with $l=ν_{e,μ, τ}, e, μ$, performed in Higgs extended versions of the Standard Model, is presented in the 't Hooft-Veltman gauge. Analytic formulas for one-loop form factors are expressed in terms of the logarithm and di-logarithmic functions. As a result, these form factors can be reduced to those relatin… ▽ More A systematic study of one-loop contributions to the decay channels $h\rightarrow l\bar{l}γ$ with $l=ν_{e,μ, τ}, e, μ$, performed in Higgs extended versions of the Standard Model, is presented in the 't Hooft-Veltman gauge. Analytic formulas for one-loop form factors are expressed in terms of the logarithm and di-logarithmic functions. As a result, these form factors can be reduced to those relating to the loop-induced decay processes $h\rightarrow γγ, Zγ$, confirming not only previous results using different approaches but also close relations between the three kinds of the loop-induced Higgs decay rates. For phenomenological study, we focus on the two observables, namely the enhancement factors defined as ratios of the decay rates calculated between the Higgs extended versions and the standard model, and the forward-backward asymmetries of fermions, which can be used to search for Higgs extensions of the SM. We show that direct effects of mixing between neutral Higgs bosons and indirect contributions of charged Higg boson exchanges can be probed at future colliders. △ Less

Submitted 6 May, 2023; originally announced May 2023.

Comments: 39 pages, 9 Figures, 11 Tables of data

Report number: DTU2023-01

Journal ref: Prog Theor Exp Phys (2023)

arXiv:2305.01026 [pdf]

Making Atomic-Level Magnetism Tunable with Light at Room Temperature

Authors: V. O. Jimenez, Y. T. H. Pham, D. Zhou, M. Z. Liu, F. A. Nugera, V. Kalappattil, T. Eggers, K. Hoang, D. L. Duong, M. Terrones, H. R. Gutierrez, M. H. Phan

Abstract: The capacity to manipulate magnetization in two-dimensional dilute magnetic semiconductors (2D-DMSs) using light, specifically in magnetically doped transition metal dichalcogenide (TMD) monolayers (M-doped TX2, where M = V, Fe, Cr; T = W, Mo; X = S, Se, Te), may lead to innovative applications in spintronics, spin-caloritronics, valleytronics, and quantum computation. This Perspective paper explo… ▽ More The capacity to manipulate magnetization in two-dimensional dilute magnetic semiconductors (2D-DMSs) using light, specifically in magnetically doped transition metal dichalcogenide (TMD) monolayers (M-doped TX2, where M = V, Fe, Cr; T = W, Mo; X = S, Se, Te), may lead to innovative applications in spintronics, spin-caloritronics, valleytronics, and quantum computation. This Perspective paper explores the mediation of magnetization by light under ambient conditions in 2D-TMD DMSs and heterostructures. By combining magneto-LC resonance (MLCR) experiments with density functional theory (DFT) calculations, we show that the magnetization can be enhanced using light in V-doped TMD monolayers (e.g., V-WS2, V-WSe2, V-MoS2). This phenomenon is attributed to excess holes in the conduction and valence bands, as well as carriers trapped in magnetic doping states, which together mediate the magnetization of the semiconducting layer. In 2D-TMD heterostructures such as VSe2/WS2 and VSe2/MoS2, we demonstrate the significance of proximity, charge-transfer, and confinement effects in amplifying light-mediated magnetism. This effect is attributed to photon absorption at the TMD layer (e.g., WS2, MoS2) that generates electron-hole pairs mediating the magnetization of the heterostructure. These findings will encourage further research in the field of 2D magnetism and establish a novel direction for designing 2D-TMDs and heterostructures with optically tunable magnetic functionalities, paving the way for next-generation magneto-optic nanodevices. △ Less

Submitted 1 May, 2023; originally announced May 2023.

arXiv:2304.06485 [pdf, ps, other]

CoRe-Sleep: A Multimodal Fusion Framework for Time Series Robust to Imperfect Modalities

Authors: Konstantinos Kontras, Christos Chatzichristos, Huy Phan, Johan Suykens, Maarten De Vos

Abstract: Sleep abnormalities can have severe health consequences. Automated sleep staging, i.e. labelling the sequence of sleep stages from the patient's physiological recordings, could simplify the diagnostic process. Previous work on automated sleep staging has achieved great results, mainly relying on the EEG signal. However, often multiple sources of information are available beyond EEG. This can be pa… ▽ More Sleep abnormalities can have severe health consequences. Automated sleep staging, i.e. labelling the sequence of sleep stages from the patient's physiological recordings, could simplify the diagnostic process. Previous work on automated sleep staging has achieved great results, mainly relying on the EEG signal. However, often multiple sources of information are available beyond EEG. This can be particularly beneficial when the EEG recordings are noisy or even missing completely. In this paper, we propose CoRe-Sleep, a Coordinated Representation multimodal fusion network that is particularly focused on improving the robustness of signal analysis on imperfect data. We demonstrate how appropriately handling multimodal information can be the key to achieving such robustness. CoRe-Sleep tolerates noisy or missing modalities segments, allowing training on incomplete data. Additionally, it shows state-of-the-art performance when testing on both multimodal and unimodal data using a single model on SHHS-1, the largest publicly available study that includes sleep stage labels. The results indicate that training the model on multimodal data does positively influence performance when tested on unimodal data. This work aims at bridging the gap between automated analysis tools and their clinical utility. △ Less

Submitted 27 March, 2023; originally announced April 2023.

Comments: 10 pages, 4 figures, 2 tables, journal

arXiv:2303.04104 [pdf, other]

An Inception-Residual-Based Architecture with Multi-Objective Loss for Detecting Respiratory Anomalies

Authors: Dat Ngo, Lam Pham, Huy Phan, Minh Tran, Delaram Jarchi, Sefki Kolozali

Abstract: This paper presents a deep learning system applied for detecting anomalies from respiratory sound recordings. Initially, our system begins with audio feature extraction using Gammatone and Continuous Wavelet transformation. This step aims to transform the respiratory sound input into a two-dimensional spectrogram where both spectral and temporal features are presented. Then, our proposed system in… ▽ More This paper presents a deep learning system applied for detecting anomalies from respiratory sound recordings. Initially, our system begins with audio feature extraction using Gammatone and Continuous Wavelet transformation. This step aims to transform the respiratory sound input into a two-dimensional spectrogram where both spectral and temporal features are presented. Then, our proposed system integrates Inception-residual-based backbone models combined with multi-head attention and multi-objective loss to classify respiratory anomalies. Instead of applying a simple concatenation approach by combining results from various spectrograms, we propose a Linear combination, which has the ability to regulate equally the contribution of each individual spectrogram throughout the training process. To evaluate the performance, we conducted experiments over the benchmark dataset of SPRSound (The Open-Source SJTU Paediatric Respiratory Sound) proposed by the IEEE BioCAS 2022 challenge. As regards the Score computed by an average between the average score and harmonic score, our proposed system gained significant improvements of 9.7%, 15.8%, 17.8%, and 16.1% in Task 1-1, Task 1-2, Task 2-1, and Task 2-2, respectively, compared to the challenge baseline system. Notably, we achieved the Top-1 performance in Task 2-1 and Task 2-2 with the highest Score of 74.5% and 53.9%, respectively. △ Less

Submitted 19 June, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

arXiv:2302.02713 [pdf, other]

Flat Seeking Bayesian Neural Networks

Authors: Van-Anh Nguyen, Tung-Long Vuong, Hoang Phan, Thanh-Toan Do, Dinh Phung, Trung Le

Abstract: Bayesian Neural Networks (BNNs) provide a probabilistic interpretation for deep learning models by imposing a prior distribution over model parameters and inferring a posterior distribution based on observed data. The model sampled from the posterior distribution can be used for providing ensemble predictions and quantifying prediction uncertainty. It is well-known that deep learning models with l… ▽ More Bayesian Neural Networks (BNNs) provide a probabilistic interpretation for deep learning models by imposing a prior distribution over model parameters and inferring a posterior distribution based on observed data. The model sampled from the posterior distribution can be used for providing ensemble predictions and quantifying prediction uncertainty. It is well-known that deep learning models with lower sharpness have better generalization ability. However, existing posterior inferences are not aware of sharpness/flatness in terms of formulation, possibly leading to high sharpness for the models sampled from them. In this paper, we develop theories, the Bayesian setting, and the variational inference approach for the sharpness-aware posterior. Specifically, the models sampled from our sharpness-aware posterior, and the optimal approximate posterior estimating this sharpness-aware posterior, have better flatness, hence possibly possessing higher generalization ability. We conduct experiments by leveraging the sharpness-aware posterior with state-of-the-art Bayesian Neural Networks, showing that the flat-seeking counterparts outperform their baselines in all metrics of interest. △ Less

Submitted 6 November, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

Comments: Accepted at NeurIPS 2023

Journal ref: Advances in Neural Information Processing Systems, 2023

arXiv:2302.02447 [pdf, other]

cross-modal fusion techniques for utterance-level emotion recognition from text and speech

Authors: Jiachen Luo, Huy Phan, Joshua Reiss

Abstract: Multimodal emotion recognition (MER) is a fundamental complex research problem due to the uncertainty of human emotional expression and the heterogeneity gap between different modalities. Audio and text modalities are particularly important for a human participant in understanding emotions. Although many successful attempts have been designed multimodal representations for MER, there still exist m… ▽ More Multimodal emotion recognition (MER) is a fundamental complex research problem due to the uncertainty of human emotional expression and the heterogeneity gap between different modalities. Audio and text modalities are particularly important for a human participant in understanding emotions. Although many successful attempts have been designed multimodal representations for MER, there still exist multiple challenges to be addressed: 1) bridging the heterogeneity gap between multimodal features and model inter- and intra-modal interactions of multiple modalities; 2) effectively and efficiently modelling the contextual dynamics in the conversation sequence. In this paper, we propose Cross-Modal RoBERTa (CM-RoBERTa) model for emotion detection from spoken audio and corresponding transcripts. As the core unit of the CM-RoBERTa, parallel self- and cross- attention is designed to dynamically capture inter- and intra-modal interactions of audio and text. Specially, the mid-level fusion and residual module are employed to model long-term contextual dependencies and learn modality-specific patterns. We evaluate the approach on the MELD dataset and the experimental results show the proposed approach achieves the state-of-art performance on the dataset. △ Less

Submitted 5 February, 2023; originally announced February 2023.

Comments: 6 pages, 2 figures

arXiv:2302.02419 [pdf, other]

deep learning of segment-level feature representation for speech emotion recognition in conversations

Authors: Jiachen Luo, Huy Phan, Joshua Reiss

Abstract: Accurately detecting emotions in conversation is a necessary yet challenging task due to the complexity of emotions and dynamics in dialogues. The emotional state of a speaker can be influenced by many different factors, such as interlocutor stimulus, dialogue scene, and topic. In this work, we propose a conversational speech emotion recognition method to deal with capturing attentive contextual d… ▽ More Accurately detecting emotions in conversation is a necessary yet challenging task due to the complexity of emotions and dynamics in dialogues. The emotional state of a speaker can be influenced by many different factors, such as interlocutor stimulus, dialogue scene, and topic. In this work, we propose a conversational speech emotion recognition method to deal with capturing attentive contextual dependency and speaker-sensitive interactions. First, we use a pretrained VGGish model to extract segment-based audio representation in individual utterances. Second, an attentive bi-directional gated recurrent unit (GRU) models contextual-sensitive information and explores intra- and inter-speaker dependencies jointly in a dynamic manner. The experiments conducted on the standard conversational dataset MELD demonstrate the effectiveness of the proposed method when compared against state-of the-art methods. △ Less

Submitted 5 February, 2023; originally announced February 2023.

Comments: 6 pages, 4 figures

arXiv:2301.09567 [pdf, other]

doi 10.1145/3550340.3564218

Rig Inversion by Training a Differentiable Rig Function

Authors: Mathieu Marquis Bolduc, Hau Nghiep Phan

Abstract: Rig inversion is the problem of creating a method that can find the rig parameter vector that best approximates a given input mesh. In this paper we propose to solve this problem by first obtaining a differentiable rig function by training a multi layer perceptron to approximate the rig function. This differentiable rig function can then be used to train a deep learning model of rig inversion. Rig inversion is the problem of creating a method that can find the rig parameter vector that best approximates a given input mesh. In this paper we propose to solve this problem by first obtaining a differentiable rig function by training a multi layer perceptron to approximate the rig function. This differentiable rig function can then be used to train a deep learning model of rig inversion. △ Less

Submitted 11 January, 2023; originally announced January 2023.

Comments: Presented at Siggraph Asia '22 in Daegu, South Korea

Journal ref: SA '22: SIGGRAPH Asia 2022 Technical Communications, December 2022, Article No.: 15

arXiv:2301.07832 [pdf, other]

Internal and external alignment of carbonaceous grains within the radiative torque paradigm

Authors: Thiem Hoang, Vo Hong Minh Phan, Le Ngoc Tram

Abstract: We study the internal and external alignment of carbonaceous grains, including graphite and hydrogenated amorphous carbon (HAC), in the interstellar medium (ISM) within the RAdiative Torque (RAT) paradigm. For internal alignment (IA), we find that HAC grains having nuclear paramagnetism due to hydrogen protons can have efficient nuclear relaxation, whereas both HAC and graphite grains can have eff… ▽ More We study the internal and external alignment of carbonaceous grains, including graphite and hydrogenated amorphous carbon (HAC), in the interstellar medium (ISM) within the RAdiative Torque (RAT) paradigm. For internal alignment (IA), we find that HAC grains having nuclear paramagnetism due to hydrogen protons can have efficient nuclear relaxation, whereas both HAC and graphite grains can have efficient inelastic relaxation for grains aligned both at low$-J$ and high$-J$ attractors. For external alignment, HAC and graphite grains can align with the radiation direction ($k$-RAT) at low$-J$ attractors but cannot have stable alignment at high$-J$ attractors due to the suppression of radiative precession. HAC also has slow Larmor precession compared to the randomization by gas collisions and cannot be aligned with the magnetic field ($B$-RAT). Small HAC grains of $a<0.05μ$m drifting through the diffuse ISM can be weakly aligned along the induced electric field ($E$-RAT) at high$-J$ attractors due to its fast precession. Paramagnetic relaxation by nuclear magnetism is found inefficient for HAC grains due to the rapid suppression of nuclear susceptibility when grains rotate at high$-J$ attractors. We then study the alignment of carbon dust in the envelope of a typical C-rich Asymptotic Giant Branch star, IRC+10216. We find that grains aligned at low$-J$ attractors can occur via $k$-RAT with the wrong IA in the inner region but via $B$-RAT in the outermost region. However, grains aligned at high$-J$ attractors have the right IA alignment via $k$-RAT due to efficient inelastic relaxation. The polarization pattern observed toward IRC+10216 by SOFIA/HAWC+ can reproduced when only grains at low$-J$ attractors are present due to removal of grains at high$-J$ attractors by the RAT disruption. △ Less

Submitted 18 January, 2023; originally announced January 2023.

Comments: 24 pages, 15 figures

arXiv:2301.03441 [pdf, ps, other]

L-SeqSleepNet: Whole-cycle Long Sequence Modelling for Automatic Sleep Staging

Authors: Huy Phan, Kristian P. Lorenzen, Elisabeth Heremans, Oliver Y. Chén, Minh C. Tran, Philipp Koch, Alfred Mertins, Mathias Baumert, Kaare Mikkelsen, Maarten De Vos

Abstract: Human sleep is cyclical with a period of approximately 90 minutes, implying long temporal dependency in the sleep data. Yet, exploring this long-term dependency when developing sleep staging models has remained untouched. In this work, we show that while encoding the logic of a whole sleep cycle is crucial to improve sleep staging performance, the sequential modelling approach in existing state-of… ▽ More Human sleep is cyclical with a period of approximately 90 minutes, implying long temporal dependency in the sleep data. Yet, exploring this long-term dependency when developing sleep staging models has remained untouched. In this work, we show that while encoding the logic of a whole sleep cycle is crucial to improve sleep staging performance, the sequential modelling approach in existing state-of-the-art deep learning models are inefficient for that purpose. We thus introduce a method for efficient long sequence modelling and propose a new deep learning model, L-SeqSleepNet, which takes into account whole-cycle sleep information for sleep staging. Evaluating L-SeqSleepNet on four distinct databases of various sizes, we demonstrate state-of-the-art performance obtained by the model over three different EEG setups, including scalp EEG in conventional Polysomnography (PSG), in-ear EEG, and around-the-ear EEG (cEEGrid), even with a single EEG channel input. Our analyses also show that L-SeqSleepNet is able to alleviate the predominance of N2 sleep (the major class in terms of classification) to bring down errors in other sleep stages. Moreover the network becomes much more robust, meaning that for all subjects where the baseline method had exceptionally poor performance, their performance are improved significantly. Finally, the computation time only grows at a sub-linear rate when the sequence length increases. △ Less

Submitted 4 August, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

Comments: This article has been published in IEEE Journal of Biomedical and Health Informatics (JBHI). Source code is available at http://github.com/pquochuy/l-seqsleepnet

arXiv:2212.08952 [pdf, other]

Learning from Taxonomy: Multi-label Few-Shot Classification for Everyday Sound Recognition

Authors: Jinhua Liang, Huy Phan, Emmanouil Benetos

Abstract: Everyday sound recognition aims to infer types of sound events in audio streams. While many works succeeded in training models with high performance in a fully-supervised manner, they are still restricted to the demand of large quantities of labelled data and the range of predefined classes. To overcome these drawbacks, this work firstly curates a new database named FSD-FS for multi-label few-shot… ▽ More Everyday sound recognition aims to infer types of sound events in audio streams. While many works succeeded in training models with high performance in a fully-supervised manner, they are still restricted to the demand of large quantities of labelled data and the range of predefined classes. To overcome these drawbacks, this work firstly curates a new database named FSD-FS for multi-label few-shot audio classification. It then explores how to incorporate audio taxonomy in few-shot learning. Specifically, this work proposes label-dependent prototypical networks (LaD-protonet) to exploit parent-children relationships between labels. Plus, it applies taxonomy-aware label smoothing techniques to boost model performance. Experiments demonstrate that LaD-protonet outperforms original prototypical networks as well as other state-of-the-art methods. Moreover, its performance can be further boosted when combined with taxonomy-aware label smoothing. △ Less

Submitted 17 December, 2022; originally announced December 2022.

Comments: submitted to ICASSP2023

arXiv:2212.03470 [pdf, other]

Improving trajectory localization accuracy via direction-of-arrival derivative estimation

Authors: Ruchi Pandey, Shreyas Jaiswal, Huy Phan, Santosh Nannuru

Abstract: Sound source localization is crucial in acoustic sensing and monitoring-related applications. In this paper, we do a comprehensive analysis of improvement in sound source localization by combining the direction of arrivals (DOAs) with their derivatives which quantify the changes in the positions of sources over time. This study uses the SALSA-Lite feature with a convolutional recurrent neural netw… ▽ More Sound source localization is crucial in acoustic sensing and monitoring-related applications. In this paper, we do a comprehensive analysis of improvement in sound source localization by combining the direction of arrivals (DOAs) with their derivatives which quantify the changes in the positions of sources over time. This study uses the SALSA-Lite feature with a convolutional recurrent neural network (CRNN) model for predicting DOAs and their first-order derivatives. An update rule is introduced to combine the predicted DOAs with the estimated derivatives to obtain the final DOAs. The experimental validation is done using TAU-NIGENS Spatial Sound Events (TNSSE) 2021 dataset. We compare the performance of the networks predicting DOAs with derivative vs. the one predicting only the DOAs at low SNR levels. The results show that combining the derivatives with the DOAs improves the localization accuracy of moving sources. △ Less

Submitted 10 December, 2022; v1 submitted 7 December, 2022; originally announced December 2022.

arXiv:2212.01957 [pdf, other]

CSTAR: Towards Compact and STructured Deep Neural Networks with Adversarial Robustness

Authors: Huy Phan, Miao Yin, Yang Sui, Bo Yuan, Saman Zonouz

Abstract: Model compression and model defense for deep neural networks (DNNs) have been extensively and individually studied. Considering the co-importance of model compactness and robustness in practical applications, several prior works have explored to improve the adversarial robustness of the sparse neural networks. However, the structured sparse models obtained by the exiting works suffer severe perfor… ▽ More Model compression and model defense for deep neural networks (DNNs) have been extensively and individually studied. Considering the co-importance of model compactness and robustness in practical applications, several prior works have explored to improve the adversarial robustness of the sparse neural networks. However, the structured sparse models obtained by the exiting works suffer severe performance degradation for both benign and robust accuracy, thereby causing a challenging dilemma between robustness and structuredness of the compact DNNs. To address this problem, in this paper, we propose CSTAR, an efficient solution that can simultaneously impose the low-rankness-based Compactness, high STructuredness and high Adversarial Robustness on the target DNN models. By formulating the low-rankness and robustness requirement within the same framework and globally determining the ranks, the compressed DNNs can simultaneously achieve high compression performance and strong adversarial robustness. Evaluations for various DNN models on different datasets demonstrate the effectiveness of CSTAR. Compared with the state-of-the-art robust structured pruning methods, CSTAR shows consistently better performance. For instance, when compressing ResNet-18 on CIFAR-10, CSTAR can achieve up to 20.07% and 11.91% improvement for benign accuracy and robust accuracy, respectively. For compressing ResNet-18 with 16x compression ratio on Imagenet, CSTAR can obtain 8.58% benign accuracy gain and 4.27% robust accuracy gain compared to the existing robust structured pruning method. △ Less

Submitted 17 February, 2023; v1 submitted 4 December, 2022; originally announced December 2022.

Comments: AAAI-23

arXiv:2211.16780 [pdf, other]

Continual Learning with Optimal Transport based Mixture Model

Authors: Quyen Tran, Hoang Phan, Khoat Than, Dinh Phung, Trung Le

Abstract: Online Class Incremental learning (CIL) is a challenging setting in Continual Learning (CL), wherein data of new tasks arrive in incoming streams and online learning models need to handle incoming data streams without revisiting previous ones. Existing works used a single centroid adapted with incoming data streams to characterize a class. This approach possibly exposes limitations when the incomi… ▽ More Online Class Incremental learning (CIL) is a challenging setting in Continual Learning (CL), wherein data of new tasks arrive in incoming streams and online learning models need to handle incoming data streams without revisiting previous ones. Existing works used a single centroid adapted with incoming data streams to characterize a class. This approach possibly exposes limitations when the incoming data stream of a class is naturally multimodal. To address this issue, in this work, we first propose an online mixture model learning approach based on nice properties of the mature optimal transport theory (OT-MM). Specifically, the centroids and covariance matrices of the mixture model are adapted incrementally according to incoming data streams. The advantages are two-fold: (i) we can characterize more accurately complex data streams and (ii) by using centroids for each class produced by OT-MM, we can estimate the similarity of an unseen example to each class more reasonably when doing inference. Moreover, to combat the catastrophic forgetting in the CIL scenario, we further propose Dynamic Preservation. Particularly, after performing the dynamic preservation technique across data streams, the latent representations of the classes in the old and new tasks become more condensed themselves and more separate from each other. Together with a contraction feature extractor, this technique facilitates the model in mitigating the catastrophic forgetting. The experimental results on real-world datasets show that our proposed method can significantly outperform the current state-of-the-art baselines. △ Less

Submitted 5 December, 2022; v1 submitted 30 November, 2022; originally announced November 2022.

arXiv:2211.15116 [pdf, ps, other]

doi 10.1088/1674-1137/acc1cf

One-loop formulas for $H\rightarrow Z ν_l\barν_l$ for $l = e,μ, τ$ in 't Hooft-Veltman gauge

Authors: Dzung Tri Tran, Khiem Hong Phan

Abstract: In this paper, we present analytical results for one-loop contributing to the decay processes $H\rightarrow Z ν_l\barν_l$ (for $l = e, μ, τ$). The calculations are performed within the Standard Model framework in 't Hooft-Veltman gauge. One-loop form factors are then written in terms of scalar one-loop functions in the standard notations of {\tt LoopTools}. As a result, one-loop decay rates for th… ▽ More In this paper, we present analytical results for one-loop contributing to the decay processes $H\rightarrow Z ν_l\barν_l$ (for $l = e, μ, τ$). The calculations are performed within the Standard Model framework in 't Hooft-Veltman gauge. One-loop form factors are then written in terms of scalar one-loop functions in the standard notations of {\tt LoopTools}. As a result, one-loop decay rates for the decay channels can be evaluated numerically by using the package. Furthermore, we analyse the signals of $H\rightarrow Z ν_l\barν_l$ via the production processes $e^-e^+ \rightarrow ZH^* \rightarrow Z (H^* \rightarrow Z ν_l\barν_l)$ including the initial beam polarizations at future lepton collider. The Standard Model background such as the processes $e^-e^+ \rightarrow ν_l\barν_l ZZ$ are also examined in this study. In numerical results, we find that one-loop corrections are about $10\%$ contributions to the decay rates. They are sizeable contributions and should be taken into account at future colliders. We show that the signals $H\rightarrow Zν_l\barν_l$ are clearly visible at center-of-mass energy $\sqrt{s}=250$ GeV and it is hard to probe at higher-energy regions due to the dominant of the background. △ Less

Submitted 6 March, 2023; v1 submitted 28 November, 2022; originally announced November 2022.

Comments: 28 pages, references are added

Report number: DTU2022-04

arXiv:2211.13723 [pdf, other]

Improving Multi-task Learning via Seeking Task-based Flat Regions

Authors: Hoang Phan, Lam Tran, Quyen Tran, Ngoc N. Tran, Tuan Truong, Qi Lei, Nhat Ho, Dinh Phung, Trung Le

Abstract: Multi-Task Learning (MTL) is a widely-used and powerful learning paradigm for training deep neural networks that allows learning more than one objective by a single backbone. Compared to training tasks separately, MTL significantly reduces computational costs, improves data efficiency, and potentially enhances model performance by leveraging knowledge across tasks. Hence, it has been adopted in a… ▽ More Multi-Task Learning (MTL) is a widely-used and powerful learning paradigm for training deep neural networks that allows learning more than one objective by a single backbone. Compared to training tasks separately, MTL significantly reduces computational costs, improves data efficiency, and potentially enhances model performance by leveraging knowledge across tasks. Hence, it has been adopted in a variety of applications, ranging from computer vision to natural language processing and speech recognition. Among them, there is an emerging line of work in MTL that focuses on manipulating the task gradient to derive an ultimate gradient descent direction to benefit all tasks. Despite achieving impressive results on many benchmarks, directly applying these approaches without using appropriate regularization techniques might lead to suboptimal solutions on real-world problems. In particular, standard training that minimizes the empirical loss on the training data can easily suffer from overfitting to low-resource tasks or be spoiled by noisy-labeled ones, which can cause negative transfer between tasks and overall performance drop. To alleviate such problems, we propose to leverage a recently introduced training method, named Sharpness-aware Minimization, which can enhance model generalization ability on single-task learning. Accordingly, we present a novel MTL training methodology, encouraging the model to find task-based flat minima for coherently improving its generalization capability on all tasks. Finally, we conduct comprehensive experiments on a variety of applications to demonstrate the merit of our proposed approach to existing gradient-based MTL methods, as suggested by our developed theory. △ Less

Submitted 23 May, 2025; v1 submitted 24 November, 2022; originally announced November 2022.

Comments: 35 pages, 17 figures, 7 tables

arXiv:2211.08170 [pdf, other]

A Comparative Study of Question Answering over Knowledge Bases

Authors: Khiem Vinh Tran, Hao Phu Phan, Khang Nguyen Duc Quach, Ngan Luu-Thuy Nguyen, Jun Jo, Thanh Tam Nguyen

Abstract: Question answering over knowledge bases (KBQA) has become a popular approach to help users extract information from knowledge bases. Although several systems exist, choosing one suitable for a particular application scenario is difficult. In this article, we provide a comparative study of six representative KBQA systems on eight benchmark datasets. In that, we study various question types, propert… ▽ More Question answering over knowledge bases (KBQA) has become a popular approach to help users extract information from knowledge bases. Although several systems exist, choosing one suitable for a particular application scenario is difficult. In this article, we provide a comparative study of six representative KBQA systems on eight benchmark datasets. In that, we study various question types, properties, languages, and domains to provide insights on where existing systems struggle. On top of that, we propose an advanced mapping algorithm to aid existing models in achieving superior results. Moreover, we also develop a multilingual corpus COVID-KGQA, which encourages COVID-19 research and multilingualism for the diversity of future AI. Finally, we discuss the key findings and their implications as well as performance guidelines and some future improvements. Our source code is available at \url{https://github.com/tamlhp/kbqa}. △ Less

Submitted 15 November, 2022; originally announced November 2022.

arXiv:2211.05882 [pdf, other]

Field line subdiffusion and cosmic ray perpendicular transport in isotropic turbulence

Authors: Marco Kuhlen, Vo Hong Minh Phan, Philipp Mertsch

Abstract: The transport of high-energy particles in the presence of small-scale, turbulent magnetic fields is a long-standing issue in astrophysics. Analytical theories disagree with numerical simulations at rigidities where the particles' gyroradii are slightly smaller than the correlation length of turbulence. At the same time, extending the numerical simulations to lower rigidities has proven computation… ▽ More The transport of high-energy particles in the presence of small-scale, turbulent magnetic fields is a long-standing issue in astrophysics. Analytical theories disagree with numerical simulations at rigidities where the particles' gyroradii are slightly smaller than the correlation length of turbulence. At the same time, extending the numerical simulations to lower rigidities has proven computationally prohibitive. In this letter, we provide a solution to the problem of perpendicular transport in isotropic turbulence at both, high and low rigidities. We also clarify the relation between the perpendicular diffusion of particles and the transport of magnetic field lines. To this end, we have run a large suite of test particle simulations at unprecedentedly low rigidites, making extensive use of graphical processing units (GPUs). We have also developed an analytical model, based on (1) initial particle transport along field lines, (2) the transport of field lines and (3) the eventual decorrelation of particles from field lines. Our numerical results exhibit a non-standard rigidity-dependence for the perpendicular diffusion coefficient at intermediate rigidites. At the lowest rigidities, the standard rigidity-dependence is recovered. The simulated diffusion coefficients are nicely reproduced by our analytical model. We have traced the non-standard rigidity-dependence to a subdiffusive phase in the field line transport. Our study has important implications for the transport of Galactic cosmic rays, acceleration at perpendicular shocks and for high-energy particles in the heliosphere. △ Less

Submitted 30 October, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

Comments: 13 pages, 5 figures; alternative derivations and discussion significantly extended

Report number: TTK-22-36

arXiv:2211.05881 [pdf, other]

Diffusion of relativistic charged particles and field lines in isotropic turbulence

Authors: Marco Kuhlen, Vo Hong Minh Phan, Philipp Mertsch

Abstract: The transport of non-thermal particles across a large-scale magnetic field in the presence of magnetised turbulence has been a long-standing issue in high-energy astrophysics. Of particular interest is the dependence of the parallel and perpendicular mean free paths $λ_{\parallel}$ and $λ_{\perp}$ on rigidity $\mathcal{R}$. We have revisited this important issue with a view to applications from th… ▽ More The transport of non-thermal particles across a large-scale magnetic field in the presence of magnetised turbulence has been a long-standing issue in high-energy astrophysics. Of particular interest is the dependence of the parallel and perpendicular mean free paths $λ_{\parallel}$ and $λ_{\perp}$ on rigidity $\mathcal{R}$. We have revisited this important issue with a view to applications from the transport of Galactic cosmic rays to acceleration at astrophysical shocks. We have run test particle simulations of cosmic ray transport in synthetic, isotropic Kolmogorov turbulence at unprecedentedly low reduced rigidites $r_g/L_c \simeq 10^{-4}$, corresponding to $\mathcal{R} \simeq 10 \, \text{TV}$ for a turbulent magnetic field of $B_{rms} = 4 \, μ\text{G}$ and correlation length $L_c = 30 \, \text{pc}$. Extracting the (asymptotic) parallel and perpendicular mean free paths $λ_{\parallel}$ and $λ_{\perp}$, we have found $λ_{\parallel} \propto (r_g/L_c)^{1/3}$ as expected for a Kolmogorov turbulence spectrum. In contrast, $λ_{\perp}$ has a faster dependence on $r_g/L_c$ for $10^{-2} \lesssim r_g/L_c \lesssim 1$, but for $r_g/L_c \ll 10^{-2}$, also $λ_{\perp} \propto (r_g/L_c)^{1/3}$. Our results have important implications for the transport of Galactic cosmic rays. △ Less

Submitted 24 November, 2023; v1 submitted 10 November, 2022; originally announced November 2022.

Comments: 22 pages, 16 figures; references added, conclusions unchanged

Report number: TTK-22-35

arXiv:2211.00497 [pdf, other]

doi 10.1109/ICASSP49357.2023.10097173

Modelling black-box audio effects with time-varying feature modulation

Authors: Marco Comunità, Christian J. Steinmetz, Huy Phan, Joshua D. Reiss

Abstract: Deep learning approaches for black-box modelling of audio effects have shown promise, however, the majority of existing work focuses on nonlinear effects with behaviour on relatively short time-scales, such as guitar amplifiers and distortion. While recurrent and convolutional architectures can theoretically be extended to capture behaviour at longer time scales, we show that simply scaling the wi… ▽ More Deep learning approaches for black-box modelling of audio effects have shown promise, however, the majority of existing work focuses on nonlinear effects with behaviour on relatively short time-scales, such as guitar amplifiers and distortion. While recurrent and convolutional architectures can theoretically be extended to capture behaviour at longer time scales, we show that simply scaling the width, depth, or dilation factor of existing architectures does not result in satisfactory performance when modelling audio effects such as fuzz and dynamic range compression. To address this, we propose the integration of time-varying feature-wise linear modulation into existing temporal convolutional backbones, an approach that enables learnable adaptation of the intermediate activations. We demonstrate that our approach more accurately captures long-range dependencies for a range of fuzz and compressor implementations across both time and frequency domain metrics. We provide sound examples, source code, and pretrained models to faciliate reproducibility. △ Less

Submitted 9 May, 2023; v1 submitted 1 November, 2022; originally announced November 2022.

arXiv:2209.13755 [pdf, ps, other]

One-loop calculations for $H\rightarrow f\bar{f}γ$ in the $U(1)_{B-L}$ extension for Standard Model

Authors: Khiem Hong Phan, Anh Thu Nguyen, Dzung Tri Tran

Abstract: In this paper, we present the calculations for $H\rightarrow f\bar{f}γ$ in the $U(1)_{B-L}$ extension for Standard Model. Analytic results for one-loop form factors in the decay process are expressed in terms of the scalar one-loop Passarino$-$Veltman functions in the conventions of {\tt LoopTools}. Therefore, the decay rates can be evaluated numerically by using this package. In phenomenological… ▽ More In this paper, we present the calculations for $H\rightarrow f\bar{f}γ$ in the $U(1)_{B-L}$ extension for Standard Model. Analytic results for one-loop form factors in the decay process are expressed in terms of the scalar one-loop Passarino$-$Veltman functions in the conventions of {\tt LoopTools}. Therefore, the decay rates can be evaluated numerically by using this package. In phenomenological results, we show the differential decay rates with respect to invariant mass of fermion pair $m_{ff}$, new neutral gauge mass $M_{Z'}$ and the coupling $g'$ of $U(1)_{B-L}$ gauge group. We find that the contributions of the $U(1)_{B-L}$ extension for Standard Model are visible effects and they must be taken into account at future colliders. △ Less

Submitted 27 September, 2022; originally announced September 2022.

Comments: 7 pages, 47th Vietnam Conference on Theoretical Physics (VCTP-47)

Report number: DTU2022-03

arXiv:2209.13153 [pdf, ps, other]

One-loop on-shell and off-shell decay $H^*\rightarrow VV$ at future $e^-e^-$ collider

Authors: Anh Thu Nguyen, Dzung Tri Tran, Khiem Hong Phan

Abstract: One-loop on-shell and off-shell decays $H\rightarrow VV$ with $VV=γγ, Zγ, ZZ$ are presented in this paper. The effects of one-loop on-shell and off-shell Higgs decays in Higgs productions at $e^-e^-$ collisions are also then examined. We find that the impacts of one-loop Higgs decays are significant and they are must be taken into account at $e^-e^-$ collision. One-loop on-shell and off-shell decays $H\rightarrow VV$ with $VV=γγ, Zγ, ZZ$ are presented in this paper. The effects of one-loop on-shell and off-shell Higgs decays in Higgs productions at $e^-e^-$ collisions are also then examined. We find that the impacts of one-loop Higgs decays are significant and they are must be taken into account at $e^-e^-$ collision. △ Less

Submitted 27 September, 2022; v1 submitted 27 September, 2022; originally announced September 2022.

Comments: 9 pages, 47th Vietnam Conference on Theoretical Physics (VCTP-47)

Report number: DTU2022-02

arXiv:2209.12410 [pdf, ps, other]

One-loop off-shell decay $H^* \rightarrow ZZ$ at future colliders

Authors: Khiem Hong Phan, Dzung Tri Tran, Anh Thu Nguyen

Abstract: We present one-loop formulas for contributing to the $HZZ$ vertex in 't Hooft-Veltman gauge within Standard Model framework. One-loop off-shell Higgs decay rates to $Z$-pair are investigated in both unpolarized and longitudinal polarization for $Z$ bosons in final state. The corrections are range of $7\%$ to $8.4\%$ when we vary the off-shell Higgs mass from $200$ GeV to $500$ GeV. In applications… ▽ More We present one-loop formulas for contributing to the $HZZ$ vertex in 't Hooft-Veltman gauge within Standard Model framework. One-loop off-shell Higgs decay rates to $Z$-pair are investigated in both unpolarized and longitudinal polarization for $Z$ bosons in final state. The corrections are range of $7\%$ to $8.4\%$ when we vary the off-shell Higgs mass from $200$ GeV to $500$ GeV. In applications, we study off-shell Higgs decay $H^* \rightarrow ZZ$ in the Higgs productions at future colliders such as the signal processes $γ^*(Q^2)γ\rightarrow H^* \rightarrow ZZ$ and $e^-γ\rightarrow e^-H^* \rightarrow e^- ZZ$ are analyzed. △ Less

Submitted 6 May, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

Comments: Submit to Communications in Physics

Report number: DTU2022-01

arXiv:2209.10581 [pdf, other]

doi 10.1103/PhysRevD.107.123006

Stochasticity of Cosmic Rays from Supernova Remnants and the Ionization Rates in Molecular Clouds

Authors: Vo Hong Minh Phan, Sarah Recchia, Philipp Mertsch, Stefano Gabici

Abstract: Cosmic rays are the only agent able to penetrate into the interior of dense molecular clouds. Depositing (part of) their energy through ionisation, cosmic rays play an essential role in determining the physical and chemical evolution of star-forming regions. To a first approximation their effect can be quantified by the cosmic-ray induced ionization rate. Interestingly, theoretical estimates of th… ▽ More Cosmic rays are the only agent able to penetrate into the interior of dense molecular clouds. Depositing (part of) their energy through ionisation, cosmic rays play an essential role in determining the physical and chemical evolution of star-forming regions. To a first approximation their effect can be quantified by the cosmic-ray induced ionization rate. Interestingly, theoretical estimates of the ionization rate assuming the cosmic-ray spectra observed in the local interstellar medium result in an ionization rate that is one to two orders of magnitude below the values inferred from observations. However, due to the discrete nature of sources, the local spectra of MeV cosmic rays are in general not representative for the spectra elsewhere in the Galaxy. Such stochasticity effects have the potential of reconciling modelled ionization rates with measured ones. Here, we model the distribution of low-energy cosmic-ray spectra expected from a statistical population of supernova remnants in the Milky Way. The corresponding distribution for the ionization rate is derived and confronted with data. We find that the stochastic uncertainty helps with explaining the surprisingly high ionization rates observed in many molecular clouds. △ Less

Submitted 25 May, 2023; v1 submitted 21 September, 2022; originally announced September 2022.

Comments: Galactic cosmic-ray maps at different energies added for illustration

Report number: TTK-22-29

arXiv:2209.09692 [pdf, other]

Personalized Longitudinal Assessment of Multiple Sclerosis Using Smartphones

Authors: Oliver Y. Chén, Florian Lipsmeier, Huy Phan, Frank Dondelinger, Andrew Creagh, Christian Gossens, Michael Lindemann, Maarten de Vos

Abstract: Personalized longitudinal disease assessment is central to quickly diagnosing, appropriately managing, and optimally adapting the therapeutic strategy of multiple sclerosis (MS). It is also important for identifying the idiosyncratic subject-specific disease profiles. Here, we design a novel longitudinal model to map individual disease trajectories in an automated way using sensor data that may co… ▽ More Personalized longitudinal disease assessment is central to quickly diagnosing, appropriately managing, and optimally adapting the therapeutic strategy of multiple sclerosis (MS). It is also important for identifying the idiosyncratic subject-specific disease profiles. Here, we design a novel longitudinal model to map individual disease trajectories in an automated way using sensor data that may contain missing values. First, we collect digital measurements related to gait and balance, and upper extremity functions using sensor-based assessments administered on a smartphone. Next, we treat missing data via imputation. We then discover potential markers of MS by employing a generalized estimation equation. Subsequently, parameters learned from multiple training datasets are ensembled to form a simple, unified longitudinal predictive model to forecast MS over time in previously unseen people with MS. To mitigate potential underestimation for individuals with severe disease scores, the final model incorporates additional subject-specific fine-tuning using data from the first day. The results show that the proposed model is promising to achieve personalized longitudinal MS assessment; they also suggest that features related to gait and balance as well as upper extremity function, remotely collected from sensor-based assessments, may be useful digital markers for predicting MS over time. △ Less

Submitted 20 September, 2022; originally announced September 2022.

MSC Class: 62P10; 62P30; 62H12; 62J02; 62D10

arXiv:2208.10608 [pdf, other]

RIBAC: Towards Robust and Imperceptible Backdoor Attack against Compact DNN

Authors: Huy Phan, Cong Shi, Yi Xie, Tianfang Zhang, Zhuohang Li, Tianming Zhao, Jian Liu, Yan Wang, Yingying Chen, Bo Yuan

Abstract: Recently backdoor attack has become an emerging threat to the security of deep neural network (DNN) models. To date, most of the existing studies focus on backdoor attack against the uncompressed model; while the vulnerability of compressed DNNs, which are widely used in the practical applications, is little exploited yet. In this paper, we propose to study and develop Robust and Imperceptible Bac… ▽ More Recently backdoor attack has become an emerging threat to the security of deep neural network (DNN) models. To date, most of the existing studies focus on backdoor attack against the uncompressed model; while the vulnerability of compressed DNNs, which are widely used in the practical applications, is little exploited yet. In this paper, we propose to study and develop Robust and Imperceptible Backdoor Attack against Compact DNN models (RIBAC). By performing systematic analysis and exploration on the important design knobs, we propose a framework that can learn the proper trigger patterns, model parameters and pruning masks in an efficient way. Thereby achieving high trigger stealthiness, high attack success rate and high model efficiency simultaneously. Extensive evaluations across different datasets, including the test against the state-of-the-art defense mechanisms, demonstrate the high robustness, stealthiness and model efficiency of RIBAC. Code is available at https://github.com/huyvnphan/ECCV2022-RIBAC △ Less

Submitted 22 August, 2022; originally announced August 2022.

Comments: Code is available at https://github.com/huyvnphan/ECCV2022-RIBAC

Journal ref: European Conference on Computer Vision (ECCV 2022)

arXiv:2206.11023 [pdf, other]

doi 10.1145/3544902.3546248

Heterogeneous Graph Neural Networks for Software Effort Estimation

Authors: Hung Phan, Ali Jannesari

Abstract: Software effort can be measured by story point [35]. Current approaches for automatically estimating story points focus on applying pre-trained embedding models and deep learning for text regression to solve this problem which required expensive embedding models. We propose HeteroSP, a tool for estimating story points from textual input of Agile software project issues. We select GPT2SP [12] and D… ▽ More Software effort can be measured by story point [35]. Current approaches for automatically estimating story points focus on applying pre-trained embedding models and deep learning for text regression to solve this problem which required expensive embedding models. We propose HeteroSP, a tool for estimating story points from textual input of Agile software project issues. We select GPT2SP [12] and Deep-SE [8] as the baselines for comparison. First, from the analysis of the story point dataset [8], we conclude that software issues are actually a mixture of natural language sentences with quoted code snippets and have problems related to large-size vocabulary. Second, we provide a module to normalize the input text including words and code tokens of the software issues. Third, we design an algorithm to convert an input software issue to a graph with different types of nodes and edges. Fourth, we construct a heterogeneous graph neural networks model with the support of fastText [6] for constructing initial node embedding to learn and predict the story points of new issues. We did the comparison over three scenarios of estimation, including within project, cross-project within the repository, and cross-project cross repository with our baseline approaches. We achieve the average Mean Absolute Error (MAE) as 2.38, 2.61, and 2.63 for three scenarios. We outperform GPT2SP in 2/3 of the scenarios while outperforming Deep-SE in the most challenging scenario with significantly less amount of running time. We also compare our approaches with different homogeneous graph neural network models and the results show that the heterogeneous graph neural networks model outperforms the homogeneous models in story point estimation. For time performance, we achieve about 570 seconds as the time performance in both three processes: node embedding initialization, model construction, and story point estimation. △ Less

Submitted 30 June, 2022; v1 submitted 22 June, 2022; originally announced June 2022.

Comments: Accepted in the Technical Papers Track of the 16th International Symposium on Empirical Software Engineering and Measurement, 2022 (ESEM 2022)

arXiv:2206.09953 [pdf]

doi 10.1111/maps.13773

Nanoscale mineralogy and organic structure in Orgueil (CI) and EET 92042 (CR) carbonaceous chondrites studied with AFM-IR spectroscopy

Authors: Van T. H. Phan, Rolando Rebois, Pierre Beck, Eric Quirico, Lydie Bonal, Takaaki Noguchi

Abstract: Meteorite matrices from primitive chondrites are an interplay of ingredients at the sub-micron scale, which requires analytical techniques with the nanometer spatial resolution to decipher the composition of individual components in their petrographic context. Infrared spectroscopy is an effective method that enables to probe of vibrations at the molecule-atomic scale of organic and inorganic comp… ▽ More Meteorite matrices from primitive chondrites are an interplay of ingredients at the sub-micron scale, which requires analytical techniques with the nanometer spatial resolution to decipher the composition of individual components in their petrographic context. Infrared spectroscopy is an effective method that enables to probe of vibrations at the molecule-atomic scale of organic and inorganic compounds but is often limited to a few micrometers in spatial resolution. To efficiently distinguish spectral signatures of the different constituents, we apply here nano-IR spectroscopy (AFM-IR), based on the combination of infrared and atomic force microscopy, having a spatial resolution beyond the diffraction limits. Our study aims to characterize two chosen meteorite samples to investigate primitive material in terms of bulk chemistry (the CI chondrite Orgueil) and organic composition (the CR chondrite EET 92042). We confirm that this technique allows unmixing the IR signatures of organics and minerals to assess the variability of organic structure within these samples. We report an investigation of the impact of the widely used chemical HF/HCl (Hydrogen Fluoride/Hydrochloric) extraction on the nature of refractory organics (Insoluble Organic Matter, IOM) and provide insights on the mineralogy of meteorites matrices from these two samples by comparing to reference (extra)terrestrial materials. These findings are discussed with a perspective toward understanding the impact of post-accretional aqueous alteration and thermal metamorphism on the composition of chondrites. Last, we highlight that the heterogeneity of organic matter within meteoritic materials extends down to the nanoscale, and by comparison with IOMs, oxygenated chemical groups are not affected by acid extractions. △ Less

Submitted 20 June, 2022; originally announced June 2022.

arXiv:2206.01934 [pdf, other]

Stochastic Multiple Target Sampling Gradient Descent

Authors: Hoang Phan, Ngoc Tran, Trung Le, Toan Tran, Nhat Ho, Dinh Phung

Abstract: Sampling from an unnormalized target distribution is an essential problem with many applications in probabilistic inference. Stein Variational Gradient Descent (SVGD) has been shown to be a powerful method that iteratively updates a set of particles to approximate the distribution of interest. Furthermore, when analysing its asymptotic properties, SVGD reduces exactly to a single-objective optimiz… ▽ More Sampling from an unnormalized target distribution is an essential problem with many applications in probabilistic inference. Stein Variational Gradient Descent (SVGD) has been shown to be a powerful method that iteratively updates a set of particles to approximate the distribution of interest. Furthermore, when analysing its asymptotic properties, SVGD reduces exactly to a single-objective optimization problem and can be viewed as a probabilistic version of this single-objective optimization problem. A natural question then arises: "Can we derive a probabilistic version of the multi-objective optimization?". To answer this question, we propose Stochastic Multiple Target Sampling Gradient Descent (MT-SGD), enabling us to sample from multiple unnormalized target distributions. Specifically, our MT-SGD conducts a flow of intermediate distributions gradually orienting to multiple target distributions, which allows the sampled particles to move to the joint high-likelihood region of the target distributions. Interestingly, the asymptotic analysis shows that our approach reduces exactly to the multiple-gradient descent algorithm for multi-objective optimization, as expected. Finally, we conduct comprehensive experiments to demonstrate the merit of our approach to multi-task learning. △ Less

Submitted 10 February, 2023; v1 submitted 4 June, 2022; originally announced June 2022.

Comments: Accepted to Advances in Neural Information Processing Systems (NeurIPS) 2022. 27 pages, 10 figures, 5 tables

arXiv:2205.09826 [pdf, other]

DPER: Dynamic Programming for Exist-Random Stochastic SAT

Authors: Vu H. N. Phan, Moshe Y. Vardi

Abstract: In Bayesian inference, the maximum a posteriori (MAP) problem combines the most probable explanation (MPE) and marginalization (MAR) problems. The counterpart in propositional logic is the exist-random stochastic satisfiability (ER-SSAT) problem, which combines the satisfiability (SAT) and weighted model counting (WMC) problems. Both MAP and ER-SSAT have the form… ▽ More In Bayesian inference, the maximum a posteriori (MAP) problem combines the most probable explanation (MPE) and marginalization (MAR) problems. The counterpart in propositional logic is the exist-random stochastic satisfiability (ER-SSAT) problem, which combines the satisfiability (SAT) and weighted model counting (WMC) problems. Both MAP and ER-SSAT have the form $\operatorname{argmax}_X \sum_Y f(X, Y)$, where $f$ is a real-valued function over disjoint sets $X$ and $Y$ of variables. These two optimization problems request a value assignment for the $X$ variables that maximizes the weighted sum of $f(X, Y)$ over all value assignments for the $Y$ variables. ER-SSAT has been shown to be a promising approach to formally verify fairness in supervised learning. Recently, dynamic programming on graded project-join trees has been proposed to solve weighted projected model counting (WPMC), a related problem that has the form $\sum_X \max_Y f(X, Y)$. We extend this WPMC framework to exactly solve ER-SSAT and implement a dynamic-programming solver named DPER. Our empirical evaluation indicates that DPER contributes to the portfolio of state-of-the-art ER-SSAT solvers (DC-SSAT and erSSAT) through competitive performance on low-width problem instances. △ Less

Submitted 19 May, 2022; originally announced May 2022.

Comments: arXiv admin note: substantial text overlap with arXiv:2205.08632

arXiv:2205.08632 [pdf, other]

DPO: Dynamic-Programming Optimization on Hybrid Constraints

Authors: Vu H. N. Phan, Moshe Y. Vardi

Abstract: In Bayesian inference, the most probable explanation (MPE) problem requests a variable instantiation with the highest probability given some evidence. Since a Bayesian network can be encoded as a literal-weighted CNF formula $\varphi$, we study Boolean MPE, a more general problem that requests a model $τ$ of $\varphi$ with the highest weight, where the weight of $τ$ is the product of weights of li… ▽ More In Bayesian inference, the most probable explanation (MPE) problem requests a variable instantiation with the highest probability given some evidence. Since a Bayesian network can be encoded as a literal-weighted CNF formula $\varphi$, we study Boolean MPE, a more general problem that requests a model $τ$ of $\varphi$ with the highest weight, where the weight of $τ$ is the product of weights of literals satisfied by $τ$. It is known that Boolean MPE can be solved via reduction to (weighted partial) MaxSAT. Recent work proposed DPMC, a dynamic-programming model counter that leverages graph-decomposition techniques to construct project-join trees. A project-join tree is an execution plan that specifies how to conjoin clauses and project out variables. We build on DPMC and introduce DPO, a dynamic-programming optimizer that exactly solves Boolean MPE. By using algebraic decision diagrams (ADDs) to represent pseudo-Boolean (PB) functions, DPO is able to handle disjunctive clauses as well as XOR clauses. (Cardinality constraints and PB constraints may also be compactly represented by ADDs, so one can further extend DPO's support for hybrid inputs.) To test the competitiveness of DPO, we generate random XOR-CNF formulas. On these hybrid benchmarks, DPO significantly outperforms MaxHS, UWrMaxSat, and GaussMaxHS, which are state-of-the-art exact solvers for MaxSAT. △ Less

Submitted 17 May, 2022; originally announced May 2022.

arXiv:2205.02334 [pdf, other]

doi 10.3847/1538-3881/ac9af5

On Internal and External Alignment of Dust Grains in Protostellar Environments

Authors: Thiem Hoang, Le Ngoc Tram, Vo Hong Minh Phan, Nguyen Chau Giang, Nguyen Thi Phuong, Nguyen Duc Dieu

Abstract: We study the physical processes inducing the alignment of the grain axis of maximum inertia moment with the angular momentum (${\bf J}$, i.e., internal alignment) and of ${\bf J}$ with the magnetic field (i.e., external alignment) of very large grains (VLGs, of radius $a>10μ$m) using the grain alignment framework based on radiative torques (RATs) and mechanical torques (METs). We derive analytical… ▽ More We study the physical processes inducing the alignment of the grain axis of maximum inertia moment with the angular momentum (${\bf J}$, i.e., internal alignment) and of ${\bf J}$ with the magnetic field (i.e., external alignment) of very large grains (VLGs, of radius $a>10μ$m) using the grain alignment framework based on radiative torques (RATs) and mechanical torques (METs). We derive analytical formulae for critical sizes of grain alignment, assuming that grains are aligned at both low$-J$ and high$-J$ attractors by RATs (METs). For protostellar cores, we find that super-Barnett relaxation can induce efficient internal alignment for VLGs with large iron inclusions aligned at high$-J$ attractors by RATs (METs). In contrast, inelastic relaxation can be efficient for VLGs made of any composition. For external alignment, we find that VLGs with iron inclusions aligned at high$-J$ attractors can have magnetic alignment by RATs ($B-$RAT) or METs ($B-$ MET), enabling dust polarization as a reliable tracer of magnetic fields in such dense regions. Still, grains at low$-J$ attractors or grains without iron inclusions have alignment along the radiation direction ($k-$RAT) or gas flow ($v-$MET). For protostellar disks, we find that super-Barnett relaxation can be efficient for grains with large iron inclusions in the outer disk thanks to spinup by METs, but inelastic relaxation is inefficient. VLGs aligned at low-J attractors can have $k-$RAT ($v-$MET) alignment, but grains aligned at high$-J$ attractors have likely $B-$RAT ($B-$MET) alignment. Grain alignment by METs appears to be more important than RATs in protostellar disks. △ Less

Submitted 17 October, 2022; v1 submitted 4 May, 2022; originally announced May 2022.

Comments: 43 pages, 17 figures, Table of notations is added. Version accepted

arXiv:2203.09035 [pdf]

HybridNets: End-to-End Perception Network

Authors: Dat Vu, Bao Ngo, Hung Phan

Abstract: End-to-end Network has become increasingly important in multi-tasking. One prominent example of this is the growing significance of a driving perception system in autonomous driving. This paper systematically studies an end-to-end perception network for multi-tasking and proposes several key optimizations to improve accuracy. First, the paper proposes efficient segmentation head and box/class pred… ▽ More End-to-end Network has become increasingly important in multi-tasking. One prominent example of this is the growing significance of a driving perception system in autonomous driving. This paper systematically studies an end-to-end perception network for multi-tasking and proposes several key optimizations to improve accuracy. First, the paper proposes efficient segmentation head and box/class prediction networks based on weighted bidirectional feature network. Second, the paper proposes automatically customized anchor for each level in the weighted bidirectional feature network. Third, the paper proposes an efficient training loss function and training strategy to balance and optimize network. Based on these optimizations, we have developed an end-to-end perception network to perform multi-tasking, including traffic object detection, drivable area segmentation and lane detection simultaneously, called HybridNets, which achieves better accuracy than prior art. In particular, HybridNets achieves 77.3 mean Average Precision on Berkeley DeepDrive Dataset, outperforms lane detection with 31.6 mean Intersection Over Union with 12.83 million parameters and 15.6 billion floating-point operations. In addition, it can perform visual perception tasks in real-time and thus is a practical and accurate solution to the multi-tasking problem. Code is available at https://github.com/datvuthanh/HybridNets. △ Less

Submitted 16 March, 2022; originally announced March 2022.

arXiv:2203.03062 [pdf, other]

Story Point Effort Estimation by Text Level Graph Neural Network

Authors: Hung Phan, Ali Jannesari

Abstract: Estimating the software projects' efforts developed by agile methods is important for project managers or technical leads. It provides a summary as a first view of how many hours and developers are required to complete the tasks. There are research works on automatic predicting the software efforts, including Term Frequency Inverse Document Frequency (TFIDF) as the traditional approach for this pr… ▽ More Estimating the software projects' efforts developed by agile methods is important for project managers or technical leads. It provides a summary as a first view of how many hours and developers are required to complete the tasks. There are research works on automatic predicting the software efforts, including Term Frequency Inverse Document Frequency (TFIDF) as the traditional approach for this problem. Graph Neural Network is a new approach that has been applied in Natural Language Processing for text classification. The advantages of Graph Neural Network are based on the ability to learn information via graph data structure, which has more representations such as the relationships between words compared to approaches of vectorizing sequence of words. In this paper, we show the potential and possible challenges of Graph Neural Network text classification in story point level estimation. By the experiments, we show that the GNN Text Level Classification can achieve as high accuracy as about 80 percent for story points level classification, which is comparable to the traditional approach. We also analyze the GNN approach and point out several current disadvantages that the GNN approach can improve for this problem or other problems in software engineering. △ Less

Submitted 14 March, 2022; v1 submitted 6 March, 2022; originally announced March 2022.

Comments: accepted at The 1st International Workshop on Natural Language-based Software Engineering (to appear)

arXiv:2203.00553 [pdf, other]

Global-Local Regularization Via Distributional Robustness

Authors: Hoang Phan, Trung Le, Trung Phung, Tuan Anh Bui, Nhat Ho, Dinh Phung

Abstract: Despite superior performance in many situations, deep neural networks are often vulnerable to adversarial examples and distribution shifts, limiting model generalization ability in real-world applications. To alleviate these problems, recent approaches leverage distributional robustness optimization (DRO) to find the most challenging distribution, and then minimize loss function over this most cha… ▽ More Despite superior performance in many situations, deep neural networks are often vulnerable to adversarial examples and distribution shifts, limiting model generalization ability in real-world applications. To alleviate these problems, recent approaches leverage distributional robustness optimization (DRO) to find the most challenging distribution, and then minimize loss function over this most challenging distribution. Regardless of achieving some improvements, these DRO approaches have some obvious limitations. First, they purely focus on local regularization to strengthen model robustness, missing a global regularization effect which is useful in many real-world applications (e.g., domain adaptation, domain generalization, and adversarial machine learning). Second, the loss functions in the existing DRO approaches operate in only the most challenging distribution, hence decouple with the original distribution, leading to a restrictive modeling capability. In this paper, we propose a novel regularization technique, following the veins of Wasserstein-based DRO framework. Specifically, we define a particular joint distribution and Wasserstein-based uncertainty, allowing us to couple the original and most challenging distributions for enhancing modeling capability and applying both local and global regularizations. Empirical studies on different learning problems demonstrate that our proposed approach significantly outperforms the existing regularization approaches in various domains: semi-supervised learning, domain adaptation, domain generalization, and adversarial machine learning. △ Less

Submitted 12 February, 2023; v1 submitted 1 March, 2022; originally announced March 2022.

Comments: Accepted to International Conference on Artificial Intelligence and Statistics (AISTATS 2023)

arXiv:2202.07915 [pdf]

Emergence of Intergranular Tunneling Dominated Negative Magnetoresistance in Helimagnetic Manganese Phosphide Nanorod Thin Films

Authors: B. Muchharla, R. P. Madhogaria, D. DeTellem, C. M. Hung, A. Chanda, A. T. Duong, P. T. Huy, M. T. Trinh, S. Cho, S. Witanachchi, M. H. Phan

Abstract: Helical magnets are emerging as a novel class of materials for spintronics and sensor applications; however, research on their charge and spin transport properties in a thin film form is less explored. Herein, we report the temperature and magnetic field dependent charge transport properties of a highly crystalline MnP nanorod thin film over a wide temperature range (2-350 K). The MnP nanorod film… ▽ More Helical magnets are emerging as a novel class of materials for spintronics and sensor applications; however, research on their charge and spin transport properties in a thin film form is less explored. Herein, we report the temperature and magnetic field dependent charge transport properties of a highly crystalline MnP nanorod thin film over a wide temperature range (2-350 K). The MnP nanorod films of 100 nm thickness were grown on Si substrates at 500 oC using molecular beam epitaxy. The temperature dependent resistivity data exhibits a metallic behavior over the entire measured temperature range. However, large negative magnetoresistance of up to 12% is observed below 50 K at which the system enters a stable helical (screw) magnetic state. In this temperature regime, the MR(H,T) dependence seems to show a magnetic field manipulated phase coexistence. The observed magnetoresistance is dominantly governed by the intergranular spin dependent tunneling mechanism. These findings pinpoint a correlation between the transport and magnetism in this helimagnetic system. △ Less

Submitted 16 February, 2022; originally announced February 2022.

arXiv:2202.02341 [pdf, other]

doi 10.1051/0004-6361/202243326

Bayesian inference of three-dimensional gas maps: II. Galactic HI

Authors: Philipp Mertsch, Vo Hong Minh Phan

Abstract: The 21-cm emission from atomic hydrogen (HI) is one of the most important tracers of the structure and dynamics of the interstellar medium. Thanks to Galactic rotation, the line is Doppler shifted and, assuming a model for the velocity field, data from gas line surveys can be deprojected along the line of sight. However, given our vantage point in the Galaxy, such a reconstruction suffers from a n… ▽ More The 21-cm emission from atomic hydrogen (HI) is one of the most important tracers of the structure and dynamics of the interstellar medium. Thanks to Galactic rotation, the line is Doppler shifted and, assuming a model for the velocity field, data from gas line surveys can be deprojected along the line of sight. However, given our vantage point in the Galaxy, such a reconstruction suffers from a number of ambiguities. Here, we argue that those can be cured by exploiting the spatial coherence of the gas density that is implied by the physical processes shaping it. We have adopted a Bayesian inference framework that allows reconstructing the three-dimensional map of HI and quantifying its uncertainty. We employ data from the HI4PI compilation to produce three-dimensional maps of Galactic HI. The reconstructed density shows structure on a variety of scales. In particular, some spurs and spiral arms can be identified with ease. We discuss the morphology of the surface mass density and the radial and vertical profiles. The reconstructed three-dimensional HI densities are available at https://doi.org/10.5281/zenodo.5956696. △ Less

Submitted 25 May, 2023; v1 submitted 4 February, 2022; originally announced February 2022.

Comments: Appendix with discussions on dust and gas correlations added

Report number: TTK-22-07

Journal ref: A&A 671, A54 (2023)

arXiv:2201.12557 [pdf, ps, other]

Polyphonic audio event detection: multi-label or multi-class multi-task classification problem?

Authors: Huy Phan, Thi Ngoc Tho Nguyen, Philipp Koch, Alfred Mertins

Abstract: Polyphonic events are the main error source of audio event detection (AED) systems. In deep-learning context, the most common approach to deal with event overlaps is to treat the AED task as a multi-label classification problem. By doing this, we inherently consider multiple one-vs.-rest classification problems, which are jointly solved by a single (i.e. shared) network. In this work, to better ha… ▽ More Polyphonic events are the main error source of audio event detection (AED) systems. In deep-learning context, the most common approach to deal with event overlaps is to treat the AED task as a multi-label classification problem. By doing this, we inherently consider multiple one-vs.-rest classification problems, which are jointly solved by a single (i.e. shared) network. In this work, to better handle polyphonic mixtures, we propose to frame the task as a multi-class classification problem by considering each possible label combination as one class. To circumvent the large number of arising classes due to combinatorial explosion, we divide the event categories into multiple groups and construct a multi-task problem in a divide-and-conquer fashion, where each of the tasks is a multi-class classification problem. A network architecture is then devised for multi-class multi-task modelling. The network is composed of a backbone subnet and multiple task-specific subnets. The task-specific subnets are designed to learn time-frequency and channel attention masks to extract features for the task at hand from the common feature maps learned by the backbone. Experiments on the TUT-SED-Synthetic-2016 with high degree of event overlap show that the proposed approach results in more favorable performance than the common multi-label approach. △ Less

Submitted 29 January, 2022; originally announced January 2022.

Comments: This paper has been accepted to IEEE ICASSP 2022

arXiv:2201.00644 [pdf, ps, other]

Feature matching as improved transfer learning technique for wearable EEG

Authors: Elisabeth R. M. Heremans, Huy Phan, Amir H. Ansari, Pascal Borzée, Bertien Buyse, Dries Testelmans, Maarten De Vos

Abstract: Objective: With the rapid rise of wearable sleep monitoring devices with non-conventional electrode configurations, there is a need for automated algorithms that can perform sleep staging on configurations with small amounts of labeled data. Transfer learning has the ability to adapt neural network weights from a source modality (e.g. standard electrode configuration) to a new target modality (e.g… ▽ More Objective: With the rapid rise of wearable sleep monitoring devices with non-conventional electrode configurations, there is a need for automated algorithms that can perform sleep staging on configurations with small amounts of labeled data. Transfer learning has the ability to adapt neural network weights from a source modality (e.g. standard electrode configuration) to a new target modality (e.g. non-conventional electrode configuration). Methods: We propose feature matching, a new transfer learning strategy as an alternative to the commonly used finetuning approach. This method consists of training a model with larger amounts of data from the source modality and few paired samples of source and target modality. For those paired samples, the model extracts features of the target modality, matching these to the features from the corresponding samples of the source modality. Results: We compare feature matching to finetuning for three different target domains, with two different neural network architectures, and with varying amounts of training data. Particularly on small cohorts (i.e. 2 - 5 labeled recordings in the non-conventional recording setting), feature matching systematically outperforms finetuning with mean relative differences in accuracy ranging from 0.4% to 4.7% for the different scenarios and datasets. Conclusion: Our findings suggest that feature matching outperforms finetuning as a transfer learning approach, especially in very low data regimes. Significance: As such, we conclude that feature matching is a promising new method for wearable sleep staging with novel devices. △ Less

Submitted 29 December, 2021; originally announced January 2022.

Comments: 14 pages, 6 figues

arXiv:2112.09708 [pdf, other]

doi 10.1088/1475-7516/2022/05/024

Self-confinement of low-energy cosmic rays around supernova remnants

Authors: Hanno Jacobs, Philipp Mertsch, Vo Hong Minh Phan

Abstract: Supernova remnants have long been considered as a promising candidate for sources of Galactic cosmic rays. However, modelling cosmic-ray transport around these sources is complicated by the fact that the overdensity of cosmic rays close to their acceleration site can lead to self-confinement, that is the generation of turbulence upon which these particles scatter. Such a highly non-linear problem… ▽ More Supernova remnants have long been considered as a promising candidate for sources of Galactic cosmic rays. However, modelling cosmic-ray transport around these sources is complicated by the fact that the overdensity of cosmic rays close to their acceleration site can lead to self-confinement, that is the generation of turbulence upon which these particles scatter. Such a highly non-linear problem can be addressed by numerically solving the coupled differential equations describing the evolution in space and time of the escaping particles and of the turbulent plasma waves. In this work, we focus essentially on the propagation of cosmic rays from supernova remnants in the warm ionized and warm neutral phases of the interstellar medium and propose an extended framework to take into account also the effect of energy loss relevant for cosmic rays of energy below 10 GeV. Interestingly, the diffusion coefficient of low-energy cosmic rays could be suppressed by up to 2 orders of magnitude for several tens of kiloyears after the escape from the shock. The cosmic-ray spectrum outside the supernova remnant flattens below 1 GeV at a sufficiently late time reminiscient of the spectral behaviour observed by Voyager. We also find the grammage accumulated around the source to be non-negligible, with important implications for precision fitting of the cosmic-ray spectra. △ Less

Submitted 17 December, 2021; originally announced December 2021.

Comments: 20 pages, 5 figures

Report number: TTK-21-56

Showing 101–150 of 322 results for author: Phan, H