-
Tailoring the Electronic Configurations of YPc$_2$ on Cu(111): Decoupling Strategies for Molecular Spin Qubits Platforms
Authors:
Soyoung Oh,
Franklin. H. Cho,
We-hyo Soe,
Jisoo Yu,
Hong Bui,
Lukas Spree,
Caroline Hommel,
Wonjun Jang,
Soo-hyon Phark,
Luciano Colazzo,
Christoph Wolf,
Fabio Donati
Abstract:
Among molecular spin qubit candidates, yttrium phthalocyanine double-decker (YPc$_2$) features a diamagnetic metal ion core that stabilizes the molecular structure, while its magnetic properties arise primarily from an unpaired electron (S = 1/2) delocalized over the phthalocyanine (Pc) ligand. Understanding its properties in the proximity of metal electrodes is crucial to assess its potential use…
▽ More
Among molecular spin qubit candidates, yttrium phthalocyanine double-decker (YPc$_2$) features a diamagnetic metal ion core that stabilizes the molecular structure, while its magnetic properties arise primarily from an unpaired electron (S = 1/2) delocalized over the phthalocyanine (Pc) ligand. Understanding its properties in the proximity of metal electrodes is crucial to assess its potential use in molecular spin qubits architectures. Here, we investigated the morphology and electronic structure of this molecule adsorbed on a metal Cu(111) surface using scanning tunneling microscopy. On that surface, YPc$_2$ adsorbs flat, with isolated molecules showing a preferred orientation along the (111) crystal axes. We observed two different types of self-assembly molecular packing when growing the molecular patches on Cu(111). For YPc$_2$ in direct contact with Cu(111), scanning tunneling spectroscopy revealed a widely separated highest occupied (HOMO) and lowest unoccupied molecular orbitals (LUMO), suggesting the quenching of the unpaired spin. Conversely, when the YPc$_2$ is separated from the substrate by a few-layer thick diamagnetic ZnPc layer, we find the HOMO to split into singly occupied (SOMO) and singly unoccupied molecular orbitals (SUMO). We find that more than 2 layers of ZnPc are needed to avoid intermixing between the two molecules and spin quenching in the YPc$_2$. Density functional theory reveals the spin quenching to be due to the hybridization between YPc$_2$ and Cu(111) states, confirming the importance of using suitable decoupling layers to preserve the unpaired molecular spin. Our results suggest the potential of YPc$_2$/ZnPc heterostructures as a stable and effective molecular spin qubit platform and validates the possibility of integrating this molecular spin qubit candidate in future quantum logic devices.
△ Less
Submitted 21 May, 2025;
originally announced May 2025.
-
Data Encryption Battlefield: A Deep Dive into the Dynamic Confrontations in Ransomware Attacks
Authors:
Arash Mahboubi,
Hamed Aboutorab,
Seyit Camtepe,
Hang Thanh Bui,
Khanh Luong,
Keyvan Ansari,
Shenlu Wang,
Bazara Barry
Abstract:
In the rapidly evolving landscape of cybersecurity threats, ransomware represents a significant challenge. Attackers increasingly employ sophisticated encryption methods, such as entropy reduction through Base64 encoding, and partial or intermittent encryption to evade traditional detection methods. This study explores the dynamic battle between adversaries who continuously refine encryption strat…
▽ More
In the rapidly evolving landscape of cybersecurity threats, ransomware represents a significant challenge. Attackers increasingly employ sophisticated encryption methods, such as entropy reduction through Base64 encoding, and partial or intermittent encryption to evade traditional detection methods. This study explores the dynamic battle between adversaries who continuously refine encryption strategies and defenders developing advanced countermeasures to protect vulnerable data. We investigate the application of online incremental machine learning algorithms designed to predict file encryption activities despite adversaries evolving obfuscation techniques. Our analysis utilizes an extensive dataset of 32.6 GB, comprising 11,928 files across multiple formats, including Microsoft Word documents (doc), PowerPoint presentations (ppt), Excel spreadsheets (xlsx), image formats (jpg, jpeg, png, tif, gif), PDFs (pdf), audio (mp3), and video (mp4) files. These files were encrypted by 75 distinct ransomware families, facilitating a robust empirical evaluation of machine learning classifiers effectiveness against diverse encryption tactics. Results highlight the Hoeffding Tree algorithms superior incremental learning capability, particularly effective in detecting traditional and AES-Base64 encryption methods employed to lower entropy. Conversely, the Random Forest classifier with warm-start functionality excels at identifying intermittent encryption methods, demonstrating the necessity of tailored machine learning solutions to counter sophisticated ransomware strategies.
△ Less
Submitted 29 April, 2025;
originally announced April 2025.
-
Implicit Sub-stepping Scheme for Critical State Soil Models
Authors:
Hoang Giang Bui,
Jelena Ninic,
Günther Meschke
Abstract:
The stress integration of critical soil model is usually based on implicit Euler algorithm, where the stress predictor is corrected by employing a return mapping algorithm. In the case of large load step, the solution of local nonlinear system to compute the plastic multiplier may not be attained. To overcome this problem, a sub-stepping scheme shall be used to improve the convergence of the local…
▽ More
The stress integration of critical soil model is usually based on implicit Euler algorithm, where the stress predictor is corrected by employing a return mapping algorithm. In the case of large load step, the solution of local nonlinear system to compute the plastic multiplier may not be attained. To overcome this problem, a sub-stepping scheme shall be used to improve the convergence of the local nonlin- ear system solution strategy. Nevertheless, the complexity of the tangent operator of the sub-stepping scheme is high. This complicates the use of Newton-Raphson algorithm to obtain global quadratic convergence. In this paper, a formulation for consistent tangent operator is developed for implicit sub-stepping integration for the modified Cam-Clay model and unified Clay and Sand model. This formulation is highly efficient and can be used with problem involving large load step, such as tun- nel simulation.
△ Less
Submitted 24 April, 2025;
originally announced April 2025.
-
Transformer Encoder and Multi-features Time2Vec for Financial Prediction
Authors:
Nguyen Kim Hai Bui,
Nguyen Duy Chien,
Péter Kovács,
Gergő Bognár
Abstract:
Financial prediction is a complex and challenging task of time series analysis and signal processing, expected to model both short-term fluctuations and long-term temporal dependencies. Transformers have remarkable success mostly in natural language processing using attention mechanism, which also influenced the time series community. The ability to capture both short and long-range dependencies h…
▽ More
Financial prediction is a complex and challenging task of time series analysis and signal processing, expected to model both short-term fluctuations and long-term temporal dependencies. Transformers have remarkable success mostly in natural language processing using attention mechanism, which also influenced the time series community. The ability to capture both short and long-range dependencies helps to understand the financial market and to recognize price patterns, leading to successful applications of Transformers in stock prediction. Although, the previous research predominantly focuses on individual features and singular predictions, that limits the model's ability to understand broader market trends. In reality, within sectors such as finance and technology, companies belonging to the same industry often exhibit correlated stock price movements.
In this paper, we develop a novel neural network architecture by integrating Time2Vec with the Encoder of the Transformer model. Based on the study of different markets, we propose a novel correlation feature selection method. Through a comprehensive fine-tuning of multiple hyperparameters, we conduct a comparative analysis of our results against benchmark models. We conclude that our method outperforms other state-of-the-art encoding methods such as positional encoding, and we also conclude that selecting correlation features enhance the accuracy of predicting multiple stock prices.
△ Less
Submitted 18 April, 2025;
originally announced April 2025.
-
Counting Number of Triangulations of Point Sets: Reinterpreting and Generalizing the Triangulation Polynomials
Authors:
Hong Duc Bui
Abstract:
We describe a framework that unifies the two types of polynomials introduced respectively by Bacher and Mouton and by Rutschmann and Wettstein to analyze the number of triangulations of point sets. Using this insight, we generalize the triangulation polynomials of chains to a wider class of near-edges, enabling efficient computation of the number of triangulations of certain families of point sets…
▽ More
We describe a framework that unifies the two types of polynomials introduced respectively by Bacher and Mouton and by Rutschmann and Wettstein to analyze the number of triangulations of point sets. Using this insight, we generalize the triangulation polynomials of chains to a wider class of near-edges, enabling efficient computation of the number of triangulations of certain families of point sets. We use the framework to try to improve the result in Rutschmann and Wettstein without success, suggesting that their result is close to optimal.
△ Less
Submitted 13 April, 2025;
originally announced April 2025.
-
Square Packing with Asymptotically Smallest Waste Only Needs Good Squares
Authors:
Hong Duc Bui
Abstract:
We consider the problem of packing a large square with nonoverlapping unit squares. Let $W(x)$ be the minimum wasted area when a large square of side length $x$ is packed with unit squares. In Roth and Vaughan's paper that proves the lower bound $W(x) \notin o(x^{1/2})$, a good square is defined to be a square with inclination at most $10^{-10}$ with respect to the large square. In this article, w…
▽ More
We consider the problem of packing a large square with nonoverlapping unit squares. Let $W(x)$ be the minimum wasted area when a large square of side length $x$ is packed with unit squares. In Roth and Vaughan's paper that proves the lower bound $W(x) \notin o(x^{1/2})$, a good square is defined to be a square with inclination at most $10^{-10}$ with respect to the large square. In this article, we prove that in calculating the asymptotic growth of the wasted space, it suffices to only consider packings with only good squares. This allows the lower bound proof in Roth and Vaughan's paper to be simplified by not having to handle bad squares.
△ Less
Submitted 13 April, 2025;
originally announced April 2025.
-
MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation
Authors:
Khai Le-Duc,
Tuyen Tran,
Bach Phan Tat,
Nguyen Kim Hai Bui,
Quan Dang,
Hung-Phong Tran,
Thanh-Thuy Nguyen,
Ly Nguyen,
Tuan-Minh Phan,
Thi Thu Phuong Tran,
Chris Ngo,
Nguyen X. Khanh,
Thanh Nguyen-Tang
Abstract:
Multilingual speech translation (ST) in the medical domain enhances patient care by enabling efficient communication across language barriers, alleviating specialized workforce shortages, and facilitating improved diagnosis and treatment, particularly during pandemics. In this work, we present the first systematic study on medical ST, to our best knowledge, by releasing MultiMed-ST, a large-scale…
▽ More
Multilingual speech translation (ST) in the medical domain enhances patient care by enabling efficient communication across language barriers, alleviating specialized workforce shortages, and facilitating improved diagnosis and treatment, particularly during pandemics. In this work, we present the first systematic study on medical ST, to our best knowledge, by releasing MultiMed-ST, a large-scale ST dataset for the medical domain, spanning all translation directions in five languages: Vietnamese, English, German, French, Traditional Chinese and Simplified Chinese, together with the models. With 290,000 samples, our dataset is the largest medical machine translation (MT) dataset and the largest many-to-many multilingual ST among all domains. Secondly, we present the most extensive analysis study in ST research to date, including: empirical baselines, bilingual-multilingual comparative study, end-to-end vs. cascaded comparative study, task-specific vs. multi-task sequence-to-sequence (seq2seq) comparative study, code-switch analysis, and quantitative-qualitative error analysis. All code, data, and models are available online: https://github.com/leduckhai/MultiMed-ST.
△ Less
Submitted 4 April, 2025;
originally announced April 2025.
-
Corner-Grasp: Multi-Action Grasp Detection and Active Gripper Adaptation for Grasping in Cluttered Environments
Authors:
Yeong Gwang Son,
Seunghwan Um,
Juyong Hong,
Tat Hieu Bui,
Hyouk Ryeol Choi
Abstract:
Robotic grasping is an essential capability, playing a critical role in enabling robots to physically interact with their surroundings. Despite extensive research, challenges remain due to the diverse shapes and properties of target objects, inaccuracies in sensing, and potential collisions with the environment. In this work, we propose a method for effectively grasping in cluttered bin-picking en…
▽ More
Robotic grasping is an essential capability, playing a critical role in enabling robots to physically interact with their surroundings. Despite extensive research, challenges remain due to the diverse shapes and properties of target objects, inaccuracies in sensing, and potential collisions with the environment. In this work, we propose a method for effectively grasping in cluttered bin-picking environments where these challenges intersect. We utilize a multi-functional gripper that combines both suction and finger grasping to handle a wide range of objects. We also present an active gripper adaptation strategy to minimize collisions between the gripper hardware and the surrounding environment by actively leveraging the reciprocating suction cup and reconfigurable finger motion. To fully utilize the gripper's capabilities, we built a neural network that detects suction and finger grasp points from a single input RGB-D image. This network is trained using a larger-scale synthetic dataset generated from simulation. In addition to this, we propose an efficient approach to constructing a real-world dataset that facilitates grasp point detection on various objects with diverse characteristics. Experiment results show that the proposed method can grasp objects in cluttered bin-picking scenarios and prevent collisions with environmental constraints such as a corner of the bin. Our proposed method demonstrated its effectiveness in the 9th Robotic Grasping and Manipulation Competition (RGMC) held at ICRA 2024.
△ Less
Submitted 2 April, 2025;
originally announced April 2025.
-
Asymptotically accurate and locking-free finite element implementation of the refined shell theory
Authors:
Khanh Chau Le,
Hoang-Giang Bui
Abstract:
A formulation of the 2D refined shell theory incorporating transverse shear in the rescaled coordinates and angles of rotation is considered. This novel approach provides the first asymptotically accurate and inherently locking-free finite element implementation. Numerical simulations of semi-cylindrical shells demonstrate excellent agreement between the analytical solution, the 2D refined shell t…
▽ More
A formulation of the 2D refined shell theory incorporating transverse shear in the rescaled coordinates and angles of rotation is considered. This novel approach provides the first asymptotically accurate and inherently locking-free finite element implementation. Numerical simulations of semi-cylindrical shells demonstrate excellent agreement between the analytical solution, the 2D refined shell theory, and three-dimensional elasticity theory, validating the effectiveness and accuracy of the method.
△ Less
Submitted 30 March, 2025;
originally announced March 2025.
-
Convex Analysis in Spectral Decomposition Systems
Authors:
Hòa T. Bùi,
Minh N. Bùi,
Christian Clason
Abstract:
This work is concerned with convex analysis of so-called spectral functions of matrices that only depend on eigenvalues of the matrix. An abstract framework of spectral decomposition systems is proposed that covers a wide range of previously studied settings, including eigenvalue decomposition of Hermitian matrices and singular value decomposition of rectangular matrices and allows deriving new re…
▽ More
This work is concerned with convex analysis of so-called spectral functions of matrices that only depend on eigenvalues of the matrix. An abstract framework of spectral decomposition systems is proposed that covers a wide range of previously studied settings, including eigenvalue decomposition of Hermitian matrices and singular value decomposition of rectangular matrices and allows deriving new results in more general settings such as Euclidean Jordan algebras. The main results characterize convexity, lower semicontinuity, Fenchel conjugates, convex subdifferentials, and Bregman proximity operators of spectral functions in terms of the reduced functions. As a byproduct, a generalization of the Ky Fan majorization theorem is obtained.
△ Less
Submitted 19 March, 2025;
originally announced March 2025.
-
A-SCoRe: Attention-based Scene Coordinate Regression for wide-ranging scenarios
Authors:
Huy-Hoang Bui,
Bach-Thuan Bui,
Quang-Vinh Tran,
Yasuyuki Fujii,
Joo-Ho Lee
Abstract:
Visual localization is considered to be one of the crucial parts in many robotic and vision systems. While state-of-the art methods that relies on feature matching have proven to be accurate for visual localization, its requirements for storage and compute are burdens. Scene coordinate regression (SCR) is an alternative approach that remove the barrier for storage by learning to map 2D pixels to 3…
▽ More
Visual localization is considered to be one of the crucial parts in many robotic and vision systems. While state-of-the art methods that relies on feature matching have proven to be accurate for visual localization, its requirements for storage and compute are burdens. Scene coordinate regression (SCR) is an alternative approach that remove the barrier for storage by learning to map 2D pixels to 3D scene coordinates. Most popular SCR use Convolutional Neural Network (CNN) to extract 2D descriptor, which we would argue that it miss the spatial relationship between pixels. Inspired by the success of vision transformer architecture, we present a new SCR architecture, called A-ScoRe, an Attention-based model which leverage attention on descriptor map level to produce meaningful and high-semantic 2D descriptors. Since the operation is performed on descriptor map, our model can work with multiple data modality whether it is a dense or sparse from depth-map, SLAM to Structure-from-Motion (SfM). This versatility allows A-SCoRe to operate in different kind of environments, conditions and achieve the level of flexibility that is important for mobile robots. Results show our methods achieve comparable performance with State-of-the-art methods on multiple benchmark while being light-weighted and much more flexible. Code and pre-trained models are public in our repository: https://github.com/ais-lab/A-SCoRe.
△ Less
Submitted 18 March, 2025;
originally announced March 2025.
-
Improved 3D Point-Line Mapping Regression for Camera Relocalization
Authors:
Bach-Thuan Bui,
Huy-Hoang Bui,
Yasuyuki Fujii,
Dinh-Tuan Tran,
Joo-Ho Lee
Abstract:
In this paper, we present a new approach for improving 3D point and line mapping regression for camera re-localization. Previous methods typically rely on feature matching (FM) with stored descriptors or use a single network to encode both points and lines. While FM-based methods perform well in large-scale environments, they become computationally expensive with a growing number of mapping points…
▽ More
In this paper, we present a new approach for improving 3D point and line mapping regression for camera re-localization. Previous methods typically rely on feature matching (FM) with stored descriptors or use a single network to encode both points and lines. While FM-based methods perform well in large-scale environments, they become computationally expensive with a growing number of mapping points and lines. Conversely, approaches that learn to encode mapping features within a single network reduce memory footprint but are prone to overfitting, as they may capture unnecessary correlations between points and lines. We propose that these features should be learned independently, each with a distinct focus, to achieve optimal accuracy. To this end, we introduce a new architecture that learns to prioritize each feature independently before combining them for localization. Experimental results demonstrate that our approach significantly enhances the 3D map point and line regression performance for camera re-localization. The implementation of our method will be publicly available at: https://github.com/ais-lab/pl2map/.
△ Less
Submitted 28 February, 2025;
originally announced February 2025.
-
RLSA-PFL: Robust Lightweight Secure Aggregation with Model Inconsistency Detection in Privacy-Preserving Federated Learning
Authors:
Nazatul H. Sultan,
Yan Bo,
Yansong Gao,
Seyit Camtepe,
Arash Mahboubi,
Hang Thanh Bui,
Aufeef Chauhan,
Hamed Aboutorab,
Michael Bewong,
Dineshkumar Singh,
Praveen Gauravaram,
Rafiqul Islam,
Sharif Abuadbba
Abstract:
Federated Learning (FL) allows users to collaboratively train a global machine learning model by sharing local model only, without exposing their private data to a central server. This distributed learning is particularly appealing in scenarios where data privacy is crucial, and it has garnered substantial attention from both industry and academia. However, studies have revealed privacy vulnerabil…
▽ More
Federated Learning (FL) allows users to collaboratively train a global machine learning model by sharing local model only, without exposing their private data to a central server. This distributed learning is particularly appealing in scenarios where data privacy is crucial, and it has garnered substantial attention from both industry and academia. However, studies have revealed privacy vulnerabilities in FL, where adversaries can potentially infer sensitive information from the shared model parameters. In this paper, we present an efficient masking-based secure aggregation scheme utilizing lightweight cryptographic primitives to mitigate privacy risks. Our scheme offers several advantages over existing methods. First, it requires only a single setup phase for the entire FL training session, significantly reducing communication overhead. Second, it minimizes user-side overhead by eliminating the need for user-to-user interactions, utilizing an intermediate server layer and a lightweight key negotiation method. Third, the scheme is highly resilient to user dropouts, and the users can join at any FL round. Fourth, it can detect and defend against malicious server activities, including recently discovered model inconsistency attacks. Finally, our scheme ensures security in both semi-honest and malicious settings. We provide security analysis to formally prove the robustness of our approach. Furthermore, we implemented an end-to-end prototype of our scheme. We conducted comprehensive experiments and comparisons, which show that it outperforms existing solutions in terms of communication and computation overhead, functionality, and security.
△ Less
Submitted 16 April, 2025; v1 submitted 13 February, 2025;
originally announced February 2025.
-
Multi-Agent Path Finding under Limited Communication Range Constraint via Dynamic Leading
Authors:
Hoang-Dung Bui,
Erion Plaku,
Gregoy J. Stein
Abstract:
This paper proposes a novel framework to handle a multi-agent path finding problem under a limited communication range constraint, where all agents must have a connected communication channel to the rest of the team. Many existing approaches to multi-agent path finding (e.g., leader-follower platooning) overcome computational challenges of planning in this domain by planning one agent at a time in…
▽ More
This paper proposes a novel framework to handle a multi-agent path finding problem under a limited communication range constraint, where all agents must have a connected communication channel to the rest of the team. Many existing approaches to multi-agent path finding (e.g., leader-follower platooning) overcome computational challenges of planning in this domain by planning one agent at a time in a fixed order. However, fixed leader-follower approaches can become stuck during planning, limiting their practical utility in dense-clutter environments. To overcome this limitation, we develop dynamic leading multi-agent path finding, which allows for dynamic reselection of the leading agent during path planning whenever progress cannot be made. The experiments show the efficiency of our framework, which can handle up to 25 agents with more than 90% success-rate across five environment types where baselines routinely fail.
△ Less
Submitted 5 February, 2025; v1 submitted 6 January, 2025;
originally announced January 2025.
-
Micrometer-resolution fluorescence and lifetime mappings of CsPbBr$_3$ nanocrystal films coupled with a TiO$_2$ grating
Authors:
Viet Anh Nguyen,
Linh Thi Dieu Nguyen,
Thi Thu Ha Do,
Ye Wu,
Aleksandr A. Sergeev,
Ding Zhu,
Vytautas Valuckas,
Duong Pham,
Hai Xuan Son Bui,
Duy Mai Hoang,
Son Tung Bui,
Xuan Khuyen Bui,
Binh Thanh Nguyen,
Hai Son Nguyen,
Lam Dinh Vu,
Andrey Rogach,
Son Tung Ha,
Quynh Le-Van
Abstract:
Enhancing light emission from perovskite nanocrystal (NC) films is essential in light-emitting devices, as their conventional stacks often restrict the escape of emitted light. This work addresses this challenge by employing a TiO$_2$ grating to enhance light extraction and shape the emission of CsPbBr$_3$ nanocrystal films. Angle-resolved photoluminescence (PL) demonstrated a tenfold increase in…
▽ More
Enhancing light emission from perovskite nanocrystal (NC) films is essential in light-emitting devices, as their conventional stacks often restrict the escape of emitted light. This work addresses this challenge by employing a TiO$_2$ grating to enhance light extraction and shape the emission of CsPbBr$_3$ nanocrystal films. Angle-resolved photoluminescence (PL) demonstrated a tenfold increase in emission intensity by coupling the Bloch resonances of the grating with the spontaneous emission of the perovskite NCs. Fluorescence lifetime imaging microscopy (FLIM) provided micrometer-resolution mapping of both PL intensity and lifetime across a large area, revealing a decrease in PL lifetime from 8.2 ns for NC films on glass to 6.1 ns on the TiO$_2$ grating. Back focal plane (BFP) spectroscopy confirmed how the Bloch resonances transformed the unpolarized, spatially incoherent emission of NCs into polarized and directed light. These findings provide further insights into the interactions between dielectric nanostructures and perovskite NC films, offering possible pathways for designing better performing perovskite optoelectronic devices.
△ Less
Submitted 19 November, 2024;
originally announced November 2024.
-
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
Authors:
Ha Manh Bui,
Enrique Mallada,
Anqi Liu
Abstract:
By leveraging the representation power of deep neural networks, neural upper confidence bound (UCB) algorithms have shown success in contextual bandits. To further balance the exploration and exploitation, we propose Neural-$σ^2$-LinearUCB, a variance-aware algorithm that utilizes $σ^2_t$, i.e., an upper bound of the reward noise variance at round $t$, to enhance the uncertainty quantification qua…
▽ More
By leveraging the representation power of deep neural networks, neural upper confidence bound (UCB) algorithms have shown success in contextual bandits. To further balance the exploration and exploitation, we propose Neural-$σ^2$-LinearUCB, a variance-aware algorithm that utilizes $σ^2_t$, i.e., an upper bound of the reward noise variance at round $t$, to enhance the uncertainty quantification quality of the UCB, resulting in a regret performance improvement. We provide an oracle version for our algorithm characterized by an oracle variance upper bound $σ^2_t$ and a practical version with a novel estimation for this variance bound. Theoretically, we provide rigorous regret analysis for both versions and prove that our oracle algorithm achieves a better regret guarantee than other neural-UCB algorithms in the neural contextual bandits setting. Empirically, our practical method enjoys a similar computational efficiency, while outperforming state-of-the-art techniques by having a better calibration and lower regret across multiple standard settings, including on the synthetic, UCI, MNIST, and CIFAR-10 datasets.
△ Less
Submitted 10 March, 2025; v1 submitted 8 November, 2024;
originally announced November 2024.
-
Observation of a Halo Trimer in an Ultracold Bose-Fermi Mixture
Authors:
Alexander Y. Chuang,
Huan Q. Bui,
Arthur Christianen,
Yiming Zhang,
Yiqi Ni,
Denise Ahmed-Braun,
Carsten Robens,
Martin W. Zwierlein
Abstract:
The quantum mechanics of three interacting particles gives rise to interesting universal phenomena, such as the staircase of Efimov trimers predicted in the context of nuclear physics and observed in ultracold gases. Here, we observe a novel type of halo trimer using radiofrequency spectroscopy in an ultracold mixture of $^{23}$Na and $^{40}$K atoms. The trimers consist of two light bosons and one…
▽ More
The quantum mechanics of three interacting particles gives rise to interesting universal phenomena, such as the staircase of Efimov trimers predicted in the context of nuclear physics and observed in ultracold gases. Here, we observe a novel type of halo trimer using radiofrequency spectroscopy in an ultracold mixture of $^{23}$Na and $^{40}$K atoms. The trimers consist of two light bosons and one heavy fermion, and have the structure of a Feshbach dimer weakly bound to one additional boson. We find that the trimer peak closely follows the dimer resonance over the entire range of explored interaction strengths across an order of magnitude variation of the dimer energy, as reproduced by our theoretical analysis. The presence of this halo trimer is of direct relevance for many-body physics in ultracold mixtures and the association of ultracold molecules.
△ Less
Submitted 7 November, 2024;
originally announced November 2024.
-
Optimization and Characterization of Thermoelectric Properties in Selenium-Doped Bismuth Telluride Ultra Thin Films
Authors:
Kien Trung Nguyen,
Lan Anh Dong,
Hien Thi Dinh,
Thi Huyen Trang Bui,
Son Truong Chu,
Thuat Nguyen-Tran,
Chi Hieu Hoang,
Hung Quoc Nguyen
Abstract:
Thermoelectricity in telluride materials is often improved by replacing telluride with selenium in its crystal. Most work, however, focuses on bulk crystal and leaves the 2D thin films intact. In this paper, we optimize the fabrication of selenium-doped bismuth telluride (Bi$_2$Te$_{3-\rm{x}}$Se$_{\rm{x}}$) thin films using a 3-source thermal co-evaporation. Thermoelectric properties, including th…
▽ More
Thermoelectricity in telluride materials is often improved by replacing telluride with selenium in its crystal. Most work, however, focuses on bulk crystal and leaves the 2D thin films intact. In this paper, we optimize the fabrication of selenium-doped bismuth telluride (Bi$_2$Te$_{3-\rm{x}}$Se$_{\rm{x}}$) thin films using a 3-source thermal co-evaporation. Thermoelectric properties, including the Seebeck coefficient and electrical resistivity, are systematically characterized to evaluate the material's performance for thermoelectric applications near room temperature. The thin films were deposited under carefully controlled conditions, with the evaporation rates of bismuth, tellurium, and selenium precisely monitored to achieve the desired stoichiometry and crystalline phase. Finally, thermoelectricity in Bi$_2$Te$_{3-\rm{x}}$Se$_{\rm{x}}$ at the ultra-thin regime is investigated. We consistently obtain films with thickness near 30 nm with a Seebeck coefficient of 400 $μ$V/K and a power factor of 1 mW/mK$^2$.
△ Less
Submitted 27 October, 2024;
originally announced October 2024.
-
Specific Nucleic Acid Detection Using a Nanoparticle Hybridization Assay
Authors:
A. A. Aldakheel,
C. B. Raub,
H. T. Bui
Abstract:
Simple methods to detect biomolecules including specific nucleic acid sequences have received renewed attention since the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) virus pandemic. Notably, biomolecule detection that uses some form of signal amplification will have some form of amplification-related error, which in the polymerase chain reaction involves mispriming and subsequent…
▽ More
Simple methods to detect biomolecules including specific nucleic acid sequences have received renewed attention since the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) virus pandemic. Notably, biomolecule detection that uses some form of signal amplification will have some form of amplification-related error, which in the polymerase chain reaction involves mispriming and subsequent signal amplification in the no template control, ultimately providing a limit of detection. To demonstrate the feasibility of the detection of a DNA target sequence without molecular or chemical signal amplification that avoids amplification errors, a gold nanoparticle aggregation assay was developed and tested. Two primers bracketing a 94 base pair target sequence from SARS-CoV-2 were conjugated to 10 nm diameter gold nanoparticles by the salt aging method, with conjugation and primer-target hybridization confirmed by agarose gel electrophoresis and nanospectrophotometry. Upon mixing of both conjugated nanoparticles with target, a surface plasmon resonance shift of 6 nm was observed, and lower electrophoretic mobility of a band containing both DNA fluorescence and gold absorption signals. This did not occur in the presence of a control DNA molecule of the same size and composition as the target but with a randomly scrambled base position. Nanoparticle tracking at 30 frames per second using a sensitive darkfield microscope revealed a lower measured diffusion coefficient of scattering objects in the target mixture than in the control mixture or with bare gold nanoparticles.
△ Less
Submitted 5 September, 2024;
originally announced September 2024.
-
Efficient driving of a spin-qubit using single-atom magnets
Authors:
Jose Reina-Gálvez,
Hoang-Anh Le,
Hong Thi Bui,
Soo-hyon Phark,
Nicolás Lorente,
Christoph Wolf
Abstract:
The realization of electron-spin resonance at the single-atom level using scanning tunneling microscopy has opened new avenues for coherent quantum sensing and quantum state manipulation at the ultimate size limit. This allows to build many-body Hamiltonians and the study of their complex physical behavior. Recently, a novel qubit platform has emerged from this field, raising questions about the d…
▽ More
The realization of electron-spin resonance at the single-atom level using scanning tunneling microscopy has opened new avenues for coherent quantum sensing and quantum state manipulation at the ultimate size limit. This allows to build many-body Hamiltonians and the study of their complex physical behavior. Recently, a novel qubit platform has emerged from this field, raising questions about the driving mechanism from single-atom magnets. In this work, we demonstrate how single-atom magnets can be used to drive a nearby single spin qubit efficiently. We show that the modulation of exchange coupling is the primary driving force, which successfully reproduces Rabi rates in the tens of MHz range, consistent with experimental data, while also addressing critical aspects related to the optimization of experimental parameters.
△ Less
Submitted 13 May, 2025; v1 submitted 14 August, 2024;
originally announced August 2024.
-
State estimation for a class of nonlinear time-varying uncertain system under multiharmonic disturbance
Authors:
Alexey A. Margun,
Van H. Bui,
Alexey A. Bobtsov,
Denis V. Efimov
Abstract:
The paper considers the observer synthesis for nonlinear, time-varying plants with uncertain parameters under multiharmonic disturbance. It is assumed that the relative degree of the plant is known, the regressor linearly depends on the state vector and may have a nonlinear relationship with the output signal. The proposed solution consists of three steps. Initially, an unknown input state observe…
▽ More
The paper considers the observer synthesis for nonlinear, time-varying plants with uncertain parameters under multiharmonic disturbance. It is assumed that the relative degree of the plant is known, the regressor linearly depends on the state vector and may have a nonlinear relationship with the output signal. The proposed solution consists of three steps. Initially, an unknown input state observer is synthesized. This observer, however, necessitates the measurement of output derivatives equal to the plant's relative degree. To relax this limitation, an alternative representation of the observer is introduced. Further, based on this observer, the unknown parameters and disturbances are reconstructed using an autoregression model and the dynamic regressor extension and mixing (DREM) approach. This approach allows the estimates to be obtained in a finite time. Finally, based on these estimates, an observer has been constructed that does not require measurements of the output derivatives. The effectiveness and efficiency of this solution are demonstrated through a computer simulation.
△ Less
Submitted 25 July, 2024;
originally announced July 2024.
-
Design of Targeted Community-Based Resource Allocation in the Presence of Vaccine Hesitancy via a Data-Driven Compartmental Stochastic Optimization Model
Authors:
Hieu Bui,
Sandra Eksioglu,
Ruben Proano,
Haoming Shen
Abstract:
Vaccines have proven effective in mitigating the threat of severe infections and deaths during outbreaks of infectious diseases. However, vaccine hesitancy (VH) complicates disease spread prediction and healthcare resource assessment across regions and populations. We propose a modeling framework that integrates an epidemiological compartmental model that captures the spread of an infectious disea…
▽ More
Vaccines have proven effective in mitigating the threat of severe infections and deaths during outbreaks of infectious diseases. However, vaccine hesitancy (VH) complicates disease spread prediction and healthcare resource assessment across regions and populations. We propose a modeling framework that integrates an epidemiological compartmental model that captures the spread of an infectious disease within a multi-stage stochastic program (MSP) that determines the allocation of critical resources under uncertainty. The proposed compartmental MSP model adaptively manages the allocation of resources to account for changes in population behavior toward vaccines (i.e., variability in VH), the unique patterns of disease spread, and the availability of healthcare resources over time and space. The compartmental MSP model allowed us to analyze the price of fairness in resource allocation. Using real COVID-19 vaccination and healthcare resource data from Arkansas, U.S. (January-May 2021), our findings include: (i) delaying the initial deployment of additional ventilators by one month could lead to an average increase in the expected number of deaths by 285.41/month, highlighting the importance of prompt action; (ii) each additional ventilator in the initial stockpile and in supply leads to a decrease in the expected number of deaths by 1.09/month and 0.962/month, respectively, emphasizing the importance of maintaining a large stockpile and scalable production response; (iii) the cost of ensuring equitable resource allocation varies over time and location, peaking during the peak of a disease outbreak and in densely populated areas. This study emphasizes the importance of flexible, informed public health decision-making and preparedness, providing a model for effective resource allocation in public health emergencies.
△ Less
Submitted 8 May, 2024;
originally announced May 2024.
-
Improved All-Pairs Approximate Shortest Paths in Congested Clique
Authors:
Hong Duc Bui,
Shashwat Chandra,
Yi-Jun Chang,
Michal Dory,
Dean Leitersdorf
Abstract:
In this paper, we present new algorithms for approximating All-Pairs Shortest Paths (APSP) in the Congested Clique model. We present randomized algorithms for weighted undirected graphs.
Our first contribution is an $O(1)$-approximate APSP algorithm taking just $O(\log \log \log n)$ rounds. Prior to our work, the fastest algorithms that give an $O(1)$-approximation for APSP take…
▽ More
In this paper, we present new algorithms for approximating All-Pairs Shortest Paths (APSP) in the Congested Clique model. We present randomized algorithms for weighted undirected graphs.
Our first contribution is an $O(1)$-approximate APSP algorithm taking just $O(\log \log \log n)$ rounds. Prior to our work, the fastest algorithms that give an $O(1)$-approximation for APSP take $\operatorname{poly}(\log{n})$ rounds in weighted undirected graphs, and $\operatorname{poly}(\log \log n)$ rounds in unweighted undirected graphs.
If we terminate the execution of the algorithm early, we obtain an $O(t)$-round algorithm that yields an $O \big( (\log n)^{1/2^t} \big) $ distance approximation for a parameter $t$. The trade-off between $t$ and the approximation quality provides flexibility for different scenarios, allowing the algorithm to adapt to specific requirements. In particular, we can get an $O \big( (\log n)^{1/2^t} \big) $-approximation for any constant $t$ in $O(1)$-rounds. Such result was previously known only for the special case that $t=0$.
A key ingredient in our algorithm is a lemma that allows to improve an $O(a)$-approximation for APSP to an $O(\sqrt{a})$-approximation for APSP in $O(1)$ rounds. To prove the lemma, we develop several new tools, including $O(1)$-round algorithms for computing the $k$ closest nodes, a certain type of hopset, and skeleton graphs.
△ Less
Submitted 4 May, 2024;
originally announced May 2024.
-
Correspondence theorems for infinite Hopf-Galois extensions
Authors:
Hoan-Phung Bui,
Joost Vercruysse,
Gabor Wiese
Abstract:
This paper extends Hopf-Galois theory to infinite field extensions and provides a natural definition of subextensions. For separable (possibly infinite) Hopf-Galois extensions, it provides a Galois correspondence. This correspondence also is a refinement of what was known in the case of finite separable Hopf-Galois extensions.
This paper extends Hopf-Galois theory to infinite field extensions and provides a natural definition of subextensions. For separable (possibly infinite) Hopf-Galois extensions, it provides a Galois correspondence. This correspondence also is a refinement of what was known in the case of finite separable Hopf-Galois extensions.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Leveraging Neural Radiance Field in Descriptor Synthesis for Keypoints Scene Coordinate Regression
Authors:
Huy-Hoang Bui,
Bach-Thuan Bui,
Dinh-Tuan Tran,
Joo-Ho Lee
Abstract:
Classical structural-based visual localization methods offer high accuracy but face trade-offs in terms of storage, speed, and privacy. A recent innovation, keypoint scene coordinate regression (KSCR) named D2S addresses these issues by leveraging graph attention networks to enhance keypoint relationships and predict their 3D coordinates using a simple multilayer perceptron (MLP). Camera pose is t…
▽ More
Classical structural-based visual localization methods offer high accuracy but face trade-offs in terms of storage, speed, and privacy. A recent innovation, keypoint scene coordinate regression (KSCR) named D2S addresses these issues by leveraging graph attention networks to enhance keypoint relationships and predict their 3D coordinates using a simple multilayer perceptron (MLP). Camera pose is then determined via PnP+RANSAC, using established 2D-3D correspondences. While KSCR achieves competitive results, rivaling state-of-the-art image-retrieval methods like HLoc across multiple benchmarks, its performance is hindered when data samples are limited due to the deep learning model's reliance on extensive data. This paper proposes a solution to this challenge by introducing a pipeline for keypoint descriptor synthesis using Neural Radiance Field (NeRF). By generating novel poses and feeding them into a trained NeRF model to create new views, our approach enhances the KSCR's generalization capabilities in data-scarce environments. The proposed system could significantly improve localization accuracy by up to 50% and cost only a fraction of time for data synthesis. Furthermore, its modular design allows for the integration of multiple NeRFs, offering a versatile and efficient solution for visual localization. The implementation is publicly available at: https://github.com/ais-lab/DescriptorSynthesis4Feat2Map.
△ Less
Submitted 19 March, 2024; v1 submitted 15 March, 2024;
originally announced March 2024.
-
Empirical Band-Gap Correction for LDA-Derived Atomic Effective Pseudopotentials
Authors:
Surender Kumar,
Hanh Bui,
Gabriel Bester
Abstract:
Atomic effective pseudopotentials enable atomistic calculations at the level of accuracy of density functional theory for semiconductor nanostructures with up to fifty thousand atoms. Since they are directly derived from ab-initio calculations performed in the local density approximation (LDA), they inherit the typical underestimated band gaps and effective masses. We propose an empirical correcti…
▽ More
Atomic effective pseudopotentials enable atomistic calculations at the level of accuracy of density functional theory for semiconductor nanostructures with up to fifty thousand atoms. Since they are directly derived from ab-initio calculations performed in the local density approximation (LDA), they inherit the typical underestimated band gaps and effective masses. We propose an empirical correction based on the modification of the non-local part of the pseudopotential and demonstrate good performance for bulk binary materials (InP, ZnS, HgTe, GaAs) and quantum dots (InP, CdSe, GaAs) with diameters ranging from 1.0 nm to 4.45 nm. Additionally, we provide a simple analytic expression to obtain accurate quasiparticle and optical band gaps for InP, CdSe, and GaAs QDs, from standard LDA calculation.
△ Less
Submitted 13 March, 2024;
originally announced March 2024.
-
Evaluating the Impact of Vaccine Hesitancy on the Allocation of Vital Resources During COVID-19 Pandemic
Authors:
Hieu Bui,
Sandra Eksioglu,
Ruben Proano
Abstract:
The COVID-19 pandemic highlighted significant challenges in the allocation of vital healthcare resources. Existing epidemiological models, specifically compartmental models, aimed to predict the spread of the COVID-19 virus and its impact on the population, but they overlooked the influence of \ac{VH} on disease dynamics, including the expected number of hospitalizations and fatalities. We propose…
▽ More
The COVID-19 pandemic highlighted significant challenges in the allocation of vital healthcare resources. Existing epidemiological models, specifically compartmental models, aimed to predict the spread of the COVID-19 virus and its impact on the population, but they overlooked the influence of \ac{VH} on disease dynamics, including the expected number of hospitalizations and fatalities. We propose improvements to the \ac{SEIR} model for COVID-19 by incorporating the influence of vaccination, \ac{VH}, and resource availability on the disease dynamics. We collect publicly available data and perform data analysis to capture \ac{VH} dynamic changes over time and develop scenario paths for \ac{VH}. We simulate the proposed compartmental model for each \ac{VH} path to explain the impacts of public attitudes toward vaccination, the impacts of healthcare resources on patient outcomes, and the timing of vaccination rollout on the progression and severity of the epidemic. Our analysis demonstrates that reducing \ac{VH} improves health outcomes, reinforcing the importance of addressing \ac{VH} to curb the spread of infectious diseases. Our results show that adequate levels of critical healthcare resources are crucial for minimizing fatalities and also highlight the life-saving impact of timely and effective vaccination programs.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Density-Regression: Efficient and Distance-Aware Deep Regressor for Uncertainty Estimation under Distribution Shifts
Authors:
Ha Manh Bui,
Anqi Liu
Abstract:
Morden deep ensembles technique achieves strong uncertainty estimation performance by going through multiple forward passes with different models. This is at the price of a high storage space and a slow speed in the inference (test) time. To address this issue, we propose Density-Regression, a method that leverages the density function in uncertainty estimation and achieves fast inference by a sin…
▽ More
Morden deep ensembles technique achieves strong uncertainty estimation performance by going through multiple forward passes with different models. This is at the price of a high storage space and a slow speed in the inference (test) time. To address this issue, we propose Density-Regression, a method that leverages the density function in uncertainty estimation and achieves fast inference by a single forward pass. We prove it is distance aware on the feature space, which is a necessary condition for a neural network to produce high-quality uncertainty estimation under distribution shifts. Empirically, we conduct experiments on regression tasks with the cubic toy dataset, benchmark UCI, weather forecast with time series, and depth estimation under real-world shifted applications. We show that Density-Regression has competitive uncertainty estimation performance under distribution shifts with modern deep regressors while using a lower model size and a faster inference speed.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Enhancing Biomechanical Simulations Based on A Posteriori Error Estimates: The Potential of Dual Weighted Residual-Driven Adaptive Mesh Refinement
Authors:
Huu Phuoc Bui,
Michel Duprez,
Pierre-Yves Rohan,
Arnaud Lejeune,
Stephane P. A. Bordas,
Marek Bucki,
Franz Chouly
Abstract:
The Finite Element Method (FEM) is a well-established procedure for computing approximate solutions to deterministic engineering problems described by partial differential equations. FEM produces discrete approximations of the solution with a discretisation error that can be an be quantified with \emph{a posteriori} error estimates. The practical relevance of error estimates for biomechanics probl…
▽ More
The Finite Element Method (FEM) is a well-established procedure for computing approximate solutions to deterministic engineering problems described by partial differential equations. FEM produces discrete approximations of the solution with a discretisation error that can be an be quantified with \emph{a posteriori} error estimates. The practical relevance of error estimates for biomechanics problems, especially for soft tissue where the response is governed by large strains, is rarely addressed. In this contribution, we propose an implementation of \emph{a posteriori} error estimates targeting a user-defined quantity of interest, using the Dual Weighted Residual (DWR) technique tailored to biomechanics. The proposed method considers a general setting that encompasses three-dimensional geometries and model non-linearities, which appear in hyperelastic soft tissues. We take advantage of the automatic differentiation capabilities embedded in modern finite element software, which allows the error estimates to be computed generically for a large class of models and constitutive laws. First we validate our methodology using experimental measurements from silicone samples, and then illustrate its applicability for patient-specific computations of pressure ulcers on a human heel.
△ Less
Submitted 1 March, 2024;
originally announced March 2024.
-
Representing 3D sparse map points and lines for camera relocalization
Authors:
Bach-Thuan Bui,
Huy-Hoang Bui,
Dinh-Tuan Tran,
Joo-Ho Lee
Abstract:
Recent advancements in visual localization and mapping have demonstrated considerable success in integrating point and line features. However, expanding the localization framework to include additional mapping components frequently results in increased demand for memory and computational resources dedicated to matching tasks. In this study, we show how a lightweight neural network can learn to rep…
▽ More
Recent advancements in visual localization and mapping have demonstrated considerable success in integrating point and line features. However, expanding the localization framework to include additional mapping components frequently results in increased demand for memory and computational resources dedicated to matching tasks. In this study, we show how a lightweight neural network can learn to represent both 3D point and line features, and exhibit leading pose accuracy by harnessing the power of multiple learned mappings. Specifically, we utilize a single transformer block to encode line features, effectively transforming them into distinctive point-like descriptors. Subsequently, we treat these point and line descriptor sets as distinct yet interconnected feature sets. Through the integration of self- and cross-attention within several graph layers, our method effectively refines each feature before regressing 3D maps using two simple MLPs. In comprehensive experiments, our indoor localization findings surpass those of Hloc and Limap across both point-based and line-assisted configurations. Moreover, in outdoor scenarios, our method secures a significant lead, marking the most considerable enhancement over state-of-the-art learning-based methodologies. The source code and demo videos of this work are publicly available at: https://thpjp.github.io/pl2map/
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Stability of asymptotic waves in the Fisher-Stefan equation
Authors:
T. T. H. Bui,
P. van Heijster,
R. Marangell
Abstract:
We establish spectral, linear, and nonlinear stability of the vanishing and slow-moving travelling waves that arise as time asymptotic solutions to the Fisher-Stefan equation. Nonlinear stability is in terms of the limiting equations that the asymptotic waves satisfy.
We establish spectral, linear, and nonlinear stability of the vanishing and slow-moving travelling waves that arise as time asymptotic solutions to the Fisher-Stefan equation. Nonlinear stability is in terms of the limiting equations that the asymptotic waves satisfy.
△ Less
Submitted 14 March, 2024; v1 submitted 15 February, 2024;
originally announced February 2024.
-
All-electrical driving and probing of dressed states in a single spin
Authors:
Hong T. Bui,
Christoph Wolf,
Yu Wang,
Masahiro Haze,
Arzhang Ardavan,
Andreas J. Heinrich,
Soo-hyon Phark
Abstract:
The sub-nanometer distance between tip and sample in a scanning tunneling microscope (STM) enables the application of very large electric fields with a strength as high as ~ 1 GV/m. This has allowed for efficient electrical driving of Rabi oscillations of a single spin on a surface at a moderate radio-frequency (RF) voltage of the order of tens of millivolts. Here, we demonstrate the creation of d…
▽ More
The sub-nanometer distance between tip and sample in a scanning tunneling microscope (STM) enables the application of very large electric fields with a strength as high as ~ 1 GV/m. This has allowed for efficient electrical driving of Rabi oscillations of a single spin on a surface at a moderate radio-frequency (RF) voltage of the order of tens of millivolts. Here, we demonstrate the creation of dressed states of a single electron spin localized in the STM tunnel junction by using resonant RF driving voltages. The read-out of these dressed states was achieved all-electrical by a weakly coupled probe spin. Our work highlights the strength of the atomic-scale geometry inherent to the STM that facilitates creation and control of dressed states, which are promising for a design of atomically well-defined single spin quantum devices on surfaces.
△ Less
Submitted 27 January, 2024;
originally announced January 2024.
-
Benchmarking with MIMIC-IV, an irregular, spare clinical time series dataset
Authors:
Hung Bui,
Harikrishna Warrier,
Yogesh Gupta
Abstract:
Electronic health record (EHR) is more and more popular, and it comes with applying machine learning solutions to resolve various problems in the domain. This growing research area also raises the need for EHRs accessibility. Medical Information Mart for Intensive Care (MIMIC) dataset is a popular, public, and free EHR dataset in a raw format that has been used in numerous studies. However, despit…
▽ More
Electronic health record (EHR) is more and more popular, and it comes with applying machine learning solutions to resolve various problems in the domain. This growing research area also raises the need for EHRs accessibility. Medical Information Mart for Intensive Care (MIMIC) dataset is a popular, public, and free EHR dataset in a raw format that has been used in numerous studies. However, despite of its popularity, it is lacking benchmarking work, especially with recent state of the art works in the field of deep learning with time-series tabular data. The aim of this work is to fill this lack by providing a benchmark for latest version of MIMIC dataset, MIMIC-IV. We also give a detailed literature survey about studies that has been already done for MIIMIC-III.
△ Less
Submitted 26 January, 2024;
originally announced January 2024.
-
A Proposed Artificial Neural Network based Approach for Molecules Bitter Prediction
Authors:
Huynh Quoc Anh Bui,
Trong Hop Do,
Thanh Binh Nguyen
Abstract:
In recent years, the development of Artificial Intelligence (AI) has offered the possibility to tackle many interdisciplinary problems, and the field of chemistry is not an exception. Drug analysis is crucial in drug discovery, playing an important role in human life. However, this task encounters many difficulties due to the wide range of computational chemistry methods. Drug analysis also involv…
▽ More
In recent years, the development of Artificial Intelligence (AI) has offered the possibility to tackle many interdisciplinary problems, and the field of chemistry is not an exception. Drug analysis is crucial in drug discovery, playing an important role in human life. However, this task encounters many difficulties due to the wide range of computational chemistry methods. Drug analysis also involves a massive amount of work, including determining taste. Thus, applying deep learning to predict a molecule's bitterness is inevitable to accelerate innovation in drug analysis by reducing the time spent. This paper proposes an artificial neural network (ANN) based approach (EC-ANN) for the molecule's bitter prediction. Our approach took the SMILE (Simplified molecular-input line-entry system) string of a molecule as the input data for the prediction, and the 256-bit ECFP descriptor is the input vector for our network. It showed impressive results compared to state-of-the-art, with a higher performance on two out of three test sets according to the experiences on three popular test sets: Phyto-Dictionary, Unimi, and Bitter-new set [1]. For the Phyto-Dictionary test set, our model recorded 0.95 and 0.983 in F1-score and AUPR, respectively, depicted as the highest score in F1-score. For the Unimi test set, our model achieved 0.88 in F1-score and 0.88 in AUPR, which is roughly 12.3% higher than the peak of previous models [1, 2, 3, 4, 5].
△ Less
Submitted 14 November, 2023;
originally announced November 2023.
-
PhoGPT: Generative Pre-training for Vietnamese
Authors:
Dat Quoc Nguyen,
Linh The Nguyen,
Chi Tran,
Dung Ngoc Nguyen,
Dinh Phung,
Hung Bui
Abstract:
We open-source a state-of-the-art 4B-parameter generative model series for Vietnamese, which includes the base pre-trained monolingual model PhoGPT-4B and its chat variant, PhoGPT-4B-Chat. The base model, PhoGPT-4B, with exactly 3.7B parameters, is pre-trained from scratch on a Vietnamese corpus of 102B tokens, with an 8192 context length, employing a vocabulary of 20480 token types. The chat vari…
▽ More
We open-source a state-of-the-art 4B-parameter generative model series for Vietnamese, which includes the base pre-trained monolingual model PhoGPT-4B and its chat variant, PhoGPT-4B-Chat. The base model, PhoGPT-4B, with exactly 3.7B parameters, is pre-trained from scratch on a Vietnamese corpus of 102B tokens, with an 8192 context length, employing a vocabulary of 20480 token types. The chat variant, PhoGPT-4B-Chat, is the modeling output obtained by fine-tuning PhoGPT-4B on a dataset of 70K instructional prompts and their responses, along with an additional 290K conversations. In addition, we also demonstrate its superior performance compared to previous open-source models. Our PhoGPT models are available at: https://github.com/VinAIResearch/PhoGPT
△ Less
Submitted 22 March, 2024; v1 submitted 6 November, 2023;
originally announced November 2023.
-
Asymptotically accurate and locking-free finite element implementation of first order shear deformation theory for plates
Authors:
Khanh Chau Le,
Hoang Giang Bui
Abstract:
A formulation of the asymptotically exact first-order shear deformation theory for linear-elastic homogeneous plates in the rescaled coordinates and rotation angles is considered. This allows the development of its asymptotically accurate and shear-locking-free finite element implementation. As applications, numerical simulations are performed for circular and rectangular plates, showing complete…
▽ More
A formulation of the asymptotically exact first-order shear deformation theory for linear-elastic homogeneous plates in the rescaled coordinates and rotation angles is considered. This allows the development of its asymptotically accurate and shear-locking-free finite element implementation. As applications, numerical simulations are performed for circular and rectangular plates, showing complete agreement between the analytical solution and the numerical solutions based on two-dimensional theory and three-dimensional elasticity theory.
△ Less
Submitted 16 April, 2024; v1 submitted 30 October, 2023;
originally announced October 2023.
-
Reinforcement Learning for Reduced-order Models of Legged Robots
Authors:
Yu-Ming Chen,
Hien Bui,
Michael Posa
Abstract:
Model-based approaches for planning and control for bipedal locomotion have a long history of success. It can provide stability and safety guarantees while being effective in accomplishing many locomotion tasks. Model-free reinforcement learning, on the other hand, has gained much popularity in recent years due to computational advancements. It can achieve high performance in specific tasks, but i…
▽ More
Model-based approaches for planning and control for bipedal locomotion have a long history of success. It can provide stability and safety guarantees while being effective in accomplishing many locomotion tasks. Model-free reinforcement learning, on the other hand, has gained much popularity in recent years due to computational advancements. It can achieve high performance in specific tasks, but it lacks physical interpretability and flexibility in re-purposing the policy for a different set of tasks. For instance, we can initially train a neural network (NN) policy using velocity commands as inputs. However, to handle new task commands like desired hand or footstep locations at a desired walking velocity, we must retrain a new NN policy. In this work, we attempt to bridge the gap between these two bodies of work on a bipedal platform. We formulate a model-based reinforcement learning problem to learn a reduced-order model (ROM) within a model predictive control (MPC). Results show a 49% improvement in viable task region size and a 21% reduction in motor torque cost. All videos and code are available at https://sites.google.com/view/ymchen/research/rl-for-roms.
△ Less
Submitted 15 October, 2023;
originally announced October 2023.
-
Enhancing Task Performance of Learned Simplified Models via Reinforcement Learning
Authors:
Hien Bui,
Michael Posa
Abstract:
In contact-rich tasks, the hybrid, multi-modal nature of contact dynamics poses great challenges in model representation, planning, and control. Recent efforts have attempted to address these challenges via data-driven methods, learning dynamical models in combination with model predictive control. Those methods, while effective, rely solely on minimizing forward prediction errors to hope for bett…
▽ More
In contact-rich tasks, the hybrid, multi-modal nature of contact dynamics poses great challenges in model representation, planning, and control. Recent efforts have attempted to address these challenges via data-driven methods, learning dynamical models in combination with model predictive control. Those methods, while effective, rely solely on minimizing forward prediction errors to hope for better task performance with MPC controllers. This weak correlation can result in data inefficiency as well as limitations to overall performance. In response, we propose a novel strategy: using a policy gradient algorithm to find a simplified dynamics model that explicitly maximizes task performance. Specifically, we parameterize the stochastic policy as the perturbed output of the MPC controller, thus, the learned model representation can directly associate with the policy or task performance. We apply the proposed method to contact-rich tasks where a three-fingered robotic hand manipulates previously unknown objects. Our method significantly enhances task success rate by up to 15% in manipulating diverse objects compared to the existing method while sustaining data efficiency. Our method can solve some tasks with success rates of 70% or higher using under 30 minutes of data. All videos and codes are available at https://sites.google.com/view/lcs-rl.
△ Less
Submitted 7 March, 2024; v1 submitted 14 October, 2023;
originally announced October 2023.
-
A Survey of Multi-Robot Motion Planning
Authors:
Hoang-Dung Bui
Abstract:
Multi-robot Motion Planning (MRMP) is an active research field which has gained attention over the years. MRMP has significant roles to improve the efficiency and reliability of multi-robot system in a wide range of applications from delivery robots to collaborative assembly lines. This survey provides an overview of MRMP taxonomy, state-of-the-art algorithms, and approaches which have been develo…
▽ More
Multi-robot Motion Planning (MRMP) is an active research field which has gained attention over the years. MRMP has significant roles to improve the efficiency and reliability of multi-robot system in a wide range of applications from delivery robots to collaborative assembly lines. This survey provides an overview of MRMP taxonomy, state-of-the-art algorithms, and approaches which have been developed for multi-robot systems. This study also discusses the strengths and limitations of each algorithm and their applications in various scenarios. Moreover, based on this, we can draw out open problems for future research.
△ Less
Submitted 29 October, 2023; v1 submitted 5 October, 2023;
originally announced October 2023.
-
Robust Adaptive Compensation of External Disturbances for Multi-Channel Linear Systems
Authors:
V. H. Bui,
A. A. Margun
Abstract:
This paper proposes a new algorithm for compensating external disturbances for class of multi-channel linear systems. The solution to this problem is based on the use of the internal model principle and the extended error adaptation algorithm. It is assumed that the disturbance is the output of an autonomous linear generator with unknown parameters. At the first stage, a full-order observer with u…
▽ More
This paper proposes a new algorithm for compensating external disturbances for class of multi-channel linear systems. The solution to this problem is based on the use of the internal model principle and the extended error adaptation algorithm. It is assumed that the disturbance is the output of an autonomous linear generator with unknown parameters. At the first stage, a full-order observer with unknown input signals (Unknown Input Observer - UIO) is synthesized to solve the problem of estimating the state vector of this plant. Then a new observer of external disturbance is formed on the basis of state vector estimations. At the last stage, based on the new observer's estimations, a system with an extended state vector is formed for which a regulator providing compensation of disturbance is constructed. The performance of the obtained results is confirmed using computer simulation in MATLAB Simulink.
△ Less
Submitted 11 October, 2023;
originally announced October 2023.
-
Negative discrete moments of the derivative of the Riemann zeta-function
Authors:
Hung M. Bui,
Alexandra Florea,
Micah B. Milinovich
Abstract:
We obtain conditional upper bounds for negative discrete moments of the derivative of the Riemann zeta-function averaged over a subfamily of zeros of the zeta function which is expected to have full density inside the set of all zeros. For $k\leq 1/2$, our bounds for the $2k$-th moments are expected to be almost optimal. Assuming a conjecture about the maximum size of the argument of the zeta func…
▽ More
We obtain conditional upper bounds for negative discrete moments of the derivative of the Riemann zeta-function averaged over a subfamily of zeros of the zeta function which is expected to have full density inside the set of all zeros. For $k\leq 1/2$, our bounds for the $2k$-th moments are expected to be almost optimal. Assuming a conjecture about the maximum size of the argument of the zeta function on the critical line, we obtain upper bounds for these negative moments of the same strength while summing over a larger subfamily of zeta zeros.
△ Less
Submitted 5 October, 2023;
originally announced October 2023.
-
Cutting Plane Algorithms are Exact for Euclidean Max-Sum Problems
Authors:
Hoa T. Bui,
Sandy Spiers,
Ryan Loxton
Abstract:
This paper studies binary quadratic programs in which the objective is defined by a Euclidean distance matrix, subject to a general polyhedral constraint set. This class of nonconcave maximisation problems includes the capacitated, generalised and bi-level diversity problems as special cases. We introduce two exact cutting plane algorithms to solve this class of optimisation problems. The new algo…
▽ More
This paper studies binary quadratic programs in which the objective is defined by a Euclidean distance matrix, subject to a general polyhedral constraint set. This class of nonconcave maximisation problems includes the capacitated, generalised and bi-level diversity problems as special cases. We introduce two exact cutting plane algorithms to solve this class of optimisation problems. The new algorithms remove the need for a concave reformulation, which is known to significantly slow down convergence. We establish exactness of the new algorithms by examining the concavity of the quadratic objective in a given direction, a concept we refer to as directional concavity. Numerical results show that the algorithms outperform other exact methods for benchmark diversity problems (capacitated, generalised and bi-level), and can easily solve problems of up to three thousand variables.
△ Less
Submitted 17 September, 2023;
originally announced September 2023.
-
D2S: Representing sparse descriptors and 3D coordinates for camera relocalization
Authors:
Bach-Thuan Bui,
Huy-Hoang Bui,
Dinh-Tuan Tran,
Joo-Ho Lee
Abstract:
State-of-the-art visual localization methods mostly rely on complex procedures to match local descriptors and 3D point clouds. However, these procedures can incur significant costs in terms of inference, storage, and updates over time. In this study, we propose a direct learning-based approach that utilizes a simple network named D2S to represent complex local descriptors and their scene coordinat…
▽ More
State-of-the-art visual localization methods mostly rely on complex procedures to match local descriptors and 3D point clouds. However, these procedures can incur significant costs in terms of inference, storage, and updates over time. In this study, we propose a direct learning-based approach that utilizes a simple network named D2S to represent complex local descriptors and their scene coordinates. Our method is characterized by its simplicity and cost-effectiveness. It solely leverages a single RGB image for localization during the testing phase and only requires a lightweight model to encode a complex sparse scene. The proposed D2S employs a combination of a simple loss function and graph attention to selectively focus on robust descriptors while disregarding areas such as clouds, trees, and several dynamic objects. This selective attention enables D2S to effectively perform a binary-semantic classification for sparse descriptors. Additionally, we propose a simple outdoor dataset to evaluate the capabilities of visual localization methods in scene-specific generalization and self-updating from unlabeled observations. Our approach outperforms the previous regression-based methods in both indoor and outdoor environments. It demonstrates the ability to generalize beyond training data, including scenarios involving transitions from day to night and adapting to domain shifts. The source code, trained models, dataset, and demo videos are available at the following link: https://thpjp.github.io/d2s.
△ Less
Submitted 22 October, 2024; v1 submitted 27 July, 2023;
originally announced July 2023.
-
Elasto-plastic large deformation analysis of multi-patch thin shells by isogeometric approach
Authors:
Giang Huynh,
Xiaoying Zhuang,
Hoang-Giang Bui,
G. Meschke,
Hung Nguyen-Xuan
Abstract:
This paper studies elasto-plastic large deformation behavior of thin shell structures using the isogeometric computational approach with the main focus on the efficiency in modelling the multi-patches and arbitrary material formulations. In terms of modelling, we employ the bending strip method to connect the patches in the structure. The incorporation of bending strips allows to eliminate the str…
▽ More
This paper studies elasto-plastic large deformation behavior of thin shell structures using the isogeometric computational approach with the main focus on the efficiency in modelling the multi-patches and arbitrary material formulations. In terms of modelling, we employ the bending strip method to connect the patches in the structure. The incorporation of bending strips allows to eliminate the strict demand of the C1 continuity condition, which is postulated in the Kirchhoff-Love theory for thin shell, and therefore it enables us to use the standard multi-patch structure even with C0 continuity along the patch boundaries. Furthermore, arbitrary nonlinear material models such as hyperelasticity and finite strain plasticity are embedded in the shell formulation, from which a unified thin shell formulation can be achieved. In terms of analysis, the Bezier decomposition concept is used to retain the local support of the traditional finite element. The performance of the presented approach is verified through several numerical benchmarks.
△ Less
Submitted 11 July, 2023;
originally announced July 2023.
-
Application of Multivariate Selective Bandwidth Kernel Density Estimation for Data Correction
Authors:
Hai Bui,
Mostafa Bakhoday-Paskyabi
Abstract:
This paper presents an intuitive application of multivariate kernel density estimation (KDE) for data correction. The method utilizes the expected value of the conditional probability density function (PDF) and a credible interval to quantify correction uncertainty. A selective KDE factor is proposed to adjust both kernel size and shape, determined through least-squares cross-validation (LSCV) or…
▽ More
This paper presents an intuitive application of multivariate kernel density estimation (KDE) for data correction. The method utilizes the expected value of the conditional probability density function (PDF) and a credible interval to quantify correction uncertainty. A selective KDE factor is proposed to adjust both kernel size and shape, determined through least-squares cross-validation (LSCV) or mean conditional squared error (MCSE) criteria. The selective bandwidth method can be used in combination with the adaptive method to potentially improve accuracy. Two examples, involving a hypothetical dataset and a realistic dataset, demonstrate the efficacy of the method. The selective bandwidth methods consistently outperform non-selective methods, while the adaptive bandwidth methods improve results for the hypothetical dataset but not for the realistic dataset. The MCSE criterion minimizes root mean square error but may yield under-smoothed distributions, whereas the LSCV criterion strikes a balance between PDF fitness and low RMSE.
△ Less
Submitted 28 June, 2023;
originally announced June 2023.
-
Guided Sampling-Based Motion Planning with Dynamics in Unknown Environments
Authors:
Abhish Khanal,
Hoang-Dung Bui,
Gregory J. Stein,
Erion Plaku
Abstract:
Despite recent progress improving the efficiency and quality of motion planning, planning collision-free and dynamically-feasible trajectories in partially-mapped environments remains challenging, since constantly replanning as unseen obstacles are revealed during navigation both incurs significant computational expense and can introduce problematic oscillatory behavior. To improve the quality of…
▽ More
Despite recent progress improving the efficiency and quality of motion planning, planning collision-free and dynamically-feasible trajectories in partially-mapped environments remains challenging, since constantly replanning as unseen obstacles are revealed during navigation both incurs significant computational expense and can introduce problematic oscillatory behavior. To improve the quality of motion planning in partial maps, this paper develops a framework that augments sampling-based motion planning to leverage a high-level discrete layer and prior solutions to guide motion-tree expansion during replanning, affording both (i) faster planning and (ii) improved solution coherence. Our framework shows significant improvements in runtime and solution distance when compared with other sampling-based motion planners.
△ Less
Submitted 15 June, 2023;
originally announced June 2023.
-
ViMQ: A Vietnamese Medical Question Dataset for Healthcare Dialogue System Development
Authors:
Ta Duc Huy,
Nguyen Anh Tu,
Tran Hoang Vu,
Nguyen Phuc Minh,
Nguyen Phan,
Trung H. Bui,
Steven Q. H. Truong
Abstract:
Existing medical text datasets usually take the form of question and answer pairs that support the task of natural language generation, but lacking the composite annotations of the medical terms. In this study, we publish a Vietnamese dataset of medical questions from patients with sentence-level and entity-level annotations for the Intent Classification and Named Entity Recognition tasks. The tag…
▽ More
Existing medical text datasets usually take the form of question and answer pairs that support the task of natural language generation, but lacking the composite annotations of the medical terms. In this study, we publish a Vietnamese dataset of medical questions from patients with sentence-level and entity-level annotations for the Intent Classification and Named Entity Recognition tasks. The tag sets for two tasks are in medical domain and can facilitate the development of task-oriented healthcare chatbots with better comprehension of queries from patients. We train baseline models for the two tasks and propose a simple self-supervised training strategy with span-noise modelling that substantially improves the performance. Dataset and code will be published at https://github.com/tadeephuy/ViMQ
△ Less
Submitted 27 April, 2023;
originally announced April 2023.
-
A note on the zeros of the derivatives of Hardy's function $Z(t)$
Authors:
Hung M. Bui,
R. R. Hall
Abstract:
Using the twisted fourth moment of the Riemann zeta-function we study large gaps between consecutive zeros of the derivatives of Hardy's function $Z(t)$, improving upon previous results of Conrey and Ghosh [J. London Math. Soc. 32 (1985), 193--202], and of the second named author [Acta Arith. 111 (2004), 125--140]. We also exhibit small distances between the zeros of $Z(t)$ and the zeros of…
▽ More
Using the twisted fourth moment of the Riemann zeta-function we study large gaps between consecutive zeros of the derivatives of Hardy's function $Z(t)$, improving upon previous results of Conrey and Ghosh [J. London Math. Soc. 32 (1985), 193--202], and of the second named author [Acta Arith. 111 (2004), 125--140]. We also exhibit small distances between the zeros of $Z(t)$ and the zeros of $Z^{(2k)}(t)$ for every $k\in\mathbb{N}$, in support of our numerical observation that the zeros of $Z^{(k)}(t)$ and $Z^{(\ell)}(t)$, when $k$ and $\ell$ have the same parity, seem to come in pairs which are very close to each other. The latter result is obtained using the mollified discrete second moment of the Riemann zeta-function.
△ Less
Submitted 11 April, 2023;
originally announced April 2023.
-
On the derivatives of Hardy's function $Z(t)$
Authors:
Hung M. Bui,
R. R. Hall
Abstract:
Let $Z^{(k)}(t)$ be the $k$-th derivative of Hardy's $Z$-function. The numerics seem to suggest that if $k$ and $\ell$ have the same parity, then the zeros of $Z^{(k)}(t)$ and $Z^{(\ell)}(t)$ come in pairs which are very close to each other. That is to say that $Z^{(k)}(t)Z^{(\ell)}(t)$ has constant sign for the majority, if not almost all, of values $t$. In this paper we show that this is true a…
▽ More
Let $Z^{(k)}(t)$ be the $k$-th derivative of Hardy's $Z$-function. The numerics seem to suggest that if $k$ and $\ell$ have the same parity, then the zeros of $Z^{(k)}(t)$ and $Z^{(\ell)}(t)$ come in pairs which are very close to each other. That is to say that $Z^{(k)}(t)Z^{(\ell)}(t)$ has constant sign for the majority, if not almost all, of values $t$. In this paper we show that this is true a positive proportion of times. We also study the sign of the product of four derivatives of Hardy's function, $Z^{(k)}(t)Z^{(\ell)}(t)Z^{(m)}(t)Z^{(n)}(t)$.
△ Less
Submitted 11 April, 2023;
originally announced April 2023.
-
Negative moments of the Riemann zeta-function
Authors:
Hung M. Bui,
Alexandra Florea
Abstract:
Assuming the Riemann Hypothesis we study negative moments of the Riemann zeta-function and obtain asymptotic formulas in certain ranges of the shift in $ζ(s)$. For example, integrating $|ζ(1/2+α+it)|^{-2k}$ with respect to $t$ from $T$ to $2T$, we obtain an asymptotic formula when the shift $α$ is roughly bigger than $\frac{1}{\log T}$ and $k < 1/2$. We also obtain non-trivial upper bounds for muc…
▽ More
Assuming the Riemann Hypothesis we study negative moments of the Riemann zeta-function and obtain asymptotic formulas in certain ranges of the shift in $ζ(s)$. For example, integrating $|ζ(1/2+α+it)|^{-2k}$ with respect to $t$ from $T$ to $2T$, we obtain an asymptotic formula when the shift $α$ is roughly bigger than $\frac{1}{\log T}$ and $k < 1/2$. We also obtain non-trivial upper bounds for much smaller shifts, as long as $\log\frac{1}α \ll \log \log T$. This provides partial progress towards a conjecture of Gonek on negative moments of the Riemann zeta-function, and settles the conjecture in certain ranges. As an application, we also obtain an upper bound for the average of the generalized Möbius function.
△ Less
Submitted 14 February, 2023;
originally announced February 2023.