-
Interpretable contour level selection for heat maps for gridded data
Authors:
Tarn Duong
Abstract:
Gridded data formats, where the observed multivariate data are aggregated into grid cells, ensure confidentiality and reduce storage requirements, with the trade-off that access to the underlying point data is lost. Heat maps are a highly pertinent visualisation for gridded data, and heat maps with a small number of well-selected contour levels offer improved interpretability over continuous conto…
▽ More
Gridded data formats, where the observed multivariate data are aggregated into grid cells, ensure confidentiality and reduce storage requirements, with the trade-off that access to the underlying point data is lost. Heat maps are a highly pertinent visualisation for gridded data, and heat maps with a small number of well-selected contour levels offer improved interpretability over continuous contour levels. There are many possible contour level choices. Amongst them, density contour levels are highly suitable in many cases, and their probabilistic interpretation form a rigorous statistical basis for further quantitative data analyses. Current methods for computing density contour levels requires access to the observed point data, so they are not applicable to gridded data. To remedy this, we introduce an approximation of density contour levels for gridded data. We then compare our proposed method to existing contour level selection methods, and conclude that our proposal provides improved interpretability for synthetic and experimental gridded data.
△ Less
Submitted 22 May, 2025;
originally announced May 2025.
-
IQBench: How "Smart'' Are Vision-Language Models? A Study with Human IQ Tests
Authors:
Tan-Hanh Pham,
Phu-Vinh Nguyen,
Dang The Hung,
Bui Trong Duong,
Vu Nguyen Thanh,
Chris Ngo,
Tri Quang Truong,
Truong-Son Hy
Abstract:
Although large Vision-Language Models (VLMs) have demonstrated remarkable performance in a wide range of multimodal tasks, their true reasoning capabilities on human IQ tests remain underexplored. To advance research on the fluid intelligence of VLMs, we introduce **IQBench**, a new benchmark designed to evaluate VLMs on standardized visual IQ tests. We focus on evaluating the reasoning capabiliti…
▽ More
Although large Vision-Language Models (VLMs) have demonstrated remarkable performance in a wide range of multimodal tasks, their true reasoning capabilities on human IQ tests remain underexplored. To advance research on the fluid intelligence of VLMs, we introduce **IQBench**, a new benchmark designed to evaluate VLMs on standardized visual IQ tests. We focus on evaluating the reasoning capabilities of VLMs, which we argue are more important than the accuracy of the final prediction. **Our benchmark is visually centric, minimizing the dependence on unnecessary textual content**, thus encouraging models to derive answers primarily from image-based information rather than learned textual knowledge. To this end, we manually collected and annotated 500 visual IQ questions to **prevent unintentional data leakage during training**. Unlike prior work that focuses primarily on the accuracy of the final answer, we evaluate the reasoning ability of the models by assessing their explanations and the patterns used to solve each problem, along with the accuracy of the final prediction and human evaluation. Our experiments show that there are substantial performance disparities between tasks, with models such as `o4-mini`, `gemini-2.5-flash`, and `claude-3.7-sonnet` achieving the highest average accuracies of 0.615, 0.578, and 0.548, respectively. However, all models struggle with 3D spatial and anagram reasoning tasks, highlighting significant limitations in current VLMs' general reasoning abilities. In terms of reasoning scores, `o4-mini`, `gemini-2.5-flash`, and `claude-3.7-sonnet` achieved top averages of 0.696, 0.586, and 0.516, respectively. These results highlight inconsistencies between the reasoning processes of the models and their final answers, emphasizing the importance of evaluating the accuracy of the reasoning in addition to the final predictions.
△ Less
Submitted 17 May, 2025;
originally announced May 2025.
-
Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM
Authors:
Thang Duong,
Minglai Yang,
Chicheng Zhang
Abstract:
We investigate the usage of Large Language Model (LLM) in collecting high-quality data to warm-start Reinforcement Learning (RL) algorithms for learning in some classical Markov Decision Process (MDP) environments. In this work, we focus on using LLM to generate an off-policy dataset that sufficiently covers state-actions visited by optimal policies, then later using an RL algorithm to explore the…
▽ More
We investigate the usage of Large Language Model (LLM) in collecting high-quality data to warm-start Reinforcement Learning (RL) algorithms for learning in some classical Markov Decision Process (MDP) environments. In this work, we focus on using LLM to generate an off-policy dataset that sufficiently covers state-actions visited by optimal policies, then later using an RL algorithm to explore the environment and improve the policy suggested by the LLM. Our algorithm, LORO, can both converge to an optimal policy and have a high sample efficiency thanks to the LLM's good starting policy. On multiple OpenAI Gym environments, such as CartPole and Pendulum, we empirically demonstrate that LORO outperforms baseline algorithms such as pure LLM-based policies, pure RL, and a naive combination of the two, achieving up to $4 \times$ the cumulative rewards of the pure RL baseline.
△ Less
Submitted 16 May, 2025;
originally announced May 2025.
-
Learned IMU Bias Prediction for Invariant Visual Inertial Odometry
Authors:
Abdullah Altawaitan,
Jason Stanley,
Sambaran Ghosal,
Thai Duong,
Nikolay Atanasov
Abstract:
Autonomous mobile robots operating in novel environments depend critically on accurate state estimation, often utilizing visual and inertial measurements. Recent work has shown that an invariant formulation of the extended Kalman filter improves the convergence and robustness of visual-inertial odometry by utilizing the Lie group structure of a robot's position, velocity, and orientation states. H…
▽ More
Autonomous mobile robots operating in novel environments depend critically on accurate state estimation, often utilizing visual and inertial measurements. Recent work has shown that an invariant formulation of the extended Kalman filter improves the convergence and robustness of visual-inertial odometry by utilizing the Lie group structure of a robot's position, velocity, and orientation states. However, inertial sensors also require measurement bias estimation, yet introducing the bias in the filter state breaks the Lie group symmetry. In this paper, we design a neural network to predict the bias of an inertial measurement unit (IMU) from a sequence of previous IMU measurements. This allows us to use an invariant filter for visual inertial odometry, relying on the learned bias prediction rather than introducing the bias in the filter state. We demonstrate that an invariant multi-state constraint Kalman filter (MSCKF) with learned bias predictions achieves robust visual-inertial odometry in real experiments, even when visual information is unavailable for extended periods and the system needs to rely solely on IMU measurements.
△ Less
Submitted 10 May, 2025;
originally announced May 2025.
-
Qimax: Efficient quantum simulation via GPU-accelerated extended stabilizer formalism
Authors:
Vu Tuan Hai,
Bui Cao Doanh,
Le Vu Trung Duong,
Pham Hoai Luan,
Yasuhiko Nakashima
Abstract:
Simulating Clifford and near-Clifford circuits using the extended stabilizer formalism has become increasingly popular, particularly in quantum error correction. Compared to the state-vector approach, the extended stabilizer formalism can solve the same problems with fewer computational resources, as it operates on stabilizers rather than full state vectors. Most existing studies on near-Clifford…
▽ More
Simulating Clifford and near-Clifford circuits using the extended stabilizer formalism has become increasingly popular, particularly in quantum error correction. Compared to the state-vector approach, the extended stabilizer formalism can solve the same problems with fewer computational resources, as it operates on stabilizers rather than full state vectors. Most existing studies on near-Clifford circuits focus on balancing the trade-off between the number of ancilla qubits and simulation accuracy, often overlooking performance considerations. Furthermore, in the presence of high-rank stabilizers, performance is limited by the sequential property of the stabilizer formalism. In this work, we introduce a parallelized version of the extended stabilizer formalism, enabling efficient execution on multi-core devices such as GPU. Experimental results demonstrate that, in certain scenarios, our Python-based implementation outperforms state-of-the-art simulators such as Qiskit and Pennylane.
△ Less
Submitted 6 May, 2025;
originally announced May 2025.
-
AI-based CSI Feedback with Digital Twins: Real-World Validation and Insights
Authors:
Tzu-Hao Huang,
Chao-Kai Wen,
Shang-Ho Tsai,
Trung Q. Duong
Abstract:
Deep learning (DL) has shown great potential for enhancing channel state information (CSI) feedback in multiple-input multiple-output (MIMO) communication systems, a subject currently under study by the 3GPP standards body. Digital twins (DTs) have emerged as an effective means to generate site-specific datasets for training DL-based CSI feedback models. However, most existing studies rely solely…
▽ More
Deep learning (DL) has shown great potential for enhancing channel state information (CSI) feedback in multiple-input multiple-output (MIMO) communication systems, a subject currently under study by the 3GPP standards body. Digital twins (DTs) have emerged as an effective means to generate site-specific datasets for training DL-based CSI feedback models. However, most existing studies rely solely on simulations, leaving the effectiveness of DTs in reducing DL training costs yet to be validated through realistic experimental setups. This paper addresses this gap by establishing a real-world (RW) environment and corresponding virtual channels using ray tracing with replicated 3D models and accurate antenna properties. We evaluate whether models trained in DT environments can effectively operate in RW scenarios and quantify the benefits of online learning (OL) for performance enhancement. Results show that a dedicated DT remains essential even with OL to achieve satisfactory performance in RW scenarios.
△ Less
Submitted 2 May, 2025; v1 submitted 1 May, 2025;
originally announced May 2025.
-
The Cauchy--Szegö Projection for domains in $\mathbb C^n$ with minimal smoothness: weighted theory
Authors:
Xuan Thinh Duong,
Loredana Lanzani,
Ji Li,
Brett D. Wick
Abstract:
Let $D\subset\mathbb C^n$ be a bounded, strongly pseudoconvex domain whose boundary $bD$ satisfies the minimal regularity condition of class $C^2$. A 2017 result of Lanzani \& Stein states that the Cauchy--Szegö projection $S_ω$ defined with respect to a bounded, positive continuous multiple $ω$ of induced Lebesgue measure, {maps $L^p(bD, ω)$ to $L^p(bD, ω)$ continuously} for any $1<p<\infty$. Her…
▽ More
Let $D\subset\mathbb C^n$ be a bounded, strongly pseudoconvex domain whose boundary $bD$ satisfies the minimal regularity condition of class $C^2$. A 2017 result of Lanzani \& Stein states that the Cauchy--Szegö projection $S_ω$ defined with respect to a bounded, positive continuous multiple $ω$ of induced Lebesgue measure, {maps $L^p(bD, ω)$ to $L^p(bD, ω)$ continuously} for any $1<p<\infty$. Here we show that $S_ω$ satisfies explicit quantitative bounds in $L^p(bD, Ω)$, for any $1<p<\infty$ and for any $Ω$ in the maximal class of \textit{$A_p$}-measures, that is for $Ω_p = ψ_pσ$ where $ψ_p$ is a Muckenhoupt $A_p$-weight and $σ$ is the induced Lebesgue measure (with $ω$'s as above being a sub-class). Earlier results rely upon an asymptotic expansion and subsequent pointwise estimates of the Cauchy--Szegö kernel, but these are unavailable in our setting of minimal regularity {of $bD$}; at the same time, more recent techniques that allow to handle domains with minimal regularity (Lanzani--Stein 2017) are not applicable to $A_p$-measures. It turns out that the method of {quantitative} extrapolation is an appropriate replacement for the missing tools. To finish, we identify a class of holomorphic Hardy spaces defined with respect to $A_p$-measures for which a meaningful notion of Cauchy--Szegö projection can be defined when $p=2$.
△ Less
Submitted 24 April, 2025;
originally announced April 2025.
-
Hardy spaces and Campanato spaces associated with Laguerre expansions and higher order Riesz transforms
Authors:
The Anh Bui,
Xuan Thinh Duong
Abstract:
Let \(\mathcal{L}_ν\) be the Laguerre differential operator which is the self-adjoint extension of the differential operator \[ L_ν:= \sum_{i=1}^n \left[-\frac{\partial^2}{\partial x_i^2} + x_i^2 + \frac{1}{x_i^2} \left(ν_i^2 - \frac{1}{4} \right) \right] \] initially defined on \(C_c^\infty(\mathbb{R}_+^n)\) as its natural domain, where \(ν\in [-1/2,\infty)^n\), \(n \geq 1\). In this paper, we fi…
▽ More
Let \(\mathcal{L}_ν\) be the Laguerre differential operator which is the self-adjoint extension of the differential operator \[ L_ν:= \sum_{i=1}^n \left[-\frac{\partial^2}{\partial x_i^2} + x_i^2 + \frac{1}{x_i^2} \left(ν_i^2 - \frac{1}{4} \right) \right] \] initially defined on \(C_c^\infty(\mathbb{R}_+^n)\) as its natural domain, where \(ν\in [-1/2,\infty)^n\), \(n \geq 1\). In this paper, we first develop the theory of Hardy spaces \(H^p_{\mathcal{L}_ν}\) associated with \(\mathcal{L}_ν\) for the full range \(p \in (0,1]\). Then we investigate the corresponding BMO-type spaces and establish that they coincide with the dual spaces of \(H^p_{\mathcal{L}_ν}\). Finally, we show boundedness of higher-order Riesz transforms on Lebesgue spaces, as well as on our new Hardy and BMO-type spaces.
△ Less
Submitted 14 April, 2025;
originally announced April 2025.
-
Parameterized Attenuated Exchange for Generalized TDHF@$v_W$ Applications
Authors:
Barry Y. Li,
Tim Duong,
Tucker Allen,
Nadine C. Bradbury,
Justin R. Caram,
Daniel Neuhauser
Abstract:
Building upon our previously developed time-dependent Hartree-Fock (TDHF)@$v_W$ method, based on many-body perturbation theory and specifically the Bethe-Salpeter Equation (BSE), we introduce a parameterization scheme for the attenuated exchange kernel, $v_W(|r - r'|)$. In the original method, $v_W$ was determined individually for each system via an efficient stochastic short-time TD Hartree propa…
▽ More
Building upon our previously developed time-dependent Hartree-Fock (TDHF)@$v_W$ method, based on many-body perturbation theory and specifically the Bethe-Salpeter Equation (BSE), we introduce a parameterization scheme for the attenuated exchange kernel, $v_W(|r - r'|)$. In the original method, $v_W$ was determined individually for each system via an efficient stochastic short-time TD Hartree propagation for the screened Coulomb interaction, $W(r,r')$. The new parameterization leverages photochemical similarities in exciton binding energies (or exchange interaction attenuation) among molecules with comparable static dielectric responses. We parameterize the inverse dielectric function using a low-order polynomial with error function apodization, calibrated on a few representative molecules, each with its own $v_W$. Using only 7 parameters, the parameterized $v_W$ is fully grid-independent and broadly applicable within a family of molecules. This enables TDHF@$v_W$ that retains BSE-level accuracy, achieving a mean absolute error of $\sim0.1$ eV compared to experimental optical gaps and representing a five- to ten-fold improvement over conventional TD density functional theory or TDHF while reducing the cost to that of standard TDHF.
△ Less
Submitted 1 April, 2025;
originally announced April 2025.
-
Geo2ComMap: Deep Learning-Based MIMO Throughput Prediction Using Geographic Data
Authors:
Fan-Hao Lin,
Tzu-Hao Huang,
Chao-Kai Wen,
Trung Q. Duong
Abstract:
Accurate communication performance prediction is crucial for wireless applications such as network deployment and resource management. Unlike conventional systems with a single transmit and receive antenna, throughput (Tput) estimation in antenna array-based multiple-output multiple-input (MIMO) systems is computationally intensive, i.e., requiring analysis of channel matrices, rank conditions, an…
▽ More
Accurate communication performance prediction is crucial for wireless applications such as network deployment and resource management. Unlike conventional systems with a single transmit and receive antenna, throughput (Tput) estimation in antenna array-based multiple-output multiple-input (MIMO) systems is computationally intensive, i.e., requiring analysis of channel matrices, rank conditions, and spatial channel quality. These calculations impose significant computational and time burdens. This paper introduces Geo2ComMap, a deep learning-based framework that leverages geographic databases to efficiently estimate multiple communication metrics across an entire area in MIMO systems using only sparse measurements. To mitigate extreme prediction errors, we propose a sparse sampling strategy. Extensive evaluations demonstrate that Geo2ComMap accurately predicts full-area communication metrics, achieving a median absolute error of 27.35 Mbps for Tput values ranging from 0 to 1900 Mbps.
△ Less
Submitted 31 March, 2025;
originally announced April 2025.
-
Efficient Plane-Wave Approach to Generalized Kohn-Sham Density-Functional Theory of Solids with Mixed Deterministic/Stochastic Exchange
Authors:
Tucker Allen,
Barry Y. Li,
Tim Duong,
Daniel Neuhauser
Abstract:
An efficient mixed deterministic/sparse-stochastic plane-wave approach is developed for bandstructure calculations of large supercell periodic generalized-Kohn-Sham density functional theory, for any hybrid-exchange density functional. The method works for very large elementary cells and supercells, and we benchmark it on covalently bonded solids and molecular crystals with nonbonded interactions,…
▽ More
An efficient mixed deterministic/sparse-stochastic plane-wave approach is developed for bandstructure calculations of large supercell periodic generalized-Kohn-Sham density functional theory, for any hybrid-exchange density functional. The method works for very large elementary cells and supercells, and we benchmark it on covalently bonded solids and molecular crystals with nonbonded interactions, for supercells of up to 33,000 atoms. Memory and CPU requirements scale with supercell size quasi-linearly.
△ Less
Submitted 10 March, 2025;
originally announced March 2025.
-
Enhancing Vietnamese VQA through Curriculum Learning on Raw and Augmented Text Representations
Authors:
Khoi Anh Nguyen,
Linh Yen Vu,
Thang Dinh Duong,
Thuan Nguyen Duong,
Huy Thanh Nguyen,
Vinh Quang Dinh
Abstract:
Visual Question Answering (VQA) is a multimodal task requiring reasoning across textual and visual inputs, which becomes particularly challenging in low-resource languages like Vietnamese due to linguistic variability and the lack of high-quality datasets. Traditional methods often rely heavily on extensive annotated datasets, computationally expensive pipelines, and large pre-trained models, spec…
▽ More
Visual Question Answering (VQA) is a multimodal task requiring reasoning across textual and visual inputs, which becomes particularly challenging in low-resource languages like Vietnamese due to linguistic variability and the lack of high-quality datasets. Traditional methods often rely heavily on extensive annotated datasets, computationally expensive pipelines, and large pre-trained models, specifically in the domain of Vietnamese VQA, limiting their applicability in such scenarios. To address these limitations, we propose a training framework that combines a paraphrase-based feature augmentation module with a dynamic curriculum learning strategy. Explicitly, augmented samples are considered "easy" while raw samples are regarded as "hard". The framework then utilizes a mechanism that dynamically adjusts the ratio of easy to hard samples during training, progressively modifying the same dataset to increase its difficulty level. By enabling gradual adaptation to task complexity, this approach helps the Vietnamese VQA model generalize well, thus improving overall performance. Experimental results show consistent improvements on the OpenViVQA dataset and mixed outcomes on the ViVQA dataset, highlighting both the potential and challenges of our approach in advancing VQA for Vietnamese language.
△ Less
Submitted 6 March, 2025; v1 submitted 5 March, 2025;
originally announced March 2025.
-
SemViQA: A Semantic Question Answering System for Vietnamese Information Fact-Checking
Authors:
Dien X. Tran,
Nam V. Nguyen,
Thanh T. Tran,
Anh T. Hoang,
Tai V. Duong,
Di T. Le,
Phuc-Lu Le
Abstract:
The rise of misinformation, exacerbated by Large Language Models (LLMs) like GPT and Gemini, demands robust fact-checking solutions, especially for low-resource languages like Vietnamese. Existing methods struggle with semantic ambiguity, homonyms, and complex linguistic structures, often trading accuracy for efficiency. We introduce SemViQA, a novel Vietnamese fact-checking framework integrating…
▽ More
The rise of misinformation, exacerbated by Large Language Models (LLMs) like GPT and Gemini, demands robust fact-checking solutions, especially for low-resource languages like Vietnamese. Existing methods struggle with semantic ambiguity, homonyms, and complex linguistic structures, often trading accuracy for efficiency. We introduce SemViQA, a novel Vietnamese fact-checking framework integrating Semantic-based Evidence Retrieval (SER) and Two-step Verdict Classification (TVC). Our approach balances precision and speed, achieving state-of-the-art results with 78.97\% strict accuracy on ISE-DSC01 and 80.82\% on ViWikiFC, securing 1st place in the UIT Data Science Challenge. Additionally, SemViQA Faster improves inference speed 7x while maintaining competitive accuracy. SemViQA sets a new benchmark for Vietnamese fact verification, advancing the fight against misinformation. The source code is available at: https://github.com/DAVID-NGUYEN-S16/SemViQA.
△ Less
Submitted 11 May, 2025; v1 submitted 2 March, 2025;
originally announced March 2025.
-
Parallelizing the stabilizer formalism for quantum machine learning applications
Authors:
Vu Tuan Hai,
Le Vu Trung Duong,
Pham Hoai Luan,
Yasuhiko Nakashima
Abstract:
The quantum machine learning model is emerging as a new model that merges quantum computing and machine learning. Simulating very deep quantum machine learning models requires a lot of resources, increasing exponentially based on the number of qubits and polynomially based on the depth value. Almost all related works use state-vector-based simulators due to their parallelization and scalability. E…
▽ More
The quantum machine learning model is emerging as a new model that merges quantum computing and machine learning. Simulating very deep quantum machine learning models requires a lot of resources, increasing exponentially based on the number of qubits and polynomially based on the depth value. Almost all related works use state-vector-based simulators due to their parallelization and scalability. Extended stabilizer formalism simulators solve the same problem with fewer computations because they act on stabilizers rather than long vectors. However, the gate application sequential property leads to less popularity and poor performance. In this work, we parallelize the process, making it feasible to deploy on multi-core devices. The results show that the proposal implementation on Python is faster than Qiskit, the current fastest simulator, 4.23 times in the case of 4-qubits, 60,2K gates.
△ Less
Submitted 15 February, 2025;
originally announced February 2025.
-
Lightweight Authenticated Task Offloading in 6G-Cloud Vehicular Twin Networks
Authors:
Sarah Al-Shareeda,
Fusun Ozguner,
Keith Redmill,
Trung Q. Duong,
Berk Canberk
Abstract:
Task offloading management in 6G vehicular networks is crucial for maintaining network efficiency, particularly as vehicles generate substantial data. Integrating secure communication through authentication introduces additional computational and communication overhead, significantly impacting offloading efficiency and latency. This paper presents a unified framework incorporating lightweight Iden…
▽ More
Task offloading management in 6G vehicular networks is crucial for maintaining network efficiency, particularly as vehicles generate substantial data. Integrating secure communication through authentication introduces additional computational and communication overhead, significantly impacting offloading efficiency and latency. This paper presents a unified framework incorporating lightweight Identity-Based Cryptographic (IBC) authentication into task offloading within cloud-based 6G Vehicular Twin Networks (VTNs). Utilizing Proximal Policy Optimization (PPO) in Deep Reinforcement Learning (DRL), our approach optimizes authenticated offloading decisions to minimize latency and enhance resource allocation. Performance evaluation under varying network sizes, task sizes, and data rates reveals that IBC authentication can reduce offloading efficiency by up to 50% due to the added overhead. Besides, increasing network size and task size can further reduce offloading efficiency by up to 91.7%. As a countermeasure, increasing the transmission data rate can improve the offloading performance by as much as 63%, even in the presence of authentication overhead. The code for the simulations and experiments detailed in this paper is available on GitHub for further reference and reproducibility [1].
△ Less
Submitted 5 February, 2025;
originally announced February 2025.
-
On subordinated semigroups and Hardy spaces associated to fractional powers of operators
Authors:
The Anh Bui,
Michael G. Cowling,
Xuan Thinh Duong
Abstract:
Let $L$ be a positive self-adjoint operator on $L^2(X)$, where $X$ is a $σ$-finite metric measure space. When $α\in (0,1)$, the subordinated semigroup $\{\exp(-tL^α):t \in \mathbb{R}^+\}$ can be defined on $L^2(X)$ and extended to $L^p(X)$. We prove various results about the semigroup $\{\exp(-tL^α):t \in \mathbb{R}^+\}$, under different assumptions on $L$. These include the weak type $(1,1)$ boun…
▽ More
Let $L$ be a positive self-adjoint operator on $L^2(X)$, where $X$ is a $σ$-finite metric measure space. When $α\in (0,1)$, the subordinated semigroup $\{\exp(-tL^α):t \in \mathbb{R}^+\}$ can be defined on $L^2(X)$ and extended to $L^p(X)$. We prove various results about the semigroup $\{\exp(-tL^α):t \in \mathbb{R}^+\}$, under different assumptions on $L$. These include the weak type $(1,1)$ boundedness of the maximal operator $f \mapsto \sup _{t\in \mathbb{R}^+}\exp(-tL^α)f$ and characterisations of Hardy spaces associated to the operator $L$ by the area integral and vertical square function.
△ Less
Submitted 3 February, 2025;
originally announced February 2025.
-
Beyond Task Diversity: Provable Representation Transfer for Sequential Multi-Task Linear Bandits
Authors:
Thang Duong,
Zhi Wang,
Chicheng Zhang
Abstract:
We study lifelong learning in linear bandits, where a learner interacts with a sequence of linear bandit tasks whose parameters lie in an $m$-dimensional subspace of $\mathbb{R}^d$, thereby sharing a low-rank representation. Current literature typically assumes that the tasks are diverse, i.e., their parameters uniformly span the $m$-dimensional subspace. This assumption allows the low-rank repres…
▽ More
We study lifelong learning in linear bandits, where a learner interacts with a sequence of linear bandit tasks whose parameters lie in an $m$-dimensional subspace of $\mathbb{R}^d$, thereby sharing a low-rank representation. Current literature typically assumes that the tasks are diverse, i.e., their parameters uniformly span the $m$-dimensional subspace. This assumption allows the low-rank representation to be learned before all tasks are revealed, which can be unrealistic in real-world applications. In this work, we present the first nontrivial result for sequential multi-task linear bandits without the task diversity assumption. We develop an algorithm that efficiently learns and transfers low-rank representations. When facing $N$ tasks, each played over $τ$ rounds, our algorithm achieves a regret guarantee of $\tilde{O}\big (Nm \sqrtτ + N^{\frac{2}{3}} τ^{\frac{2}{3}} d m^{\frac13} + Nd^2 + τm d \big)$ under the ellipsoid action set assumption. This result can significantly improve upon the baseline of $\tilde{O} \left (Nd \sqrtτ\right)$ that does not leverage the low-rank structure when the number of tasks $N$ is sufficiently large and $m \ll d$. We also demonstrate empirically on synthetic data that our algorithm outperforms baseline algorithms, which rely on the task diversity assumption.
△ Less
Submitted 23 January, 2025;
originally announced January 2025.
-
Investigating Market Strength Prediction with CNNs on Candlestick Chart Images
Authors:
Thanh Nam Duong,
Trung Kien Hoang,
Quoc Khanh Duong,
Quoc Dat Dinh,
Duc Hoan Le,
Huy Tuan Nguyen,
Xuan Bach Nguyen,
Quy Ban Tran
Abstract:
This paper investigates predicting market strength solely from candlestick chart images to assist investment decisions. The core research problem is developing an effective computer vision-based model using raw candlestick visuals without time-series data. We specifically analyze the impact of incorporating candlestick patterns that were detected by YOLOv8. The study implements two approaches: pur…
▽ More
This paper investigates predicting market strength solely from candlestick chart images to assist investment decisions. The core research problem is developing an effective computer vision-based model using raw candlestick visuals without time-series data. We specifically analyze the impact of incorporating candlestick patterns that were detected by YOLOv8. The study implements two approaches: pure CNN on chart images and a Decomposer architecture detecting patterns. Experiments utilize diverse financial datasets spanning stocks, cryptocurrencies, and forex assets. Key findings demonstrate candlestick patterns do not improve model performance over only image data in our research. The significance is illuminating limitations in candlestick image signals. Performance peaked at approximately 0.7 accuracy, below more complex time-series models. Outcomes reveal challenges in distilling sufficient predictive power from visual shapes alone, motivating the incorporation of other data modalities. This research clarifies how purely image-based models can inform trading while confirming patterns add little value over raw charts. Our content is endeavored to be delineated into distinct sections, each autonomously furnishing a unique contribution while maintaining cohesive linkage. Note that, the examples discussed herein are not limited to the scope, applicability, or knowledge outlined in the paper.
△ Less
Submitted 21 January, 2025;
originally announced January 2025.
-
GenSC-6G: A Prototype Testbed for Integrated Generative AI, Quantum, and Semantic Communication
Authors:
Brian E. Arfeto,
Shehbaz Tariq,
Uman Khalid,
Trung Q. Duong,
Hyundong Shin
Abstract:
We introduce a prototyping testbed, GenSC-6G, developed to generate a comprehensive dataset that supports the integration of generative artificial intelligence (AI), quantum computing, and semantic communication for emerging sixth-generation (6G) applications. The GenSC-6G dataset is designed with noise-augmented synthetic data optimized for semantic decoding, classification, and localization task…
▽ More
We introduce a prototyping testbed, GenSC-6G, developed to generate a comprehensive dataset that supports the integration of generative artificial intelligence (AI), quantum computing, and semantic communication for emerging sixth-generation (6G) applications. The GenSC-6G dataset is designed with noise-augmented synthetic data optimized for semantic decoding, classification, and localization tasks, significantly enhancing flexibility for diverse AI-driven communication applications. This adaptable prototype supports seamless modifications across baseline models, communication modules, and goal-oriented decoders. Case studies demonstrate its application in lightweight classification, semantic upsampling, and edge-based language inference under noise conditions. The GenSC-6G dataset serves as a scalable and robust resource for developing goal-oriented communication systems tailored to the growing demands of 6G networks.
△ Less
Submitted 16 January, 2025;
originally announced January 2025.
-
A new rotation-free isogeometric thin shell formulation and a corresponding continuity constraint for patch boundaries
Authors:
Thang Xuan Duong,
Farshad Roohbakhshan,
Roger Andrew Sauer
Abstract:
This paper presents a general non-linear computational formulation for rotation-free thin shells based on isogeometric finite elements. It is a displacement-based formulation that admits general material models. The formulation allows for a wide range of constitutive laws, including both shell models that are extracted from existing 3D continua using numerical integration and those that are direct…
▽ More
This paper presents a general non-linear computational formulation for rotation-free thin shells based on isogeometric finite elements. It is a displacement-based formulation that admits general material models. The formulation allows for a wide range of constitutive laws, including both shell models that are extracted from existing 3D continua using numerical integration and those that are directly formulated in 2D manifold form, like the Koiter, Canham and Helfrich models. Further, a unified approach to enforce the $G^1$-continuity between patches, fix the angle between surface folds, enforce symmetry conditions and prescribe rotational Dirichlet boundary conditions, is presented using penalty and Lagrange multiplier methods. The formulation is fully described in the natural curvilinear coordinate system of the finite element description, which facilitates an efficient computational implementation. It contains existing isogeometric thin shell formulations as special cases. Several classical numerical benchmark examples are considered to demonstrate the robustness and accuracy of the proposed formulation. The presented constitutive models, in particular the simple mixed Koiter model that does not require any thickness integration, show excellent performance, even for large deformations.
△ Less
Submitted 8 January, 2025;
originally announced January 2025.
-
Navigation Variable-based Multi-objective Particle Swarm Optimization for UAV Path Planning with Kinematic Constraints
Authors:
Thi Thuy Ngan Duong,
Duy-Nam Bui,
Manh Duong Phung
Abstract:
Path planning is essential for unmanned aerial vehicles (UAVs) as it determines the path that the UAV needs to follow to complete a task. This work addresses this problem by introducing a new algorithm called navigation variable-based multi-objective particle swarm optimization (NMOPSO). It first models path planning as an optimization problem via the definition of a set of objective functions tha…
▽ More
Path planning is essential for unmanned aerial vehicles (UAVs) as it determines the path that the UAV needs to follow to complete a task. This work addresses this problem by introducing a new algorithm called navigation variable-based multi-objective particle swarm optimization (NMOPSO). It first models path planning as an optimization problem via the definition of a set of objective functions that include optimality and safety requirements for UAV operation. The NMOPSO is then used to minimize those functions through Pareto optimal solutions. The algorithm features a new path representation based on navigation variables to include kinematic constraints and exploit the maneuverable characteristics of the UAV. It also includes an adaptive mutation mechanism to enhance the diversity of the swarm for better solutions. Comparisons with various algorithms have been carried out to benchmark the proposed approach. The results indicate that the NMOPSO performs better than not only other particle swarm optimization variants but also other state-of-the-art multi-objective and metaheuristic optimization algorithms. Experiments have also been conducted with real UAVs to confirm the validity of the approach for practical flights. The source code of the algorithm is available at https://github.com/ngandng/NMOPSO.
△ Less
Submitted 3 January, 2025;
originally announced January 2025.
-
A finite strain model for fiber angle plasticity of textile fabrics based on isogeometric shell finite elements
Authors:
Thang Xuan Duong,
Roger Andrew Sauer
Abstract:
This work presents a shear elastoplasticity model for textile fabrics within the theoretical framework of anisotropic Kirchhoff-Love shells with bending of embedded fibers proposed by Duong et al. (2023). The plasticity model aims at capturing the rotational inter-ply frictional sliding between fiber families in textile composites undergoing large deformation. Such effects are usually dominant in…
▽ More
This work presents a shear elastoplasticity model for textile fabrics within the theoretical framework of anisotropic Kirchhoff-Love shells with bending of embedded fibers proposed by Duong et al. (2023). The plasticity model aims at capturing the rotational inter-ply frictional sliding between fiber families in textile composites undergoing large deformation. Such effects are usually dominant in dry textile fabrics such as woven and non-crimp fabrics. The model explicitly uses relative angles between fiber families as strain measures for the kinematics. The plasticity model is formulated directly with surface invariants without resorting to thickness integration. Motivated by experimental observations from the picture frame test, a yield function is proposed with isotropic hardening and a simple evolution equation. A classical return mapping algorithm is employed to solve the elastoplastic problem within the isogeometric finite shell element formulation of Duong et al. (2022). The verification of the implementation is facilitated by the analytical solution for the picture frame test. The proposed plasticity model is calibrated from the picture frame test and is then validated by the bias extension test, considering available experimental data for different samples from the literature. Good agreement between model prediction and experimental data is obtained. Finally, the applicability of the elastoplasticity model to 3D shell problems is demonstrated.
△ Less
Submitted 5 May, 2025; v1 submitted 28 December, 2024;
originally announced December 2024.
-
Hardy spaces, Besov spaces and Triebel--Lizorkin spaces associated with a discrete Laplacian and applications
Authors:
The Anh Bui,
Xuan Thinh Duong
Abstract:
Consider the discrete Laplacian $Δ_d$ defined on the set of integers $\mathbb Z$ by
\[
Δ_d f(n) = -f(n+1) + 2f(n) -f(n-1), \ \ \ \ n\in \mathbb Z,
\]
where $f$ is a function defined on $\mathbb Z$. In this paper, we define Hardy spaces, Besov spaces and Triebel--Lizorkin spaces associated with $Δ_d$ and then show that these function spaces coincide with the classical function spaces define…
▽ More
Consider the discrete Laplacian $Δ_d$ defined on the set of integers $\mathbb Z$ by
\[
Δ_d f(n) = -f(n+1) + 2f(n) -f(n-1), \ \ \ \ n\in \mathbb Z,
\]
where $f$ is a function defined on $\mathbb Z$. In this paper, we define Hardy spaces, Besov spaces and Triebel--Lizorkin spaces associated with $Δ_d$ and then show that these function spaces coincide with the classical function spaces defined on $\mathbb Z$. As applications, we prove the boundedness of the spectral multipliers and the Riesz transforms associated with $Δ_d$ on these function spaces.
△ Less
Submitted 28 November, 2024;
originally announced November 2024.
-
Equivalence of Sobolev norms for Kolmogorov operators with scaling-critical drift
Authors:
The Anh Bui,
Xuan Thinh Duong,
Konstantin Merz
Abstract:
We consider the ordinary or fractional Laplacian plus a homogeneous, scaling-critical drift term. This operator is non-symmetric but homogeneous, and generates scales of $L^p$-Sobolev spaces which we compare with the ordinary homogeneous Sobolev spaces. Unlike in previous studies concerning Hardy operators, i.e., ordinary or fractional Laplacians plus scaling-critical scalar perturbations, handlin…
▽ More
We consider the ordinary or fractional Laplacian plus a homogeneous, scaling-critical drift term. This operator is non-symmetric but homogeneous, and generates scales of $L^p$-Sobolev spaces which we compare with the ordinary homogeneous Sobolev spaces. Unlike in previous studies concerning Hardy operators, i.e., ordinary or fractional Laplacians plus scaling-critical scalar perturbations, handling the drift term requires an additional, possibly technical, restriction on the range of comparable Sobolev spaces, which is related to the unavailability of gradient bounds for the associated semigroup.
△ Less
Submitted 30 September, 2024;
originally announced October 2024.
-
The Discovery of Giant Positive Magnetoresistance in Proximity to Helimagnetic Order in Manganese Phosphide Nanostructured Films
Authors:
Nivarthana W. Y. A. Y. Mudiyanselage,
Derick DeTellem,
Amit Chanda,
Anh Tuan Duong,
Tzung-En Hsieh,
Johannes Frisch,
Marcus Bär,
Richa Pokharel Madhogaria,
Shirin Mozaffari,
Hasitha Suriya Arachchige,
David Mandrus,
Hariharan Srikanth,
Sarath Witanachchi,
Manh-Huong Phan
Abstract:
The study of magnetoresistance (MR) phenomena has been pivotal in advancing magnetic sensors and spintronic devices. Helimagnets present an intriguing avenue for spintronics research. Theoretical predictions suggest that MR magnitude in the helimagnetic (HM) regime surpasses that in the ferromagnetic (FM) regime by over an order of magnitude. However, in metallic helimagnets like manganese phosphi…
▽ More
The study of magnetoresistance (MR) phenomena has been pivotal in advancing magnetic sensors and spintronic devices. Helimagnets present an intriguing avenue for spintronics research. Theoretical predictions suggest that MR magnitude in the helimagnetic (HM) regime surpasses that in the ferromagnetic (FM) regime by over an order of magnitude. However, in metallic helimagnets like manganese phosphide, MR in the HM phase remains modest (10%), limiting its application in MR devices. Here, a groundbreaking approach is presented to achieve a giant low field MR effect in nanostructured manganese phosphide films by leveraging confinement and strain effects along with spin helicity. Unlike the modest MR observed in bulk manganese phosphide single crystals and large grain polycrystalline films, which exhibit a small negative MR in the FM region (2%) increasing to 8% in the HM region across 10-300 K, a grain size-dependent giant positive MR (90%) is discovered near FM to HM transition temperature (110 K), followed by a rapid decline to a negative MR below 55 K in manganese phosphide nanocrystalline films. These findings illuminate a novel strain-mediated spin helicity phenomenon in nanostructured helimagnets, presenting a promising pathway for the development of high-performance MR sensors and spintronic devices through the strategic utilization of confinement and strain effects.
△ Less
Submitted 28 September, 2024;
originally announced September 2024.
-
Variational Autoencoder for Anomaly Detection: A Comparative Study
Authors:
Huy Hoang Nguyen,
Cuong Nhat Nguyen,
Xuan Tung Dao,
Quoc Trung Duong,
Dzung Pham Thi Kim,
Minh-Tan Pham
Abstract:
This paper aims to conduct a comparative analysis of contemporary Variational Autoencoder (VAE) architectures employed in anomaly detection, elucidating their performance and behavioral characteristics within this specific task. The architectural configurations under consideration encompass the original VAE baseline, the VAE with a Gaussian Random Field prior (VAE-GRF), and the VAE incorporating a…
▽ More
This paper aims to conduct a comparative analysis of contemporary Variational Autoencoder (VAE) architectures employed in anomaly detection, elucidating their performance and behavioral characteristics within this specific task. The architectural configurations under consideration encompass the original VAE baseline, the VAE with a Gaussian Random Field prior (VAE-GRF), and the VAE incorporating a vision transformer (ViT-VAE). The findings reveal that ViT-VAE exhibits exemplary performance across various scenarios, whereas VAE-GRF may necessitate more intricate hyperparameter tuning to attain its optimal performance state. Additionally, to mitigate the propensity for over-reliance on results derived from the widely used MVTec dataset, this paper leverages the recently-public MiAD dataset for benchmarking. This deliberate inclusion seeks to enhance result competitiveness by alleviating the impact of domain-specific models tailored exclusively for MVTec, thereby contributing to a more robust evaluation framework. Codes is available at https://github.com/endtheme123/VAE-compare.git.
△ Less
Submitted 24 August, 2024;
originally announced August 2024.
-
Variable-Frequency Model Learning and Predictive Control for Jumping Maneuvers on Legged Robots
Authors:
Chuong Nguyen,
Abdullah Altawaitan,
Thai Duong,
Nikolay Atanasov,
Quan Nguyen
Abstract:
Achieving both target accuracy and robustness in dynamic maneuvers with long flight phases, such as high or long jumps, has been a significant challenge for legged robots. To address this challenge, we propose a novel learning-based control approach consisting of model learning and model predictive control (MPC) utilizing a variable-frequency scheme. Compared to existing MPC techniques, we learn a…
▽ More
Achieving both target accuracy and robustness in dynamic maneuvers with long flight phases, such as high or long jumps, has been a significant challenge for legged robots. To address this challenge, we propose a novel learning-based control approach consisting of model learning and model predictive control (MPC) utilizing a variable-frequency scheme. Compared to existing MPC techniques, we learn a model directly from experiments, accounting not only for leg dynamics but also for modeling errors and unknown dynamics mismatch in hardware and during contact. Additionally, learning the model with variable-frequency allows us to cover the entire flight phase and final jumping target, enhancing the prediction accuracy of the jumping trajectory. Using the learned model, we also design variable-frequency to effectively leverage different jumping phases and track the target accurately. In a total of 92 jumps on Unitree A1 robot hardware, we verify that our approach outperforms other MPCs using fixed frequency or nominal model, reducing the jumping distance error 2 to 8 times. We also achieve jumping distance errors of less than 3 percent during continuous jumping on uneven terrain with randomly placed perturbations of random heights (up to 4 cm or 27 percent the robot standing height). Our approach obtains distance errors of 1 to 2 cm on 34 single and continuous jumps with different jumping targets and model uncertainties. Code is available at https://github.com/DRCL-USC/Learning MPC Jumping.
△ Less
Submitted 6 December, 2024; v1 submitted 20 July, 2024;
originally announced July 2024.
-
Using iterated local alignment to aggregate trajectory data into a traffic flow map
Authors:
Tarn Duong
Abstract:
Vehicle trajectories, with their detailed geolocations, are a promising data source to compute traffic flow maps at scales ranging from the city/regional level to the road level. The main obstacle is that trajectory data are prone to measurement noise. While this is negligible for city level large-scale flow aggregation, it poses substantial difficulties for road level small-scale aggregation. To…
▽ More
Vehicle trajectories, with their detailed geolocations, are a promising data source to compute traffic flow maps at scales ranging from the city/regional level to the road level. The main obstacle is that trajectory data are prone to measurement noise. While this is negligible for city level large-scale flow aggregation, it poses substantial difficulties for road level small-scale aggregation. To overcome these difficulties, we introduce innovative local alignment algorithms, where we infer road segments to serve as local reference segments, and proceed to align nearby road segments to them. We deploy these algorithms in an iterative workflow to compute locally aligned flow maps. By applying this workflow to synthetic and empirical trajectories, we verify that our locally aligned flow maps provide high levels of accuracy and spatial resolution of flow aggregation at multiple scales for static and interactive maps.
△ Less
Submitted 9 May, 2025; v1 submitted 25 June, 2024;
originally announced June 2024.
-
Building a temperature forecasting model for the city with the regression neural network (RNN)
Authors:
Nguyen Phuc Tran,
Duy Thanh Tran,
Thi Thuy Nga Duong
Abstract:
In recent years, a study by environmental organizations in the world and Vietnam shows that weather change is quite complex. global warming has become a serious problem in the modern world, which is a concern for scientists. last century, it was difficult to forecast the weather due to missing weather monitoring stations and technological limitations. this made it hard to collect data for building…
▽ More
In recent years, a study by environmental organizations in the world and Vietnam shows that weather change is quite complex. global warming has become a serious problem in the modern world, which is a concern for scientists. last century, it was difficult to forecast the weather due to missing weather monitoring stations and technological limitations. this made it hard to collect data for building predictive models to make accurate simulations. in Vietnam, research on weather forecast models is a recent development, having only begun around 2000. along with advancements in computer science, mathematical models are being built and applied with machine learning techniques to create more accurate and reliable predictive models. this article will summarize the research and solutions for applying recurrent neural networks to forecast urban temperatures.
△ Less
Submitted 27 May, 2024;
originally announced May 2024.
-
Global-in-time maximal regularity for the Cauchy problem of the heat equation in BMO and applications
Authors:
Xuan Thinh Duong,
Ji Li,
Liangchuan Wu,
Lixin Yan
Abstract:
In this article, we establish global-in-time maximal regularity for the Cauchy problem of the classical heat equation $\partial_t u(x,t)-Δu(x,t)=f(x,t)$ with $u(x,0)=0$ in a certain $\rm BMO$ setting, which improves the local-in-time result initially proposed by Ogawa and Shimizu in \cite{OS, OS2}. In further developing our method originally formulated for the heat equation, we obtain analogous gl…
▽ More
In this article, we establish global-in-time maximal regularity for the Cauchy problem of the classical heat equation $\partial_t u(x,t)-Δu(x,t)=f(x,t)$ with $u(x,0)=0$ in a certain $\rm BMO$ setting, which improves the local-in-time result initially proposed by Ogawa and Shimizu in \cite{OS, OS2}. In further developing our method originally formulated for the heat equation, we obtain analogous global ${\rm BMO}$-maximal regularity associated to the Schrödinger operator $\mathcal L=-Δ+V$, where the nonnegative potential $V$ belongs to the reverse Hölder class ${\rm RH}_q$ for some $q> n/2$. This extension includes several inhomogeneous estimates as ingredients, such as Carleson-type estimates for the external forces.
Our new methodology is to exploit elaborate heat kernel estimates, along with matched space-time decomposition on the involving integral-type structure of maximal operators, as well as some global techniques such as those from de Simon's work and Schur's lemma. One crucial trick is to utilize the mean oscillation therein to contribute a higher and necessary decay order for global-in-time estimates.
△ Less
Submitted 2 May, 2024;
originally announced May 2024.
-
What-if Analysis Framework for Digital Twins in 6G Wireless Network Management
Authors:
Elif Ak,
Berk Canberk,
Vishal Sharma,
Octavia A. Dobre,
Trung Q. Duong
Abstract:
This study explores implementing a digital twin network (DTN) for efficient 6G wireless network management, aligning with the fault, configuration, accounting, performance, and security (FCAPS) model. The DTN architecture comprises the Physical Twin Layer, implemented using NS-3, and the Service Layer, featuring machine learning and reinforcement learning for optimizing carrier sensitivity thresho…
▽ More
This study explores implementing a digital twin network (DTN) for efficient 6G wireless network management, aligning with the fault, configuration, accounting, performance, and security (FCAPS) model. The DTN architecture comprises the Physical Twin Layer, implemented using NS-3, and the Service Layer, featuring machine learning and reinforcement learning for optimizing carrier sensitivity threshold and transmit power control in wireless networks. We introduce a robust "What-if Analysis" module, utilizing conditional tabular generative adversarial network (CTGAN) for synthetic data generation to mimic various network scenarios. These scenarios assess four network performance metrics: throughput, latency, packet loss, and coverage. Our findings demonstrate the efficiency of the proposed what-if analysis framework in managing complex network conditions, highlighting the importance of the scenario-maker step and the impact of twinning intervals on network performance.
△ Less
Submitted 24 April, 2024; v1 submitted 17 April, 2024;
originally announced April 2024.
-
Multi-target and multi-stage liver lesion segmentation and detection in multi-phase computed tomography scans
Authors:
Abdullah F. Al-Battal,
Soan T. M. Duong,
Van Ha Tang,
Quang Duc Tran,
Steven Q. H. Truong,
Chien Phan,
Truong Q. Nguyen,
Cheolhong An
Abstract:
Multi-phase computed tomography (CT) scans use contrast agents to highlight different anatomical structures within the body to improve the probability of identifying and detecting anatomical structures of interest and abnormalities such as liver lesions. Yet, detecting these lesions remains a challenging task as these lesions vary significantly in their size, shape, texture, and contrast with resp…
▽ More
Multi-phase computed tomography (CT) scans use contrast agents to highlight different anatomical structures within the body to improve the probability of identifying and detecting anatomical structures of interest and abnormalities such as liver lesions. Yet, detecting these lesions remains a challenging task as these lesions vary significantly in their size, shape, texture, and contrast with respect to surrounding tissue. Therefore, radiologists need to have an extensive experience to be able to identify and detect these lesions. Segmentation-based neural networks can assist radiologists with this task. Current state-of-the-art lesion segmentation networks use the encoder-decoder design paradigm based on the UNet architecture where the multi-phase CT scan volume is fed to the network as a multi-channel input. Although this approach utilizes information from all the phases and outperform single-phase segmentation networks, we demonstrate that their performance is not optimal and can be further improved by incorporating the learning from models trained on each single-phase individually. Our approach comprises three stages. The first stage identifies the regions within the liver where there might be lesions at three different scales (4, 8, and 16 mm). The second stage includes the main segmentation model trained using all the phases as well as a segmentation model trained on each of the phases individually. The third stage uses the multi-phase CT volumes together with the predictions from each of the segmentation models to generate the final segmentation map. Overall, our approach improves relative liver lesion segmentation performance by 1.6% while reducing performance variability across subjects by 8% when compared to the current state-of-the-art models.
△ Less
Submitted 17 April, 2024;
originally announced April 2024.
-
Decay characterization of solutions to semi-linear structurally damped $σ$-evolution equations with time-dependent damping
Authors:
Cung The Anh,
Phan Duc An,
Pham Trieu Duong
Abstract:
In this paper, we study the Cauchy problem to the linear damped $σ$-evolution equation with time-dependent damping in the effective cases \begin{equation*} u_{t t}+(-Δ)^σu+b(t)(-Δ)^δu_t=0, \end{equation*} and investigate the decay rates of the solution and its derivatives that are expressed in terms of the decay character of the initial data $u_0(x)=u(0, x)$ and $u_1(x)=u_t(0, x)$. We are interest…
▽ More
In this paper, we study the Cauchy problem to the linear damped $σ$-evolution equation with time-dependent damping in the effective cases \begin{equation*} u_{t t}+(-Δ)^σu+b(t)(-Δ)^δu_t=0, \end{equation*} and investigate the decay rates of the solution and its derivatives that are expressed in terms of the decay character of the initial data $u_0(x)=u(0, x)$ and $u_1(x)=u_t(0, x)$. We are interested also in the existence and decay rate of the global in time solution with small data for the corresponding semi-linear problem with the nonlinear term of power type $||D|^γu|^p$. The blow-up results for solutions to the semi-linear problem in the case $γ=0$ are presented to show the sharpness of the exponent $p$.
△ Less
Submitted 10 April, 2024;
originally announced April 2024.
-
Surface-based parcellation and vertex-wise analysis of ultra high-resolution ex vivo 7 tesla MRI in Alzheimer's disease and related dementias
Authors:
Pulkit Khandelwal,
Michael Tran Duong,
Lisa Levorse,
Constanza Fuentes,
Amanda Denning,
Winifred Trotman,
Ranjit Ittyerah,
Alejandra Bahena,
Theresa Schuck,
Marianna Gabrielyan,
Karthik Prabhakaran,
Daniel Ohm,
Gabor Mizsei,
John Robinson,
Monica Munoz,
John Detre,
Edward Lee,
David Irwin,
Corey McMillan,
M. Dylan Tisdall,
Sandhitsu Das,
David Wolk,
Paul A. Yushkevich
Abstract:
Magnetic resonance imaging (MRI) is the standard modality to understand human brain structure and function in vivo (antemortem). Decades of research in human neuroimaging has led to the widespread development of methods and tools to provide automated volume-based segmentations and surface-based parcellations which help localize brain functions to specialized anatomical regions. Recently ex vivo (p…
▽ More
Magnetic resonance imaging (MRI) is the standard modality to understand human brain structure and function in vivo (antemortem). Decades of research in human neuroimaging has led to the widespread development of methods and tools to provide automated volume-based segmentations and surface-based parcellations which help localize brain functions to specialized anatomical regions. Recently ex vivo (postmortem) imaging of the brain has opened-up avenues to study brain structure at sub-millimeter ultra high-resolution revealing details not possible to observe with in vivo MRI. Unfortunately, there has been limited methodological development in ex vivo MRI primarily due to lack of datasets and limited centers with such imaging resources. Therefore, in this work, we present one-of-its-kind dataset of 82 ex vivo T2w whole brain hemispheres MRI at 0.3 mm isotropic resolution spanning Alzheimer's disease and related dementias. We adapted and developed a fast and easy-to-use automated surface-based pipeline to parcellate, for the first time, ultra high-resolution ex vivo brain tissue at the native subject space resolution using the Desikan-Killiany-Tourville (DKT) brain atlas. This allows us to perform vertex-wise analysis in the template space and thereby link morphometry measures with pathology measurements derived from histology. We will open-source our dataset docker container, Jupyter notebooks for ready-to-use out-of-the-box set of tools and command line options to advance ex vivo MRI clinical brain imaging research on the project webpage.
△ Less
Submitted 2 July, 2024; v1 submitted 28 March, 2024;
originally announced March 2024.
-
Overlapping community detection algorithms using Modularity and the cosine
Authors:
Do Duy Hieu,
Phan Thi Ha Duong
Abstract:
The issue of network community detection has been extensively studied across many fields. Most community detection methods assume that nodes belong to only one community. However, in many cases, nodes can belong to multiple communities simultaneously.This paper presents two overlapping network community detection algorithms that build on the two-step approach, using the extended modularity and cos…
▽ More
The issue of network community detection has been extensively studied across many fields. Most community detection methods assume that nodes belong to only one community. However, in many cases, nodes can belong to multiple communities simultaneously.This paper presents two overlapping network community detection algorithms that build on the two-step approach, using the extended modularity and cosine function. The applicability of our algorithms extends to both undirected and directed graph structures. To demonstrate the feasibility and effectiveness of these algorithms, we conducted experiments using real data.
△ Less
Submitted 12 March, 2024;
originally announced March 2024.
-
Response to David Steigmann's discussion of our paper
Authors:
Thang X. Duong,
Mikhail Itskov,
Roger A. Sauer
Abstract:
We respond to David Steigmann's discussion of our paper "A general theory for anisotropic Kirchhoff-Love shells with in-plane bending of embedded fibers, Math. Mech. Solids, 28(5):1274-1317" (arXiv:2101.03122). His discussion allows us to clarify two misleading statements in our original paper, and confirm that its formulation is fully consistent with the formulation of Steigmann. We also demonstr…
▽ More
We respond to David Steigmann's discussion of our paper "A general theory for anisotropic Kirchhoff-Love shells with in-plane bending of embedded fibers, Math. Mech. Solids, 28(5):1274-1317" (arXiv:2101.03122). His discussion allows us to clarify two misleading statements in our original paper, and confirm that its formulation is fully consistent with the formulation of Steigmann. We also demonstrate that some of our original statements criticized by Steigmann are not wrong.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
Ant Colony Optimization for Cooperative Inspection Path Planning Using Multiple Unmanned Aerial Vehicles
Authors:
Duy Nam Bui,
Thuy Ngan Duong,
Manh Duong Phung
Abstract:
This paper presents a new swarm intelligence-based approach to deal with the cooperative path planning problem of unmanned aerial vehicles (UAVs), which is essential for the automatic inspection of infrastructure. The approach uses a 3D model of the structure to generate viewpoints for the UAVs. The calculation of the viewpoints considers the constraints related to the UAV formation model, camera…
▽ More
This paper presents a new swarm intelligence-based approach to deal with the cooperative path planning problem of unmanned aerial vehicles (UAVs), which is essential for the automatic inspection of infrastructure. The approach uses a 3D model of the structure to generate viewpoints for the UAVs. The calculation of the viewpoints considers the constraints related to the UAV formation model, camera parameters, and requirements for data post-processing. The viewpoints are then used as input to formulate the path planning as an extended traveling salesman problem and the definition of a new cost function. Ant colony optimization is finally used to solve the problem to yield optimal inspection paths. Experiments with 3D models of real structures have been conducted to evaluate the performance of the proposed approach. The results show that our system is not only capable of generating feasible inspection paths for UAVs but also reducing the path length by 29.47\% for complex structures when compared with another heuristic approach. The source code of the algorithm can be found at https://github.com/duynamrcv/aco_3d_ipp.
△ Less
Submitted 13 February, 2024;
originally announced February 2024.
-
Port-Hamiltonian Neural ODE Networks on Lie Groups For Robot Dynamics Learning and Control
Authors:
Thai Duong,
Abdullah Altawaitan,
Jason Stanley,
Nikolay Atanasov
Abstract:
Accurate models of robot dynamics are critical for safe and stable control and generalization to novel operational conditions. Hand-designed models, however, may be insufficiently accurate, even after careful parameter tuning. This motivates the use of machine learning techniques to approximate the robot dynamics over a training set of state-control trajectories. The dynamics of many robots are de…
▽ More
Accurate models of robot dynamics are critical for safe and stable control and generalization to novel operational conditions. Hand-designed models, however, may be insufficiently accurate, even after careful parameter tuning. This motivates the use of machine learning techniques to approximate the robot dynamics over a training set of state-control trajectories. The dynamics of many robots are described in terms of their generalized coordinates on a matrix Lie group, e.g. on $SE(3)$ for ground, aerial, and underwater vehicles, and generalized velocity, and satisfy conservation of energy principles. This paper proposes a port-Hamiltonian formulation over a Lie group of the structure of a neural ordinary differential equation (ODE) network to approximate the robot dynamics. In contrast to a black-box ODE network, our formulation embeds energy conservation principle and Lie group's constraints in the dynamics model and explicitly accounts for energy-dissipation effect such as friction and drag forces in the dynamics model. We develop energy shaping and damping injection control for the learned, potentially under-actuated Hamiltonian dynamics to enable a unified approach for stabilization and trajectory tracking with various robot platforms.
△ Less
Submitted 11 June, 2024; v1 submitted 17 January, 2024;
originally announced January 2024.
-
Physics-Informed Multi-Agent Reinforcement Learning for Distributed Multi-Robot Problems
Authors:
Eduardo Sebastian,
Thai Duong,
Nikolay Atanasov,
Eduardo Montijano,
Carlos Sagues
Abstract:
The networked nature of multi-robot systems presents challenges in the context of multi-agent reinforcement learning. Centralized control policies do not scale with increasing numbers of robots, whereas independent control policies do not exploit the information provided by other robots, exhibiting poor performance in cooperative-competitive tasks. In this work we propose a physics-informed reinfo…
▽ More
The networked nature of multi-robot systems presents challenges in the context of multi-agent reinforcement learning. Centralized control policies do not scale with increasing numbers of robots, whereas independent control policies do not exploit the information provided by other robots, exhibiting poor performance in cooperative-competitive tasks. In this work we propose a physics-informed reinforcement learning approach able to learn distributed multi-robot control policies that are both scalable and make use of all the available information to each robot. Our approach has three key characteristics. First, it imposes a port-Hamiltonian structure on the policy representation, respecting energy conservation properties of physical robot systems and the networked nature of robot team interactions. Second, it uses self-attention to ensure a sparse policy representation able to handle time-varying information at each robot from the interaction graph. Third, we present a soft actor-critic reinforcement learning algorithm parameterized by our self-attention port-Hamiltonian control policy, which accounts for the correlation among robots during training while overcoming the need of value function factorization. Extensive simulations in different multi-robot scenarios demonstrate the success of the proposed approach, surpassing previous multi-robot reinforcement learning solutions in scalability, while achieving similar or superior performance (with averaged cumulative reward up to x2 greater than the state-of-the-art with robot teams x6 larger than the number of robots at training time). We also validate our approach on multiple real robots in the Georgia Tech Robotarium under imperfect communication, demonstrating zero-shot sim-to-real transfer and scalability across number of robots.
△ Less
Submitted 24 March, 2025; v1 submitted 30 December, 2023;
originally announced January 2024.
-
Performance of Distributed File Systems on Cloud Computing Environment: An Evaluation for Small-File Problem
Authors:
Thanh Duong,
Quoc Luu,
Hung Nguyen
Abstract:
Various performance characteristics of distributed file systems have been well studied. However, the performance efficiency of distributed file systems on small-file problems with complex machine learning algorithms scenarios is not well addressed. In addition, demands for unified storage of big data processing and high-performance computing have been crucial. Hence, developing a solution combinin…
▽ More
Various performance characteristics of distributed file systems have been well studied. However, the performance efficiency of distributed file systems on small-file problems with complex machine learning algorithms scenarios is not well addressed. In addition, demands for unified storage of big data processing and high-performance computing have been crucial. Hence, developing a solution combining high-performance computing and big data with shared storage is very important. This paper focuses on the performance efficiency of distributed file systems with small-file datasets. We propose an architecture combining both high-performance computing and big data with shared storage and perform a series of experiments to investigate the performance of these distributed file systems. The result of the experiments confirms the applicability of the proposed architecture in terms of complex machine learning algorithms.
△ Less
Submitted 29 December, 2023;
originally announced December 2023.
-
Multi-Tier Computing-Enabled Digital Twin in 6G Networks
Authors:
Kunlun Wang,
Yongyi Tang,
Trung Q. Duong,
Saeed R. Khosravirad,
Octavia A. Dobre,
George K. Karagiannidis
Abstract:
Digital twin (DT) is the recurrent and common feature in discussions about future technologies, bringing together advanced communication, computation, and artificial intelligence, to name a few. In the context of Industry 4.0, industries such as manufacturing, automotive, and healthcare are rapidly adopting DT-based development. The main challenges to date have been the high demands on communicati…
▽ More
Digital twin (DT) is the recurrent and common feature in discussions about future technologies, bringing together advanced communication, computation, and artificial intelligence, to name a few. In the context of Industry 4.0, industries such as manufacturing, automotive, and healthcare are rapidly adopting DT-based development. The main challenges to date have been the high demands on communication and computing resources, as well as privacy and security concerns, arising from the large volumes of data exchanges. To achieve low latency and high security services in the emerging DT, multi-tier computing has been proposed by combining edge/fog computing and cloud computing. Specifically, low latency data transmission, efficient resource allocation, and validated security strategies of multi-tier computing systems are used to solve the operational problems of the DT system. In this paper, we introduce the architecture and applications of DT using examples from manufacturing, the Internet-of-Vehicles and healthcare. At the same time, the architecture and technology of multi-tier computing systems are studied to support DT. This paper will provide valuable reference and guidance for the theory, algorithms, and applications in collaborative multi-tier computing and DT.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Time-Dependent Density Functional Theory with the Orthogonal Projector Augmented Wave Method
Authors:
Minh Nguyen,
Tim Duong,
Daniel Neuhauser
Abstract:
The projector augmented wave (PAW) method of Blöchl linearly maps smooth pseudo wavefunctions to the highly oscillatory all-electron DFT orbitals. Compared to norm-conserving pseudopotentials (NCPP), PAW has the advantage of lower kinetic energy cutoffs and larger grid spacings at the cost of having to solve for non-orthogonal wavefunctions. We earlier developed orthogonal PAW (OPAW) to allow the…
▽ More
The projector augmented wave (PAW) method of Blöchl linearly maps smooth pseudo wavefunctions to the highly oscillatory all-electron DFT orbitals. Compared to norm-conserving pseudopotentials (NCPP), PAW has the advantage of lower kinetic energy cutoffs and larger grid spacings at the cost of having to solve for non-orthogonal wavefunctions. We earlier developed orthogonal PAW (OPAW) to allow the use of PAW when orthogonal wavefunctions are required. In OPAW, the pseudo wavefunctions are transformed through the efficient application of powers of the PAW overlap operator with essentially no extra cost compared to NCPP methods. Previously, we applied OPAW to DFT. Here, we take the first step to make OPAW viable for post-DFT methods by implementing it in real-time time-dependent (TD) DFT. Using fourth-order Runge-Kutta for the time-propagation, we compare calculations of absorption spectra for various organic and biological molecules and show that very large grid spacings are sufficient, 0.6-0.8 Bohr in OPAW-TDDFT rather than the 0.4-0.5 Bohr used in traditional NCPP-TDDFT calculations. This reduces the memory and propagation costs by up to a factor of 5. Our method would be directly applicable to any post-DFT methods that require time-dependent propagations such as GW and BSE.
△ Less
Submitted 18 December, 2023;
originally announced December 2023.
-
Low-resource classification of mobility functioning information in clinical sentences using large language models
Authors:
Tuan Dung Le,
Thanh Duong,
Thanh Thieu
Abstract:
Objective: Function is increasingly recognized as an important indicator of whole-person health. This study evaluates the ability of publicly available large language models (LLMs) to accurately identify the presence of functioning information from clinical notes. We explore various strategies to improve the performance on this task. Materials and Methods: We collect a balanced binary classificati…
▽ More
Objective: Function is increasingly recognized as an important indicator of whole-person health. This study evaluates the ability of publicly available large language models (LLMs) to accurately identify the presence of functioning information from clinical notes. We explore various strategies to improve the performance on this task. Materials and Methods: We collect a balanced binary classification dataset of 1000 sentences from the Mobility NER dataset, which was curated from n2c2 clinical notes. For evaluation, we construct zero-shot and few-shot prompts to query the LLMs whether a given sentence contains mobility functioning information. Two sampling techniques, random sampling and k-nearest neighbor (kNN)-based sampling, are used to select the few-shot examples. Furthermore, we apply a parameter-efficient prompt-based fine-tuning method to the LLMs and evaluate their performance under various training settings. Results: Flan-T5-xxl outperforms all other models in both zero-shot and few-shot settings, achieving a F1 score of 0.865 with a single demonstrative example selected by kNN sampling. In prompt-based fine-tuning experiments, this foundation model also demonstrates superior performance across all low-resource settings, particularly achieving an impressive F1 score of 0.922 using the full training dataset. The smaller model, Flan-T5-xl, requires fine-tuning with only 2.3M additional parameters to achieve comparable performance to the fully fine-tuned Gatortron-base model, both surpassing 0.9 F1 score. Conclusion: Open-source instruction-tuned LLMs demonstrate impressive in-context learning capability in the mobility functioning classification task. The performance of these models can be further improved by continuing fine-tuning on a task-specific dataset.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Evaluating the Convergence Limit of Quantum Neural Tangent Kernel
Authors:
Trong Duong
Abstract:
Quantum variational algorithms have been one of major applications of quantum computing with current quantum devices. There are recent attempts to establish the foundation for these algorithms. A possible approach is to characterize the training dynamics with quantum neural tangent kernel. In this work, we construct the kernel for two models, Quantun Ensemble and Quantum Neural Network, and show t…
▽ More
Quantum variational algorithms have been one of major applications of quantum computing with current quantum devices. There are recent attempts to establish the foundation for these algorithms. A possible approach is to characterize the training dynamics with quantum neural tangent kernel. In this work, we construct the kernel for two models, Quantun Ensemble and Quantum Neural Network, and show the convergence of these models in the limit of infinitely many qubits. We also show applications of the kernel limit in regression tasks.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
The impact of the Russia-Ukraine conflict on the extreme risk spillovers between agricultural futures and spots
Authors:
Wei-Xing Zhou,
Yun-Shi Dai,
Kiet Tuan Duong,
Peng-Fei Dai
Abstract:
The ongoing Russia-Ukraine conflict between two major agricultural powers has posed significant threats and challenges to the global food system and world food security. Focusing on the impact of the conflict on the global agricultural market, we propose a new analytical framework for tail dependence, and combine the Copula-CoVaR method with the ARMA-GARCH-skewed Student-t model to examine the tai…
▽ More
The ongoing Russia-Ukraine conflict between two major agricultural powers has posed significant threats and challenges to the global food system and world food security. Focusing on the impact of the conflict on the global agricultural market, we propose a new analytical framework for tail dependence, and combine the Copula-CoVaR method with the ARMA-GARCH-skewed Student-t model to examine the tail dependence structure and extreme risk spillover between agricultural futures and spots over the pre- and post-outbreak periods. Our results indicate that the tail dependence structures in the futures-spot markets of soybean, maize, wheat, and rice have all reacted to the Russia-Ukraine conflict. Furthermore, the outbreak of the conflict has intensified risks of the four agricultural markets in varying degrees, with the wheat market being affected the most. Additionally, all the agricultural futures markets exhibit significant downside and upside risk spillovers to their corresponding spot markets before and after the outbreak of the conflict, whereas the strengths of these extreme risk spillover effects demonstrate significant asymmetries at the directional (downside versus upside) and temporal (pre-outbreak versus post-outbreak) levels.
△ Less
Submitted 24 October, 2023;
originally announced October 2023.
-
Digital Twin-Enabled Intelligent DDoS Detection Mechanism for Autonomous Core Networks
Authors:
Yagmur Yigit,
Bahadir Bal,
Aytac Karameseoglu,
Trung Q. Duong,
Berk Canberk
Abstract:
Existing distributed denial of service attack (DDoS) solutions cannot handle highly aggregated data rates; thus, they are unsuitable for Internet service provider (ISP) core networks. This article proposes a digital twin-enabled intelligent DDoS detection mechanism using an online learning method for autonomous systems. Our contributions are three-fold: we first design a DDoS detection architectur…
▽ More
Existing distributed denial of service attack (DDoS) solutions cannot handle highly aggregated data rates; thus, they are unsuitable for Internet service provider (ISP) core networks. This article proposes a digital twin-enabled intelligent DDoS detection mechanism using an online learning method for autonomous systems. Our contributions are three-fold: we first design a DDoS detection architecture based on the digital twin for ISP core networks. We implemented a Yet Another Next Generation (YANG) model and an automated feature selection (AutoFS) module to handle core network data. We used an online learning approach to update the model instantly and efficiently, improve the learning model quickly, and ensure accurate predictions. Finally, we reveal that our proposed solution successfully detects DDoS attacks and updates the feature selection method and learning model with a true classification rate of ninety-seven percent. Our proposed solution can estimate the attack within approximately fifteen minutes after the DDoS attack starts.
△ Less
Submitted 25 October, 2023; v1 submitted 19 October, 2023;
originally announced October 2023.
-
TwinPot: Digital Twin-assisted Honeypot for Cyber-Secure Smart Seaports
Authors:
Yagmur Yigit,
Omer Kemal Kinaci,
Trung Q. Duong,
Berk Canberk
Abstract:
The idea of next-generation ports has become more apparent in the last ten years in response to the challenge posed by the rising demand for efficiency and the ever-increasing volume of goods. In this new era of intelligent infrastructure and facilities, it is evident that cyber-security has recently received the most significant attention from the seaport and maritime authorities, and it is a pri…
▽ More
The idea of next-generation ports has become more apparent in the last ten years in response to the challenge posed by the rising demand for efficiency and the ever-increasing volume of goods. In this new era of intelligent infrastructure and facilities, it is evident that cyber-security has recently received the most significant attention from the seaport and maritime authorities, and it is a primary concern on the agenda of most ports. Traditional security solutions can be applied to safeguard IoT and Cyber-Physical Systems (CPS) from harmful entities. Nevertheless, security researchers can only watch, examine, and learn about the behaviors of attackers if these solutions operate more transparently. Herein, honeypots are potential solutions since they offer valuable information about the attackers. It can be virtual or physical. Virtual honeypots must be more realistic to entice attackers, necessitating better high-fidelity. To this end, Digital Twin (DT) technology can be employed to increase the complexity and simulation fidelity of the honeypots. Seaports can be attacked from both their existing devices and external devices at the same time. Existing mechanisms are insufficient to detect external attacks; therefore, the current systems cannot handle attacks at the desired level. DT and honeypot technologies can be used together to tackle them. Consequently, we suggest a DT-assisted honeypot, called TwinPot, for external attacks in smart seaports. Moreover, we propose an intelligent attack detection mechanism to handle different attack types using DT for internal attacks. Finally, we build an extensive smart seaport dataset for internal and external attacks using the MANSIM tool and two existing datasets to test the performance of our system. We show that under simultaneous internal and external attacks on the system, our solution successfully detects internal and external attacks.
△ Less
Submitted 25 October, 2023; v1 submitted 19 October, 2023;
originally announced October 2023.
-
Circular-Line Trajectory Tracking Controller for Mobile Robot using Multi-Pixy2 Sensors
Authors:
Xuan Quang Ngo,
Tri Duc Tran,
Huy Hung Nguyen,
Van Dong Nguyen,
Van Tu Duong,
Tan Tien Nguyen
Abstract:
This study suggests a novel tracking method that employs three Pixy2 sensors to identify the desired line trajectories instead of traditional perceiving means. Firstly, the kinematic model of the mobile robot is derived from the information gathered by three Pixy2 sensors. Secondly, the sliding mode controller is implemented to regulate the tracking error. Finally, simulation results are analyzed…
▽ More
This study suggests a novel tracking method that employs three Pixy2 sensors to identify the desired line trajectories instead of traditional perceiving means. Firstly, the kinematic model of the mobile robot is derived from the information gathered by three Pixy2 sensors. Secondly, the sliding mode controller is implemented to regulate the tracking error. Finally, simulation results are analyzed to show the effectiveness of the proposed method.
△ Less
Submitted 12 August, 2023;
originally announced September 2023.
-
Energy-Efficient Precoding Designs for Multi-User Visible Light Communication Systems with Confidential Messages
Authors:
Son T. Duong,
Thanh V. Pham,
Chuyen T. Nguyen,
Anh T. Pham
Abstract:
This paper studies energy-efficient precoding designs for multi-user visible light communication (VLC) systems from the perspective of physical layer security where users' messages must be kept mutually confidential. For such systems, we first derive a lower bound on the achievable secrecy rate of each user. Next, the total power consumption for illumination and data transmission is thoroughly ana…
▽ More
This paper studies energy-efficient precoding designs for multi-user visible light communication (VLC) systems from the perspective of physical layer security where users' messages must be kept mutually confidential. For such systems, we first derive a lower bound on the achievable secrecy rate of each user. Next, the total power consumption for illumination and data transmission is thoroughly analyzed. We then tackle the problem of maximizing energy efficiency, given that each user's secrecy rate satisfies a certain threshold. The design problem is shown to be non-convex fractional programming, which renders finding the optimal solution computationally prohibitive. Our aim in this paper is, therefore, to find sub-optimal yet low complexity solutions. For this purpose, the traditional Dinkelbach algorithm is first employed to reformulate the original problem to a non-fractional parameterized one. Two different approaches based on the convex-concave procedure (CCCP) and Semidefinite Relaxation (SDR) are utilized to solve the non-convex parameterized problem. In addition, to further reduce the complexity, we investigate a design using the zero-forcing (ZF) technique. Numerical results are conducted to show the feasibility, convergence, and performance of the proposed algorithms depending on different parameters of the system.
△ Less
Submitted 27 September, 2023;
originally announced September 2023.
-
Optimal Scene Graph Planning with Large Language Model Guidance
Authors:
Zhirui Dai,
Arash Asgharivaskasi,
Thai Duong,
Shusen Lin,
Maria-Elizabeth Tzes,
George Pappas,
Nikolay Atanasov
Abstract:
Recent advances in metric, semantic, and topological mapping have equipped autonomous robots with semantic concept grounding capabilities to interpret natural language tasks. This work aims to leverage these new capabilities with an efficient task planning algorithm for hierarchical metric-semantic models. We consider a scene graph representation of the environment and utilize a large language mod…
▽ More
Recent advances in metric, semantic, and topological mapping have equipped autonomous robots with semantic concept grounding capabilities to interpret natural language tasks. This work aims to leverage these new capabilities with an efficient task planning algorithm for hierarchical metric-semantic models. We consider a scene graph representation of the environment and utilize a large language model (LLM) to convert a natural language task into a linear temporal logic (LTL) automaton. Our main contribution is to enable optimal hierarchical LTL planning with LLM guidance over scene graphs. To achieve efficiency, we construct a hierarchical planning domain that captures the attributes and connectivity of the scene graph and the task automaton, and provide semantic guidance via an LLM heuristic function. To guarantee optimality, we design an LTL heuristic function that is provably consistent and supplements the potentially inadmissible LLM guidance in multi-heuristic planning. We demonstrate efficient planning of complex natural language tasks in scene graphs of virtualized real environments.
△ Less
Submitted 10 January, 2024; v1 submitted 17 September, 2023;
originally announced September 2023.