Skip to main content

Showing 1–19 of 19 results for author: Nguyen-Tang, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.00669  [pdf, ps, other

    cs.LG cs.AI cs.CV cs.RO

    Audio-3DVG: Unified Audio - Point Cloud Fusion for 3D Visual Grounding

    Authors: Duc Cao-Dinh, Khai Le-Duc, Anh Dao, Bach Phan Tat, Chris Ngo, Duy M. H. Nguyen, Nguyen X. Khanh, Thanh Nguyen-Tang

    Abstract: 3D Visual Grounding (3DVG) involves localizing target objects in 3D point clouds based on natural language. While prior work has made strides using textual descriptions, leveraging spoken language-known as Audio-based 3D Visual Grounding-remains underexplored and challenging. Motivated by advances in automatic speech recognition (ASR) and speech representation learning, we propose Audio-3DVG, a si… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: Work in progress, 42 pages

  2. arXiv:2505.15009  [pdf, ps, other

    cs.LG cs.AI

    One-Layer Transformers are Provably Optimal for In-context Reasoning and Distributional Association Learning in Next-Token Prediction Tasks

    Authors: Quan Nguyen, Thanh Nguyen-Tang

    Abstract: We study the approximation capabilities and on-convergence behaviors of one-layer transformers on the noiseless and noisy in-context reasoning of next-token prediction. Existing theoretical results focus on understanding the in-context reasoning behaviors for either the first gradient step or when the number of samples is infinite. Furthermore, no convergence rates nor generalization abilities wer… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 27 pages

  3. arXiv:2504.03546  [pdf, other

    cs.CL cs.AI cs.LG cs.SD eess.AS

    MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation

    Authors: Khai Le-Duc, Tuyen Tran, Bach Phan Tat, Nguyen Kim Hai Bui, Quan Dang, Hung-Phong Tran, Thanh-Thuy Nguyen, Ly Nguyen, Tuan-Minh Phan, Thi Thu Phuong Tran, Chris Ngo, Nguyen X. Khanh, Thanh Nguyen-Tang

    Abstract: Multilingual speech translation (ST) in the medical domain enhances patient care by enabling efficient communication across language barriers, alleviating specialized workforce shortages, and facilitating improved diagnosis and treatment, particularly during pandemics. In this work, we present the first systematic study on medical ST, to our best knowledge, by releasing MultiMed-ST, a large-scale… ▽ More

    Submitted 4 April, 2025; originally announced April 2025.

    Comments: Preprint, 122 pages

  4. arXiv:2503.01329  [pdf, other

    cs.LG cs.AI

    Neural ODE Transformers: Analyzing Internal Dynamics and Adaptive Fine-tuning

    Authors: Anh Tong, Thanh Nguyen-Tang, Dongeun Lee, Duc Nguyen, Toan Tran, David Hall, Cheongwoong Kang, Jaesik Choi

    Abstract: Recent advancements in large language models (LLMs) based on transformer architectures have sparked significant interest in understanding their inner workings. In this paper, we introduce a novel approach to modeling transformer architectures using highly flexible non-autonomous neural ordinary differential equations (ODEs). Our proposed model parameterizes all weights of attention and feed-forwar… ▽ More

    Submitted 16 April, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

    Comments: ICLR 2025

  5. arXiv:2501.06339  [pdf, other

    cs.LG cs.AI stat.ML

    On The Statistical Complexity of Offline Decision-Making

    Authors: Thanh Nguyen-Tang, Raman Arora

    Abstract: We study the statistical complexity of offline decision-making with function approximation, establishing (near) minimax-optimal rates for stochastic contextual bandits and Markov decision processes. The performance limits are captured by the pseudo-dimension of the (value) function class and a new characterization of the behavior policy that \emph{strictly} subsumes all the previous notions of dat… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

    Comments: arXiv version for the ICML'24 paper

  6. arXiv:2411.00707  [pdf, ps, other

    cs.LG cs.AI cs.GT stat.ML

    Learning in Markov Games with Adaptive Adversaries: Policy Regret, Fundamental Barriers, and Efficient Algorithms

    Authors: Thanh Nguyen-Tang, Raman Arora

    Abstract: We study learning in a dynamically evolving environment modeled as a Markov game between a learner and a strategic opponent that can adapt to the learner's strategies. While most existing works in Markov games focus on external regret as the learning objective, external regret becomes inadequate when the adversaries are adaptive. In this work, we focus on \emph{policy regret} -- a counterfactual n… ▽ More

    Submitted 9 December, 2024; v1 submitted 1 November, 2024; originally announced November 2024.

    Comments: NeurIPS'24; fix typos

  7. arXiv:2409.14074  [pdf, other

    cs.CL cs.SD eess.AS

    MultiMed: Multilingual Medical Speech Recognition via Attention Encoder Decoder

    Authors: Khai Le-Duc, Phuc Phan, Tan-Hanh Pham, Bach Phan Tat, Minh-Huong Ngo, Chris Ngo, Thanh Nguyen-Tang, Truong-Son Hy

    Abstract: Multilingual automatic speech recognition (ASR) in the medical domain serves as a foundational task for various downstream applications such as speech translation, spoken language understanding, and voice-activated assistants. This technology improves patient care by enabling efficient communication across language barriers, alleviating specialized workforce shortages, and facilitating improved di… ▽ More

    Submitted 15 May, 2025; v1 submitted 21 September, 2024; originally announced September 2024.

    Comments: ACL 2025, 38 pages

  8. arXiv:2407.10825  [pdf, other

    cs.LG cs.CR cs.CV

    Wicked Oddities: Selectively Poisoning for Effective Clean-Label Backdoor Attacks

    Authors: Quang H. Nguyen, Nguyen Ngoc-Hieu, The-Anh Ta, Thanh Nguyen-Tang, Kok-Seng Wong, Hoang Thanh-Tung, Khoa D. Doan

    Abstract: Deep neural networks are vulnerable to backdoor attacks, a type of adversarial attack that poisons the training data to manipulate the behavior of models trained on such data. Clean-label attacks are a more stealthy form of backdoor attacks that can perform the attack without changing the labels of poisoned data. Early works on clean-label attacks added triggers to a random subset of the training… ▽ More

    Submitted 16 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

  9. arXiv:2403.11574  [pdf, other

    cs.LG

    Offline Multitask Representation Learning for Reinforcement Learning

    Authors: Haque Ishfaq, Thanh Nguyen-Tang, Songtao Feng, Raman Arora, Mengdi Wang, Ming Yin, Doina Precup

    Abstract: We study offline multitask representation learning in reinforcement learning (RL), where a learner is provided with an offline dataset from different tasks that share a common representation and is asked to learn the shared representation. We theoretically investigate offline multitask low-rank RL, and propose a new algorithm called MORL for offline multitask representation learning. Furthermore,… ▽ More

    Submitted 31 October, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted to 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

  10. arXiv:2401.03301  [pdf, other

    cs.LG cs.AI stat.ML

    On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond

    Authors: Thanh Nguyen-Tang, Raman Arora

    Abstract: We seek to understand what facilitates sample-efficient learning from historical datasets for sequential decision-making, a problem that is popularly known as offline reinforcement learning (RL). Further, we are interested in algorithms that enjoy sample efficiency while leveraging (value) function approximation. In this paper, we address these fundamental questions by (i) proposing a notion of da… ▽ More

    Submitted 6 February, 2024; v1 submitted 6 January, 2024; originally announced January 2024.

    Comments: NeurIPS'23; Arxiv is the authors' preferred version; v2: add a missing related work

  11. SigFormer: Signature Transformers for Deep Hedging

    Authors: Anh Tong, Thanh Nguyen-Tang, Dongeun Lee, Toan Tran, Jaesik Choi

    Abstract: Deep hedging is a promising direction in quantitative finance, incorporating models and techniques from deep learning research. While giving excellent hedging strategies, models inherently requires careful treatment in designing architectures for neural networks. To mitigate such difficulties, we introduce SigFormer, a novel deep learning model that combines the power of path signatures and transf… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: ICAIF 2023

  12. arXiv:2306.14920  [pdf, other

    cs.LG

    A Cosine Similarity-based Method for Out-of-Distribution Detection

    Authors: Nguyen Ngoc-Hieu, Nguyen Hung-Quang, The-Anh Ta, Thanh Nguyen-Tang, Khoa D Doan, Hoang Thanh-Tung

    Abstract: The ability to detect OOD data is a crucial aspect of practical machine learning applications. In this work, we show that cosine similarity between the test feature and the typical ID feature is a good indicator of OOD data. We propose Class Typical Matching (CTM), a post hoc OOD detection algorithm that uses a cosine similarity scoring function. Extensive experiments on multiple benchmarks show t… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

    Comments: Accepted paper at ICML 2023 Workshop on Spurious Correlations, Invariance, and Stability. 10 pages (4 main + appendix)

  13. arXiv:2302.12780  [pdf, other

    cs.LG

    VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation

    Authors: Thanh Nguyen-Tang, Raman Arora

    Abstract: We propose a novel algorithm for offline reinforcement learning called Value Iteration with Perturbed Rewards (VIPeR), which amalgamates the pessimism principle with random perturbations of the value function. Most current offline RL algorithms explicitly construct statistical confidence regions to obtain pessimism via lower confidence bounds (LCB), which cannot easily scale to complex problems wh… ▽ More

    Submitted 3 March, 2023; v1 submitted 24 February, 2023; originally announced February 2023.

    Comments: top-25%-noble ICLR'23; code: https://github.com/thanhnguyentang/neural-offline-rl; v2: change title

  14. arXiv:2211.13208  [pdf, other

    cs.LG

    On Instance-Dependent Bounds for Offline Reinforcement Learning with Linear Function Approximation

    Authors: Thanh Nguyen-Tang, Ming Yin, Sunil Gupta, Svetha Venkatesh, Raman Arora

    Abstract: Sample-efficient offline reinforcement learning (RL) with linear function approximation has recently been studied extensively. Much of prior work has yielded the minimax-optimal bound of $\tilde{\mathcal{O}}(\frac{1}{\sqrt{K}})$, with $K$ being the number of episodes in the offline data. In this work, we seek to understand instance-dependent bounds for offline RL with function approximation. We pr… ▽ More

    Submitted 27 January, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

    Comments: AAAI'23

  15. arXiv:2206.14648  [pdf, other

    cs.IR cs.LG

    Two-Stage Neural Contextual Bandits for Personalised News Recommendation

    Authors: Mengyan Zhang, Thanh Nguyen-Tang, Fangzhao Wu, Zhenyu He, Xing Xie, Cheng Soon Ong

    Abstract: We consider the problem of personalised news recommendation where each user consumes news in a sequential fashion. Existing personalised news recommendation methods focus on exploiting user interests and ignores exploration in recommendation, which leads to biased feedback loops and hurt recommendation quality in the long term. We build on contextual bandits recommendation strategies which natural… ▽ More

    Submitted 26 June, 2022; originally announced June 2022.

  16. arXiv:2203.01758  [pdf, other

    cs.LG cs.AI stat.ML

    On Practical Reinforcement Learning: Provable Robustness, Scalability, and Statistical Efficiency

    Authors: Thanh Nguyen-Tang

    Abstract: This thesis rigorously studies fundamental reinforcement learning (RL) methods in modern practical considerations, including robust RL, distributional RL, and offline RL with neural function approximation. The thesis first prepares the readers with an overall overview of RL and key technical background in statistics and optimization. In each of the settings, the thesis motivates the problems to be… ▽ More

    Submitted 3 March, 2022; originally announced March 2022.

    Comments: Ph.D. thesis, 209 pages

  17. arXiv:2111.13807  [pdf, other

    cs.LG cs.AI stat.ML

    Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization

    Authors: Thanh Nguyen-Tang, Sunil Gupta, A. Tuan Nguyen, Svetha Venkatesh

    Abstract: Offline policy learning (OPL) leverages existing data collected a priori for policy optimization without any active exploration. Despite the prevalence and recent interest in this problem, its theoretical and algorithmic foundations in function approximation settings remain under-developed. In this paper, we consider this problem on the axes of distributional shift, optimization, and generalizatio… ▽ More

    Submitted 13 March, 2022; v1 submitted 26 November, 2021; originally announced November 2021.

    Comments: A full version at ICLR'22; a preliminary version at Offline RL Workshop at NeurIPS'21; code: https://github.com/thanhnguyentang/offline_neural_bandits

    Journal ref: ICLR 2022

  18. arXiv:2107.11533  [pdf, other

    stat.ML cs.LG

    Combining Online Learning and Offline Learning for Contextual Bandits with Deficient Support

    Authors: Hung Tran-The, Sunil Gupta, Thanh Nguyen-Tang, Santu Rana, Svetha Venkatesh

    Abstract: We address policy learning with logged data in contextual bandits. Current offline-policy learning algorithms are mostly based on inverse propensity score (IPS) weighting requiring the logging policy to have \emph{full support} i.e. a non-zero probability for any context/action of the evaluation policy. However, many real-world systems do not guarantee such logging policies, especially when the ac… ▽ More

    Submitted 24 July, 2021; originally announced July 2021.

  19. arXiv:2103.06671  [pdf, ps, other

    stat.ML cs.LG

    Sample Complexity of Offline Reinforcement Learning with Deep ReLU Networks

    Authors: Thanh Nguyen-Tang, Sunil Gupta, Hung Tran-The, Svetha Venkatesh

    Abstract: Offline reinforcement learning (RL) leverages previously collected data for policy optimization without any further active exploration. Despite the recent interest in this problem, its theoretical results in neural network function approximation settings remain elusive. In this paper, we study the statistical theory of offline RL with deep ReLU network function approximation. In particular, we est… ▽ More

    Submitted 13 December, 2022; v1 submitted 11 March, 2021; originally announced March 2021.

    Comments: https://openreview.net/forum?id=LdEm0umNcv

    Journal ref: Transactions on Machine Learning Research, 2022