Skip to main content

Showing 1–23 of 23 results for author: Nguyen, H M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.02671  [pdf, ps, other

    cs.LG cs.CV eess.IV

    Embedding-Based Federated Data Sharing via Differentially Private Conditional VAEs

    Authors: Francesco Di Salvo, Hanh Huyen My Nguyen, Christian Ledig

    Abstract: Deep Learning (DL) has revolutionized medical imaging, yet its adoption is constrained by data scarcity and privacy regulations, limiting access to diverse datasets. Federated Learning (FL) enables decentralized training but suffers from high communication costs and is often restricted to a single downstream task, reducing flexibility. We propose a data-sharing method via Differentially Private (D… ▽ More

    Submitted 3 July, 2025; originally announced July 2025.

    Comments: Accepted to MICCAI 2025

  2. arXiv:2506.08681  [pdf, ps, other

    cs.LG

    Mitigating Reward Over-optimization in Direct Alignment Algorithms with Importance Sampling

    Authors: Phuc Minh Nguyen, Ngoc-Hieu Nguyen, Duy H. M. Nguyen, Anji Liu, An Mai, Binh T. Nguyen, Daniel Sonntag, Khoa D. Doan

    Abstract: Direct Alignment Algorithms (DAAs) such as Direct Preference Optimization (DPO) have emerged as alternatives to the standard Reinforcement Learning from Human Feedback (RLHF) for aligning large language models (LLMs) with human values. However, these methods are more susceptible to over-optimization, in which the model drifts away from the reference policy, leading to degraded performance as train… ▽ More

    Submitted 11 June, 2025; v1 submitted 10 June, 2025; originally announced June 2025.

    Comments: First version

  3. arXiv:2505.19080  [pdf, ps, other

    cs.RO

    ReFineVLA: Reasoning-Aware Teacher-Guided Transfer Fine-Tuning

    Authors: Tuan Van Vo, Tan Quang Nguyen, Khang Minh Nguyen, Duy Ho Minh Nguyen, Minh Nhat Vu

    Abstract: Vision-Language-Action (VLA) models have gained much attention from the research community thanks to their strength in translating multimodal observations with linguistic instructions into robotic actions. Despite their recent advancements, VLAs often overlook the explicit reasoning and only learn the functional input-action mappings, omitting these crucial logical steps for interpretability and g… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: 10 pages

  4. arXiv:2505.03770  [pdf, other

    cs.AI

    Proceedings of 1st Workshop on Advancing Artificial Intelligence through Theory of Mind

    Authors: Mouad Abrini, Omri Abend, Dina Acklin, Henny Admoni, Gregor Aichinger, Nitay Alon, Zahra Ashktorab, Ashish Atreja, Moises Auron, Alexander Aufreiter, Raghav Awasthi, Soumya Banerjee, Joe M. Barnby, Rhea Basappa, Severin Bergsmann, Djallel Bouneffouf, Patrick Callaghan, Marc Cavazza, Thierry Chaminade, Sonia Chernova, Mohamed Chetouan, Moumita Choudhury, Axel Cleeremans, Jacek B. Cywinski, Fabio Cuzzolin , et al. (83 additional authors not shown)

    Abstract: This volume includes a selection of papers presented at the Workshop on Advancing Artificial Intelligence through Theory of Mind held at AAAI 2025 in Philadelphia US on 3rd March 2025. The purpose of this volume is to provide an open access and curated anthology for the ToM and AI research community.

    Submitted 28 April, 2025; originally announced May 2025.

    Comments: workshop proceedings

  5. arXiv:2504.14898  [pdf, other

    stat.ML cs.LG

    Expected Free Energy-based Planning as Variational Inference

    Authors: Bert de Vries, Wouter Nuijten, Thijs van de Laar, Wouter Kouw, Sepideh Adamiat, Tim Nisslbeck, Mykola Lukashchuk, Hoang Minh Huu Nguyen, Marco Hidalgo Araya, Raphael Tresor, Thijs Jenneskens, Ivana Nikoloska, Raaja Ganapathy Subramanian, Bart van Erp, Dmitry Bagaev, Albert Podusenko

    Abstract: We address the problem of planning under uncertainty, where an agent must choose actions that not only achieve desired outcomes but also reduce uncertainty. Traditional methods often treat exploration and exploitation as separate objectives, lacking a unified inferential foundation. Active inference, grounded in the Free Energy Principle, provides such a foundation by minimizing Expected Free Ener… ▽ More

    Submitted 23 April, 2025; v1 submitted 21 April, 2025; originally announced April 2025.

    Comments: 18 pages

  6. arXiv:2503.12722  [pdf, other

    cs.AI cs.CL cs.GT cs.MA

    Identifying Cooperative Personalities in Multi-agent Contexts through Personality Steering with Representation Engineering

    Authors: Kenneth J. K. Ong, Lye Jia Jun, Hieu Minh "Jord" Nguyen, Seong Hah Cho, Natalia Pérez-Campanero Antolín

    Abstract: As Large Language Models (LLMs) gain autonomous capabilities, their coordination in multi-agent settings becomes increasingly important. However, they often struggle with cooperation, leading to suboptimal outcomes. Inspired by Axelrod's Iterated Prisoner's Dilemma (IPD) tournaments, we explore how personality traits influence LLM cooperation. Using representation engineering, we steer Big Five tr… ▽ More

    Submitted 16 March, 2025; originally announced March 2025.

    Comments: Poster, Technical AI Safety Conference 2025

  7. arXiv:2503.10728  [pdf, other

    cs.CL cs.AI cs.CY

    DarkBench: Benchmarking Dark Patterns in Large Language Models

    Authors: Esben Kran, Hieu Minh "Jord" Nguyen, Akash Kundu, Sami Jawhar, Jinsuk Park, Mateusz Maria Jurewicz

    Abstract: We introduce DarkBench, a comprehensive benchmark for detecting dark design patterns--manipulative techniques that influence user behavior--in interactions with large language models (LLMs). Our benchmark comprises 660 prompts across six categories: brand bias, user retention, sycophancy, anthropomorphism, harmful generation, and sneaking. We evaluate models from five leading companies (OpenAI, An… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: Accepted as an Oral paper at ICLR 2025

  8. arXiv:2502.14412  [pdf, other

    cs.CV cs.CR cs.LG

    Evaluating Precise Geolocation Inference Capabilities of Vision Language Models

    Authors: Neel Jay, Hieu Minh Nguyen, Trung Dung Hoang, Jacob Haimes

    Abstract: The prevalence of Vision-Language Models (VLMs) raises important questions about privacy in an era where visual information is increasingly available. While foundation VLMs demonstrate broad knowledge and learned capabilities, we specifically investigate their ability to infer geographic location from previously unseen image data. This paper introduces a benchmark dataset collected from Google Str… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: AAAI 2025 Workshop DATASAFE

  9. arXiv:2502.06470  [pdf, ps, other

    cs.CL cs.AI

    A Survey of Theory of Mind in Large Language Models: Evaluations, Representations, and Safety Risks

    Authors: Hieu Minh "Jord" Nguyen

    Abstract: Theory of Mind (ToM), the ability to attribute mental states to others and predict their behaviour, is fundamental to social intelligence. In this paper, we survey studies evaluating behavioural and representational ToM in Large Language Models (LLMs), identify important safety risks from advanced LLM ToM capabilities, and suggest several research directions for effective evaluation and mitigation… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: Advancing Artificial Intelligence through Theory of Mind Workshop, AAAI 2025

  10. arXiv:2502.02118  [pdf, other

    cs.LG cs.CV

    BRIDLE: Generalized Self-supervised Learning with Quantization

    Authors: Hoang M. Nguyen, Satya N. Shukla, Qiang Zhang, Hanchao Yu, Sreya D. Roy, Taipeng Tian, Lingjiong Zhu, Yuchen Liu

    Abstract: Self-supervised learning has been a powerful approach for learning meaningful representations from unlabeled data across various domains, reducing the reliance on large labeled datasets. Inspired by BERT's success in capturing deep bidirectional contexts in natural language processing, similar frameworks have been adapted to other modalities such as audio, with models like BEATs extending the bidi… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

  11. arXiv:2406.13997  [pdf, other

    cs.CL cs.CE

    "Global is Good, Local is Bad?": Understanding Brand Bias in LLMs

    Authors: Mahammed Kamruzzaman, Hieu Minh Nguyen, Gene Louis Kim

    Abstract: Many recent studies have investigated social biases in LLMs but brand bias has received little attention. This research examines the biases exhibited by LLMs towards different brands, a significant concern given the widespread use of LLMs in affected use cases such as product recommendation and market analysis. Biased models may perpetuate societal inequalities, unfairly favoring established globa… ▽ More

    Submitted 27 September, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: Accepted at EMNLP-2024 (main)

  12. arXiv:2404.02949  [pdf, other

    cs.LG cs.AI

    The SaTML '24 CNN Interpretability Competition: New Innovations for Concept-Level Interpretability

    Authors: Stephen Casper, Jieun Yun, Joonhyuk Baek, Yeseong Jung, Minhwan Kim, Kiwan Kwon, Saerom Park, Hayden Moore, David Shriver, Marissa Connor, Keltin Grimes, Angus Nicolson, Arush Tagade, Jessica Rumbelow, Hieu Minh Nguyen, Dylan Hadfield-Menell

    Abstract: Interpretability techniques are valuable for helping humans understand and oversee AI systems. The SaTML 2024 CNN Interpretability Competition solicited novel methods for studying convolutional neural networks (CNNs) at the ImageNet scale. The objective of the competition was to help human crowd-workers identify trojans in CNNs. This report showcases the methods and results of four featured compet… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: Competition for SaTML 2024

  13. arXiv:2312.07784  [pdf, other

    eess.IV cs.AI cs.CV cs.LG eess.SP

    Robust MRI Reconstruction by Smoothed Unrolling (SMUG)

    Authors: Shijun Liang, Van Hoang Minh Nguyen, Jinghan Jia, Ismail Alkhouri, Sijia Liu, Saiprasad Ravishankar

    Abstract: As the popularity of deep learning (DL) in the field of magnetic resonance imaging (MRI) continues to rise, recent research has indicated that DL-based MRI reconstruction models might be excessively sensitive to minor input disturbances, including worst-case additive perturbations. This sensitivity often leads to unstable, aliased images. This raises the question of how to devise DL techniques for… ▽ More

    Submitted 19 August, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

  14. arXiv:2311.11003  [pdf, other

    cs.LG math.PR stat.ML

    Wasserstein Convergence Guarantees for a General Class of Score-Based Generative Models

    Authors: Xuefeng Gao, Hoang M. Nguyen, Lingjiong Zhu

    Abstract: Score-based generative models (SGMs) is a recent class of deep generative models with state-of-the-art performance in many applications. In this paper, we establish convergence guarantees for a general class of SGMs in 2-Wasserstein distance, assuming accurate score estimates and smooth log-concave data distribution. We specialize our result to several concrete SGMs with specific choices of forwar… ▽ More

    Submitted 15 February, 2025; v1 submitted 18 November, 2023; originally announced November 2023.

  15. arXiv:2211.07166  [pdf, other

    cs.LG cs.CR cs.DC

    Optimal Privacy Preserving for Federated Learning in Mobile Edge Computing

    Authors: Hai M. Nguyen, Nam H. Chu, Diep N. Nguyen, Dinh Thai Hoang, Van-Dinh Nguyen, Minh Hoang Ha, Eryk Dutkiewicz, Marwan Krunz

    Abstract: Federated Learning (FL) with quantization and deliberately added noise over wireless networks is a promising approach to preserve user differential privacy (DP) while reducing wireless resources. Specifically, an FL process can be fused with quantized Binomial mechanism-based updates contributed by multiple users. However, optimizing quantization parameters, communication resources (e.g., transmit… ▽ More

    Submitted 20 May, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

    Comments: 16 pages, 10 figures

  16. SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

    Authors: Bao Hieu Tran, Thanh Le-Cong, Huu Manh Nguyen, Duc Anh Le, Thanh Hung Nguyen, Phi Le Nguyen

    Abstract: In the last decades, scene text recognition has gained worldwide attention from both the academic community and actual users due to its importance in a wide range of applications. Despite achievements in optical character recognition, scene text recognition remains challenging due to inherent problems such as distortions or irregular layout. Most of the existing approaches mainly leverage recurren… ▽ More

    Submitted 1 January, 2022; originally announced January 2022.

    Comments: Accepted to ICMLA 2020

    Journal ref: 2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA)

  17. arXiv:2106.05190  [pdf, ps, other

    stat.ML cs.CE cs.LG

    DPER: Efficient Parameter Estimation for Randomly Missing Data

    Authors: Thu Nguyen, Khoi Minh Nguyen-Duy, Duy Ho Minh Nguyen, Binh T. Nguyen, Bruce Alan Wade

    Abstract: The missing data problem has been broadly studied in the last few decades and has various applications in different areas such as statistics or bioinformatics. Even though many methods have been developed to tackle this challenge, most of those are imputation techniques that require multiple iterations through the data before yielding convergence. In addition, such approaches may introduce extra b… ▽ More

    Submitted 6 June, 2021; originally announced June 2021.

    Comments: 28 pages, 3 tables, 40 references

  18. arXiv:2009.11360  [pdf, other

    cs.LG stat.ML

    EPEM: Efficient Parameter Estimation for Multiple Class Monotone Missing Data

    Authors: Thu Nguyen, Duy H. M. Nguyen, Huy Nguyen, Binh T. Nguyen, Bruce A. Wade

    Abstract: The problem of monotone missing data has been broadly studied during the last two decades and has many applications in different fields such as bioinformatics or statistics. Commonly used imputation techniques require multiple iterations through the data before yielding convergence. Moreover, those approaches may introduce extra noises and biases to the subsequent modeling. In this work, we derive… ▽ More

    Submitted 23 September, 2020; originally announced September 2020.

    Comments: version 1

  19. Multiple Visual-Semantic Embedding for Video Retrieval from Query Sentence

    Authors: Huy Manh Nguyen, Tomo Miyazaki, Yoshihiro Sugaya, Shinichiro Omachi

    Abstract: Visual-semantic embedding aims to learn a joint embedding space where related video and sentence instances are located close to each other. Most existing methods put instances in a single embedding space. However, they struggle to embed instances due to the difficulty of matching visual dynamics in videos to textual features in sentences. A single space is not enough to accommodate various videos… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: 8 pages, 5 figures

    Journal ref: Applied Sciences, 2021

  20. arXiv:1905.06509  [pdf, other

    cs.CV

    TRk-CNN: Transferable Ranking-CNN for image classification of glaucoma, glaucoma suspect, and normal eyes

    Authors: Tae Joon Jun, Youngsub Eom, Dohyeun Kim, Cherry Kim, Ji-Hye Park, Hoang Minh Nguyen, Daeyoung Kim

    Abstract: In this paper, we proposed Transferable Ranking Convolutional Neural Network (TRk-CNN) that can be effectively applied when the classes of images to be classified show a high correlation with each other. The multi-class classification method based on the softmax function, which is generally used, is not effective in this case because the inter-class relationship is ignored. Although there is a Ran… ▽ More

    Submitted 15 May, 2019; originally announced May 2019.

    Comments: 49 pages, 12 figures

  21. arXiv:1805.05727  [pdf, other

    cs.CV

    2sRanking-CNN: A 2-stage ranking-CNN for diagnosis of glaucoma from fundus images using CAM-extracted ROI as an intermediate input

    Authors: Tae Joon Jun, Dohyeun Kim, Hoang Minh Nguyen, Daeyoung Kim, Youngsub Eom

    Abstract: Glaucoma is a disease in which the optic nerve is chronically damaged by the elevation of the intra-ocular pressure, resulting in visual field defect. Therefore, it is important to monitor and treat suspected patients before they are confirmed with glaucoma. In this paper, we propose a 2-stage ranking-CNN that classifies fundus images as normal, suspicious, and glaucoma. Furthermore, we propose a… ▽ More

    Submitted 4 July, 2018; v1 submitted 15 May, 2018; originally announced May 2018.

    Comments: Accepted at BMVC 2018

  22. arXiv:1804.06812  [pdf, other

    cs.CV

    ECG arrhythmia classification using a 2-D convolutional neural network

    Authors: Tae Joon Jun, Hoang Minh Nguyen, Daeyoun Kang, Dohyeun Kim, Daeyoung Kim, Young-Hak Kim

    Abstract: In this paper, we propose an effective electrocardiogram (ECG) arrhythmia classification method using a deep two-dimensional convolutional neural network (CNN) which recently shows outstanding performance in the field of pattern recognition. Every ECG beat was transformed into a two-dimensional grayscale image as an input data for the CNN classifier. Optimization of the proposed CNN classifier inc… ▽ More

    Submitted 18 April, 2018; originally announced April 2018.

    Comments: Submitted to journal

  23. ASMCNN: An Efficient Brain Extraction Using Active Shape Model and Convolutional Neural Networks

    Authors: Duy H. M. Nguyen, Duy M. Nguyen, Mai T. N. Truong, Thu Nguyen, Khanh T. Tran, Nguyen A. Triet, Pham T. Bao, Binh T. Nguyen

    Abstract: Brain extraction (skull stripping) is a challenging problem in neuroimaging. It is due to the variability in conditions from data acquisition or abnormalities in images, making brain morphology and intensity characteristics changeable and complicated. In this paper, we propose an algorithm for skull stripping in Magnetic Resonance Imaging (MRI) scans, namely ASMCNN, by combining the Active Shape M… ▽ More

    Submitted 27 January, 2022; v1 submitted 5 February, 2018; originally announced February 2018.

    Comments: 47 pages, 20 figures

    MSC Class: 68T10