Skip to main content

Showing 1–50 of 145 results for author: Nguyen, T N

.
  1. arXiv:2506.04013  [pdf, ps, other

    cs.SD cs.AI eess.AS

    Towards Better Disentanglement in Non-Autoregressive Zero-Shot Expressive Voice Conversion

    Authors: Seymanur Akti, Tuan Nam Nguyen, Alexander Waibel

    Abstract: Expressive voice conversion aims to transfer both speaker identity and expressive attributes from a target speech to a given source speech. In this work, we improve over a self-supervised, non-autoregressive framework with a conditional variational autoencoder, focusing on reducing source timbre leakage and improving linguistic-acoustic disentanglement for better style transfer. To minimize style… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

    Comments: Accepted to Interspeech 2025

  2. arXiv:2505.19679  [pdf, ps, other

    cs.CL cs.AI

    KIT's Low-resource Speech Translation Systems for IWSLT2025: System Enhancement with Synthetic Data and Model Regularization

    Authors: Zhaolin Li, Yining Liu, Danni Liu, Tuan Nam Nguyen, Enes Yavuz Ugan, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues

    Abstract: This paper presents KIT's submissions to the IWSLT 2025 low-resource track. We develop both cascaded systems, consisting of Automatic Speech Recognition (ASR) and Machine Translation (MT) models, and end-to-end (E2E) Speech Translation (ST) systems for three language pairs: Bemba, North Levantine Arabic, and Tunisian Arabic into English. Building upon pre-trained models, we fine-tune our systems w… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  3. arXiv:2505.14759  [pdf, ps, other

    cs.SE cs.LG

    LEANCODE: Understanding Models Better for Code Simplification of Pre-trained Large Language Models

    Authors: Yan Wang, Ling Ding, Tien N Nguyen, Shaohua Wang, Yanan Zheng

    Abstract: Large Language Models for code often entail significant computational complexity, which grows significantly with the length of the input code sequence. We propose LeanCode for code simplification to reduce training and prediction time, leveraging code contexts in utilizing attention scores to represent the tokens' importance. We advocate for the selective removal of tokens based on the average con… ▽ More

    Submitted 8 June, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

    Comments: Accepted to ACL 2025 main conference

  4. arXiv:2505.00831  [pdf, other

    cs.RO cs.CL

    SmallPlan: Leverage Small Language Models for Sequential Path Planning with Simulation-Powered, LLM-Guided Distillation

    Authors: Quang P. M. Pham, Khoi T. N. Nguyen, Nhi H. Doan, Cuong A. Pham, Kentaro Inui, Dezhen Song

    Abstract: Efficient path planning in robotics, particularly within large-scale, dynamic environments, remains a significant hurdle. While Large Language Models (LLMs) offer strong reasoning capabilities, their high computational cost and limited adaptability in dynamic scenarios hinder real-time deployment on edge devices. We present SmallPlan -- a novel framework leveraging LLMs as teacher models to train… ▽ More

    Submitted 11 May, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

    Comments: Paper is under review

  5. arXiv:2504.19175  [pdf, other

    physics.med-ph

    A tissue-informed deep learning-based method for positron range correction in preclinical 68Ga PET imaging

    Authors: Nerea Encina-Baranda, Robert J. Paneque-Yunta, Javier Lopez-Rodriguez, Edwin C. Pratt, Trong Nghia Nguyen, Jan Grimm, Alejandro Lopez-Montes, Joaquin L. Herraiz

    Abstract: Positron range (PR) limits spatial resolution and quantitative accuracy in PET imaging, particularly for high-energy positron-emitting radionuclides like 68Ga. We propose a deep learning method using 3D residual encoder-decoder convolutional neural networks (3D RED-CNNs), incorporating tissue-dependent anatomical information through a u-map-dependent loss function. Models were trained with realist… ▽ More

    Submitted 27 April, 2025; originally announced April 2025.

    Comments: Submitted to EJNMMI Physics

  6. arXiv:2504.17287  [pdf, other

    cs.SE

    Combining Static and Dynamic Approaches for Mining and Testing Constraints for RESTful API Testing

    Authors: Hieu Huynh, Tri Le, Vu Nguyen, Tien N. Nguyen

    Abstract: In API testing, deriving logical constraints on API response bodies is crucial in generating the test cases to cover various aspects of RESTful APIs. However, existing approaches are limited to dynamic analysis in which constraints are extracted from the execution of APIs as part of the system under test. The key limitation of such a dynamic approach is its under-estimation in which inputs in API… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  7. arXiv:2504.16797  [pdf, other

    math.NA

    The extended adjoint state and nonlinearity in correlation-based passive imaging

    Authors: Tram Thi Ngoc Nguyen

    Abstract: This articles investigates physics-based passive imaging problem, wherein one infers an unknown medium using ambient noise and correlation of the noise signal. We develop a general backpropagation framework via the so-called extended adjoint state, suitable for any linear PDE; crucially, this approach reduces by half the number of required PDE solves. Applications to several different PDE models d… ▽ More

    Submitted 23 April, 2025; originally announced April 2025.

    MSC Class: 65M32; 65J22; 35R30

  8. arXiv:2504.15917  [pdf, other

    cs.SE

    Towards Test Generation from Task Description for Mobile Testing with Multi-modal Reasoning

    Authors: Hieu Huynh, Hai Phung, Hao Pham, Tien N. Nguyen, Vu Nguyen

    Abstract: In Android GUI testing, generating an action sequence for a task that can be replayed as a test script is common. Generating sequences of actions and respective test scripts from task goals described in natural language can eliminate the need for manually writing test scripts. However, existing approaches based on large language models (LLM) often struggle with identifying the final action, and ei… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: Under review for a conference

  9. arXiv:2504.14757  [pdf, other

    cs.SE cs.AI

    SWE-Synth: Synthesizing Verifiable Bug-Fix Data to Enable Large Language Models in Resolving Real-World Bugs

    Authors: Minh V. T. Pham, Huy N. Phan, Hoang N. Phan, Cuong Le Chi, Tien N. Nguyen, Nghi D. Q. Bui

    Abstract: Large language models (LLMs) are transforming automated program repair (APR) through agent-based approaches that localize bugs, generate patches, and verify fixes. However, the lack of high-quality, scalable training datasets, especially those with verifiable outputs and intermediate reasoning traces-limits progress, particularly for open-source models. In this work, we present SWE-Synth, a framew… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

    Comments: Work in progress

  10. arXiv:2504.14336  [pdf, other

    cs.SE

    Toward Generation of Test Cases from Task Descriptions via History-aware Planning

    Authors: Duy Cao, Phu Nguyen, Vy Le, Tien N. Nguyen, Vu Nguyen

    Abstract: In automated web testing, generating test scripts from natural language task descriptions is crucial for enhancing the test generation process. This activity involves creating the correct sequences of actions to form test scripts for future testing activities. Current state-of-the-art approaches are limited in generating these action sequences, as they either demand substantial manual effort for h… ▽ More

    Submitted 19 April, 2025; originally announced April 2025.

    Comments: Under review

  11. arXiv:2504.10603  [pdf, other

    cs.CR

    Demo: ViolentUTF as An Accessible Platform for Generative AI Red Teaming

    Authors: Tam n. Nguyen

    Abstract: The rapid integration of Generative AI (GenAI) into various applications necessitates robust risk management strategies which includes Red Teaming (RT) - an evaluation method for simulating adversarial attacks. Unfortunately, RT for GenAI is often hindered by technical complexity, lack of user-friendly interfaces, and inadequate reporting features. This paper introduces Violent UTF - an accessible… ▽ More

    Submitted 29 April, 2025; v1 submitted 14 April, 2025; originally announced April 2025.

    Comments: 3 pages, 1 figure, 1 table. This is a demo paper for CyberWarrior2025. The video demo is at https://youtu.be/c-UCYXq0rfY. Codes will be shared when the competition concludes in June 2025 due to embargo requirements

  12. arXiv:2504.05748  [pdf, other

    cs.CV cs.HC

    When Less Is More: A Sparse Facial Motion Structure For Listening Motion Learning

    Authors: Tri Tung Nguyen Nguyen, Quang Tien Dam, Dinh Tuan Tran, Joo-Ho Lee

    Abstract: Effective human behavior modeling is critical for successful human-robot interaction. Current state-of-the-art approaches for predicting listening head behavior during dyadic conversations employ continuous-to-discrete representations, where continuous facial motion sequence is converted into discrete latent tokens. However, non-verbal facial motion presents unique challenges owing to its temporal… ▽ More

    Submitted 8 April, 2025; originally announced April 2025.

  13. arXiv:2504.05747  [pdf, other

    cs.CL

    SEA-LION: Southeast Asian Languages in One Network

    Authors: Raymond Ng, Thanh Ngan Nguyen, Yuli Huang, Ngee Chia Tai, Wai Yi Leong, Wei Qi Leong, Xianbin Yong, Jian Gang Ngui, Yosephine Susanto, Nicholas Cheng, Hamsawardhini Rengarajan, Peerat Limkonchotiwat, Adithya Venkatadri Hulagadri, Kok Wai Teng, Yeo Yeow Tong, Bryan Siow, Wei Yi Teo, Wayne Lau, Choon Meng Tan, Brandon Ong, Zhi Hao Ong, Jann Railey Montalan, Adwin Chan, Sajeban Antonyrex, Ren Lee , et al. (6 additional authors not shown)

    Abstract: Recently, Large Language Models (LLMs) have dominated much of the artificial intelligence scene with their ability to process and generate natural languages. However, the majority of LLM research and development remains English-centric, leaving low-resource languages such as those in the Southeast Asian (SEA) region under-represented. To address this representation gap, we introduce Llama-SEA-LION… ▽ More

    Submitted 15 April, 2025; v1 submitted 8 April, 2025; originally announced April 2025.

    Comments: We released our model at https://huggingface.co/collections/aisingapore/sea-lionv3-672589a39cdadd6a5b199581

  14. arXiv:2504.01112  [pdf

    physics.optics

    Soft X-ray high-harmonic generation in an anti-resonant hollow core fiber driven by a 3 $μ$m ultrafast laser

    Authors: Drew Morrill, Will Hettel, Daniel Carlson, Benjamin Shearer, Clay Klein, Jeremy Thurston, Grzegorz Golba, Rae Larsen, Gabriella Seifert, James Uhrich, Daniel Lesko, Tin Nghia Nguyen, Gunnar Arisholm, Jonathan Knight, Scott Diddams, Margaret Murnane, Henry Kapteyn, Michaƫl Hemmer

    Abstract: High-harmonic upconversion driven by a mid-infrared femtosecond laser can generate coherent soft X-ray beams in a tabletop-scale setup. Here, we report on a compact ytterbium-pumped optical parametric chirped pulse amplifier (OPCPA) laser system seeded by an all-fiber front-end and employing periodically-poled lithium niobate (PPLN) nonlinear media operated near the pulse fluence limits of current… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

    Comments: 10 pages, 5 figures, under review

  15. arXiv:2503.20934  [pdf, other

    cs.SE

    Leveraging LLMs, IDEs, and Semantic Embeddings for Automated Move Method Refactoring

    Authors: Fraol Batole, Abhiram Bellur, Malinda Dilhara, Mohammed Raihan Ullah, Yaroslav Zharov, Timofey Bryksin, Kai Ishikawa, Haifeng Chen, Masaharu Morimoto, Shota Motoura, Takeo Hosomi, Tien N. Nguyen, Hridesh Rajan, Nikolaos Tsantalis, Danny Dig

    Abstract: MOVEMETHOD is a hallmark refactoring. Despite a plethora of research tools that recommend which methods to move and where, these recommendations do not align with how expert developers perform MOVEMETHOD. Given the extensive training of Large Language Models and their reliance upon naturalness of code, they should expertly recommend which methods are misplaced in a given class and which classes ar… ▽ More

    Submitted 26 March, 2025; originally announced March 2025.

    Comments: 12 pages, 2 figures

  16. arXiv:2503.11588  [pdf, other

    eess.IV

    Generalization performance of neural mapping schemes for the space-time interpolation of satellite-derived ocean colour datasets

    Authors: Thi Thuy Nga Nguyen, ClƩment Dorffer, FrƩdƩric Jourdin, Ronan Fablet

    Abstract: Neural mapping schemes have become appealing approaches to deliver gap-free satellite-derived products for sea surface tracers. The generalization performance of these learning-based approaches naturally arises as a key challenge. This is particularly true for satellite-derived ocean colour products given the variety of bio-optical variables of interest, as well as the diversity of processes and s… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: 9 pages, 6 figures. Submitted to IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing

    ACM Class: I.2.10; I.4.5

  17. arXiv:2503.11532  [pdf, other

    eess.IV

    Observation-only learning of neural mapping schemes for gappy satellite-derived ocean colour parameters

    Authors: ClƩment Dorffer, FrƩdƩric Jourdin, Thi Thuy Nga Nguyen, Rodolphe Devillers, David Mouillot, Ronan Fablet

    Abstract: Monitoring optical properties of coastal and open ocean waters is crucial to assessing the health of marine ecosystems. Deep learning offers a promising approach to address these ecosystem dynamics, especially in scenarios where gap-free ground-truth data is lacking, which poses a challenge for designing effective training frameworks. Using an advanced neural variational data assimilation scheme (… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

    Comments: 10 pages, 9 figures, submitted to IEEE Transactions on Geoscience and Remote Sensing

    ACM Class: I.4.5; I.2.10

  18. arXiv:2502.08085  [pdf, other

    cs.GR

    Interactive Holographic Visualization for 3D Facial Avatar

    Authors: Tri Tung Nguyen Nguyen, Fujii Yasuyuki, Dinh Tuan Tran, Joo-Ho Lee

    Abstract: Traditional methods for visualizing dynamic human expressions, particularly in medical training, often rely on flat-screen displays or static mannequins, which have proven inefficient for realistic simulation. In response, we propose a platform that leverages a 3D interactive facial avatar capable of displaying non-verbal feedback, including pain signals. This avatar is projected onto a stereoscop… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  19. arXiv:2412.08250  [pdf, other

    cs.IT

    Fast Beam Placement for Ultra-Dense LEO Networks

    Authors: Trinh Van Chien, Nguyen Minh Quan, Tri Nhu Do, Cuong Le, Tan N. Nguyen, Symeon Chatzinotas

    Abstract: Low Earth orbit (LEO) satellites has brought about significant improvements in wireless communications, characterized by low latency and reduced transmission loss compared to geostationary orbit (GSO) satellites. Ultra-dense LEO satellites can serve many users by generating active beams effective to their locations. The beam placement problem is challenging but important for efficiently allocating… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Comments: 5 pages, 3 figures. Accepted by IEEE WCL

  20. arXiv:2411.19917  [pdf, other

    math.NA

    Traction force microscopy for linear and nonlinear elastic materials as a parameter identification inverse problem

    Authors: Gesa Sarnighausen, Tram Thi Ngoc Nguyen, Thorsten Hohage, Mangalika Sinha, Sarah Koester, Timo Betz, Ulrich Sebastian Schwarz, Anne Wald

    Abstract: Traction force microscopy is a method widely used in biophysics and cell biology to determine forces that biological cells apply to their environment. In the experiment, the cells adhere to a soft elastic substrate, which is then deformed in response to cellular traction forces. The inverse problem consists in computing the traction stress applied by the cell from microscopy measurements of the su… ▽ More

    Submitted 29 November, 2024; originally announced November 2024.

    Comments: 28 pages, 9 figures

    MSC Class: 92-08; 35Q92; 35R30

  21. arXiv:2411.10509  [pdf, other

    cs.CV cs.LG

    TESGNN: Temporal Equivariant Scene Graph Neural Networks for Efficient and Robust Multi-View 3D Scene Understanding

    Authors: Quang P. M. Pham, Khoi T. N. Nguyen, Lan C. Ngo, Truong Do, Dezhen Song, Truong-Son Hy

    Abstract: Scene graphs have proven to be highly effective for various scene understanding tasks due to their compact and explicit representation of relational information. However, current methods often overlook the critical importance of preserving symmetry when generating scene graphs from 3D point clouds, which can lead to reduced accuracy and robustness, particularly when dealing with noisy, multi-view… ▽ More

    Submitted 2 March, 2025; v1 submitted 15 November, 2024; originally announced November 2024.

    Comments: arXiv admin note: text overlap with arXiv:2407.00609

  22. arXiv:2411.01808  [pdf, other

    cs.LG stat.ML

    Fixing the Loose Brake: Exponential-Tailed Stopping Time in Best Arm Identification

    Authors: Kapilan Balagopalan, Tuan Ngo Nguyen, Yao Zhao, Kwang-Sung Jun

    Abstract: The best arm identification problem requires identifying the best alternative (i.e., arm) in active experimentation using the smallest number of experiments (i.e., arm pulls), which is crucial for cost-efficient and timely decision-making processes. In the fixed confidence setting, an algorithm must stop data-dependently and return the estimated best arm with a correctness guarantee. Since this st… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  23. arXiv:2411.00405  [pdf, other

    stat.ML cs.LG

    HAVER: Instance-Dependent Error Bounds for Maximum Mean Estimation and Applications to Q-Learning and Monte Carlo Tree Search

    Authors: Tuan Ngo Nguyen, Jay Barrett, Kwang-Sung Jun

    Abstract: We study the problem of estimating the \emph{value} of the largest mean among K distributions via samples from them (rather than estimating \emph{which} distribution has the largest mean), which arises from various machine learning tasks including Q-learning and Monte Carlo Tree Search (MCTS). While there have been a few proposed algorithms, their performance analyses have been limited to their bi… ▽ More

    Submitted 28 April, 2025; v1 submitted 1 November, 2024; originally announced November 2024.

    Comments: In Proceedings of the Artificial Intelligence and Statistics (AISTATS) 2025

  24. arXiv:2410.23402  [pdf, other

    cs.SE

    VisualCoder: Guiding Large Language Models in Code Execution with Fine-grained Multimodal Chain-of-Thought Reasoning

    Authors: Cuong Chi Le, Hoang-Chau Truong-Vinh, Huy Nhat Phan, Dung Duy Le, Tien N. Nguyen, Nghi D. Q. Bui

    Abstract: Predicting program behavior and reasoning about code execution remain significant challenges in software engineering, particularly for large language models (LLMs) designed for code analysis. While these models excel at understanding static syntax, they often struggle with dynamic reasoning tasks. We introduce VisualCoder, a simple yet effective approach that enhances code reasoning by integrating… ▽ More

    Submitted 9 February, 2025; v1 submitted 30 October, 2024; originally announced October 2024.

    Comments: NAACL 2025

  25. arXiv:2410.14997  [pdf, other

    cs.SD cs.AI eess.AS

    Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS

    Authors: Tuan Nam Nguyen, Seymanur Akti, Ngoc Quan Pham, Alexander Waibel

    Abstract: Previous approaches on accent conversion (AC) mainly aimed at making non-native speech sound more native while maintaining the original content and speaker identity. However, non-native speakers sometimes have pronunciation issues, which can make it difficult for listeners to understand them. Hence, we developed a new AC approach that not only focuses on accent conversion but also improves pronunc… ▽ More

    Submitted 4 March, 2025; v1 submitted 19 October, 2024; originally announced October 2024.

    Comments: accepted at ICASSP 2025

  26. arXiv:2410.03734  [pdf, other

    cs.SD cs.CL eess.AS

    Accent conversion using discrete units with parallel data synthesized from controllable accented TTS

    Authors: Tuan Nam Nguyen, Ngoc Quan Pham, Alexander Waibel

    Abstract: The goal of accent conversion (AC) is to convert speech accents while preserving content and speaker identity. Previous methods either required reference utterances during inference, did not preserve speaker identity well, or used one-to-one systems that could only be trained for each non-native accent. This paper presents a promising AC model that can convert many accents into native to overcome… ▽ More

    Submitted 30 September, 2024; originally announced October 2024.

    Comments: Accepted at Syndata4genAI

  27. arXiv:2409.20033  [pdf

    cs.MA econ.GN

    Fuel tax loss in a world of electric mobility: A window of opportunity for congestion pricing

    Authors: Thi Ngoc Nguyen, Felix Muesgens

    Abstract: The continued transition towards electric mobility will decrease energy tax revenues worldwide, which has substantial implications for government funds. At the same time, demand for transportation is ever increasing, which in turn increases congestion problems. Combining both challenges, this paper assesses the effectiveness of congestion pricing as a sustainable revenue stream to offset fuel tax… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: A part of this work has been presented in the International Conference on Operations Research OR2024

  28. arXiv:2409.16299  [pdf, other

    cs.SE cs.AI

    HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale

    Authors: Huy Nhat Phan, Tien N. Nguyen, Phong X. Nguyen, Nghi D. Q. Bui

    Abstract: Large Language Models (LLMs) have revolutionized software engineering (SE), showcasing remarkable proficiency in various coding tasks. Despite recent advancements that have enabled the creation of autonomous software agents utilizing LLMs for end-to-end development tasks, these systems are typically designed for specific SE functions. We introduce HyperAgent, an innovative generalist multi-agent s… ▽ More

    Submitted 5 November, 2024; v1 submitted 9 September, 2024; originally announced September 2024.

    Comments: 49 pages

  29. arXiv:2409.11635  [pdf, other

    cs.CV

    PainDiffusion: Learning to Express Pain

    Authors: Quang Tien Dam, Tri Tung Nguyen Nguyen, Yuki Endo, Dinh Tuan Tran, Joo-Ho Lee

    Abstract: Accurate pain expression synthesis is essential for improving clinical training and human-robot interaction. Current Robotic Patient Simulators (RPSs) lack realistic pain facial expressions, limiting their effectiveness in medical training. In this work, we introduce PainDiffusion, a generative model that synthesizes naturalistic facial pain expressions. Unlike traditional heuristic or autoregress… ▽ More

    Submitted 4 March, 2025; v1 submitted 17 September, 2024; originally announced September 2024.

    Comments: 8 pages, 9 figures

  30. arXiv:2409.06854  [pdf, other

    math.NA math.OC

    Bi-level regularization via iterative mesh refinement for aeroacoustics

    Authors: Christian Aarset, Tram Thi Ngoc Nguyen

    Abstract: In this work, we illustrate the connection between adaptive mesh refinement for finite element discretized PDEs and the recently developed \emph{bi-level regularization algorithm}. By adaptive mesh refinement according to data noise, regularization effect and convergence are immediate consequences. We moreover demonstrate its numerical advantages to the classical Landweber algorithm in term of tim… ▽ More

    Submitted 31 October, 2024; v1 submitted 10 September, 2024; originally announced September 2024.

    MSC Class: 65M32; 65J22; 35R30

  31. arXiv:2409.03834  [pdf, ps, other

    math.NA math.AP math.OC

    Sequential bi-level regularized inversion with application to hidden reaction law discovery

    Authors: Tram Thi Ngoc Nguyen

    Abstract: In this article, we develop and present a novel regularization scheme for ill-posed inverse problems governed by nonlinear \blue{time-dependent} partial differential equations (PDEs). In our recent work, we introduced a bi-level regularization framework. This study significantly improves upon the bi-level algorithm by sequentially initializing the lower-level problem, yielding accelerated converge… ▽ More

    Submitted 29 May, 2025; v1 submitted 5 September, 2024; originally announced September 2024.

    MSC Class: 65M32; 65J22; 35R30

    Journal ref: Inverse Problems, 2025

  32. arXiv:2408.02816  [pdf, other

    cs.SE

    CodeFlow: Program Behavior Prediction with Dynamic Dependencies Learning

    Authors: Cuong Chi Le, Hoang Nhat Phan, Huy Nhat Phan, Tien N. Nguyen, Nghi D. Q. Bui

    Abstract: Predicting program behavior without execution is a critical task in software engineering. Existing models often fall short in capturing the dynamic dependencies among program elements. To address this, we present CodeFlow, a novel machine learning-based approach that predicts code coverage and detects runtime errors by learning both static and dynamic dependencies within the code. By using control… ▽ More

    Submitted 9 February, 2025; v1 submitted 5 August, 2024; originally announced August 2024.

    Comments: FORGE 2025

  33. Segment-Based Test Case Prioritization: A Multi-objective Approach

    Authors: Hieu Huynh, Nhu Pham, Tien N. Nguyen, Vu Nguyen

    Abstract: Regression testing of software is a crucial but time-consuming task, especially in the context of user interface (UI) testing where multiple microservices must be validated simultaneously. Test case prioritization (TCP) is a cost-efficient solution to address this by scheduling test cases in an execution order that maximizes an objective function, generally aimed at increasing the fault detection… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: ISSTA 2024

  34. arXiv:2407.09281  [pdf, other

    cs.AI

    Predicting and Understanding Human Action Decisions: Insights from Large Language Models and Cognitive Instance-Based Learning

    Authors: Thuy Ngoc Nguyen, Kasturi Jamale, Cleotilde Gonzalez

    Abstract: Large Language Models (LLMs) have demonstrated their capabilities across various tasks, from language translation to complex reasoning. Understanding and predicting human behavior and biases are crucial for artificial intelligence (AI) assisted systems to provide useful assistance, yet it remains an open question whether these models can achieve this. This paper addresses this gap by leveraging th… ▽ More

    Submitted 5 August, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

  35. arXiv:2407.07472  [pdf, other

    cs.SE cs.AI

    Rectifier: Code Translation with Corrector via LLMs

    Authors: Xin Yin, Chao Ni, Tien N. Nguyen, Shaohua Wang, Xiaohu Yang

    Abstract: Software migration is garnering increasing attention with the evolution of software and society. Early studies mainly relied on handcrafted translation rules to translate between two languages, the translation process is error-prone and time-consuming. In recent years, researchers have begun to explore the use of pre-trained large language models (LLMs) in code translation. However, code translati… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2308.03109, arXiv:2302.03908 by other authors

  36. arXiv:2407.00609  [pdf, other

    cs.CV cs.LG

    ESGNN: Towards Equivariant Scene Graph Neural Network for 3D Scene Understanding

    Authors: Quang P. M. Pham, Khoi T. N. Nguyen, Lan C. Ngo, Truong Do, Truong Son Hy

    Abstract: Scene graphs have been proven to be useful for various scene understanding tasks due to their compact and explicit nature. However, existing approaches often neglect the importance of maintaining the symmetry-preserving property when generating scene graphs from 3D point clouds. This oversight can diminish the accuracy and robustness of the resulting scene graphs, especially when handling noisy, m… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  37. arXiv:2406.06863  [pdf, other

    cs.CR cs.AI cs.HC

    Ollabench: Evaluating LLMs' Reasoning for Human-centric Interdependent Cybersecurity

    Authors: Tam n. Nguyen

    Abstract: Large Language Models (LLMs) have the potential to enhance Agent-Based Modeling by better representing complex interdependent cybersecurity systems, improving cybersecurity threat modeling and risk management. However, evaluating LLMs in this context is crucial for legal compliance and effective application development. Existing LLM evaluation frameworks often overlook the human factor and cogniti… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: 12 pages, 7 figures, 2 tables The final conference/journal version may have significantly more content updates

    ACM Class: I.2.0; J.4

  38. arXiv:2405.03427  [pdf, other

    cs.LG

    Geometry-aware framework for deep energy method: an application to structural mechanics with hyperelastic materials

    Authors: Thi Nguyen Khoa Nguyen, Thibault Dairay, Raphaƫl Meunier, Christophe Millet, Mathilde Mougeot

    Abstract: Physics-Informed Neural Networks (PINNs) have gained considerable interest in diverse engineering domains thanks to their capacity to integrate physical laws into deep learning models. Recently, geometry-aware PINN-based approaches that employ the strong form of underlying physical system equations have been developed with the aim of integrating geometric information into PINNs. Despite ongoing re… ▽ More

    Submitted 6 May, 2024; originally announced May 2024.

    Comments: 28 pages, 26 figures, 4 tables

  39. arXiv:2403.07763  [pdf, other

    cs.NI cs.ET

    Emerging Technologies for 6G Non-Terrestrial-Networks: From Academia to Industrial Applications

    Authors: Cong T. Nguyen, Yuris Mulya Saputra, Nguyen Van Huynh, Tan N. Nguyen, Dinh Thai Hoang, Diep N Nguyen, Van-Quan Pham, Miroslav Voznak, Symeon Chatzinotas, Dinh-Hieu Tran

    Abstract: Terrestrial networks form the fundamental infrastructure of modern communication systems, serving more than 4 billion users globally. However, terrestrial networks are facing a wide range of challenges, from coverage and reliability to interference and congestion. As the demands of the 6G era are expected to be much higher, it is crucial to address these challenges to ensure a robust and efficient… ▽ More

    Submitted 3 July, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

    Comments: 35 pages

  40. arXiv:2403.06095  [pdf, other

    cs.SE cs.AI

    RepoHyper: Search-Expand-Refine on Semantic Graphs for Repository-Level Code Completion

    Authors: Huy N. Phan, Hoang N. Phan, Tien N. Nguyen, Nghi D. Q. Bui

    Abstract: Code Large Language Models (CodeLLMs) have demonstrated impressive proficiency in code completion tasks. However, they often fall short of fully understanding the extensive context of a project repository, such as the intricacies of relevant files and class hierarchies, which can result in less precise completions. To overcome these limitations, we present \tool, a multifaceted framework designed… ▽ More

    Submitted 14 August, 2024; v1 submitted 10 March, 2024; originally announced March 2024.

  41. arXiv:2403.00488  [pdf, other

    astro-ph.SR math.NA math.OC

    Inferring solar differential rotation and viscosity via passive imaging with inertial waves

    Authors: Tram Thi Ngoc Nguyen, Thorsten Hohage, Damien Fournier, Laurent Gizon

    Abstract: The recent discovery of inertial waves on the surface of the Sun offers new possibilities to learn about the solar interior. These waves are long-lived with a period on the order of the Sun rotation period ($\sim$27 days) and are sensitive to parameters deep inside the Sun. They are excited by turbulent convection, leading to a passive imaging problem. In this work, we present the forward and inve… ▽ More

    Submitted 22 March, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: proceedings paper

    MSC Class: 65M32; 65J22; 35R30

  42. arXiv:2402.06695  [pdf, other

    cs.AI cs.LG eess.SY

    Integrating LLMs for Explainable Fault Diagnosis in Complex Systems

    Authors: Akshay J. Dave, Tat Nghia Nguyen, Richard B. Vilim

    Abstract: This paper introduces an integrated system designed to enhance the explainability of fault diagnostics in complex systems, such as nuclear power plants, where operator understanding is critical for informed decision-making. By combining a physics-based diagnostic tool with a Large Language Model, we offer a novel solution that not only identifies faults but also provides clear, understandable expl… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

    Comments: 4 pages

  43. Data-Driven Evidence-Based Syntactic Sugar Design

    Authors: David OBrien, Robert Dyer, Tien N. Nguyen, Hridesh Rajan

    Abstract: Programming languages are essential tools for developers, and their evolution plays a crucial role in supporting the activities of developers. One instance of programming language evolution is the introduction of syntactic sugars, which are additional syntax elements that provide alternative, more readable code constructs. However, the process of designing and evolving a programming language has t… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: 12 pages, 12 figures, to be published in ICSE'24

  44. arXiv:2401.00293  [pdf, other

    math.FA math.OC

    Representation formulas for maximal monotone operators of type (D) in Banach spaces whose dual spaces are strictly convex

    Authors: Nguyen B. Tran, Tran N. Nguyen, Huynh M. Hien

    Abstract: This work deals with a maximal monotone operator $A$ of type (D) in a Banach space whose dual space is strictly convex. We establish some representations for the value $Ax$ at a given point $x$ via its values at nearby points of $x$. We show that the faces of $Ax$ are contained in the set of all weak$^*$ convergent limits of bounded nets of the operator at nearby points of $x$, then we obtain a re… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: Comments are welcome!

    MSC Class: 47H05; 47H04; 47N10

  45. arXiv:2311.00216  [pdf, other

    physics.atom-ph

    FPGA-based residual amplitude modulation suppression and control for compact atomic clocks

    Authors: Tin Nghia Nguyen, Thomas R. Schibli

    Abstract: We designed an FPGA fabric to provide phase modulation techniques to lock lasers to optical frequency references. The method incorporates an active residual-amplitude-modulation (RAM) suppression scheme that relies on complex modulation. All the required servos to construct an optical atomic clock are incorporated onto the same low-cost, commercial FPGA chip. We demonstrate a reliable, long-term R… ▽ More

    Submitted 31 October, 2023; originally announced November 2023.

  46. arXiv:2310.07984  [pdf

    cs.AI cs.CE

    Large Language Models for Scientific Synthesis, Inference and Explanation

    Authors: Yizhen Zheng, Huan Yee Koh, Jiaxin Ju, Anh T. N. Nguyen, Lauren T. May, Geoffrey I. Webb, Shirui Pan

    Abstract: Large language models are a form of artificial intelligence systems whose primary knowledge consists of the statistical patterns, semantic relationships, and syntactical structures of language1. Despite their limited forms of "knowledge", these systems are adept at numerous complex tasks including creative writing, storytelling, translation, question-answering, summarization, and computer code gen… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

    Comments: Supplementary Information: https://drive.google.com/file/d/1KrpUpzuFTeMx6a6zl18lqdo8vV-UUa1Z/view?usp=sharing Github Repo: https://github.com/zyzisastudyreallyhardguy/LLM4SD

  47. Bi-level iterative regularization for inverse problems in nonlinear PDEs

    Authors: Tram Thi Ngoc Nguyen

    Abstract: We investigate the ill-posed inverse problem of recovering unknown spatially dependent parameters in nonlinear evolution PDEs. We propose a bi-level Landweber scheme, where the upper-level parameter reconstruction embeds a lower-level state approximation. This can be seen as combining the classical reduced setting and the newer all-at-once setting, allowing us to, respectively, utilize well-posedn… ▽ More

    Submitted 5 February, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

    MSC Class: 65M32; 65J22; 35R30

    Journal ref: Inverse Problems, Volume 40, Number 4, 2024

  48. arXiv:2308.09219  [pdf, other

    cs.AI cs.MA

    Learning in Cooperative Multiagent Systems Using Cognitive and Machine Models

    Authors: Thuy Ngoc Nguyen, Duy Nhat Phan, Cleotilde Gonzalez

    Abstract: Developing effective Multi-Agent Systems (MAS) is critical for many applications requiring collaboration and coordination with humans. Despite the rapid advance of Multi-Agent Deep Reinforcement Learning (MADRL) in cooperative MAS, one major challenge is the simultaneous learning and interaction of independent agents in dynamic environments in the presence of stochastic rewards. State-of-the-art M… ▽ More

    Submitted 17 August, 2023; originally announced August 2023.

    Comments: 22 pages, 5 figures, 2 tables

  49. arXiv:2307.11939  [pdf, other

    cs.LG

    Batch Clipping and Adaptive Layerwise Clipping for Differential Private Stochastic Gradient Descent

    Authors: Toan N. Nguyen, Phuong Ha Nguyen, Lam M. Nguyen, Marten Van Dijk

    Abstract: Each round in Differential Private Stochastic Gradient Descent (DPSGD) transmits a sum of clipped gradients obfuscated with Gaussian noise to a central server which uses this to update a global model which often represents a deep neural network. Since the clipped gradients are computed separately, which we call Individual Clipping (IC), deep neural networks like resnet-18 cannot use Batch Normaliz… ▽ More

    Submitted 21 July, 2023; originally announced July 2023.

    Comments: 20 pages, 18 Figures

  50. arXiv:2307.08171  [pdf, other

    cs.AI cs.HC

    Credit Assignment: Challenges and Opportunities in Developing Human-like AI Agents

    Authors: Thuy Ngoc Nguyen, Chase McDonald, Cleotilde Gonzalez

    Abstract: Temporal credit assignment is crucial for learning and skill development in natural and artificial intelligence. While computational methods like the TD approach in reinforcement learning have been proposed, it's unclear if they accurately represent how humans handle feedback delays. Cognitive models intend to represent the mental steps by which humans solve problems and perform a number of tasks,… ▽ More

    Submitted 16 July, 2023; originally announced July 2023.

    Comments: 11 figures; 3 tables