Skip to main content

Showing 1–50 of 150 results for author: Truong, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.06563  [pdf, ps, other

    cs.IR cs.CL

    DS@GT at CheckThat! 2025: Exploring Retrieval and Reranking Pipelines for Scientific Claim Source Retrieval on Social Media Discourse

    Authors: Jeanette Schofield, Shuyu Tian, Hoang Thanh Thanh Truong, Maximilian Heil

    Abstract: Social media users often make scientific claims without citing where these claims come from, generating a need to verify these claims. This paper details work done by the DS@GT team for CLEF 2025 CheckThat! Lab Task 4b Scientific Claim Source Retrieval which seeks to find relevant scientific papers based on implicit references in tweets. Our team explored 6 different data augmentation techniques,… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

  2. arXiv:2507.06205  [pdf, ps, other

    cs.CL

    DS@GT at CheckThat! 2025: Ensemble Methods for Detection of Scientific Discourse on Social Media

    Authors: Ayush Parikh, Hoang Thanh Thanh Truong, Jeanette Schofield, Maximilian Heil

    Abstract: In this paper, we, as the DS@GT team for CLEF 2025 CheckThat! Task 4a Scientific Web Discourse Detection, present the methods we explored for this task. For this multiclass classification task, we determined if a tweet contained a scientific claim, a reference to a scientific study or publication, and/or mentions of scientific entities, such as a university or a scientist. We present 3 modeling ap… ▽ More

    Submitted 8 July, 2025; originally announced July 2025.

  3. arXiv:2506.21887  [pdf, ps, other

    cs.AI cs.LG

    Interactive Multi-Objective Probabilistic Preference Learning with Soft and Hard Bounds

    Authors: Edward Chen, Sang T. Truong, Natalie Dullerud, Sanmi Koyejo, Carlos Guestrin

    Abstract: High-stakes decision-making involves navigating multiple competing objectives with expensive evaluations. For instance, in brachytherapy, clinicians must balance maximizing tumor coverage (e.g., an aspirational target or soft bound of >95% coverage) against strict organ dose limits (e.g., a non-negotiable hard bound of <601 cGy to the bladder), with each plan evaluation being resource-intensive. S… ▽ More

    Submitted 26 June, 2025; originally announced June 2025.

  4. arXiv:2506.08306  [pdf, ps, other

    cs.AI astro-ph.IM

    AstroCompress: A benchmark dataset for multi-purpose compression of astronomical data

    Authors: Tuan Truong, Rithwik Sudharsan, Yibo Yang, Peter Xiangyuan Ma, Ruihan Yang, Stephan Mandt, Joshua S. Bloom

    Abstract: The site conditions that make astronomical observatories in space and on the ground so desirable -- cold and dark -- demand a physical remoteness that leads to limited data transmission capabilities. Such transmission limitations directly bottleneck the amount of data acquired and in an era of costly modern observatories, any improvements in lossless data compression has the potential scale to bil… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: ICLR 2025 conference paper. See reviews at https://openreview.net/forum?id=kQCHCkNk7s

  5. arXiv:2506.07247  [pdf, ps, other

    cs.LG

    Promoting Ensemble Diversity with Interactive Bayesian Distributional Robustness for Fine-tuning Foundation Models

    Authors: Ngoc-Quan Pham, Tuan Truong, Quyen Tran, Tan Nguyen, Dinh Phung, Trung Le

    Abstract: We introduce Interactive Bayesian Distributional Robustness (IBDR), a novel Bayesian inference framework that allows modeling the interactions between particles, thereby enhancing ensemble quality through increased particle diversity. IBDR is grounded in a generalized theoretical framework that connects the distributional population loss with the approximate posterior, motivating a practical dual… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

    Comments: ICML 2025 (Poster)

  6. arXiv:2506.02314  [pdf, ps, other

    cs.AI cs.CL

    ResearchCodeBench: Benchmarking LLMs on Implementing Novel Machine Learning Research Code

    Authors: Tianyu Hua, Harper Hua, Violet Xiang, Benjamin Klieger, Sang T. Truong, Weixin Liang, Fan-Yun Sun, Nick Haber

    Abstract: Large language models (LLMs) have shown promise in transforming machine learning research, yet their capability to faithfully implement novel ideas from recent research papers-ideas unseen during pretraining-remains unclear. We introduce ResearchCodeBench, a benchmark of 212 coding challenges that evaluates LLMs' ability to translate cutting-edge ML contributions from top 2024-2025 research papers… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

  7. arXiv:2505.24649  [pdf, ps, other

    cs.CV cs.AI

    BIMA: Bijective Maximum Likelihood Learning Approach to Hallucination Prediction and Mitigation in Large Vision-Language Models

    Authors: Huu-Thien Tran, Thanh-Dat Truong, Khoa Luu

    Abstract: Large vision-language models have become widely adopted to advance in various domains. However, developing a trustworthy system with minimal interpretable characteristics of large-scale models presents a significant challenge. One of the most prevalent terms associated with the fallacy functions caused by these systems is hallucination, where the language model generates a response that does not c… ▽ More

    Submitted 30 May, 2025; originally announced May 2025.

    Comments: CVPRW 2025, 8 pages, 4 figures

  8. arXiv:2505.12000  [pdf, ps, other

    cs.CV

    IQBench: How "Smart'' Are Vision-Language Models? A Study with Human IQ Tests

    Authors: Tan-Hanh Pham, Phu-Vinh Nguyen, Dang The Hung, Bui Trong Duong, Vu Nguyen Thanh, Chris Ngo, Tri Quang Truong, Truong-Son Hy

    Abstract: Although large Vision-Language Models (VLMs) have demonstrated remarkable performance in a wide range of multimodal tasks, their true reasoning capabilities on human IQ tests remain underexplored. To advance research on the fluid intelligence of VLMs, we introduce **IQBench**, a new benchmark designed to evaluate VLMs on standardized visual IQ tests. We focus on evaluating the reasoning capabiliti… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

    Comments: IQ Test for Multimodal Models

  9. arXiv:2504.17346  [pdf

    cs.NE cs.AI

    Dual-Individual Genetic Algorithm: A Dual-Individual Approach for Efficient Training of Multi-Layer Neural Networks

    Authors: Tran Thuy Nga Truong, Jooyong Kim

    Abstract: This paper introduces an enhanced Genetic Algorithm technique, which optimizes neural networks for binary image classification tasks, such as cat vs. non-cat classification. The proposed method employs only two individuals for crossover, represented by two parameter sets: Leader and Follower. The Leader focuses on exploitation, representing the primary optimal solution, while the Follower promotes… ▽ More

    Submitted 10 June, 2025; v1 submitted 24 April, 2025; originally announced April 2025.

  10. arXiv:2504.17311  [pdf, other

    cs.CL cs.AI

    FLUKE: A Linguistically-Driven and Task-Agnostic Framework for Robustness Evaluation

    Authors: Yulia Otmakhova, Hung Thinh Truong, Rahmad Mahendra, Zenan Zhai, Rongxin Zhu, Daniel Beck, Jey Han Lau

    Abstract: We present FLUKE (Framework for LingUistically-driven and tasK-agnostic robustness Evaluation), a task-agnostic framework for assessing model robustness through systematic minimal variations of test data. FLUKE introduces controlled variations across linguistic levels - from orthography to dialect and style varieties - and leverages large language models (LLMs) with human validation to generate mo… ▽ More

    Submitted 24 April, 2025; originally announced April 2025.

  11. arXiv:2504.12902  [pdf, other

    cs.SI cs.CY

    The Rise of Bluesky

    Authors: Ozgur Can Seckin, Filipi Nascimento Silva, Bao Tran Truong, Sangyeon Kim, Fan Huang, Nick Liu, Alessandro Flammini, Filippo Menczer

    Abstract: This study investigates the rapid growth and evolving network structure of Bluesky from August 2023 to February 2025. Through multiple waves of user migrations, the platform has reached a stable, persistently active user base. The growth process has given rise to a dense follower network with clustering and hub features that favor viral information diffusion. These developments highlight engagemen… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: 4 pages, 1 figure

  12. arXiv:2503.16841  [pdf, other

    cs.LG cs.HC q-bio.BM

    Preferential Multi-Objective Bayesian Optimization for Drug Discovery

    Authors: Tai Dang, Long-Hung Pham, Sang T. Truong, Ari Glenn, Wendy Nguyen, Edward A. Pham, Jeffrey S. Glenn, Sanmi Koyejo, Thang Luong

    Abstract: Despite decades of advancements in automated ligand screening, large-scale drug discovery remains resource-intensive and requires post-processing hit selection, a step where chemists manually select a few promising molecules based on their chemical intuition. This creates a major bottleneck in the virtual screening process for drug discovery, demanding experts to repeatedly balance complex trade-o… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

  13. arXiv:2503.12828  [pdf, other

    cs.CE cs.CV

    AUTV: Creating Underwater Video Datasets with Pixel-wise Annotations

    Authors: Quang Trung Truong, Wong Yuk Kwan, Duc Thanh Nguyen, Binh-Son Hua, Sai-Kit Yeung

    Abstract: Underwater video analysis, hampered by the dynamic marine environment and camera motion, remains a challenging task in computer vision. Existing training-free video generation techniques, learning motion dynamics on the frame-by-frame basis, often produce poor results with noticeable motion interruptions and misaligments. To address these issues, we propose AUTV, a framework for synthesizing marin… ▽ More

    Submitted 17 March, 2025; originally announced March 2025.

    Comments: under review

  14. arXiv:2503.11801  [pdf, ps, other

    cs.GR cs.LG cs.RO

    Diffuse-CLoC: Guided Diffusion for Physics-based Character Look-ahead Control

    Authors: Xiaoyu Huang, Takara Truong, Yunbo Zhang, Fangzhou Yu, Jean Pierre Sleiman, Jessica Hodgins, Koushil Sreenath, Farbod Farshidian

    Abstract: We present Diffuse-CLoC, a guided diffusion framework for physics-based look-ahead control that enables intuitive, steerable, and physically realistic motion generation. While existing kinematics motion generation with diffusion models offer intuitive steering capabilities with inference-time conditioning, they often fail to produce physically viable motions. In contrast, recent diffusion-based co… ▽ More

    Submitted 1 July, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

  15. arXiv:2503.05112  [pdf, other

    cs.RO

    THE-SEAN: A Heart Rate Variation-Inspired Temporally High-Order Event-Based Visual Odometry with Self-Supervised Spiking Event Accumulation Networks

    Authors: Chaoran Xiong, Litao Wei, Kehui Ma, Zhen Sun, Yan Xiang, Zihan Nan, Trieu-Kien Truong, Ling Pei

    Abstract: Event-based visual odometry has recently gained attention for its high accuracy and real-time performance in fast-motion systems. Unlike traditional synchronous estimators that rely on constant-frequency (zero-order) triggers, event-based visual odometry can actively accumulate information to generate temporally high-order estimation triggers. However, existing methods primarily focus on adaptive… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

  16. arXiv:2503.04242  [pdf, other

    cs.LG

    Incorporating Surrogate Gradient Norm to Improve Offline Optimization Techniques

    Authors: Manh Cuong Dao, Phi Le Nguyen, Thao Nguyen Truong, Trong Nghia Hoang

    Abstract: Offline optimization has recently emerged as an increasingly popular approach to mitigate the prohibitively expensive cost of online experimentation. The key idea is to learn a surrogate of the black-box function that underlines the target experiment using a static (offline) dataset of its previous input-output queries. Such an approach is, however, fraught with an out-of-distribution issue where… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Journal ref: The Thirty-eighth Annual Conference on Neural Information Processing Systems, 2024

  17. arXiv:2503.04181  [pdf, other

    cs.LG

    Boosting Offline Optimizers with Surrogate Sensitivity

    Authors: Manh Cuong Dao, Phi Le Nguyen, Thao Nguyen Truong, Trong Nghia Hoang

    Abstract: Offline optimization is an important task in numerous material engineering domains where online experimentation to collect data is too expensive and needs to be replaced by an in silico maximization of a surrogate of the black-box function. Although such a surrogate can be learned from offline data, its prediction might not be reliable outside the offline data regime, which happens when the surrog… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Journal ref: Proceedings of the 41st International Conference on Machine Learning, PMLR 235:10072-10090, 2024

  18. arXiv:2502.20663  [pdf, other

    cs.CL

    Prediction of Item Difficulty for Reading Comprehension Items by Creation of Annotated Item Repository

    Authors: Radhika Kapoor, Sang T. Truong, Nick Haber, Maria Araceli Ruiz-Primo, Benjamin W. Domingue

    Abstract: Prediction of item difficulty based on its text content is of substantial interest. In this paper, we focus on the related problem of recovering IRT-based difficulty when the data originally reported item p-value (percent correct responses). We model this item difficulty using a repository of reading passages and student data from US standardized tests from New York and Texas for grades 3-8 spanni… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  19. arXiv:2502.19047  [pdf, other

    cs.CV

    A Dual-Purpose Framework for Backdoor Defense and Backdoor Amplification in Diffusion Models

    Authors: Vu Tuan Truong, Long Bao Le

    Abstract: Diffusion models have emerged as state-of-the-art generative frameworks, excelling in producing high-quality multi-modal samples. However, recent studies have revealed their vulnerability to backdoor attacks, where backdoored models generate specific, undesirable outputs called backdoor target (e.g., harmful images) when a pre-defined trigger is embedded to their inputs. In this paper, we propose… ▽ More

    Submitted 2 March, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

  20. arXiv:2502.09906  [pdf, other

    cs.CV

    Insect-Foundation: A Foundation Model and Large Multimodal Dataset for Vision-Language Insect Understanding

    Authors: Thanh-Dat Truong, Hoang-Quan Nguyen, Xuan-Bac Nguyen, Ashley Dowling, Xin Li, Khoa Luu

    Abstract: Multimodal conversational generative AI has shown impressive capabilities in various vision and language understanding through learning massive text-image data. However, current conversational models still lack knowledge about visual insects since they are often trained on the general knowledge of vision-language data. Meanwhile, understanding insects is a fundamental problem in precision agricult… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  21. arXiv:2502.08841  [pdf, other

    cs.SI cs.CY

    Delayed takedown of illegal content on social media makes moderation ineffective

    Authors: Bao Tran Truong, Sangyeon Kim, Gianluca Nogara, Enrico Verdolotti, Erfan Samieyan Sahneh, Florian Saurwein, Natascha Just, Luca Luceri, Silvia Giordano, Filippo Menczer

    Abstract: Social media platforms face legal and regulatory demands to swiftly remove illegal content, sometimes under strict takedown deadlines. However, the effects of moderation speed and the impact of takedown deadlines remain underexplored. This study models the relationship between the timeliness of illegal content removal and its prevalence, reach, and exposure on social media. By simulating illegal c… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

  22. arXiv:2502.03044  [pdf, ps, other

    cs.LG

    RepLoRA: Reparameterizing Low-Rank Adaptation via the Perspective of Mixture of Experts

    Authors: Tuan Truong, Chau Nguyen, Huy Nguyen, Minh Le, Trung Le, Nhat Ho

    Abstract: Low-rank Adaptation (LoRA) has emerged as a powerful method for fine-tuning large-scale foundation models. Despite its popularity, the theoretical understanding of LoRA has remained limited. This paper presents a theoretical analysis of LoRA by examining its connection to the Mixture of Experts models. Under this framework, we show that simple reparameterizations of the LoRA matrices can notably a… ▽ More

    Submitted 8 June, 2025; v1 submitted 5 February, 2025; originally announced February 2025.

    Comments: ICML 2025 (Poster)

  23. arXiv:2501.12666  [pdf, other

    cs.LG cs.CV

    Explicit Eigenvalue Regularization Improves Sharpness-Aware Minimization

    Authors: Haocheng Luo, Tuan Truong, Tung Pham, Mehrtash Harandi, Dinh Phung, Trung Le

    Abstract: Sharpness-Aware Minimization (SAM) has attracted significant attention for its effectiveness in improving generalization across various tasks. However, its underlying principles remain poorly understood. In this work, we analyze SAM's training dynamics using the maximum eigenvalue of the Hessian as a measure of sharpness, and propose a third-order stochastic differential equation (SDE), which reve… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

  24. arXiv:2501.09552  [pdf, ps, other

    cs.CV

    Exploring AI-based System Design for Pixel-level Protected Health Information Detection in Medical Images

    Authors: Tuan Truong, Ivo M. Baltruschat, Mark Klemens, Grit Werner, Matthias Lenga

    Abstract: De-identification of medical images is a critical step to ensure privacy during data sharing in research and clinical settings. The initial step in this process involves detecting Protected Health Information (PHI), which can be found in image metadata or imprinted within image pixels. Despite the importance of such systems, there has been limited evaluation of existing AI-based solutions, creatin… ▽ More

    Submitted 24 June, 2025; v1 submitted 16 January, 2025; originally announced January 2025.

    Comments: In progress

  25. arXiv:2412.09439  [pdf, other

    cs.CV

    Towards Robust and Fair Vision Learning in Open-World Environments

    Authors: Thanh-Dat Truong

    Abstract: The dissertation presents four key contributions toward fairness and robustness in vision learning. First, to address the problem of large-scale data requirements, the dissertation presents a novel Fairness Domain Adaptation approach derived from two major novel research findings of Bijective Maximum Likelihood and Fairness Adaptation Learning. Second, to enable the capability of open-world modeli… ▽ More

    Submitted 12 December, 2024; originally announced December 2024.

    Comments: PhD Dissertation

  26. arXiv:2410.21932  [pdf, other

    eess.IV cs.CV

    CT to PET Translation: A Large-scale Dataset and Domain-Knowledge-Guided Diffusion Approach

    Authors: Dac Thai Nguyen, Trung Thanh Nguyen, Huu Tien Nguyen, Thanh Trung Nguyen, Huy Hieu Pham, Thanh Hung Nguyen, Thao Nguyen Truong, Phi Le Nguyen

    Abstract: Positron Emission Tomography (PET) and Computed Tomography (CT) are essential for diagnosing, staging, and monitoring various diseases, particularly cancer. Despite their importance, the use of PET/CT systems is limited by the necessity for radioactive materials, the scarcity of PET scanners, and the high cost associated with PET imaging. In contrast, CT scanners are more widely available and sign… ▽ More

    Submitted 29 October, 2024; originally announced October 2024.

    Comments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025

  27. arXiv:2410.09803  [pdf, other

    cs.RO

    Socially Aware Motion Planning for Service Robots Using LiDAR and RGB-D Camera

    Authors: Duc Phu Nguyen, Thanh Long Nguyen, Minh Dang Tu, Cong Hoang Quach, Xuan Tung Truong, Manh Duong Phung

    Abstract: Service robots that work alongside humans in a shared environment need a navigation system that takes into account not only physical safety but also social norms for mutual cooperation. In this paper, we introduce a motion planning system that includes human states such as positions and velocities and their personal space for social-aware navigation. The system first extracts human positions from… ▽ More

    Submitted 13 October, 2024; originally announced October 2024.

    Comments: In Proceedings of 2024, the 7th International Conference on Control, Robotics and Informatics (ICCRI 2024)

  28. arXiv:2410.08229  [pdf, other

    cs.CV cs.NE eess.IV

    Improvement of Spiking Neural Network with Bit Planes and Color Models

    Authors: Nhan T. Luu, Duong T. Luu, Nam N. Pham, Thang C. Truong

    Abstract: Spiking neural network (SNN) has emerged as a promising paradigm in computational neuroscience and artificial intelligence, offering advantages such as low energy consumption and small memory footprint. However, their practical adoption is constrained by several challenges, prominently among them being performance optimization. In this study, we present a novel approach to enhance the performance… ▽ More

    Submitted 8 November, 2024; v1 submitted 28 September, 2024; originally announced October 2024.

  29. arXiv:2410.04327  [pdf, other

    cs.LG

    Leveraging Hierarchical Taxonomies in Prompt-based Continual Learning

    Authors: Quyen Tran, Hoang Phan, Minh Le, Tuan Truong, Dinh Phung, Linh Ngo, Thien Nguyen, Nhat Ho, Trung Le

    Abstract: Humans perceive the world as a series of sequential events, which can be hierarchically organized with different levels of abstraction based on conceptual knowledge. Drawing inspiration from human learning behaviors, this work proposes a novel approach to mitigate catastrophic forgetting in Prompt-based Continual Learning models by exploiting the relationships between continuously emerging class d… ▽ More

    Submitted 8 March, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

  30. arXiv:2410.04196  [pdf, ps, other

    cs.LG stat.ML

    Improving Generalization with Flat Hilbert Bayesian Inference

    Authors: Tuan Truong, Quyen Tran, Quan Pham-Ngoc, Nhat Ho, Dinh Phung, Trung Le

    Abstract: We introduce Flat Hilbert Bayesian Inference (FHBI), an algorithm designed to enhance generalization in Bayesian inference. Our approach involves an iterative two-step procedure with an adversarial functional perturbation step and a functional descent step within a reproducing kernel Hilbert space. This methodology is supported by a theoretical analysis that extends previous findings on generaliza… ▽ More

    Submitted 8 June, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

    Comments: Accepted (ICML 2025)

  31. arXiv:2409.14435  [pdf, other

    cs.RO

    Adaptive Compensation for Robotic Joint Failures Using Partially Observable Reinforcement Learning

    Authors: Tan-Hanh Pham, Godwyll Aikins, Tri Truong, Kim-Doang Nguyen

    Abstract: Robotic manipulators are widely used in various industries for complex and repetitive tasks. However, they remain vulnerable to unexpected hardware failures. In this study, we address the challenge of enabling a robotic manipulator to complete tasks despite joint malfunctions. Specifically, we develop a reinforcement learning (RL) framework to adaptively compensate for a non-functional joint durin… ▽ More

    Submitted 22 September, 2024; originally announced September 2024.

    Comments: 15 pages

  32. arXiv:2409.13945  [pdf, other

    cs.AI

    PureDiffusion: Using Backdoor to Counter Backdoor in Generative Diffusion Models

    Authors: Vu Tuan Truong, Long Bao Le

    Abstract: Diffusion models (DMs) are advanced deep learning models that achieved state-of-the-art capability on a wide range of generative tasks. However, recent studies have shown their vulnerability regarding backdoor attacks, in which backdoored DMs consistently generate a designated result (e.g., a harmful image) called backdoor target when the models' input contains a backdoor trigger. Although various… ▽ More

    Submitted 20 September, 2024; originally announced September 2024.

  33. Physics-Guided Reinforcement Learning System for Realistic Vehicle Active Suspension Control

    Authors: Anh N. Nhu, Ngoc-Anh Le, Shihang Li, Thang D. V. Truong

    Abstract: The suspension system is a crucial part of the automotive chassis, improving vehicle ride comfort and isolating passengers from rough road excitation. Unlike passive suspension, which has constant spring and damping coefficients, active suspension incorporates electronic actuators into the system to dynamically control stiffness and damping variables. However, effectively controlling the suspensio… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: \c{opyright} 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works

    Journal ref: 2023 International Conference on Machine Learning and Applications (ICMLA), pp. 422-429

  34. Five Pitfalls When Assessing Synthetic Medical Images with Reference Metrics

    Authors: Melanie Dohmen, Tuan Truong, Ivo M. Baltruschat, Matthias Lenga

    Abstract: Reference metrics have been developed to objectively and quantitatively compare two images. Especially for evaluating the quality of reconstructed or compressed images, these metrics have shown very useful. Extensive tests of such metrics on benchmarks of artificially distorted natural images have revealed which metric best correlate with human perception of quality. Direct transfer of these metri… ▽ More

    Submitted 24 October, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

    Comments: 10 pages, 5 figures, presented at Deep Generative Models workshop @ MICCAI 2024

    Journal ref: In: Mukhopadhyay, A., Oksuz, I., Engelhardt, S., Mehrof, D., Yuan, Y. (eds) Deep Generative Models. DGM4MICCAI 2024. Lecture Notes in Computer Science, vol 15224. Springer, Cham

  35. arXiv:2408.03400  [pdf, other

    cs.CR cs.AI cs.LG

    Attacks and Defenses for Generative Diffusion Models: A Comprehensive Survey

    Authors: Vu Tuan Truong, Luan Ba Dang, Long Bao Le

    Abstract: Diffusion models (DMs) have achieved state-of-the-art performance on various generative tasks such as image synthesis, text-to-image, and text-guided image-to-image generation. However, the more powerful the DMs, the more harmful they potentially are. Recent studies have shown that DMs are prone to a wide range of attacks, including adversarial attacks, membership inference, backdoor injection, an… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

  36. arXiv:2408.01452  [pdf, other

    cs.CY cs.AI cs.LG

    Building a Domain-specific Guardrail Model in Production

    Authors: Mohammad Niknazar, Paul V Haley, Latha Ramanan, Sang T. Truong, Yedendra Shrinivasan, Ayan Kumar Bhowmick, Prasenjit Dey, Ashish Jagmohan, Hema Maheshwari, Shom Ponoth, Robert Smith, Aditya Vempaty, Nick Haber, Sanmi Koyejo, Sharad Sundararajan

    Abstract: Generative AI holds the promise of enabling a range of sought-after capabilities and revolutionizing workflows in various consumer and enterprise verticals. However, putting a model in production involves much more than just generating an output. It involves ensuring the model is reliable, safe, performant and also adheres to the policy of operation in a particular domain. Guardrails as a necessit… ▽ More

    Submitted 24 July, 2024; originally announced August 2024.

  37. arXiv:2407.04631  [pdf, other

    cond-mat.mtrl-sci cs.LG

    An autoencoder for compressing angle-resolved photoemission spectroscopy data

    Authors: Steinn Ymir Agustsson, Mohammad Ahsanul Haque, Thi Tam Truong, Marco Bianchi, Nikita Klyuchnikov, Davide Mottin, Panagiotis Karras, Philip Hofmann

    Abstract: Angle-resolved photoemission spectroscopy (ARPES) is a powerful experimental technique to determine the electronic structure of solids. Advances in light sources for ARPES experiments are currently leading to a vast increase of data acquisition rates and data quantity. On the other hand, access time to the most advanced ARPES instruments remains strictly limited, calling for fast, effective, and o… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

    Journal ref: Machine Learning: Science and Technology 6, 015019 (2025)

  38. arXiv:2407.01734  [pdf, other

    quant-ph cs.AI

    Universal Quantum Tomography With Deep Neural Networks

    Authors: Nhan T. Luu, Thang C. Truong, Duong T. Luu

    Abstract: Quantum state tomography is a crucial technique for characterizing the state of a quantum system, which is essential for many applications in quantum technologies. In recent years, there has been growing interest in leveraging neural networks to enhance the efficiency and accuracy of quantum state tomography. Still, many of them did not include mixed quantum state, since pure states are arguably l… ▽ More

    Submitted 8 September, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: 10 pages, 5 figures, 17 illustration, 1 table

  39. arXiv:2406.18602  [pdf

    stat.AP cs.LG stat.CO

    Multi-level Phenotypic Models of Cardiovascular Disease and Obstructive Sleep Apnea Comorbidities: A Longitudinal Wisconsin Sleep Cohort Study

    Authors: Duy Nguyen, Ca Hoang, Phat K. Huynh, Tien Truong, Dang Nguyen, Abhay Sharma, Trung Q. Le

    Abstract: Cardiovascular diseases (CVDs) are notably prevalent among patients with obstructive sleep apnea (OSA), posing unique challenges in predicting CVD progression due to the intricate interactions of comorbidities. Traditional models typically lack the necessary dynamic and longitudinal scope to accurately forecast CVD trajectories in OSA patients. This study introduces a novel multi-level phenotypic… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 30 pages, 5 figure, 5 tables

  40. arXiv:2406.07107  [pdf, other

    cs.LG

    Agnostic Sharpness-Aware Minimization

    Authors: Van-Anh Nguyen, Quyen Tran, Tuan Truong, Thanh-Toan Do, Dinh Phung, Trung Le

    Abstract: Sharpness-aware minimization (SAM) has been instrumental in improving deep neural network training by minimizing both the training loss and the sharpness of the loss landscape, leading the model into flatter minima that are associated with better generalization properties. In another aspect, Model-Agnostic Meta-Learning (MAML) is a framework designed to improve the adaptability of models. MAML opt… ▽ More

    Submitted 2 October, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Under review

  41. arXiv:2406.01432  [pdf, other

    cs.CV

    ED-SAM: An Efficient Diffusion Sampling Approach to Domain Generalization in Vision-Language Foundation Models

    Authors: Thanh-Dat Truong, Xin Li, Bhiksha Raj, Jackson Cothren, Khoa Luu

    Abstract: The Vision-Language Foundation Model has recently shown outstanding performance in various perception learning tasks. The outstanding performance of the vision-language model mainly relies on large-scale pre-training datasets and different data augmentation techniques. However, the domain generalization problem of the vision-language foundation model needs to be addressed. This problem has limited… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  42. arXiv:2406.01429  [pdf, other

    cs.CV

    EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding

    Authors: Thanh-Dat Truong, Utsav Prabhu, Dongyi Wang, Bhiksha Raj, Susan Gauch, Jeyamkondan Subbiah, Khoa Luu

    Abstract: Unsupervised Domain Adaptation has been an efficient approach to transferring the semantic segmentation model across data distributions. Meanwhile, the recent Open-vocabulary Semantic Scene understanding based on large-scale vision language models is effective in open-set settings because it can learn diverse concepts and categories. However, these prior methods fail to generalize across different… ▽ More

    Submitted 11 October, 2024; v1 submitted 3 June, 2024; originally announced June 2024.

    Comments: Accepted to NeurIPS'24

  43. PDP: Physics-Based Character Animation via Diffusion Policy

    Authors: Takara E. Truong, Michael Piseno, Zhaoming Xie, C. Karen Liu

    Abstract: Generating diverse and realistic human motion that can physically interact with an environment remains a challenging research area in character animation. Meanwhile, diffusion-based methods, as proposed by the robotics community, have demonstrated the ability to capture highly diverse and multi-modal skills. However, naively training a diffusion policy often results in unstable motions for high-fr… ▽ More

    Submitted 4 December, 2024; v1 submitted 2 June, 2024; originally announced June 2024.

    Journal ref: In SIGGRAPH Asia 2024 Conference Papers (Article No. 86, 10 pages)

  44. arXiv:2405.09334  [pdf, other

    cs.CV cs.AI cs.IR

    Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study

    Authors: Farnaz Khun Jush, Steffen Vogler, Tuan Truong, Matthias Lenga

    Abstract: While content-based image retrieval (CBIR) has been extensively studied in natural image retrieval, its application to medical images presents ongoing challenges, primarily due to the 3D nature of medical images. Recent studies have shown the potential use of pre-trained vision embeddings for CBIR in the context of radiology image retrieval. However, a benchmark for the retrieval of 3D volumetric… ▽ More

    Submitted 4 July, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: 34 pages, 12 Figures, 22 Tables

  45. Similarity and Quality Metrics for MR Image-To-Image Translation

    Authors: Melanie Dohmen, Mark A. Klemens, Ivo M. Baltruschat, Tuan Truong, Matthias Lenga

    Abstract: Image-to-image translation can create large impact in medical imaging, as images can be synthetically transformed to other modalities, sequence types, higher resolutions or lower noise levels. To ensure patient safety, these methods should be validated by human readers, which requires a considerable amount of time and costs. Quantitative metrics can effectively complement such studies and provide… ▽ More

    Submitted 12 February, 2025; v1 submitted 14 May, 2024; originally announced May 2024.

    Comments: 44 pages (main: 22 pages, 3 figures, supplement: 22 pages, 15 figures)

    Journal ref: Sci Rep 15, 3853 (2025)

  46. arXiv:2405.01337  [pdf, other

    cs.CV

    Multi-view Action Recognition via Directed Gromov-Wasserstein Discrepancy

    Authors: Hoang-Quan Nguyen, Thanh-Dat Truong, Khoa Luu

    Abstract: Action recognition has become one of the popular research topics in computer vision. There are various methods based on Convolutional Networks and self-attention mechanisms as Transformers to solve both spatial and temporal dimensions problems of action recognition tasks that achieve competitive performances. However, these methods lack a guarantee of the correctness of the action subject that the… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  47. arXiv:2405.00681  [pdf, other

    eess.SP cs.IT cs.NI eess.SY

    Delay and Overhead Efficient Transmission Scheduling for Federated Learning in UAV Swarms

    Authors: Duc N. M. Hoang, Vu Tuan Truong, Hung Duy Le, Long Bao Le

    Abstract: This paper studies the wireless scheduling design to coordinate the transmissions of (local) model parameters of federated learning (FL) for a swarm of unmanned aerial vehicles (UAVs). The overall goal of the proposed design is to realize the FL training and aggregation processes with a central aggregator exploiting the sensory data collected by the UAVs but it considers the multi-hop wireless net… ▽ More

    Submitted 22 February, 2024; originally announced May 2024.

    Comments: accepted to WCNC'24

  48. arXiv:2404.02421  [pdf, other

    cs.CL

    Revisiting subword tokenization: A case study on affixal negation in large language models

    Authors: Thinh Hung Truong, Yulia Otmakhova, Karin Verspoor, Trevor Cohn, Timothy Baldwin

    Abstract: In this work, we measure the impact of affixal negation on modern English large language models (LLMs). In affixal negation, the negated meaning is expressed through a negative morpheme, which is potentially challenging for LLMs as their tokenizers are often not morphologically plausible. We conduct extensive experiments using LLMs with different subword tokenization methods, which lead to several… ▽ More

    Submitted 4 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: NAACL 2024

  49. arXiv:2403.02715  [pdf, other

    cs.CL cs.AI

    Crossing Linguistic Horizons: Finetuning and Comprehensive Evaluation of Vietnamese Large Language Models

    Authors: Sang T. Truong, Duc Q. Nguyen, Toan Nguyen, Dong D. Le, Nhi N. Truong, Tho Quan, Sanmi Koyejo

    Abstract: Recent advancements in large language models (LLMs) have underscored their importance in the evolution of artificial intelligence. However, despite extensive pretraining on multilingual datasets, available open-sourced LLMs exhibit limited effectiveness in processing Vietnamese. The challenge is exacerbated by the absence of systematic benchmark datasets and metrics tailored for Vietnamese LLM eva… ▽ More

    Submitted 26 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 51 pages

    MSC Class: 68T50

  50. arXiv:2401.06692  [pdf, other

    cs.CL cs.AI cs.LG

    An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models

    Authors: Gantavya Bhatt, Yifang Chen, Arnav M. Das, Jifan Zhang, Sang T. Truong, Stephen Mussmann, Yinglun Zhu, Jeffrey Bilmes, Simon S. Du, Kevin Jamieson, Jordan T. Ash, Robert D. Nowak

    Abstract: Supervised finetuning (SFT) on instruction datasets has played a crucial role in achieving the remarkable zero-shot generalization capabilities observed in modern large language models (LLMs). However, the annotation efforts required to produce high quality responses for instructions are becoming prohibitively expensive, especially as the number of tasks spanned by instruction datasets continues t… ▽ More

    Submitted 7 July, 2024; v1 submitted 12 January, 2024; originally announced January 2024.

    Comments: Accepted to Findings of the Association for Computational Linguistics: ACL 2024