Skip to main content

Showing 1–50 of 68 results for author: Luu, A T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.16073  [pdf, other

    cs.CL

    Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation

    Authors: Zhiyuan Hu, Shiyun Xiong, Yifan Zhang, See-Kiong Ng, Anh Tuan Luu, Bo An, Shuicheng Yan, Bryan Hooi

    Abstract: Recent advancements in visual language models (VLMs) have notably enhanced their capabilities in handling complex Graphical User Interface (GUI) interaction tasks. Despite these improvements, current frameworks often struggle to generate correct actions in challenging GUI environments. State-of-the-art commercial VLMs are black-boxes, and fine-tuning open-source VLMs for GUI tasks requires signifi… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  2. arXiv:2504.13054  [pdf, other

    cs.CL cs.AI

    Aspect-Based Summarization with Self-Aspect Retrieval Enhanced Generation

    Authors: Yichao Feng, Shuai Zhao, Yueqiu Li, Luwei Xiao, Xiaobao Wu, Anh Tuan Luu

    Abstract: Aspect-based summarization aims to generate summaries tailored to specific aspects, addressing the resource constraints and limited generalizability of traditional summarization approaches. Recently, large language models have shown promise in this task without the need for training. However, they rely excessively on prompt engineering and face token limits and hallucination challenges, especially… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  3. arXiv:2503.01295  [pdf, other

    cs.SE

    CodeArena: A Collective Evaluation Platform for LLM Code Generation

    Authors: Mingzhe Du, Anh Tuan Luu, Bin Ji, Xiaobao Wu, Dong Huang, Terry Yue Zhuo, Qian Liu, See-Kiong Ng

    Abstract: Large Language Models (LLMs) have reshaped code generation by synergizing their exceptional comprehension of natural language and programming syntax, thereby substantially boosting developer productivity. These advancements have prompted numerous efforts to quantitatively evaluate their coding capabilities. However, persistent challenges, such as benchmark leakage, data dissipation, and limited sy… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

  4. arXiv:2502.20238  [pdf, other

    cs.CL

    FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving

    Authors: Guizhen Chen, Weiwen Xu, Hao Zhang, Hou Pong Chan, Chaoqun Liu, Lidong Bing, Deli Zhao, Anh Tuan Luu, Yu Rong

    Abstract: Many challenging reasoning tasks require not just rapid, intuitive responses, but a more deliberate, multi-step approach. Recent progress in large language models (LLMs) highlights an important shift from the "System 1" way of quick reactions to the "System 2" style of reflection-and-correction problem solving. However, current benchmarks heavily rely on the final-answer accuracy, leaving much of… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

  5. arXiv:2502.14356  [pdf, other

    cs.CL

    Full-Step-DPO: Self-Supervised Preference Optimization with Step-wise Rewards for Mathematical Reasoning

    Authors: Huimin Xu, Xin Mao, Feng-Lin Li, Xiaobao Wu, Wang Chen, Wei Zhang, Anh Tuan Luu

    Abstract: Direct Preference Optimization (DPO) often struggles with long-chain mathematical reasoning. Existing approaches, such as Step-DPO, typically improve this by focusing on the first erroneous step in the reasoning chain. However, they overlook all other steps and rely heavily on humans or GPT-4 to identify erroneous steps. To address these issues, we propose Full-Step-DPO, a novel DPO framework tail… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

  6. arXiv:2502.12591  [pdf, other

    cs.CV cs.CL

    CutPaste&Find: Efficient Multimodal Hallucination Detector with Visual-aid Knowledge Base

    Authors: Cong-Duy Nguyen, Xiaobao Wu, Duc Anh Vu, Shuai Zhao, Thong Nguyen, Anh Tuan Luu

    Abstract: Large Vision-Language Models (LVLMs) have demonstrated impressive multimodal reasoning capabilities, but they remain susceptible to hallucination, particularly object hallucination where non-existent objects or incorrect attributes are fabricated in generated descriptions. Existing detection methods achieve strong performance but rely heavily on expensive API calls and iterative LVLM-based validat… ▽ More

    Submitted 18 February, 2025; originally announced February 2025.

  7. arXiv:2502.06298  [pdf, other

    cs.CL cs.AI

    SeaExam and SeaBench: Benchmarking LLMs with Local Multilingual Questions in Southeast Asia

    Authors: Chaoqun Liu, Wenxuan Zhang, Jiahao Ying, Mahani Aljunied, Anh Tuan Luu, Lidong Bing

    Abstract: This study introduces two novel benchmarks, SeaExam and SeaBench, designed to evaluate the capabilities of Large Language Models (LLMs) in Southeast Asian (SEA) application scenarios. Unlike existing multilingual datasets primarily derived from English translations, these benchmarks are constructed based on real-world scenarios from SEA regions. SeaExam draws from regional educational exams to for… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: Accepted to Findings of NAACL 2025

  8. arXiv:2501.14166  [pdf, other

    cs.CV cs.AI

    Enhancing Multimodal Entity Linking with Jaccard Distance-based Conditional Contrastive Learning and Contextual Visual Augmentation

    Authors: Cong-Duy Nguyen, Xiaobao Wu, Thong Nguyen, Shuai Zhao, Khoi Le, Viet-Anh Nguyen, Feng Yichao, Anh Tuan Luu

    Abstract: Previous research on multimodal entity linking (MEL) has primarily employed contrastive learning as the primary objective. However, using the rest of the batch as negative samples without careful consideration, these studies risk leveraging easy features and potentially overlook essential details that make entities unique. In this work, we propose JD-CCL (Jaccard Distance-based Conditional Contras… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

  9. arXiv:2412.13670  [pdf, other

    cs.CL cs.LG

    AntiLeak-Bench: Preventing Data Contamination by Automatically Constructing Benchmarks with Updated Real-World Knowledge

    Authors: Xiaobao Wu, Liangming Pan, Yuxi Xie, Ruiwen Zhou, Shuai Zhao, Yubo Ma, Mingzhe Du, Rui Mao, Anh Tuan Luu, William Yang Wang

    Abstract: Data contamination hinders fair LLM evaluation by introducing test data into newer models' training sets. Existing studies solve this challenge by updating benchmarks with newly collected data. However, they fail to guarantee contamination-free evaluation as the newly collected data may contain pre-existing knowledge, and their benchmark updates rely on intensive human labor. To address these issu… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

  10. arXiv:2412.07160  [pdf, other

    cs.CV

    Motion-aware Contrastive Learning for Temporal Panoptic Scene Graph Generation

    Authors: Thong Thanh Nguyen, Xiaobao Wu, Yi Bin, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

    Abstract: To equip artificial intelligence with a comprehensive understanding towards a temporal world, video and 4D panoptic scene graph generation abstracts visual data into nodes to represent entities and edges to capture temporal relations. Existing methods encode entity masks tracked across temporal dimensions (mask tubes), then predict their relations with temporal pooling operation, which does not fu… ▽ More

    Submitted 18 December, 2024; v1 submitted 9 December, 2024; originally announced December 2024.

    Comments: Accepted at AAAI 2025

  11. arXiv:2412.07157  [pdf, other

    cs.CV

    Multi-Scale Contrastive Learning for Video Temporal Grounding

    Authors: Thong Thanh Nguyen, Yi Bin, Xiaobao Wu, Zhiyuan Hu, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

    Abstract: Temporal grounding, which localizes video moments related to a natural language query, is a core problem of vision-language learning and video understanding. To encode video moments of varying lengths, recent methods employ a multi-level structure known as a feature pyramid. In this structure, lower levels concentrate on short-range video moments, while higher levels address long-range moments. Be… ▽ More

    Submitted 18 December, 2024; v1 submitted 9 December, 2024; originally announced December 2024.

    Comments: Accepted at AAAI 2025

  12. arXiv:2411.18126  [pdf, other

    cs.CL

    Curriculum Demonstration Selection for In-Context Learning

    Authors: Duc Anh Vu, Nguyen Tran Cong Duy, Xiaobao Wu, Hoang Minh Nhat, Du Mingzhe, Nguyen Thanh Thong, Anh Tuan Luu

    Abstract: Large Language Models (LLMs) have shown strong in-context learning (ICL) abilities with a few demonstrations. However, one critical challenge is how to select demonstrations to elicit the full potential of LLMs. In this paper, we propose Curriculum Demonstration Selection (CDS), a novel demonstration selection method for ICL. Instead of merely using similarity, CDS additionally partitions samples… ▽ More

    Submitted 15 December, 2024; v1 submitted 27 November, 2024; originally announced November 2024.

    Comments: Accepted at the 40th ACM/SIGAPP Symposium On Applied Computing (SAC 2025), Main Conference

  13. arXiv:2411.00492  [pdf, other

    cs.CL

    Multi-expert Prompting Improves Reliability, Safety, and Usefulness of Large Language Models

    Authors: Do Xuan Long, Duong Ngoc Yen, Anh Tuan Luu, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen

    Abstract: We present Multi-expert Prompting, a novel enhancement of ExpertPrompting (Xu et al., 2023), designed to improve the large language model (LLM) generation. Specifically, it guides an LLM to fulfill an input instruction by simulating multiple experts, aggregating their responses, and selecting the best among individual and aggregated responses. This process is performed in a single chain of thought… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: EMNLP 2024 Main Conference

  14. arXiv:2410.15737  [pdf, other

    cs.CL cs.AI cs.IR

    Who's Who: Large Language Models Meet Knowledge Conflicts in Practice

    Authors: Quang Hieu Pham, Hoang Ngo, Anh Tuan Luu, Dat Quoc Nguyen

    Abstract: Retrieval-augmented generation (RAG) methods are viable solutions for addressing the static memory limits of pre-trained language models. Nevertheless, encountering conflicting sources of information within the retrieval context is an inevitable practical challenge. In such situations, the language models are recommended to transparently inform users about the conflicts rather than autonomously de… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

    Comments: Accepted to EMNLP 2024 Findings

  15. arXiv:2410.15050  [pdf, other

    cs.CL

    Are LLMs Good Zero-Shot Fallacy Classifiers?

    Authors: Fengjun Pan, Xiaobao Wu, Zongrui Li, Anh Tuan Luu

    Abstract: Fallacies are defective arguments with faulty reasoning. Detecting and classifying them is a crucial NLP task to prevent misinformation, manipulative claims, and biased decisions. However, existing fallacy classifiers are limited by the requirement for sufficient labeled data for training, which hinders their out-of-distribution (OOD) generalization abilities. In this paper, we focus on leveraging… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

    Comments: Accepted to EMNLP2024 main conference

  16. arXiv:2410.04834  [pdf, other

    cs.CL

    As Simple as Fine-tuning: LLM Alignment via Bidirectional Negative Feedback Loss

    Authors: Xin Mao, Feng-Lin Li, Huimin Xu, Wei Zhang, Wang Chen, Anh Tuan Luu

    Abstract: Direct Preference Optimization (DPO) has emerged as a more computationally efficient alternative to Reinforcement Learning from Human Feedback (RLHF) with Proximal Policy Optimization (PPO), eliminating the need for reward models and online sampling. Despite these benefits, DPO and its variants remain sensitive to hyper-parameters and prone to instability, particularly on mathematical datasets. We… ▽ More

    Submitted 25 October, 2024; v1 submitted 7 October, 2024; originally announced October 2024.

    Comments: 20 pages, 9 figures

  17. arXiv:2409.16682  [pdf, other

    cs.CL

    SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQA

    Authors: Siyue Zhang, Anh Tuan Luu, Chen Zhao

    Abstract: Text-to-SQL parsing and end-to-end question answering (E2E TQA) are two main approaches for Table-based Question Answering task. Despite success on multiple benchmarks, they have yet to be compared and their synergy remains unexplored. In this paper, we identify different strengths and weaknesses through evaluating state-of-the-art models on benchmark datasets: Text-to-SQL demonstrates superiority… ▽ More

    Submitted 29 September, 2024; v1 submitted 25 September, 2024; originally announced September 2024.

    Comments: EMNLP 2024

  18. arXiv:2409.12425  [pdf, other

    cs.CL cs.LG

    Zero-to-Strong Generalization: Eliciting Strong Capabilities of Large Language Models Iteratively without Gold Labels

    Authors: Chaoqun Liu, Qin Chao, Wenxuan Zhang, Xiaobao Wu, Boyang Li, Anh Tuan Luu, Lidong Bing

    Abstract: Large Language Models (LLMs) have demonstrated remarkable performance through supervised fine-tuning or in-context learning using gold labels. However, this paradigm is limited by the availability of gold labels, while in certain scenarios, LLMs may need to perform tasks that are too complex for humans to provide such labels. To tackle this challenge, this study explores whether solely utilizing u… ▽ More

    Submitted 18 September, 2024; originally announced September 2024.

    Comments: 15 pages

  19. SoVAR: Building Generalizable Scenarios from Accident Reports for Autonomous Driving Testing

    Authors: An Guo, Yuan Zhou, Haoxiang Tian, Chunrong Fang, Yunjian Sun, Weisong Sun, Xinyu Gao, Anh Tuan Luu, Yang Liu, Zhenyu Chen

    Abstract: Autonomous driving systems (ADSs) have undergone remarkable development and are increasingly employed in safety-critical applications. However, recently reported data on fatal accidents involving ADSs suggests that the desired level of safety has not yet been fully achieved. Consequently, there is a growing need for more comprehensive and targeted testing approaches to ensure safe driving. Scenari… ▽ More

    Submitted 12 September, 2024; originally announced September 2024.

    Journal ref: 39th IEEE/ACM International Conference on Automated Software Engineering (ASE '24), October 27-November 1, 2024, Sacramento, CA, USA

  20. arXiv:2409.00509  [pdf, other

    cs.CL

    LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models

    Authors: Zhiyuan Hu, Yuliang Liu, Jinman Zhao, Suyuchen Wang, Yan Wang, Wei Shen, Qing Gu, Anh Tuan Luu, See-Kiong Ng, Zhiwei Jiang, Bryan Hooi

    Abstract: Large language models (LLMs) face significant challenges in handling long-context tasks because of their limited effective context window size during pretraining, which restricts their ability to generalize over extended sequences. Meanwhile, extending the context window in LLMs through post-pretraining is highly resource-intensive. To address this, we introduce LongRecipe, an efficient training s… ▽ More

    Submitted 4 September, 2024; v1 submitted 31 August, 2024; originally announced September 2024.

    Comments: Work in Progress

  21. arXiv:2407.10998  [pdf, other

    cs.CL cs.LG

    Discrete Diffusion Language Model for Efficient Text Summarization

    Authors: Do Huu Dat, Do Duc Anh, Anh Tuan Luu, Wray Buntine

    Abstract: While diffusion models excel at conditional generating high-quality images, prior works in discrete diffusion models were not evaluated on conditional long-text generation. In this work, we address the limitations of prior discrete diffusion models for conditional long-text generation, particularly in long sequence-to-sequence tasks such as abstractive summarization. Despite fast decoding speeds c… ▽ More

    Submitted 10 March, 2025; v1 submitted 25 June, 2024; originally announced July 2024.

  22. arXiv:2405.19723  [pdf, other

    cs.CV cs.AI

    Encoding and Controlling Global Semantics for Long-form Video Question Answering

    Authors: Thong Thanh Nguyen, Zhiyuan Hu, Xiaobao Wu, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

    Abstract: Seeking answers effectively for long videos is essential to build video question answering (videoQA) systems. Previous methods adaptively select frames and regions from long videos to save computations. However, this fails to reason over the whole sequence of video, leading to sub-optimal performance. To address this problem, we introduce a state space layer (SSL) into multi-modal Transformer to e… ▽ More

    Submitted 5 October, 2024; v1 submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted to the main EMNLP 2024 conference

  23. arXiv:2405.17978  [pdf, other

    cs.CL cs.AI

    FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and Transferable Topic Model

    Authors: Xiaobao Wu, Thong Nguyen, Delvin Ce Zhang, William Yang Wang, Anh Tuan Luu

    Abstract: Topic models have been evolving rapidly over the years, from conventional to recent neural models. However, existing topic models generally struggle with either effectiveness, efficiency, or stability, highly impeding their practical applications. In this paper, we propose FASTopic, a fast, adaptive, stable, and transferable topic model. FASTopic follows a new paradigm: Dual Semantic-relation Reco… ▽ More

    Submitted 26 October, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted to NeurIPS 2024. Code is available at https://github.com/BobXWu/Fastopic

  24. arXiv:2405.17957  [pdf, other

    cs.CL cs.AI

    Modeling Dynamic Topics in Chain-Free Fashion by Evolution-Tracking Contrastive Learning and Unassociated Word Exclusion

    Authors: Xiaobao Wu, Xinshuai Dong, Liangming Pan, Thong Nguyen, Anh Tuan Luu

    Abstract: Dynamic topic models track the evolution of topics in sequential documents, which have derived various applications like trend analysis and opinion mining. However, existing models suffer from repetitive topic and unassociated topic issues, failing to reveal the evolution and hindering further applications. To address these issues, we break the tradition of simply chaining topics in existing work… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted to ACL 2024 Findings

  25. arXiv:2403.17486  [pdf, other

    cs.CL

    KDMCSE: Knowledge Distillation Multimodal Sentence Embeddings with Adaptive Angular margin Contrastive Learning

    Authors: Cong-Duy Nguyen, Thong Nguyen, Xiaobao Wu, Anh Tuan Luu

    Abstract: Previous work on multimodal sentence embedding has proposed multimodal contrastive learning and achieved promising results. However, by taking the rest of the batch as negative samples without reviewing when forming contrastive pairs, those studies encountered many suspicious and noisy negative examples, significantly affecting the methods' overall performance. In this work, we propose KDMCSE (Kno… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2024

  26. arXiv:2403.10258  [pdf, other

    cs.CL

    Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models

    Authors: Chaoqun Liu, Wenxuan Zhang, Yiran Zhao, Anh Tuan Luu, Lidong Bing

    Abstract: Large language models (LLMs) have demonstrated multilingual capabilities, yet they are mostly English-centric due to the imbalanced training corpora. While prior works have leveraged this bias to enhance multilingual performance through translation, they have been largely limited to natural language processing (NLP) tasks. In this work, we extend the evaluation to real-world user queries and non-E… ▽ More

    Submitted 21 April, 2025; v1 submitted 15 March, 2024; originally announced March 2024.

    Comments: Accepted to NAACL 2025

  27. arXiv:2403.02990  [pdf, other

    cs.CL cs.AI

    Data Augmentation using Large Language Models: Data Perspectives, Learning Paradigms and Challenges

    Authors: Bosheng Ding, Chengwei Qin, Ruochen Zhao, Tianze Luo, Xinze Li, Guizhen Chen, Wenhan Xia, Junjie Hu, Anh Tuan Luu, Shafiq Joty

    Abstract: In the rapidly evolving field of large language models (LLMs), data augmentation (DA) has emerged as a pivotal technique for enhancing model performance by diversifying training examples without the need for additional data collection. This survey explores the transformative impact of LLMs on DA, particularly addressing the unique challenges and opportunities they present in the context of natural… ▽ More

    Submitted 2 July, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

  28. arXiv:2402.18909  [pdf, other

    cs.CL cs.AI

    AKEW: Assessing Knowledge Editing in the Wild

    Authors: Xiaobao Wu, Liangming Pan, William Yang Wang, Anh Tuan Luu

    Abstract: Knowledge editing injects knowledge updates into language models to keep them correct and up-to-date. However, its current evaluations deviate significantly from practice: their knowledge updates solely consist of structured facts derived from meticulously crafted datasets, instead of practical sources -- unstructured texts like news articles, and they often overlook practical real-world knowledge… ▽ More

    Submitted 10 October, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: Accepted to EMNLP 2024 main conference

  29. arXiv:2402.16030  [pdf, other

    cs.CL cs.AI

    Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration

    Authors: Xin Mao, Feng-Lin Li, Huimin Xu, Wei Zhang, Anh Tuan Luu

    Abstract: While Reinforcement Learning from Human Feedback (RLHF) significantly enhances the generation quality of Large Language Models (LLMs), recent studies have raised concerns regarding the complexity and instability associated with the Proximal Policy Optimization (PPO) algorithm, proposing a series of order-based calibration methods as viable alternatives. This paper delves further into current order… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 19 pages, Under review

  30. arXiv:2402.07844  [pdf, other

    cs.SE cs.CL

    Mercury: A Code Efficiency Benchmark for Code Large Language Models

    Authors: Mingzhe Du, Anh Tuan Luu, Bin Ji, Qian Liu, See-Kiong Ng

    Abstract: Amidst the recent strides in evaluating Large Language Models for Code (Code LLMs), existing benchmarks have mainly focused on the functional correctness of generated code, neglecting the importance of their computational efficiency. To fill the gap, we present Mercury, the first code efficiency benchmark for Code LLMs. It comprises 1,889 Python tasks, each accompanied by adequate solutions that s… ▽ More

    Submitted 11 June, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

  31. arXiv:2402.07577  [pdf, other

    cs.CL

    Topic Modeling as Multi-Objective Contrastive Optimization

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Cong-Duy T Nguyen, See-Kiong Ng, Anh Tuan Luu

    Abstract: Recent representation learning approaches enhance neural topic models by optimizing the weighted linear combination of the evidence lower bound (ELBO) of the log-likelihood and the contrastive learning objective that contrasts pairs of input documents. However, document-level contrastive learning might capture low-level mutual information, such as word ratio, which disturbs topic modeling. Moreove… ▽ More

    Submitted 9 March, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted at ICLR 2024 (poster)

  32. arXiv:2402.03271  [pdf, other

    cs.CL cs.AI cs.LG

    Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models

    Authors: Zhiyuan Hu, Chumin Liu, Xidong Feng, Yilun Zhao, See-Kiong Ng, Anh Tuan Luu, Junxian He, Pang Wei Koh, Bryan Hooi

    Abstract: In the face of uncertainty, the ability to *seek information* is of fundamental importance. In many practical applications, such as medical diagnosis and troubleshooting, the information needed to solve the task is not initially given and has to be actively sought by asking follow-up questions (for example, a doctor asking a patient for more details about their symptoms). In this work, we introduc… ▽ More

    Submitted 13 November, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: NeurIPS 2024

  33. A Survey on Neural Topic Models: Methods, Applications, and Challenges

    Authors: Xiaobao Wu, Thong Nguyen, Anh Tuan Luu

    Abstract: Topic models have been prevalent for decades to discover latent topics and infer topic proportions of documents in an unsupervised fashion. They have been widely used in various applications like text analysis and context recommendation. Recently, the rise of neural networks has facilitated the emergence of a new research field -- Neural Topic Models (NTMs). Different from conventional topic model… ▽ More

    Submitted 24 June, 2024; v1 submitted 27 January, 2024; originally announced January 2024.

    Comments: Accepted to Artificial Intelligence Review. See https://doi.org/10.1007/s10462-023-10661-7 and a paper list at https://github.com/BobXWu/Paper-Neural-Topic-Models

  34. arXiv:2401.14113  [pdf, other

    cs.CL

    On the Affinity, Rationality, and Diversity of Hierarchical Topic Modeling

    Authors: Xiaobao Wu, Fengjun Pan, Thong Nguyen, Yichao Feng, Chaoqun Liu, Cong-Duy Nguyen, Anh Tuan Luu

    Abstract: Hierarchical topic modeling aims to discover latent topics from a corpus and organize them into a hierarchy to understand documents with desirable semantic granularity. However, existing work struggles with producing topic hierarchies of low affinity, rationality, and diversity, which hampers document understanding. To overcome these challenges, we in this paper propose Transport Plan and Context-… ▽ More

    Submitted 31 January, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: Accepted to AAAI2024 conference. Our code is available at https://github.com/bobxwu/TraCo

  35. LAMPAT: Low-Rank Adaption for Multilingual Paraphrasing Using Adversarial Training

    Authors: Khoi M. Le, Trinh Pham, Tho Quan, Anh Tuan Luu

    Abstract: Paraphrases are texts that convey the same meaning while using different words or sentence structures. It can be used as an automatic data augmentation tool for many Natural Language Processing tasks, especially when dealing with low-resource languages, where data shortage is a significant problem. To generate a paraphrase in multilingual settings, previous studies have leveraged the knowledge fro… ▽ More

    Submitted 23 June, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: First two authors contribute equally. Accepted at AAAI 2024

  36. arXiv:2312.11109  [pdf, other

    cs.LG

    Graph Transformers for Large Graphs

    Authors: Vijay Prakash Dwivedi, Yozen Liu, Anh Tuan Luu, Xavier Bresson, Neil Shah, Tong Zhao

    Abstract: Transformers have recently emerged as powerful neural networks for graph learning, showcasing state-of-the-art performance on several graph property prediction tasks. However, these results have been limited to small-scale graphs, where the computational feasibility of the global attention mechanism is possible. The next goal is to scale up these architectures to handle very large graphs on the sc… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  37. arXiv:2312.01661  [pdf, other

    cs.CL cs.AI

    ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions

    Authors: Phuoc Pham Van Long, Duc Anh Vu, Nhat M. Hoang, Xuan Long Do, Anh Tuan Luu

    Abstract: Mathematical questioning is crucial for assessing students problem-solving skills. Since manually creating such questions requires substantial effort, automatic methods have been explored. Existing state-of-the-art models rely on fine-tuning strategies and struggle to generate questions that heavily involve multiple steps of logical and arithmetic reasoning. Meanwhile, large language models(LLMs)… ▽ More

    Submitted 27 February, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted at the 39th ACM/SIGAPP Symposium On Applied Computing (SAC 2024), Main Conference

  38. arXiv:2311.03970  [pdf, other

    cs.CV

    Bias and Diversity in Synthetic-based Face Recognition

    Authors: Marco Huber, Anh Thi Luu, Fadi Boutros, Arjan Kuijper, Naser Damer

    Abstract: Synthetic data is emerging as a substitute for authentic data to solve ethical and legal challenges in handling authentic face data. The current models can create real-looking face images of people who do not exist. However, it is a known and sensitive problem that face recognition systems are susceptible to bias, i.e. performance differences between different demographic and non-demographics attr… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted for presentation at WACV2024

  39. arXiv:2310.14248  [pdf, other

    cs.CL

    From Static to Dynamic: A Continual Learning Framework for Large Language Models

    Authors: Mingzhe Du, Anh Tuan Luu, Bin Ji, See-kiong Ng

    Abstract: The vast number of parameters in large language models (LLMs) endows them with remarkable capabilities, allowing them to excel in a variety of natural language processing tasks. However, this complexity also presents challenges, making LLMs difficult to train and inhibiting their ability to continuously assimilate new knowledge, which may lead to inaccuracies in their outputs. To mitigate these is… ▽ More

    Submitted 22 October, 2023; originally announced October 2023.

  40. arXiv:2309.08949  [pdf, other

    cs.CL

    Enhancing Large Language Model Induced Task-Oriented Dialogue Systems Through Look-Forward Motivated Goals

    Authors: Zhiyuan Hu, Yue Feng, Yang Deng, Zekun Li, See-Kiong Ng, Anh Tuan Luu, Bryan Hooi

    Abstract: Recently, the development of large language models (LLMs) has been significantly enhanced the question answering and dialogue generation, and makes them become increasingly popular in current practical scenarios. While unlike the general dialogue system which emphasizes the semantic performance, the task-oriented dialogue (ToD) systems aim to achieve the dialogue goal efficiently and successfully… ▽ More

    Submitted 16 September, 2023; originally announced September 2023.

    Comments: 7 Pages

  41. arXiv:2309.06908  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Towards the TopMost: A Topic Modeling System Toolkit

    Authors: Xiaobao Wu, Fengjun Pan, Anh Tuan Luu

    Abstract: Topic models have a rich history with various applications and have recently been reinvigorated by neural topic modeling. However, these numerous topic models adopt totally distinct datasets, implementations, and evaluations. This impedes quick utilization and fair comparisons, and thereby hinders their research progress and applications. To tackle this challenge, we in this paper propose a Topic… ▽ More

    Submitted 14 June, 2024; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: Accepted to ACL 2024 System Demonstrations Track

  42. arXiv:2309.01219  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

    Authors: Yue Zhang, Yafu Li, Leyang Cui, Deng Cai, Lemao Liu, Tingchen Fu, Xinting Huang, Enbo Zhao, Yu Zhang, Yulong Chen, Longyue Wang, Anh Tuan Luu, Wei Bi, Freda Shi, Shuming Shi

    Abstract: While large language models (LLMs) have demonstrated remarkable capabilities across a range of downstream tasks, a significant concern revolves around their propensity to exhibit hallucinations: LLMs occasionally generate content that diverges from the user input, contradicts previously generated context, or misaligns with established world knowledge. This phenomenon poses a substantial challenge… ▽ More

    Submitted 24 September, 2023; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: work in progress; 32 pages

  43. Unlocking the Potential of User Feedback: Leveraging Large Language Model as User Simulator to Enhance Dialogue System

    Authors: Zhiyuan Hu, Yue Feng, Anh Tuan Luu, Bryan Hooi, Aldo Lipani

    Abstract: Dialogue systems and large language models (LLMs) have gained considerable attention. However, the direct utilization of LLMs as task-oriented dialogue (TOD) models has been found to underperform compared to smaller task-specific models. Nonetheless, it is crucial to acknowledge the significant potential of LLMs and explore improved approaches for leveraging their impressive abilities. Motivated b… ▽ More

    Submitted 19 October, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted by CIKM 2023

  44. arXiv:2306.08456  [pdf, other

    cs.CL

    PoetryDiffusion: Towards Joint Semantic and Metrical Manipulation in Poetry Generation

    Authors: Zhiyuan Hu, Chumin Liu, Yue Feng, Anh Tuan Luu, Bryan Hooi

    Abstract: Controllable text generation is a challenging and meaningful field in natural language generation (NLG). Especially, poetry generation is a typical one with well-defined and strict conditions for text generation which is an ideal playground for the assessment of current methodologies. While prior works succeeded in controlling either semantic or metrical aspects of poetry generation, simultaneousl… ▽ More

    Submitted 19 December, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

    Comments: Accepted by AAAI2024

  45. arXiv:2306.04217  [pdf, other

    cs.CL

    Effective Neural Topic Modeling with Embedding Clustering Regularization

    Authors: Xiaobao Wu, Xinshuai Dong, Thong Nguyen, Anh Tuan Luu

    Abstract: Topic models have been prevalent for decades with various applications. However, existing topic models commonly suffer from the notorious topic collapsing: discovered topics semantically collapse towards each other, leading to highly repetitive topics, insufficient topic discovery, and damaged model interpretability. In this paper, we propose a new neural topic model, Embedding Clustering Regulari… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

    Comments: Accepted to ICML 2023 conference

  46. arXiv:2305.15872  [pdf, other

    cs.CL cs.AI

    Jointprop: Joint Semi-supervised Learning for Entity and Relation Extraction with Heterogeneous Graph-based Propagation

    Authors: Yandan Zheng, Anran Hao, Anh Tuan Luu

    Abstract: Semi-supervised learning has been an important approach to address challenges in extracting entities and relations from limited data. However, current semi-supervised works handle the two tasks (i.e., Named Entity Recognition and Relation Extraction) separately and ignore the cross-correlation of entity and relation instances as well as the existence of similar instances across unlabeled data. To… ▽ More

    Submitted 25 May, 2023; originally announced May 2023.

  47. arXiv:2305.12744  [pdf, other

    cs.CL cs.AI

    Fact-Checking Complex Claims with Program-Guided Reasoning

    Authors: Liangming Pan, Xiaobao Wu, Xinyuan Lu, Anh Tuan Luu, William Yang Wang, Min-Yen Kan, Preslav Nakov

    Abstract: Fact-checking real-world claims often requires collecting multiple pieces of evidence and applying complex multi-step reasoning. In this paper, we present Program-Guided Fact-Checking (ProgramFC), a novel fact-checking model that decomposes complex claims into simpler sub-tasks that can be solved using a shared library of specialized functions. We first leverage the in-context learning ability of… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: ACL 2023 (main conference, long paper)

  48. arXiv:2305.12678  [pdf, other

    cs.CL

    Gradient-Boosted Decision Tree for Listwise Context Model in Multimodal Review Helpfulness Prediction

    Authors: Thong Nguyen, Xiaobao Wu, Xinshuai Dong, Anh Tuan Luu, Cong-Duy Nguyen, Zhen Hai, Lidong Bing

    Abstract: Multimodal Review Helpfulness Prediction (MRHP) aims to rank product reviews based on predicted helpfulness scores and has been widely applied in e-commerce via presenting customers with useful reviews. Previous studies commonly employ fully-connected neural networks (FCNNs) as the final score predictor and pairwise loss as the training objective. However, FCNNs have been shown to perform ineffici… ▽ More

    Submitted 25 May, 2023; v1 submitted 21 May, 2023; originally announced May 2023.

    Comments: Published in ACL 2023 (Findings)

  49. arXiv:2305.11442  [pdf, other

    cs.CL cs.AI cs.LG

    Zero-Shot Text Classification via Self-Supervised Tuning

    Authors: Chaoqun Liu, Wenxuan Zhang, Guizhen Chen, Xiaobao Wu, Anh Tuan Luu, Chip Hong Chang, Lidong Bing

    Abstract: Existing solutions to zero-shot text classification either conduct prompting with pre-trained language models, which is sensitive to the choices of templates, or rely on large-scale annotated data of relevant tasks for meta-tuning. In this work, we propose a new paradigm based on self-supervised learning to solve zero-shot text classification tasks by tuning the language models with unlabeled data… ▽ More

    Submitted 25 May, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Accepted to the Findings of ACL 2023

  50. arXiv:2304.13409  [pdf, other

    cs.CV

    Efficient Explainable Face Verification based on Similarity Score Argument Backpropagation

    Authors: Marco Huber, Anh Thi Luu, Philipp Terhörst, Naser Damer

    Abstract: Explainable Face Recognition is gaining growing attention as the use of the technology is gaining ground in security-critical applications. Understanding why two faces images are matched or not matched by a given face recognition system is important to operators, users, anddevelopers to increase trust, accountability, develop better systems, and highlight unfair behavior. In this work, we propose… ▽ More

    Submitted 7 November, 2023; v1 submitted 26 April, 2023; originally announced April 2023.

    Comments: Accepted at WACV 2024