Skip to main content

Showing 1–50 of 141 results for author: Dinh, T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.05584  [pdf, ps, other

    cs.LG

    TabFlex: Scaling Tabular Learning to Millions with Linear Attention

    Authors: Yuchen Zeng, Tuan Dinh, Wonjun Kang, Andreas C Mueller

    Abstract: Leveraging the in-context learning (ICL) capability of Large Language Models (LLMs) for tabular classification has gained significant attention for its training-free adaptability across diverse datasets. Recent advancements, like TabPFN, excel in small-scale tabular datasets but struggle to scale for large and complex datasets. Our work enhances the efficiency and scalability of TabPFN for larger… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 30 pages, ICML 2025

  2. arXiv:2506.03785  [pdf, ps, other

    cs.CL cs.AI

    Knockout LLM Assessment: Using Large Language Models for Evaluations through Iterative Pairwise Comparisons

    Authors: Isik Baran Sandan, Tu Anh Dinh, Jan Niehues

    Abstract: Large Language Models (LLMs) have shown to be effective evaluators across various domains such as machine translations or the scientific domain. Current LLM-as-a-Judge approaches rely mostly on individual assessments or a single round of pairwise assessments, preventing the judge LLM from developing a global ranking perspective. To address this, we present Knockout Assessment, an LLM-asa Judge met… ▽ More

    Submitted 5 June, 2025; v1 submitted 4 June, 2025; originally announced June 2025.

    Comments: Accepted to GEM @ ACL 2025

    ACM Class: I.2.7

  3. arXiv:2506.01856  [pdf

    cs.CR

    Synchronic Web Digital Identity: Speculations on the Art of the Possible

    Authors: Thien-Nam Dinh, Justin Li, Mitch Negus, Ken Goss

    Abstract: As search, social media, and artificial intelligence continue to reshape collective knowledge, the preservation of trust on the public infosphere has become a defining challenge of our time. Given the breadth and versatility of adversarial threats, the best--and perhaps only--defense is an equally broad and versatile infrastructure for digital identity. This document discusses the opportunities… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Report number: SAND2025-06684R

  4. arXiv:2505.19679  [pdf, ps, other

    cs.CL cs.AI

    KIT's Low-resource Speech Translation Systems for IWSLT2025: System Enhancement with Synthetic Data and Model Regularization

    Authors: Zhaolin Li, Yining Liu, Danni Liu, Tuan Nam Nguyen, Enes Yavuz Ugan, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues

    Abstract: This paper presents KIT's submissions to the IWSLT 2025 low-resource track. We develop both cascaded systems, consisting of Automatic Speech Recognition (ASR) and Machine Translation (MT) models, and end-to-end (E2E) Speech Translation (ST) systems for three language pairs: Bemba, North Levantine Arabic, and Tunisian Arabic into English. Building upon pre-trained models, we fine-tune our systems w… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

  5. arXiv:2505.12524  [pdf, ps, other

    cs.DB cs.LG

    HAKES: Scalable Vector Database for Embedding Search Service

    Authors: Guoyu Hu, Shaofeng Cai, Tien Tuan Anh Dinh, Zhongle Xie, Cong Yue, Gang Chen, Beng Chin Ooi

    Abstract: Modern deep learning models capture the semantics of complex data by transforming them into high-dimensional embedding vectors. Emerging applications, such as retrieval-augmented generation, use approximate nearest neighbor (ANN) search in the embedding vector space to find similar data. Existing vector databases provide indexes for efficient ANN searches, with graph-based indexes being the most p… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  6. arXiv:2504.15252  [pdf, other

    cs.AI cs.CV cs.LG

    SuoiAI: Building a Dataset for Aquatic Invertebrates in Vietnam

    Authors: Tue Vo, Lakshay Sharma, Tuan Dinh, Khuong Dinh, Trang Nguyen, Trung Phan, Minh Do, Duong Vu

    Abstract: Understanding and monitoring aquatic biodiversity is critical for ecological health and conservation efforts. This paper proposes SuoiAI, an end-to-end pipeline for building a dataset of aquatic invertebrates in Vietnam and employing machine learning (ML) techniques for species classification. We outline the methods for data collection, annotation, and model training, focusing on reducing annotati… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: Published as a workshop paper at "Tackling Climate Change with Machine Learning", ICLR 2025

  7. arXiv:2504.02194  [pdf, other

    cs.DB cs.CR

    FairDAG: Consensus Fairness over Concurrent Causal Design

    Authors: Dakai Kang, Junchao Chen, Tien Tuan Anh Dinh, Mohammad Sadoghi

    Abstract: The rise of cryptocurrencies like Bitcoin and Ethereum has driven interest in blockchain technology, with Ethereum's smart contracts enabling the growth of decentralized finance (DeFi). However, research has shown that adversaries exploit transaction ordering to extract profits through attacks like front-running, sandwich attacks, and liquidation manipulation. This issue affects both permissionles… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

    Comments: 17 pages, 15 figures

  8. arXiv:2503.10036  [pdf, other

    cs.DB

    CCaaLF: Concurrency Control as a Learnable Function

    Authors: Hexiang Pan, Shaofeng Cai, Tien Tuan Anh Dinh, Yuncheng Wu, Yeow Meng Chee, Gang Chen, Beng Chin Ooi

    Abstract: Concurrency control (CC) algorithms are important in modern transactional databases, as they enable high performance by executing transactions concurrently while ensuring correctness. However, state-of-the-art CC algorithms struggle to perform well across diverse workloads, and most do not consider workload drifts. In this paper, we propose CCaaLF (Concurrency Control as a Learnable Function), a… ▽ More

    Submitted 25 March, 2025; v1 submitted 13 March, 2025; originally announced March 2025.

    MSC Class: 68P15 ACM Class: H.2.4

  9. arXiv:2503.08303  [pdf, other

    quant-ph cs.LG

    Energy Scale Degradation in Sparse Quantum Solvers: A Barrier to Quantum Utility

    Authors: Thang N. Dinh, Cao P. Cong

    Abstract: Quantum computing offers a promising route for tackling hard optimization problems by encoding them as Ising models. However, sparse qubit connectivity requires the use of minor-embedding, mapping logical qubits onto chains of physical qubits, which necessitates stronger intra-chain coupling to maintain consistency. This elevated coupling strength forces a rescaling of the Hamiltonian due to hardw… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  10. arXiv:2503.02293  [pdf, other

    cs.IT eess.SP

    Sparse Orthogonal Matching Pursuit-based Parameter Estimation for Integrated Sensing and Communications

    Authors: Ngoc-Son Duong, Khac-Hoang Ngo, Thai-Mai Dinh, Van-Linh Nguyen

    Abstract: Accurate parameter estimation such as angle of arrival (AOA) is essential to enhance the performance of integrated sensing and communication (ISAC) in mmWave multiple-input multiple-output (MIMO) systems. This work presents a sensing-aided communication channel estimation mechanism, where the sensing channel shares the same AOA with the uplink communication channel. First, we propose a novel ortho… ▽ More

    Submitted 4 March, 2025; originally announced March 2025.

    Comments: IEEE INFOCOM Workshop 2025

  11. arXiv:2502.11115  [pdf, ps, other

    cs.CL

    Are Generative Models Underconfident? Better Quality Estimation with Boosted Model Probability

    Authors: Tu Anh Dinh, Jan Niehues

    Abstract: Quality Estimation (QE) is estimating quality of the model output during inference when the ground truth is not available. Deriving output quality from the models' output probability is the most trivial and low-effort way. However, we show that the output probability of text-generation models can appear underconfident. At each output step, there can be multiple correct options, making the probabil… ▽ More

    Submitted 29 May, 2025; v1 submitted 16 February, 2025; originally announced February 2025.

    ACM Class: I.2.7

  12. arXiv:2412.18760  [pdf, other

    cs.AI

    Data clustering: an essential technique in data science

    Authors: Tai Dinh, Wong Hauchi, Daniil Lisik, Michal Koren, Dat Tran, Philip S. Yu, Joaquín Torres-Sospedra

    Abstract: This paper explores the critical role of data clustering in data science, emphasizing its methodologies, tools, and diverse applications. Traditional techniques, such as partitional and hierarchical clustering, are analyzed alongside advanced approaches such as data stream, density-based, graph-based, and model-based clustering for handling complex structured datasets. The paper highlights key pri… ▽ More

    Submitted 30 January, 2025; v1 submitted 24 December, 2024; originally announced December 2024.

  13. arXiv:2412.18571  [pdf, other

    quant-ph cs.DM cs.LG

    Scalable Quantum-Inspired Optimization through Dynamic Qubit Compression

    Authors: Co Tran, Quoc-Bao Tran, Hy Truong Son, Thang N Dinh

    Abstract: Hard combinatorial optimization problems, often mapped to Ising models, promise potential solutions with quantum advantage but are constrained by limited qubit counts in near-term devices. We present an innovative quantum-inspired framework that dynamically compresses large Ising models to fit available quantum hardware of different sizes. Thus, we aim to bridge the gap between large-scale optimiz… ▽ More

    Submitted 24 December, 2024; originally announced December 2024.

    Comments: Accepted to AAAI'25

  14. arXiv:2412.11640  [pdf, other

    cs.CR cs.DC

    SeSeMI: Secure Serverless Model Inference on Sensitive Data

    Authors: Guoyu Hu, Yuncheng Wu, Gang Chen, Tien Tuan Anh Dinh, Beng Chin Ooi

    Abstract: Model inference systems are essential for implementing end-to-end data analytics pipelines that deliver the benefits of machine learning models to users. Existing cloud-based model inference systems are costly, not easy to scale, and must be trusted in handling the models and user request data. Serverless computing presents a new opportunity, as it provides elasticity and fine-grained pricing. Our… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  15. arXiv:2412.09829   

    cs.CY

    Speech-based Multimodel Pipeline for Vietnamese Services Quality Assessment

    Authors: Quang-Anh N. D., Minh-Duc Pham, Thai Kim Dinh

    Abstract: In the evolving landscape of customer service within the digital economy, traditional methods of service quality assessment have shown significant limitations, this research proposes a novel deep-learning approach to service quality assessment, focusing on the Vietnamese service sector. By leveraging a multi-modal pipeline that transcends traditional evaluation methods, the research addresses the… ▽ More

    Submitted 18 December, 2024; v1 submitted 12 December, 2024; originally announced December 2024.

    Comments: I am writing to request the withdrawal of my preprint due to the discovery of significant inaccuracies in the results. These errors could mislead future research and applications, which compromises the integrity of my work. I believe withdrawing the paper is essential to uphold scientific standards and prevent the dissemination of misleading information. Thank you for your understanding

  16. arXiv:2412.08683  [pdf, other

    cs.SD cs.CV eess.AS

    Emotional Vietnamese Speech-Based Depression Diagnosis Using Dynamic Attention Mechanism

    Authors: Quang-Anh N. D., Manh-Hung Ha, Thai Kim Dinh, Minh-Duc Pham, Ninh Nguyen Van

    Abstract: Major depressive disorder is a prevalent and serious mental health condition that negatively impacts your emotions, thoughts, actions, and overall perception of the world. It is complicated to determine whether a person is depressed due to the symptoms of depression not apparent. However, their voice can be one of the factor from which we can acknowledge signs of depression. People who are depress… ▽ More

    Submitted 11 December, 2024; originally announced December 2024.

    Comments: 9 Page, 5 Figures

  17. arXiv:2410.06408  [pdf, other

    cs.LG

    Automating Data Science Pipelines with Tensor Completion

    Authors: Shaan Pakala, Bryce Graw, Dawon Ahn, Tam Dinh, Mehnaz Tabassum Mahin, Vassilis Tsotras, Jia Chen, Evangelos E. Papalexakis

    Abstract: Hyperparameter optimization is an essential component in many data science pipelines and typically entails exhaustive time and resource-consuming computations in order to explore the combinatorial search space. Similar to this problem, other key operations in data science pipelines exhibit the exact same properties. Important examples are: neural architecture search, where the goal is to identify… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  18. arXiv:2408.17244  [pdf, other

    cs.LG

    Categorical data clustering: 25 years beyond K-modes

    Authors: Tai Dinh, Wong Hauchi, Philippe Fournier-Viger, Daniil Lisik, Minh-Quyet Ha, Hieu-Chi Dam, Van-Nam Huynh

    Abstract: The clustering of categorical data is a common and important task in computer science, offering profound implications across a spectrum of applications. Unlike purely numerical data, categorical data often lack inherent ordering as in nominal data, or have varying levels of order as in ordinal data, thus requiring specialized methodologies for efficient organization and analysis. This review provi… ▽ More

    Submitted 24 January, 2025; v1 submitted 30 August, 2024; originally announced August 2024.

    Comments: Accepted at Expert Systems With Applications

  19. arXiv:2406.10421  [pdf, other

    cs.CL

    SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading

    Authors: Tu Anh Dinh, Carlos Mullov, Leonard Bärmann, Zhaolin Li, Danni Liu, Simon Reiß, Jueun Lee, Nathan Lerzer, Fabian Ternava, Jianfeng Gao, Tobias Röddiger, Alexander Waibel, Tamim Asfour, Michael Beigl, Rainer Stiefelhagen, Carsten Dachsbacher, Klemens Böhm, Jan Niehues

    Abstract: With the rapid development of Large Language Models (LLMs), it is crucial to have benchmarks which can evaluate the ability of LLMs on different domains. One common use of LLMs is performing tasks on scientific topics, such as writing algorithms, querying databases or giving mathematical proofs. Inspired by the way university students are evaluated on such tasks, in this paper, we propose SciEx -… ▽ More

    Submitted 2 October, 2024; v1 submitted 14 June, 2024; originally announced June 2024.

    Comments: Accepted to EMNLP 2024 Main Conference

    ACM Class: I.2.7

  20. arXiv:2405.07180  [pdf, other

    cs.IT

    Repairing Reed-Solomon Codes with Side Information

    Authors: Thi Xinh Dinh, Ba Thong Le, Son Hoang Dau, Serdar Boztas, Stanislav Kruglik, Han Mao Kiah, Emanuele Viterbo, Tuvi Etzion, Yeow Meng Chee

    Abstract: We generalize the problem of recovering a lost/erased symbol in a Reed-Solomon code to the scenario in which some side information about the lost symbol is known. The side information is represented as a set $S$ of linearly independent combinations of the sub-symbols of the lost symbol. When $S = \varnothing$, this reduces to the standard problem of repairing a single codeword symbol. When $S$ is… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    MSC Class: 94B05; 94B60 ACM Class: E.4

  21. arXiv:2404.18031  [pdf, other

    cs.CL

    Quality Estimation with $k$-nearest Neighbors and Automatic Evaluation for Model-specific Quality Estimation

    Authors: Tu Anh Dinh, Tobias Palzer, Jan Niehues

    Abstract: Providing quality scores along with Machine Translation (MT) output, so-called reference-free Quality Estimation (QE), is crucial to inform users about the reliability of the translation. We propose a model-specific, unsupervised QE approach, termed $k$NN-QE, that extracts information from the MT model's training data using $k$-nearest neighbors. Measuring the performance of model-specific QE is n… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: Accepted to EAMT 2024

    ACM Class: I.2.7

  22. arXiv:2403.15509  [pdf, other

    cs.CR cs.AI cs.LG

    Twin Auto-Encoder Model for Learning Separable Representation in Cyberattack Detection

    Authors: Phai Vu Dinh, Quang Uy Nguyen, Thai Hoang Dinh, Diep N. Nguyen, Bao Son Pham, Eryk Dutkiewicz

    Abstract: Representation learning (RL) methods for cyberattack detection face the diversity and sophistication of attack data, leading to the issue of mixed representations of different classes, particularly as the number of classes increases. To address this, the paper proposes a novel deep learning architecture/model called the Twin Auto-Encoder (TAE). TAE first maps the input data into latent space and t… ▽ More

    Submitted 28 April, 2025; v1 submitted 21 March, 2024; originally announced March 2024.

  23. arXiv:2401.10447  [pdf, other

    cs.CL cs.AI cs.LG cs.NE cs.SD eess.AS

    Investigating Training Strategies and Model Robustness of Low-Rank Adaptation for Language Modeling in Speech Recognition

    Authors: Yu Yu, Chao-Han Huck Yang, Tuan Dinh, Sungho Ryu, Jari Kolehmainen, Roger Ren, Denis Filimonov, Prashanth G. Shivakumar, Ankur Gandhe, Ariya Rastow, Jia Xu, Ivan Bulyko, Andreas Stolcke

    Abstract: The use of low-rank adaptation (LoRA) with frozen pretrained language models (PLMs) has become increasing popular as a mainstream, resource-efficient modeling approach for memory-constrained hardware. In this study, we first explore how to enhance model performance by introducing various LoRA training strategies, achieving relative word error rate reductions of 3.50\% on the public Librispeech dat… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  24. arXiv:2311.07289  [pdf, other

    cs.LG

    A probabilistic forecast methodology for volatile electricity prices in the Australian National Electricity Market

    Authors: Cameron Cornell, Nam Trong Dinh, S. Ali Pourmousavi

    Abstract: The South Australia region of the Australian National Electricity Market (NEM) displays some of the highest levels of price volatility observed in modern electricity markets. This paper outlines an approach to probabilistic forecasting under these extreme conditions, including spike filtration and several post-processing steps. We propose using quantile regression as an ensemble tool for probabili… ▽ More

    Submitted 12 December, 2023; v1 submitted 13 November, 2023; originally announced November 2023.

    Comments: This manuscript has been accepted for publication in International Journal of Forecasting

  25. arXiv:2310.02494  [pdf, other

    eess.SY cs.IT

    On the Financial Consequences of Simplified Battery Sizing Models without Considering Operational Details

    Authors: Nam Trong Dinh, Sahand Karimi-Arpanahi, S. Ali Pourmousavi, Mingyu Guo, Julian Lemos-Vinasco, Jon A. R. Liisberg

    Abstract: Optimal battery sizing studies tend to overly simplify the practical aspects of battery operation within the battery sizing framework. Such assumptions may lead to a suboptimal battery capacity, resulting in significant financial losses for a battery project that could last more than a decade. In this paper, we compare the most common existing sizing methods in the literature with a battery sizing… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: This manuscript has been submitted to PSCC 2024 for possible publication

  26. arXiv:2309.15223  [pdf, other

    cs.CL cs.AI cs.LG cs.NE cs.SD eess.AS

    Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

    Authors: Yu Yu, Chao-Han Huck Yang, Jari Kolehmainen, Prashanth G. Shivakumar, Yile Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi-Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastow, Ivan Bulyko

    Abstract: We propose a neural language modeling system based on low-rank adaptation (LoRA) for speech recognition output rescoring. Although pretrained language models (LMs) like BERT have shown superior performance in second-pass rescoring, the high computational cost of scaling up the pretraining stage and adapting the pretrained models to specific domains limit their practical use in rescoring. Here we p… ▽ More

    Submitted 10 October, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

    Comments: Accepted to IEEE ASRU 2023. Internal Review Approved. Revised 2nd version with Andreas and Huck. The first version is in Sep 29th. 8 pages

    Journal ref: Proc. IEEE ASRU Workshop, Dec. 2023

  27. arXiv:2309.09012  [pdf, other

    eess.SY cs.IT

    Modelling Irrational Behaviour of Residential End Users using Non-Stationary Gaussian Processes

    Authors: Nam Trong Dinh, Sahand Karimi-Arpanahi, Rui Yuan, S. Ali Pourmousavi, Mingyu Guo, Jon A. R. Liisberg, Julian Lemos-Vinasco

    Abstract: Demand response (DR) plays a critical role in ensuring efficient electricity consumption and optimal use of network assets. Yet, existing DR models often overlook a crucial element, the irrational behaviour of electricity end users. In this work, we propose a price-responsive model that incorporates key aspects of end-user irrationality, specifically loss aversion, time inconsistency, and bounded… ▽ More

    Submitted 26 March, 2024; v1 submitted 16 September, 2023; originally announced September 2023.

    Comments: This manuscript has been accepted for publication in IEEE Transactions on Smart Grid

  28. arXiv:2308.03415  [pdf, other

    cs.CL cs.AI

    End-to-End Evaluation for Low-Latency Simultaneous Speech Translation

    Authors: Christian Huber, Tu Anh Dinh, Carlos Mullov, Ngoc Quan Pham, Thai Binh Nguyen, Fabian Retkowski, Stefan Constantin, Enes Yavuz Ugan, Danni Liu, Zhaolin Li, Sai Koneru, Jan Niehues, Alexander Waibel

    Abstract: The challenge of low-latency speech translation has recently draw significant interest in the research community as shown by several publications and shared tasks. Therefore, it is essential to evaluate these different approaches in realistic scenarios. However, currently only specific aspects of the systems are evaluated and often it is not possible to compare different approaches. In this work… ▽ More

    Submitted 17 July, 2024; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: Demo paper at EMNLP 2023

  29. arXiv:2306.15860  [pdf, other

    cs.NI

    Federated Deep Reinforcement Learning-based Bitrate Adaptation for Dynamic Adaptive Streaming over HTTP

    Authors: Phuong L. Vo, Nghia T. Nguyen, Long Luu, Canh T. Dinh, Nguyen H. Tran, Tuan-Anh Le

    Abstract: In video streaming over HTTP, the bitrate adaptation selects the quality of video chunks depending on the current network condition. Some previous works have applied deep reinforcement learning (DRL) algorithms to determine the chunk's bitrate from the observed states to maximize the quality-of-experience (QoE). However, to build an intelligent model that can predict in various environments, such… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 13 pages, 1 column

  30. arXiv:2306.09754  [pdf, other

    cs.CR

    CroCoDai: A Stablecoin for Cross-Chain Commerce

    Authors: Daniël Reijsbergen, Bretislav Hajek, Tien Tuan Anh Dinh, Jussi Keppo, Henry F. Korth, Anwitaman Datta

    Abstract: Decentralized Finance (DeFi), in which digital assets are exchanged without trusted intermediaries, has grown rapidly in value in recent years. The global DeFi ecosystem is fragmented into multiple blockchains, fueling the demand for cross-chain commerce. Existing approaches for cross-chain transactions, e.g., bridges and cross-chain deals, achieve atomicity by locking assets in escrow. However, l… ▽ More

    Submitted 14 October, 2024; v1 submitted 16 June, 2023; originally announced June 2023.

    Comments: Accepted for publication in ACM Distributed Ledger Technologies: Research and Practice

  31. arXiv:2306.09735  [pdf, other

    cs.CR

    PIEChain -- A Practical Blockchain Interoperability Framework

    Authors: Daniël Reijsbergen, Aung Maw, Jingchi Zhang, Tien Tuan Anh Dinh, Anwitaman Datta

    Abstract: A plethora of different blockchain platforms have emerged in recent years, but many of them operate in silos. As such, there is a need for reliable cross-chain communication to enable blockchain interoperability. Blockchain interoperability is challenging because transactions can typically not be reverted - as such, if one transaction is committed then the protocol must ensure that all related tra… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  32. arXiv:2306.05320  [pdf, other

    cs.CL cs.SD

    KIT's Multilingual Speech Translation System for IWSLT 2023

    Authors: Danni Liu, Thai Binh Nguyen, Sai Koneru, Enes Yavuz Ugan, Ngoc-Quan Pham, Tuan-Nam Nguyen, Tu Anh Dinh, Carlos Mullov, Alexander Waibel, Jan Niehues

    Abstract: Many existing speech translation benchmarks focus on native-English speech in high-quality recording conditions, which often do not match the conditions in real-life use-cases. In this paper, we describe our speech translation system for the multilingual track of IWSLT 2023, which evaluates translation quality on scientific conference talks. The test condition features accented input speech and te… ▽ More

    Submitted 12 July, 2023; v1 submitted 8 June, 2023; originally announced June 2023.

    Comments: IWSLT 2023

  33. arXiv:2306.03438  [pdf, other

    cs.LG cs.AI cs.CL cs.SE

    Large Language Models of Code Fail at Completing Code with Potential Bugs

    Authors: Tuan Dinh, Jinman Zhao, Samson Tan, Renato Negrinho, Leonard Lausen, Sheng Zha, George Karypis

    Abstract: Large language models of code (Code-LLMs) have recently brought tremendous advances to code completion, a fundamental feature of programming assistance and code intelligence. However, most existing works ignore the possible presence of bugs in the code context for generation, which are inevitable in software development. Therefore, we introduce and study the buggy-code completion problem, inspired… ▽ More

    Submitted 30 November, 2023; v1 submitted 6 June, 2023; originally announced June 2023.

    Comments: 27 pages, accepted to NeurIPS 2023

  34. arXiv:2305.07457  [pdf, other

    cs.CL

    Perturbation-based QE: An Explainable, Unsupervised Word-level Quality Estimation Method for Blackbox Machine Translation

    Authors: Tu Anh Dinh, Jan Niehues

    Abstract: Quality Estimation (QE) is the task of predicting the quality of Machine Translation (MT) system output, without using any gold-standard translation references. State-of-the-art QE models are supervised: they require human-labeled quality of some MT system output on some datasets for training, making them domain-dependent and MT-system-dependent. There has been research on unsupervised QE, which r… ▽ More

    Submitted 13 July, 2023; v1 submitted 12 May, 2023; originally announced May 2023.

    Comments: Accepted to MT Summit 2023

    ACM Class: I.2.7

  35. arXiv:2305.06600  [pdf, other

    cs.IT cs.DM

    Designing Compact Repair Groups for Reed-Solomon Codes

    Authors: Thi Xinh Dinh, Serdar Boztas, Son Hoang Dau, Emanuele Viterbo

    Abstract: Motivated by the application of Reed-Solomon codes to recently emerging decentralized storage systems such as Storj and Filebase/Sia, we study the problem of designing compact repair groups for recovering multiple failures in a decentralized manner. Here, compactness means that the corresponding trace repair schemes of these groups of helpers can be generated from a single or a few seed repair sch… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

  36. Development of a Vision System to Enhance the Reliability of the Pick-and-Place Robot for Autonomous Testing of Camera Module used in Smartphones

    Authors: Hoang-Anh Phan, Duy Nam Bui, Tuan Nguyen Dinh, Bao-Anh Hoang, An Nguyen Ngoc, Dong Tran Huu Quoc, Ha Tran Thi Thuy, Tung Thanh Bui, Van Nguyen Thi Thanh

    Abstract: Pick-and-place robots are commonly used in modern industrial manufacturing. For complex devices/parts like camera modules used in smartphones, which contain optical parts, electrical components and interfacing connectors, the placement operation may not absolutely accurate, which may cause damage in the device under test during the mechanical movement to make good contact for electrical functions… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Published to 2021 International Conference on Engineering and Emerging Technologies (ICEET 2021). 6 pages

  37. arXiv:2304.13671  [pdf, other

    math.OC cs.AI

    Multiobjective Logistics Optimization for Automated ATM Cash Replenishment Process

    Authors: Bui Tien Thanh, Dinh Van Tuan, Tuan Anh Chi, Nguyen Van Dai, Nguyen Tai Quang Dinh, Nguyen Thu Thuy, Nguyen Thi Xuan Hoa

    Abstract: In the digital transformation era, integrating digital technology into every aspect of banking operations improves process automation, cost efficiency, and service level improvement. Although logistics for ATM cash is a crucial task that impacts operating costs and consumer satisfaction, there has been little effort to enhance it. Specifically, in Vietnam, with a market of more than 20,000 ATMs na… ▽ More

    Submitted 22 July, 2023; v1 submitted 23 April, 2023; originally announced April 2023.

  38. arXiv:2304.03433  [pdf, other

    cs.IT eess.SP

    Multi-User Cooperation for Covert Communication Under Quasi-Static Fading

    Authors: Jinyoung Lee, Duc Trung Dinh, Hyeonsik Yeom, Si-Hyeon Lee, Jeongseok Ha

    Abstract: This work studies a covert communication scheme for an uplink multi-user scenario in which some users are opportunistically selected to help a covert user. In particular, the selected users emit interfering signals via an orthogonal resource dedicated to the covert user together with signals for their own communications using orthogonal resources allocated to the selected users, which helps the co… ▽ More

    Submitted 10 April, 2023; v1 submitted 6 April, 2023; originally announced April 2023.

    Comments: 13 pages, 8 figures, This work has been submitted to the IEEE for possible publication

  39. arXiv:2302.11426  [pdf, other

    cs.DB cs.IR

    Mining compact high utility sequential patterns

    Authors: Tai Dinh, Philippe Fournier-Viger, Huynh Van Hong

    Abstract: High utility sequential pattern mining (HUSPM) aims to mine all patterns that yield a high utility (profit) in a sequence dataset. HUSPM is useful for several applications such as market basket analysis, marketing, and website clickstream analysis. In these applications, users may also consider high utility patterns frequently appearing in the dataset to obtain more fruitful information. However,… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

    Comments: Nippon (Japan) Applied Informatics Society Journal

  40. arXiv:2302.05512  [pdf

    cs.CR cs.DL

    Composable Ledgers for Distributed Synchronic Web Archiving

    Authors: Thien-Nam Dinh, Nicholas Pattengale

    Abstract: The Synchronic Web is a highly scalable notary infrastructure that provides tamper-evident data provenance for historical web data. In this document, we describe the applicability of this infrastructure for web archiving across three envisioned stages of adoption. We codify the core mechanism enabling the value proposition: a procedure for splitting and merging cryptographic information fluidly ac… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

  41. arXiv:2301.10733  [pdf

    cs.CR

    The Synchronic Web

    Authors: Thien-Nam Dinh, Nicholas Pattengale, Steven Elliott

    Abstract: The Synchronic Web is a distributed network for securing data provenance on the World Wide Web. By enabling clients around the world to freely commit digital information into a single shared view of history, it provides a foundational basis of truth on which to build decentralized and scalable trust across the Internet. Its core cryptographical capability allows mutually distrusting parties to cre… ▽ More

    Submitted 10 June, 2024; v1 submitted 25 January, 2023; originally announced January 2023.

  42. arXiv:2212.10723  [pdf, other

    cs.AI

    Predict+Optimize Problem in Renewable Energy Scheduling

    Authors: Christoph Bergmeir, Frits de Nijs, Evgenii Genov, Abishek Sriramulu, Mahdi Abolghasemi, Richard Bean, John Betts, Quang Bui, Nam Trong Dinh, Nils Einecke, Rasul Esmaeilbeigi, Scott Ferraro, Priya Galketiya, Robert Glasgow, Rakshitha Godahewa, Yanfei Kang, Steffen Limmer, Luis Magdalena, Pablo Montero-Manso, Daniel Peralta, Yogesh Pipada Sunil Kumar, Alejandro Rosales-Pérez, Julian Ruddick, Akylas Stratigakos, Peter Stuckey , et al. (3 additional authors not shown)

    Abstract: Predict+Optimize frameworks integrate forecasting and optimization to address real-world challenges such as renewable energy scheduling, where variability and uncertainty are critical factors. This paper benchmarks solutions from the IEEE-CIS Technical Challenge on Predict+Optimize for Renewable Energy Scheduling, focusing on forecasting renewable production and demand and optimizing energy cost.… ▽ More

    Submitted 14 April, 2025; v1 submitted 20 December, 2022; originally announced December 2022.

  43. arXiv:2212.00981  [pdf, other

    cs.CV cs.AI

    QC-StyleGAN -- Quality Controllable Image Generation and Manipulation

    Authors: Dat Viet Thanh Nguyen, Phong Tran The, Tan M. Dinh, Cuong Pham, Anh Tuan Tran

    Abstract: The introduction of high-quality image generation models, particularly the StyleGAN family, provides a powerful tool to synthesize and manipulate images. However, existing models are built upon high-quality (HQ) data as desired outputs, making them unfit for in-the-wild low-quality (LQ) images, which are common inputs for manipulation. In this work, we bridge this gap by proposing a novel GAN stru… ▽ More

    Submitted 7 December, 2022; v1 submitted 2 December, 2022; originally announced December 2022.

    Comments: Accepted to NeurIPS 2022; The code is available at https://github.com/VinAIResearch/QC-StyleGAN

  44. arXiv:2211.11001  [pdf

    cs.CV

    F2SD: A dataset for end-to-end group detection algorithms

    Authors: Giang Hoang, Tuan Nguyen Dinh, Tung Cao Hoang, Son Le Duy, Keisuke Hihara, Yumeka Utada, Akihiko Torii, Naoki Izumi, Long Tran Quoc

    Abstract: The lack of large-scale datasets has been impeding the advance of deep learning approaches to the problem of F-formation detection. Moreover, most research works on this problem rely on input sensor signals of object location and orientation rather than image signals. To address this, we develop a new, large-scale dataset of simulated images for F-formation detection, called F-formation Simulation… ▽ More

    Submitted 20 November, 2022; originally announced November 2022.

    Comments: Accepted at ICMV 2022

  45. arXiv:2210.12990  [pdf, other

    cs.LG math.OC

    Optimal activity and battery scheduling algorithm using load and solar generation forecasts

    Authors: Yogesh Pipada Sunil Kumar, Rui Yuan, Nam Trong Dinh, S. Ali Pourmousavi

    Abstract: Energy usage optimal scheduling has attracted great attention in the power system community, where various methodologies have been proposed. However, in real-world applications, the optimal scheduling problems require reliable energy forecasting, which is scarcely discussed as a joint solution to the scheduling problem. The 5\textsuperscript{th} IEEE Computational Intelligence Society (IEEE-CIS) c… ▽ More

    Submitted 24 October, 2022; originally announced October 2022.

    Comments: 6 pages, 4 figures, 3 tables. Accepted for IEEE proceedings as a conference paper for AUPEC 2022

  46. arXiv:2210.11702  [pdf, other

    cs.CR

    TAP: Transparent and Privacy-Preserving Data Services

    Authors: Daniel Reijsbergen, Aung Maw, Zheng Yang, Tien Tuan Anh Dinh, Jianying Zhou

    Abstract: Users today expect more security from services that handle their data. In addition to traditional data privacy and integrity requirements, they expect transparency, i.e., that the service's processing of the data is verifiable by users and trusted auditors. Our goal is to build a multi-user system that provides data privacy, integrity, and transparency for a large number of operations, while achie… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

    Comments: Accepted for USENIX Security 2023

  47. arXiv:2210.06042  [pdf, ps, other

    cs.IT

    Efficient Hamiltonian Reduction for Quantum Annealing on SatCom Beam Placement Problem

    Authors: Thinh Q. Dinh, Son Hoang Dau, Eva Lagunas, Symeon Chatzinotas

    Abstract: Beam Placement (BP) is a well-known problem in Low-Earth Orbit (LEO) satellite communication (SatCom) systems, which can be modelled as an NP-hard clique cover problem. Recently, quantum computing has emerged as a novel technology which revolutionizes how to solve challenging optimization problems by formulating Quadratic Unconstrained Binary Optimization (QUBO), then preparing Hamiltonians as inp… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  48. arXiv:2208.04613  [pdf

    eess.IV cs.CV cs.LG

    Res-Dense Net for 3D Covid Chest CT-scan classification

    Authors: Quoc-Huy Trinh, Minh-Van Nguyen, Thien-Phuc Nguyen Dinh

    Abstract: One of the most contentious areas of research in Medical Image Preprocessing is 3D CT-scan. With the rapid spread of COVID-19, the function of CT-scan in properly and swiftly diagnosing the disease has become critical. It has a positive impact on infection prevention. There are many tasks to diagnose the illness through CT-scan images, include COVID-19. In this paper, we propose a method that usin… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: text overlap with arXiv:2106.07524 by other authors

  49. arXiv:2208.04609  [pdf, other

    cs.LG cs.SI

    E2EG: End-to-End Node Classification Using Graph Topology and Text-based Node Attributes

    Authors: Tu Anh Dinh, Jeroen den Boef, Joran Cornelisse, Paul Groth

    Abstract: Node classification utilizing text-based node attributes has many real-world applications, ranging from prediction of paper topics in academic citation graphs to classification of user characteristics in social media networks. State-of-the-art node classification frameworks, such as GIANT, use a two-stage pipeline: first embedding the text attributes of graph nodes then feeding the resulting embed… ▽ More

    Submitted 26 September, 2023; v1 submitted 9 August, 2022; originally announced August 2022.

    Comments: Accepted to MLoG - IEEE International Conference on Data Mining Workshops ICDMW 2023

  50. arXiv:2207.00944  [pdf, other

    cs.DB

    GlassDB: An Efficient Verifiable Ledger Database System Through Transparency

    Authors: Cong Yue, Tien Tuan Anh Dinh, Zhongle Xie, Meihui Zhang, Gang Chen, Beng Chin Ooi, Xiaokui Xiao

    Abstract: Verifiable ledger databases protect data history against malicious tampering. Existing systems, such as blockchains and certificate transparency, are based on transparency logs -- a simple abstraction allowing users to verify that a log maintained by an untrusted server is append-only. They expose a simple key-value interface. Building a practical database from transparency logs, on the other hand… ▽ More

    Submitted 19 February, 2023; v1 submitted 2 July, 2022; originally announced July 2022.