Skip to main content

Showing 1–50 of 52 results for author: Le, T D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.10501  [pdf, ps, other

    cs.LG cs.AI

    From Noise to Precision: A Diffusion-Driven Approach to Zero-Inflated Precipitation Prediction

    Authors: Wentao Gao, Jiuyong Li, Lin Liu, Thuc Duy Le, Xiongren Chen, Xiaojing Du, Jixue Liu, Yanchang Zhao, Yun Chen

    Abstract: Zero-inflated data pose significant challenges in precipitation forecasting due to the predominance of zeros with sparse non-zero events. To address this, we propose the Zero Inflation Diffusion Framework (ZIDF), which integrates Gaussian perturbation for smoothing zero-inflated distributions, Transformer-based prediction for capturing temporal patterns, and diffusion-based denoising to restore th… ▽ More

    Submitted 1 September, 2025; originally announced September 2025.

    Comments: ECAI 2025 Accepted

  2. arXiv:2509.01731  [pdf, ps, other

    cs.CR

    Are Enterprises Ready for Quantum-Safe Cybersecurity?

    Authors: Tran Duc Le, Phuc Hao Do, Truong Duy Dinh, Van Dai Pham

    Abstract: Quantum computing threatens to undermine classical cryptography by breaking widely deployed encryption and signature schemes. This paper examines enterprise readiness for quantum-safe cybersecurity through three perspectives: (i) the technologist view, assessing the maturity of post-quantum cryptography (PQC) and quantum key distribution (QKD); (ii) the enterprise (CISO/CIO) view, analyzing organi… ▽ More

    Submitted 1 September, 2025; originally announced September 2025.

    Comments: Are Enterprises Ready for Quantum-Safe Cybersecurity?

  3. arXiv:2509.00130  [pdf, ps, other

    cs.NI

    VOTA: Parallelizing 6G-RAN Experimentation with Virtualized Over-The-Air Workloads

    Authors: Chang Liu, T. D. Khoa Le, Rahul Saini, Kishor C. Joshi, George Exarchakos

    Abstract: Testbed sharing, a practice in which different researchers concurrently develop independent use cases on top of the same testbed, is ubiquitous in wireless experimental research. Its key drawback is experimental inconvenience: one must delay experiments or tolerate compute and RF interference that harms experimental fidelity. In this paper, we propose \textbf{VOTA}, an open-source, software-only t… ▽ More

    Submitted 29 August, 2025; originally announced September 2025.

  4. arXiv:2508.12074  [pdf, ps, other

    quant-ph cs.CC

    Raising the Bar: An Asymptotic Comparison of Classical and Quantum Shortest Path Algorithms

    Authors: Phuc Hao Do, Tran Duc Le

    Abstract: The Single-Source Shortest Path (SSSP) problem is a cornerstone of computer science with vast applications, for which Dijkstra's algorithm has long been the classical baseline. While various quantum algorithms have been proposed, their performance has typically been benchmarked against this decades-old approach. This landscape was recently reshaped by the introduction of a new classical algorithm… ▽ More

    Submitted 16 August, 2025; originally announced August 2025.

    Comments: 04 figures and 02 tables

  5. arXiv:2508.04801  [pdf, ps, other

    cs.CV

    ACM Multimedia Grand Challenge on ENT Endoscopy Analysis

    Authors: Trong-Thuan Nguyen, Viet-Tham Huynh, Thao Thi Phuong Dao, Ha Nguyen Thi, Tien To Vu Thuy, Uyen Hanh Tran, Tam V. Nguyen, Thanh Dinh Le, Minh-Triet Tran

    Abstract: Automated analysis of endoscopic imagery is a critical yet underdeveloped component of ENT (ear, nose, and throat) care, hindered by variability in devices and operators, subtle and localized findings, and fine-grained distinctions such as laterality and vocal-fold state. In addition to classification, clinicians require reliable retrieval of similar cases, both visually and through concise textua… ▽ More

    Submitted 6 August, 2025; originally announced August 2025.

  6. arXiv:2508.04288  [pdf, ps, other

    quant-ph cs.AI eess.SY

    Challenges in Applying Variational Quantum Algorithms to Dynamic Satellite Network Routing

    Authors: Phuc Hao Do, Tran Duc Le

    Abstract: Applying near-term variational quantum algorithms to the problem of dynamic satellite network routing represents a promising direction for quantum computing. In this work, we provide a critical evaluation of two major approaches: static quantum optimizers such as the Variational Quantum Eigensolver (VQE) and the Quantum Approximate Optimization Algorithm (QAOA) for offline route computation, and Q… ▽ More

    Submitted 6 August, 2025; originally announced August 2025.

    Comments: 17 pages and 3 figures

  7. arXiv:2508.03466  [pdf, ps, other

    astro-ph.EP astro-ph.IM cs.NE

    A Genetic Algorithm Framework for Optimizing Three-Impulse Orbital Transfers with Poliastro Simulation

    Authors: Phuc Hao Do, Tran Duc Le

    Abstract: Orbital maneuver planning is a critical aspect of mission design, aimed at minimizing propellant consumption, which is directly correlated with the total velocity change ($ΔV$). While analytical solutions like the Hohmann and Bi-elliptic transfers offer optimal strategies for specific cases, they lack the flexibility for more general optimization problems. This paper presents a computational frame… ▽ More

    Submitted 5 August, 2025; originally announced August 2025.

    Comments: 12 pages, 3 figures, and 2 tables

  8. arXiv:2505.11880  [pdf, ps, other

    cs.AR cs.CR

    AES-RV: Hardware-Efficient RISC-V Accelerator with Low-Latency AES Instruction Extension for IoT Security

    Authors: Van Tinh Nguyen, Phuc Hung Pham, Vu Trung Duong Le, Hoai Luan Pham, Tuan Hai Vu, Thi Diem Tran

    Abstract: The Advanced Encryption Standard (AES) is a widely adopted cryptographic algorithm essential for securing embedded systems and IoT platforms. However, existing AES hardware accelerators often face limitations in performance, energy efficiency, and flexibility. This paper presents AES-RV, a hardware-efficient RISC-V accelerator featuring low-latency AES instruction extensions optimized for real-tim… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

    Comments: 6 pages, 5 figures. Submitted to IEICE Electronics Express

    ACM Class: C.3; B.6.3; E.3

  9. arXiv:2505.08101  [pdf, ps, other

    cs.CV cs.LG

    Topology-Guided Knowledge Distillation for Efficient Point Cloud Processing

    Authors: Luu Tung Hai, Thinh D. Le, Zhicheng Ding, Qing Tian, Truong-Son Hy

    Abstract: Point cloud processing has gained significant attention due to its critical role in applications such as autonomous driving and 3D object recognition. However, deploying high-performance models like Point Transformer V3 in resource-constrained environments remains challenging due to their high computational and memory demands. This work introduces a novel distillation framework that leverages topo… ▽ More

    Submitted 12 May, 2025; originally announced May 2025.

  10. arXiv:2505.01984  [pdf, other

    cs.CV

    Lifelong Whole Slide Image Analysis: Online Vision-Language Adaptation and Past-to-Present Gradient Distillation

    Authors: Doanh C. Bui, Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Duy Tran, Khang Nguyen, Yasuhiko Nakashima

    Abstract: Whole Slide Images (WSIs) play a crucial role in accurate cancer diagnosis and prognosis, as they provide tissue details at the cellular level. However, the rapid growth of computational tasks involving WSIs poses significant challenges. Given that WSIs are gigapixels in size, they present difficulties in terms of storage, processing, and model training. Therefore, it is essential to develop lifel… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  11. arXiv:2504.15627  [pdf, ps, other

    cs.CV

    ZeroSlide: Is Zero-Shot Classification Adequate for Lifelong Learning in Whole-Slide Image Analysis in the Era of Pathology Vision-Language Foundation Models?

    Authors: Doanh C. Bui, Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Duy Tran, Yasuhiko Nakashima

    Abstract: Lifelong learning for whole slide images (WSIs) poses the challenge of training a unified model to perform multiple WSI-related tasks, such as cancer subtyping and tumor classification, in a distributed, continual fashion. This is a practical and applicable problem in clinics and hospitals, as WSIs are large, require storage, processing, and transfer time. Training new models whenever new tasks ar… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: 10 pages, 3 figures, 1 table, conference submission

  12. arXiv:2501.04890  [pdf, other

    cs.SE

    Evaluating Developer-written Unit Test Case Reduction for Java -- A Replication Study

    Authors: Tuan D Le, Brandon Wilber, Arpit Christi

    Abstract: Abstract: Failing test case reduction can promote efficient debugging because a developer may not need to observe components that are not relevant to inducing failure. Failing test case reduction can also improve the efficiency of fault localization. These considerations have prompted researchers to study the reduction process, the reduction output, and the removed entities. Christi et al. studied… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: 5 pages and 4 figures

    MSC Class: 68N30 ACM Class: D.2.5

  13. arXiv:2412.04641  [pdf, other

    cs.LG cs.AI stat.ML

    Disentangled Representation Learning for Causal Inference with Instruments

    Authors: Debo Cheng, Jiuyong Li, Lin Liu, Ziqi Xu, Weijia Zhang, Jixue Liu, Thuc Duy Le

    Abstract: Latent confounders are a fundamental challenge for inferring causal effects from observational data. The instrumental variable (IV) approach is a practical way to address this challenge. Existing IV based estimators need a known IV or other strong assumptions, such as the existence of two or more IVs in the system, which limits the application of the IV approach. In this paper, we consider a relax… ▽ More

    Submitted 5 December, 2024; originally announced December 2024.

    Comments: 14 pages, 13 figures and 5 tables. Accepted by TNNLS

  14. arXiv:2411.17774  [pdf, other

    cs.LG cs.AI

    Leaning Time-Varying Instruments for Identifying Causal Effects in Time-Series Data

    Authors: Debo Cheng, Ziqi Xu, Jiuyong Li, Lin Liu, Thuc duy Le, Xudong Guo, Shichao Zhang

    Abstract: Querying causal effects from time-series data is important across various fields, including healthcare, economics, climate science, and epidemiology. However, this task becomes complex in the existence of time-varying latent confounders, which affect both treatment and outcome variables over time and can introduce bias in causal effect estimation. Traditional instrumental variable (IV) methods are… ▽ More

    Submitted 26 November, 2024; originally announced November 2024.

    Comments: 14 pages

  15. arXiv:2411.04471  [pdf, other

    quant-ph cs.AR

    FQsun: A Configurable Wave Function-Based Quantum Emulator for Power-Efficient Quantum Simulations

    Authors: Tuan Hai Vu, Vu Trung Duong Le, Hoai Luan Pham, Quoc Chuong Nguyen, Yasuhiko Nakashima

    Abstract: Quantum computers are promising powerful computers for solving complex problems, but access to real quantum hardware remains limited due to high costs. Although the software simulators on CPUs/GPUs such as Qiskit, ProjectQ, and Qsun offer flexibility and support for many qubits, they struggle with high power consumption and limited processing speed, especially as qubit counts scale. Accordingly, q… ▽ More

    Submitted 18 March, 2025; v1 submitted 7 November, 2024; originally announced November 2024.

    Comments: 15 pages, 11 figures, 7 tables, submitted to the IEEE Access

  16. arXiv:2410.15648  [pdf, other

    cs.LG stat.ME

    Linking Model Intervention to Causal Interpretation in Model Explanation

    Authors: Debo Cheng, Ziqi Xu, Jiuyong Li, Lin Liu, Kui Yu, Thuc Duy Le, Jixue Liu

    Abstract: Intervention intuition is often used in model explanation where the intervention effect of a feature on the outcome is quantified by the difference of a model prediction when the feature value is changed from the current value to the baseline value. Such a model intervention effect of a feature is inherently association. In this paper, we will study the conditions when an intuitive model intervent… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  17. arXiv:2410.11146  [pdf, other

    cs.AR

    Theoretical Analysis of the Efficient-Memory Matrix Storage Method for Quantum Emulation Accelerators with Gate Fusion on FPGAs

    Authors: Tran Xuan Hieu Le, Hoai Luan Pham, Tuan Hai Vu, Vu Trung Duong Le, Nakashima Yasuhiko

    Abstract: Quantum emulators play an important role in the development and testing of quantum algorithms, especially given the limitations of the current FTQC era. Developing high-speed, memory-optimized quantum emulators is a growing research trend, with gate fusion being a promising technique. However, existing gate fusion implementations often struggle to efficiently support large-scale quantum systems wi… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  18. arXiv:2409.19871  [pdf, other

    cs.LG cs.AI

    TSI: A Multi-View Representation Learning Approach for Time Series Forecasting

    Authors: Wentao Gao, Ziqi Xu, Jiuyong Li, Lin Liu, Jixue Liu, Thuc Duy Le, Debo Cheng, Yanchang Zhao, Yun Chen

    Abstract: As the growing demand for long sequence time-series forecasting in real-world applications, such as electricity consumption planning, the significance of time series forecasting becomes increasingly crucial across various domains. This is highlighted by recent advancements in representation learning within the field. This study introduces a novel multi-view approach for time series forecasting tha… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: AJCAI Oral Accepted

  19. arXiv:2409.05924  [pdf, other

    cs.SD eess.AS

    Continuous Learning of Transformer-based Audio Deepfake Detection

    Authors: Tuan Duy Nguyen Le, Kah Kuan Teh, Huy Dat Tran

    Abstract: This paper proposes a novel framework for audio deepfake detection with two main objectives: i) attaining the highest possible accuracy on available fake data, and ii) effectively performing continuous learning on new fake data in a few-shot learning manner. Specifically, we conduct a large audio deepfake collection using various deep audio generation methods. The data is further enhanced with add… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

    Comments: Submitted to INTERSPEECH 2024

  20. arXiv:2408.12063  [pdf, ps, other

    stat.ML cs.AI cs.LG physics.ao-ph

    Deconfounding Multi-Cause Latent Confounders: A Factor-Model Approach to Climate Model Bias Correction

    Authors: Wentao Gao, Jiuyong Li, Debo Cheng, Lin Liu, Jixue Liu, Thuc Duy Le, Xiaojing Du, Xiongren Chen, Yanchang Zhao, Yun Chen

    Abstract: Global Climate Models (GCMs) are crucial for predicting future climate changes by simulating the Earth systems. However, the GCM Outputs exhibit systematic biases due to model uncertainties, parameterization simplifications, and inadequate representation of complex climate phenomena. Traditional bias correction methods, which rely on historical observation data and statistical techniques, often ne… ▽ More

    Submitted 6 June, 2025; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: IJCAI 2025 Accepted

  21. arXiv:2407.17790  [pdf, other

    cs.LG cs.AR

    Exploring the Limitations of Kolmogorov-Arnold Networks in Classification: Insights to Software Training and Hardware Implementation

    Authors: Van Duy Tran, Tran Xuan Hieu Le, Thi Diem Tran, Hoai Luan Pham, Vu Trung Duong Le, Tuan Hai Vu, Van Tinh Nguyen, Yasuhiko Nakashima

    Abstract: Kolmogorov-Arnold Networks (KANs), a novel type of neural network, have recently gained popularity and attention due to the ability to substitute multi-layer perceptions (MLPs) in artificial intelligence (AI) with higher accuracy and interoperability. However, KAN assessment is still limited and cannot provide an in-depth analysis of a specific domain. Furthermore, no study has been conducted on t… ▽ More

    Submitted 25 July, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: 6 pages, 3 figures, 2 tables

  22. arXiv:2404.00006  [pdf, ps, other

    cs.CC

    A Critique of Chen's "The 2-MAXSAT Problem Can Be Solved in Polynomial Time"

    Authors: Tran Duy Anh Le, Michael P. Reidy, Eliot J. Smith

    Abstract: In this paper, we examine Yangjun Chen's technical report titled ``The 2-MAXSAT Problem Can Be Solved in Polynomial Time'' [Che23], which revises and expands upon their conference paper of the same name [Che22]. Chen's paper purports to build a polynomial-time algorithm for the ${\rm NP}$-complete problem 2-MAXSAT by converting a 2-CNF formula into a graph that is then searched. We show through mu… ▽ More

    Submitted 21 February, 2024; originally announced April 2024.

  23. arXiv:2403.08947  [pdf, other

    eess.IV cs.CV

    Robust COVID-19 Detection in CT Images with CLIP

    Authors: Li Lin, Yamini Sri Krubha, Zhenhuan Yang, Cheng Ren, Thuc Duy Le, Irene Amerini, Xin Wang, Shu Hu

    Abstract: In the realm of medical imaging, particularly for COVID-19 detection, deep learning models face substantial challenges such as the necessity for extensive computational resources, the paucity of well-annotated datasets, and a significant amount of unlabeled data. In this work, we introduce the first lightweight detector designed to overcome these obstacles, leveraging a frozen CLIP image encoder a… ▽ More

    Submitted 8 September, 2024; v1 submitted 13 March, 2024; originally announced March 2024.

  24. arXiv:2312.10202  [pdf, other

    cs.CL

    Low-resource classification of mobility functioning information in clinical sentences using large language models

    Authors: Tuan Dung Le, Thanh Duong, Thanh Thieu

    Abstract: Objective: Function is increasingly recognized as an important indicator of whole-person health. This study evaluates the ability of publicly available large language models (LLMs) to accurately identify the presence of functioning information from clinical notes. We explore various strategies to improve the performance on this task. Materials and Methods: We collect a balanced binary classificati… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  25. arXiv:2312.07175  [pdf, other

    cs.LG cs.AI stat.ME

    Instrumental Variable Estimation for Causal Inference in Longitudinal Data with Time-Dependent Latent Confounders

    Authors: Debo Cheng, Ziqi Xu, Jiuyong Li, Lin Liu, Jixue Liu, Wentao Gao, Thuc Duy Le

    Abstract: Causal inference from longitudinal observational data is a challenging problem due to the difficulty in correctly identifying the time-dependent confounders, especially in the presence of latent time-dependent confounders. Instrumental variable (IV) is a powerful tool for addressing the latent confounders issue, but the traditional IV technique cannot deal with latent time-dependent confounders in… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: 13 pages, 7 figures and 3 tables

  26. arXiv:2312.04395  [pdf, ps, other

    cs.CC

    On Czerwinski's "${\rm P} \neq {\rm NP}$ relative to a ${\rm P}$-complete oracle"

    Authors: Michael C. Chavrimootoo, Tran Duy Anh Le, Michael P. Reidy, Eliot J. Smith

    Abstract: In this paper, we take a closer look at Czerwinski's "${\rm P}\neq{\rm NP}$ relative to a ${\rm P}$-complete oracle" [Cze23]. There are (uncountably) infinitely-many relativized worlds where ${\rm P}$ and ${\rm NP}$ differ, and it is well-known that for any ${\rm P}$-complete problem $A$, ${\rm P}^A \neq {\rm NP}^A \iff {\rm P}\neq {\rm NP}$. The paper defines two sets ${\rm D}_{\rm P}$ and… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  27. arXiv:2310.01865  [pdf, other

    cs.LG cs.AI

    Conditional Instrumental Variable Regression with Representation Learning for Causal Inference

    Authors: Debo Cheng, Ziqi Xu, Jiuyong Li, Lin Liu, Jixue Liu, Thuc Duy Le

    Abstract: This paper studies the challenging problem of estimating causal effects from observational data, in the presence of unobserved confounders. The two-stage least square (TSLS) method and its variants with a standard instrumental variable (IV) are commonly used to eliminate confounding bias, including the bias caused by unobserved confounders, but they rely on the linearity assumption. Besides, the s… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 17pages, 3 figures and 6 tables

  28. arXiv:2307.01844  [pdf, other

    cs.CV

    Advancing Wound Filling Extraction on 3D Faces: Auto-Segmentation and Wound Face Regeneration Approach

    Authors: Duong Q. Nguyen, Thinh D. Le, Phuong D. Nguyen, Nga T. K. Le, H. Nguyen-Xuan

    Abstract: Facial wound segmentation plays a crucial role in preoperative planning and optimizing patient outcomes in various medical applications. In this paper, we propose an efficient approach for automating 3D facial wound segmentation using a two-stream graph convolutional network. Our method leverages the Cir3D-FaIR dataset and addresses the challenge of data imbalance through extensive experimentation… ▽ More

    Submitted 12 July, 2023; v1 submitted 4 July, 2023; originally announced July 2023.

  29. arXiv:2306.12453  [pdf, other

    cs.LG cs.AI stat.ME

    Learning Conditional Instrumental Variable Representation for Causal Effect Estimation

    Authors: Debo Cheng, Ziqi Xu, Jiuyong Li, Lin Liu, Thuc Duy Le, Jixue Liu

    Abstract: One of the fundamental challenges in causal inference is to estimate the causal effect of a treatment on its outcome of interest from observational data. However, causal effect estimation often suffers from the impacts of confounding bias caused by unmeasured confounders that affect both the treatment and the outcome. The instrumental variable (IV) approach is a powerful way to eliminate the confo… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

    Comments: Debo Cheng and Ziqi Xu contributed equally. 20 pages, 5 tables, and 3 figures. Accepted at ECML-PKDD2023

  30. arXiv:2304.04566  [pdf, other

    cs.LG cs.AI stat.ME

    Linking a predictive model to causal effect estimation

    Authors: Jiuyong Li, Lin Liu, Ziqi Xu, Ha Xuan Tran, Thuc Duy Le, Jixue Liu

    Abstract: A predictive model makes outcome predictions based on some given features, i.e., it estimates the conditional probability of the outcome given a feature vector. In general, a predictive model cannot estimate the causal effect of a feature on the outcome, i.e., how the outcome will change if the feature is changed while keeping the values of other features unchanged. This is because causal effect e… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: 16

  31. arXiv:2304.04060  [pdf, other

    cs.CV

    Application of Self-Supervised Learning to MICA Model for Reconstructing Imperfect 3D Facial Structures

    Authors: Phuong D. Nguyen, Thinh D. Le, Duong Q. Nguyen, Binh Nguyen, H. Nguyen-Xuan

    Abstract: In this study, we emphasize the integration of a pre-trained MICA model with an imperfect face dataset, employing a self-supervised learning approach. We present an innovative method for regenerating flawed facial structures, yielding 3D printable outputs that effectively support physicians in their patient treatment process. Our results highlight the model's capacity for concealing scars and achi… ▽ More

    Submitted 8 April, 2023; originally announced April 2023.

  32. arXiv:2303.14381  [pdf, other

    cs.CV

    3D Facial Imperfection Regeneration: Deep learning approach and 3D printing prototypes

    Authors: Phuong D. Nguyen, Thinh D. Le, Duong Q. Nguyen, Thanh Q. Nguyen, Li-Wei Chou, H. Nguyen-Xuan

    Abstract: This study explores the potential of a fully convolutional mesh autoencoder model for regenerating 3D nature faces with the presence of imperfect areas. We utilize deep learning approaches in graph processing and analysis to investigate the capabilities model in recreating a filling part for facial scars. Our approach in dataset creation is able to generate a facial scar rationally in a virtual sp… ▽ More

    Submitted 25 March, 2023; originally announced March 2023.

  33. arXiv:2211.16246  [pdf, other

    cs.LG cs.AI stat.ME

    Causal Inference with Conditional Instruments using Deep Generative Models

    Authors: Debo Cheng, Ziqi Xu, Jiuyong Li, Lin Liu, Jixue Liu, Thuc Duy Le

    Abstract: The instrumental variable (IV) approach is a widely used way to estimate the causal effects of a treatment on an outcome of interest from observational data with latent confounders. A standard IV is expected to be related to the treatment variable and independent of all other variables in the system. However, it is challenging to search for a standard IV from data directly due to the strict condit… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: 10 pages, 4 figures and 3 tables. Accepted by AAAI2023

  34. arXiv:2208.09590  [pdf, other

    cs.AI cs.LG

    Data-Driven Causal Effect Estimation Based on Graphical Causal Modelling: A Survey

    Authors: Debo Cheng, Jiuyong Li, Lin Liu, Jixue Liu, Thuc Duy Le

    Abstract: In many fields of scientific research and real-world applications, unbiased estimation of causal effects from non-experimental data is crucial for understanding the mechanism underlying the data and for decision-making on effective responses or interventions. A great deal of research has been conducted to address this challenging problem from different angles. For estimating causal effect in obser… ▽ More

    Submitted 3 December, 2023; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: 35 pages, 10 figures and 2 table, Accepted by ACM Computing Surveys

  35. arXiv:2206.11529  [pdf, other

    cs.LG cs.AI stat.ME

    Explanatory causal effects for model agnostic explanations

    Authors: Jiuyong Li, Ha Xuan Tran, Thuc Duy Le, Lin Liu, Kui Yu, Jixue Liu

    Abstract: This paper studies the problem of estimating the contributions of features to the prediction of a specific instance by a machine learning model and the overall contribution of a feature to the model. The causal effect of a feature (variable) on the predicted outcome reflects the contribution of the feature to a prediction very well. A challenge is that most existing causal effects cannot be estima… ▽ More

    Submitted 23 June, 2022; originally announced June 2022.

    Comments: 17

  36. arXiv:2201.03810  [pdf, other

    cs.AI

    Ancestral Instrument Method for Causal Inference without Complete Knowledge

    Authors: Debo Cheng, Jiuyong Li, Lin Liu, Jiji Zhang, Thuc duy Le, Jixue Liu

    Abstract: Unobserved confounding is the main obstacle to causal effect estimation from observational data. Instrumental variables (IVs) are widely used for causal effect estimation when there exist latent confounders. With the standard IV method, when a given IV is valid, unbiased estimation can be obtained, but the validity requirement on a standard IV is strict and untestable. Conditional IVs have been pr… ▽ More

    Submitted 8 December, 2023; v1 submitted 11 January, 2022; originally announced January 2022.

    Comments: 11 pages, 5 figures and 2 tables

  37. arXiv:2011.06716  [pdf, other

    cs.LG cs.AI

    Dependency-based Anomaly Detection: a General Framework and Comprehensive Evaluation

    Authors: Sha Lu, Lin Liu, Kui Yu, Thuc Duy Le, Jixue Liu, Jiuyong Li

    Abstract: Anomaly detection is crucial for understanding unusual behaviors in data, as anomalies offer valuable insights. This paper introduces Dependency-based Anomaly Detection (DepAD), a general framework that utilizes variable dependencies to uncover meaningful anomalies with better interpretability. DepAD reframes unsupervised anomaly detection as supervised feature selection and prediction tasks, whic… ▽ More

    Submitted 17 April, 2024; v1 submitted 12 November, 2020; originally announced November 2020.

  38. arXiv:2008.08272  [pdf, other

    cs.PL cs.LG

    Compiling ONNX Neural Network Models Using MLIR

    Authors: Tian Jin, Gheorghe-Teodor Bercea, Tung D. Le, Tong Chen, Gong Su, Haruki Imai, Yasushi Negishi, Anh Leu, Kevin O'Brien, Kiyokuni Kawachiya, Alexandre E. Eichenberger

    Abstract: Deep neural network models are becoming increasingly popular and have been used in various tasks such as computer vision, speech recognition, and natural language processing. Machine learning models are commonly trained in a resource-rich environment and then deployed in a distinct environment such as high availability machines or edge devices. To assist the portability of models, the open-source… ▽ More

    Submitted 30 September, 2020; v1 submitted 19 August, 2020; originally announced August 2020.

    Comments: 8 pages

  39. arXiv:2007.00887  [pdf, other

    q-bio.GN cs.CE

    Computational methods for cancer driver discovery: A survey

    Authors: Vu Viet Hoang Pham, Lin Liu, Cameron Bracken, Gregory Goodall, Jiuyong Li, Thuc Duy Le

    Abstract: Motivation: Uncovering the genomic causes of cancer, known as cancer driver genes, is a fundamental task in biomedical research. Cancer driver genes drive the development and progression of cancer, thus identifying cancer driver genes and their regulatory mechanism is crucial to the design of cancer treatment and intervention. Many computational methods, which take the advantages of computer scien… ▽ More

    Submitted 2 July, 2020; originally announced July 2020.

    Comments: 13 pages, 6 figures

  40. A general framework for causal classification

    Authors: Jiuyong Li, Weijia Zhang, Lin Liu, Kui Yu, Thuc Duy Le, Jixue Liu

    Abstract: In many applications, there is a need to predict the effect of an intervention on different individuals from data. For example, which customers are persuadable by a product promotion? which patients should be treated with a certain type of treatment? These are typical causal questions involving the effect or the change in outcomes made by an intervention. The questions cannot be answered with trad… ▽ More

    Submitted 14 March, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

    Comments: International Journal of Data Science and Analytics (2021). arXiv admin note: text overlap with arXiv:1604.07212 by other authors

  41. arXiv:2001.10269  [pdf, other

    cs.AI cs.LG stat.ML

    Causal query in observational data with hidden variables

    Authors: Debo Cheng, Jiuyong Li, Lin Liu, Jixue Liu, Kui Yu, Thuc Duy Le

    Abstract: This paper discusses the problem of causal query in observational data with hidden variables, with the aim of seeking the change of an outcome when "manipulating" a variable while given a set of plausible confounding variables which affect the manipulated variable and the outcome. Such an "experiment on data" to estimate the causal effect of the manipulated variable is useful for validating an exp… ▽ More

    Submitted 24 November, 2020; v1 submitted 28 January, 2020; originally announced January 2020.

    Comments: 8 pages and 7 figures. The paper has been accepted by ECAI2020. We have updated the proof of the Theorem 1 and removed Theorem 2 from the conference version

  42. arXiv:1906.06080  [pdf, other

    stat.ME cs.LG stat.ML

    Identify treatment effect patterns for personalised decisions

    Authors: Jiuyong Li, Lin Liu, Shisheng Zhang, Saisai Ma, Thuc Duy Le, Jixue Liu

    Abstract: In personalised decision making, evidence is required to determine whether an action (treatment) is suitable for an individual. Such evidence can be obtained by modelling treatment effect heterogeneity in subgroups. The existing interpretable modelling methods take a top-down approach to search for subgroups with heterogeneous treatment effects and they may miss the most specific and relevant cont… ▽ More

    Submitted 23 June, 2022; v1 submitted 14 June, 2019; originally announced June 2019.

    Comments: 17

    Journal ref: Applied Intelligence 2022

  43. arXiv:1812.07816  [pdf

    cs.LG cs.CV cs.PF stat.ML

    Fast and Accurate 3D Medical Image Segmentation with Data-swapping Method

    Authors: Haruki Imai, Samuel Matzek, Tung D. Le, Yasushi Negishi, Kiyokuni Kawachiya

    Abstract: Deep neural network models used for medical image segmentation are large because they are trained with high-resolution three-dimensional (3D) images. Graphics processing units (GPUs) are widely used to accelerate the trainings. However, the memory on a GPU is not large enough to train the models. A popular approach to tackling this problem is patch-based method, which divides a large image into sm… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

    Comments: 13 pages

    ACM Class: C.4; I.2.6; I.2.10; I.4.6; I.4.9; J.4

  44. arXiv:1811.02994  [pdf, other

    cs.CY cs.LG

    An exploration of algorithmic discrimination in data and classification

    Authors: Jixue Liu, Jiuyong Li, Feiyue Ye, Lin Liu, Thuc Duy Le, Ping Xiong

    Abstract: Algorithmic discrimination is an important aspect when data is used for predictive purposes. This paper analyzes the relationships between discrimination and classification, data set partitioning, and decision models, as well as correlation. The paper uses real world data sets to demonstrate the existence of discrimination and the independence between the discrimination of data sets and the discri… ▽ More

    Submitted 6 November, 2018; originally announced November 2018.

    Comments: arXiv admin note: text overlap with arXiv:1811.01480

  45. arXiv:1811.01480  [pdf, other

    cs.AI cs.LG

    FairMod - Making Predictive Models Discrimination Aware

    Authors: Jixue Liu, Jiuyong Li, Lin Liu, Thuc Duy Le, Feiyue Ye, Gefei Li

    Abstract: Predictive models such as decision trees and neural networks may produce discrimination in their predictions. This paper proposes a method to post-process the predictions of a predictive model to make the processed predictions non-discriminatory. The method considers multiple protected variables together. Multiple protected variables make the problem more challenging than a simple protected variab… ▽ More

    Submitted 4 November, 2018; originally announced November 2018.

  46. arXiv:1808.06316  [pdf, other

    cs.AI stat.AP

    Discovering Context Specific Causal Relationships

    Authors: Saisai Ma, Jiuyong Li, Lin Liu, Thuc Duy Le

    Abstract: With the increasing need of personalised decision making, such as personalised medicine and online recommendations, a growing attention has been paid to the discovery of the context and heterogeneity of causal relationships. Most existing methods, however, assume a known cause (e.g. a new drug) and focus on identifying from data the contexts of heterogeneous effects of the cause (e.g. patient grou… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

    Comments: This paper has been accepted by Intelligent Data Analysis

    Journal ref: Intelligent Data Analysis 23(4), 2019

  47. arXiv:1807.02037  [pdf, other

    cs.LG cs.AI stat.ML

    TFLMS: Large Model Support in TensorFlow by Graph Rewriting

    Authors: Tung D. Le, Haruki Imai, Yasushi Negishi, Kiyokuni Kawachiya

    Abstract: While accelerators such as GPUs have limited memory, deep neural networks are becoming larger and will not fit with the memory limitation of accelerators for training. We propose an approach to tackle this problem by rewriting the computational graph of a neural network, in which swap-out and swap-in operations are inserted to temporarily store intermediate results on CPU memory. In particular, we… ▽ More

    Submitted 2 October, 2019; v1 submitted 5 July, 2018; originally announced July 2018.

    Comments: A new version of TFLMS was published at ISMM 2019 (https://dl.acm.org/citation.cfm?id=3329984)

  48. arXiv:1510.03042  [pdf, other

    cs.AI stat.ML

    ParallelPC: an R package for efficient constraint based causal exploration

    Authors: Thuc Duy Le, Tao Hoang, Jiuyong Li, Lin Liu, Shu Hu

    Abstract: Discovering causal relationships from data is the ultimate goal of many research areas. Constraint based causal exploration algorithms, such as PC, FCI, RFCI, PC-simple, IDA and Joint-IDA have achieved significant progress and have many applications. A common problem with these methods is the high computational complexity, which hinders their applications in real world high dimensional datasets, e… ▽ More

    Submitted 11 October, 2015; originally announced October 2015.

  49. arXiv:1508.07092  [pdf, ps, other

    cs.AI

    Mining Combined Causes in Large Data Sets

    Authors: Saisai Ma, Jiuyong Li, Lin Liu, Thuc Duy Le

    Abstract: In recent years, many methods have been developed for detecting causal relationships in observational data. Some of them have the potential to tackle large data sets. However, these methods fail to discover a combined cause, i.e. a multi-factor cause consisting of two or more component variables which individually are not causes. A straightforward approach to uncovering a combined cause is to incl… ▽ More

    Submitted 15 October, 2015; v1 submitted 28 August, 2015; originally announced August 2015.

    Comments: This paper has been accepted by Knowledge-Based Systems

  50. From Observational Studies to Causal Rule Mining

    Authors: Jiuyong Li, Thuc Duy Le, Lin Liu, Jixue Liu, Zhou Jin, Bingyu Sun, Saisai Ma

    Abstract: Randomised controlled trials (RCTs) are the most effective approach to causal discovery, but in many circumstances it is impossible to conduct RCTs. Therefore observational studies based on passively observed data are widely accepted as an alternative to RCTs. However, in observational studies, prior knowledge is required to generate the hypotheses about the cause-effect relationships to be tested… ▽ More

    Submitted 16 August, 2015; originally announced August 2015.

    Comments: This paper has been accepted by ACM TIST journal and will be available soon

    Journal ref: ACM Trans. Intell. Syst. Technol. 7, 2, Article 14 (November 2015), 27 pages