Skip to main content

Showing 1–50 of 105 results for author: Pham, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.08562  [pdf, ps, other

    cs.CV

    Hierarchical Neural Collapse Detection Transformer for Class Incremental Object Detection

    Authors: Duc Thanh Pham, Hong Dang Nguyen, Nhat Minh Nguyen Quoc, Linh Ngo Van, Sang Dinh Viet, Duc Anh Nguyen

    Abstract: Recently, object detection models have witnessed notable performance improvements, particularly with transformer-based models. However, new objects frequently appear in the real world, requiring detection models to continually learn without suffering from catastrophic forgetting. Although Incremental Object Detection (IOD) has emerged to address this challenge, these existing models are still not… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  2. arXiv:2506.06363  [pdf

    physics.chem-ph cond-mat.mtrl-sci cs.AI cs.LG physics.comp-ph

    ChemGraph: An Agentic Framework for Computational Chemistry Workflows

    Authors: Thang D. Pham, Aditya Tanikanti, Murat Keçeli

    Abstract: Atomistic simulations are essential tools in chemistry and materials science, accelerating the discovery of novel catalysts, energy storage materials, and pharmaceuticals. However, running these simulations remains challenging due to the wide range of computational methods, diverse software ecosystems, and the need for expert knowledge and manual effort for the setup, execution, and validation sta… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

  3. Leveraging Novel Ensemble Learning Techniques and Landsat Multispectral Data for Estimating Olive Yields in Tunisia

    Authors: Mohamed Kefi, Tien Dat Pham, Thin Nguyen, Mark G. Tjoelker, Viola Devasirvatham, Kenichi Kashiwagi

    Abstract: Olive production is an important tree crop in Mediterranean climates. However, olive yield varies significantly due to climate change. Accurately estimating yield using remote sensing and machine learning remains a complex challenge. In this study, we developed a streamlined pipeline for olive yield estimation in the Kairouan and Sousse governorates of Tunisia. We extracted features from multispec… ▽ More

    Submitted 25 May, 2025; originally announced June 2025.

  4. arXiv:2505.18365  [pdf, ps, other

    eess.IV cs.CV

    Brightness-Invariant Tracking Estimation in Tagged MRI

    Authors: Zhangxing Bian, Shuwen Wei, Xiao Liang, Yuan-Chiao Lu, Samuel W. Remedios, Fangxu Xing, Jonghye Woo, Dzung L. Pham, Aaron Carass, Philip V. Bayly, Jiachen Zhuo, Ahmed Alshareef, Jerry L. Prince

    Abstract: Magnetic resonance (MR) tagging is an imaging technique for noninvasively tracking tissue motion in vivo by creating a visible pattern of magnetization saturation (tags) that deforms with the tissue. Due to longitudinal relaxation and progression to steady-state, the tags and tissue brightnesses change over time, which makes tracking with optical flow methods error-prone. Although Fourier methods… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: Accepted by IPMI 2025

  5. arXiv:2505.18128  [pdf, ps, other

    cs.CL

    Frankentext: Stitching random text fragments into long-form narratives

    Authors: Chau Minh Pham, Jenna Russell, Dzung Pham, Mohit Iyyer

    Abstract: We introduce Frankentexts, a new type of long-form narratives produced by LLMs under the extreme constraint that most tokens (e.g., 90%) must be copied verbatim from human writings. This task presents a challenging test of controllable generation, requiring models to satisfy a writing prompt, integrate disparate text fragments, and still produce a coherent narrative. To generate Frankentexts, we i… ▽ More

    Submitted 28 May, 2025; v1 submitted 23 May, 2025; originally announced May 2025.

  6. arXiv:2505.14549  [pdf, ps, other

    cs.CR cs.AI

    Can Large Language Models Really Recognize Your Name?

    Authors: Dzung Pham, Peter Kairouz, Niloofar Mireshghallah, Eugene Bagdasarian, Chau Minh Pham, Amir Houmansadr

    Abstract: Large language models (LLMs) are increasingly being used to protect sensitive user data. However, current LLM-based privacy solutions assume that these models can reliably detect personally identifiable information (PII), particularly named entities. In this paper, we challenge that assumption by revealing systematic failures in LLM-based privacy tasks. Specifically, we show that modern LLMs regul… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

  7. arXiv:2505.13784  [pdf, other

    cs.CV

    Transfer Learning from Visual Speech Recognition to Mouthing Recognition in German Sign Language

    Authors: Dinh Nam Pham, Eleftherios Avramidis

    Abstract: Sign Language Recognition (SLR) systems primarily focus on manual gestures, but non-manual features such as mouth movements, specifically mouthing, provide valuable linguistic information. This work directly classifies mouthing instances to their corresponding words in the spoken language while exploring the potential of transfer learning from Visual Speech Recognition (VSR) to mouthing recognitio… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: Accepted at 19th IEEE International Conference on Automatic Face and Gesture Recognition 2025

  8. Development and evaluation of a deep learning algorithm for German word recognition from lip movements

    Authors: Dinh Nam Pham, Torsten Rahne

    Abstract: When reading lips, many people benefit from additional visual information from the lip movements of the speaker, which is, however, very error prone. Algorithms for lip reading with artificial intelligence based on artificial neural networks significantly improve word recognition but are not available for the German language. A total of 1806 video clips with only one German-speaking person each we… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

    Comments: English version of journal article in HNO 2022

    Journal ref: HNO 70, 456-465 (2022)

  9. arXiv:2503.14240  [pdf, other

    cs.LG

    Persistent Homology-induced Graph Ensembles for Time Series Regressions

    Authors: Viet The Nguyen, Duy Anh Pham, An Thai Le, Jans Peter, Gunther Gust

    Abstract: The effectiveness of Spatio-temporal Graph Neural Networks (STGNNs) in time-series applications is often limited by their dependence on fixed, hand-crafted input graph structures. Motivated by insights from the Topological Data Analysis (TDA) paradigm, of which real-world data exhibits multi-scale patterns, we construct several graphs using Persistent Homology Filtration -- a mathematical framewor… ▽ More

    Submitted 19 March, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

  10. arXiv:2503.11787  [pdf, ps, other

    cs.CV eess.IV

    ECLARE: Efficient cross-planar learning for anisotropic resolution enhancement

    Authors: Samuel W. Remedios, Shuwen Wei, Shuo Han, Jinwei Zhang, Aaron Carass, Kurt G. Schilling, Dzung L. Pham, Jerry L. Prince, Blake E. Dewey

    Abstract: In clinical imaging, magnetic resonance (MR) image volumes are often acquired as stacks of 2D slices with decreased scan times, improved signal-to-noise ratio, and image contrasts unique to 2D MR pulse sequences. While this is sufficient for clinical evaluation, automated algorithms designed for 3D analysis perform poorly on multi-slice 2D MR volumes, especially those with thick slices and gaps be… ▽ More

    Submitted 21 May, 2025; v1 submitted 14 March, 2025; originally announced March 2025.

  11. arXiv:2503.05725  [pdf

    cs.CY cs.AI

    A new framework for prognostics in decentralized industries: Enhancing fairness, security, and transparency through Blockchain and Federated Learning

    Authors: T. Q. D. Pham, K. D. Tran, Khanh T. P. Nguyen, X. V. Tran, L. Köehl, K. P. Tran

    Abstract: As global industries transition towards Industry 5.0 predictive maintenance PM remains crucial for cost effective operations resilience and minimizing downtime in increasingly smart manufacturing environments In this chapter we explore how the integration of Federated Learning FL and blockchain BC technologies enhances the prediction of machinerys Remaining Useful Life RUL within decentralized and… ▽ More

    Submitted 8 April, 2025; v1 submitted 17 February, 2025; originally announced March 2025.

  12. arXiv:2502.07944  [pdf, other

    cs.AI

    SHACL-SKOS Based Knowledge Representation of Material Safety Data Sheet (SDS) for the Pharmaceutical Industry

    Authors: Brian Lu, Dennis Pham, Ti-Chiun Chang, Michael Lovette, Terri Bui, Stephen Ma

    Abstract: We report the development of a knowledge representation and reasoning (KRR) system built on hybrid SHACL-SKOS ontologies for globally harmonized system (GHS) material Safety Data Sheets (SDS) to enhance chemical safety communication and regulatory compliance. SDS are comprehensive documents containing safety and handling information for chemical substances. Thus, they are an essential part of work… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: 8 pages, 10 figures, IEEE ICSC

    ACM Class: I.2.4

  13. arXiv:2502.06773  [pdf, other

    cs.AI cs.CL cs.LG

    On the Emergence of Thinking in LLMs I: Searching for the Right Intuition

    Authors: Guanghao Ye, Khiem Duc Pham, Xinzhi Zhang, Sivakanth Gopi, Baolin Peng, Beibin Li, Janardhan Kulkarni, Huseyin A. Inan

    Abstract: Recent AI advancements, such as OpenAI's new models, are transforming LLMs into LRMs (Large Reasoning Models) that perform reasoning during inference, taking extra time and compute for higher-quality outputs. We aim to uncover the algorithmic framework for training LRMs. Methods like self-consistency, PRM, and AlphaZero suggest reasoning as guided search. We ask: what is the simplest, most scalabl… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: Abstract shortened for arXiv

  14. arXiv:2502.04835  [pdf, other

    cs.SE

    How Do Developers Use Code Suggestions in Pull Request Reviews?

    Authors: Abir Bouraffa, Yen Dieu Pham, Walid Maalej

    Abstract: GitHub introduced the suggestion feature to enable reviewers to explicitly suggest code modifications in pull requests. These suggestions make the reviewers' feedback more actionable for the submitters and represent a valuable knowledge for newcomers. Still, little is known about how code review suggestions are used by developers, what impact they have on pull requests, and how they are influenced… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: Accepted for publication in proceedings of the 18th International Conference on Cooperative and Human Aspects of Software Engineering (CHASE 2025)

  15. arXiv:2501.14249  [pdf, other

    cs.LG cs.AI cs.CL

    Humanity's Last Exam

    Authors: Long Phan, Alice Gatti, Ziwen Han, Nathaniel Li, Josephina Hu, Hugh Zhang, Chen Bo Calvin Zhang, Mohamed Shaaban, John Ling, Sean Shi, Michael Choi, Anish Agrawal, Arnav Chopra, Adam Khoja, Ryan Kim, Richard Ren, Jason Hausenloy, Oliver Zhang, Mantas Mazeika, Dmitry Dodonov, Tung Nguyen, Jaeho Lee, Daron Anderson, Mikhail Doroshenko, Alun Cennyth Stokes , et al. (1084 additional authors not shown)

    Abstract: Benchmarks are important tools for tracking the rapid advancements in large language model (LLM) capabilities. However, benchmarks are not keeping pace in difficulty: LLMs now achieve over 90\% accuracy on popular benchmarks like MMLU, limiting informed measurement of state-of-the-art LLM capabilities. In response, we introduce Humanity's Last Exam (HLE), a multi-modal benchmark at the frontier of… ▽ More

    Submitted 19 April, 2025; v1 submitted 24 January, 2025; originally announced January 2025.

    Comments: 29 pages, 6 figures

  16. arXiv:2501.14000  [pdf, other

    cs.LG cs.AI

    Local Control Networks (LCNs): Optimizing Flexibility in Neural Network Data Pattern Capture

    Authors: Hy Nguyen, Duy Khoa Pham, Srikanth Thudumu, Hung Du, Rajesh Vasa, Kon Mouzakis

    Abstract: The widespread use of Multi-layer perceptrons (MLPs) often relies on a fixed activation function (e.g., ReLU, Sigmoid, Tanh) for all nodes within the hidden layers. While effective in many scenarios, this uniformity may limit the networks ability to capture complex data patterns. We argue that employing the same activation function at every node is suboptimal and propose leveraging different activ… ▽ More

    Submitted 25 April, 2025; v1 submitted 23 January, 2025; originally announced January 2025.

  17. arXiv:2501.08335  [pdf, ps, other

    cs.CL cs.AI

    MERaLiON-TextLLM: Cross-Lingual Understanding of Large Language Models in Chinese, Indonesian, Malay, and Singlish

    Authors: Xin Huang, Tarun Kumar Vangani, Minh Duc Pham, Xunlong Zou, Bin Wang, Zhengyuan Liu, Ai Ti Aw

    Abstract: Multilingual large language models (MLLMs) have shown impressive capabilities across a variety of languages. However, efficacy can differ greatly between different language families, especially for those with limited linguistic resources. This report presents MERaLiON-TextLLM, a series of open-source language models specifically tailored to improve understanding and generation in Chinese, Indonesi… ▽ More

    Submitted 21 January, 2025; v1 submitted 21 December, 2024; originally announced January 2025.

  18. arXiv:2501.07102  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    AdaCS: Adaptive Normalization for Enhanced Code-Switching ASR

    Authors: The Chuong Chu, Vu Tuan Dat Pham, Kien Dao, Hoang Nguyen, Quoc Hung Truong

    Abstract: Intra-sentential code-switching (CS) refers to the alternation between languages that happens within a single utterance and is a significant challenge for Automatic Speech Recognition (ASR) systems. For example, when a Vietnamese speaker uses foreign proper names or specialized terms within their speech. ASR systems often struggle to accurately transcribe intra-sentential CS due to their training… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

    Comments: Accepted at ICASSP 2025

  19. arXiv:2411.18229  [pdf, other

    cs.CV

    SharpDepth: Sharpening Metric Depth Predictions Using Diffusion Distillation

    Authors: Duc-Hai Pham, Tung Do, Phong Nguyen, Binh-Son Hua, Khoi Nguyen, Rang Nguyen

    Abstract: We propose SharpDepth, a novel approach to monocular metric depth estimation that combines the metric accuracy of discriminative depth estimation methods (e.g., Metric3D, UniDepth) with the fine-grained boundary sharpness typically achieved by generative methods (e.g., Marigold, Lotus). Traditional discriminative models trained on real-world data with sparse ground-truth depth can accurately predi… ▽ More

    Submitted 27 November, 2024; originally announced November 2024.

    Comments: Uncompressed version can be found in https://drive.google.com/file/d/1MG4-d_xDERVBCRfLDolNLnMLLuqd7qRz

  20. arXiv:2410.04213  [pdf, ps, other

    cs.LG

    Equivariant Polynomial Functional Networks

    Authors: Thieu N. Vo, Viet-Hoang Tran, Tho Tran Huu, An Nguyen The, Thanh Tran, Minh-Khoi Nguyen-Nhat, Duy-Tung Pham, Tan Minh Nguyen

    Abstract: Neural Functional Networks (NFNs) have gained increasing interest due to their wide range of applications, including extracting information from implicit representations of data, editing network weights, and evaluating policies. A key design principle of NFNs is their adherence to the permutation and scaling symmetries inherent in the connectionist structure of the input neural networks. Recent NF… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

  21. arXiv:2410.04209  [pdf, other

    cs.LG

    Equivariant Neural Functional Networks for Transformers

    Authors: Viet-Hoang Tran, Thieu N. Vo, An Nguyen The, Tho Tran Huu, Minh-Khoi Nguyen-Nhat, Thanh Tran, Duy-Tung Pham, Tan Minh Nguyen

    Abstract: This paper systematically explores neural functional networks (NFN) for transformer architectures. NFN are specialized neural networks that treat the weights, gradients, or sparsity patterns of a deep neural network (DNN) as input data and have proven valuable for tasks such as learnable optimizers, implicit data representations, and weight editing. While NFN have been extensively developed for ML… ▽ More

    Submitted 7 March, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

    Comments: Accepted in ICLR 2025

  22. arXiv:2410.03292  [pdf, other

    cs.LG

    Demystifying the Token Dynamics of Deep Selective State Space Models

    Authors: Thieu N Vo, Tung D. Pham, Xin T. Tong, Tan Minh Nguyen

    Abstract: Selective state space models (SSM), such as Mamba, have gained prominence for their effectiveness in modeling sequential data. Despite their outstanding empirical performance, a comprehensive theoretical understanding of deep selective SSM remains elusive, hindering their further development and adoption for applications that need high fidelity. In this paper, we investigate the dynamical properti… ▽ More

    Submitted 7 March, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

    Comments: Accepted at ICLR 2025 (spotlight)

  23. arXiv:2409.19749  [pdf, other

    cs.CL

    NeuroMax: Enhancing Neural Topic Modeling via Maximizing Mutual Information and Group Topic Regularization

    Authors: Duy-Tung Pham, Thien Trang Nguyen Vu, Tung Nguyen, Linh Ngo Van, Duc Anh Nguyen, Thien Huu Nguyen

    Abstract: Recent advances in neural topic models have concentrated on two primary directions: the integration of the inference network (encoder) with a pre-trained language model (PLM) and the modeling of the relationship between words and topics in the generative model (decoder). However, the use of large PLMs significantly increases inference costs, making them less practical for situations requiring low… ▽ More

    Submitted 29 September, 2024; originally announced September 2024.

    Comments: Findings of EMNLP 2024

  24. arXiv:2408.13808  [pdf, ps, other

    cs.CL

    Towards Reliable Medical Question Answering: Techniques and Challenges in Mitigating Hallucinations in Language Models

    Authors: Duy Khoa Pham, Bao Quoc Vo

    Abstract: The rapid advancement of large language models (LLMs) has significantly impacted various domains, including healthcare and biomedicine. However, the phenomenon of hallucination, where LLMs generate outputs that deviate from factual accuracy or context, poses a critical challenge, especially in high-stakes domains. This paper conducts a scoping study of existing techniques for mitigating hallucinat… ▽ More

    Submitted 25 August, 2024; originally announced August 2024.

    Comments: 9 pages

  25. arXiv:2408.12480  [pdf, other

    cs.LG cs.CL

    Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese

    Authors: Khang T. Doan, Bao G. Huynh, Dung T. Hoang, Thuc D. Pham, Nhat H. Pham, Quan T. M. Nguyen, Bang Q. Vo, Suong N. Hoang

    Abstract: In this report, we introduce Vintern-1B, a reliable 1-billion-parameters multimodal large language model (MLLM) for Vietnamese language tasks. By integrating the Qwen2-0.5B-Instruct language model with the InternViT-300M-448px visual model, Vintern-1B is optimized for a range of applications, including optical character recognition (OCR), document extraction, and general question-answering in Viet… ▽ More

    Submitted 23 August, 2024; v1 submitted 22 August, 2024; originally announced August 2024.

  26. arXiv:2408.11559  [pdf, other

    cs.CV

    Semi-supervised 3D Semantic Scene Completion with 2D Vision Foundation Model Guidance

    Authors: Duc-Hai Pham, Duc-Dung Nguyen, Anh Pham, Tuan Ho, Phong Nguyen, Khoi Nguyen, Rang Nguyen

    Abstract: Accurate prediction of 3D semantic occupancy from 2D visual images is vital in enabling autonomous agents to comprehend their surroundings for planning and navigation. State-of-the-art methods typically employ fully supervised approaches, necessitating a huge labeled dataset acquired through expensive LiDAR sensors and meticulous voxel-wise labeling by human annotators. The resource-intensive natu… ▽ More

    Submitted 9 January, 2025; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: Accepted at AAAI2025. Project Page: https://vinairesearch.github.io/SemiSSC

  27. arXiv:2407.08792  [pdf, ps, other

    cs.CR

    ProxyGPT: Enabling User Anonymity in LLM Chatbots via (Un)Trustworthy Volunteer Proxies

    Authors: Dzung Pham, Jade Sheffey, Chau Minh Pham, Amir Houmansadr

    Abstract: Popular large language model (LLM) chatbots such as ChatGPT and Claude require users to create an account with an email or a phone number before allowing full access to their services. This practice ties users' personally identifiable information (PII) to their sensitive conversational data, thus posing significant privacy risks. Unfortunately, existing private LLM solutions based on cryptography… ▽ More

    Submitted 11 June, 2025; v1 submitted 11 July, 2024; originally announced July 2024.

  28. arXiv:2406.07680  [pdf, other

    cs.CV

    Watching Swarm Dynamics from Above: A Framework for Advanced Object Tracking in Drone Videos

    Authors: Duc Pham, Matthew Hansen, Félicie Dhellemmes, Jens Krause, Pia Bideau

    Abstract: Easily accessible sensors, like drones with diverse onboard sensors, have greatly expanded studying animal behavior in natural environments. Yet, analyzing vast, unlabeled video data, often spanning hours, remains a challenge for machine learning, especially in computer vision. Existing approaches often analyze only a few frames. Our focus is on long-term animal behavior analysis. To address this… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: CVPRW: Workshop paper appearing in CV4Animals

  29. arXiv:2404.12076  [pdf, other

    cs.AI cs.NE

    Evolutionary Multi-Objective Optimisation for Fairness-Aware Self Adjusting Memory Classifiers in Data Streams

    Authors: Pivithuru Thejan Amarasinghe, Diem Pham, Binh Tran, Su Nguyen, Yuan Sun, Damminda Alahakoon

    Abstract: This paper introduces a novel approach, evolutionary multi-objective optimisation for fairness-aware self-adjusting memory classifiers, designed to enhance fairness in machine learning algorithms applied to data stream classification. With the growing concern over discrimination in algorithmic decision-making, particularly in dynamic data stream environments, there is a need for methods that ensur… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: This paper has been accepted by GECCO 2024

  30. arXiv:2403.15975  [pdf, other

    cs.NI

    Prioritized Multi-Tenant Traffic Engineering for Dynamic QoS Provisioning in Autonomous SDN-OpenFlow Edge Networks

    Authors: Mohammad Sajid Shahriar, Faisal Ahmed, Genshe Chen, Khanh D. Pham, Suresh Subramaniam, Motoharu Matsuura, Hiroshi Hasegawa, Shih-Chun Lin

    Abstract: This letter indicates the critical need for prioritized multi-tenant quality-of-service (QoS) management by emerging mobile edge systems, particularly for high-throughput beyond fifth-generation networks. Existing traffic engineering tools utilize complex functions baked into closed, proprietary infrastructures, largely limiting design flexibility, scalability, and adaptiveness. Hence, this study… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  31. arXiv:2401.17571  [pdf, other

    eess.IV cs.CV

    Is Registering Raw Tagged-MR Enough for Strain Estimation in the Era of Deep Learning?

    Authors: Zhangxing Bian, Ahmed Alshareef, Shuwen Wei, Junyu Chen, Yuli Wang, Jonghye Woo, Dzung L. Pham, Jiachen Zhuo, Aaron Carass, Jerry L. Prince

    Abstract: Magnetic Resonance Imaging with tagging (tMRI) has long been utilized for quantifying tissue motion and strain during deformation. However, a phenomenon known as tag fading, a gradual decrease in tag visibility over time, often complicates post-processing. The first contribution of this study is to model tag fading by considering the interplay between $T_1$ relaxation and the repeated application… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted to SPIE Medical Imaging 2024 (oral)

  32. Improving Graph Convolutional Networks with Transformer Layer in social-based items recommendation

    Authors: Thi Linh Hoang, Tuan Dung Pham, Viet Cuong Ta

    Abstract: In this work, we have proposed an approach for improving the GCN for predicting ratings in social networks. Our model is expanded from the standard model with several layers of transformer architecture. The main focus of the paper is on the encoder architecture for node embedding in the network. Using the embedding layer from the graph-based convolution layer, the attention mechanism could rearran… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

  33. arXiv:2312.15751  [pdf, other

    cs.CL

    Solving Label Variation in Scientific Information Extraction via Multi-Task Learning

    Authors: Dong Pham, Xanh Ho, Quang-Thuy Ha, Akiko Aizawa

    Abstract: Scientific Information Extraction (ScientificIE) is a critical task that involves the identification of scientific entities and their relationships. The complexity of this task is compounded by the necessity for domain-specific knowledge and the limited availability of annotated data. Two of the most popular datasets for ScientificIE are SemEval-2018 Task-7 and SciERC. They have overlapping sample… ▽ More

    Submitted 25 December, 2023; originally announced December 2023.

    Comments: 14 pages, 7 figures, PACLIC 37

  34. arXiv:2312.01460  [pdf, other

    eess.IV cs.CV

    Towards an accurate and generalizable multiple sclerosis lesion segmentation model using self-ensembled lesion fusion

    Authors: Jinwei Zhang, Lianrui Zuo, Blake E. Dewey, Samuel W. Remedios, Dzung L. Pham, Aaron Carass, Jerry L. Prince

    Abstract: Automatic multiple sclerosis (MS) lesion segmentation using multi-contrast magnetic resonance (MR) images provides improved efficiency and reproducibility compared to manual delineation. Current state-of-the-art automatic MS lesion segmentation methods utilize modified U-Net-like architectures. However, in the literature, dedicated architecture modifications were always required to maximize their… ▽ More

    Submitted 3 December, 2023; originally announced December 2023.

  35. arXiv:2311.11001  [pdf, other

    cs.CL

    Gendec: A Machine Learning-based Framework for Gender Detection from Japanese Names

    Authors: Duong Tien Pham, Luan Thanh Nguyen

    Abstract: Every human has their own name, a fundamental aspect of their identity and cultural heritage. The name often conveys a wealth of information, including details about an individual's background, ethnicity, and, especially, their gender. By detecting gender through the analysis of names, researchers can unlock valuable insights into linguistic patterns and cultural norms, which can be applied to pra… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

    Comments: This paper is accepted for presentation at ISDA'23

  36. arXiv:2310.19163  [pdf, other

    cs.CR cs.LG

    RAIFLE: Reconstruction Attacks on Interaction-based Federated Learning with Adversarial Data Manipulation

    Authors: Dzung Pham, Shreyas Kulkarni, Amir Houmansadr

    Abstract: Federated learning has emerged as a promising privacy-preserving solution for machine learning domains that rely on user interactions, particularly recommender systems and online learning to rank. While there has been substantial research on the privacy of traditional federated learning, little attention has been paid to the privacy properties of these interaction-based settings. In this work, we… ▽ More

    Submitted 1 March, 2025; v1 submitted 29 October, 2023; originally announced October 2023.

    Comments: Published in NDSS 2025

  37. arXiv:2310.14434  [pdf, other

    cs.CR

    Enhancing Accuracy-Privacy Trade-off in Differentially Private Split Learning

    Authors: Ngoc Duy Pham, Khoa Tran Phan, Naveen Chilamkurti

    Abstract: Split learning (SL) aims to protect user data privacy by distributing deep models between client-server and keeping private data locally. Only processed or `smashed' data can be transmitted from the clients to the server during the SL process. However, recently proposed model inversion attacks can recover the original data from the smashed data. In order to enhance privacy protection against such… ▽ More

    Submitted 15 October, 2024; v1 submitted 22 October, 2023; originally announced October 2023.

  38. arXiv:2305.18705  [pdf, other

    cs.DS

    Algorithmic Foundations of Inexact Computing

    Authors: John Augustine, Dror Fried, Krishna V. Palem, Duc-Hung Pham, Anshumali Shrivastava

    Abstract: Inexact computing also referred to as approximate computing is a style of designing algorithms and computing systems wherein the accuracy of correctness of algorithms executing on them is deliberately traded for significant resource savings. Significant progress has been reported in this regard both in terms of hardware as well as software or custom algorithms that exploited this approach resultin… ▽ More

    Submitted 29 May, 2023; originally announced May 2023.

  39. arXiv:2304.03610  [pdf, other

    cs.CV cs.AI

    Look how they have grown: Non-destructive Leaf Detection and Size Estimation of Tomato Plants for 3D Growth Monitoring

    Authors: Yuning Xing, Dexter Pham, Henry Williams, David Smith, Ho Seok Ahn, JongYoon Lim, Bruce A. MacDonald, Mahla Nejati

    Abstract: Smart farming is a growing field as technology advances. Plant characteristics are crucial indicators for monitoring plant growth. Research has been done to estimate characteristics like leaf area index, leaf disease, and plant height. However, few methods have been applied to non-destructive measurements of leaf size. In this paper, an automated non-destructive imaged-based measuring system is pr… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 10 Pages, 10 Figures

    Journal ref: Proceedings of the Australasian conference on robotics and automation (ACRA 2022)

  40. Tailoring Requirements Engineering for Responsible AI

    Authors: Walid Maalej, Yen Dieu Pham, Larissa Chazette

    Abstract: Requirements Engineering (RE) is the discipline for identifying, analyzing, as well as ensuring the implementation and delivery of user, technical, and societal requirements. Recently reported issues concerning the acceptance of Artificial Intelligence (AI) solutions after deployment, e.g. in the medical, automotive, or scientific domains, stress the importance of RE for designing and delivering R… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: To appear in IEEE Computer, Special Issue on Software Engineering for Responsible AI

  41. arXiv:2302.09184  [pdf

    cond-mat.mtrl-sci cs.LG

    Rapid Design of Top-Performing Metal-Organic Frameworks with Qualitative Representations of Building Blocks

    Authors: Yigitcan Comlek, Thang Duc Pham, Randall Snurr, Wei Chen

    Abstract: Data-driven materials design often encounters challenges where systems require or possess qualitative (categorical) information. Metal-organic frameworks (MOFs) are an example of such material systems. The representation of MOFs through different building blocks makes it a challenge for designers to incorporate qualitative information into design optimization. Furthermore, the large number of pote… ▽ More

    Submitted 17 February, 2023; originally announced February 2023.

    Comments: 35 pages total. First 29 pages belong to the main manuscript and the remaining 6 six are for the supplementary information, 13 figures total. 9 figures are on the main manuscript and 4 figures are in the supplementary information. 1 table in the supplementary information

  42. arXiv:2212.01761  [pdf

    physics.soc-ph cs.CV

    A PM2.5 concentration prediction framework with vehicle tracking system: From cause to effect

    Authors: Chuong D. Le, Hoang V. Pham, Duy A. Pham, An D. Le, Hien B. Vo

    Abstract: Air pollution is an emerging problem that needs to be solved especially in developed and developing countries. In Vietnam, air pollution is also a concerning issue in big cities such as Hanoi and Ho Chi Minh cities where air pollution comes mostly from vehicles such as cars and motorbikes. In order to tackle the problem, the paper focuses on developing a solution that can estimate the emitted PM2.… ▽ More

    Submitted 4 December, 2022; originally announced December 2022.

  43. arXiv:2212.00250  [pdf, other

    cs.CR cs.DC

    Split Learning without Local Weight Sharing to Enhance Client-side Data Privacy

    Authors: Ngoc Duy Pham, Tran Khoa Phan, Alsharif Abuadbba, Yansong Gao, Doan Nguyen, Naveen Chilamkurti

    Abstract: Split learning (SL) aims to protect user data privacy by distributing deep models between client-server and keeping private data locally. In SL training with multiple clients, the local model weights are shared among the clients for local model update. This paper first reveals data privacy leakage exacerbated from local weight sharing among the clients in SL through model inversion attacks. Then,… ▽ More

    Submitted 21 July, 2024; v1 submitted 30 November, 2022; originally announced December 2022.

  44. arXiv:2211.16366  [pdf, other

    cs.IR cs.LG

    Reusable Self-Attention-based Recommender System for Fashion

    Authors: Marjan Celikik, Jacek Wasilewski, Sahar Mbarek, Pablo Celayes, Pierre Gagliardi, Duy Pham, Nour Karessli, Ana Peleteiro Ramallo

    Abstract: A large number of empirical studies on applying self-attention models in the domain of recommender systems are based on offline evaluation and metrics computed on standardized datasets, without insights on how these models perform in real life scenarios. Moreover, many of them do not consider information such as item and customer metadata, although deep-learning recommenders live up to their full… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: FashionXRecSys'22: Workshop on Recommender Systems in Fashion, September 23, 2022, Seattle, WA. Parts published in RecSys 2022 (industry track)

    Journal ref: FashionXRecSys'22: Workshop on Recommender Systems in Fashion, September 23, 2022, Seattle, WA. Parts published in RecSys 2022 (industry track)

  45. arXiv:2211.16353  [pdf, other

    cs.IR cs.LG

    Outfit Generation and Recommendation -- An Experimental Study

    Authors: Marjan Celikik, Matthias Kirmse, Timo Denk, Pierre Gagliardi, Sahar Mbarek, Duy Pham, Ana Peleteiro Ramallo

    Abstract: Over the past years, fashion-related challenges have gained a lot of attention in the research community. Outfit generation and recommendation, i.e., the composition of a set of items of different types (e.g., tops, bottom, shoes, accessories) that go well together, are among the most challenging ones. That is because items have to be both compatible amongst each other and also personalized to mat… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: fashionXrecsys '20: Workshop on Recommender Systems in Fashion, 14th ACM Conference on Recommender Systems, September 22--26, 2020, Virtual Event, Brazil

    Journal ref: fashionXrecsys '20: Workshop on Recommender Systems in Fashion, 14th ACM Conference on Recommender Systems, September 22--26, 2020, Virtual Event, Brazil

  46. arXiv:2211.14493  [pdf, other

    cs.LG cs.AI

    Multi-fidelity Gaussian Process for Biomanufacturing Process Modeling with Small Data

    Authors: Yuan Sun, Winton Nathan-Roberts, Tien Dung Pham, Ellen Otte, Uwe Aickelin

    Abstract: In biomanufacturing, developing an accurate model to simulate the complex dynamics of bioprocesses is an important yet challenging task. This is partially due to the uncertainty associated with bioprocesses, high data acquisition cost, and lack of data availability to learn complex relations in bioprocesses. To deal with these challenges, we propose to use a statistical machine learning approach,… ▽ More

    Submitted 26 November, 2022; originally announced November 2022.

  47. arXiv:2211.09559  [pdf, other

    eess.IV cs.CV

    Interpretable HER2 scoring by evaluating clinical Guidelines through a weakly supervised, constrained Deep Learning Approach

    Authors: Manh Dan Pham, Cyprien Tilmant, Stéphanie Petit, Isabelle Salmon, Saima Ben Hadj, Rutger H. J. Fick

    Abstract: The evaluation of the Human Epidermal growth factor Receptor-2 (HER2) expression is an important prognostic biomarker for breast cancer treatment selection. However, HER2 scoring has notoriously high interobserver variability due to stain variations between centers and the need to estimate visually the staining intensity in specific percentages of tumor area. In this paper, focusing on the interpr… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

    Comments: Submitted to Elsevier

  48. arXiv:2209.02611  [pdf, other

    eess.IV cs.CV

    Deep filter bank regression for super-resolution of anisotropic MR brain images

    Authors: Samuel W. Remedios, Shuo Han, Yuan Xue, Aaron Carass, Trac D. Tran, Dzung L. Pham, Jerry L. Prince

    Abstract: In 2D multi-slice magnetic resonance (MR) acquisition, the through-plane signals are typically of lower resolution than the in-plane signals. While contemporary super-resolution (SR) methods aim to recover the underlying high-resolution volume, the estimated high-frequency information is implicit via end-to-end data-driven training rather than being explicitly stated and sought. To address this, w… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  49. Using Chatbots to Teach Languages

    Authors: Yu Li, Chun-Yen Chen, Dian Yu, Sam Davidson, Ryan Hou, Xun Yuan, Yinghua Tan, Derek Pham, Zhou Yu

    Abstract: This paper reports on progress towards building an online language learning tool to provide learners with conversational experience by using dialog systems as conversation practice partners. Our system can adapt to users' language proficiency on the fly. We also provide automatic grammar error feedback to help users learn from their mistakes. According to our first adopters, our system is entertai… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

    Comments: Accepted to Learning @ Scale 2022

  50. arXiv:2207.11821  [pdf, other

    cs.NI

    Maximizing Entanglement Routing Rate in Quantum Networks: Approximation Algorithms

    Authors: Tu N. Nguyen, Dung H. P. Nguyen, Dang H. Pham, Bing-Hong Liu, Hoa N. Nguyen

    Abstract: There will be a fast-paced shift from conventional network systems to novel quantum networks that are supported by the quantum entanglement and teleportation, key technologies of the quantum era, to enable secured data transmissions in the next-generation of the Internet. Despite this prospect, migration to quantum networks cannot be done at once, especially on the aspect of quantum routing. In th… ▽ More

    Submitted 18 July, 2022; originally announced July 2022.

    Comments: 12 pages