Skip to main content

Showing 1–17 of 17 results for author: Vuong, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.04192  [pdf, other

    cs.CV cs.AI cs.CL

    VideoPath-LLaVA: Pathology Diagnostic Reasoning Through Video Instruction Tuning

    Authors: Trinh T. L. Vuong, Jin Tae Kwak

    Abstract: We present VideoPath-LLaVA, the first large multimodal model (LMM) in computational pathology that integrates three distinct image scenarios, single patch images, automatically keyframe-extracted clips, and manually segmented video pathology images, to mimic the natural diagnostic process of pathologists. By generating detailed histological descriptions and culminating in a definitive sign-out dia… ▽ More

    Submitted 7 May, 2025; originally announced May 2025.

  2. arXiv:2502.10716  [pdf, other

    cs.LG stat.ML

    Why Domain Generalization Fail? A View of Necessity and Sufficiency

    Authors: Long-Tung Vuong, Vy Vo, Hien Dang, Van-Anh Nguyen, Thanh-Toan Do, Mehrtash Harandi, Trung Le, Dinh Phung

    Abstract: Despite a strong theoretical foundation, empirical experiments reveal that existing domain generalization (DG) algorithms often fail to consistently outperform the ERM baseline. We argue that this issue arises because most DG studies focus on establishing theoretical guarantees for generalization under unrealistic assumptions, such as the availability of sufficient, diverse (or even infinite) doma… ▽ More

    Submitted 15 February, 2025; originally announced February 2025.

  3. arXiv:2501.18950  [pdf, other

    cs.LG cs.AI cs.CV

    Fantastic Targets for Concept Erasure in Diffusion Models and Where To Find Them

    Authors: Anh Bui, Trang Vu, Long Vuong, Trung Le, Paul Montague, Tamas Abraham, Junae Kim, Dinh Phung

    Abstract: Concept erasure has emerged as a promising technique for mitigating the risk of harmful content generation in diffusion models by selectively unlearning undesirable concepts. The common principle of previous works to remove a specific concept is to map it to a fixed generic concept, such as a neutral concept or just an empty text prompt. In this paper, we demonstrate that this fixed-target strateg… ▽ More

    Submitted 23 May, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

    Journal ref: International Conference on Learning Representations 2025

  4. arXiv:2410.15618  [pdf, other

    cs.LG cs.CV

    Erasing Undesirable Concepts in Diffusion Models with Adversarial Preservation

    Authors: Anh Bui, Long Vuong, Khanh Doan, Trung Le, Paul Montague, Tamas Abraham, Dinh Phung

    Abstract: Diffusion models excel at generating visually striking content from text but can inadvertently produce undesirable or harmful content when trained on unfiltered internet data. A practical solution is to selectively removing target concepts from the model, but this may impact the remaining concepts. Prior approaches have tried to balance this by introducing a loss term to preserve neutral content o… ▽ More

    Submitted 23 May, 2025; v1 submitted 20 October, 2024; originally announced October 2024.

    Comments: Erasing Concepts, Generative Unlearning, NeurIPS 2024. arXiv admin note: text overlap with arXiv:2403.12326

  5. arXiv:2408.04221  [pdf, other

    cs.CV cs.AI cs.LG cs.NE

    Connective Viewpoints of Signal-to-Noise Diffusion Models

    Authors: Khanh Doan, Long Tung Vuong, Tuan Nguyen, Anh Tuan Bui, Quyen Tran, Thanh-Toan Do, Dinh Phung, Trung Le

    Abstract: Diffusion models (DM) have become fundamental components of generative models, excelling across various domains such as image creation, audio generation, and complex data interpolation. Signal-to-Noise diffusion models constitute a diverse family covering most state-of-the-art diffusion models. While there have been several attempts to study Signal-to-Noise (S2N) diffusion models from various pers… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  6. arXiv:2407.13216  [pdf, other

    cs.CV

    QuIIL at T3 challenge: Towards Automation in Life-Saving Intervention Procedures from First-Person View

    Authors: Trinh T. L. Vuong, Doanh C. Bui, Jin Tae Kwak

    Abstract: In this paper, we present our solutions for a spectrum of automation tasks in life-saving intervention procedures within the Trauma THOMPSON (T3) Challenge, encompassing action recognition, action anticipation, and Visual Question Answering (VQA). For action recognition and anticipation, we propose a pre-processing strategy that samples and stitches multiple inputs into a single image and then inc… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: MICCAI-Thompson Challenge 2023

  7. arXiv:2407.07360  [pdf, other

    cs.CV cs.LG

    Towards a text-based quantitative and explainable histopathology image analysis

    Authors: Anh Tien Nguyen, Trinh Thi Le Vuong, Jin Tae Kwak

    Abstract: Recently, vision-language pre-trained models have emerged in computational pathology. Previous works generally focused on the alignment of image-text pairs via the contrastive pre-training paradigm. Such pre-trained models have been applied to pathology image classification in zero-shot learning or transfer learning fashion. Herein, we hypothesize that the pre-trained vision-language models can be… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: MICCAI 2024 - Early acceptance (Top 11%)

  8. arXiv:2407.07340  [pdf, other

    cs.CV

    FALFormer: Feature-aware Landmarks self-attention for Whole-slide Image Classification

    Authors: Doanh C. Bui, Trinh Thi Le Vuong, Jin Tae Kwak

    Abstract: Slide-level classification for whole-slide images (WSIs) has been widely recognized as a crucial problem in digital and computational pathology. Current approaches commonly consider WSIs as a bag of cropped patches and process them via multiple instance learning due to the large number of patches, which cannot fully explore the relationship among patches; in other words, the global information can… ▽ More

    Submitted 11 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: 10 pages, 2 figures

  9. arXiv:2308.16561  [pdf, other

    eess.IV cs.CV

    MoMA: Momentum Contrastive Learning with Multi-head Attention-based Knowledge Distillation for Histopathology Image Analysis

    Authors: Trinh Thi Le Vuong, Jin Tae Kwak

    Abstract: There is no doubt that advanced artificial intelligence models and high quality data are the keys to success in developing computational pathology tools. Although the overall volume of pathology data keeps increasing, a lack of quality data is a common issue when it comes to a specific task due to several reasons including privacy and ethical issues with patient data. In this work, we propose to e… ▽ More

    Submitted 11 December, 2024; v1 submitted 31 August, 2023; originally announced August 2023.

  10. arXiv:2303.06274  [pdf

    cs.CV cs.LG

    CoNIC Challenge: Pushing the Frontiers of Nuclear Detection, Segmentation, Classification and Counting

    Authors: Simon Graham, Quoc Dang Vu, Mostafa Jahanifar, Martin Weigert, Uwe Schmidt, Wenhua Zhang, Jun Zhang, Sen Yang, Jinxi Xiang, Xiyue Wang, Josef Lorenz Rumberger, Elias Baumann, Peter Hirsch, Lihao Liu, Chenyang Hong, Angelica I. Aviles-Rivero, Ayushi Jain, Heeyoung Ahn, Yiyu Hong, Hussam Azzuni, Min Xu, Mohammad Yaqub, Marie-Claire Blache, Benoît Piégu, Bertrand Vernay , et al. (64 additional authors not shown)

    Abstract: Nuclear detection, segmentation and morphometric profiling are essential in helping us further understand the relationship between histology and patient outcome. To drive innovation in this area, we setup a community-wide challenge using the largest available dataset of its kind to assess nuclear segmentation and cellular composition. Our challenge, named CoNIC, stimulated the development of repro… ▽ More

    Submitted 14 March, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

  11. arXiv:2210.07646  [pdf, other

    cs.CV cs.LG

    Vision Transformer Visualization: What Neurons Tell and How Neurons Behave?

    Authors: Van-Anh Nguyen, Khanh Pham Dinh, Long Tung Vuong, Thanh-Toan Do, Quan Hung Tran, Dinh Phung, Trung Le

    Abstract: Recently vision transformers (ViT) have been applied successfully for various tasks in computer vision. However, important questions such as why they work or how they behave still remain largely unknown. In this paper, we propose an effective visualization technique, to assist us in exposing the information carried in neurons and feature embeddings across the ViT's layers. Our approach departs fro… ▽ More

    Submitted 17 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: The first two authors contributed equally to this work. Our code is available at https://github.com/byM1902/ViT_visualization

  12. arXiv:2209.09002  [pdf, other

    cs.CV

    MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation

    Authors: Chuanxia Zheng, Long Tung Vuong, Jianfei Cai, Dinh Phung

    Abstract: Although two-stage Vector Quantized (VQ) generative models allow for synthesizing high-fidelity and high-resolution images, their quantization operator encodes similar patches within an image into the same index, resulting in a repeated artifact for similar adjacent regions using existing decoder architectures. To address this issue, we propose to incorporate the spatially conditional normalizatio… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  13. arXiv:2208.11052  [pdf, other

    cs.CV

    IMPaSh: A Novel Domain-shift Resistant Representation for Colorectal Cancer Tissue Classification

    Authors: Trinh Thi Le Vuong, Quoc Dang Vu, Mostafa Jahanifar, Simon Graham, Jin Tae Kwak, Nasir Rajpoot

    Abstract: The appearance of histopathology images depends on tissue type, staining and digitization procedure. These vary from source to source and are the potential causes for domain-shift problems. Owing to this problem, despite the great success of deep learning models in computational pathology, a model trained on a specific domain may still perform sub-optimally when we apply them to another domain. To… ▽ More

    Submitted 23 August, 2022; originally announced August 2022.

    Comments: Accepted in ECCV2022 MCV Workshop

  14. arXiv:2111.13822  [pdf, other

    cs.LG cs.AI stat.ML

    On Learning Domain-Invariant Representations for Transfer Learning with Multiple Sources

    Authors: Trung Phung, Trung Le, Long Vuong, Toan Tran, Anh Tran, Hung Bui, Dinh Phung

    Abstract: Domain adaptation (DA) benefits from the rigorous theoretical works that study its insightful characteristics and various aspects, e.g., learning domain-invariant representations and its trade-off. However, it seems not the case for the multiple source DA and domain generalization (DG) settings which are remarkably more complicated and sophisticated due to the involvement of multiple source domain… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

    Comments: NeurIPS 2021

    Journal ref: Proceedings of Advances in Neural Information Processing Systems (2021) 27720-27733

  15. arXiv:2005.07682  [pdf, other

    eess.IV cs.CV physics.optics

    Small-brain neural networks rapidly solve inverse problems with vortex Fourier encoders

    Authors: Baurzhan Muminov, Luat T. Vuong

    Abstract: We introduce a vortex phase transform with a lenslet-array to accompany shallow, dense, ``small-brain'' neural networks for high-speed and low-light imaging. Our single-shot ptychographic approach exploits the coherent diffraction, compact representation, and edge enhancement of Fourier-tranformed spiral-phase gradients. With vortex spatial encoding, a small brain is trained to deconvolve images a… ▽ More

    Submitted 15 May, 2020; originally announced May 2020.

  16. arXiv:1708.03850  [pdf, other

    cs.SI physics.soc-ph

    Structure in scientific networks: towards predictions of research dynamism

    Authors: Benjamin W. Stewart, Andy Rivas, Luat T. Vuong

    Abstract: Certain areas of scientific research flourish while others lose advocates and attention. We are interested in whether structural patterns within citation networks correspond to the growth or decline of the research areas to which those networks belong. We focus on three topic areas within optical physics as a set of cases; those areas have developed along different trajectories: one continues to e… ▽ More

    Submitted 13 August, 2017; originally announced August 2017.

  17. arXiv:1312.3692  [pdf

    cs.NI

    Designing a brown planthoppers surveillance network based on wireless sensor network approach

    Authors: Hoai Bao Lam, Tai Tan Phan, Long Huynh Vuong, Hiep Xuan Huynh, Bernard Pottier

    Abstract: This paper proposes a new approach for monitoring brown planthoppers (BPH) swarms using a surveillance network at provincial scale. The topology of this network is identified to a wireless sensor network (WSN), where each node is a real light trap and each edge describes the influence between two nodes, allowing gathering BPH information. Different communication ranges are evaluated to choose a su… ▽ More

    Submitted 12 December, 2013; originally announced December 2013.