Skip to main content

Showing 1–1 of 1 results for author: Vu, L Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.03285  [pdf, other

    cs.CV cs.LG

    Enhancing Vietnamese VQA through Curriculum Learning on Raw and Augmented Text Representations

    Authors: Khoi Anh Nguyen, Linh Yen Vu, Thang Dinh Duong, Thuan Nguyen Duong, Huy Thanh Nguyen, Vinh Quang Dinh

    Abstract: Visual Question Answering (VQA) is a multimodal task requiring reasoning across textual and visual inputs, which becomes particularly challenging in low-resource languages like Vietnamese due to linguistic variability and the lack of high-quality datasets. Traditional methods often rely heavily on extensive annotated datasets, computationally expensive pipelines, and large pre-trained models, spec… ▽ More

    Submitted 6 March, 2025; v1 submitted 5 March, 2025; originally announced March 2025.

    Comments: 10 pages, 3 figures, AAAI-25 Workshop on Document Understanding and Intelligence