Skip to main content

Showing 1–3 of 3 results for author: The, A N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.08285  [pdf, other

    cs.CL cs.LG

    Adaptive Prompting for Continual Relation Extraction: A Within-Task Variance Perspective

    Authors: Minh Le, Tien Ngoc Luu, An Nguyen The, Thanh-Thien Le, Trang Nguyen, Tung Thanh Nguyen, Linh Ngo Van, Thien Huu Nguyen

    Abstract: To address catastrophic forgetting in Continual Relation Extraction (CRE), many current approaches rely on memory buffers to rehearse previously learned knowledge while acquiring new tasks. Recently, prompt-based methods have emerged as potent alternatives to rehearsal-based strategies, demonstrating strong empirical performance. However, upon analyzing existing prompt-based approaches for CRE, we… ▽ More

    Submitted 18 January, 2025; v1 submitted 11 December, 2024; originally announced December 2024.

    Comments: Oral presentation at AAAI 2025

  2. arXiv:2410.04213  [pdf, ps, other

    cs.LG

    Equivariant Polynomial Functional Networks

    Authors: Thieu N. Vo, Viet-Hoang Tran, Tho Tran Huu, An Nguyen The, Thanh Tran, Minh-Khoi Nguyen-Nhat, Duy-Tung Pham, Tan Minh Nguyen

    Abstract: Neural Functional Networks (NFNs) have gained increasing interest due to their wide range of applications, including extracting information from implicit representations of data, editing network weights, and evaluating policies. A key design principle of NFNs is their adherence to the permutation and scaling symmetries inherent in the connectionist structure of the input neural networks. Recent NF… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

  3. arXiv:2410.04209  [pdf, other

    cs.LG

    Equivariant Neural Functional Networks for Transformers

    Authors: Viet-Hoang Tran, Thieu N. Vo, An Nguyen The, Tho Tran Huu, Minh-Khoi Nguyen-Nhat, Thanh Tran, Duy-Tung Pham, Tan Minh Nguyen

    Abstract: This paper systematically explores neural functional networks (NFN) for transformer architectures. NFN are specialized neural networks that treat the weights, gradients, or sparsity patterns of a deep neural network (DNN) as input data and have proven valuable for tasks such as learnable optimizers, implicit data representations, and weight editing. While NFN have been extensively developed for ML… ▽ More

    Submitted 7 March, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

    Comments: Accepted in ICLR 2025