Skip to main content

Showing 1–4 of 4 results for author: Huu, T T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.20525  [pdf, other

    cs.LG cs.AI

    Revisiting Kernel Attention with Correlated Gaussian Process Representation

    Authors: Long Minh Bui, Tho Tran Huu, Duy Dinh, Tan Minh Nguyen, Trong Nghia Hoang

    Abstract: Transformers have increasingly become the de facto method to model sequential data with state-of-the-art performance. Due to its widespread use, being able to estimate and calibrate its modeling uncertainty is important to understand and design robust transformer models. To achieve this, previous works have used Gaussian processes (GPs) to perform uncertainty calibration for the attention units of… ▽ More

    Submitted 27 February, 2025; originally announced February 2025.

    Comments: 21 pages, 4 figures

    Journal ref: The 40th Conference on Uncertainty in Artificial Intelligence, 2024

  2. arXiv:2410.04692  [pdf, other

    cs.LG stat.ML

    A Clifford Algebraic Approach to E(n)-Equivariant High-order Graph Neural Networks

    Authors: Viet-Hoang Tran, Thieu N. Vo, Tho Tran Huu, Tan Minh Nguyen

    Abstract: Designing neural network architectures that can handle data symmetry is crucial. This is especially important for geometric graphs whose properties are equivariance under Euclidean transformations. Current equivariant graph neural networks (EGNNs), particularly those using message passing, have a limitation in expressive power. Recent high-order graph neural networks can overcome this limitation,… ▽ More

    Submitted 13 March, 2025; v1 submitted 6 October, 2024; originally announced October 2024.

  3. arXiv:2410.04213  [pdf, ps, other

    cs.LG

    Equivariant Polynomial Functional Networks

    Authors: Thieu N. Vo, Viet-Hoang Tran, Tho Tran Huu, An Nguyen The, Thanh Tran, Minh-Khoi Nguyen-Nhat, Duy-Tung Pham, Tan Minh Nguyen

    Abstract: Neural Functional Networks (NFNs) have gained increasing interest due to their wide range of applications, including extracting information from implicit representations of data, editing network weights, and evaluating policies. A key design principle of NFNs is their adherence to the permutation and scaling symmetries inherent in the connectionist structure of the input neural networks. Recent NF… ▽ More

    Submitted 5 October, 2024; originally announced October 2024.

  4. arXiv:2410.04209  [pdf, other

    cs.LG

    Equivariant Neural Functional Networks for Transformers

    Authors: Viet-Hoang Tran, Thieu N. Vo, An Nguyen The, Tho Tran Huu, Minh-Khoi Nguyen-Nhat, Thanh Tran, Duy-Tung Pham, Tan Minh Nguyen

    Abstract: This paper systematically explores neural functional networks (NFN) for transformer architectures. NFN are specialized neural networks that treat the weights, gradients, or sparsity patterns of a deep neural network (DNN) as input data and have proven valuable for tasks such as learnable optimizers, implicit data representations, and weight editing. While NFN have been extensively developed for ML… ▽ More

    Submitted 7 March, 2025; v1 submitted 5 October, 2024; originally announced October 2024.

    Comments: Accepted in ICLR 2025