Skip to main content

Showing 1–2 of 2 results for author: Saragih, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.19371  [pdf, other

    cs.LG cs.AI

    Flow to Learn: Flow Matching on Neural Network Parameters

    Authors: Daniel Saragih, Deyu Cao, Tejas Balaji, Ashwin Santhosh

    Abstract: Foundational language models show a remarkable ability to learn new concepts during inference via context data. However, similar work for images lag behind. To address this challenge, we introduce FLoWN, a flow matching model that learns to generate neural network parameters for different tasks. Our approach models the flow on latent space, while conditioning the process on context data. Experimen… ▽ More

    Submitted 19 April, 2025; v1 submitted 25 March, 2025; originally announced March 2025.

    Comments: Accepted at the ICLR Workshop on Neural Network Weights as a New Data Modality 2025

  2. arXiv:2404.15784  [pdf, other

    cs.LG

    An Empirical Study of Aegis

    Authors: Daniel Saragih, Paridhi Goel, Tejas Balaji, Alyssa Li

    Abstract: Bit flipping attacks are one class of attacks on neural networks with numerous defense mechanisms invented to mitigate its potency. Due to the importance of ensuring the robustness of these defense mechanisms, we perform an empirical study on the Aegis framework. We evaluate the baseline mechanisms of Aegis on low-entropy data (MNIST), and we evaluate a pre-trained model with the mechanisms fine-t… ▽ More

    Submitted 24 April, 2024; originally announced April 2024.

    Comments: 9 pages, 6 figures, 3 tables