Skip to main content

Showing 1–2 of 2 results for author: Panchal, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.09687  [pdf, other

    cs.LG cs.AI cs.CL

    MoIN: Mixture of Introvert Experts to Upcycle an LLM

    Authors: Ajinkya Tejankar, KL Navaneet, Ujjawal Panchal, Kossar Pourahmadi, Hamed Pirsiavash

    Abstract: The goal of this paper is to improve (upcycle) an existing large language model without the prohibitive requirements of continued pre-training of the full-model. The idea is to split the pre-training data into semantically relevant groups and train an expert on each subset. An expert takes the form of a lightweight adapter added on the top of a frozen base model. During inference, an incoming quer… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.

  2. arXiv:2208.06882  [pdf, other

    cs.CV

    CoShNet: A Hybrid Complex Valued Neural Network using Shearlets

    Authors: Manny Ko, Ujjawal K. Panchal, Héctor Andrade-Loarca, Andres Mendez-Vazquez

    Abstract: In a hybrid neural network, the expensive convolutional layers are replaced by a non-trainable fixed transform with a great reduction in parameters. In previous works, good results were obtained by replacing the convolutions with wavelets. However, wavelet based hybrid network inherited wavelet's lack of vanishing moments along curves and its axis-bias. We propose to use Shearlets with its robust… ▽ More

    Submitted 29 October, 2022; v1 submitted 14 August, 2022; originally announced August 2022.

    Comments: 16 pages, 11 figures