Skip to main content

Showing 1–1 of 1 results for author: Deb, U

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.08019  [pdf, other

    cs.LG cs.AI

    AdaKD: Dynamic Knowledge Distillation of ASR models using Adaptive Loss Weighting

    Authors: Shreyan Ganguly, Roshan Nayak, Rakshith Rao, Ujan Deb, Prathosh AP

    Abstract: Knowledge distillation, a widely used model compression technique, works on the basis of transferring knowledge from a cumbersome teacher model to a lightweight student model. The technique involves jointly optimizing the task specific and knowledge distillation losses with a weight assigned to them. Despite these weights playing a crucial role in the performance of the distillation process, curre… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.