Skip to main content

Showing 1–1 of 1 results for author: Chiu, J W

Searching in archive eess. Search in all archives.
.
  1. arXiv:2411.05945  [pdf, other

    cs.CL cs.AI cs.LG cs.MA eess.AS

    NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts

    Authors: Yen-Ting Lin, Chao-Han Huck Yang, Zhehuai Chen, Piotr Zelasko, Xuesong Yang, Zih-Ching Chen, Krishna C Puvvada, Szu-Wei Fu, Ke Hu, Jun Wei Chiu, Jagadeesh Balam, Boris Ginsburg, Yu-Chiang Frank Wang

    Abstract: Construction of a general-purpose post-recognition error corrector poses a crucial question: how can we most effectively train a model on a large mixture of domain datasets? The answer would lie in learning dataset-specific features and digesting their knowledge in a single model. Previous methods achieve this by having separate correction language models, resulting in a significant increase in pa… ▽ More

    Submitted 8 November, 2024; originally announced November 2024.

    Comments: NeKo work has been done in June 2024. NeKo LMs will be open source on https://huggingface.co/nvidia under the MIT license