Skip to main content

Showing 1–1 of 1 results for author: Raza, M O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.12168  [pdf, other

    cs.CL

    FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation

    Authors: KaShun Shum, Minrui Xu, Jianshu Zhang, Zixin Chen, Shizhe Diao, Hanze Dong, Jipeng Zhang, Muhammad Omer Raza

    Abstract: Large language models (LLMs) have become increasingly prevalent in our daily lives, leading to an expectation for LLMs to be trustworthy -- - both accurate and well-calibrated (the prediction confidence should align with its ground truth correctness likelihood). Nowadays, fine-tuning has become the most popular method for adapting a model to practical usage by significantly increasing accuracy on… ▽ More

    Submitted 2 October, 2024; v1 submitted 22 August, 2024; originally announced August 2024.

    Comments: EMNLP 2024