Skip to main content

Showing 1–1 of 1 results for author: Donisch, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.03130  [pdf, other

    cs.CL

    Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations

    Authors: Leo Donisch, Sigurd Schacht, Carsten Lanquillon

    Abstract: Large language models are ubiquitous in natural language processing because they can adapt to new tasks without retraining. However, their sheer scale and complexity present unique challenges and opportunities, prompting researchers and practitioners to explore novel model training, optimization, and deployment methods. This literature review focuses on various techniques for reducing resource req… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.