Skip to main content

Showing 1–4 of 4 results for author: Gentile, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.07295  [pdf, ps, other

    cs.CL

    Exploring the Impact of Temperature on Large Language Models:Hot or Cold?

    Authors: Lujun Li, Lama Sleem, Niccolo' Gentile, Geoffrey Nichil, Radu State

    Abstract: The sampling temperature, a critical hyperparameter in large language models (LLMs), modifies the logits before the softmax layer, thereby reshaping the distribution of output tokens. Recent studies have challenged the Stochastic Parrots analogy by demonstrating that LLMs are capable of understanding semantics rather than merely memorizing data and that randomness, modulated by sampling temperatur… ▽ More

    Submitted 8 June, 2025; originally announced June 2025.

  2. arXiv:2505.16078  [pdf, ps, other

    cs.CL

    Small Language Models in the Real World: Insights from Industrial Text Classification

    Authors: Lujun Li, Lama Sleem, Niccolo' Gentile, Geoffrey Nichil, Radu State

    Abstract: With the emergence of ChatGPT, Transformer models have significantly advanced text classification and related tasks. Decoder-only models such as Llama exhibit strong performance and flexibility, yet they suffer from inefficiency on inference due to token-by-token generation, and their effectiveness in text classification tasks heavily depends on prompt quality. Moreover, their substantial GPU reso… ▽ More

    Submitted 23 June, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: This paper has been accepted as a conference paper in the Industry Track of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL)

  3. arXiv:2503.24102  [pdf, ps, other

    cs.CL

    Is LLM the Silver Bullet to Low-Resource Languages Machine Translation?

    Authors: Yewei Song, Lujun Li, Cedric Lothritz, Saad Ezzini, Lama Sleem, Niccolo Gentile, Radu State, Tegawendé F. Bissyandé, Jacques Klein

    Abstract: Low-Resource Languages (LRLs) present significant challenges in natural language processing due to their limited linguistic resources and underrepresentation in standard datasets. While recent advances in Large Language Models (LLMs) and Neural Machine Translation have substantially improved translation capabilities for high-resource languages, performance disparities persist for LRLs, particularl… ▽ More

    Submitted 5 June, 2025; v1 submitted 31 March, 2025; originally announced March 2025.

  4. arXiv:2009.01905  [pdf, other

    cs.CE

    Frequency-Dependent Material Motion Benchmarks for Radiative Transfer

    Authors: Ryan G. McClarren, N. A. Gentile

    Abstract: We present a general solution for the radiation intensity in front of a purely absorbing slab moving toward an observer at constant speed and with a constant temperature. The solution is obtained by integrating the lab-frame radiation transport equation through the slab to the observer. We present comparisons between our benchmark and results from the Kull simulation code for an aluminum slab movi… ▽ More

    Submitted 3 August, 2020; originally announced September 2020.

    Comments: Submitted to ANS M&C 2021 - The International Conference on Mathematics and Computational Methods Applied to Nuclear Science and Engineering

    Report number: LLNL-ABS-813186