Skip to main content

Showing 1–2 of 2 results for author: Belardi, C K

.
  1. arXiv:2505.15962  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Pre-training Large Memory Language Models with Internal and External Knowledge

    Authors: Linxi Zhao, Sofian Zalouk, Christian K. Belardi, Justin Lovelace, Jin Peng Zhou, Kilian Q. Weinberger, Yoav Artzi, Jennifer J. Sun

    Abstract: Neural language models are black-boxes -- both linguistic patterns and factual knowledge are distributed across billions of opaque parameters. This entangled encoding makes it difficult to reliably inspect, verify, or update specific facts. We propose a new class of language models, Large Memory Language Models (LMLM) with a pre-training recipe that stores factual knowledge in both internal weight… ▽ More

    Submitted 2 July, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: Code, models, and data available at https://github.com/kilian-group/LMLM

  2. arXiv:2407.06172  [pdf, other

    cs.AI cs.CL

    On Speeding Up Language Model Evaluation

    Authors: Jin Peng Zhou, Christian K. Belardi, Ruihan Wu, Travis Zhang, Carla P. Gomes, Wen Sun, Kilian Q. Weinberger

    Abstract: Developing prompt-based methods with Large Language Models (LLMs) requires making numerous decisions, which give rise to a combinatorial search problem over hyper-parameters. This exhaustive evaluation can be time-consuming and costly. In this paper, we propose an $\textit{adaptive}$ approach to explore this space. We are exploiting the fact that often only few samples are needed to identify clear… ▽ More

    Submitted 26 February, 2025; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: ICLR 2025