Skip to main content

Showing 1–2 of 2 results for author: Zalouk, S

.
  1. arXiv:2505.15962  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Pre-training Large Memory Language Models with Internal and External Knowledge

    Authors: Linxi Zhao, Sofian Zalouk, Christian K. Belardi, Justin Lovelace, Jin Peng Zhou, Kilian Q. Weinberger, Yoav Artzi, Jennifer J. Sun

    Abstract: Neural language models are black-boxes -- both linguistic patterns and factual knowledge are distributed across billions of opaque parameters. This entangled encoding makes it difficult to reliably inspect, verify, or update specific facts. We propose a new class of language models, Large Memory Language Models (LMLM) with a pre-training recipe that stores factual knowledge in both internal weight… ▽ More

    Submitted 2 July, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: Code, models, and data available at https://github.com/kilian-group/LMLM

  2. arXiv:2310.20211  [pdf, other

    cs.LG stat.ML

    Calibration by Distribution Matching: Trainable Kernel Calibration Metrics

    Authors: Charles Marx, Sofian Zalouk, Stefano Ermon

    Abstract: Calibration ensures that probabilistic forecasts meaningfully capture uncertainty by requiring that predicted probabilities align with empirical frequencies. However, many existing calibration methods are specialized for post-hoc recalibration, which can worsen the sharpness of forecasts. Drawing on the insight that calibration can be viewed as a distribution matching task, we introduce kernel-bas… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.