Skip to main content

Showing 1–12 of 12 results for author: Shaul, N

.
  1. arXiv:2506.06215  [pdf, ps, other

    cs.LG cs.CL

    Corrector Sampling in Language Models

    Authors: Itai Gat, Neta Shaul, Uriel Singer, Yaron Lipman

    Abstract: Autoregressive language models accumulate errors due to their fixed, irrevocable left-to-right token generation. To address this, we propose a new sampling method called Resample-Previous-Tokens (RPT). RPT mitigates error accumulation by iteratively revisiting and potentially replacing tokens in a window of previously generated text. This method can be integrated into existing autoregressive model… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

  2. arXiv:2412.06264  [pdf, other

    cs.LG

    Flow Matching Guide and Code

    Authors: Yaron Lipman, Marton Havasi, Peter Holderrieth, Neta Shaul, Matt Le, Brian Karrer, Ricky T. Q. Chen, David Lopez-Paz, Heli Ben-Hamu, Itai Gat

    Abstract: Flow Matching (FM) is a recent framework for generative modeling that has achieved state-of-the-art performance across various domains, including image, video, audio, speech, and biological structures. This guide offers a comprehensive and self-contained review of FM, covering its mathematical foundations, design choices, and extensions. By also providing a PyTorch package featuring relevant examp… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

  3. arXiv:2412.03487  [pdf, other

    cs.LG cs.AI

    Flow Matching with General Discrete Paths: A Kinetic-Optimal Perspective

    Authors: Neta Shaul, Itai Gat, Marton Havasi, Daniel Severo, Anuroop Sriram, Peter Holderrieth, Brian Karrer, Yaron Lipman, Ricky T. Q. Chen

    Abstract: The design space of discrete-space diffusion or flow generative models are significantly less well-understood than their continuous-space counterparts, with many works focusing only on a simple masked construction. In this work, we aim to take a holistic approach to the construction of discrete generative models based on continuous-time Markov chains, and for the first time, allow the use of arbit… ▽ More

    Submitted 4 December, 2024; originally announced December 2024.

  4. arXiv:2410.20587  [pdf, other

    cs.LG cs.AI

    Generator Matching: Generative modeling with arbitrary Markov processes

    Authors: Peter Holderrieth, Marton Havasi, Jason Yim, Neta Shaul, Itai Gat, Tommi Jaakkola, Brian Karrer, Ricky T. Q. Chen, Yaron Lipman

    Abstract: We introduce Generator Matching, a modality-agnostic framework for generative modeling using arbitrary Markov processes. Generators characterize the infinitesimal evolution of a Markov process, which we leverage for generative modeling in a similar vein to flow matching: we construct conditional generators which generate single data points, then learn to approximate the marginal generator which ge… ▽ More

    Submitted 26 February, 2025; v1 submitted 27 October, 2024; originally announced October 2024.

  5. arXiv:2407.15595  [pdf, other

    cs.LG cs.AI

    Discrete Flow Matching

    Authors: Itai Gat, Tal Remez, Neta Shaul, Felix Kreuk, Ricky T. Q. Chen, Gabriel Synnaeve, Yossi Adi, Yaron Lipman

    Abstract: Despite Flow Matching and diffusion models having emerged as powerful generative paradigms for continuous variables such as images and videos, their application to high-dimensional discrete data, such as language, is still limited. In this work, we present Discrete Flow Matching, a novel discrete flow paradigm designed specifically for generating discrete data. Discrete Flow Matching offers severa… ▽ More

    Submitted 5 November, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

  6. arXiv:2403.01329  [pdf, other

    cs.LG cs.AI cs.CV

    Bespoke Non-Stationary Solvers for Fast Sampling of Diffusion and Flow Models

    Authors: Neta Shaul, Uriel Singer, Ricky T. Q. Chen, Matthew Le, Ali Thabet, Albert Pumarola, Yaron Lipman

    Abstract: This paper introduces Bespoke Non-Stationary (BNS) Solvers, a solver distillation approach to improve sample efficiency of Diffusion and Flow models. BNS solvers are based on a family of non-stationary solvers that provably subsumes existing numerical ODE solvers and consequently demonstrate considerable improvement in sample approximation (PSNR) over these baselines. Compared to model distillatio… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  7. arXiv:2311.13443  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Guided Flows for Generative Modeling and Decision Making

    Authors: Qinqing Zheng, Matt Le, Neta Shaul, Yaron Lipman, Aditya Grover, Ricky T. Q. Chen

    Abstract: Classifier-free guidance is a key component for enhancing the performance of conditional generative models across diverse tasks. While it has previously demonstrated remarkable improvements for the sample quality, it has only been exclusively employed for diffusion models. In this paper, we integrate classifier-free guidance into Flow Matching (FM) models, an alternative simulation-free approach t… ▽ More

    Submitted 7 December, 2023; v1 submitted 22 November, 2023; originally announced November 2023.

  8. arXiv:2310.19075  [pdf, other

    cs.LG cs.AI cs.CV

    Bespoke Solvers for Generative Flow Models

    Authors: Neta Shaul, Juan Perez, Ricky T. Q. Chen, Ali Thabet, Albert Pumarola, Yaron Lipman

    Abstract: Diffusion or flow-based models are powerful generative paradigms that are notoriously hard to sample as samples are defined as solutions to high-dimensional Ordinary or Stochastic Differential Equations (ODEs/SDEs) which require a large Number of Function Evaluations (NFE) to approximate well. Existing methods to alleviate the costly sampling process include model distillation and designing dedica… ▽ More

    Submitted 29 October, 2023; originally announced October 2023.

  9. arXiv:2306.06626  [pdf, other

    cs.LG stat.ML

    On Kinetic Optimal Probability Paths for Generative Models

    Authors: Neta Shaul, Ricky T. Q. Chen, Maximilian Nickel, Matt Le, Yaron Lipman

    Abstract: Recent successful generative models are trained by fitting a neural network to an a-priori defined tractable probability density path taking noise to training examples. In this paper we investigate the space of Gaussian probability paths, which includes diffusion paths as an instance, and look for an optimal member in some useful sense. In particular, minimizing the Kinetic Energy (KE) of a path i… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  10. Improved Simulation of the Mass Charging for ASTROD I

    Authors: Gang Bao, Wei-Tou Ni, D. N. A. Shaul, H. M. Araujo, Lei Liu, T. J. Sumner

    Abstract: The electrostatic charging of the test mass in ASTROD I (Astrodynamical Space Test of Relativity using Optical Devices I) mission can affect the quality of the science data as a result of spurious Coulomb and Lorentz forces. To estimate the size of the resultant disturbances, credible predictions of charging rates and the charging noise are required. Using the GEANT4 software toolkit, we present… ▽ More

    Submitted 13 July, 2007; originally announced July 2007.

    Comments: 20 pages, 14 figures, submitted to International Journal of Modern Physics D

    Journal ref: INTERNATIONAL JOURNAL OF MODERN PHYSICS D, Vol.17, No.7, 965 (July 2008)

  11. arXiv:0704.3493  [pdf

    astro-ph

    Simulation of ASTROD I test mass charging due to solar energetic particles

    Authors: Lei Liu, Gang Bao, Wei-Tou Ni, D N A Shaul

    Abstract: As ASTROD I travels through space, its test mass will accrue charge due to galactic cosmic-rays and solar energetic particles incident on the spacecraft. This test mass charge will result in Coulomb forces between the test mass and the surrounding electrodes. In earlier work using the GEANT4 toolkit, we predicted a net charging rate of nearly 9.0 +e/s from cosmic-ray protons between 0.1 and 1000… ▽ More

    Submitted 26 April, 2007; originally announced April 2007.

    Comments: 10 pages,8 figures, COSPAR2006 H0.1-1, submitted

  12. arXiv:0704.3303  [pdf

    astro-ph

    ASTROD I Charging Simulation and Disturbances

    Authors: Gang Bao, D N A Shaul, H M Araujo, Wei-Tou Ni, T J Sumner, Lei Liu

    Abstract: ASTROD I is planned as a single spacecraft mission. It will use interferometric and pulse ranging techniques between the spacecraft and ground stations, to make high precision measurements of the parameters that describe the solar system, and to test relativistic gravity with improved accuracy. At the heart of the spacecraft is a test mass, which the spacecraft will follow using a drag-free cont… ▽ More

    Submitted 24 April, 2007; originally announced April 2007.

    Comments: 17 pages, 11 figures, submitted to General Relativity and Gravitation