Skip to main content

Showing 1–50 of 109 results for author: Elad, M

.
  1. arXiv:2504.01689  [pdf, other

    cs.CV

    InvFussion: Bridging Supervised and Zero-shot Diffusion for Inverse Problems

    Authors: Noam Elata, Hyungjin Chung, Jong Chul Ye, Tomer Michaeli, Michael Elad

    Abstract: Diffusion Models have demonstrated remarkable capabilities in handling inverse problems, offering high-quality posterior-sampling-based solutions. Despite significant advances, a fundamental trade-off persists, regarding the way the conditioned synthesis is employed: Training-based methods achieve high quality results, while zero-shot approaches trade this with flexibility. This work introduces a… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  2. arXiv:2504.00873  [pdf, other

    hep-ex

    Input to the ESPPU: The LUXE Experiment

    Authors: H. Abramowicz, M. Almanza Soto, M. Altarelli, R. Aßmann, A. Athanassiadis, G. Avoni, T. Behnke, M. Benettoni, Y. Benhammou, J. Bhatt, T. Blackburn, C. Blanch, S. Bonaldo, S. Boogert, O. Borysov, M. Borysova, V. Boudry, D. Breton, R. Brinkmann, M. Bruschi, F. Burkart, K. Büßer, N. Cavanagh, F. Dal Corso, W. Decking , et al. (108 additional authors not shown)

    Abstract: This document presents an overview of LUXE (Laser Und XFEL Experiment), an experiment that will combine the high-quality and high-energy electron beam of the European XFEL with a high-intensity laser, to explore the uncharted terrain of strong-field quantum electrodynamics. The scientific case, facility, and detector setup are presented together with an overview of the foreseen timeline and expect… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  3. arXiv:2502.01189  [pdf, other

    eess.IV cs.AI cs.CV cs.IT eess.SP

    Compressed Image Generation with Denoising Diffusion Codebook Models

    Authors: Guy Ohayon, Hila Manor, Tomer Michaeli, Michael Elad

    Abstract: We present a novel generative approach based on Denoising Diffusion Models (DDMs), which produces high-quality image samples along with their losslessly compressed bit-stream representations. This is obtained by replacing the standard Gaussian noise sampling in the reverse diffusion with a selection of noise samples from pre-defined codebooks of fixed iid Gaussian vectors. Surprisingly, we find th… ▽ More

    Submitted 10 February, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

    Comments: Code and demo are available at https://ddcm-2025.github.io/

  4. arXiv:2502.00180  [pdf, other

    cs.LG stat.ML

    Spectral Analysis of Diffusion Models with Application to Schedule Design

    Authors: Roi Benita, Michael Elad, Joseph Keshet

    Abstract: Diffusion models (DMs) have emerged as powerful tools for modeling complex data distributions and generating realistic new samples. Over the years, advanced architectures and sampling methods have been developed to make these models practically usable. However, certain synthesis process decisions still rely on heuristics without a solid theoretical foundation. In our work, we offer a novel analysi… ▽ More

    Submitted 31 May, 2025; v1 submitted 31 January, 2025; originally announced February 2025.

  5. arXiv:2501.12102  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Proxies for Distortion and Consistency with Applications for Real-World Image Restoration

    Authors: Sean Man, Guy Ohayon, Ron Raphaeli, Michael Elad

    Abstract: Real-world image restoration deals with the recovery of images suffering from an unknown degradation. This task is typically addressed while being given only degraded images, without their corresponding ground-truth versions. In this hard setting, designing and evaluating restoration algorithms becomes highly challenging. This paper offers a suite of tools that can serve both the design and assess… ▽ More

    Submitted 21 January, 2025; originally announced January 2025.

    Comments: Project page in https://man-sean.github.io/elad-website/

  6. arXiv:2501.11746  [pdf, other

    cs.CV cs.AI cs.LG

    SILO: Solving Inverse Problems with Latent Operators

    Authors: Ron Raphaeli, Sean Man, Michael Elad

    Abstract: Consistent improvement of image priors over the years has led to the development of better inverse problem solvers. Diffusion models are the newcomers to this arena, posing the strongest known prior to date. Recently, such models operating in a latent space have become increasingly predominant due to their efficiency. In recent works, these models have been applied to solve inverse problems. Worki… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

    Comments: Project page in https://ronraphaeli.github.io/SILO-website/

  7. arXiv:2501.07431  [pdf, other

    physics.ins-det hep-ex

    Novel Silicon and GaAs Sensors for Compact Sampling Calorimeters

    Authors: H. Abramowicz, M. Almanza Soto, Y. Benhammou, W. Daniluk, M. Elad, M. Firlej, T. Fiutowski, V. Ghenescu, G. Grzelak, D. Horn, S. Huang, M. Idzik, A. Irles, J. Kotula, A. Levy, I. Levy, W. Lohmann, J. Morón, A. T. Neagu, D. Pietruch, P. M. Potlog, K. Świentek, A. F. Żarnecki, K. Zembaczyński

    Abstract: Two samples of silicon pad sensors and two samples of GaAs sensors are studied in an electron beam with 5 GeV energy from the DESY-II test-beam facility. The sizes of the silicon and GaAs sensors are about 9$\times$9 cm$^2$ and 5$\times$8 cm$^2$, respectively. The thickness is 500 micrometer for both the silicon and GaAs sensors. The pad size is about 5$\times$5 mm$^2$. The sensors are foreseen to… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

    Comments: 22 pages, 24 figures, submitted to The European Physical Journal C

  8. arXiv:2410.00418  [pdf, other

    eess.IV cs.AI cs.CV eess.SP

    Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration

    Authors: Guy Ohayon, Tomer Michaeli, Michael Elad

    Abstract: Photo-realistic image restoration algorithms are typically evaluated by distortion measures (e.g., PSNR, SSIM) and by perceptual quality measures (e.g., FID, NIQE), where the desire is to attain the lowest possible distortion without compromising on perceptual quality. To achieve this goal, current methods commonly attempt to sample from the posterior distribution, or to optimize a weighted sum of… ▽ More

    Submitted 4 February, 2025; v1 submitted 1 October, 2024; originally announced October 2024.

    Comments: Accepted to ICLR 2025. Code and demo are available at https://pmrf-ml.github.io/

  9. arXiv:2408.17046  [pdf, other

    cs.CV

    Text-to-Image Generation Via Energy-Based CLIP

    Authors: Roy Ganz, Michael Elad

    Abstract: Joint Energy Models (JEMs), while drawing significant research attention, have not been successfully scaled to real-world, high-resolution datasets. We present EB-CLIP, a novel approach extending JEMs to the multimodal vision-language domain using CLIP, integrating both generative and discriminative objectives. For the generative objective, we introduce an image-text joint-energy function based on… ▽ More

    Submitted 30 August, 2024; originally announced August 2024.

  10. arXiv:2407.15153  [pdf, other

    cs.CV

    Anchored Diffusion for Video Face Reenactment

    Authors: Idan Kligvasser, Regev Cohen, George Leifman, Ehud Rivlin, Michael Elad

    Abstract: Video generation has drawn significant interest recently, pushing the development of large-scale models capable of producing realistic videos with coherent motion. Due to memory constraints, these models typically generate short video segments that are then combined into long videos. The merging process poses a significant challenge, as it requires ensuring smooth transitions and overall consisten… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  11. arXiv:2407.09896  [pdf, other

    cs.CV eess.IV

    PSC: Posterior Sampling-Based Compression

    Authors: Noam Elata, Tomer Michaeli, Michael Elad

    Abstract: Diffusion models have transformed the landscape of image generation and now show remarkable potential for image compression. Most of the recent diffusion-based compression methods require training and are tailored for a specific bit-rate. In this work, we propose Posterior Sampling-based Compression (PSC) - a zero-shot compression method that leverages a pre-trained diffusion model as its sole neu… ▽ More

    Submitted 5 February, 2025; v1 submitted 13 July, 2024; originally announced July 2024.

  12. arXiv:2407.08256  [pdf, other

    eess.IV cs.CV cs.LG

    Adaptive Compressed Sensing with Diffusion-Based Posterior Sampling

    Authors: Noam Elata, Tomer Michaeli, Michael Elad

    Abstract: Compressed Sensing (CS) facilitates rapid image acquisition by selecting a small subset of measurements sufficient for high-fidelity reconstruction. Adaptive CS seeks to further enhance this process by dynamically choosing future measurements based on information gleaned from data that is already acquired. However, many existing frameworks are often tailored to specific tasks and require intricate… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Published in European Conference on Computer Vision (ECCV) 2024

  13. arXiv:2405.16260  [pdf, other

    cs.CV cs.LG

    Enhancing Consistency-Based Image Generation via Adversarialy-Trained Classification and Energy-Based Discrimination

    Authors: Shelly Golan, Roy Ganz, Michael Elad

    Abstract: The recently introduced Consistency models pose an efficient alternative to diffusion algorithms, enabling rapid and good quality image synthesis. These methods overcome the slowness of diffusion models by directly mapping noise to data, while maintaining a (relatively) simpler training. Consistency models enable a fast one- or few-step generation, but they typically fall somewhat short in sample… ▽ More

    Submitted 28 November, 2024; v1 submitted 25 May, 2024; originally announced May 2024.

  14. arXiv:2405.13805  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Perceptual Fairness in Image Restoration

    Authors: Guy Ohayon, Michael Elad, Tomer Michaeli

    Abstract: Fairness in image restoration tasks is the desire to treat different sub-groups of images equally well. Existing definitions of fairness in image restoration are highly restrictive. They consider a reconstruction to be a correct outcome for a group (e.g., women) only if it falls within the group's set of ground truth images (e.g., natural images of women); otherwise, it is considered entirely inco… ▽ More

    Submitted 12 October, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  15. arXiv:2405.11566  [pdf, other

    cs.LG

    Uncertainty-Aware PPG-2-ECG for Enhanced Cardiovascular Diagnosis using Diffusion Models

    Authors: Omer Belhasin, Idan Kligvasser, George Leifman, Regev Cohen, Erin Rainaldi, Li-Fang Cheng, Nishant Verma, Paul Varghese, Ehud Rivlin, Michael Elad

    Abstract: Analyzing the cardiovascular system condition via Electrocardiography (ECG) is a common and highly effective approach, and it has been practiced and perfected over many decades. ECG sensing is non-invasive and relatively easy to acquire, and yet it is still cumbersome for holter monitoring tests that may span over hours and even days. A possible alternative in this context is Photoplethysmography… ▽ More

    Submitted 20 April, 2025; v1 submitted 19 May, 2024; originally announced May 2024.

  16. arXiv:2402.00857  [pdf, other

    cs.LG stat.ML

    Early Time Classification with Accumulated Accuracy Gap Control

    Authors: Liran Ringel, Regev Cohen, Daniel Freedman, Michael Elad, Yaniv Romano

    Abstract: Early time classification algorithms aim to label a stream of features without processing the full input stream, while maintaining accuracy comparable to that achieved by applying the classifier to the entire input. In this paper, we introduce a statistical framework that can be applied to any sequential classifier, formulating a calibrated stopping rule. This data-driven rule attains finite-sampl… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

  17. arXiv:2311.09253  [pdf, other

    eess.IV cs.CV cs.LG eess.SP

    The Perception-Robustness Tradeoff in Deterministic Image Restoration

    Authors: Guy Ohayon, Tomer Michaeli, Michael Elad

    Abstract: We study the behavior of deterministic methods for solving inverse problems in imaging. These methods are commonly designed to achieve two goals: (1) attaining high perceptual quality, and (2) generating reconstructions that are consistent with the measurements. We provide a rigorous proof that the better a predictor satisfies these two requirements, the larger its Lipschitz constant must be, rega… ▽ More

    Submitted 8 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

  18. arXiv:2310.01381  [pdf, other

    cs.SD cs.CL eess.AS

    DiffAR: Denoising Diffusion Autoregressive Model for Raw Speech Waveform Generation

    Authors: Roi Benita, Michael Elad, Joseph Keshet

    Abstract: Diffusion models have recently been shown to be relevant for high-quality speech generation. Most work has been focused on generating spectrograms, and as such, they further require a subsequent model to convert the spectrogram to a waveform (i.e., a vocoder). This work proposes a diffusion probabilistic end-to-end model for generating a raw speech waveform. The proposed model is autoregressive, g… ▽ More

    Submitted 10 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

  19. arXiv:2308.00515  [pdf, other

    hep-ex physics.ins-det

    Technical Design Report for the LUXE Experiment

    Authors: H. Abramowicz, M. Almanza Soto, M. Altarelli, R. Aßmann, A. Athanassiadis, G. Avoni, T. Behnke, M. Benettoni, Y. Benhammou, J. Bhatt, T. Blackburn, C. Blanch, S. Bonaldo, S. Boogert, O. Borysov, M. Borysova, V. Boudry, D. Breton, R. Brinkmann, M. Bruschi, F. Burkart, K. Büßer, N. Cavanagh, F. Dal Corso, W. Decking , et al. (109 additional authors not shown)

    Abstract: This Technical Design Report presents a detailed description of all aspects of the LUXE (Laser Und XFEL Experiment), an experiment that will combine the high-quality and high-energy electron beam of the European XFEL with a high-intensity laser, to explore the uncharted terrain of strong-field quantum electrodynamics characterised by both high energy and high intensity, reaching the Schwinger fiel… ▽ More

    Submitted 2 August, 2023; v1 submitted 1 August, 2023; originally announced August 2023.

  20. arXiv:2306.16805  [pdf, other

    cs.CV cs.CL cs.LG

    CLIPAG: Towards Generator-Free Text-to-Image Generation

    Authors: Roy Ganz, Michael Elad

    Abstract: Perceptually Aligned Gradients (PAG) refer to an intriguing property observed in robust image classification models, wherein their input gradients align with human perception and pose semantic meanings. While this phenomenon has gained significant research attention, it was solely studied in the context of unimodal vision-only architectures. In this work, we extend the study of PAG to Vision-Langu… ▽ More

    Submitted 1 September, 2023; v1 submitted 29 June, 2023; originally announced June 2023.

  21. arXiv:2306.02342  [pdf, other

    cs.AI

    Deep Optimal Transport: A Practical Algorithm for Photo-realistic Image Restoration

    Authors: Theo Adrai, Guy Ohayon, Tomer Michaeli, Michael Elad

    Abstract: We propose an image restoration algorithm that can control the perceptual quality and/or the mean square error (MSE) of any pre-trained model, trading one over the other at test time. Our algorithm is few-shot: Given about a dozen images restored by the model, it can significantly improve the perceptual quality and/or the MSE of the model for newly restored images without further training. Our app… ▽ More

    Submitted 12 August, 2024; v1 submitted 4 June, 2023; originally announced June 2023.

  22. arXiv:2305.19066  [pdf, other

    cs.CV

    Nested Diffusion Processes for Anytime Image Generation

    Authors: Noam Elata, Bahjat Kawar, Tomer Michaeli, Michael Elad

    Abstract: Diffusion models are the current state-of-the-art in image generation, synthesizing high-quality images by breaking down the generation process into many fine-grained denoising steps. Despite their good performance, diffusion models are computationally expensive, requiring many neural function evaluations (NFEs). In this work, we propose an anytime diffusion-based method that can generate viable i… ▽ More

    Submitted 30 October, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  23. arXiv:2305.13128  [pdf, other

    eess.IV cs.CV cs.LG

    GSURE-Based Diffusion Model Training with Corrupted Data

    Authors: Bahjat Kawar, Noam Elata, Tomer Michaeli, Michael Elad

    Abstract: Diffusion models have demonstrated impressive results in both data generation and downstream tasks such as inverse problems, text-based editing, classification, and more. However, training such models usually requires large amounts of clean signals which are often difficult or impossible to obtain. In this work, we propose a novel training technique for generative diffusion models based only on co… ▽ More

    Submitted 13 June, 2024; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: Code: https://github.com/bahjat-kawar/gsure-diffusion

  24. arXiv:2305.10124  [pdf, other

    cs.CV

    Principal Uncertainty Quantification with Spatial Correlation for Image Restoration Problems

    Authors: Omer Belhasin, Yaniv Romano, Daniel Freedman, Ehud Rivlin, Michael Elad

    Abstract: Uncertainty quantification for inverse problems in imaging has drawn much attention lately. Existing approaches towards this task define uncertainty regions based on probable values per pixel, while ignoring spatial correlations within the image, resulting in an exaggerated volume of uncertainty. In this paper, we propose PUQ (Principal Uncertainty Quantification) -- a novel definition and corresp… ▽ More

    Submitted 20 January, 2024; v1 submitted 17 May, 2023; originally announced May 2023.

  25. Semi-supervised Quality Evaluation of Colonoscopy Procedures

    Authors: Idan Kligvasser, George Leifman, Roman Goldenberg, Ehud Rivlin, Michael Elad

    Abstract: Colonoscopy is the standard of care technique for detecting and removing polyps for the prevention of colorectal cancer. Nevertheless, gastroenterologists (GI) routinely miss approximately 25% of polyps during colonoscopies. These misses are highly operator dependent, influenced by the physician skills, experience, vigilance, and fatigue. Standard quality metrics, such as Withdrawal Time or Cecal… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Journal ref: 2023 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)

  26. Colonoscopy Coverage Revisited: Identifying Scanning Gaps in Real-Time

    Authors: G. Leifman, I. Kligvasser, R. Goldenberg, M. Elad, E. Rivlin

    Abstract: Colonoscopy is the most widely used medical technique for preventing Colorectal Cancer, by detecting and removing polyps before they become malignant. Recent studies show that around one quarter of the existing polyps are routinely missed. While some of these do appear in the endoscopist's field of view, others are missed due to a partial coverage of the colon. The task of detecting and marking un… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: 10 pages, 5 figures

    Journal ref: MICCAI workshop on Cancer Prevention Through Early Detection, LNCS, Vol. 14295, 2023

  27. arXiv:2303.15409  [pdf, other

    cs.CV

    Class-Conditioned Transformation for Enhanced Robust Image Classification

    Authors: Tsachi Blau, Roy Ganz, Chaim Baskin, Michael Elad, Alex M. Bronstein

    Abstract: Robust classification methods predominantly concentrate on algorithms that address a specific threat model, resulting in ineffective defenses against other threat models. Real-world applications are exposed to this vulnerability, as malicious attackers might exploit alternative threat models. In this work, we propose a novel test-time threat model agnostic algorithm that enhances Adversarial-Train… ▽ More

    Submitted 4 November, 2024; v1 submitted 27 March, 2023; originally announced March 2023.

  28. arXiv:2302.04064  [pdf, other

    cs.CV

    Weakly-supervised Representation Learning for Video Alignment and Analysis

    Authors: Guy Bar-Shalom, George Leifman, Michael Elad, Ehud Rivlin

    Abstract: Many tasks in video analysis and understanding boil down to the need for frame-based feature learning, aiming to encapsulate the relevant visual content so as to enable simpler and easier subsequent processing. While supervised strategies for this learning task can be envisioned, self and weakly-supervised alternatives are preferred due to the difficulties in getting labeled data. This paper intro… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

  29. arXiv:2301.03362  [pdf, other

    eess.IV cs.CV

    Image Denoising: The Deep Learning Revolution and Beyond -- A Survey Paper --

    Authors: Michael Elad, Bahjat Kawar, Gregory Vaksman

    Abstract: Image denoising (removal of additive white Gaussian noise from an image) is one of the oldest and most studied problems in image processing. An extensive work over several decades has led to thousands of papers on this subject, and to many well-performing algorithms for this task. Indeed, 10 years ago, these achievements have led some researchers to suspect that "Denoising is Dead", in the sense t… ▽ More

    Submitted 9 January, 2023; originally announced January 2023.

  30. arXiv:2212.03235  [pdf, other

    cs.CV cs.AI

    Complex-valued Retrievals From Noisy Images Using Diffusion Models

    Authors: Nadav Torem, Roi Ronen, Yoav Y. Schechner, Michael Elad

    Abstract: In diverse microscopy modalities, sensors measure only real-valued intensities. Additionally, the sensor readouts are affected by Poissonian-distributed photon noise. Traditional restoration algorithms typically aim to minimize the mean squared error (MSE) between the original and recovered images. This often leads to blurry outcomes with poor perceptual quality. Recently, deep diffusion models (D… ▽ More

    Submitted 28 July, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: 11 pages, 7figures

    MSC Class: I2; I4 ACM Class: I.2; I.4

  31. arXiv:2211.15211  [pdf, other

    cs.CV cs.LG

    What's Behind the Mask: Estimating Uncertainty in Image-to-Image Problems

    Authors: Gilad Kutiel, Regev Cohen, Michael Elad, Daniel Freedman

    Abstract: Estimating uncertainty in image-to-image networks is an important task, particularly as such networks are being increasingly deployed in the biological and medical imaging realms. In this paper, we introduce a new approach to this problem based on masking. Given an existing image-to-image network, our approach computes a mask such that the distance between the masked reconstructed image and the ma… ▽ More

    Submitted 28 November, 2022; originally announced November 2022.

  32. arXiv:2211.11827  [pdf, other

    eess.IV cs.CV

    High-Perceptual Quality JPEG Decoding via Posterior Sampling

    Authors: Sean Man, Guy Ohayon, Theo Adrai, Michael Elad

    Abstract: JPEG is arguably the most popular image coding format, achieving high compression ratios via lossy quantization that may create visual artifacts degradation. Numerous attempts to remove these artifacts were conceived over the years, and common to most of these is the use of deterministic post-processing algorithms that optimize some distortion measure (e.g., PSNR, SSIM). In this paper we propose a… ▽ More

    Submitted 30 August, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: Presented in NTIRE workshop as part of CVPR 2023

  33. arXiv:2211.09919  [pdf, other

    cs.CV

    Patch-Craft Self-Supervised Training for Correlated Image Denoising

    Authors: Gregory Vaksman, Michael Elad

    Abstract: Supervised neural networks are known to achieve excellent results in various image restoration tasks. However, such training requires datasets composed of pairs of corrupted images and their corresponding ground truth targets. Unfortunately, such data is not available in many applications. For the task of image denoising in which the noise statistics is unknown, several self-supervised training me… ▽ More

    Submitted 17 November, 2022; originally announced November 2022.

  34. arXiv:2211.08944  [pdf, other

    eess.IV cs.CV cs.LG

    Reasons for the Superiority of Stochastic Estimators over Deterministic Ones: Robustness, Consistency and Perceptual Quality

    Authors: Guy Ohayon, Theo Adrai, Michael Elad, Tomer Michaeli

    Abstract: Stochastic restoration algorithms allow to explore the space of solutions that correspond to the degraded input. In this paper we reveal additional fundamental advantages of stochastic methods over deterministic ones, which further motivate their use. First, we prove that any restoration algorithm that attains perfect perceptual quality and whose outputs are consistent with the input must be a pos… ▽ More

    Submitted 26 July, 2023; v1 submitted 16 November, 2022; originally announced November 2022.

    Journal ref: Proceedings of the 40th International Conference on Machine Learning, PMLR 202:26474-26494, 2023

  35. arXiv:2209.11888  [pdf, other

    eess.IV cs.CV

    JPEG Artifact Correction using Denoising Diffusion Restoration Models

    Authors: Bahjat Kawar, Jiaming Song, Stefano Ermon, Michael Elad

    Abstract: Diffusion models can be used as learned priors for solving various inverse problems. However, most existing approaches are restricted to linear inverse problems, limiting their applicability to more general cases. In this paper, we build upon Denoising Diffusion Restoration Models (DDRM) and propose a method for solving some non-linear inverse problems. We leverage the pseudo-inverse operator used… ▽ More

    Submitted 23 November, 2022; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: Presented at NeurIPS 2022 Workshop on Score-Based Methods. Code: https://github.com/bahjat-kawar/ddrm-jpeg

  36. arXiv:2208.08664  [pdf, other

    cs.CV cs.LG

    Enhancing Diffusion-Based Image Synthesis with Robust Classifier Guidance

    Authors: Bahjat Kawar, Roy Ganz, Michael Elad

    Abstract: Denoising diffusion probabilistic models (DDPMs) are a recent family of generative models that achieve state-of-the-art results. In order to obtain class-conditional generation, it was suggested to guide the diffusion process by gradients from a time-dependent classifier. While the idea is theoretically sound, deep learning-based classifiers are infamously susceptible to gradient-based adversarial… ▽ More

    Submitted 15 March, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted to TMLR

  37. arXiv:2207.11378  [pdf, other

    cs.CV

    Do Perceptually Aligned Gradients Imply Adversarial Robustness?

    Authors: Roy Ganz, Bahjat Kawar, Michael Elad

    Abstract: Adversarially robust classifiers possess a trait that non-robust models do not -- Perceptually Aligned Gradients (PAG). Their gradients with respect to the input align well with human perception. Several works have identified PAG as a byproduct of robust training, but none have considered it as a standalone phenomenon nor studied its own implications. In this work, we focus on this trait and test… ▽ More

    Submitted 9 August, 2023; v1 submitted 22 July, 2022; originally announced July 2022.

  38. arXiv:2207.08089  [pdf, other

    cs.CV cs.AI

    Threat Model-Agnostic Adversarial Defense using Diffusion Models

    Authors: Tsachi Blau, Roy Ganz, Bahjat Kawar, Alex Bronstein, Michael Elad

    Abstract: Deep Neural Networks (DNNs) are highly sensitive to imperceptible malicious perturbations, known as adversarial attacks. Following the discovery of this vulnerability in real-world imaging and vision applications, the associated safety concerns have attracted vast research attention, and many defense techniques have been developed. Most of these defense methods rely on adversarial training (AT) --… ▽ More

    Submitted 17 July, 2022; originally announced July 2022.

  39. arXiv:2201.11793  [pdf, other

    eess.IV cs.CV cs.LG

    Denoising Diffusion Restoration Models

    Authors: Bahjat Kawar, Michael Elad, Stefano Ermon, Jiaming Song

    Abstract: Many interesting tasks in image restoration can be cast as linear inverse problems. A recent family of approaches for solving these problems uses stochastic algorithms that sample from the posterior distribution of natural images given the measurements. However, efficient solutions often require problem-specific supervised training to model the posterior, whereas unsupervised methods that are not… ▽ More

    Submitted 12 October, 2022; v1 submitted 27 January, 2022; originally announced January 2022.

    Comments: Project page: https://ddrm-ml.github.io/

  40. arXiv:2108.03702  [pdf, other

    cs.CV

    BIGRoC: Boosting Image Generation via a Robust Classifier

    Authors: Roy Ganz, Michael Elad

    Abstract: The interest of the machine learning community in image synthesis has grown significantly in recent years, with the introduction of a wide range of deep generative models and means for training them. In this work, we propose a general model-agnostic technique for improving the image quality and the distribution fidelity of generated images obtained by any generative model. Our method, termed BIGRo… ▽ More

    Submitted 1 February, 2023; v1 submitted 8 August, 2021; originally announced August 2021.

    Journal ref: Transactions on machine learning research, 2023

  41. arXiv:2105.14951  [pdf, other

    eess.IV cs.CV

    SNIPS: Solving Noisy Inverse Problems Stochastically

    Authors: Bahjat Kawar, Gregory Vaksman, Michael Elad

    Abstract: In this work we introduce a novel stochastic algorithm dubbed SNIPS, which draws samples from the posterior distribution of any linear inverse problem, where the observation is assumed to be contaminated by additive white Gaussian noise. Our solution incorporates ideas from Langevin dynamics and Newton's method, and exploits a pre-trained minimum mean squared error (MMSE) Gaussian denoiser. The pr… ▽ More

    Submitted 10 November, 2021; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: Thirty-Fifth Conference on Neural Information Processing Systems (NeurIPS), 2021

  42. arXiv:2104.00464  [pdf, other

    cs.CV eess.IV

    Improved Image Generation via Sparse Modeling

    Authors: Roy Ganz, Michael Elad

    Abstract: The interest of the deep learning community in image synthesis has grown massively in recent years. Nowadays, deep generative methods, and especially Generative Adversarial Networks (GANs), are leading to state-of-the-art performance, capable of synthesizing images that appear realistic. While the efforts for improving the quality of the generated images are extensive, most attempts still consider… ▽ More

    Submitted 13 May, 2022; v1 submitted 1 April, 2021; originally announced April 2021.

  43. arXiv:2103.13767  [pdf, other

    cs.CV

    Patch Craft: Video Denoising by Deep Modeling and Patch Matching

    Authors: Gregory Vaksman, Michael Elad, Peyman Milanfar

    Abstract: The non-local self-similarity property of natural images has been exploited extensively for solving various image processing problems. When it comes to video sequences, harnessing this force is even more beneficial due to the temporal redundancy. In the context of image and video denoising, many classically-oriented algorithms employ self-similarity, splitting the data into overlapping patches, ga… ▽ More

    Submitted 30 October, 2021; v1 submitted 25 March, 2021; originally announced March 2021.

  44. High Perceptual Quality Image Denoising with a Posterior Sampling CGAN

    Authors: Guy Ohayon, Theo Adrai, Gregory Vaksman, Michael Elad, Peyman Milanfar

    Abstract: The vast work in Deep Learning (DL) has led to a leap in image denoising research. Most DL solutions for this task have chosen to put their efforts on the denoiser's architecture while maximizing distortion performance. However, distortion driven solutions lead to blurry results with sub-optimal perceptual quality, especially in immoderate noise levels. In this paper we propose a different perspec… ▽ More

    Submitted 11 October, 2021; v1 submitted 6 March, 2021; originally announced March 2021.

    Journal ref: 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Montreal, BC, Canada, 2021, pp. 1805-1813

  45. arXiv:2101.09552  [pdf, other

    eess.IV cs.CV

    Stochastic Image Denoising by Sampling from the Posterior Distribution

    Authors: Bahjat Kawar, Gregory Vaksman, Michael Elad

    Abstract: Image denoising is a well-known and well studied problem, commonly targeting a minimization of the mean squared error (MSE) between the outcome and the original image. Unfortunately, especially for severe noise levels, such Minimum MSE (MMSE) solutions may lead to blurry output images. In this work we propose a novel stochastic denoising approach that produces viable and high perceptual quality re… ▽ More

    Submitted 31 August, 2021; v1 submitted 23 January, 2021; originally announced January 2021.

  46. arXiv:2010.07069  [pdf, other

    cs.LG eess.SP stat.ML

    Learned Greedy Method (LGM): A Novel Neural Architecture for Sparse Coding and Beyond

    Authors: Rajaei Khatib, Dror Simon, Michael Elad

    Abstract: The fields of signal and image processing have been deeply influenced by the introduction of deep neural networks. These are successfully deployed in a wide range of real-world applications, obtaining state of the art results and surpassing well-known and well-established classical methods. Despite their impressive success, the architectures used in many of these neural networks come with no clear… ▽ More

    Submitted 20 October, 2020; v1 submitted 14 October, 2020; originally announced October 2020.

  47. arXiv:2008.00605  [pdf, other

    eess.IV cs.CV

    The Rate-Distortion-Accuracy Tradeoff: JPEG Case Study

    Authors: Xiyang Luo, Hossein Talebi, Feng Yang, Michael Elad, Peyman Milanfar

    Abstract: Handling digital images is almost always accompanied by a lossy compression in order to facilitate efficient transmission and storage. This introduces an unavoidable tension between the allocated bit-budget (rate) and the faithfulness of the resulting image to the original one (distortion). An additional complicating consideration is the effect of the compression on recognition performance by give… ▽ More

    Submitted 2 August, 2020; originally announced August 2020.

    ACM Class: I.4.2; I.5.1

  48. arXiv:2008.00226  [pdf, other

    eess.IV cs.CV

    Regularization by Denoising via Fixed-Point Projection (RED-PRO)

    Authors: Regev Cohen, Michael Elad, Peyman Milanfar

    Abstract: Inverse problems in image processing are typically cast as optimization tasks, consisting of data-fidelity and stabilizing regularization terms. A recent regularization strategy of great interest utilizes the power of denoising engines. Two such methods are the Plug-and-Play Prior (PnP) and Regularization by Denoising (RED). While both have shown state-of-the-art results in various recovery tasks,… ▽ More

    Submitted 28 October, 2020; v1 submitted 1 August, 2020; originally announced August 2020.

    Comments: 33 Pages, 6 figures, 7 tables

  49. arXiv:2006.15555  [pdf, other

    cs.LG cs.CV stat.ML

    When and How Can Deep Generative Models be Inverted?

    Authors: Aviad Aberdam, Dror Simon, Michael Elad

    Abstract: Deep generative models (e.g. GANs and VAEs) have been developed quite extensively in recent years. Lately, there has been an increased interest in the inversion of such a model, i.e. given a (possibly corrupted) signal, we wish to recover the latent vector that generated it. Building upon sparse representation theory, we define conditions that are applicable to any inversion algorithm (gradient de… ▽ More

    Submitted 28 June, 2020; originally announced June 2020.

  50. Better Compression with Deep Pre-Editing

    Authors: Hossein Talebi, Damien Kelly, Xiyang Luo, Ignacio Garcia Dorado, Feng Yang, Peyman Milanfar, Michael Elad

    Abstract: Could we compress images via standard codecs while avoiding visible artifacts? The answer is obvious -- this is doable as long as the bit budget is generous enough. What if the allocated bit-rate for compression is insufficient? Then unfortunately, artifacts are a fact of life. Many attempts were made over the years to fight this phenomenon, with various degrees of success. In this work we aim to… ▽ More

    Submitted 23 July, 2021; v1 submitted 31 January, 2020; originally announced February 2020.