Skip to main content

Showing 1–2 of 2 results for author: Lamerigts, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.19324  [pdf, other

    cs.CV cs.LG stat.ML

    Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion

    Authors: Emiel Hoogeboom, Thomas Mensink, Jonathan Heek, Kay Lamerigts, Ruiqi Gao, Tim Salimans

    Abstract: Latent diffusion models have become the popular choice for scaling up diffusion models for high resolution image synthesis. Compared to pixel-space models that are trained end-to-end, latent models are perceived to be more efficient and to produce higher image quality at high resolution. Here we challenge these notions, and show that pixel-space models can be very competitive to latent models both… ▽ More

    Submitted 22 March, 2025; v1 submitted 25 October, 2024; originally announced October 2024.

    Comments: Accepted to CVPR 2025

  2. arXiv:2410.02637  [pdf, other

    cs.AI cs.CV

    Plots Unlock Time-Series Understanding in Multimodal Models

    Authors: Mayank Daswani, Mathias M. J. Bellaiche, Marc Wilson, Desislav Ivanov, Mikhail Papkov, Eva Schnider, Jing Tang, Kay Lamerigts, Gabriela Botea, Michael A. Sanchez, Yojan Patel, Shruthi Prabhakara, Shravya Shetty, Umesh Telang

    Abstract: While multimodal foundation models can now natively work with data beyond text, they remain underutilized in analyzing the considerable amounts of multi-dimensional time-series data in fields like healthcare, finance, and social sciences, representing a missed opportunity for richer, data-driven insights. This paper proposes a simple but effective method that leverages the existing vision encoders… ▽ More

    Submitted 28 November, 2024; v1 submitted 3 October, 2024; originally announced October 2024.

    Comments: 57 pages