Skip to main content

Showing 1–19 of 19 results for author: Moser, B B

.
  1. arXiv:2505.17799  [pdf, ps, other

    cs.LG cs.CV

    A Coreset Selection of Coreset Selection Literature: Introduction and Recent Advances

    Authors: Brian B. Moser, Arundhati S. Shanbhag, Stanislav Frolov, Federico Raue, Joachim Folz, Andreas Dengel

    Abstract: Coreset selection targets the challenge of finding a small, representative subset of a large dataset that preserves essential patterns for effective machine learning. Although several surveys have examined data reduction strategies before, most focus narrowly on either classical geometry-based methods or active learning techniques. In contrast, this survey presents a more comprehensive view by uni… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

  2. arXiv:2502.12691  [pdf, other

    cs.CV

    Spherical Dense Text-to-Image Synthesis

    Authors: Timon Winter, Stanislav Frolov, Brian Bernhard Moser, Andreas Dengel

    Abstract: Recent advancements in text-to-image (T2I) have improved synthesis results, but challenges remain in layout control and generating omnidirectional panoramic images. Dense T2I (DT2I) and spherical T2I (ST2I) models address these issues, but so far no unified approach exists. Trivial approaches, like prompting a DT2I model to generate panoramas can not generate proper spherical distortions and seaml… ▽ More

    Submitted 10 March, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Comments: Link to project page https://sdt2i.github.io/

  3. arXiv:2502.03656  [pdf, other

    cs.CV cs.AI cs.LG

    A Study in Dataset Distillation for Image Super-Resolution

    Authors: Tobias Dietz, Brian B. Moser, Tobias Nauen, Federico Raue, Stanislav Frolov, Andreas Dengel

    Abstract: Dataset distillation is the concept of condensing large datasets into smaller but highly representative synthetic samples. While previous research has primarily focused on image classification, its application to image Super-Resolution (SR) remains underexplored. This exploratory work studies multiple dataset distillation techniques applied to SR, including pixel- and latent-space approaches under… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  4. arXiv:2501.06720  [pdf, other

    cs.CV cs.AI

    Multi-Label Scene Classification in Remote Sensing Benefits from Image Super-Resolution

    Authors: Ashitha Mudraje, Brian B. Moser, Stanislav Frolov, Andreas Dengel

    Abstract: Satellite imagery is a cornerstone for numerous Remote Sensing (RS) applications; however, limited spatial resolution frequently hinders the precision of such systems, especially in multi-label scene classification tasks as it requires a higher level of detail and feature differentiation. In this study, we explore the efficacy of image Super-Resolution (SR) as a pre-processing step to enhance the… ▽ More

    Submitted 12 January, 2025; originally announced January 2025.

  5. arXiv:2411.15580  [pdf, other

    cs.CV

    TKG-DM: Training-free Chroma Key Content Generation Diffusion Model

    Authors: Ryugo Morita, Stanislav Frolov, Brian Bernhard Moser, Takahiro Shirakawa, Ko Watanabe, Andreas Dengel, Jinjia Zhou

    Abstract: Diffusion models have enabled the generation of high-quality images with a strong focus on realism and textual fidelity. Yet, large-scale text-to-image models, such as Stable Diffusion, struggle to generate images where foreground objects are placed over a chroma key background, limiting their ability to separate foreground and background elements without fine-tuning. To address this limitation, w… ▽ More

    Submitted 8 March, 2025; v1 submitted 23 November, 2024; originally announced November 2024.

    Comments: Accepted to CVPR2025

  6. arXiv:2411.12115  [pdf, other

    cs.CV cs.AI cs.LG

    Distill the Best, Ignore the Rest: Improving Dataset Distillation with Loss-Value-Based Pruning

    Authors: Brian B. Moser, Federico Raue, Tobias C. Nauen, Stanislav Frolov, Andreas Dengel

    Abstract: Dataset distillation has gained significant interest in recent years, yet existing approaches typically distill from the entire dataset, potentially including non-beneficial samples. We introduce a novel "Prune First, Distill After" framework that systematically prunes datasets via loss-based sampling prior to distillation. By leveraging pruning before classical distillation techniques and generat… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  7. arXiv:2411.12073  [pdf, other

    cs.CV cs.AI cs.LG

    Just Leaf It: Accelerating Diffusion Classifiers with Hierarchical Class Pruning

    Authors: Arundhati S. Shanbhag, Brian B. Moser, Tobias C. Nauen, Stanislav Frolov, Federico Raue, Andreas Dengel

    Abstract: Diffusion models, celebrated for their generative capabilities, have recently demonstrated surprising effectiveness in image classification tasks by using Bayes' theorem. Yet, current diffusion classifiers must evaluate every label candidate for each input, creating high computational costs that impede their use in large-scale applications. To address this limitation, we propose a Hierarchical Dif… ▽ More

    Submitted 7 March, 2025; v1 submitted 18 November, 2024; originally announced November 2024.

  8. arXiv:2411.12072  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Zoomed In, Diffused Out: Towards Local Degradation-Aware Multi-Diffusion for Extreme Image Super-Resolution

    Authors: Brian B. Moser, Stanislav Frolov, Tobias C. Nauen, Federico Raue, Andreas Dengel

    Abstract: Large-scale, pre-trained Text-to-Image (T2I) diffusion models have gained significant popularity in image generation tasks and have shown unexpected potential in image Super-Resolution (SR). However, most existing T2I diffusion models are trained with a resolution limit of 512x512, making scaling beyond this resolution an unresolved but necessary challenge for image SR. In this work, we introduce… ▽ More

    Submitted 18 November, 2024; originally announced November 2024.

  9. arXiv:2411.10231  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    A Low-Resolution Image is Worth 1x1 Words: Enabling Fine Image Super-Resolution with Transformers and TaylorShift

    Authors: Sanath Budakegowdanadoddi Nagaraju, Brian Bernhard Moser, Tobias Christian Nauen, Stanislav Frolov, Federico Raue, Andreas Dengel

    Abstract: Transformer-based Super-Resolution (SR) models have recently advanced image reconstruction quality, yet challenges remain due to computational complexity and an over-reliance on large patch sizes, which constrain fine-grained detail enhancement. In this work, we propose TaylorIR to address these limitations by utilizing a patch size of 1x1, enabling pixel-level processing in any transformer-based… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

  10. arXiv:2408.10397  [pdf, other

    cs.CV cs.AI cs.MM

    Webcam-based Pupil Diameter Prediction Benefits from Upscaling

    Authors: Vijul Shah, Brian B. Moser, Ko Watanabe, Andreas Dengel

    Abstract: Capturing pupil diameter is essential for assessing psychological and physiological states such as stress levels and cognitive load. However, the low resolution of images in eye datasets often hampers precise measurement. This study evaluates the impact of various upscaling methods, ranging from bicubic interpolation to advanced super-resolution, on pupil diameter predictions. We compare several p… ▽ More

    Submitted 22 December, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

    Journal ref: Proceedings of the 17th International Conference on Agents and Artificial Intelligence (ICAART 2025), Porto, Portugal, February 23-25, 2025

  11. arXiv:2407.15507  [pdf, other

    cs.CV

    SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time

    Authors: Stanislav Frolov, Brian B. Moser, Andreas Dengel

    Abstract: Generating high-resolution images with generative models has recently been made widely accessible by leveraging diffusion models pre-trained on large-scale datasets. Various techniques, such as MultiDiffusion and SyncDiffusion, have further pushed image generation beyond training resolutions, i.e., from square images to panorama, by merging multiple overlapping diffusion paths or employing gradien… ▽ More

    Submitted 7 January, 2025; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: Project page: https://spotdiffusion.github.io/

  12. arXiv:2407.11204  [pdf, other

    cs.CV cs.AI cs.CY cs.HC cs.LG

    PupilSense: A Novel Application for Webcam-Based Pupil Diameter Estimation

    Authors: Vijul Shah, Ko Watanabe, Brian B. Moser, Andreas Dengel

    Abstract: Measuring pupil diameter is vital for gaining insights into physiological and psychological states - traditionally captured by expensive, specialized equipment like Tobii eye-trackers and Pupillabs glasses. This paper presents a novel application that enables pupil diameter estimation using standard webcams, making the process accessible in everyday environments without specialized equipment. Our… ▽ More

    Submitted 28 March, 2025; v1 submitted 15 July, 2024; originally announced July 2024.

  13. arXiv:2404.17670  [pdf, other

    eess.IV cs.AI cs.CV cs.ET cs.LG

    Federated Learning for Blind Image Super-Resolution

    Authors: Brian B. Moser, Ahmed Anwar, Federico Raue, Stanislav Frolov, Andreas Dengel

    Abstract: Traditional blind image SR methods need to model real-world degradations precisely. Consequently, current research struggles with this dilemma by assuming idealized degradations, which leads to limited applicability to actual user data. Moreover, the ideal scenario - training models on data from the targeted user base - presents significant privacy concerns. To address both challenges, we propose… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  14. arXiv:2404.07564  [pdf, other

    cs.CV

    ObjBlur: A Curriculum Learning Approach With Progressive Object-Level Blurring for Improved Layout-to-Image Generation

    Authors: Stanislav Frolov, Brian B. Moser, Sebastian Palacio, Andreas Dengel

    Abstract: We present ObjBlur, a novel curriculum learning approach to improve layout-to-image generation models, where the task is to produce realistic images from layouts composed of boxes and labels. Our method is based on progressive object-level blurring, which effectively stabilizes training and enhances the quality of generated images. This curriculum learning strategy systematically applies varying d… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  15. arXiv:2403.17083  [pdf, other

    eess.IV cs.AI cs.CV cs.GR cs.LG

    A Study in Dataset Pruning for Image Super-Resolution

    Authors: Brian B. Moser, Federico Raue, Andreas Dengel

    Abstract: In image Super-Resolution (SR), relying on large datasets for training is a double-edged sword. While offering rich training material, they also demand substantial computational and storage resources. In this work, we analyze dataset pruning to solve these challenges. We introduce a novel approach that reduces a dataset to a core-set of training samples, selected based on their loss values as dete… ▽ More

    Submitted 8 June, 2024; v1 submitted 25 March, 2024; originally announced March 2024.

  16. arXiv:2403.03881  [pdf, other

    cs.CV cs.AI cs.LG

    Latent Dataset Distillation with Diffusion Models

    Authors: Brian B. Moser, Federico Raue, Sebastian Palacio, Stanislav Frolov, Andreas Dengel

    Abstract: Machine learning traditionally relies on increasingly larger datasets. Yet, such datasets pose major storage challenges and usually contain non-influential samples, which could be ignored during training without negatively impacting the training quality. In response, the idea of distilling a dataset into a condensed set of synthetic samples, i.e., a distilled dataset, emerged. One key aspect is th… ▽ More

    Submitted 11 July, 2024; v1 submitted 6 March, 2024; originally announced March 2024.

  17. arXiv:2401.00736  [pdf, other

    cs.CV cs.AI cs.LG cs.MM

    Diffusion Models, Image Super-Resolution And Everything: A Survey

    Authors: Brian B. Moser, Arundhati S. Shanbhag, Federico Raue, Stanislav Frolov, Sebastian Palacio, Andreas Dengel

    Abstract: Diffusion Models (DMs) have disrupted the image Super-Resolution (SR) field and further closed the gap between image quality and human perceptual preferences. They are easy to train and can produce very high-quality samples that exceed the realism of those produced by previous generative methods. Despite their promising results, they also come with new challenges that need further research: high c… ▽ More

    Submitted 23 June, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

  18. Dynamic Attention-Guided Diffusion for Image Super-Resolution

    Authors: Brian B. Moser, Stanislav Frolov, Federico Raue, Sebastian Palacio, Andreas Dengel

    Abstract: Diffusion models in image Super-Resolution (SR) treat all image regions uniformly, which risks compromising the overall image quality by potentially introducing artifacts during denoising of less-complex regions. To address this, we propose ``You Only Diffuse Areas'' (YODA), a dynamic attention-guided diffusion process for image SR. YODA selectively focuses on spatial regions defined by attention… ▽ More

    Submitted 22 November, 2024; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: Brian B. Moser and Stanislav Frolov contributed equally

  19. arXiv:2307.04593  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    DWA: Differential Wavelet Amplifier for Image Super-Resolution

    Authors: Brian B. Moser, Stanislav Frolov, Federico Raue, Sebastian Palacio, Andreas Dengel

    Abstract: This work introduces Differential Wavelet Amplifier (DWA), a drop-in module for wavelet-based image Super-Resolution (SR). DWA invigorates an approach recently receiving less attention, namely Discrete Wavelet Transformation (DWT). DWT enables an efficient image representation for SR and reduces the spatial area of its input by a factor of 4, the overall model size, and computation cost, framing i… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.