Skip to main content

Showing 1–8 of 8 results for author: Kotovenko, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.13204  [pdf, other

    cs.GR

    EDGS: Eliminating Densification for Efficient Convergence of 3DGS

    Authors: Dmytro Kotovenko, Olga Grebenkova, Björn Ommer

    Abstract: 3D Gaussian Splatting reconstructs scenes by starting from a sparse Structure-from-Motion initialization and iteratively refining under-reconstructed regions. This process is inherently slow, as it requires multiple densification steps where Gaussians are repeatedly split and adjusted, following a lengthy optimization path. Moreover, this incremental approach often leads to suboptimal renderings,… ▽ More

    Submitted 15 April, 2025; originally announced April 2025.

  2. arXiv:2412.11917  [pdf, other

    cs.CV

    Does VLM Classification Benefit from LLM Description Semantics?

    Authors: Pingchuan Ma, Lennart Rietdorf, Dmytro Kotovenko, Vincent Tao Hu, Björn Ommer

    Abstract: Accurately describing images with text is a foundation of explainable AI. Vision-Language Models (VLMs) like CLIP have recently addressed this by aligning images and texts in a shared embedding space, expressing semantic similarities between vision and language embeddings. VLM classification can be improved with descriptions generated by Large Language Models (LLMs). However, it is difficult to de… ▽ More

    Submitted 19 December, 2024; v1 submitted 16 December, 2024; originally announced December 2024.

    Comments: AAAI-25 (extended version), Code: https://github.com/CompVis/DisCLIP

  3. arXiv:2409.17917  [pdf, other

    cs.CV

    WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians

    Authors: Dmytro Kotovenko, Olga Grebenkova, Nikolaos Sarafianos, Avinash Paliwal, Pingchuan Ma, Omid Poursaeed, Sreyas Mohan, Yuchen Fan, Yilei Li, Rakesh Ranjan, Björn Ommer

    Abstract: While style transfer techniques have been well-developed for 2D image stylization, the extension of these methods to 3D scenes remains relatively unexplored. Existing approaches demonstrate proficiency in transferring colors and textures but often struggle with replicating the geometry of the scenes. In our work, we leverage an explicit Gaussian Splatting (GS) representation and directly match the… ▽ More

    Submitted 26 September, 2024; originally announced September 2024.

  4. CoherentGS: Sparse Novel View Synthesis with Coherent 3D Gaussians

    Authors: Avinash Paliwal, Wei Ye, Jinhui Xiong, Dmytro Kotovenko, Rakesh Ranjan, Vikas Chandra, Nima Khademi Kalantari

    Abstract: The field of 3D reconstruction from images has rapidly evolved in the past few years, first with the introduction of Neural Radiance Field (NeRF) and more recently with 3D Gaussian Splatting (3DGS). The latter provides a significant edge over NeRF in terms of the training and inference speed, as well as the reconstruction quality. Although 3DGS works well for dense input images, the unstructured p… ▽ More

    Submitted 7 December, 2024; v1 submitted 28 March, 2024; originally announced March 2024.

    Comments: ECCV2024, Project page: https://people.engr.tamu.edu/nimak/Papers/CoherentGS, Code: https://github.com/avinashpaliwal/CoherentGS

  5. arXiv:2403.13788  [pdf, other

    cs.CV

    DepthFM: Fast Monocular Depth Estimation with Flow Matching

    Authors: Ming Gui, Johannes Schusterbauer, Ulrich Prestel, Pingchuan Ma, Dmytro Kotovenko, Olga Grebenkova, Stefan Andreas Baumann, Vincent Tao Hu, Björn Ommer

    Abstract: Current discriminative depth estimation methods often produce blurry artifacts, while generative approaches suffer from slow sampling due to curvatures in the noise-to-depth transport. Our method addresses these challenges by framing depth estimation as a direct transport between image and depth distributions. We are the first to explore flow matching in this field, and we demonstrate that its int… ▽ More

    Submitted 19 December, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: AAAI 2025, Project Page: https://github.com/CompVis/depth-fm

  6. arXiv:2103.17185  [pdf, other

    cs.CV cs.AI cs.GR

    Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes

    Authors: Dmytro Kotovenko, Matthias Wright, Arthur Heimbrecht, Björn Ommer

    Abstract: There have been many successful implementations of neural style transfer in recent years. In most of these works, the stylization process is confined to the pixel domain. However, we argue that this representation is unnatural because paintings usually consist of brushstrokes rather than pixels. We propose a method to stylize images by optimizing parameterized brushstrokes instead of pixels and fu… ▽ More

    Submitted 31 March, 2021; originally announced March 2021.

    Comments: Accepted at CVPR2021

  7. arXiv:2003.08407  [pdf, other

    cs.CV

    A Content Transformation Block For Image Style Transfer

    Authors: Dmytro Kotovenko, Artsiom Sanakoyeu, Pingchuan Ma, Sabine Lang, Björn Ommer

    Abstract: Style transfer has recently received a lot of attention, since it allows to study fundamental challenges in image understanding and synthesis. Recent work has significantly improved the representation of color and texture and computational speed and image resolution. The explicit transformation of image content has, however, been mostly neglected: while artistic style affects formal characteristic… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

    Comments: Accepted to CVPR 2019

  8. arXiv:1807.10201  [pdf, other

    cs.CV

    A Style-Aware Content Loss for Real-time HD Style Transfer

    Authors: Artsiom Sanakoyeu, Dmytro Kotovenko, Sabine Lang, Björn Ommer

    Abstract: Recently, style transfer has received a lot of attention. While much of this research has aimed at speeding up processing, the approaches are still lacking from a principled, art historical standpoint: a style is more than just a single image or an artist, but previous work is limited to only a single instance of a style or shows no benefit from more images. Moreover, previous work has relied on a… ▽ More

    Submitted 28 July, 2018; v1 submitted 26 July, 2018; originally announced July 2018.

    Comments: Accepted at ECCV18 (Oral)