Skip to main content

Showing 1–22 of 22 results for author: Ackermann, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2509.20503  [pdf, ps, other

    cs.LG

    Myosotis: structured computation for attention like layer

    Authors: Evgenii Egorov, Hanno Ackermann, Markus Nagel, Hong Cai

    Abstract: Attention layers apply a sequence-to-sequence mapping whose parameters depend on the pairwise interactions of the input elements. However, without any structural assumptions, memory and compute scale quadratically with the sequence length. The two main ways to mitigate this are to introduce sparsity by ignoring a sufficient amount of pairwise interactions or to introduce recurrent dependence along… ▽ More

    Submitted 24 September, 2025; originally announced September 2025.

  2. arXiv:2506.03290  [pdf, other

    cs.CV

    Learning Optical Flow Field via Neural Ordinary Differential Equation

    Authors: Leyla Mirvakhabova, Hong Cai, Jisoo Jeong, Hanno Ackermann, Farhad Zanjani, Fatih Porikli

    Abstract: Recent works on optical flow estimation use neural networks to predict the flow field that maps positions of one image to positions of the other. These networks consist of a feature extractor, a correlation volume, and finally several refinement steps. These refinement steps mimic the iterative refinements performed by classical optimization algorithms and are usually implemented by neural layers… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: CVPRW 2025

  3. arXiv:2412.01931  [pdf, other

    cs.CV

    Planar Gaussian Splatting

    Authors: Farhad G. Zanjani, Hong Cai, Hanno Ackermann, Leila Mirvakhabova, Fatih Porikli

    Abstract: This paper presents Planar Gaussian Splatting (PGS), a novel neural rendering approach to learn the 3D geometry and parse the 3D planes of a scene, directly from multiple RGB images. The PGS leverages Gaussian primitives to model the scene and employ a hierarchical Gaussian mixture approach to group them. Similar Gaussians are progressively merged probabilistically in the tree-structured Gaussian… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2025

  4. arXiv:2211.02667  [pdf, other

    cs.LG stat.ML

    Deconfounding Imitation Learning with Variational Inference

    Authors: Risto Vuorio, Pim de Haan, Johann Brehmer, Hanno Ackermann, Daniel Dijkman, Taco Cohen

    Abstract: Standard imitation learning can fail when the expert demonstrators have different sensory inputs than the imitating agent. This is because partial observability gives rise to hidden confounders in the causal graph. In previous work, to work around the confounding problem, policies have been trained using query access to the expert's policy or inverse reinforcement learning (IRL). However, both app… ▽ More

    Submitted 25 August, 2024; v1 submitted 4 November, 2022; originally announced November 2022.

  5. arXiv:2107.12309  [pdf, other

    cs.CV

    Spatial-Temporal Transformer for Dynamic Scene Graph Generation

    Authors: Yuren Cong, Wentong Liao, Hanno Ackermann, Bodo Rosenhahn, Michael Ying Yang

    Abstract: Dynamic scene graph generation aims at generating a scene graph of the given video. Compared to the task of scene graph generation from images, it is more challenging because of the dynamic relationships between objects and the temporal dependencies between frames allowing for a richer semantic interpretation. In this paper, we propose Spatial-temporal Transformer (STTran), a neural network that c… ▽ More

    Submitted 8 August, 2021; v1 submitted 26 July, 2021; originally announced July 2021.

    Comments: accepted by ICCV 2021

  6. arXiv:2105.02047  [pdf, other

    cs.CV

    Cuboids Revisited: Learning Robust 3D Shape Fitting to Single RGB Images

    Authors: Florian Kluger, Hanno Ackermann, Eric Brachmann, Michael Ying Yang, Bodo Rosenhahn

    Abstract: Humans perceive and construct the surrounding world as an arrangement of simple parametric models. In particular, man-made environments commonly consist of volumetric primitives such as cuboids or cylinders. Inferring these primitives is an important step to attain high-level, abstract scene descriptions. Previous approaches directly estimate shape parameters from a 2D or 3D input, and are only ab… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: CVPR 2021

  7. Scheduling a Proportionate Flow Shop of Batching Machines

    Authors: Christoph Hertrich, Christian Weiß, Heiner Ackermann, Sandy Heydrich, Sven O. Krumke

    Abstract: In this paper we study a proportionate flow shop of batching machines with release dates and a fixed number $m \geq 2$ of machines. The scheduling problem has so far barely received any attention in the literature, but recently its importance has increased significantly, due to applications in the industrial scaling of modern bio-medicine production processes. We show that for any fixed number of… ▽ More

    Submitted 26 November, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Version 2: replace initial preprint with authors' accepted manuscript

    Journal ref: Journal of Scheduling 23, 575-593 (2020)

  8. Online Algorithms to Schedule a Proportionate Flexible Flow Shop of Batching Machines

    Authors: Christoph Hertrich, Christian Weiß, Heiner Ackermann, Sandy Heydrich, Sven O. Krumke

    Abstract: This paper is the first to consider online algorithms to schedule a proportionate flexible flow shop of batching machines (PFFB). The scheduling model is motivated by manufacturing processes of individualized medicaments, which are used in modern medicine to treat some serious illnesses. We provide two different online algorithms, proving also lower bounds for the offline problem to compute their… ▽ More

    Submitted 17 July, 2024; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: Authors' accepted manuscript

    Journal ref: Journal of Scheduling 25, 643-657 (2022)

  9. arXiv:2001.04735  [pdf, other

    cs.CV

    NODIS: Neural Ordinary Differential Scene Understanding

    Authors: Cong Yuren, Hanno Ackermann, Wentong Liao, Michael Ying Yang, Bodo Rosenhahn

    Abstract: Semantic image understanding is a challenging topic in computer vision. It requires to detect all objects in an image, but also to identify all the relations between them. Detected objects, their labels and the discovered relations can be used to construct a scene graph which provides an abstract semantic interpretation of an image. In previous works, relations were identified by solving an assign… ▽ More

    Submitted 18 July, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

  10. arXiv:2001.02643  [pdf, other

    cs.CV

    CONSAC: Robust Multi-Model Fitting by Conditional Sample Consensus

    Authors: Florian Kluger, Eric Brachmann, Hanno Ackermann, Carsten Rother, Michael Ying Yang, Bodo Rosenhahn

    Abstract: We present a robust estimator for fitting multiple parametric models of the same form to noisy measurements. Applications include finding multiple vanishing points in man-made scenes, fitting planes to architectural imagery, or estimating multiple rigid motions within the same sequence. In contrast to previous works, which resorted to hand-crafted search strategies for multiple model detection, we… ▽ More

    Submitted 25 March, 2020; v1 submitted 8 January, 2020; originally announced January 2020.

    Comments: CVPR 2020

  11. arXiv:1908.08989  [pdf, other

    cs.CV cs.LG stat.ML

    Learning Disentangled Representations via Independent Subspaces

    Authors: Maren Awiszus, Hanno Ackermann, Bodo Rosenhahn

    Abstract: Image generating neural networks are mostly viewed as black boxes, where any change in the input can have a number of globally effective changes on the output. In this work, we propose a method for learning disentangled representations to allow for localized image manipulations. We use face images as our example of choice. Depending on the image region, identity and other facial attributes can be… ▽ More

    Submitted 26 August, 2019; originally announced August 2019.

    Comments: Accepted at ICCV 2019 Workshop on Robust Subspace Learning and Applications in Computer Vision

  12. arXiv:1907.10014  [pdf, other

    cs.CV

    Temporally Consistent Horizon Lines

    Authors: Florian Kluger, Hanno Ackermann, Michael Ying Yang, Bodo Rosenhahn

    Abstract: The horizon line is an important geometric feature for many image processing and scene understanding tasks in computer vision. For instance, in navigation of autonomous vehicles or driver assistance, it can be used to improve 3D reconstruction as well as for semantic interpretation of dynamic environments. While both algorithms and datasets exist for single images, the problem of horizon line esti… ▽ More

    Submitted 9 January, 2020; v1 submitted 23 July, 2019; originally announced July 2019.

  13. arXiv:1904.13271  [pdf, other

    cs.CV

    Non-Rigid Structure-From-Motion by Rank-One Basis Shapes

    Authors: Sami S. Brandt, Hanno Ackermann

    Abstract: In this paper, we show that the affine, non-rigid structure-from-motion problem can be solved by rank-one, thus degenerate, basis shapes. It is a natural reformulation of the classic low-rank method by Bregler et al., where it was assumed that the deformable 3D structure is generated by a linear combination of rigid basis shapes. The non-rigid shape will be decomposed into the mean shape and the d… ▽ More

    Submitted 30 April, 2019; originally announced April 2019.

  14. arXiv:1811.09132  [pdf, other

    cs.CV

    Uncalibrated Non-Rigid Factorisation by Independent Subspace Analysis

    Authors: Sami Sebastian Brandt, Hanno Ackermann, Stella Grasshof

    Abstract: We propose a general, prior-free approach for the uncalibrated non-rigid structure-from-motion problem for modelling and analysis of non-rigid objects such as human faces. The word general refers to an approach that recovers the non-rigid affine structure and motion from 2D point correspondences by assuming that (1) the non-rigid shapes are generated by a linear combination of rigid 3D basis shape… ▽ More

    Submitted 22 November, 2018; originally announced November 2018.

  15. arXiv:1709.05910  [pdf, other

    cs.CV cs.RO

    Object Recognition from very few Training Examples for Enhancing Bicycle Maps

    Authors: Christoph Reinders, Hanno Ackermann, Michael Ying Yang, Bodo Rosenhahn

    Abstract: In recent years, data-driven methods have shown great success for extracting information about the infrastructure in urban areas. These algorithms are usually trained on large datasets consisting of thousands or millions of labeled training examples. While large datasets have been published regarding cars, for cyclists very few labeled data is available although appearance, point of view, and posi… ▽ More

    Submitted 28 May, 2018; v1 submitted 18 September, 2017; originally announced September 2017.

    Comments: Submitted to IV 2018. This research was supported by German Research Foundation DFG within Priority Research Programme 1894 "Volunteered Geographic Information: Interpretation, Visualization and Social Computing"

  16. arXiv:1707.02427  [pdf, other

    cs.CV

    Deep Learning for Vanishing Point Detection Using an Inverse Gnomonic Projection

    Authors: Florian Kluger, Hanno Ackermann, Michael Ying Yang, Bodo Rosenhahn

    Abstract: We present a novel approach for vanishing point detection from uncalibrated monocular images. In contrast to state-of-the-art, we make no a priori assumptions about the observed scene. Our method is based on a convolutional neural network (CNN) which does not use natural images, but a Gaussian sphere representation arising from an inverse gnomonic projection of lines detected in an image. This all… ▽ More

    Submitted 16 November, 2017; v1 submitted 8 July, 2017; originally announced July 2017.

    Comments: Accepted for publication at German Conference on Pattern Recognition (GCPR) 2017. This research was supported by German Research Foundation DFG within Priority Research Programme 1894 "Volunteered Geographic Information: Interpretation, Visualisation and Social Computing"

  17. arXiv:1702.00186  [pdf, other

    cs.CV

    A Kinematic Chain Space for Monocular Motion Capture

    Authors: Bastian Wandt, Hanno Ackermann, Bodo Rosenhahn

    Abstract: This paper deals with motion capture of kinematic chains (e.g. human skeletons) from monocular image sequences taken by uncalibrated cameras. We present a method based on projecting an observation into a kinematic chain space (KCS). An optimization of the nuclear norm is proposed that implicitly enforces structural properties of the kinematic chain. Unlike other approaches our method does not requ… ▽ More

    Submitted 1 February, 2017; originally announced February 2017.

  18. Who With Whom And How?: Extracting Large Social Networks Using Search Engines

    Authors: Stefan Siersdorfer, Philipp Kemkes, Hanno Ackermann, Sergej Zerr

    Abstract: Social network analysis is leveraged in a variety of applications such as identifying influential entities, detecting communities with special interests, and determining the flow of information and innovations. However, existing approaches for extracting social networks from unstructured Web content do not scale well and are only feasible for small graphs. In this paper, we introduce novel methodo… ▽ More

    Submitted 28 January, 2017; originally announced January 2017.

    Journal ref: CIKM 2015 Proceedings of the 24th ACM International on Conference on Information and Knowledge Management Pages 1491-1500

  19. arXiv:1701.06944  [pdf, other

    cs.CV

    Motion Segmentation via Global and Local Sparse Subspace Optimization

    Authors: Michael Ying Yang, Hanno Ackermann, Weiyao Lin, Sitong Feng, Bodo Rosenhahn

    Abstract: In this paper, we propose a new framework for segmenting feature-based moving objects under affine subspace model. Since the feature trajectories in practice are high-dimensional and contain a lot of noise, we firstly apply the sparse PCA to represent the original trajectories with a low-dimensional global subspace, which consists of the orthogonal sparse principal vectors. Subsequently, the local… ▽ More

    Submitted 24 January, 2017; originally announced January 2017.

    Comments: 11 pages

  20. On Support Relations and Semantic Scene Graphs

    Authors: Michael Ying Yang, Wentong Liao, Hanno Ackermann, Bodo Rosenhahn

    Abstract: Scene understanding is a popular and challenging topic in both computer vision and photogrammetry. Scene graph provides rich information for such scene understanding. This paper presents a novel approach to infer such relations and then to construct the scene graph. Support relations are estimated by considering important, previously ignored information: the physical stability and the prior suppor… ▽ More

    Submitted 16 November, 2017; v1 submitted 19 September, 2016; originally announced September 2016.

    Comments: Accepted in ISPRS Journal of Photogrammetry and Remote Sensing

  21. arXiv:0808.2081  [pdf, ps, other

    cs.GT

    Concurrent Imitation Dynamics in Congestion Games

    Authors: Heiner Ackermann, Petra Berenbrink, Simon Fischer, Martin Hoefer

    Abstract: Imitating successful behavior is a natural and frequently applied approach to trust in when facing scenarios for which we have little or no experience upon which we can base our decision. In this paper, we consider such behavior in atomic congestion games. We propose to study concurrent imitation dynamics that emerge when each player samples another player and possibly imitates this agents' stra… ▽ More

    Submitted 3 October, 2008; v1 submitted 14 August, 2008; originally announced August 2008.

    Comments: 28 pages, 1 figure

    ACM Class: F.2.2; G.2; G.3

  22. arXiv:0805.1130  [pdf, ps, other

    cs.GT

    On the Convergence Time of the Best Response Dynamics in Player-specific Congestion Games

    Authors: Heiner Ackermann, Heiko Roeglin

    Abstract: We study the convergence time of the best response dynamics in player-specific singleton congestion games. It is well known that this dynamics can cycle, although from every state a short sequence of best responses to a Nash equilibrium exists. Thus, the random best response dynamics, which selects the next player to play a best response uniformly at random, terminates in a Nash equilibrium with… ▽ More

    Submitted 8 May, 2008; originally announced May 2008.