Skip to main content

Showing 1–22 of 22 results for author: Seiler, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.13709  [pdf, other

    eess.IV cs.CV

    A Study on the Effect of Color Spaces in Learned Image Compression

    Authors: Srivatsa Prativadibhayankaram, Mahadev Prasad Panda, Jürgen Seiler, Thomas Richter, Heiko Sparenberg, Siegfried Fößel, André Kaup

    Abstract: In this work, we present a comparison between color spaces namely YUV, LAB, RGB and their effect on learned image compression. For this we use the structure and color based learned image codec (SLIC) from our prior work, which consists of two branches - one for the luminance component (Y or L) and another for chrominance components (UV or AB). However, for the RGB variant we input all 3 channels i… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: Accepter pre-print version for ICIP 2024

  2. arXiv:2306.17636  [pdf, other

    cs.CV cs.AI cs.LG

    Achieving RGB-D level Segmentation Performance from a Single ToF Camera

    Authors: Pranav Sharma, Jigyasa Singh Katrolia, Jason Rambach, Bruno Mirbach, Didier Stricker, Juergen Seiler

    Abstract: Depth is a very important modality in computer vision, typically used as complementary information to RGB, provided by RGB-D cameras. In this work, we show that it is possible to obtain the same level of accuracy as RGB-D cameras on a semantic segmentation task using infrared (IR) and depth images from a single Time-of-Flight (ToF) camera. In order to fuse the IR and depth modalities of the ToF ca… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

  3. arXiv:2304.12412  [pdf, other

    cs.CV cs.AI

    End-to-End Lidar-Camera Self-Calibration for Autonomous Vehicles

    Authors: Arya Rachman, Jürgen Seiler, André Kaup

    Abstract: Autonomous vehicles are equipped with a multi-modal sensor setup to enable the car to drive safely. The initial calibration of such perception sensors is a highly matured topic and is routinely done in an automated factory environment. However, an intriguing question arises on how to maintain the calibration quality throughout the vehicle's operating duration. Another challenge is to calibrate mul… ▽ More

    Submitted 27 April, 2023; v1 submitted 24 April, 2023; originally announced April 2023.

    Comments: Accepted for The 35th IEEE Intelligent Vehicles Symposium (IV 2023)

  4. arXiv:2211.16995  [pdf, ps, other

    eess.IV cs.CV

    A hybrid motion estimation technique for fisheye video sequences based on equisolid re-projection

    Authors: Andrea Eichenseer, Michel Bätz, Jürgen Seiler, André Kaup

    Abstract: Capturing large fields of view with only one camera is an important aspect in surveillance and automotive applications, but the wide-angle fisheye imagery thus obtained exhibits very special characteristics that may not be very well suited for typical image and video processing methods such as motion estimation. This paper introduces a motion estimation method that adapts to the typical radial cha… ▽ More

    Submitted 30 November, 2022; originally announced November 2022.

    Journal ref: IEEE International Conference on Image Processing (ICIP), 2015, pp. 3565-3569

  5. arXiv:2210.07737  [pdf, other

    cs.IT eess.IV

    On Benefits and Challenges of Conditional Interframe Video Coding in Light of Information Theory

    Authors: Fabian Brand, Jürgen Seiler, André Kaup

    Abstract: The rise of variational autoencoders for image and video compression has opened the door to many elaborate coding techniques. One example here is the possibility of conditional interframe coding. Here, instead of transmitting the residual between the original frame and the predicted frame (often obtained by motion compensation), the current frame is transmitted under the condition of knowing the p… ▽ More

    Submitted 13 December, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

    Comments: 5 pages, 4 figures, accepted to be presented at PCS 2022. arXiv admin note: text overlap with arXiv:2112.08011 Update Note: Fixed notation in Eq. 10, no changes otherwise

  6. 3D Rendering Framework for Data Augmentation in Optical Character Recognition

    Authors: Andreas Spruck, Maximiliane Hawesch, Anatol Maier, Christian Riess, Jürgen Seiler, André Kaup

    Abstract: In this paper, we propose a data augmentation framework for Optical Character Recognition (OCR). The proposed framework is able to synthesize new viewing angles and illumination scenarios, effectively enriching any available OCR dataset. Its modular structure allows to be modified to match individual user requirements. The framework enables to comfortably scale the enlargement factor of the availa… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: IEEE International Symposium on Signals, Circuits and Systems (ISSCS), 1-4, July 2021

  7. arXiv:2209.14448  [pdf, other

    cs.CV eess.IV

    Synthesizing Annotated Image and Video Data Using a Rendering-Based Pipeline for Improved License Plate Recognition

    Authors: Andreas Spruck, Maximilane Gruber, Anatol Maier, Denise Moussa, Jürgen Seiler, Christian Riess, André Kaup

    Abstract: An insufficient number of training samples is a common problem in neural network applications. While data augmentation methods require at least a minimum number of samples, we propose a novel, rendering-based pipeline for synthesizing annotated data sets. Our method does not modify existing samples but synthesizes entirely new samples. The proposed rendering-based pipeline is capable of generating… ▽ More

    Submitted 28 September, 2022; originally announced September 2022.

    Comments: submitted to IEEE Transactions on Intelligent Transportation Systems

  8. Forensic License Plate Recognition with Compression-Informed Transformers

    Authors: Denise Moussa, Anatol Maier, Andreas Spruck, Jürgen Seiler, Christian Riess

    Abstract: Forensic license plate recognition (FLPR) remains an open challenge in legal contexts such as criminal investigations, where unreadable license plates (LPs) need to be deciphered from highly compressed and/or low resolution footage, e.g., from surveillance cameras. In this work, we propose a side-informed Transformer architecture that embeds knowledge on the input compression level to improve reco… ▽ More

    Submitted 3 May, 2024; v1 submitted 29 July, 2022; originally announced July 2022.

    Comments: Published at ICIP 2022, Code: https://faui1-gitlab.cs.fau.de/denise.moussa/forensic-license-plate-transformer/

    Journal ref: In IEEE International Conference on Image Processing (ICIP), pp. 406-410. IEEE, 2022

  9. arXiv:2207.01210  [pdf, ps, other

    eess.IV cs.MM

    Reusing the H.264/AVC deblocking filter for efficient spatio-temporal prediction in video coding

    Authors: Jürgen Seiler, André Kaup

    Abstract: The prediction step is a very important part of hybrid video codecs for effectively compressing video sequences. While existing video codecs predict either in temporal or in spatial direction only, the compression efficiency can be increased by a combined spatio-temporal prediction. In this paper we propose an algorithm for reusing the H.264/AVC deblocking filter for spatio-temporal prediction. Re… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Journal ref: IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2011, pp. 1049-1052

  10. arXiv:2207.00231  [pdf, ps, other

    eess.IV cs.MM

    Motion Compensated Frequency Selective Extrapolation for Error Concealment in Video Coding

    Authors: Jürgen Seiler, André Kaup

    Abstract: Although wireless and IP-based access to video content gives a new degree of freedom to the viewers, the risk of severe block losses caused by transmission errors is always present. The purpose of this paper is to present a new method for concealing block losses in erroneously received video sequences. For this, a motion compensated data set is generated around the lost block. Based on this aligne… ▽ More

    Submitted 1 July, 2022; originally announced July 2022.

    Journal ref: 16th European Signal Processing Conference, 2008

  11. Scalable Kernel-Based Minimum Mean Square Error Estimator for Accelerated Image Error Concealment

    Authors: Ján Koloda, Jürgen Seiler, Antonio M. Peinado, André Kaup

    Abstract: Error concealment is of great importance for block-based video systems, such as DVB or video streaming services. In this paper, we propose a novel scalable spatial error concealment algorithm that aims at obtaining high quality reconstructions with reduced computational burden. The proposed technique exploits the excellent reconstructing abilities of the kernel-based minimum mean square error K-MM… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    ACM Class: I.4.3; I.4.5

    Journal ref: IEEE Transactions on Broadcasting, vol. 63, no. 1, pp. 59-70, March 2017

  12. Denoising-based image reconstruction from pixels located at non-integer positions

    Authors: Ján Koloda, Jürgen Seiler, André Kaup

    Abstract: Digital images are commonly represented as regular 2D arrays, so pixels are organized in form of a matrix addressed by integers. However, there are many image processing operations, such as rotation or motion compensation, that produce pixels at non-integer positions. Typically, image reconstruction techniques cannot handle samples at non-integer positions. In this paper, we propose to use triangu… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

    Comments: arXiv admin note: text overlap with arXiv:2205.10138

    ACM Class: I.4.3; I.4.5

    Journal ref: 2015 IEEE International Conference on Image Processing (ICIP), 2015, pp. 4565-4569

  13. Reliability-based Mesh-to-Grid Image Reconstruction

    Authors: Ján Koloda, Jürgen Seiler, André Kaup

    Abstract: This paper presents a novel method for the reconstruction of images from samples located at non-integer positions, called mesh. This is a common scenario for many image processing applications, such as super-resolution, warping or virtual view generation in multi-camera systems. The proposed method relies on a set of initial estimates that are later refined by a new reliability-based content-adapt… ▽ More

    Submitted 20 May, 2022; originally announced May 2022.

    ACM Class: I.4.3; I.4.5

    Journal ref: 2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP), 2016, pp. 1-5

  14. Frequency selective extrapolation with residual filtering for image error concealment

    Authors: Ján Koloda, Jürgen Seiler, André Kaup, Victoria Sánchez, Antonio M. Peinado

    Abstract: The purpose of signal extrapolation is to estimate unknown signal parts from known samples. This task is especially important for error concealment in image and video communication. For obtaining a high quality reconstruction, assumptions have to be made about the underlying signal in order to solve this underdetermined problem. Among existent reconstruction algorithms, frequency selective extrapo… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    ACM Class: I.4.3; I.4.5

    Journal ref: 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2014, pp. 1976-1980

  15. arXiv:2204.04068  [pdf, other

    eess.AS cs.SD

    Declipping of Speech Signals Using Frequency Selective Extrapolation

    Authors: Markus Jonscher, Jürgen Seiler, André Kaup

    Abstract: The reconstruction of clipped speech signals is an important task in audio signal processing to achieve an enhanced audio quality for further processing. In this paper, Frequency Selective Extrapolation (FSE), which is commonly used for error concealment or the reconstruction of incomplete image data, is adapted to be able to restore audio signals which are distorted from clipping. For this, FSE g… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: 4 pages, 5 figures, 2 tables, Speech Communication 11. ITG Symposium

  16. Novel Consistency Check For Fast Recursive Reconstruction Of Non-Regularly Sampled Video Data

    Authors: Simon Grosche, Jürgen Seiler, André Kaup

    Abstract: Quarter sampling is a novel sensor design that allows for an acquisition of higher resolution images without increasing the number of pixels. When being used for video data, one out of four pixels is measured in each frame. Effectively, this leads to a non-regular spatio-temporal sub-sampling. Compared to purely spatial or temporal sub-sampling, this allows for an increased reconstruction quality,… ▽ More

    Submitted 17 March, 2022; originally announced March 2022.

    Comments: 5 pages, 5 figures, 3 tables, IEEE International Conference on Image Processing (ICIP)

  17. arXiv:2112.08011  [pdf, other

    eess.IV cs.IT

    Generalized Difference Coder: A Novel Conditional Autoencoder Structure for Video Compression

    Authors: Fabian Brand, Jürgen Seiler, André Kaup

    Abstract: Motion compensated inter prediction is a common component of all video coders. The concept was established in traditional hybrid coding and successfully transferred to learning-based video compression. To compress the residual signal after prediction, usually the difference of the two signals is compressed using a standard autoencoder. However, information theory tells us that a general conditiona… ▽ More

    Submitted 15 December, 2021; originally announced December 2021.

  18. arXiv:2111.09013  [pdf, other

    eess.IV cs.CV

    Image Super-Resolution Using T-Tetromino Pixels

    Authors: Simon Grosche, Andy Regensky, Jürgen Seiler, André Kaup

    Abstract: For modern high-resolution imaging sensors, pixel binning is performed in low-lighting conditions and in case high frame rates are required. To recover the original spatial resolution, single-image super-resolution techniques can be applied for upscaling. To achieve a higher image quality after upscaling, we propose a novel binning concept using tetromino-shaped pixels. It is embedded into the fie… ▽ More

    Submitted 22 March, 2023; v1 submitted 17 November, 2021; originally announced November 2021.

    Comments: 10 pages, 9 figures, 4 tables. This work has been submitted to the IEEE for possible publication

  19. arXiv:1701.06852  [pdf, other

    cs.IT cs.AI stat.ML

    Incorporating Prior Information in Compressive Online Robust Principal Component Analysis

    Authors: Huynh Van Luong, Nikos Deligiannis, Jurgen Seiler, Soren Forchhammer, Andre Kaup

    Abstract: We consider an online version of the robust Principle Component Analysis (PCA), which arises naturally in time-varying source separations such as video foreground-background separation. This paper proposes a compressive online robust PCA with prior information for recursively separating a sequences of frames into sparse and low-rank components from a small set of measurements. In contrast to conve… ▽ More

    Submitted 27 May, 2017; v1 submitted 24 January, 2017; originally announced January 2017.

  20. arXiv:1605.06776  [pdf, other

    cs.CV math.OC

    Sparse Signal Reconstruction with Multiple Side Information using Adaptive Weights for Multiview Sources

    Authors: Huynh Van Luong, Jürgen Seiler, André Kaup, Søren Forchhammer

    Abstract: This work considers reconstructing a target signal in a context of distributed sparse sources. We propose an efficient reconstruction algorithm with the aid of other given sources as multiple side information (SI). The proposed algorithm takes advantage of compressive sensing (CS) with SI and adaptive weights by solving a proposed weighted $n$-$\ell_{1}$ minimization. The proposed algorithm comput… ▽ More

    Submitted 22 May, 2016; originally announced May 2016.

    Comments: Submitted to the IEEE International Conference on Image Processing 2016

  21. arXiv:1605.03234  [pdf, other

    cs.IT cs.CV math.OC

    Measurement Bounds for Sparse Signal Reconstruction with Multiple Side Information

    Authors: Huynh Van Luong, Jurgen Seiler, Andre Kaup, Soren Forchhammer, Nikos Deligiannis

    Abstract: In the context of compressed sensing (CS), this paper considers the problem of reconstructing sparse signals with the aid of other given correlated sources as multiple side information. To address this problem, we theoretically study a generic \textcolor{black}{weighted $n$-$\ell_{1}$ minimization} framework and propose a reconstruction algorithm that leverages multiple side information signals (R… ▽ More

    Submitted 18 January, 2017; v1 submitted 10 May, 2016; originally announced May 2016.

    Comments: submitted to a journal

  22. arXiv:1502.07449  [pdf

    cs.ET cs.AR cs.CV

    Concept for a CMOS Image Sensor Suited for Analog Image Pre-Processing

    Authors: Lan Shi, Christopher Soell, Andreas Baenisch, Robert Weigel, Jürgen Seiler, Thomas Ussmueller

    Abstract: A concept for a novel CMOS image sensor suited for analog image pre-processing is presented in this paper. As an example, an image restoration algorithm for reducing image noise is applied as image pre-processing in the analog domain. To supply low-latency data input for analog image preprocessing, the proposed concept for a CMOS image sensor offers a new sensor signal acquisition method in 2D. In… ▽ More

    Submitted 26 February, 2015; originally announced February 2015.

    Comments: Presented at DATE Friday Workshop on Heterogeneous Architectures and Design Methods for Embedded Image Systems (HIS 2015) (arXiv:1502.07241)

    Report number: DATEHIS/2015/04