Skip to main content

Showing 1–29 of 29 results for author: Peer, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2504.07392  [pdf, other

    cs.CV

    ID-Booth: Identity-consistent Face Generation with Diffusion Models

    Authors: Darian Tomašević, Fadi Boutros, Chenhao Lin, Naser Damer, Vitomir Štruc, Peter Peer

    Abstract: Recent advances in generative modeling have enabled the generation of high-quality synthetic data that is applicable in a variety of domains, including face recognition. Here, state-of-the-art generative models typically rely on conditioning and fine-tuning of powerful pretrained diffusion models to facilitate the synthesis of realistic images of a desired identity. Yet, these models often do not… ▽ More

    Submitted 4 May, 2025; v1 submitted 9 April, 2025; originally announced April 2025.

    Comments: IEEE International Conference on Automatic Face and Gesture Recognition (FG) 2025, 14 pages

  2. arXiv:2504.05504  [pdf, other

    cs.CV

    SelfMAD: Enhancing Generalization and Robustness in Morphing Attack Detection via Self-Supervised Learning

    Authors: Marija Ivanovska, Leon Todorov, Naser Damer, Deepak Kumar Jain, Peter Peer, Vitomir Štruc

    Abstract: With the continuous advancement of generative models, face morphing attacks have become a significant challenge for existing face verification systems due to their potential use in identity fraud and other malicious activities. Contemporary Morphing Attack Detection (MAD) approaches frequently rely on supervised, discriminative models trained on examples of bona fide and morphed images. These mode… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

    Comments: Accepted at IEEE International Conference on Automatic Face and Gesture Recognition (FG 2025)

  3. arXiv:2502.07734  [pdf, other

    cs.CV cs.AI

    EdgeEar: Efficient and Accurate Ear Recognition for Edge Devices

    Authors: Camile Lendering, Bernardo Perrone Ribeiro, Žiga Emeršič, Peter Peer

    Abstract: Ear recognition is a contactless and unobtrusive biometric technique with applications across various domains. However, deploying high-performing ear recognition models on resource-constrained devices is challenging, limiting their applicability and widespread adoption. This paper introduces EdgeEar, a lightweight model based on a proposed hybrid CNN-transformer architecture to solve this problem.… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

    Comments: Submitted to IEEE FG 2025

  4. arXiv:2501.03800  [pdf, other

    cs.CV cs.CR

    MADation: Face Morphing Attack Detection with Foundation Models

    Authors: Eduarda Caldeira, Guray Ozgur, Tahar Chettaoui, Marija Ivanovska, Peter Peer, Fadi Boutros, Vitomir Struc, Naser Damer

    Abstract: Despite the considerable performance improvements of face recognition algorithms in recent years, the same scientific advances responsible for this progress can also be used to create efficient ways to attack them, posing a threat to their secure deployment. Morphing attack detection (MAD) systems aim to detect a specific type of threat, morphing attacks, at an early stage, preventing them from be… ▽ More

    Submitted 27 January, 2025; v1 submitted 7 January, 2025; originally announced January 2025.

    Comments: Accepted at WACV 2025 workshops

  5. arXiv:2407.03901  [pdf, other

    cs.CV cs.LG

    DiCTI: Diffusion-based Clothing Designer via Text-guided Input

    Authors: Ajda Lampe, Julija Stopar, Deepak Kumar Jain, Shinichiro Omachi, Peter Peer, Vitomir Štruc

    Abstract: Recent developments in deep generative models have opened up a wide range of opportunities for image synthesis, leading to significant changes in various creative fields, including the fashion industry. While numerous methods have been proposed to benefit buyers, particularly in virtual try-on applications, there has been relatively less focus on facilitating fast prototyping for designers and cus… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted to FG 2024

  6. arXiv:2404.09555  [pdf, other

    cs.CV

    AI-KD: Towards Alignment Invariant Face Image Quality Assessment Using Knowledge Distillation

    Authors: Žiga Babnik, Fadi Boutros, Naser Damer, Peter Peer, Vitomir Štruc

    Abstract: Face Image Quality Assessment (FIQA) techniques have seen steady improvements over recent years, but their performance still deteriorates if the input face samples are not properly aligned. This alignment sensitivity comes from the fact that most FIQA techniques are trained or designed using a specific face alignment procedure. If the alignment technique changes, the performance of most existing F… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: IEEE International Workshop on Biometrics and Forensics (IWBF) 2024, pp. 6

  7. arXiv:2306.05985  [pdf, ps, other

    cs.CV

    Beyond Detection: Visual Realism Assessment of Deepfakes

    Authors: Luka Dragar, Peter Peer, Vitomir Štruc, Borut Batagelj

    Abstract: In the era of rapid digitalization and artificial intelligence advancements, the development of DeepFake technology has posed significant security and privacy concerns. This paper presents an effective measure to assess the visual realism of DeepFake videos. We utilize an ensemble of two Convolutional Neural Network (CNN) models: Eva and ConvNext. These models have been trained on the DeepFake Gam… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  8. arXiv:2305.05768  [pdf, other

    cs.CV cs.LG

    DifFIQA: Face Image Quality Assessment Using Denoising Diffusion Probabilistic Models

    Authors: Žiga Babnik, Peter Peer, Vitomir Štruc

    Abstract: Modern face recognition (FR) models excel in constrained scenarios, but often suffer from decreased performance when deployed in unconstrained (real-world) environments due to uncertainties surrounding the quality of the captured facial data. Face image quality assessment (FIQA) techniques aim to mitigate these performance degradations by providing FR models with sample-quality predictions that ca… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  9. Body Segmentation Using Multi-task Learning

    Authors: Julijan Jug, Ajda Lampe, Vitomir Štruc, Peter Peer

    Abstract: Body segmentation is an important step in many computer vision problems involving human images and one of the key components that affects the performance of all downstream tasks. Several prior works have approached this problem using a multi-task model that exploits correlations between different tasks to improve segmentation performance. Based on the success of such solutions, we present in this… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

  10. C-VTON: Context-Driven Image-Based Virtual Try-On Network

    Authors: Benjamin Fele, Ajda Lampe, Peter Peer, Vitomir Štruc

    Abstract: Image-based virtual try-on techniques have shown great promise for enhancing the user-experience and improving customer satisfaction on fashion-oriented e-commerce platforms. However, existing techniques are currently still limited in the quality of the try-on results they are able to produce from input images of diverse characteristics. In this work, we propose a Context-Driven Virtual Try-On Net… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: Accepted to WACV 2022

  11. arXiv:2212.02127  [pdf, other

    cs.CV

    FaceQAN: Face Image Quality Assessment Through Adversarial Noise Exploration

    Authors: Žiga Babnik, Peter Peer, Vitomir Štruc

    Abstract: Recent state-of-the-art face recognition (FR) approaches have achieved impressive performance, yet unconstrained face recognition still represents an open problem. Face image quality assessment (FIQA) approaches aim to estimate the quality of the input samples that can help provide information on the confidence of the recognition decision and eventually lead to improved results in challenging scen… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: The content of this paper was published in ICPR 2022

  12. arXiv:2211.11296  [pdf, other

    cs.CV

    SeeABLE: Soft Discrepancies and Bounded Contrastive Learning for Exposing Deepfakes

    Authors: Nicolas Larue, Ngoc-Son Vu, Vitomir Struc, Peter Peer, Vassilis Christophides

    Abstract: Modern deepfake detectors have achieved encouraging results, when training and test images are drawn from the same data collection. However, when these detectors are applied to images produced with unknown deepfake-generation techniques, considerable performance degradations are commonly observed. In this paper, we propose a novel deepfake detector, called SeeABLE, that formalizes the detection pr… ▽ More

    Submitted 1 October, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: Accepted at ICCV 2023

    Journal ref: 2023, Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), 21011-21021

  13. arXiv:2211.08864  [pdf, other

    cs.CV

    PrivacyProber: Assessment and Detection of Soft-Biometric Privacy-Enhancing Techniques

    Authors: Peter Rot, Peter Peer, Vitomir Štruc

    Abstract: Soft-biometric privacy-enhancing techniques represent machine learning methods that aim to: (i) mitigate privacy concerns associated with face recognition technology by suppressing selected soft-biometric attributes in facial images (e.g., gender, age, ethnicity) and (ii) make unsolicited extraction of sensitive personal information infeasible. Because such techniques are increasingly used in real… ▽ More

    Submitted 22 November, 2022; v1 submitted 16 November, 2022; originally announced November 2022.

  14. arXiv:2210.14145  [pdf, other

    cs.CV eess.IV

    GlassesGAN: Eyewear Personalization using Synthetic Appearance Discovery and Targeted Subspace Modeling

    Authors: Richard Plesh, Peter Peer, Vitomir Štruc

    Abstract: We present GlassesGAN, a novel image editing framework for custom design of glasses, that sets a new standard in terms of image quality, edit realism, and continuous multi-style edit capability. To facilitate the editing process with GlassesGAN, we propose a Targeted Subspace Modelling (TSM) procedure that, based on a novel mechanism for (synthetic) appearance discovery in the latent space of a pr… ▽ More

    Submitted 18 November, 2022; v1 submitted 24 October, 2022; originally announced October 2022.

    Comments: 18 pages, 18 figures, 3 tables

  15. arXiv:2209.07619  [pdf, other

    cs.CV

    Hierarchical Superquadric Decomposition with Implicit Space Separation

    Authors: Jaka Šircelj, Peter Peer, Franc Solina, Vitomir Štruc

    Abstract: We introduce a new method to reconstruct 3D objects using a set of volumetric primitives, i.e., superquadrics. The method hierarchically decomposes a target 3D object into pairs of superquadrics recovering finer and finer details. While such hierarchical methods have been studied before, we introduce a new way of splitting the object space using only properties of the predicted superquadrics. The… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

  16. arXiv:2208.07337  [pdf, other

    cs.CV

    SYN-MAD 2022: Competition on Face Morphing Attack Detection Based on Privacy-aware Synthetic Training Data

    Authors: Marco Huber, Fadi Boutros, Anh Thi Luu, Kiran Raja, Raghavendra Ramachandra, Naser Damer, Pedro C. Neto, Tiago Gonçalves, Ana F. Sequeira, Jaime S. Cardoso, João Tremoço, Miguel Lourenço, Sergio Serra, Eduardo Cermeño, Marija Ivanovska, Borut Batagelj, Andrej Kronovšek, Peter Peer, Vitomir Štruc

    Abstract: This paper presents a summary of the Competition on Face Morphing Attack Detection Based on Privacy-aware Synthetic Training Data (SYN-MAD) held at the 2022 International Joint Conference on Biometrics (IJCB 2022). The competition attracted a total of 12 participating teams, both from academia and industry and present in 11 different countries. In the end, seven valid submissions were submitted by… ▽ More

    Submitted 15 August, 2022; originally announced August 2022.

    Comments: Accepted at International Joint Conference on Biometrics (IJCB) 2022

  17. arXiv:2207.00899  [pdf, other

    cs.CV

    Face Morphing Attack Detection Using Privacy-Aware Training Data

    Authors: Marija Ivanovska, Andrej Kronovšek, Peter Peer, Vitomir Štruc, Borut Batagelj

    Abstract: Images of morphed faces pose a serious threat to face recognition--based security systems, as they can be used to illegally verify the identity of multiple people with a single morphed image. Modern detection algorithms learn to identify such morphing attacks using authentic images of real individuals. This approach raises various privacy concerns and limits the amount of publicly available traini… ▽ More

    Submitted 2 July, 2022; originally announced July 2022.

  18. arXiv:2205.01536  [pdf, other

    cs.CV cs.LG

    BiOcularGAN: Bimodal Synthesis and Annotation of Ocular Images

    Authors: Darian Tomašević, Peter Peer, Vitomir Štruc

    Abstract: Current state-of-the-art segmentation techniques for ocular images are critically dependent on large-scale annotated datasets, which are labor-intensive to gather and often raise privacy concerns. In this paper, we present a novel framework, called BiOcularGAN, capable of generating synthetic large-scale datasets of photorealistic (visible light and near-infrared) ocular images, together with corr… ▽ More

    Submitted 8 December, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

    Comments: 13 pages, 14 figures

  19. arXiv:2001.10504  [pdf, other

    cs.CV

    Segmentation and Recovery of Superquadric Models using Convolutional Neural Networks

    Authors: Jaka Šircelj, Tim Oblak, Klemen Grm, Uroš Petković, Aleš Jaklič, Peter Peer, Vitomir Štruc, Franc Solina

    Abstract: In this paper we address the problem of representing 3D visual data with parameterized volumetric shape primitives. Specifically, we present a (two-stage) approach built around convolutional neural networks (CNNs) capable of segmenting complex depth scenes into the simpler geometric structures that can be represented with superquadric models. In the first stage, our approach uses a Mask RCNN model… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

    Comments: 8 pages, in Computer Vision Winter Workshop, 2020

  20. arXiv:1904.10787  [pdf, other

    cs.CV

    Simultaneous regression and feature learning for facial landmarking

    Authors: Janez Križaj, Peter Peer, Vitomir Štruc, Simon Dobrišek

    Abstract: Face alignment (or facial landmarking) is an important task in many face-related applications, ranging from registration, tracking and animation to higher-level classification problems such as face, expression or attribute recognition. While several solutions have been presented in the literature for this task so far, reliably locating salient facial features across a wide range of posses still re… ▽ More

    Submitted 24 April, 2019; originally announced April 2019.

  21. Recovery of Superquadrics from Range Images using Deep Learning: A Preliminary Study

    Authors: Tim Oblak, Klemen Grm, Aleš Jaklič, Peter Peer, Vitomir Štruc, Franc Solina

    Abstract: It has been a longstanding goal in computer vision to describe the 3D physical space in terms of parameterized volumetric models that would allow autonomous machines to understand and interact with their surroundings. Such models are typically motivated by human visual perception and aim to represents all elements of the physical word ranging from individual objects to complex scenes using a small… ▽ More

    Submitted 28 July, 2020; v1 submitted 13 April, 2019; originally announced April 2019.

    Journal ref: In 2019 International Work Conference on Bioinspired Intelligence (IWOBI), pp. 45-52. IEEE, 2019

  22. arXiv:1903.04143  [pdf, other

    cs.CV

    The Unconstrained Ear Recognition Challenge 2019 - ArXiv Version With Appendix

    Authors: Žiga Emeršič, Aruna Kumar S. V., B. S. Harish, Weronika Gutfeter, Jalil Nourmohammadi Khiarak, Andrzej Pacut, Earnest Hansley, Mauricio Pamplona Segundo, Sudeep Sarkar, Hyeonjung Park, Gi Pyo Nam, Ig-Jae Kim, Sagar G. Sangodkar, Ümit Kaçar, Murvet Kirci, Li Yuan, Jishou Yuan, Haonan Zhao, Fei Lu, Junying Mao, Xiaoshuang Zhang, Dogucan Yaman, Fevziye Irem Eyiokur, Kadir Bulut Özler, Hazım Kemal Ekenel , et al. (6 additional authors not shown)

    Abstract: This paper presents a summary of the 2019 Unconstrained Ear Recognition Challenge (UERC), the second in a series of group benchmarking efforts centered around the problem of person recognition from ear images captured in uncontrolled settings. The goal of the challenge is to assess the performance of existing ear recognition techniques on a challenging large-scale ear dataset and to analyze perfor… ▽ More

    Submitted 14 March, 2019; v1 submitted 11 March, 2019; originally announced March 2019.

    Comments: The content of this paper was published in ICB, 2019. This ArXiv version is from before the peer review

  23. Influence of segmentation on deep iris recognition performance

    Authors: Juš Lozej, Dejan Štepec, Vitomir Štruc, Peter Peer

    Abstract: Despite the rise of deep learning in numerous areas of computer vision and image processing, iris recognition has not benefited considerably from these trends so far. Most of the existing research on deep iris recognition is focused on new models for generating discriminative and robust iris representations and relies on methodologies akin to traditional iris recognition pipelines. Hence, the prop… ▽ More

    Submitted 8 May, 2020; v1 submitted 29 January, 2019; originally announced January 2019.

    Comments: 6 pages, 3 figures, 3 tables, submitted to IWBF 2019

  24. Training Convolutional Neural Networks with Limited Training Data for Ear Recognition in the Wild

    Authors: Žiga Emeršič, Dejan Štepec, Vitomir Štruc, Peter Peer

    Abstract: Identity recognition from ear images is an active field of research within the biometric community. The ability to capture ear images from a distance and in a covert manner makes ear recognition technology an appealing choice for surveillance and security applications as well as related application domains. In contrast to other biometric modalities, where large datasets captured in uncontrolled se… ▽ More

    Submitted 1 February, 2019; v1 submitted 27 November, 2017; originally announced November 2017.

  25. arXiv:1708.06997  [pdf, other

    cs.CV

    The Unconstrained Ear Recognition Challenge

    Authors: Žiga Emeršič, Dejan Štepec, Vitomir Štruc, Peter Peer, Anjith George, Adil Ahmad, Elshibani Omar, Terrance E. Boult, Reza Safdari, Yuxiang Zhou, Stefanos Zafeiriou, Dogucan Yaman, Fevziye I. Eyiokur, Hazim K. Ekenel

    Abstract: In this paper we present the results of the Unconstrained Ear Recognition Challenge (UERC), a group benchmarking effort centered around the problem of person recognition from ear images captured in uncontrolled conditions. The goal of the challenge was to assess the performance of existing ear recognition techniques on a challenging large-scale dataset and identify open problems that need to be ad… ▽ More

    Submitted 1 February, 2019; v1 submitted 23 August, 2017; originally announced August 2017.

    Comments: International Joint Conference on Biometrics 2017

  26. Face Deidentification with Generative Deep Neural Networks

    Authors: Blaž Meden, Refik Can Mallı, Sebastjan Fabijan, Hazım Kemal Ekenel, Vitomir Štruc, Peter Peer

    Abstract: Face deidentification is an active topic amongst privacy and security researchers. Early deidentification methods relying on image blurring or pixelization were replaced in recent years with techniques based on formal anonymity models that provide privacy guaranties and at the same time aim at retaining certain characteristics of the data even after deidentification. The latter aspect is particula… ▽ More

    Submitted 28 July, 2017; originally announced July 2017.

    Comments: IET Signal Processing Special Issue on Deidentification 2017

  27. arXiv:1702.00307  [pdf, other

    cs.CV

    Pixel-wise Ear Detection with Convolutional Encoder-Decoder Networks

    Authors: Žiga Emeršič, Luka Lan Gabriel, Vitomir Štruc, Peter Peer

    Abstract: Object detection and segmentation represents the basis for many tasks in computer and machine vision. In biometric recognition systems the detection of the region-of-interest (ROI) is one of the most crucial steps in the overall processing pipeline, significantly impacting the performance of the entire recognition system. Existing approaches to ear detection, for example, are commonly susceptible… ▽ More

    Submitted 1 February, 2019; v1 submitted 1 February, 2017; originally announced February 2017.

    Comments: 12 pages

  28. arXiv:1611.06203  [pdf, other

    cs.CV

    Ear Recognition: More Than a Survey

    Authors: Žiga Emeršič, Vitomir Štruc, Peter Peer

    Abstract: Automatic identity recognition from ear images represents an active field of research within the biometric community. The ability to capture ear images from a distance and in a covert manner makes the technology an appealing choice for surveillance and security applications as well as other application domains. Significant contributions have been made in the field over recent years, but open resea… ▽ More

    Submitted 1 February, 2019; v1 submitted 18 November, 2016; originally announced November 2016.

    Comments: 17 pages, paper accepted to Neurocomputing

  29. arXiv:1608.07454  [pdf, other

    cs.CV

    Fine Hand Segmentation using Convolutional Neural Networks

    Authors: Tadej Vodopivec, Vincent Lepetit, Peter Peer

    Abstract: We propose a method for extracting very accurate masks of hands in egocentric views. Our method is based on a novel Deep Learning architecture: In contrast with current Deep Learning methods, we do not use upscaling layers applied to a low-dimensional representation of the input image. Instead, we extract features with convolutional layers and map them directly to a segmentation mask with a fully… ▽ More

    Submitted 26 August, 2016; originally announced August 2016.