Skip to main content

Showing 1–50 of 58 results for author: Ekenel

Searching in archive cs. Search in all archives.
.
  1. Facial Attribute Based Text Guided Face Anonymization

    Authors: Mustafa İzzet Muştu, Hazım Kemal Ekenel

    Abstract: The increasing prevalence of computer vision applications necessitates handling vast amounts of visual data, often containing personal information. While this technology offers significant benefits, it should not compromise privacy. Data privacy regulations emphasize the need for individual consent for processing personal data, hindering researchers' ability to collect high-quality datasets contai… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: 6 pages, 5 figures, published in the Proceedings of the Joint visuAAL-GoodBrother Conference on Trustworthy Video- and Audio-Based Assistive Technologies

    ACM Class: I.4.9; I.2.10; I.4.8

  2. arXiv:2505.20985  [pdf, ps, other

    cs.CV

    Assessing the Use of Face Swapping Methods as Face Anonymizers in Videos

    Authors: Mustafa İzzet Muştu, Hazım Kemal Ekenel

    Abstract: The increasing demand for large-scale visual data, coupled with strict privacy regulations, has driven research into anonymization methods that hide personal identities without seriously degrading data quality. In this paper, we explore the potential of face swapping methods to preserve privacy in video data. Through extensive evaluations focusing on temporal consistency, anonymity strength, and v… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: Accepted to the 2025 25th International Conference on Digital Signal Processing (DSP 2025)

  3. arXiv:2501.19184  [pdf, other

    cs.CV

    A Survey on Class-Agnostic Counting: Advancements from Reference-Based to Open-World Text-Guided Approaches

    Authors: Luca Ciampi, Ali Azmoudeh, Elif Ecem Akbaba, Erdi Sarıtaş, Ziya Ata Yazıcı, Hazım Kemal Ekenel, Giuseppe Amato, Fabrizio Falchi

    Abstract: Visual object counting has recently shifted towards class-agnostic counting (CAC), which addresses the challenge of counting objects across arbitrary categories -- a crucial capability for flexible and generalizable counting systems. Unlike humans, who effortlessly identify and count objects from diverse categories without prior knowledge, most existing counting methods are restricted to enumerati… ▽ More

    Submitted 28 April, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

  4. arXiv:2412.11779  [pdf, other

    cs.CV

    Impact of Face Alignment on Face Image Quality

    Authors: Eren Onaran, Erdi Sarıtaş, Hazım Kemal Ekenel

    Abstract: Face alignment is a crucial step in preparing face images for feature extraction in facial analysis tasks. For applications such as face recognition, facial expression recognition, and facial attribute classification, alignment is widely utilized during both training and inference to standardize the positions of key landmarks in the face. It is well known that the application and method of face al… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: Accepted at EAI ROSENET 2024 - 8th EAI International Conference on Robotic Sensor Networks

  5. 50 questions on Active Assisted Living technologies. Global edition

    Authors: Francisco Florez-Revuelta, Alin Ake-Kob, Pau Climent-Perez, Paulo Coelho, Liane Colonna, Laila Dahabiyeh, Carina Dantas, Esra Dogru-Huzmeli, Hazim Kemal Ekenel, Aleksandar Jevremovic, Nina Hosseini-Kivanani, Aysegul Ilgaz, Mladjan Jovanovic, Andrzej Klimczuk, Maksymilian M. Kuźmicz, Petre Lameski, Ferlanda Luna, Natália Machado, Tamara Mujirishvili, Zada Pajalic, Galidiya Petrova, Nathalie G. S. Puaschitz, Maria Jose Santofimia, Agusti Solanas, Wilhelmina van Staalduinen , et al. (1 additional authors not shown)

    Abstract: This booklet on Active Assisted Living (AAL) technologies has been created as part of the GoodBrother COST Action, which has run from 2020 to 2024. COST Actions are European research programs that promote collaboration across borders, uniting researchers, professionals, and institutions to address key societal challenges. GoodBrother focused on ethical and privacy concerns surrounding video and au… ▽ More

    Submitted 22 October, 2024; originally announced October 2024.

  6. arXiv:2410.08713  [pdf, other

    cs.CV

    Impact of Surface Reflections in Maritime Obstacle Detection

    Authors: Samed Yalçın, Hazım Kemal Ekenel

    Abstract: Maritime obstacle detection aims to detect possible obstacles for autonomous driving of unmanned surface vehicles. In the context of maritime obstacle detection, the water surface can act like a mirror on certain circumstances, causing reflections on imagery. Previous works have indicated surface reflections as a source of false positives for object detectors in maritime obstacle detection tasks.… ▽ More

    Submitted 11 October, 2024; originally announced October 2024.

    Comments: Accepted at RROW2024 Workshop @ British Machine Vision Conference (BMVC) 2024

    ACM Class: I.5.4

  7. arXiv:2406.02153  [pdf, other

    cs.CV

    Analyzing the Feature Extractor Networks for Face Image Synthesis

    Authors: Erdi Sarıtaş, Hazım Kemal Ekenel

    Abstract: Advancements like Generative Adversarial Networks have attracted the attention of researchers toward face image synthesis to generate ever more realistic images. Thereby, the need for the evaluation criteria to assess the realism of the generated images has become apparent. While FID utilized with InceptionV3 is one of the primary choices for benchmarking, concerns about InceptionV3's limitations… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted at 18th International Conference on Automatic Face and Gesture Recognition (FG) on 1st SD-FGA Workshop 2024

  8. arXiv:2406.02142  [pdf, other

    cs.CV

    Analyzing the Effect of Combined Degradations on Face Recognition

    Authors: Erdi Sarıtaş, Hazım Kemal Ekenel

    Abstract: A face recognition model is typically trained on large datasets of images that may be collected from controlled environments. This results in performance discrepancies when applied to real-world scenarios due to the domain gap between clean and in-the-wild images. Therefore, some researchers have investigated the robustness of these models by analyzing synthetic degradations. Yet, existing studies… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: Accepted at 18th International Conference on Automatic Face and Gesture Recognition (FG) on 2nd PrivAAL Workshop 2024

  9. arXiv:2405.04327  [pdf, other

    cs.CV

    Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation

    Authors: Dogucan Yaman, Fevziye Irem Eyiokur, Leonard Bärmann, Seymanur Aktı, Hazım Kemal Ekenel, Alexander Waibel

    Abstract: In the task of talking face generation, the objective is to generate a face video with lips synchronized to the corresponding audio while preserving visual details and identity information. Current methods face the challenge of learning accurate lip synchronization while avoiding detrimental effects on visual quality, as well as robustly evaluating such synchronization. To tackle these problems, w… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: CVPR2024 NTIRE Workshop

  10. GLIMS: Attention-Guided Lightweight Multi-Scale Hybrid Network for Volumetric Semantic Segmentation

    Authors: Ziya Ata Yazıcı, İlkay Öksüz, Hazım Kemal Ekenel

    Abstract: Convolutional Neural Networks (CNNs) have become widely adopted for medical image segmentation tasks, demonstrating promising performance. However, the inherent inductive biases in convolutional architectures limit their ability to model long-range dependencies and spatial correlations. While recent transformer-based architectures address these limitations by leveraging self-attention mechanisms t… ▽ More

    Submitted 27 April, 2024; originally announced April 2024.

    Comments: The article was accepted for publication in the Image and Vision Computing journal

  11. arXiv:2403.09942  [pdf, other

    eess.IV cs.CV cs.LG

    Attention-Enhanced Hybrid Feature Aggregation Network for 3D Brain Tumor Segmentation

    Authors: Ziya Ata Yazıcı, İlkay Öksüz, Hazım Kemal Ekenel

    Abstract: Glioblastoma is a highly aggressive and malignant brain tumor type that requires early diagnosis and prompt intervention. Due to its heterogeneity in appearance, developing automated detection approaches is challenging. To address this challenge, Artificial Intelligence (AI)-driven approaches in healthcare have generated interest in efficiently diagnosing and evaluating brain tumors. The Brain Tum… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: Accepted at 9th BrainLes Workshop (BraTS 2023 Challenge) @ International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) 2023

  12. arXiv:2402.00700  [pdf, other

    cs.CV

    In-Bed Pose Estimation: A Review

    Authors: Ziya Ata Yazıcı, Sara Colantonio, Hazım Kemal Ekenel

    Abstract: Human pose estimation, the process of identifying joint positions in a person's body from images or videos, represents a widely utilized technology across diverse fields, including healthcare. One such healthcare application involves in-bed pose estimation, where the body pose of an individual lying under a blanket is analyzed. This task, for instance, can be used to monitor a person's sleep behav… ▽ More

    Submitted 1 February, 2024; originally announced February 2024.

    Comments: Accepted at HCCS24 Workshop @ International Conference on Pervasive Computing and Communications (PerCom 2024)

  13. arXiv:2307.09368  [pdf, other

    cs.CV

    Audio-driven Talking Face Generation with Stabilized Synchronization Loss

    Authors: Dogucan Yaman, Fevziye Irem Eyiokur, Leonard Bärmann, Hazim Kemal Ekenel, Alexander Waibel

    Abstract: Talking face generation aims to create realistic videos with accurate lip synchronization and high visual quality, using given audio and reference video while preserving identity and visual characteristics. In this paper, we start by identifying several issues with existing synchronization learning methods. These involve unstable training, lip synchronization, and visual quality issues caused by l… ▽ More

    Submitted 18 July, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Comments: Accepted by ECCV 2024

  14. arXiv:2211.15225  [pdf, other

    cs.CV

    Meet-in-the-middle: Multi-scale upsampling and matching for cross-resolution face recognition

    Authors: Klemen Grm, Berk Kemal Özata, Vitomir Štruc, Hazım Kemal Ekenel

    Abstract: In this paper, we aim to address the large domain gap between high-resolution face images, e.g., from professional portrait photography, and low-quality surveillance images, e.g., from security cameras. Establishing an identity match between disparate sources like this is a classical surveillance face identification scenario, which continues to be a challenging problem for modern face recognition… ▽ More

    Submitted 29 November, 2022; v1 submitted 28 November, 2022; originally announced November 2022.

  15. arXiv:2211.03705  [pdf, other

    cs.CV eess.IV

    A Survey on Computer Vision based Human Analysis in the COVID-19 Era

    Authors: Fevziye Irem Eyiokur, Alperen Kantarcı, Mustafa Ekrem Erakın, Naser Damer, Ferda Ofli, Muhammad Imran, Janez Križaj, Albert Ali Salah, Alexander Waibel, Vitomir Štruc, Hazım Kemal Ekenel

    Abstract: The emergence of COVID-19 has had a global and profound impact, not only on society as a whole, but also on the lives of individuals. Various prevention measures were introduced around the world to limit the transmission of the disease, including face masks, mandates for social distancing and regular disinfection in public spaces, and the use of screening applications. These developments also trig… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: Submitted to Image and Vision Computing, 44 pages, 7 figures

  16. arXiv:2211.01207  [pdf, other

    cs.CV cs.LG eess.IV

    Bias-Aware Face Mask Detection Dataset

    Authors: Alperen Kantarcı, Ferda Ofli, Muhammad Imran, Hazım Kemal Ekenel

    Abstract: In December 2019, a novel coronavirus (COVID-19) spread so quickly around the world that many countries had to set mandatory face mask rules in public areas to reduce the transmission of the virus. To monitor public adherence, researchers aimed to rapidly develop efficient systems that can detect faces with masks automatically. However, the lack of representative and novel datasets proved to be th… ▽ More

    Submitted 10 January, 2023; v1 submitted 2 November, 2022; originally announced November 2022.

    Comments: 7 pages, 3 figures

    Journal ref: Multimedia Tools and Applications 2024

  17. arXiv:2208.02760  [pdf, other

    cs.CV cs.LG

    OCFR 2022: Competition on Occluded Face Recognition From Synthetically Generated Structure-Aware Occlusions

    Authors: Pedro C. Neto, Fadi Boutros, Joao Ribeiro Pinto, Naser Damer, Ana F. Sequeira, Jaime S. Cardoso, Messaoud Bengherabi, Abderaouf Bousnat, Sana Boucheta, Nesrine Hebbadj, Mustafa Ekrem Erakın, Uğur Demir, Hazım Kemal Ekenel, Pedro Beber de Queiroz Vidal, David Menotti

    Abstract: This work summarizes the IJCB Occluded Face Recognition Competition 2022 (IJCB-OCFR-2022) embraced by the 2022 International Joint Conference on Biometrics (IJCB 2022). OCFR-2022 attracted a total of 3 participating teams, from academia. Eventually, six valid submissions were submitted and then evaluated by the organizers. The competition was held to address the challenge of face recognition in th… ▽ More

    Submitted 15 August, 2022; v1 submitted 4 August, 2022; originally announced August 2022.

    Comments: Accepted at International Joint Conference on Biometrics 2022

  18. arXiv:2207.01487  [pdf

    cs.CY cs.AI cs.HC cs.SD eess.AS

    State of the Art of Audio- and Video-Based Solutions for AAL

    Authors: Slavisa Aleksic, Michael Atanasov, Jean Calleja Agius, Kenneth Camilleri, Anto Cartolovni, Pau Climent-Peerez, Sara Colantonio, Stefania Cristina, Vladimir Despotovic, Hazim Kemal Ekenel, Ekrem Erakin, Francisco Florez-Revuelta, Danila Germanese, Nicole Grech, Steinunn Gróa Sigurðardóttir, Murat Emirzeoglu, Ivo Iliev, Mladjan Jovanovic, Martin Kampel, William Kearns, Andrzej Klimczuk, Lambros Lambrinos, Jennifer Lumetzberger, Wiktor Mucha, Sophie Noiret , et al. (14 additional authors not shown)

    Abstract: The report illustrates the state of the art of the most successful AAL applications and functions based on audio and video data, namely (i) lifelogging and self-monitoring, (ii) remote monitoring of vital signs, (iii) emotional state recognition, (iv) food intake monitoring, activity and behaviour recognition, (v) activity and personal assistance, (vi) gesture recognition, (vii) fall detection and… ▽ More

    Submitted 5 July, 2022; v1 submitted 26 June, 2022; originally announced July 2022.

    ACM Class: I.2

  19. arXiv:2206.04523  [pdf, other

    cs.CL cs.CV cs.SD eess.AS eess.IV

    Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos

    Authors: Alexander Waibel, Moritz Behr, Fevziye Irem Eyiokur, Dogucan Yaman, Tuan-Nam Nguyen, Carlos Mullov, Mehmet Arif Demirtas, Alperen Kantarcı, Stefan Constantin, Hazım Kemal Ekenel

    Abstract: In this paper, we propose a neural end-to-end system for voice preserving, lip-synchronous translation of videos. The system is designed to combine multiple component models and produces a video of the original speaker speaking in the target language that is lip-synchronous with the target speech, yet maintains emphases in speech, voice characteristics, face video of the original speaker. The pipe… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

  20. VIDI: A Video Dataset of Incidents

    Authors: Duygu Sesver, Alp Eren Gençoğlu, Çağrı Emre Yıldız, Zehra Günindi, Faeze Habibi, Ziya Ata Yazıcı, Hazım Kemal Ekenel

    Abstract: Automatic detection of natural disasters and incidents has become more important as a tool for fast response. There have been many studies to detect incidents using still images and text. However, the number of approaches that exploit temporal information is rather limited. One of the main reasons for this is that a diverse video dataset with various incident types does not exist. To address this… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

    Journal ref: 2022 IEEE 14th Image, Video, and Multidimensional Signal Processing Workshop (IVMSP)

  21. arXiv:2204.10648  [pdf, other

    cs.CV

    Exposure Correction Model to Enhance Image Quality

    Authors: Fevziye Irem Eyiokur, Dogucan Yaman, Hazım Kemal Ekenel, Alexander Waibel

    Abstract: Exposure errors in an image cause a degradation in the contrast and low visibility in the content. In this paper, we address this problem and propose an end-to-end exposure correction model in order to handle both under- and overexposure errors with a single model. Our model contains an image encoder, consecutive residual blocks, and image decoder to synthesize the corrected image. We utilize perc… ▽ More

    Submitted 22 April, 2022; originally announced April 2022.

    Comments: Accepted for CVPR2022 NTIRE Workshop

  22. arXiv:2204.09432  [pdf, other

    cs.CV

    A Mobile Food Recognition System for Dietary Assessment

    Authors: Şeymanur Aktı, Marwa Qaraqe, Hazım Kemal Ekenel

    Abstract: Food recognition is an important task for a variety of applications, including managing health conditions and assisting visually impaired people. Several food recognition studies have focused on generic types of food or specific cuisines, however, food recognition with respect to Middle Eastern cuisines has remained unexplored. Therefore, in this paper we focus on developing a mobile friendly, Mid… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: Accepted at GoodBrotherVI4IAAL Workshop @ICIAP2021

  23. arXiv:2111.08370  [pdf, other

    cs.CV

    Fight Detection from Still Images in the Wild

    Authors: Şeymanur Aktı, Ferda Ofli, Muhammad Imran, Hazım Kemal Ekenel

    Abstract: Detecting fights from still images shared on social media is an important task required to limit the distribution of violent scenes in order to prevent their negative effects. For this reason, in this study, we address the problem of fight detection from still images collected from the web and social media. We explore how well one can detect fights from just a single still image. We also propose a… ▽ More

    Submitted 17 November, 2021; v1 submitted 16 November, 2021; originally announced November 2021.

    Comments: Accepted for publication at Winter Conference of Applications on Computer Vision Workshops (WACV-W 2022), Workshop on Real-World Surveillance: Applications and Challenges

  24. arXiv:2109.03672  [pdf, other

    cs.CV

    On Recognizing Occluded Faces in the Wild

    Authors: Mustafa Ekrem Erakın, Uğur Demir, Hazım Kemal Ekenel

    Abstract: Facial appearance variations due to occlusion has been one of the main challenges for face recognition systems. To facilitate further research in this area, it is necessary and important to have occluded face datasets collected from real-world, as synthetically generated occluded faces cannot represent the nature of the problem. In this paper, we present the Real World Occluded Faces (ROF) dataset… ▽ More

    Submitted 11 September, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted to 20th International Conference of the Biometrics Special Interest Group (BIOSIG 2021) as Poster paper

  25. arXiv:2109.03484  [pdf, other

    cs.CV cs.CR cs.LG

    Shuffled Patch-Wise Supervision for Presentation Attack Detection

    Authors: Alperen Kantarcı, Hasan Dertli, Hazım Kemal Ekenel

    Abstract: Face anti-spoofing is essential to prevent false facial verification by using a photo, video, mask, or a different substitute for an authorized person's face. Most of the state-of-the-art presentation attack detection (PAD) systems suffer from overfitting, where they achieve near-perfect scores on a single dataset but fail on a different dataset with more realistic data. This problem drives resear… ▽ More

    Submitted 9 September, 2021; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: Accepted to 20th International Conference of the Biometrics Special Interest Group (BIOSIG 2021) as Oral paper

  26. arXiv:2106.15288  [pdf, other

    cs.CV

    MFR 2021: Masked Face Recognition Competition

    Authors: Fadi Boutros, Naser Damer, Jan Niklas Kolf, Kiran Raja, Florian Kirchbuchner, Raghavendra Ramachandra, Arjan Kuijper, Pengcheng Fang, Chao Zhang, Fei Wang, David Montero, Naiara Aginako, Basilio Sierra, Marcos Nieto, Mustafa Ekrem Erakin, Ugur Demir, Hazim Kemal, Ekenel, Asaki Kataoka, Kohei Ichikawa, Shizuma Kubo, Jie Zhang, Mingjie He, Dan Han, Shiguang Shan , et al. (10 additional authors not shown)

    Abstract: This paper presents a summary of the Masked Face Recognition Competitions (MFR) held within the 2021 International Joint Conference on Biometrics (IJCB 2021). The competition attracted a total of 10 participating teams with valid submissions. The affiliations of these teams are diverse and associated with academia and industry in nine different countries. These teams successfully submitted 18 vali… ▽ More

    Submitted 29 June, 2021; originally announced June 2021.

    Comments: Accepted at International Join Conference on Biometrics (IJCB 2021)

  27. arXiv:2106.03210  [pdf, other

    cs.CV

    Alpha Matte Generation from Single Input for Portrait Matting

    Authors: Dogucan Yaman, Hazım Kemal Ekenel, Alexander Waibel

    Abstract: In the portrait matting, the goal is to predict an alpha matte that identifies the effect of each pixel on the foreground subject. Traditional approaches and most of the existing works utilized an additional input, e.g., trimap, background image, to predict alpha matte. However, (1) providing additional input is not always practical, and (2) models are too sensitive to these additional inputs. To… ▽ More

    Submitted 25 April, 2022; v1 submitted 6 June, 2021; originally announced June 2021.

    Comments: Accepted for CVPR 2022 NTIRE Workshop

  28. Unconstrained Face-Mask & Face-Hand Datasets: Building a Computer Vision System to Help Prevent the Transmission of COVID-19

    Authors: Fevziye Irem Eyiokur, Hazım Kemal Ekenel, Alexander Waibel

    Abstract: Health organizations advise social distancing, wearing face mask, and avoiding touching face to prevent the spread of coronavirus. Based on these protective measures, we developed a computer vision system to help prevent the transmission of COVID-19. Specifically, the developed system performs face mask detection, face-hand interaction detection, and measures social distance. To train and evaluate… ▽ More

    Submitted 8 December, 2021; v1 submitted 15 March, 2021; originally announced March 2021.

    Comments: 9 pages, 4 figures

    Journal ref: SIViP (2022)

  29. MOCCA: Multi-Layer One-Class ClassificAtion for Anomaly Detection

    Authors: Fabio Valerio Massoli, Fabrizio Falchi, Alperen Kantarci, Şeymanur Akti, Hazim Kemal Ekenel, Giuseppe Amato

    Abstract: Anomalies are ubiquitous in all scientific fields and can express an unexpected event due to incomplete knowledge about the data distribution or an unknown process that suddenly comes into play and distorts observations. Due to such events' rarity, to train deep learning models on the Anomaly Detection (AD) task, scientists only rely on "normal" data, i.e., non-anomalous samples. Thus, letting the… ▽ More

    Submitted 27 November, 2021; v1 submitted 9 December, 2020; originally announced December 2020.

    Comments: The paper has been accepted for publication in the IEEE Transactions on Neural Networks and Learning Systems, Special Issue on Deep Learning for Anomaly Detection

    MSC Class: 68-XX ACM Class: I.5

    Journal ref: IEEE TNNLS (2021)

  30. arXiv:2007.04383  [pdf, other

    cs.CV eess.IV

    Words as Art Materials: Generating Paintings with Sequential GANs

    Authors: Azmi Can Özgen, Hazım Kemal Ekenel

    Abstract: Converting text descriptions into images using Generative Adversarial Networks has become a popular research area. Visually appealing images have been generated successfully in recent years. Inspired by these studies, we investigated the generation of artistic images on a large variance dataset. This dataset includes images with variations, for example, in shape, color, and content. These variatio… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

  31. arXiv:2007.03053  [pdf, other

    eess.IV cs.CV

    Benefiting from Bicubically Down-Sampled Images for Learning Real-World Image Super-Resolution

    Authors: Mohammad Saeed Rad, Thomas Yu, Claudiu Musat, Hazim Kemal Ekenel, Behzad Bozorgtabar, Jean-Philippe Thiran

    Abstract: Super-resolution (SR) has traditionally been based on pairs of high-resolution images (HR) and their low-resolution (LR) counterparts obtained artificially with bicubic downsampling. However, in real-world SR, there is a large variety of realistic image degradations and analytically modeling these realistic degradations can prove quite difficult. In this work, we propose to handle real-world SR by… ▽ More

    Submitted 5 November, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: WACV 2021

  32. arXiv:2006.01943  [pdf, other

    cs.CV

    Ear2Face: Deep Biometric Modality Mapping

    Authors: Dogucan Yaman, Fevziye Irem Eyiokur, Hazım Kemal Ekenel

    Abstract: In this paper, we explore the correlation between different visual biometric modalities. For this purpose, we present an end-to-end deep neural network model that learns a mapping between the biometric modalities. Namely, our goal is to generate a frontal face image of a subject given his/her ear image as the input. We formulated the problem as a paired image-to-image translation task and collecte… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

    Comments: 13 pages, 4 figures

  33. arXiv:2004.12104  [pdf, other

    cs.CV

    Offline Signature Verification on Real-World Documents

    Authors: Deniz Engin, Alperen Kantarcı, Seçil Arslan, Hazım Kemal Ekenel

    Abstract: Research on offline signature verification has explored a large variety of methods on multiple signature datasets, which are collected under controlled conditions. However, these datasets may not fully reflect the characteristics of the signatures in some practical use cases. Real-world signatures extracted from the formal documents may contain different types of occlusions, for example, stamps, c… ▽ More

    Submitted 25 April, 2020; originally announced April 2020.

    Comments: CVPR 2020 Biometrics Workshop

  34. arXiv:2002.04355  [pdf, other

    cs.CV cs.LG eess.IV

    Vision-based Fight Detection from Surveillance Cameras

    Authors: Şeymanur Aktı, Gözde Ayşe Tataroğlu, Hazım Kemal Ekenel

    Abstract: Vision-based action recognition is one of the most challenging research topics of computer vision and pattern recognition. A specific application of it, namely, detecting fights from surveillance cameras in public areas, prisons, etc., is desired to quickly get under control these violent incidents. This paper addresses this research problem and explores LSTM-based approaches to solve it. Moreover… ▽ More

    Submitted 11 February, 2020; originally announced February 2020.

    Comments: 6 pages, 5 figures, 4 tables, International Conference on Image Processing Theory, Tools and Applications, IPTA 2019

  35. arXiv:2002.04219  [pdf, other

    cs.CV cs.LG eess.IV

    Thermal to Visible Face Recognition Using Deep Autoencoders

    Authors: Alperen Kantarcı, Hazım Kemal Ekenel

    Abstract: Visible face recognition systems achieve nearly perfect recognition accuracies using deep learning. However, in lack of light, these systems perform poorly. A way to deal with this problem is thermal to visible cross-domain face matching. This is a desired technology because of its usefulness in night time surveillance. Nevertheless, due to differences between two domains, it is a very challenging… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

    Comments: 5 pages, 3 figures, 2019 International Conference of the Biometrics Special Interest Group (BIOSIG)

  36. arXiv:1908.07222  [pdf, other

    cs.CV

    SROBB: Targeted Perceptual Loss for Single Image Super-Resolution

    Authors: Mohammad Saeed Rad, Behzad Bozorgtabar, Urs-Viktor Marti, Max Basler, Hazim Kemal Ekenel, Jean-Philippe Thiran

    Abstract: By benefiting from perceptual losses, recent studies have improved significantly the performance of the super-resolution task, where a high-resolution image is resolved from its low-resolution counterpart. Although such objective functions generate near-photorealistic results, their capability is limited, since they estimate the reconstruction error for an entire image in the same way, without con… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: ICCV 2019

  37. arXiv:1907.12488  [pdf, other

    cs.CV

    Benefiting from Multitask Learning to Improve Single Image Super-Resolution

    Authors: Mohammad Saeed Rad, Behzad Bozorgtabar, Claudiu Musat, Urs-Viktor Marti, Max Basler, Hazim Kemal Ekenel, Jean-Philippe Thiran

    Abstract: Despite significant progress toward super resolving more realistic images by deeper convolutional neural networks (CNNs), reconstructing fine and natural textures still remains a challenging problem. Recent works on single image super resolution (SISR) are mostly based on optimizing pixel and content wise similarity between recovered and high-resolution (HR) images and do not benefit from recogniz… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

    Comments: accepted at Neurocomputing (Special Issue on Deep Learning for Image Super-Resolution), 2019

  38. arXiv:1907.10104  [pdf, other

    cs.CV

    Exploring Factors for Improving Low Resolution Face Recognition

    Authors: Omid Abdollahi Aghdam, Behzad Bozorgtabar, Hazım Kemal Ekenel, Jean-Philippe Thiran

    Abstract: State-of-the-art deep face recognition approaches report near perfect performance on popular benchmarks, e.g., Labeled Faces in the Wild. However, their performance deteriorates significantly when they are applied on low quality images, such as those acquired by surveillance cameras. A further challenge for low resolution face recognition for surveillance applications is the matching of recorded l… ▽ More

    Submitted 25 July, 2019; v1 submitted 23 July, 2019; originally announced July 2019.

    Comments: CVPR Workshop on Biometrics 2019

  39. arXiv:1907.10081  [pdf, other

    cs.CV

    Multimodal Age and Gender Classification Using Ear and Profile Face Images

    Authors: Dogucan Yaman, Fevziye Irem Eyiokur, Hazım Kemal Ekenel

    Abstract: In this paper, we present multimodal deep neural network frameworks for age and gender classification, which take input a profile face image as well as an ear image. Our main objective is to enhance the accuracy of soft biometric trait extraction from profile face images by additionally utilizing a promising biometric modality: ear appearance. For this purpose, we provided end-to-end multimodal de… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

    Comments: 8 pages, 4 figures, accepted for CVPR 2019 - Workshop on Biometrics

  40. arXiv:1905.13538  [pdf, other

    cs.IR cs.CV cs.LG stat.ML

    FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents

    Authors: Guillaume Jaume, Hazim Kemal Ekenel, Jean-Philippe Thiran

    Abstract: We present a new dataset for form understanding in noisy scanned documents (FUNSD) that aims at extracting and structuring the textual content of forms. The dataset comprises 199 real, fully annotated, scanned forms. The documents are noisy and vary widely in appearance, making form understanding (FoUn) a challenging task. The proposed dataset can be used for various tasks, including text detectio… ▽ More

    Submitted 29 October, 2019; v1 submitted 27 May, 2019; originally announced May 2019.

    Comments: ICDAR'19 OST workshop

  41. arXiv:1905.08090  [pdf, other

    cs.CV

    Using Photorealistic Face Synthesis and Domain Adaptation to Improve Facial Expression Analysis

    Authors: Behzad Bozorgtabar, Mohammad Saeed Rad, Hazim Kemal Ekenel, Jean-Philippe Thiran

    Abstract: Cross-domain synthesizing realistic faces to learn deep models has attracted increasing attention for facial expression analysis as it helps to improve the performance of expression recognition accuracy despite having small number of real training images. However, learning from synthetic face images can be problematic due to the distribution discrepancy between low-quality synthetic images and rea… ▽ More

    Submitted 17 May, 2019; originally announced May 2019.

    Comments: 8 pages, 8 figures, 5 tables, accepted by FG 2019. arXiv admin note: substantial text overlap with arXiv:1905.00286

  42. arXiv:1905.00286  [pdf, other

    cs.CV

    Learn to synthesize and synthesize to learn

    Authors: Behzad Bozorgtabar, Mohammad Saeed Rad, Hazım Kemal Ekenel, Jean-Philippe Thiran

    Abstract: Attribute guided face image synthesis aims to manipulate attributes on a face image. Most existing methods for image-to-image translation can either perform a fixed translation between any two image domains using a single attribute or require training data with the attributes of interest for each subject. Therefore, these methods could only train one specific model for each pair of image domains,… ▽ More

    Submitted 1 May, 2019; originally announced May 2019.

    Comments: Accepted to Computer Vision and Image Understanding (CVIU)

  43. arXiv:1903.04143  [pdf, other

    cs.CV

    The Unconstrained Ear Recognition Challenge 2019 - ArXiv Version With Appendix

    Authors: Žiga Emeršič, Aruna Kumar S. V., B. S. Harish, Weronika Gutfeter, Jalil Nourmohammadi Khiarak, Andrzej Pacut, Earnest Hansley, Mauricio Pamplona Segundo, Sudeep Sarkar, Hyeonjung Park, Gi Pyo Nam, Ig-Jae Kim, Sagar G. Sangodkar, Ümit Kaçar, Murvet Kirci, Li Yuan, Jishou Yuan, Haonan Zhao, Fei Lu, Junying Mao, Xiaoshuang Zhang, Dogucan Yaman, Fevziye Irem Eyiokur, Kadir Bulut Özler, Hazım Kemal Ekenel , et al. (6 additional authors not shown)

    Abstract: This paper presents a summary of the 2019 Unconstrained Ear Recognition Challenge (UERC), the second in a series of group benchmarking efforts centered around the problem of person recognition from ear images captured in uncontrolled settings. The goal of the challenge is to assess the performance of existing ear recognition techniques on a challenging large-scale ear dataset and to analyze perfor… ▽ More

    Submitted 14 March, 2019; v1 submitted 11 March, 2019; originally announced March 2019.

    Comments: The content of this paper was published in ICB, 2019. This ArXiv version is from before the peer review

  44. arXiv:1811.03830  [pdf, other

    cs.CV cs.AI

    Image-Level Attentional Context Modeling Using Nested-Graph Neural Networks

    Authors: Guillaume Jaume, Behzad Bozorgtabar, Hazim Kemal Ekenel, Jean-Philippe Thiran, Maria Gabrani

    Abstract: We introduce a new scene graph generation method called image-level attentional context modeling (ILAC). Our model includes an attentional graph network that effectively propagates contextual information across the graph using image-level features. Whereas previous works use an object-centric context, we build an image-level context agent to encode the scene properties. The proposed method compris… ▽ More

    Submitted 12 November, 2018; v1 submitted 9 November, 2018; originally announced November 2018.

    Comments: NIPS 2018, Relational Representation Learning Workshop

  45. arXiv:1806.05742  [pdf, other

    cs.CV

    Age and Gender Classification From Ear Images

    Authors: Dogucan Yaman, Fevziye Irem Eyiokur, Nurdan Sezgin, Hazım Kemal Ekenel

    Abstract: In this paper, we present a detailed analysis on extracting soft biometric traits, age and gender, from ear images. Although there have been a few previous work on gender classification using ear images, to the best of our knowledge, this study is the first work on age classification from ear images. In the study, we have utilized both geometric features and appearance-based features for ear repre… ▽ More

    Submitted 14 June, 2018; originally announced June 2018.

    Comments: 7 pages, 3 figures, accepted for IAPR/IEEE International Workshop on Biometrics and Forensics (IWBF) 2018

  46. arXiv:1805.05308  [pdf, other

    cs.CV

    Cycle-Dehaze: Enhanced CycleGAN for Single Image Dehazing

    Authors: Deniz Engin, Anıl Genç, Hazım Kemal Ekenel

    Abstract: In this paper, we present an end-to-end network, called Cycle-Dehaze, for single image dehazing problem, which does not require pairs of hazy and corresponding ground truth images for training. That is, we train the network by feeding clean and hazy images in an unpaired manner. Moreover, the proposed approach does not rely on estimation of the atmospheric scattering model parameters. Our method e… ▽ More

    Submitted 14 May, 2018; originally announced May 2018.

    Comments: Accepted at CVPRW: NTIRE 2018

  47. Domain Adaptation for Ear Recognition Using Deep Convolutional Neural Networks

    Authors: Fevziye Irem Eyiokur, Dogucan Yaman, Hazım Kemal Ekenel

    Abstract: In this paper, we have extensively investigated the unconstrained ear recognition problem. We have first shown the importance of domain adaptation, when deep convolutional neural network models are used for ear recognition. To enable domain adaptation, we have collected a new ear dataset using the Multi-PIE face dataset, which we named as Multi-PIE ear dataset. To improve the performance further,… ▽ More

    Submitted 21 March, 2018; originally announced March 2018.

    Comments: 12 pages, 7 figures, IET Biometrics

  48. A Computer Vision System to Localize and Classify Wastes on the Streets

    Authors: Mohammad Saeed Rad, Andreas von Kaenel, Andre Droux, Francois Tieche, Nabil Ouerhani, Hazim Kemal Ekenel, Jean-Philippe Thiran

    Abstract: Littering quantification is an important step for improving cleanliness of cities. When human interpretation is too cumbersome or in some cases impossible, an objective index of cleanliness could reduce the littering by awareness actions. In this paper, we present a fully automated computer vision application for littering quantification based on images taken from the streets and sidewalks. We hav… ▽ More

    Submitted 31 October, 2017; originally announced October 2017.

    Journal ref: Liu M., Chen H., Vincze M. (eds) Computer Vision Systems. pp 195-204. ICVS 2017. Lecture Notes in Computer Science, vol 10528. Springer, Cham

  49. arXiv:1710.07168  [pdf, other

    cs.CV

    Combining Multiple Views for Visual Speech Recognition

    Authors: Marina Zimmermann, Mostafa Mehdipour Ghazi, Hazım Kemal Ekenel, Jean-Philippe Thiran

    Abstract: Visual speech recognition is a challenging research problem with a particular practical application of aiding audio speech recognition in noisy scenarios. Multiple camera setups can be beneficial for the visual speech recognition systems in terms of improved performance and robustness. In this paper, we explore this aspect and provide a comprehensive study on combining multiple views for visual sp… ▽ More

    Submitted 28 June, 2018; v1 submitted 19 October, 2017; originally announced October 2017.

    Journal ref: Proceedings of the 14th International Conference on Auditory-Visual Speech Processing (AVSP2017)

  50. Visual Speech Recognition Using PCA Networks and LSTMs in a Tandem GMM-HMM System

    Authors: Marina Zimmermann, Mostafa Mehdipour Ghazi, Hazım Kemal Ekenel, Jean-Philippe Thiran

    Abstract: Automatic visual speech recognition is an interesting problem in pattern recognition especially when audio data is noisy or not readily available. It is also a very challenging task mainly because of the lower amount of information in the visual articulations compared to the audible utterance. In this work, principle component analysis is applied to the image patches - extracted from the video dat… ▽ More

    Submitted 19 October, 2017; originally announced October 2017.

    Journal ref: ACCV 2016 Workshops. ACCV 2016. Lecture Notes in Computer Science, vol 10117. Springer, Cham