Skip to main content

Showing 1–20 of 20 results for author: Dahyot, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.13421  [pdf, other

    cs.CV

    Performance of Gaussian Mixture Model Classifiers on Embedded Feature Spaces

    Authors: Jeremy Chopin, Rozenn Dahyot

    Abstract: Data embeddings with CLIP and ImageBind provide powerful features for the analysis of multimedia and/or multimodal data. We assess their performance here for classification using a Gaussian Mixture models (GMMs) based layer as an alternative to the standard Softmax layer. GMMs based classifiers have recently been shown to have interesting performances as part of deep learning pipelines trained end… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: 8 pages

  2. arXiv:2305.08232  [pdf, other

    cs.CV

    Combining geolocation and height estimation of objects from street level imagery

    Authors: Matej Ulicny, Vladimir A. Krylov, Julie Connelly, Rozenn Dahyot

    Abstract: We propose a pipeline for combined multi-class object geolocation and height estimation from street level RGB imagery, which is considered as a single available input data modality. Our solution is formulated via Markov Random Field optimization with deterministic output. The proposed technique uses image metadata along with coordinates of objects detected in the image plane as found by a custom-t… ▽ More

    Submitted 14 May, 2023; originally announced May 2023.

  3. Model-based inexact graph matching on top of CNNs for semantic scene understanding

    Authors: Jérémy Chopin, Jean-Baptiste Fasquel, Harold Mouchère, Rozenn Dahyot, Isabelle Bloch

    Abstract: Deep learning based pipelines for semantic segmentation often ignore structural information available on annotated images used for training. We propose a novel post-processing module enforcing structural knowledge about the objects of interest to improve segmentation results provided by deep learning. This module corresponds to a "many-to-one-or-none" inexact graph matching approach, and is formul… ▽ More

    Submitted 1 August, 2023; v1 submitted 18 January, 2023; originally announced January 2023.

    Comments: 27 pages, 9 figures, 11 tables

    MSC Class: I.4.5

  4. arXiv:2210.12746  [pdf, other

    cs.LG cs.CV stat.ML

    Principal Component Classification

    Authors: Rozenn Dahyot

    Abstract: We propose to directly compute classification estimates by learning features encoded with their class scores using PCA. Our resulting model has a encoder-decoder structure suitable for supervised learning, it is computationally efficient and performs well for classification on several datasets.

    Submitted 26 October, 2022; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: 5 pages; 5 figures; 1 table

  5. arXiv:2111.04739  [pdf, other

    eess.IV cs.CV

    DR-VNet: Retinal Vessel Segmentation via Dense Residual UNet

    Authors: Ali Karaali, Rozenn Dahyot, Donal J. Sexton

    Abstract: Accurate retinal vessel segmentation is an important task for many computer-aided diagnosis systems. Yet, it is still a challenging problem due to the complex vessel structures of an eye. Numerous vessel segmentation methods have been proposed recently, however more research is needed to deal with poor segmentation of thin and tiny vessels. To address this, we propose a new deep learning pipeline… ▽ More

    Submitted 22 March, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

    Comments: Accepted to ICPRAI 2022 - 3rd International Conference on Pattern Recognition and Artificial Intelligence

  6. arXiv:2108.06306  [pdf, other

    cs.CV

    3D point cloud segmentation using GIS

    Authors: Chao-Jung Liu, Vladimir Krylov, Rozenn Dahyot

    Abstract: In this paper we propose an approach to perform semantic segmentation of 3D point cloud data by importing the geographic information from a 2D GIS layer (OpenStreetMap). The proposed automatic procedure identifies meaningful units such as buildings and adjusts their locations to achieve best fit between the GIS polygonal perimeters and the point cloud. Our processing pipeline is presented and illu… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: 8 pages

    Journal ref: IMVIP 2018

  7. arXiv:2108.06302  [pdf, other

    cs.LG

    Context Aware Object Geotagging

    Authors: Chao-Jung Liu, Matej Ulicny, Michael Manzke, Rozenn Dahyot

    Abstract: Localization of street objects from images has gained a lot of attention in recent years. We propose an approach to improve asset geolocation from street view imagery by enhancing the quality of the metadata associated with the images using Structure from Motion. The predicted object geolocation is further refined by imposing contextual geographic information extracted from OpenStreetMap. Our pipe… ▽ More

    Submitted 13 August, 2021; originally announced August 2021.

    Comments: 8 pages

    Journal ref: IMVIP 2021

  8. arXiv:2102.09297  [pdf, other

    cs.CV

    Sliced $\mathcal{L}_2$ Distance for Colour Grading

    Authors: Hana Alghamdi, Rozenn Dahyot

    Abstract: We propose a new method with $\mathcal{L}_2$ distance that maps one $N$-dimensional distribution to another, taking into account available information about correspondences. We solve the high-dimensional problem in 1D space using an iterative projection approach. To show the potentials of this mapping, we apply it to colour transfer between two images that exhibit overlapped scenes. Experiments sh… ▽ More

    Submitted 18 February, 2021; originally announced February 2021.

    Comments: 5 pages, 9 figures

  9. arXiv:2010.12110  [pdf, other

    cs.LG cs.CV

    Tensor Reordering for CNN Compression

    Authors: Matej Ulicny, Vladimir A. Krylov, Rozenn Dahyot

    Abstract: We show how parameter redundancy in Convolutional Neural Network (CNN) filters can be effectively reduced by pruning in spectral domain. Specifically, the representation extracted via Discrete Cosine Transform (DCT) is more conducive for pruning than the original space. By relying on a combination of weight tensor reshaping and reordering we achieve high levels of layer compression with just minor… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

  10. arXiv:2006.09208  [pdf, other

    cs.CV cs.LG cs.MM

    Iterative Nadaraya-Watson Distribution Transfer for Colour Grading

    Authors: Hana Alghamdi, Rozenn Dahyot

    Abstract: We propose a new method with Nadaraya-Watson that maps one N-dimensional distribution to another taking into account available information about correspondences. We extend the 2D/3D problem to higher dimensions by encoding overlapping neighborhoods of data points and solve the high dimensional problem in 1D space using an iterative projection approach. To show potentials of this mapping, we apply… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

    Comments: 6 pages, 6 figures, 4 tables. arXiv admin note: substantial text overlap with arXiv:2005.09015

  11. arXiv:2005.09015  [pdf, other

    cs.CV cs.LG

    Patch based Colour Transfer using SIFT Flow

    Authors: Hana Alghamdi, Rozenn Dahyot

    Abstract: We propose a new colour transfer method with Optimal Transport (OT) to transfer the colour of a sourceimage to match the colour of a target image of the same scene that may exhibit large motion changes betweenimages. By definition OT does not take into account any available information about correspondences whencomputing the optimal solution. To tackle this problem we propose to encode overlapping… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: 8 pages, 7 figures, 4 tables

  12. arXiv:2001.06570  [pdf, other

    cs.CV cs.LG

    Harmonic Convolutional Networks based on Discrete Cosine Transform

    Authors: Matej Ulicny, Vladimir A. Krylov, Rozenn Dahyot

    Abstract: Convolutional neural networks (CNNs) learn filters in order to capture local correlation patterns in feature space. We propose to learn these filters as combinations of preset spectral filters defined by the Discrete Cosine Transform (DCT). Our proposed DCT-based harmonic blocks replace conventional convolutional layers to produce partially or fully harmonic versions of new or existing CNN archite… ▽ More

    Submitted 9 April, 2022; v1 submitted 17 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:1812.03205

  13. arXiv:2001.02976  [pdf, other

    cs.LG cs.NE

    Performance-Oriented Neural Architecture Search

    Authors: Andrew Anderson, Jing Su, Rozenn Dahyot, David Gregg

    Abstract: Hardware-Software Co-Design is a highly successful strategy for improving performance of domain-specific computing systems. We argue for the application of the same methodology to deep learning; specifically, we propose to extend neural architecture search with information about the hardware to ensure that the model designs produced are highly efficient in addition to the typical criteria around a… ▽ More

    Submitted 9 January, 2020; originally announced January 2020.

    Comments: The 2019 International Conference on High Performance Computing & Simulation

  14. arXiv:1905.12678  [pdf, other

    cs.CV cs.LG stat.ML

    Entropic Regularisation of Robust Optimal Transport

    Authors: Rozenn Dahyot, Hana Alghamdi, Mairead Grogan

    Abstract: Grogan et al [11,12] have recently proposed a solution to colour transfer by minimising the Euclidean distance L2 between two probability density functions capturing the colour distributions of two images (palette and target). It was shown to be very competitive to alternative solutions based on Optimal Transport for colour transfer. We show that in fact Grogan et al's formulation can also be unde… ▽ More

    Submitted 29 May, 2019; originally announced May 2019.

    Comments: 8 pages

    Journal ref: Proceeding of Irish Machine Vision and Image Processing conference IMVIP 2019

  15. arXiv:1905.00135  [pdf, other

    cs.CV cs.LG

    Harmonic Networks with Limited Training Samples

    Authors: Matej Ulicny, Vladimir A. Krylov, Rozenn Dahyot

    Abstract: Convolutional neural networks (CNNs) are very popular nowadays for image processing. CNNs allow one to learn optimal filters in a (mostly) supervised machine learning context. However this typically requires abundant labelled training data to estimate the filter parameters. Alternative strategies have been deployed for reducing the number of parameters and / or filters to be learned and thus decre… ▽ More

    Submitted 30 April, 2019; originally announced May 2019.

    Journal ref: European Signal Processing Conference (EUSIPCO) 2019

  16. arXiv:1901.03298  [pdf, other

    cs.IR

    Automatic detection of passable roads after floods in remote sensed and social media data

    Authors: Kashif Ahmad, Konstantin Pogorelov, Michael Riegler, Olga Ostroukhova, Paal Halvorsen, Nicola Conci, Rozenn Dahyot

    Abstract: This paper addresses the problem of floods classification and floods aftermath detection utilizing both social media and satellite imagery. Automatic detection of disasters such as floods is still a very challenging task. The focus lies on identifying passable routes or roads during floods. Two novel solutions are presented, which were developed for two corresponding tasks at the MediaEval 2018 be… ▽ More

    Submitted 10 January, 2019; originally announced January 2019.

  17. arXiv:1812.03205  [pdf, other

    cs.CV cs.LG

    Harmonic Networks: Integrating Spectral Information into CNNs

    Authors: Matej Ulicny, Vladimir A. Krylov, Rozenn Dahyot

    Abstract: Convolutional neural networks (CNNs) learn filters in order to capture local correlation patterns in feature space. In contrast, in this paper we propose harmonic blocks that produce features by learning optimal combinations of spectral filters defined by the Discrete Cosine Transform. The harmonic blocks are used to replace conventional convolutional layers to construct partial or fully harmonic… ▽ More

    Submitted 7 December, 2018; originally announced December 2018.

  18. Automatic Discovery and Geotagging of Objects from Street View Imagery

    Authors: Vladimir A. Krylov, Eamonn Kenny, Rozenn Dahyot

    Abstract: Many applications such as autonomous navigation, urban planning and asset monitoring, rely on the availability of accurate information about objects and their geolocations. In this paper we propose to automatically detect and compute the GPS coordinates of recurring stationary objects of interest using street view imagery. Our processing pipeline relies on two fully convolutional neural networks:… ▽ More

    Submitted 1 December, 2017; v1 submitted 28 August, 2017; originally announced August 2017.

    Comments: Video demo at https://youtu.be/X0tM_iSRJMw

  19. Shape Registration with Directional Data

    Authors: Mairéad Grogan, Rozenn Dahyot

    Abstract: We propose several cost functions for registration of shapes encoded with Euclidean and/or non-Euclidean information (unit vectors). Our framework is assessed for estimation of both rigid and non-rigid transformations between the target and model shapes corresponding to 2D contours and 3D surfaces. The experimental results obtained confirm that using the combination of a point's position and unit… ▽ More

    Submitted 29 August, 2017; v1 submitted 25 August, 2017; originally announced August 2017.

    Comments: v2: Updated v1 by adding supplementary material

    ACM Class: I.2.10; I.5.1

    Journal ref: Pattern Recognition 79 (2018) 452-466

  20. arXiv:1705.06091  [pdf, other

    cs.CV

    Robust Registration of Gaussian Mixtures for Colour Transfer

    Authors: Mairéad Grogan, Rozenn Dahyot

    Abstract: We present a flexible approach to colour transfer inspired by techniques recently proposed for shape registration. Colour distributions of the palette and target images are modelled with Gaussian Mixture Models (GMMs) that are robustly registered to infer a non linear parametric transfer function. We show experimentally that our approach compares well to current techniques both quantitatively and… ▽ More

    Submitted 17 May, 2017; originally announced May 2017.

    ACM Class: I.4; I.4.3; I.5.1; I.5.4