Search | arXiv e-print repository

Fast, accurate, and predictive method for atom detection in site-resolved images of microtrap arrays

Authors: Marc Cheneau, Romaric Journet, Matthieu Boffety, François Goudail, Caroline Kulcsár, Pauline Trouvé-Peloux

Abstract: We introduce a new method, rooted in estimation theory, to detect the individual atoms in site-resolved images of microtrap arrays, such as optical lattices or optical tweezers arrays. Using simulated images, we demonstrate a ten-fold reduction of the detection error rate compared to the popular method based on Wiener deconvolution, under a wide range of experimental conditions. The runtime is ful… ▽ More We introduce a new method, rooted in estimation theory, to detect the individual atoms in site-resolved images of microtrap arrays, such as optical lattices or optical tweezers arrays. Using simulated images, we demonstrate a ten-fold reduction of the detection error rate compared to the popular method based on Wiener deconvolution, under a wide range of experimental conditions. The runtime is fully compatible with real-time applications, even for a very large arrays. Finally, we propose a rigorous definition for the signal-to-noise ratio of an image, and show that it can be used as a predictor for the detection error rate, which opens new prospect for the design of future experiments. △ Less

Submitted 25 February, 2025; v1 submitted 12 February, 2025; originally announced February 2025.

arXiv:2107.14498 [pdf, other]

Pix2Point: Learning Outdoor 3D Using Sparse Point Clouds and Optimal Transport

Authors: Rémy Leroy, Pauline Trouvé-Peloux, Frédéric Champagnat, Bertrand Le Saux, Marcela Carvalho

Abstract: Good quality reconstruction and comprehension of a scene rely on 3D estimation methods. The 3D information was usually obtained from images by stereo-photogrammetry, but deep learning has recently provided us with excellent results for monocular depth estimation. Building up a sufficiently large and rich training dataset to achieve these results requires onerous processing. In this paper, we addre… ▽ More Good quality reconstruction and comprehension of a scene rely on 3D estimation methods. The 3D information was usually obtained from images by stereo-photogrammetry, but deep learning has recently provided us with excellent results for monocular depth estimation. Building up a sufficiently large and rich training dataset to achieve these results requires onerous processing. In this paper, we address the problem of learning outdoor 3D point cloud from monocular data using a sparse ground-truth dataset. We propose Pix2Point, a deep learning-based approach for monocular 3D point cloud prediction, able to deal with complete and challenging outdoor scenes. Our method relies on a 2D-3D hybrid neural network architecture, and a supervised end-to-end minimisation of an optimal transport divergence between point clouds. We show that, when trained on sparse point clouds, our simple promising approach achieves a better coverage of 3D outdoor scenes than efficient monocular depth methods. △ Less

Submitted 30 July, 2021; originally announced July 2021.

Comments: 5 pages, 2 figures, to be published in 2021 International Conference on Machine Vision Applications

ACM Class: I.4.5; I.2.10

arXiv:1911.07543 [pdf, other]

doi 10.1109/LGRS.2019.2947783

Multi-Task Learning of Height and Semantics from Aerial Images

Authors: Marcela Carvalho, Bertrand Le Saux, Pauline Trouvé-Peloux, Frédéric Champagnat, Andrés Almansa

Abstract: Aerial or satellite imagery is a great source for land surface analysis, which might yield land use maps or elevation models. In this investigation, we present a neural network framework for learning semantics and local height together. We show how this joint multi-task learning benefits to each task on the large dataset of the 2018 Data Fusion Contest. Moreover, our framework also yields an uncer… ▽ More Aerial or satellite imagery is a great source for land surface analysis, which might yield land use maps or elevation models. In this investigation, we present a neural network framework for learning semantics and local height together. We show how this joint multi-task learning benefits to each task on the large dataset of the 2018 Data Fusion Contest. Moreover, our framework also yields an uncertainty map which allows assessing the prediction of the model. Code is available at https://github.com/marcelampc/mtl_aerial_images . △ Less

Submitted 18 November, 2019; originally announced November 2019.

Comments: Published IEEE Geoscience and Remote Sensing Letters. Code https://github.com/marcelampc/mtl_aerial_images

arXiv:1911.01082 [pdf, other]

Technical Report: Co-learning of geometry and semantics for online 3D mapping

Authors: Marcela Carvalho, Maxime Ferrera, Alexandre Boulch, Julien Moras, Bertrand Le Saux, Pauline Trouvé-Peloux

Abstract: This paper is a technical report about our submission for the ECCV 2018 3DRMS Workshop Challenge on Semantic 3D Reconstruction \cite{Tylecek2018rms}. In this paper, we address 3D semantic reconstruction for autonomous navigation using co-learning of depth map and semantic segmentation. The core of our pipeline is a deep multi-task neural network which tightly refines depth and also produces accur… ▽ More This paper is a technical report about our submission for the ECCV 2018 3DRMS Workshop Challenge on Semantic 3D Reconstruction \cite{Tylecek2018rms}. In this paper, we address 3D semantic reconstruction for autonomous navigation using co-learning of depth map and semantic segmentation. The core of our pipeline is a deep multi-task neural network which tightly refines depth and also produces accurate semantic segmentation maps. Its inputs are an image and a raw depth map produced from a pair of images by standard stereo vision. The resulting semantic 3D point clouds are then merged in order to create a consistent 3D mesh, in turn used to produce dense semantic 3D reconstruction maps. The performances of each step of the proposed method are evaluated on the dataset and multiple tasks of the 3DRMS Challenge, and repeatedly surpass state-of-the-art approaches. △ Less

Submitted 4 November, 2019; originally announced November 2019.

arXiv:1910.14532 [pdf, other]

doi 10.1177/0278364919883346

AQUALOC: An Underwater Dataset for Visual-Inertial-Pressure Localization

Authors: Maxime Ferrera, Vincent Creuze, Julien Moras, Pauline Trouvé-Peloux

Abstract: We present a new dataset, dedicated to the development of simultaneous localization and mapping methods for underwater vehicles navigating close to the seabed. The data sequences composing this dataset are recorded in three different environments: a harbor at a depth of a few meters, a first archaeological site at a depth of 270 meters and a second site at a depth of 380 meters. The data acquisiti… ▽ More We present a new dataset, dedicated to the development of simultaneous localization and mapping methods for underwater vehicles navigating close to the seabed. The data sequences composing this dataset are recorded in three different environments: a harbor at a depth of a few meters, a first archaeological site at a depth of 270 meters and a second site at a depth of 380 meters. The data acquisition is performed using Remotely Operated Vehicles equipped with a monocular monochromatic camera, a low-cost inertial measurement unit, a pressure sensor and a computing unit, all embedded in a single enclosure. The sensors' measurements are recorded synchronously on the computing unit and seventeen sequences have been created from all the acquired data. These sequences are made available in the form of ROS bags and as raw data. For each sequence, a trajectory has also been computed offline using a Structure-from-Motion library in order to allow the comparison with real-time localization methods. With the release of this dataset, we wish to provide data difficult to acquire and to encourage the development of vision-based localization methods dedicated to the underwater environment. The dataset can be downloaded from: http://www.lirmm.fr/aqualoc/ △ Less

Submitted 31 October, 2019; originally announced October 2019.

Comments: The International Journal of Robotics Research, SAGE Publications, 2019

arXiv:1809.07076 [pdf, other]

The Aqualoc Dataset: Towards Real-Time Underwater Localization from a Visual-Inertial-Pressure Acquisition System

Authors: Maxime Ferrera, Julien Moras, Pauline Trouvé-Peloux, Vincent Creuze, Denis Dégez

Abstract: This paper presents a new underwater dataset acquired from a visual-inertial-pressure acquisition system and meant to be used to benchmark visual odometry, visual SLAM and multi-sensors SLAM solutions. The dataset is publicly available and contains ground-truth trajectories for evaluation. This paper presents a new underwater dataset acquired from a visual-inertial-pressure acquisition system and meant to be used to benchmark visual odometry, visual SLAM and multi-sensors SLAM solutions. The dataset is publicly available and contains ground-truth trajectories for evaluation. △ Less

Submitted 19 September, 2018; originally announced September 2018.

Journal ref: IROS Workshop - New Horizons for Underwater Intervention Missions: from Current Technologies to Future Applications, Oct 2018, Madrid, Spain

arXiv:1809.01567 [pdf, other]

Deep Depth from Defocus: how can defocus blur improve 3D estimation using dense neural networks?

Authors: Marcela Carvalho, Bertrand Le Saux, Pauline Trouvé-Peloux, Andrés Almansa, Frédéric Champagnat

Abstract: Depth estimation is of critical interest for scene understanding and accurate 3D reconstruction. Most recent approaches in depth estimation with deep learning exploit geometrical structures of standard sharp images to predict corresponding depth maps. However, cameras can also produce images with defocus blur depending on the depth of the objects and camera settings. Hence, these features may repr… ▽ More Depth estimation is of critical interest for scene understanding and accurate 3D reconstruction. Most recent approaches in depth estimation with deep learning exploit geometrical structures of standard sharp images to predict corresponding depth maps. However, cameras can also produce images with defocus blur depending on the depth of the objects and camera settings. Hence, these features may represent an important hint for learning to predict depth. In this paper, we propose a full system for single-image depth prediction in the wild using depth-from-defocus and neural networks. We carry out thorough experiments to test deep convolutional networks on real and simulated defocused images using a realistic model of blur variation with respect to depth. We also investigate the influence of blur on depth prediction observing model uncertainty with a Bayesian neural network approach. From these studies, we show that out-of-focus blur greatly improves the depth-prediction network performances. Furthermore, we transfer the ability learned on a synthetic, indoor dataset to real, indoor and outdoor images. For this purpose, we present a new dataset containing real all-focus and defocused images from a Digital Single-Lens Reflex (DSLR) camera, paired with ground truth depth maps obtained with an active 3D sensor for indoor scenes. The proposed approach is successfully validated on both this new dataset and standard ones as NYUv2 or Depth-in-the-Wild. Code and new datasets are available at https://github.com/marcelampc/d3net_depth_estimation △ Less

Submitted 6 September, 2018; v1 submitted 5 September, 2018; originally announced September 2018.

Comments: 3DRW Workshop ECCV 2018. Code: https://github.com/marcelampc/d3net_depth_estimation

arXiv:1806.05842 [pdf, other]

Real-time Monocular Visual Odometry for Turbid and Dynamic Underwater Environments

Authors: Maxime Ferrera, Julien Moras, Pauline Trouvé-Peloux, Vincent Creuze

Abstract: In the context of robotic underwater operations, the visual degradations induced by the medium properties make difficult the exclusive use of cameras for localization purpose. Hence, most localization methods are based on expensive navigational sensors associated with acoustic positioning. On the other hand, visual odometry and visual SLAM have been exhaustively studied for aerial or terrestrial a… ▽ More In the context of robotic underwater operations, the visual degradations induced by the medium properties make difficult the exclusive use of cameras for localization purpose. Hence, most localization methods are based on expensive navigational sensors associated with acoustic positioning. On the other hand, visual odometry and visual SLAM have been exhaustively studied for aerial or terrestrial applications, but state-of-the-art algorithms fail underwater. In this paper we tackle the problem of using a simple low-cost camera for underwater localization and propose a new monocular visual odometry method dedicated to the underwater environment. We evaluate different tracking methods and show that optical flow based tracking is more suited to underwater images than classical approaches based on descriptors. We also propose a keyframe-based visual odometry approach highly relying on nonlinear optimization. The proposed algorithm has been assessed on both simulated and real underwater datasets and outperforms state-of-the-art visual SLAM methods under many of the most challenging conditions. The main application of this work is the localization of Remotely Operated Vehicles (ROVs) used for underwater archaeological missions but the developed system can be used in any other applications as long as visual information is available. △ Less

Submitted 27 February, 2020; v1 submitted 15 June, 2018; originally announced June 2018.

Journal ref: Sensors, MDPI, 2019

Showing 1–8 of 8 results for author: Trouvé-Peloux, P