Showing 1–2 of 2 results for author: Cuenat, S

Search v0.5.6 released 2020-02-24

arXiv:2203.07772 [pdf, other]

eess.IV cs.CV physics.optics

doi 10.1364/OE.458948

Fast Autofocusing using Tiny Transformer Networks for Digital Holographic Microscopy

Authors: Stéphane Cuenat, Louis Andréoli, Antoine N. André, Patrick Sandoz, Guillaume J. Laurent, Raphaël Couturier, Maxime Jacquot

Abstract: The numerical wavefront backpropagation principle of digital holography confers unique extended focus capabilities, without mechanical displacements along z-axis. However, the determination of the correct focusing distance is a non-trivial and time consuming issue. A deep learning (DL) solution is proposed to cast the autofocusing as a regression problem and tested over both experimental and simul… ▽ More The numerical wavefront backpropagation principle of digital holography confers unique extended focus capabilities, without mechanical displacements along z-axis. However, the determination of the correct focusing distance is a non-trivial and time consuming issue. A deep learning (DL) solution is proposed to cast the autofocusing as a regression problem and tested over both experimental and simulated holograms. Single wavelength digital holograms were recorded by a Digital Holographic Microscope (DHM) with a 10$\mathrm{x}$ microscope objective from a patterned target moving in 3D over an axial range of 92 $μ$m. Tiny DL models are proposed and compared such as a tiny Vision Transformer (TViT), tiny VGG16 (TVGG) and a tiny Swin-Transfomer (TSwinT). The proposed tiny networks are compared with their original versions (ViT/B16, VGG16 and Swin-Transformer Tiny) and the main neural networks used in digital holography such as LeNet and AlexNet. The experiments show that the predicted focusing distance $Z_R^{\mathrm{Pred}}$ is accurately inferred with an accuracy of 1.2 $μ$m in average in comparison with the DHM depth of field of 15 $μ$m. Numerical simulations show that all tiny models give the $Z_R^{\mathrm{Pred}}$ with an error below 0.3 $μ$m. Such a prospect would significantly improve the current capabilities of computer vision position sensing in applications such as 3D microscopy for life sciences or micro-robotics. Moreover, all models reach an inference time on CPU, inferior to 25 ms per inference. In terms of occlusions, TViT based on its Transformer architecture is the most robust. △ Less

Submitted 20 May, 2022; v1 submitted 15 March, 2022; originally announced March 2022.
arXiv:2108.09147 [pdf, other]

cs.CV eess.IV

Convolutional Neural Network (CNN) vs Vision Transformer (ViT) for Digital Holography

Authors: Stéphane Cuenat, Raphaël Couturier

Abstract: In Digital Holography (DH), it is crucial to extract the object distance from a hologram in order to reconstruct its amplitude and phase. This step is called auto-focusing and it is conventionally solved by first reconstructing a stack of images and then by sharpening each reconstructed image using a focus metric such as entropy or variance. The distance corresponding to the sharpest image is cons… ▽ More In Digital Holography (DH), it is crucial to extract the object distance from a hologram in order to reconstruct its amplitude and phase. This step is called auto-focusing and it is conventionally solved by first reconstructing a stack of images and then by sharpening each reconstructed image using a focus metric such as entropy or variance. The distance corresponding to the sharpest image is considered the focal position. This approach, while effective, is computationally demanding and time-consuming. In this paper, the determination of the distance is performed by Deep Learning (DL). Two deep learning (DL) architectures are compared: Convolutional Neural Network (CNN) and Vision Transformer (ViT). ViT and CNN are used to cope with the problem of auto-focusing as a classification problem. Compared to a first attempt [11] in which the distance between two consecutive classes was 100$μ$m, our proposal allows us to drastically reduce this distance to 1$μ$m. Moreover, ViT reaches similar accuracy and is more robust than CNN. △ Less

Submitted 27 January, 2022; v1 submitted 20 August, 2021; originally announced August 2021.

Comments: 6 pages, 11 figures, ICCCR 2022 Conference

Search v0.5.6 released 2020-02-24