-
MEDeA: Multi-view Efficient Depth Adjustment
Authors:
Mikhail Artemyev,
Anna Vorontsova,
Anna Sokolova,
Alexander Limonov
Abstract:
The majority of modern single-view depth estimation methods predict relative depth and thus cannot be directly applied in many real-world scenarios, despite impressive performance in the benchmarks. Moreover, single-view approaches cannot guarantee consistency across a sequence of frames. Consistency is typically addressed with test-time optimization of discrepancy across views; however, it takes…
▽ More
The majority of modern single-view depth estimation methods predict relative depth and thus cannot be directly applied in many real-world scenarios, despite impressive performance in the benchmarks. Moreover, single-view approaches cannot guarantee consistency across a sequence of frames. Consistency is typically addressed with test-time optimization of discrepancy across views; however, it takes hours to process a single scene. In this paper, we present MEDeA, an efficient multi-view test-time depth adjustment method, that is an order of magnitude faster than existing test-time approaches. Given RGB frames with camera parameters, MEDeA predicts initial depth maps, adjusts them by optimizing local scaling coefficients, and outputs temporally-consistent depth maps. Contrary to test-time methods requiring normals, optical flow, or semantics estimation, MEDeA produces high-quality predictions with a depth estimation network solely. Our method sets a new state-of-the-art on TUM RGB-D, 7Scenes, and ScanNet benchmarks and successfully handles smartphone-captured data from ARKitScenes dataset.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Single-Stage 3D Geometry-Preserving Depth Estimation Model Training on Dataset Mixtures with Uncalibrated Stereo Data
Authors:
Nikolay Patakin,
Mikhail Romanov,
Anna Vorontsova,
Mikhail Artemyev,
Anton Konushin
Abstract:
Nowadays, robotics, AR, and 3D modeling applications attract considerable attention to single-view depth estimation (SVDE) as it allows estimating scene geometry from a single RGB image. Recent works have demonstrated that the accuracy of an SVDE method hugely depends on the diversity and volume of the training data. However, RGB-D datasets obtained via depth capturing or 3D reconstruction are typ…
▽ More
Nowadays, robotics, AR, and 3D modeling applications attract considerable attention to single-view depth estimation (SVDE) as it allows estimating scene geometry from a single RGB image. Recent works have demonstrated that the accuracy of an SVDE method hugely depends on the diversity and volume of the training data. However, RGB-D datasets obtained via depth capturing or 3D reconstruction are typically small, synthetic datasets are not photorealistic enough, and all these datasets lack diversity. The large-scale and diverse data can be sourced from stereo images or stereo videos from the web. Typically being uncalibrated, stereo data provides disparities up to unknown shift (geometrically incomplete data), so stereo-trained SVDE methods cannot recover 3D geometry. It was recently shown that the distorted point clouds obtained with a stereo-trained SVDE method can be corrected with additional point cloud modules (PCM) separately trained on the geometrically complete data. On the contrary, we propose GP$^{2}$, General-Purpose and Geometry-Preserving training scheme, and show that conventional SVDE models can learn correct shifts themselves without any post-processing, benefiting from using stereo data even in the geometry-preserving setting. Through experiments on different dataset mixtures, we prove that GP$^{2}$-trained models outperform methods relying on PCM in both accuracy and speed, and report the state-of-the-art results in the general-purpose geometry-preserving SVDE. Moreover, we show that SVDE models can learn to predict geometrically correct depth even when geometrically complete data comprises the minor part of the training set.
△ Less
Submitted 5 June, 2023;
originally announced June 2023.
-
p-State Luminescence in CdSe Nanoplatelets: The Role of Lateral Confinement and an LO Phonon Bottleneck
Authors:
Alexander W. Achtstein,
Riccardo Scott,
Sebastian Kickhöfel,
Stefan T. Jagsch,
Sotirios Christodoulou,
Anatol V. Prudnikau,
Artsiom Antanovich,
Mikhail Artemyev,
Iwan Moreels,
Andrei Schliwa,
Ulrike Woggon
Abstract:
We report excited state emission from p-states at excitation fluences well below ground state saturation in CdSe nanoplatelets. Size dependent exciton ground state-excited state energies and dynamics are determined by three independent methods, time-resolved photoluminescence (PL), time-integrated PL and Hartree renormalized k$\cdot$p calculations -- all in very good agreement. The ground state-ex…
▽ More
We report excited state emission from p-states at excitation fluences well below ground state saturation in CdSe nanoplatelets. Size dependent exciton ground state-excited state energies and dynamics are determined by three independent methods, time-resolved photoluminescence (PL), time-integrated PL and Hartree renormalized k$\cdot$p calculations -- all in very good agreement. The ground state-excited state energy spacing strongly increases with the lateral platelet quantization. Our results suggest that the PL decay of CdSe platelets is governed by an LO-phonon bottleneck, related to the reported low exciton phonon coupling in CdSe platelets and only observable due to the very large oscillator strength and energy spacing of both states.
△ Less
Submitted 20 July, 2015;
originally announced July 2015.