Skip to main content

Showing 1–7 of 7 results for author: Krainin, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.06006  [pdf, other

    cs.CV

    CamCtrl3D: Single-Image Scene Exploration with Precise 3D Camera Control

    Authors: Stefan Popov, Amit Raj, Michael Krainin, Yuanzhen Li, William T. Freeman, Michael Rubinstein

    Abstract: We propose a method for generating fly-through videos of a scene, from a single image and a given camera trajectory. We build upon an image-to-video latent diffusion model. We condition its UNet denoiser on the camera trajectory, using four techniques. (1) We condition the UNet's temporal blocks on raw camera extrinsics, similar to MotionCtrl. (2) We use images containing camera rays and direction… ▽ More

    Submitted 31 January, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

    Comments: To be published in 3DV 2025. Project page at https://camctrl3d.github.io

  2. arXiv:2401.01461  [pdf, other

    cs.CV

    Efficient Hybrid Zoom using Camera Fusion on Mobile Phones

    Authors: Xiaotong Wu, Wei-Sheng Lai, YiChang Shih, Charles Herrmann, Michael Krainin, Deqing Sun, Chia-Kai Liang

    Abstract: DSLR cameras can achieve multiple zoom levels via shifting lens distances or swapping lens types. However, these techniques are not possible on smartphone devices due to space constraints. Most smartphone manufacturers adopt a hybrid zoom system: commonly a Wide (W) camera at a low zoom level and a Telephoto (T) camera at a high zoom level. To simulate zoom levels between W and T, these systems cr… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

    Comments: Accepted to SIGGRAPH Asia 2023 (ACM TOG). Project website: https://www.wslai.net/publications/fusion_zoom

  3. arXiv:2207.11617  [pdf, other

    cs.CV cs.GR

    Face Deblurring using Dual Camera Fusion on Mobile Phones

    Authors: Wei-Sheng Lai, YiChang Shih, Lun-Cheng Chu, Xiaotong Wu, Sung-Fang Tsai, Michael Krainin, Deqing Sun, Chia-Kai Liang

    Abstract: Motion blur of fast-moving subjects is a longstanding problem in photography and very common on mobile phones due to limited light collection efficiency, particularly in low-light conditions. While we have witnessed great progress in image deblurring in recent years, most methods require significant computational power and have limitations in processing high-resolution photos with severe local mot… ▽ More

    Submitted 23 July, 2022; originally announced July 2022.

    Comments: Accepted to SIGGRAPH 2022 (ACM TOG). Project websit: https://www.wslai.net/publications/fusion_deblur/

  4. arXiv:2109.01068  [pdf, other

    cs.CV cs.GR

    SLIDE: Single Image 3D Photography with Soft Layering and Depth-aware Inpainting

    Authors: Varun Jampani, Huiwen Chang, Kyle Sargent, Abhishek Kar, Richard Tucker, Michael Krainin, Dominik Kaeser, William T. Freeman, David Salesin, Brian Curless, Ce Liu

    Abstract: Single image 3D photography enables viewers to view a still image from novel viewpoints. Recent approaches combine monocular depth networks with inpainting networks to achieve compelling results. A drawback of these techniques is the use of hard depth layering, making them unable to model intricate appearance details such as thin hair-like structures. We present SLIDE, a modular and unified system… ▽ More

    Submitted 2 September, 2021; originally announced September 2021.

    Comments: ICCV 2021 (Oral); Project page: https://varunjampani.github.io/slide ; Video: https://www.youtube.com/watch?v=RQio7q-ueY8

  5. arXiv:2104.14544  [pdf, other

    cs.CV

    AutoFlow: Learning a Better Training Set for Optical Flow

    Authors: Deqing Sun, Daniel Vlasic, Charles Herrmann, Varun Jampani, Michael Krainin, Huiwen Chang, Ramin Zabih, William T. Freeman, Ce Liu

    Abstract: Synthetic datasets play a critical role in pre-training CNN models for optical flow, but they are painstaking to generate and hard to adapt to new applications. To automate the process, we present AutoFlow, a simple and effective method to render training data for optical flow that optimizes the performance of a model on a target dataset. AutoFlow takes a layered approach to render synthetic data,… ▽ More

    Submitted 29 April, 2021; originally announced April 2021.

    Comments: CVPR 2021

  6. arXiv:2011.11784  [pdf, other

    cs.CV

    Robust image stitching with multiple registrations

    Authors: Charles Herrmann, Chen Wang, Richard Strong Bowen, Emil Keyder, Michael Krainin, Ce Liu, Ramin Zabih

    Abstract: Panorama creation is one of the most widely deployed techniques in computer vision. In addition to industry applications such as Google Street View, it is also used by millions of consumers in smartphones and other cameras. Traditionally, the problem is decomposed into three phases: registration, which picks a single transformation of each source image to align it to the other inputs, seam finding… ▽ More

    Submitted 23 November, 2020; originally announced November 2020.

    Comments: ECCV 2018

  7. Handheld Multi-Frame Super-Resolution

    Authors: Bartlomiej Wronski, Ignacio Garcia-Dorado, Manfred Ernst, Damien Kelly, Michael Krainin, Chia-Kai Liang, Marc Levoy, Peyman Milanfar

    Abstract: Compared to DSLR cameras, smartphone cameras have smaller sensors, which limits their spatial resolution; smaller apertures, which limits their light gathering ability; and smaller pixels, which reduces their signal-to noise ratio. The use of color filter arrays (CFAs) requires demosaicing, which further degrades resolution. In this paper, we supplant the use of traditional demosaicing in single-f… ▽ More

    Submitted 16 February, 2021; v1 submitted 8 May, 2019; originally announced May 2019.

    Comments: 24 pages, accepted to Siggraph 2019 Technical Papers program