Skip to main content

Showing 1–9 of 9 results for author: Matzen, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2212.03239  [pdf, other

    cs.CV

    Perspective Fields for Single Image Camera Calibration

    Authors: Linyi Jin, Jianming Zhang, Yannick Hold-Geoffroy, Oliver Wang, Kevin Matzen, Matthew Sticha, David F. Fouhey

    Abstract: Geometric camera calibration is often required for applications that understand the perspective of the image. We propose perspective fields as a representation that models the local perspective properties of an image. Perspective Fields contain per-pixel information about the camera view, parameterized as an up vector and a latitude value. This representation has a number of advantages as it makes… ▽ More

    Submitted 16 March, 2023; v1 submitted 6 December, 2022; originally announced December 2022.

    Comments: CVPR 2023 Camera Ready. Project Page https://jinlinyi.github.io/PerspectiveFields/

  2. arXiv:2008.12298  [pdf, other

    cs.CV cs.GR

    One Shot 3D Photography

    Authors: Johannes Kopf, Kevin Matzen, Suhib Alsisan, Ocean Quigley, Francis Ge, Yangming Chong, Josh Patterson, Jan-Michael Frahm, Shu Wu, Matthew Yu, Peizhao Zhang, Zijian He, Peter Vajda, Ayush Saraf, Michael Cohen

    Abstract: 3D photography is a new medium that allows viewers to more fully experience a captured moment. In this work, we refer to a 3D photo as one that displays parallax induced by moving the viewpoint (as opposed to a stereo pair with a fixed viewpoint). 3D photos are static in time, like traditional photos, but are displayed with interactive parallax on mobile or desktop screens, as well as on Virtual R… ▽ More

    Submitted 1 September, 2020; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: Project page: https://facebookresearch.github.io/one_shot_3d_photography/ Code: https://github.com/facebookresearch/one_shot_3d_photography

    Journal ref: ACM Transactions on Graphics (Proceedings of SIGGRAPH 2020), Volume 39, Number 4, 2020

  3. arXiv:2004.15021  [pdf, other

    cs.CV

    Consistent Video Depth Estimation

    Authors: Xuan Luo, Jia-Bin Huang, Richard Szeliski, Kevin Matzen, Johannes Kopf

    Abstract: We present an algorithm for reconstructing dense, geometrically consistent depth for all pixels in a monocular video. We leverage a conventional structure-from-motion reconstruction to establish geometric constraints on pixels in the video. Unlike the ad-hoc priors in classical reconstruction, we use a learning-based prior, i.e., a convolutional neural network trained for single-image depth estima… ▽ More

    Submitted 26 August, 2020; v1 submitted 30 April, 2020; originally announced April 2020.

    Comments: SIGGRAPH 2020. Video: https://www.youtube.com/watch?v=5Tia2oblJAg Project page: https://roxanneluo.github.io/Consistent-Video-Depth-Estimation/ Code: https://github.com/facebookresearch/consistent_depth

  4. arXiv:1908.11412  [pdf, other

    cs.CV

    GeoStyle: Discovering Fashion Trends and Events

    Authors: Utkarsh Mall, Kevin Matzen, Bharath Hariharan, Noah Snavely, Kavita Bala

    Abstract: Understanding fashion styles and trends is of great potential interest to retailers and consumers alike. The photos people upload to social media are a historical and public data source of how people dress across the world and at different times. While we now have tools to automatically recognize the clothing and style attributes of what people are wearing in these photographs, we lack the ability… ▽ More

    Submitted 29 August, 2019; originally announced August 2019.

    Comments: Accepted in ICCV 2019

  5. arXiv:1905.06326  [pdf, other

    cs.CV cs.GR

    Synthetic Defocus and Look-Ahead Autofocus for Casual Videography

    Authors: Xuaner Zhang, Kevin Matzen, Vivien Nguyen, Dillon Yao, You Zhang, Ren Ng

    Abstract: In cinema, large camera lenses create beautiful shallow depth of field (DOF), but make focusing difficult and expensive. Accurate cinema focus usually relies on a script and a person to control focus in realtime. Casual videographers often crave cinematic focus, but fail to achieve it. We either sacrifice shallow DOF, as in smartphone videos; or we struggle to deliver accurate focus, as in videos… ▽ More

    Submitted 21 May, 2019; v1 submitted 15 May, 2019; originally announced May 2019.

    Comments: (V2 author name corrected) SIGGRAPH 2019; project website: https://ceciliavision.github.io/vid-auto-focus/

  6. arXiv:1804.00650  [pdf, other

    cs.CV

    DeepMVS: Learning Multi-view Stereopsis

    Authors: Po-Han Huang, Kevin Matzen, Johannes Kopf, Narendra Ahuja, Jia-Bin Huang

    Abstract: We present DeepMVS, a deep convolutional neural network (ConvNet) for multi-view stereo reconstruction. Taking an arbitrary number of posed images as input, we first produce a set of plane-sweep volumes and use the proposed DeepMVS network to predict high-quality disparity maps. The key contributions that enable these results are (1) supervised pretraining on a photorealistic synthetic dataset, (2… ▽ More

    Submitted 2 April, 2018; originally announced April 2018.

    Comments: CVPR 2018. Project page: https://phuang17.github.io/DeepMVS/ Code: https://github.com/phuang17/DeepMVS

  7. arXiv:1712.05790  [pdf, other

    cs.CV cs.LG stat.ML

    Deep Burst Denoising

    Authors: Clément Godard, Kevin Matzen, Matt Uyttendaele

    Abstract: Noise is an inherent issue of low-light image capture, one which is exacerbated on mobile devices due to their narrow apertures and small sensors. One strategy for mitigating noise in a low-light situation is to increase the shutter time of the camera, thus allowing each photosite to integrate more light and decrease noise variance. However, there are two downsides of long exposures: (a) bright re… ▽ More

    Submitted 15 December, 2017; originally announced December 2017.

  8. arXiv:1706.01869  [pdf, other

    cs.CV

    StreetStyle: Exploring world-wide clothing styles from millions of photos

    Authors: Kevin Matzen, Kavita Bala, Noah Snavely

    Abstract: Each day billions of photographs are uploaded to photo-sharing services and social media platforms. These images are packed with information about how people live around the world. In this paper we exploit this rich trove of data to understand fashion and style trends worldwide. We present a framework for visual discovery at scale, analyzing clothing and fashion across millions of images of people… ▽ More

    Submitted 6 June, 2017; originally announced June 2017.

  9. arXiv:1605.02196  [pdf, other

    eess.SY cs.CV cs.LG cs.RO

    All Weather Perception: Joint Data Association, Tracking, and Classification for Autonomous Ground Vehicles

    Authors: Peter Radecki, Mark Campbell, Kevin Matzen

    Abstract: A novel probabilistic perception algorithm is presented as a real-time joint solution to data association, object tracking, and object classification for an autonomous ground vehicle in all-weather conditions. The presented algorithm extends a Rao-Blackwellized Particle Filter originally built with a particle filter for data association and a Kalman filter for multi-object tracking (Miller et al.… ▽ More

    Submitted 7 May, 2016; originally announced May 2016.

    Comments: 35 pages, 21 figures, 14 tables