Skip to main content

Showing 1–4 of 4 results for author: Felsen, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:1908.04781  [pdf, other

    cs.CV

    Predicting 3D Human Dynamics from Video

    Authors: Jason Y. Zhang, Panna Felsen, Angjoo Kanazawa, Jitendra Malik

    Abstract: Given a video of a person in action, we can easily guess the 3D future motion of the person. In this work, we present perhaps the first approach for predicting a future 3D mesh model sequence of a person from past video input. We do this for periodic motions such as walking and also actions like bowling and squatting seen in sports or workout videos. While there has been a surge of future predicti… ▽ More

    Submitted 20 August, 2019; v1 submitted 13 August, 2019; originally announced August 2019.

    Comments: To Appear in ICCV 2019. (v2: Updated "3D Pose from Video" in Related Work.)

  2. arXiv:1812.01601  [pdf, other

    cs.CV

    Learning 3D Human Dynamics from Video

    Authors: Angjoo Kanazawa, Jason Y. Zhang, Panna Felsen, Jitendra Malik

    Abstract: From an image of a person in action, we can easily guess the 3D motion of the person in the immediate past and future. This is because we have a mental model of 3D human dynamics that we have acquired from observing visual sequences of humans in motion. We present a framework that can similarly learn a representation of 3D dynamics of humans from video via a simple but effective temporal encoding… ▽ More

    Submitted 16 September, 2019; v1 submitted 4 December, 2018; originally announced December 2018.

    Comments: To appear in CVPR 2019. Changelog: v3. +an experiment to compare improvement from pseudo-gt data on single view vs temporal context model. v2. camready ver: Minor update in model training where the gaussian shape prior is used, updated results (similar results, same trends), added more ablation study in the appendix. v1. +evaluation protocol subsection in appendix, updated results due to bug fix

  3. arXiv:1508.00271  [pdf, other

    cs.CV

    Recurrent Network Models for Human Dynamics

    Authors: Katerina Fragkiadaki, Sergey Levine, Panna Felsen, Jitendra Malik

    Abstract: We propose the Encoder-Recurrent-Decoder (ERD) model for recognition and prediction of human body pose in videos and motion capture. The ERD model is a recurrent neural network that incorporates nonlinear encoder and decoder networks before and after recurrent layers. We test instantiations of ERD architectures in the tasks of motion capture (mocap) generation, body pose labeling and body pose for… ▽ More

    Submitted 28 September, 2015; v1 submitted 2 August, 2015; originally announced August 2015.

    Comments: International Conference on Computer Vision 2015

  4. arXiv:1412.6504  [pdf, other

    cs.CV

    Learning to Segment Moving Objects in Videos

    Authors: Katerina Fragkiadaki, Pablo Arbelaez, Panna Felsen, Jitendra Malik

    Abstract: We segment moving objects in videos by ranking spatio-temporal segment proposals according to "moving objectness": how likely they are to contain a moving object. In each video frame, we compute segment proposals using multiple figure-ground segmentations on per frame motion boundaries. We rank them with a Moving Objectness Detector trained on image and motion fields to detect moving objects and d… ▽ More

    Submitted 7 May, 2015; v1 submitted 19 December, 2014; originally announced December 2014.