Skip to main content

Showing 1–14 of 14 results for author: Bregler, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.11697  [pdf, other

    cs.CY

    AMMeBa: A Large-Scale Survey and Dataset of Media-Based Misinformation In-The-Wild

    Authors: Nicholas Dufour, Arkanath Pathak, Pouya Samangouei, Nikki Hariri, Shashi Deshetti, Andrew Dudfield, Christopher Guess, Pablo Hernández Escayola, Bobby Tran, Mevan Babakar, Christoph Bregler

    Abstract: The prevalence and harms of online misinformation is a perennial concern for internet platforms, institutions and society at large. Over time, information shared online has become more media-heavy and misinformation has readily adapted to these new modalities. The rise of generative AI-based tools, which provide widely-accessible methods for synthesizing realistic audio, images, video and human-li… ▽ More

    Submitted 21 May, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

    Comments: Grammar, spelling corrections. Minor rewording and clarification of one sentence. 24 pages, 31 figures

  2. Training-Free Neural Matte Extraction for Visual Effects

    Authors: Sharif Elcott, J. P. Lewis, Nori Kanazawa, Christoph Bregler

    Abstract: Alpha matting is widely used in video conferencing as well as in movies, television, and social media sites. Deep learning approaches to the matte extraction problem are well suited to video conferencing due to the consistent subject matter (front-facing humans), however training-based approaches are somewhat pointless for entertainment videos where varied subjects (spaceships, monsters, etc.) may… ▽ More

    Submitted 29 June, 2023; originally announced June 2023.

    ACM Class: I.4.6

    Journal ref: SIGGRAPH Asia 2022 Technical Communications

  3. arXiv:2207.14534  [pdf, other

    cs.MM

    ACM Multimedia Grand Challenge on Detecting Cheapfakes

    Authors: Shivangi Aneja, Cise Midoglu, Duc-Tien Dang-Nguyen, Sohail Ahmed Khan, Michael Riegler, Pål Halvorsen, Chris Bregler, Balu Adsumilli

    Abstract: Cheapfake is a recently coined term that encompasses non-AI (``cheap'') manipulations of multimedia content. Cheapfakes are known to be more prevalent than deepfakes. Cheapfake media can be created using editing software for image/video manipulations, or even without using any software, by simply altering the context of an image/video by sharing the media alongside misleading claims. This alterati… ▽ More

    Submitted 29 July, 2022; originally announced July 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2107.05297

  4. arXiv:2107.05297  [pdf, other

    cs.MM

    MMSys'21 Grand Challenge on Detecting Cheapfakes

    Authors: Shivangi Aneja, Cise Midoglu, Duc-Tien Dang-Nguyen, Michael Alexander Riegler, Paal Halvorsen, Matthias Niessner, Balu Adsumilli, Chris Bregler

    Abstract: Cheapfake is a recently coined term that encompasses non-AI ("cheap") manipulations of multimedia content. Cheapfakes are known to be more prevalent than deepfakes. Cheapfake media can be created using editing software for image/video manipulations, or even without using any software, by simply altering the context of an image/video by sharing the media alongside misleading claims. This alteration… ▽ More

    Submitted 12 July, 2021; originally announced July 2021.

  5. arXiv:2106.04185  [pdf, other

    cs.CV

    LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces from Video using Pose and Lighting Normalization

    Authors: Avisek Lahiri, Vivek Kwatra, Christian Frueh, John Lewis, Chris Bregler

    Abstract: In this paper, we present a video-based learning framework for animating personalized 3D talking faces from audio. We introduce two training-time data normalizations that significantly improve data sample efficiency. First, we isolate and represent faces in a normalized space that decouples 3D geometry, head pose, and texture. This decomposes the prediction problem into regressions over the 3D fac… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: Accepted to IEEE CVPR 2021. Brief demo video available at: https://www.youtube.com/watch?v=L1StbX9OznY

  6. arXiv:2101.06278  [pdf, other

    cs.CV cs.AI

    COSMOS: Catching Out-of-Context Misinformation with Self-Supervised Learning

    Authors: Shivangi Aneja, Chris Bregler, Matthias Nießner

    Abstract: Despite the recent attention to DeepFakes, one of the most prevalent ways to mislead audiences on social media is the use of unaltered images in a new but false context. To address these challenges and support fact-checkers, we propose a new method that automatically detects out-of-context image and text pairs. Our key insight is to leverage the grounding of image with text to distinguish out-of-c… ▽ More

    Submitted 21 April, 2021; v1 submitted 15 January, 2021; originally announced January 2021.

    Comments: Video : https://youtu.be/riI3Cl2xy10

  7. arXiv:2007.15506  [pdf, other

    cs.CV

    SimPose: Effectively Learning DensePose and Surface Normals of People from Simulated Data

    Authors: Tyler Zhu, Per Karlsson, Christoph Bregler

    Abstract: With a proliferation of generic domain-adaptation approaches, we report a simple yet effective technique for learning difficult per-pixel 2.5D and 3D regression representations of articulated people. We obtained strong sim-to-real domain generalization for the 2.5D DensePose estimation task and the 3D human surface normal estimation task. On the multi-person DensePose MSCOCO benchmark, our approac… ▽ More

    Submitted 30 July, 2020; originally announced July 2020.

    Comments: To appear in the Proceedings of ECCV 2020

  8. arXiv:1901.10024  [pdf, other

    cs.LG cs.GR stat.ML

    Cross-Domain Image Manipulation by Demonstration

    Authors: Ben Usman, Nick Dufour, Kate Saenko, Chris Bregler

    Abstract: In this work we propose a model that can manipulate individual visual attributes of objects in a real scene using examples of how respective attribute manipulations affect the output of a simulation. As an example, we train our model to manipulate the expression of a human face using nonphotorealistic 3D renders of a face with varied expression. Our model manages to preserve all other visual attri… ▽ More

    Submitted 3 April, 2019; v1 submitted 28 January, 2019; originally announced January 2019.

  9. arXiv:1701.01779  [pdf, other

    cs.CV

    Towards Accurate Multi-person Pose Estimation in the Wild

    Authors: George Papandreou, Tyler Zhu, Nori Kanazawa, Alexander Toshev, Jonathan Tompson, Chris Bregler, Kevin Murphy

    Abstract: We propose a method for multi-person detection and 2-D pose estimation that achieves state-of-art results on the challenging COCO keypoints task. It is a simple, yet powerful, top-down approach consisting of two stages. In the first stage, we predict the location and scale of boxes which are likely to contain people; for this we use the Faster RCNN detector. In the second stage, we estimate the… ▽ More

    Submitted 14 April, 2017; v1 submitted 6 January, 2017; originally announced January 2017.

    Comments: Paper describing an improved version of the G-RMI entry to the 2016 COCO keypoints challenge (http://image-net.org/challenges/ilsvrc+coco2016). Camera ready version to appear in the Proceedings of CVPR 2017

  10. arXiv:1411.4280  [pdf, other

    cs.CV

    Efficient Object Localization Using Convolutional Networks

    Authors: Jonathan Tompson, Ross Goroshin, Arjun Jain, Yann LeCun, Christopher Bregler

    Abstract: Recent state-of-the-art performance on human-body pose estimation has been achieved with Deep Convolutional Networks (ConvNets). Traditional ConvNet architectures include pooling and sub-sampling layers which reduce computational requirements, introduce invariance and prevent over-training. These benefits of pooling come at the cost of reduced localization accuracy. We introduce a novel architectu… ▽ More

    Submitted 9 June, 2015; v1 submitted 16 November, 2014; originally announced November 2014.

    Comments: 8 pages with 1 page of citations

  11. arXiv:1409.7963  [pdf, other

    cs.CV cs.LG cs.NE

    MoDeep: A Deep Learning Framework Using Motion Features for Human Pose Estimation

    Authors: Arjun Jain, Jonathan Tompson, Yann LeCun, Christoph Bregler

    Abstract: In this work, we propose a novel and efficient method for articulated human pose estimation in videos using a convolutional network architecture, which incorporates both color and motion features. We propose a new human body pose dataset, FLIC-motion, that extends the FLIC dataset with additional motion features. We apply our architecture to this dataset and report significantly better performance… ▽ More

    Submitted 28 September, 2014; originally announced September 2014.

  12. arXiv:1406.2984  [pdf, other

    cs.CV

    Joint Training of a Convolutional Network and a Graphical Model for Human Pose Estimation

    Authors: Jonathan Tompson, Arjun Jain, Yann LeCun, Christoph Bregler

    Abstract: This paper proposes a new hybrid architecture that consists of a deep Convolutional Network and a Markov Random Field. We show how this architecture is successfully applied to the challenging problem of articulated human pose estimation in monocular images. The architecture can exploit structural domain constraints such as geometric relationships between body joint locations. We show that joint tr… ▽ More

    Submitted 17 September, 2014; v1 submitted 11 June, 2014; originally announced June 2014.

  13. arXiv:1312.7302  [pdf, other

    cs.CV cs.LG cs.NE

    Learning Human Pose Estimation Features with Convolutional Networks

    Authors: Arjun Jain, Jonathan Tompson, Mykhaylo Andriluka, Graham W. Taylor, Christoph Bregler

    Abstract: This paper introduces a new architecture for human pose estimation using a multi- layer convolutional network architecture and a modified learning technique that learns low-level features and higher-level weak spatial models. Unconstrained human pose estimation is one of the hardest problems in computer vision, and our new architecture and learning schema shows significant improvement over the cur… ▽ More

    Submitted 23 April, 2014; v1 submitted 27 December, 2013; originally announced December 2013.

    Report number: NYU-TR-2013-CS0999

  14. arXiv:1204.3596  [pdf, other

    cs.SI cs.HC

    Markerless Motion Capture in the Crowd

    Authors: Ian Spiro, Thomas Huston, Christoph Bregler

    Abstract: This work uses crowdsourcing to obtain motion capture data from video recordings. The data is obtained by information workers who click repeatedly to indicate body configurations in the frames of a video, resulting in a model of 2D structure over time. We discuss techniques to optimize the tracking task and strategies for maximizing accuracy and efficiency. We show visualizations of a variety of m… ▽ More

    Submitted 16 April, 2012; originally announced April 2012.

    Comments: Presented at Collective Intelligence conference, 2012 (arXiv:1204.2991)

    Report number: CollectiveIntelligence/2012/51