Skip to main content

Showing 1–10 of 10 results for author: Fosco, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.04733  [pdf, other

    cs.HC

    Artifact magnification on deepfake videos increases human detection and subjective confidence

    Authors: Emilie Josephs, Camilo Fosco, Aude Oliva

    Abstract: The development of technologies for easily and automatically falsifying video has raised practical questions about people's ability to detect false information online. How vulnerable are people to deepfake videos? What technologies can be applied to boost their performance? Human susceptibility to deepfake videos is typically measured in laboratory settings, which do not reflect the challenges of… ▽ More

    Submitted 10 April, 2023; originally announced April 2023.

    Comments: 8 pages, 4 figures

  2. arXiv:2212.06516  [pdf, other

    cs.CV cs.AI cs.MM

    Overview of The MediaEval 2022 Predicting Video Memorability Task

    Authors: Lorin Sweeney, Mihai Gabriel Constantin, Claire-Hélène Demarty, Camilo Fosco, Alba G. Seco de Herrera, Sebastian Halder, Graham Healy, Bogdan Ionescu, Ana Matran-Fernandez, Alan F. Smeaton, Mushfika Sultana

    Abstract: This paper describes the 5th edition of the Predicting Video Memorability Task as part of MediaEval2022. This year we have reorganised and simplified the task in order to lubricate a greater depth of inquiry. Similar to last year, two datasets are provided in order to facilitate generalisation, however, this year we have replaced the TRECVid2019 Video-to-Text dataset with the VideoMem dataset in o… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: 6 pages. In: MediaEval Multimedia Benchmark Workshop Working Notes, 2022

  3. arXiv:2212.03955  [pdf, other

    cs.CV cs.AI

    Experiences from the MediaEval Predicting Media Memorability Task

    Authors: Alba García Deco de Herrera, Mihai Gabriel Constantin, Chaire-Hélène Demarty, Camilo Fosco, Sebastian Halder, Graham Healy, Bogdan Ionescu, Ana Matran-Fernandez, Alan F. Smeaton, Mushfika Sultana, Lorin Sweeney

    Abstract: The Predicting Media Memorability task in the MediaEval evaluation campaign has been running annually since 2018 and several different tasks and data sets have been used in this time. This has allowed us to compare the performance of many memorability prediction techniques on the same data and in a reproducible way and to refine and improve on those techniques. The resources created to compute med… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

    Comments: 7 pages, 2 figures, 1 table. Presented at the NeurIPS 2022 Workshop on Memory in Artificial and Real Intelligence (MemARI), 2 December 2022, New Orleans, USA

  4. arXiv:2206.00535  [pdf, other

    cs.CV cs.HC cs.SI

    Deepfake Caricatures: Amplifying attention to artifacts increases deepfake detection by humans and machines

    Authors: Camilo Fosco, Emilie Josephs, Alex Andonian, Allen Lee, Xi Wang, Aude Oliva

    Abstract: Deepfakes pose a serious threat to digital well-being by fueling misinformation. As deepfakes get harder to recognize with the naked eye, human users become increasingly reliant on deepfake detection models to decide if a video is real or fake. Currently, models yield a prediction for a video's authenticity, but do not integrate a method for alerting a human user. We introduce a framework for ampl… ▽ More

    Submitted 10 April, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Comments: 9 pages, 5 figures, 4 tables

  5. arXiv:2112.05982  [pdf, ps, other

    cs.CV cs.AI cs.MM

    Overview of The MediaEval 2021 Predicting Media Memorability Task

    Authors: Rukiye Savran Kiziltepe, Mihai Gabriel Constantin, Claire-Helene Demarty, Graham Healy, Camilo Fosco, Alba Garcia Seco de Herrera, Sebastian Halder, Bogdan Ionescu, Ana Matran-Fernandez, Alan F. Smeaton, Lorin Sweeney

    Abstract: This paper describes the MediaEval 2021 Predicting Media Memorability}task, which is in its 4th edition this year, as the prediction of short-term and long-term video memorability remains a challenging task. In 2021, two datasets of videos are used: first, a subset of the TRECVid 2019 Video-to-Text dataset; second, the Memento10K dataset in order to provide opportunities to explore cross-dataset g… ▽ More

    Submitted 11 December, 2021; originally announced December 2021.

    Comments: 3 pages, to appear in Proceedings of MediaEval 2021, December 13-15 2021, Online

  6. arXiv:2102.07887  [pdf, other

    cs.CV

    VA-RED$^2$: Video Adaptive Redundancy Reduction

    Authors: Bowen Pan, Rameswar Panda, Camilo Fosco, Chung-Ching Lin, Alex Andonian, Yue Meng, Kate Saenko, Aude Oliva, Rogerio Feris

    Abstract: Performing inference on deep learning models for videos remains a challenge due to the large amount of computational resources required to achieve robust recognition. An inherent property of real-world videos is the high correlation of information across frames which can translate into redundancy in either temporal or spatial feature maps of the models, or both. The type of redundant features depe… ▽ More

    Submitted 4 October, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: Accepted in ICLR 2021

  7. arXiv:2009.02568  [pdf, other

    cs.CV

    Multimodal Memorability: Modeling Effects of Semantics and Decay on Video Memorability

    Authors: Anelise Newman, Camilo Fosco, Vincent Casser, Allen Lee, Barry McNamara, Aude Oliva

    Abstract: A key capability of an intelligent system is deciding when events from past experience must be remembered and when they can be forgotten. Towards this goal, we develop a predictive model of human visual event memory and how those memories decay over time. We introduce Memento10k, a new, dynamic video memorability dataset containing human annotations at different viewing delays. Based on our findin… ▽ More

    Submitted 5 September, 2020; originally announced September 2020.

    Comments: European Conference on Computer Vision

  8. arXiv:2008.05596  [pdf, other

    cs.CV

    We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos

    Authors: Alex Andonian, Camilo Fosco, Mathew Monfort, Allen Lee, Rogerio Feris, Carl Vondrick, Aude Oliva

    Abstract: Identifying common patterns among events is a key ability in human and machine perception, as it underlies intelligent decision making. We propose an approach for learning semantic relational set abstractions on videos, inspired by human learning. We combine visual features with natural language supervision to generate high-level representations of similarities across a set of videos. This allows… ▽ More

    Submitted 12 August, 2020; originally announced August 2020.

    Comments: European Conference on Computer Vision (ECCV) 2020, accepted

  9. arXiv:2008.02912  [pdf, other

    cs.CV cs.GR cs.HC eess.IV

    Predicting Visual Importance Across Graphic Design Types

    Authors: Camilo Fosco, Vincent Casser, Amish Kumar Bedi, Peter O'Donovan, Aaron Hertzmann, Zoya Bylinskii

    Abstract: This paper introduces a Unified Model of Saliency and Importance (UMSI), which learns to predict visual importance in input graphic designs, and saliency in natural images, along with a new dataset and applications. Previous methods for predicting saliency or visual importance are trained individually on specialized datasets, making them limited in application and leading to poor generalization on… ▽ More

    Submitted 6 August, 2020; originally announced August 2020.

    Journal ref: Proceedings of UIST 2020

  10. arXiv:2001.04461  [pdf, other

    cs.HC

    TurkEyes: A Web-Based Toolbox for Crowdsourcing Attention Data

    Authors: Anelise Newman, Barry McNamara, Camilo Fosco, Yun Bin Zhang, Pat Sukhum, Matthew Tancik, Nam Wook Kim, Zoya Bylinskii

    Abstract: Eye movements provide insight into what parts of an image a viewer finds most salient, interesting, or relevant to the task at hand. Unfortunately, eye tracking data, a commonly-used proxy for attention, is cumbersome to collect. Here we explore an alternative: a comprehensive web-based toolbox for crowdsourcing visual attention. We draw from four main classes of attention-capturing methodologies… ▽ More

    Submitted 13 January, 2020; originally announced January 2020.

    Comments: To appear in CHI 2020. Code available at http://turkeyes.mit.edu/