Skip to main content

Showing 1–23 of 23 results for author: Vasconcelos, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.06023  [pdf, ps, other

    cs.CV

    Restereo: Diffusion stereo video generation and restoration

    Authors: Xingchang Huang, Ashish Kumar Singh, Florian Dubost, Cristina Nader Vasconcelos, Sakar Khattar, Liang Shi, Christian Theobalt, Cengiz Oztireli, Gurprit Singh

    Abstract: Stereo video generation has been gaining increasing attention with recent advancements in video diffusion models. However, most existing methods focus on generating 3D stereoscopic videos from monocular 2D videos. These approaches typically assume that the input monocular video is of high quality, making the task primarily about inpainting occluded regions in the warped video while preserving diso… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: 12 pages, 5 figures

  2. arXiv:2502.08652  [pdf

    cs.CY cs.AI

    LegalScore: Development of a Benchmark for Evaluating AI Models in Legal Career Exams in Brazil

    Authors: Roberto Caparroz, Marcelo Roitman, Beatriz G. Chow, Caroline Giusti, Larissa Torhacs, Pedro A. Sola, João H. M. Diogo, Luiza Balby, Carolina D. L. Vasconcelos, Leonardo R. Caparroz, Albano P. Franco

    Abstract: This research introduces LegalScore, a specialized index for assessing how generative artificial intelligence models perform in a selected range of career exams that require a legal background in Brazil. The index evaluates fourteen different types of artificial intelligence models' performance, from proprietary to open-source models, in answering objective questions applied to these exams. The re… ▽ More

    Submitted 17 January, 2025; originally announced February 2025.

    Comments: Main article 25 pages, Appendices from page 26

  3. arXiv:2501.09833  [pdf, other

    cs.CV

    EraseBench: Understanding The Ripple Effects of Concept Erasure Techniques

    Authors: Ibtihel Amara, Ahmed Imtiaz Humayun, Ivana Kajic, Zarana Parekh, Natalie Harris, Sarah Young, Chirag Nagpal, Najoung Kim, Junfeng He, Cristina Nader Vasconcelos, Deepak Ramachandran, Goolnoosh Farnadi, Katherine Heller, Mohammad Havaei, Negar Rostamzadeh

    Abstract: Concept erasure techniques have recently gained significant attention for their potential to remove unwanted concepts from text-to-image models. While these methods often demonstrate success in controlled scenarios, their robustness in real-world applications and readiness for deployment remain uncertain. In this work, we identify a critical gap in evaluating sanitized models, particularly in term… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

    Comments: 11 pages main; 9 pages supplemental material

  4. arXiv:2408.08307  [pdf, other

    cs.LG cs.CV

    What Secrets Do Your Manifolds Hold? Understanding the Local Geometry of Generative Models

    Authors: Ahmed Imtiaz Humayun, Ibtihel Amara, Cristina Vasconcelos, Deepak Ramachandran, Candice Schumann, Junfeng He, Katherine Heller, Golnoosh Farnadi, Negar Rostamzadeh, Mohammad Havaei

    Abstract: Deep Generative Models are frequently used to learn continuous representations of complex data distributions using a finite number of samples. For any generative model, including pre-trained foundation models with Diffusion or Transformer architectures, generation performance can significantly vary across the learned data manifold. In this paper we study the local geometry of the learned manifold… ▽ More

    Submitted 6 February, 2025; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: Accepted for publication at ICLR 2025

  5. arXiv:2408.07009  [pdf, other

    cs.CV

    Imagen 3

    Authors: Imagen-Team-Google, :, Jason Baldridge, Jakob Bauer, Mukul Bhutani, Nicole Brichtova, Andrew Bunner, Lluis Castrejon, Kelvin Chan, Yichang Chen, Sander Dieleman, Yuqing Du, Zach Eaton-Rosen, Hongliang Fei, Nando de Freitas, Yilin Gao, Evgeny Gladchenko, Sergio Gómez Colmenarejo, Mandy Guo, Alex Haig, Will Hawkins, Hexiang Hu, Huilian Huang, Tobenna Peter Igwe, Christos Kaplanis , et al. (237 additional authors not shown)

    Abstract: We introduce Imagen 3, a latent diffusion model that generates high quality images from text prompts. We describe our quality and responsibility evaluations. Imagen 3 is preferred over other state-of-the-art (SOTA) models at the time of evaluation. In addition, we discuss issues around safety and representation, as well as methods we used to minimize the potential harm of our models.

    Submitted 21 December, 2024; v1 submitted 13 August, 2024; originally announced August 2024.

  6. arXiv:2406.18554  [pdf, other

    cs.CV cs.LG

    Planted: a dataset for planted forest identification from multi-satellite time series

    Authors: Luis Miguel Pazos-Outón, Cristina Nader Vasconcelos, Anton Raichuk, Anurag Arnab, Dan Morris, Maxim Neumann

    Abstract: Protecting and restoring forest ecosystems is critical for biodiversity conservation and carbon sequestration. Forest monitoring on a global scale is essential for prioritizing and assessing conservation efforts. Satellite-based remote sensing is the only viable solution for providing global coverage, but to date, large-scale forest monitoring is limited to single modalities and single time points… ▽ More

    Submitted 24 May, 2024; originally announced June 2024.

  7. arXiv:2405.16759  [pdf, other

    cs.CV cs.LG

    Greedy Growing Enables High-Resolution Pixel-Based Diffusion Models

    Authors: Cristina N. Vasconcelos, Abdullah Rashwan, Austin Waters, Trevor Walker, Keyang Xu, Jimmy Yan, Rui Qian, Shixin Luo, Zarana Parekh, Andrew Bunner, Hongliang Fei, Roopal Garg, Mandy Guo, Ivana Kajic, Yeqing Li, Henna Nandwani, Jordi Pont-Tuset, Yasumasa Onoe, Sarah Rosston, Su Wang, Wenlei Zhou, Kevin Swersky, David J. Fleet, Jason M. Baldridge, Oliver Wang

    Abstract: We address the long-standing problem of how to learn effective pixel-based image diffusion models at scale, introducing a remarkably simple greedy growing method for stable training of large-scale, high-resolution models. without the needs for cascaded super-resolution components. The key insight stems from careful pre-training of core components, namely, those responsible for text-to-image alignm… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  8. arXiv:2402.04930  [pdf, other

    cs.CV cs.GR cs.LG

    Blue noise for diffusion models

    Authors: Xingchang Huang, Corentin Salaün, Cristina Vasconcelos, Christian Theobalt, Cengiz Öztireli, Gurprit Singh

    Abstract: Most of the existing diffusion models use Gaussian noise for training and sampling across all time steps, which may not optimally account for the frequency contents reconstructed by the denoising network. Despite the diverse applications of correlated noise in computer graphics, its potential for improving the training process has been underexplored. In this paper, we introduce a novel and general… ▽ More

    Submitted 2 May, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: SIGGRAPH 2024 Conference Proceedings; Project page: https://xchhuang.github.io/bndm

  9. arXiv:2302.05442  [pdf, other

    cs.CV cs.AI cs.LG

    Scaling Vision Transformers to 22 Billion Parameters

    Authors: Mostafa Dehghani, Josip Djolonga, Basil Mustafa, Piotr Padlewski, Jonathan Heek, Justin Gilmer, Andreas Steiner, Mathilde Caron, Robert Geirhos, Ibrahim Alabdulmohsin, Rodolphe Jenatton, Lucas Beyer, Michael Tschannen, Anurag Arnab, Xiao Wang, Carlos Riquelme, Matthias Minderer, Joan Puigcerver, Utku Evci, Manoj Kumar, Sjoerd van Steenkiste, Gamaleldin F. Elsayed, Aravindh Mahendran, Fisher Yu, Avital Oliver , et al. (17 additional authors not shown)

    Abstract: The scaling of Transformers has driven breakthrough capabilities for language models. At present, the largest large language models (LLMs) contain upwards of 100B parameters. Vision Transformers (ViT) have introduced the same architecture to image and video modelling, but these have not yet been successfully scaled to nearly the same degree; the largest dense ViT contains 4B parameters (Chen et al… ▽ More

    Submitted 10 February, 2023; originally announced February 2023.

  10. arXiv:2210.06965  [pdf, other

    cs.LG cs.CV

    CUF: Continuous Upsampling Filters

    Authors: Cristina Vasconcelos, Cengiz Oztireli, Mark Matthews, Milad Hashemi, Kevin Swersky, Andrea Tagliasacchi

    Abstract: Neural fields have rapidly been adopted for representing 3D signals, but their application to more classical 2D image-processing has been relatively limited. In this paper, we consider one of the most important operations in image processing: upsampling. In deep learning, learnable upsampling layers have extensively been used for single image super-resolution. We propose to parameterize upsampling… ▽ More

    Submitted 20 October, 2022; v1 submitted 13 October, 2022; originally announced October 2022.

  11. arXiv:2209.13792  [pdf, other

    cs.CV

    A Machine Learning Approach for DeepFake Detection

    Authors: Gustavo Cunha Lacerda, Raimundo Claudio da Silva Vasconcelos

    Abstract: With the spread of DeepFake techniques, this technology has become quite accessible and good enough that there is concern about its malicious use. Faced with this problem, detecting forged faces is of utmost importance to ensure security and avoid socio-political problems, both on a global and private scale. This paper presents a solution for the detection of DeepFakes using convolution neural net… ▽ More

    Submitted 27 September, 2022; originally announced September 2022.

    Comments: 4 pages, accepted for presentation at the SIBGRAPI 2022

    ACM Class: I.4.7; I.5.0

  12. arXiv:2204.00484  [pdf, other

    cs.CV cs.LG

    Proper Reuse of Image Classification Features Improves Object Detection

    Authors: Cristina Vasconcelos, Vighnesh Birodkar, Vincent Dumoulin

    Abstract: A common practice in transfer learning is to initialize the downstream model weights by pre-training on a data-abundant upstream task. In object detection specifically, the feature backbone is typically initialized with Imagenet classifier weights and fine-tuned on the object detection task. Recent works show this is not strictly necessary under longer training regimes and provide recipes for trai… ▽ More

    Submitted 27 June, 2022; v1 submitted 1 April, 2022; originally announced April 2022.

    Journal ref: CVPR 2022

  13. arXiv:2110.12108  [pdf, other

    cs.LG cs.AI cs.CV

    ConformalLayers: A non-linear sequential neural network with associative layers

    Authors: Eduardo Vera Sousa, Leandro A. F. Fernandes, Cristina Nader Vasconcelos

    Abstract: Convolutional Neural Networks (CNNs) have been widely applied. But as the CNNs grow, the number of arithmetic operations and memory footprint also increase. Furthermore, typical non-linear activation functions do not allow associativity of the operations encoded by consecutive layers, preventing the simplification of intermediate steps by combining them. We present a new activation function that a… ▽ More

    Submitted 9 November, 2021; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: Best Paper on Pattern Recognition and Related Field at SIBGRAPI 2021 -- 34th Conference on Graphics, Patterns and Images

  14. arXiv:2108.07903  [pdf, other

    cs.CV cs.GR

    Spatially and color consistent environment lighting estimation using deep neural networks for mixed reality

    Authors: Bruno Augusto Dorta Marques, Esteban Walter Gonzalez Clua, Anselmo Antunes Montenegro, Cristina Nader Vasconcelos

    Abstract: The representation of consistent mixed reality (XR) environments requires adequate real and virtual illumination composition in real-time. Estimating the lighting of a real scenario is still a challenge. Due to the ill-posed nature of the problem, classical inverse-rendering techniques tackle the problem for simple lighting setups. However, those assumptions do not satisfy the current state-of-art… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

  15. arXiv:2108.03489  [pdf, other

    cs.CV cs.LG

    Impact of Aliasing on Generalization in Deep Convolutional Networks

    Authors: Cristina Vasconcelos, Hugo Larochelle, Vincent Dumoulin, Rob Romijnders, Nicolas Le Roux, Ross Goroshin

    Abstract: We investigate the impact of aliasing on generalization in Deep Convolutional Networks and show that data augmentation schemes alone are unable to prevent it due to structural limitations in widely used architectures. Drawing insights from frequency analysis theory, we take a closer look at ResNet and EfficientNet architectures and review the trade-off between aliasing and information loss in each… ▽ More

    Submitted 7 August, 2021; originally announced August 2021.

    Comments: Accepted to ICCV 2021. arXiv admin note: text overlap with arXiv:2011.10675

  16. arXiv:2102.08868  [pdf, other

    cs.LG cs.CV stat.ML

    Bridging the Gap Between Adversarial Robustness and Optimization Bias

    Authors: Fartash Faghri, Sven Gowal, Cristina Vasconcelos, David J. Fleet, Fabian Pedregosa, Nicolas Le Roux

    Abstract: We demonstrate that the choice of optimizer, neural network architecture, and regularizer significantly affect the adversarial robustness of linear neural networks, providing guarantees without the need for adversarial training. To this end, we revisit a known result linking maximally robust classifiers and minimum norm solutions, and combine it with recent results on the implicit bias of optimize… ▽ More

    Submitted 7 June, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

    Comments: New CIFAR-10 experiments and Fourier attack variations

  17. arXiv:2011.10675  [pdf, other

    cs.CV

    An Effective Anti-Aliasing Approach for Residual Networks

    Authors: Cristina Vasconcelos, Hugo Larochelle, Vincent Dumoulin, Nicolas Le Roux, Ross Goroshin

    Abstract: Image pre-processing in the frequency domain has traditionally played a vital role in computer vision and was even part of the standard pipeline in the early days of deep learning. However, with the advent of large datasets, many practitioners concluded that this was unnecessary due to the belief that these priors can be learned from the data itself. Frequency aliasing is a phenomenon that may occ… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  18. arXiv:1908.10945  [pdf, other

    cs.CV

    A Multiple Source Hourglass Deep Network for Multi-Focus Image Fusion

    Authors: Fidel Alejandro Guerrero Peña, Pedro Diamel Marrero Fernández, Tsang Ing Ren, Germano Crispim Vasconcelos, Alexandre Cunha

    Abstract: Multi-Focus Image Fusion seeks to improve the quality of an acquired burst of images with different focus planes. For solving the task, an activity level measurement and a fusion rule are typically established to select and fuse the most relevant information from the sources. However, the design of this kind of method by hand is really hard and sometimes restricted to solution spaces where the opt… ▽ More

    Submitted 28 August, 2019; originally announced August 2019.

  19. Data Augmentation for Skin Lesion Analysis

    Authors: Fábio Perez, Cristina Vasconcelos, Sandra Avila, Eduardo Valle

    Abstract: Deep learning models show remarkable results in automated skin lesion analysis. However, these models demand considerable amounts of data, while the availability of annotated skin lesion images is often limited. Data augmentation can expand the training dataset by transforming input images. In this work, we investigate the impact of 13 data augmentation scenarios for melanoma classification traine… ▽ More

    Submitted 5 September, 2018; originally announced September 2018.

    Comments: 8 pages, 3 figures, to be presented on ISIC Skin Image Analysis Workshop

  20. arXiv:1702.07025  [pdf, other

    cs.CV

    Convolutional Neural Network Committees for Melanoma Classification with Classical And Expert Knowledge Based Image Transforms Data Augmentation

    Authors: Cristina Nader Vasconcelos, Bárbara Nader Vasconcelos

    Abstract: Skin cancer is a major public health problem, as is the most common type of cancer and represents more than half of cancer diagnoses worldwide. Early detection influences the outcome of the disease and motivates our work. We investigate the composition of CNN committees and data augmentation for the the ISBI 2017 Melanoma Classification Challenge (named Skin Lesion Analysis towards Melanoma Detect… ▽ More

    Submitted 15 March, 2017; v1 submitted 22 February, 2017; originally announced February 2017.

  21. arXiv:1611.06292  [pdf, other

    cs.HC

    Minimizing cyber sickness in head mounted display systems: design guidelines and applications

    Authors: Thiago M. Porcino, Esteban W. Clua, Cristina N. Vasconcelos, Daniela Trevisan, Luis Valente

    Abstract: We are experiencing an upcoming trend of using head mounted display systems in games and serious games, which is likely to become an established practice in the near future. While these systems provide highly immersive experiences, many users have been reporting discomfort symptoms, such as nausea, sickness, and headaches, among others. When using VR for health applications, this is more critical,… ▽ More

    Submitted 18 November, 2016; originally announced November 2016.

    Comments: 11 pages, 3 figures, 3 tables

  22. arXiv:1604.06245  [pdf, other

    cs.PL

    A Revision of the Mool Language

    Authors: Cláudio Vasconcelos, António Ravara

    Abstract: We present here in a thorough analysis of the Mool language, covering not only its implementation but also the formalisation (syntax, operational semantics, and type system). The objective is to detect glitches in both the implementation and in the formal definitions, proposing as well new features and added expressiveness. To test our proposals we implemented the revision developed in the Racket… ▽ More

    Submitted 22 September, 2016; v1 submitted 21 April, 2016; originally announced April 2016.

    Comments: 34 pages, 15 figures, 11 listings

  23. arXiv:1603.08949  [pdf, other

    cs.PL

    The While language

    Authors: Cláudio Vasconcelos, António Ravara

    Abstract: This article presents a formalisation of a simple imperative programming language. The objective is to study and develop "hands-on" a formal specifcation of a programming language, namely its syntax, operational semantics and type system. To have an executable version of the language, we implemented in Racket its operational semantics and type system.

    Submitted 12 April, 2016; v1 submitted 29 March, 2016; originally announced March 2016.

    Comments: 15 pages, 21 figures