Skip to main content

Showing 1–3 of 3 results for author: Latortue, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.18849  [pdf, other

    cs.CV

    MiPa: Mixed Patch Infrared-Visible Modality Agnostic Object Detection

    Authors: Heitor R. Medeiros, David Latortue, Eric Granger, Marco Pedersoli

    Abstract: In real-world scenarios, using multiple modalities like visible (RGB) and infrared (IR) can greatly improve the performance of a predictive task such as object detection (OD). Multimodal learning is a common way to leverage these modalities, where multiple modality-specific encoders and a fusion module are used to improve performance. In this paper, we tackle a different way to employ RGB and IR m… ▽ More

    Submitted 2 August, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  2. arXiv:2404.01492  [pdf, other

    cs.CV cs.AI

    Modality Translation for Object Detection Adaptation Without Forgetting Prior Knowledge

    Authors: Heitor Rapela Medeiros, Masih Aminbeidokhti, Fidel Guerrero Pena, David Latortue, Eric Granger, Marco Pedersoli

    Abstract: A common practice in deep learning involves training large neural networks on massive datasets to achieve high accuracy across various domains and tasks. While this approach works well in many application areas, it often fails drastically when processing data from a new modality with a significant distribution shift from the data used to pre-train the model. This paper focuses on adapting a large… ▽ More

    Submitted 31 July, 2024; v1 submitted 1 April, 2024; originally announced April 2024.

    Comments: ECCV 2024: European Conference on Computer Vision, Milan Italy

  3. arXiv:2311.11974  [pdf, other

    cs.CV cs.AI cs.LG

    Evaluating Supervision Levels Trade-Offs for Infrared-Based People Counting

    Authors: David Latortue, Moetez Kdayem, Fidel A Guerrero Peña, Eric Granger, Marco Pedersoli

    Abstract: Object detection models are commonly used for people counting (and localization) in many applications but require a dataset with costly bounding box annotations for training. Given the importance of privacy in people counting, these models rely more and more on infrared images, making the task even harder. In this paper, we explore how weaker levels of supervision can affect the performance of dee… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024