Skip to main content

Showing 1–15 of 15 results for author: Mesejo, P

Searching in archive cs. Search in all archives.
.
  1. Don't Forget your Inverse DDIM for Image Editing

    Authors: Guillermo Gomez-Trenado, Pablo Mesejo, Oscar Cordón, Stéphane Lathuilière

    Abstract: The field of text-to-image generation has undergone significant advancements with the introduction of diffusion models. Nevertheless, the challenge of editing real images persists, as most methods are either computationally intensive or produce poor reconstructions. This paper introduces SAGE (Self-Attention Guidance for image Editing) - a novel technique leveraging pre-trained diffusion models fo… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    Comments: 12 pages, 12 figures, code available at https://guillermogotre.github.io/sage/

    ACM Class: I.2.10; I.5.0

  2. arXiv:2501.08068  [pdf, ps, other

    cs.AI

    A Roadmap to Guide the Integration of LLMs in Hierarchical Planning

    Authors: Israel Puerta-Merino, Carlos Núñez-Molina, Pablo Mesejo, Juan Fernández-Olivares

    Abstract: Recent advances in Large Language Models (LLMs) are fostering their integration into several reasoning-related fields, including Automated Planning (AP). However, their integration into Hierarchical Planning (HP), a subfield of AP that leverages hierarchical knowledge to enhance planning performance, remains largely unexplored. In this preliminary work, we propose a roadmap to address this gap and… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

    Comments: 5 pages, 0 figures, to be published in the AAAI Workshop on Planning in the Era of LLMs ( https://llmforplanning.github.io )

  3. arXiv:2310.02167  [pdf, ps, other

    cs.AI

    Towards a Unified Framework for Sequential Decision Making

    Authors: Carlos Núñez-Molina, Pablo Mesejo, Juan Fernández-Olivares

    Abstract: In recent years, the integration of Automated Planning (AP) and Reinforcement Learning (RL) has seen a surge of interest. To perform this integration, a general framework for Sequential Decision Making (SDM) would prove immensely useful, as it would help us understand how AP and RL fit together. In this preliminary work, we attempt to provide such a framework, suitable for any method ranging from… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 10 pages, 0 figures

    MSC Class: I.2.8

    Journal ref: Carlos Núñez Molina, Pablo Mesejo, & Juan Fernández-Olivares. (2023). Towards a Unified Framework for Sequential Decision Making. In ICAPS PRL Workshop on Bridging the Gap Between AI Planning and Reinforcement Learning

  4. On Using Admissible Bounds for Learning Forward Search Heuristics

    Authors: Carlos Núñez-Molina, Masataro Asai, Pablo Mesejo, Juan Fernández-Olivares

    Abstract: In recent years, there has been growing interest in utilizing modern machine learning techniques to learn heuristic functions for forward search algorithms. Despite this, there has been little theoretical understanding of what they should learn, how to train them, and why we do so. This lack of understanding has resulted in the adoption of diverse training targets (suboptimal vs optimal costs vs a… ▽ More

    Submitted 7 May, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: 19 pages, 2 figures

    MSC Class: I.2.8

    Journal ref: Carlos Núñez Molina, Masataro Asai, Pablo Mesejo, & Juan Fernández-Olivares. (2024). On using admissible bounds for learning forward search heuristics. In IJCAI, pages 6761-6769

  5. A Review of Symbolic, Subsymbolic and Hybrid Methods for Sequential Decision Making

    Authors: Carlos Núñez-Molina, Pablo Mesejo, Juan Fernández-Olivares

    Abstract: In the field of Sequential Decision Making (SDM), two paradigms have historically vied for supremacy: Automated Planning (AP) and Reinforcement Learning (RL). In the spirit of reconciliation, this article reviews AP, RL and hybrid methods (e.g., novel learn to plan techniques) for solving Sequential Decision Processes (SDPs), focusing on their knowledge representation: symbolic, subsymbolic, or a… ▽ More

    Submitted 5 July, 2024; v1 submitted 20 April, 2023; originally announced April 2023.

    Comments: 35 pages, 16 figures

    ACM Class: A.2; I.2.4; I.2.6; I.2.8

    Journal ref: Carlos Núñez Molina, Pablo Mesejo, & Juan Fernández-Olivares. (2024). A review of symbolic, subsymbolic and hybrid methods for sequential decision making. ACM Computing Surveys, 56(11), Article 272, 1-36

  6. arXiv:2302.09899  [pdf, other

    cs.CV

    A Survey on Semi-Supervised Semantic Segmentation

    Authors: Adrian Peláez-Vegas, Pablo Mesejo, Julián Luengo

    Abstract: Semantic segmentation is one of the most challenging tasks in computer vision. However, in many applications, a frequent obstacle is the lack of labeled images, due to the high cost of pixel-level labeling. In this scenario, it makes sense to approach the problem from a semi-supervised point of view, where both labeled and unlabeled images are exploited. In recent years this line of research has g… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  7. NeSIG: A Neuro-Symbolic Method for Learning to Generate Planning Problems

    Authors: Carlos Núñez-Molina, Pablo Mesejo, Juan Fernández-Olivares

    Abstract: In the field of Automated Planning there is often the need for a set of planning problems from a particular domain, e.g., to be used as training data for Machine Learning or as benchmarks in planning competitions. In most cases, these problems are created either by hand or by a domain-specific generator, putting a burden on the human designers. In this paper we propose NeSIG, to the best of our kn… ▽ More

    Submitted 16 July, 2024; v1 submitted 24 January, 2023; originally announced January 2023.

    Comments: 15 pages, 9 figures

    ACM Class: I.2.6; I.2.8

    Journal ref: Carlos Núñez Molina, Pablo Mesejo, & Juan Fernández-Olivares. (2024). NeSIG: A neuro-symbolic method for learning to generate planning problems. In ECAI, volume 392, pages 4084-4091

  8. arXiv:2209.06399  [pdf, other

    cs.NE cs.AI cs.CV cs.LG

    A Survey on Evolutionary Computation for Computer Vision and Image Analysis: Past, Present, and Future Trends

    Authors: Ying Bi, Bing Xue, Pablo Mesejo, Stefano Cagnoni, Mengjie Zhang

    Abstract: Computer vision (CV) is a big and important field in artificial intelligence covering a wide range of applications. Image analysis is a major task in CV aiming to extract, analyse and understand the visual content of images. However, image-related tasks are very challenging due to many factors, e.g., high variations across images, high dimensionality, domain expertise requirement, and image distor… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: Conditionally accepted by IEEE Transactions on Evolutionary Computation

    Journal ref: IEEE Transactions on Evolutionary Computationm, 2022, https://ieeexplore.ieee.org/document/9943992/

  9. arXiv:2207.11025  [pdf, other

    cs.CV cs.LG

    Custom Structure Preservation in Face Aging

    Authors: Guillermo Gomez-Trenado, Stéphane Lathuilière, Pablo Mesejo, Óscar Cordón

    Abstract: In this work, we propose a novel architecture for face age editing that can produce structural modifications while maintaining relevant details present in the original image. We disentangle the style and content of the input image and propose a new decoder network that adopts a style-based strategy to combine the style and content representations of the input image while conditioning the output on… ▽ More

    Submitted 22 July, 2022; originally announced July 2022.

    Comments: 36 pages, 21 figures

  10. arXiv:2009.11204  [pdf, other

    cs.CV

    Learning Visual Voice Activity Detection with an Automatically Annotated Dataset

    Authors: Sylvain Guy, Stéphane Lathuilière, Pablo Mesejo, Radu Horaud

    Abstract: Visual voice activity detection (V-VAD) uses visual features to predict whether a person is speaking or not. V-VAD is useful whenever audio VAD (A-VAD) is inefficient either because the acoustic signal is difficult to analyze or because it is simply missing. We propose two deep architectures for V-VAD, one based on facial landmarks and one based on optical flow. Moreover, available datasets, used… ▽ More

    Submitted 16 October, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: International Conference on Pattern Recognition, Milan, Italy, January 2021

  11. arXiv:1902.10953  [pdf, other

    cs.CV

    Extended Gaze Following: Detecting Objects in Videos Beyond the Camera Field of View

    Authors: Benoit Massé, Stéphane Lathuilière, Pablo Mesejo, Radu Horaud

    Abstract: In this paper we address the problems of detecting objects of interest in a video and of estimating their locations, solely from the gaze directions of people present in the video. Objects can be indistinctly located inside or outside the camera field of view. We refer to this problem as extended gaze following. The contributions of the paper are the followings. First, we propose a novel spatial r… ▽ More

    Submitted 28 February, 2019; originally announced February 2019.

    Comments: FG 2019

  12. arXiv:1810.05193  [pdf, other

    stat.ML cs.LG

    Understanding Priors in Bayesian Neural Networks at the Unit Level

    Authors: Mariia Vladimirova, Jakob Verbeek, Pablo Mesejo, Julyan Arbel

    Abstract: We investigate deep Bayesian neural networks with Gaussian weight priors and a class of ReLU-like nonlinearities. Bayesian neural networks with Gaussian priors are well known to induce an L2, "weight decay", regularization. Our results characterize a more intricate regularization effect at the level of the unit activations. Our main result establishes that the induced prior distribution on the uni… ▽ More

    Submitted 10 May, 2019; v1 submitted 11 October, 2018; originally announced October 2018.

    Comments: 10 pages, 5 figures, ICML'19 conference

  13. arXiv:1808.09211  [pdf, other

    cs.CV

    DeepGUM: Learning Deep Robust Regression with a Gaussian-Uniform Mixture Model

    Authors: Stéphane Lathuilière, Pablo Mesejo, Xavier Alameda-Pineda, Radu Horaud

    Abstract: In this paper, we address the problem of how to robustly train a ConvNet for regression, or deep robust regression. Traditionally, deep regression employs the L2 loss function, known to be sensitive to outliers, i.e. samples that either lie at an abnormal distance away from the majority of the training samples, or that correspond to wrongly annotated targets. This means that, during back-propagati… ▽ More

    Submitted 28 August, 2018; originally announced August 2018.

    Comments: accepted at ECCV 2018

  14. A Comprehensive Analysis of Deep Regression

    Authors: Stéphane Lathuilière, Pablo Mesejo, Xavier Alameda-Pineda, Radu Horaud

    Abstract: Deep learning revolutionized data science, and recently its popularity has grown exponentially, as did the amount of papers employing deep networks. Vision tasks, such as human pose estimation, did not escape from this trend. There is a large number of deep models, where small changes in the network architecture, or in the data pre-processing, together with the stochastic nature of the optimizatio… ▽ More

    Submitted 24 September, 2020; v1 submitted 22 March, 2018; originally announced March 2018.

    Comments: Published in IEEE TPAMI

    Journal ref: IEEE TPAMI Volume: 42 , Issue: 9 , Sept. 1 2020

  15. Neural Network Based Reinforcement Learning for Audio-Visual Gaze Control in Human-Robot Interaction

    Authors: Stéphane Lathuilière, Benoit Massé, Pablo Mesejo, Radu Horaud

    Abstract: This paper introduces a novel neural network-based reinforcement learning approach for robot gaze control. Our approach enables a robot to learn and to adapt its gaze control strategy for human-robot interaction neither with the use of external sensors nor with human supervision. The robot learns to focus its attention onto groups of people from its own audio-visual experiences, independently of t… ▽ More

    Submitted 23 April, 2018; v1 submitted 18 November, 2017; originally announced November 2017.

    Comments: Paper submitted to Pattern Recognition Letters

    Journal ref: Pattern Recognition Letters, vol. 118, 2019, 61-71