Skip to main content

Showing 1–21 of 21 results for author: Olmos, P M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2506.04700  [pdf, ps, other

    cs.LG cs.AI math.PR stat.ML

    Explicit Density Approximation for Neural Implicit Samplers Using a Bernstein-Based Convex Divergence

    Authors: José Manuel de Frutos, Manuel A. Vázquez, Pablo M. Olmos, Joaquín Míguez

    Abstract: Rank-based statistical metrics, such as the invariant statistical loss (ISL), have recently emerged as robust and practically effective tools for training implicit generative models. In this work, we introduce dual-ISL, a novel likelihood-free objective for training implicit generative models that interchanges the roles of the target and model distributions in the ISL framework, yielding a convex… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  2. arXiv:2502.08253  [pdf, other

    stat.ML cs.LG

    Multi-View Oriented GPLVM: Expressiveness and Efficiency

    Authors: Zi Yang, Ying Li, Zhidi Lin, Michael Minyi Zhang, Pablo M. Olmos

    Abstract: The multi-view Gaussian process latent variable model (MV-GPLVM) aims to learn a unified representation from multi-view data but is hindered by challenges such as limited kernel expressiveness and low computational efficiency. To overcome these issues, we first introduce a new duality between the spectral density and the kernel function. By modeling the spectral density with a bivariate Gaussian m… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    Comments: 8 pages

  3. arXiv:2501.11730  [pdf, other

    cs.LG cs.AI stat.ML

    Transformer Vibration Forecasting for Advancing Rail Safety and Maintenance 4.0

    Authors: Darío C. Larese, Almudena Bravo Cerrada, Gabriel Dambrosio Tomei, Alejandro Guerrero-López, Pablo M. Olmos, María Jesús Gómez García

    Abstract: Maintaining railway axles is critical to preventing severe accidents and financial losses. The railway industry is increasingly interested in advanced condition monitoring techniques to enhance safety and efficiency, moving beyond traditional periodic inspections toward Maintenance 4.0. This study introduces a robust Deep Autoregressive solution that integrates seamlessly with existing systems t… ▽ More

    Submitted 20 January, 2025; originally announced January 2025.

  4. arXiv:2405.01196  [pdf, other

    cs.LG stat.ML

    Decoupling Feature Extraction and Classification Layers for Calibrated Neural Networks

    Authors: Mikkel Jordahn, Pablo M. Olmos

    Abstract: Deep Neural Networks (DNN) have shown great promise in many classification applications, yet are widely known to have poorly calibrated predictions when they are over-parametrized. Improving DNN calibration without comprising on model accuracy is of extreme importance and interest in safety critical applications such as in the health-care sector. In this work, we show that decoupling the training… ▽ More

    Submitted 6 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: Proceedings of the 41 st International Conference on Machine Learning (ICML) 2024

  5. arXiv:2402.16435  [pdf, other

    cs.LG cs.AI math.ST stat.ML

    Training Implicit Generative Models via an Invariant Statistical Loss

    Authors: José Manuel de Frutos, Pablo M. Olmos, Manuel A. Vázquez, Joaquín Míguez

    Abstract: Implicit generative models have the capability to learn arbitrary complex data distributions. On the downside, training requires telling apart real data from artificially-generated ones using adversarial discriminators, leading to unstable training and mode-dropping issues. As reported by Zahee et al. (2017), even in the one-dimensional (1D) case, training a generative adversarial network (GAN) is… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Proceedings of the 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  6. arXiv:2401.11618  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Efficient local linearity regularization to overcome catastrophic overfitting

    Authors: Elias Abad Rocamora, Fanghui Liu, Grigorios G. Chrysos, Pablo M. Olmos, Volkan Cevher

    Abstract: Catastrophic overfitting (CO) in single-step adversarial training (AT) results in abrupt drops in the adversarial test accuracy (even down to 0%). For models trained with multi-step AT, it has been observed that the loss function behaves locally linearly with respect to the input, this is however lost in single-step AT. To address CO in single-step AT, several methods have been proposed to enforce… ▽ More

    Submitted 28 February, 2024; v1 submitted 21 January, 2024; originally announced January 2024.

    Comments: Accepted in ICLR 2024

  7. arXiv:2310.11940  [pdf, other

    stat.ML cs.LG

    Interpretable Spectral Variational AutoEncoder (ISVAE) for time series clustering

    Authors: Óscar Jiménez Rama, Fernando Moreno-Pino, David Ramírez, Pablo M. Olmos

    Abstract: The best encoding is the one that is interpretable in nature. In this work, we introduce a novel model that incorporates an interpretable bottleneck-termed the Filter Bank (FB)-at the outset of a Variational Autoencoder (VAE). This arrangement compels the VAE to attend on the most informative segments of the input signal, fostering the learning of a novel encoding ${f_0}$ which boasts enhanced int… ▽ More

    Submitted 18 October, 2023; originally announced October 2023.

  8. arXiv:2201.06968  [pdf, ps, other

    cs.MS cs.LG stat.ML

    PyHHMM: A Python Library for Heterogeneous Hidden Markov Models

    Authors: Fernando Moreno-Pino, Emese Sükei, Pablo M. Olmos, Antonio Artés-Rodríguez

    Abstract: We introduce PyHHMM, an object-oriented open-source Python implementation of Heterogeneous-Hidden Markov Models (HHMMs). In addition to HMM's basic core functionalities, such as different initialization algorithms and classical observations models, i.e., continuous and multinoulli, PyHHMM distinctively emphasizes features not supported in similar available frameworks: a heterogeneous observation m… ▽ More

    Submitted 12 January, 2022; originally announced January 2022.

  9. arXiv:2201.05040  [pdf, other

    stat.ML cs.LG

    Multi-task longitudinal forecasting with missing values on Alzheimer's Disease

    Authors: Carlos Sevilla-Salcedo, Vandad Imani, Pablo M. Olmos, Vanessa Gómez-Verdejo, Jussi Tohka

    Abstract: Machine learning techniques typically applied to dementia forecasting lack in their capabilities to jointly learn several tasks, handle time dependent heterogeneous data and missing values. In this paper, we propose a framework using the recently presented SSHIBA model for jointly learning different tasks on longitudinal data with missing values. The method uses Bayesian variational inference to i… ▽ More

    Submitted 13 January, 2022; originally announced January 2022.

  10. arXiv:2107.05984  [pdf, ps, other

    stat.ML cs.LG

    Deep Autoregressive Models with Spectral Attention

    Authors: Fernando Moreno-Pino, Pablo M. Olmos, Antonio Artés-Rodríguez

    Abstract: Time series forecasting is an important problem across many domains, playing a crucial role in multiple real-world applications. In this paper, we propose a forecasting architecture that combines deep autoregressive models with a Spectral Attention (SA) module, which merges global and local frequency domain information in the model's embedded space. By characterizing in the spectral domain the emb… ▽ More

    Submitted 26 December, 2021; v1 submitted 13 July, 2021; originally announced July 2021.

    Comments: Errors in Eq. 2 and Eq. 3 corrected

  11. Medical data wrangling with sequential variational autoencoders

    Authors: Daniel Barrejón, Pablo M. Olmos, Antonio Artés-Rodríguez

    Abstract: Medical data sets are usually corrupted by noise and missing data. These missing patterns are commonly assumed to be completely random, but in medical scenarios, the reality is that these patterns occur in bursts due to sensors that are off for some time or data collected in a misaligned uneven fashion, among other causes. This paper proposes to model medical data records with heterogeneous data t… ▽ More

    Submitted 8 November, 2021; v1 submitted 12 March, 2021; originally announced March 2021.

    Comments: Accepted for publication in IEEE Journal of Biomedical and Health Informatics (JBHI)

  12. arXiv:2012.08234  [pdf, other

    cs.LG stat.ML

    Unsupervised Learning of Global Factors in Deep Generative Models

    Authors: Ignacio Peis, Pablo M. Olmos, Antonio Artés-Rodríguez

    Abstract: We present a novel deep generative model based on non i.i.d. variational autoencoders that captures global dependencies among observations in a fully unsupervised fashion. In contrast to the recent semi-supervised alternatives for global modeling in deep generative models, our approach combines a mixture model in the local or data-dependent space and a global Gaussian latent variable, which lead u… ▽ More

    Submitted 16 December, 2020; v1 submitted 15 December, 2020; originally announced December 2020.

  13. arXiv:2006.02734  [pdf, ps, other

    cs.LG stat.ML

    Robust Sampling in Deep Learning

    Authors: Aurora Cobo Aguilera, Antonio Artés-Rodríguez, Fernando Pérez-Cruz, Pablo Martínez Olmos

    Abstract: Deep learning requires regularization mechanisms to reduce overfitting and improve generalization. We address this problem by a new regularization method based on distributional robust optimization. The key idea is to modify the contribution from each sample for tightening the empirical risk bound. During the stochastic training, the selection of samples is done according to their accuracy in such… ▽ More

    Submitted 5 June, 2020; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: 8 pages, 3 figures

  14. arXiv:2006.00968  [pdf, other

    stat.ML cs.LG

    Bayesian Sparse Factor Analysis with Kernelized Observations

    Authors: Carlos Sevilla-Salcedo, Alejandro Guerrero-López, Pablo M. Olmos, Vanessa Gómez-Verdejo

    Abstract: Multi-view problems can be faced with latent variable models since they are able to find low-dimensional projections that fairly capture the correlations among the multiple views that characterise each datum. On the other hand, high-dimensionality and non-linear issues are traditionally handled by kernel methods, inducing a (non)-linear function between the latent projection and the data itself. H… ▽ More

    Submitted 27 January, 2021; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: Article submitted to Artificial Intelligence Journal

  15. arXiv:2001.08975  [pdf, other

    stat.ML cs.LG

    Sparse Semi-supervised Heterogeneous Interbattery Bayesian Analysis

    Authors: Carlos Sevilla-Salcedo, Vanessa Gómez-Verdejo, Pablo M. Olmos

    Abstract: The Bayesian approach to feature extraction, known as factor analysis (FA), has been widely studied in machine learning to obtain a latent representation of the data. An adequate selection of the probabilities and priors of these bayesian models allows the model to better adapt to the data nature (i.e. heterogeneity, sparsity), obtaining a more representative latent space. The objective of this… ▽ More

    Submitted 24 January, 2020; originally announced January 2020.

  16. Deep Sequential Models for Suicidal Ideation from Multiple Source Data

    Authors: Ignacio Peis, Pablo M. Olmos, Constanza Vera-Varela, María Luisa Barrigón, Philippe Courtet, Enrique Baca-García, Antonio Artés-Rodríguez

    Abstract: This article presents a novel method for predicting suicidal ideation from Electronic Health Records (EHR) and Ecological Momentary Assessment (EMA) data using deep sequential models. Both EHR longitudinal data and EMA question forms are defined by asynchronous, variable length, randomly-sampled data sequences. In our method, we model each of them with a Recurrent Neural Network (RNN), and both se… ▽ More

    Submitted 6 November, 2019; originally announced November 2019.

    Comments: Accepted for publication in IEEE Journal of Biomedical and Health Informatics (JBHI)

    Journal ref: Journal of Biomedical and Health Informatics, vol.23, no. 6, 2019

  17. arXiv:1911.01425  [pdf, other

    stat.ML cs.CV cs.LG

    Improved BiGAN training with marginal likelihood equalization

    Authors: Pablo Sánchez-Martín, Pablo M. Olmos, Fernando Perez-Cruz

    Abstract: We propose a novel training procedure for improving the performance of generative adversarial networks (GANs), especially to bidirectional GANs. First, we enforce that the empirical distribution of the inverse inference network matches the prior distribution, which favors the generator network reproducibility on the seen samples. Second, we have found that the marginal log-likelihood of the sample… ▽ More

    Submitted 23 May, 2020; v1 submitted 4 November, 2019; originally announced November 2019.

  18. arXiv:1910.06569  [pdf, other

    cs.LG eess.SP stat.ML

    Probabilistic Time of Arrival Localization

    Authors: Fernando Perez-Cruz, Pablo M. Olmos, Michael Minyi Zhang, Howard Huang

    Abstract: In this paper, we take a new approach for time of arrival geo-localization. We show that the main sources of error in metropolitan areas are due to environmental imperfections that bias our solutions, and that we can rely on a probabilistic model to learn and compensate for them. The resulting localization error is validated using measurements from a live LTE cellular network to be less than 10 me… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

    Comments: IEEE Signal Processing Letters, 2019

  19. arXiv:1901.09557  [pdf, other

    cs.LG stat.ML

    Out-of-Sample Testing for GANs

    Authors: Pablo Sánchez-Martín, Pablo M. Olmos, Fernando Pérez-Cruz

    Abstract: We propose a new method to evaluate GANs, namely EvalGAN. EvalGAN relies on a test set to directly measure the reconstruction quality in the original sample space (no auxiliary networks are necessary), and it also computes the (log)likelihood for the reconstructed samples in the test set. Further, EvalGAN is agnostic to the GAN algorithm and the dataset. We decided to test it on three state-of-the… ▽ More

    Submitted 28 January, 2019; originally announced January 2019.

  20. arXiv:1807.03653  [pdf, other

    cs.LG cs.AI stat.ML

    Handling Incomplete Heterogeneous Data using VAEs

    Authors: Alfredo Nazabal, Pablo M. Olmos, Zoubin Ghahramani, Isabel Valera

    Abstract: Variational autoencoders (VAEs), as well as other generative models, have been shown to be efficient and accurate for capturing the latent structure of vast amounts of complex high-dimensional data. However, existing VAEs can still not directly handle data that are heterogenous (mixed continuous and discrete) or incomplete (with missing data at random), which is indeed common in real-world applica… ▽ More

    Submitted 22 May, 2020; v1 submitted 10 July, 2018; originally announced July 2018.

  21. Boosting Handwriting Text Recognition in Small Databases with Transfer Learning

    Authors: José Carlos Aradillas, Juan José Murillo-Fuentes, Pablo M. Olmos

    Abstract: In this paper we deal with the offline handwriting text recognition (HTR) problem with reduced training datasets. Recent HTR solutions based on artificial neural networks exhibit remarkable solutions in referenced databases. These deep learning neural networks are composed of both convolutional (CNN) and long short-term memory recurrent units (LSTM). In addition, connectionist temporal classificat… ▽ More

    Submitted 4 April, 2018; originally announced April 2018.

    Comments: ICFHR 2018 Conference