Skip to main content

Showing 1–50 of 97 results for author: Suykens, J

.
  1. arXiv:2506.08143  [pdf, ps, other

    cs.LG

    Accelerating Spectral Clustering under Fairness Constraints

    Authors: Francesco Tonin, Alex Lambert, Johan A. K. Suykens, Volkan Cevher

    Abstract: Fairness of decision-making algorithms is an increasingly important issue. In this paper, we focus on spectral clustering with group fairness constraints, where every demographic group is represented in each cluster proportionally as in the general population. We present a new efficient method for fair spectral clustering (Fair SC) by casting the Fair SC problem within the difference of convex fun… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

    Comments: ICML 2025

  2. Generative Kernel Spectral Clustering

    Authors: David Winant, Sonny Achten, Johan A. K. Suykens

    Abstract: Modern clustering approaches often trade interpretability for performance, particularly in deep learning-based methods. We present Generative Kernel Spectral Clustering (GenKSC), a novel model combining kernel spectral clustering with generative modeling to produce both well-defined clusters and interpretable representations. By augmenting weighted variance maximization with reconstruction and clu… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

    Comments: Accepted for publication at ESANN 2025

    Journal ref: ESANN 2025 proceedings, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning. Bruges (Belgium) and online event, 23-25 April 2025

  3. arXiv:2406.08748  [pdf, other

    cs.LG cs.AI stat.ML

    Learning in Feature Spaces via Coupled Covariances: Asymmetric Kernel SVD and Nyström method

    Authors: Qinghua Tao, Francesco Tonin, Alex Lambert, Yingyi Chen, Panagiotis Patrinos, Johan A. K. Suykens

    Abstract: In contrast with Mercer kernel-based approaches as used e.g., in Kernel Principal Component Analysis (KPCA), it was previously shown that Singular Value Decomposition (SVD) inherently relates to asymmetric kernels and Asymmetric Kernel Singular Value Decomposition (KSVD) has been proposed. However, the existing formulation to KSVD cannot work with infinite-dimensional feature mappings, the variati… ▽ More

    Submitted 8 March, 2025; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 19 pages, 9 tables, 6 figures

    Journal ref: the 41st International Conference on Machine Learning (ICML), 2024

  4. arXiv:2406.01435  [pdf, other

    cs.LG stat.ML

    Learning Analysis of Kernel Ridgeless Regression with Asymmetric Kernel Learning

    Authors: Fan He, Mingzhen He, Lei Shi, Xiaolin Huang, Johan A. K. Suykens

    Abstract: Ridgeless regression has garnered attention among researchers, particularly in light of the ``Benign Overfitting'' phenomenon, where models interpolating noisy samples demonstrate robust generalization. However, kernel ridgeless regression does not always perform well due to the lack of flexibility. This paper enhances kernel ridgeless regression with Locally-Adaptive-Bandwidths (LAB) RBF kernels,… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.05236

  5. arXiv:2405.17050  [pdf, other

    cs.LG

    HeNCler: Node Clustering in Heterophilous Graphs through Learned Asymmetric Similarity

    Authors: Sonny Achten, Francesco Tonin, Volkan Cevher, Johan A. K. Suykens

    Abstract: Clustering nodes in heterophilous graphs presents unique challenges due to the asymmetric relationships often overlooked by traditional methods, which moreover assume that good clustering corresponds to high intra-cluster and low inter-cluster connectivity. To address these issues, we introduce HeNCler - a novel approach for Heterophilous Node Clustering. Our method begins by defining a weighted k… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  6. arXiv:2405.14472  [pdf, other

    eess.SP cs.LG

    SolNet: Open-source deep learning models for photovoltaic power forecasting across the globe

    Authors: Joris Depoortere, Johan Driesen, Johan Suykens, Hussain Syed Kazmi

    Abstract: Deep learning models have gained increasing prominence in recent years in the field of solar pho-tovoltaic (PV) forecasting. One drawback of these models is that they require a lot of high-quality data to perform well. This is often infeasible in practice, due to poor measurement infrastructure in legacy systems and the rapid build-up of new solar systems across the world. This paper proposes SolN… ▽ More

    Submitted 30 May, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: 24 pages, 5 figures

  7. arXiv:2404.16468  [pdf, other

    cs.LG cs.AI eess.SY

    A Dual Perspective of Reinforcement Learning for Imposing Policy Constraints

    Authors: Bram De Cooman, Johan Suykens

    Abstract: Model-free reinforcement learning methods lack an inherent mechanism to impose behavioural constraints on the trained policies. Although certain extensions exist, they remain limited to specific types of constraints, such as value constraints with additional reward signals or visitation density constraints. In this work we unify these existing techniques and bridge the gap with classical optimizat… ▽ More

    Submitted 25 April, 2025; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: Accepted for publication in IEEE Transactions on Artificial Intelligence

    ACM Class: I.2.8

  8. arXiv:2402.08493  [pdf, other

    cs.LG stat.ML

    Sparsity via Sparse Group $k$-max Regularization

    Authors: Qinghua Tao, Xiangming Xi, Jun Xu, Johan A. K. Suykens

    Abstract: For the linear inverse problem with sparsity constraints, the $l_0$ regularized problem is NP-hard, and existing approaches either utilize greedy algorithms to find almost-optimal solutions or to approximate the $l_0$ regularization with its convex counterparts. In this paper, we propose a novel and concise regularization, namely the sparse group $k$-max regularization, which can not only simultan… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: 7 pages, accepted to American Control Conference 2024

  9. arXiv:2402.01476  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Self-Attention through Kernel-Eigen Pair Sparse Variational Gaussian Processes

    Authors: Yingyi Chen, Qinghua Tao, Francesco Tonin, Johan A. K. Suykens

    Abstract: While the great capability of Transformers significantly boosts prediction accuracy, it could also yield overconfident predictions and require calibrated uncertainty estimation, which can be commonly tackled by Gaussian processes (GPs). Existing works apply GPs with symmetric kernels under variational inference to the attention kernel; however, omitting the fact that attention kernels are in essen… ▽ More

    Submitted 28 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: We propose Kernel-Eigen Pair Sparse Variational Gaussian Processes (KEP-SVGP) for building uncertainty-aware self-attention where the asymmetry of attention kernel is tackled by KSVD and a reduced time complexity is acquired

  10. arXiv:2401.13624  [pdf, other

    stat.ML cs.LG

    Can overfitted deep neural networks in adversarial training generalize? -- An approximation viewpoint

    Authors: Zhongjie Shi, Fanghui Liu, Yuan Cao, Johan A. K. Suykens

    Abstract: Adversarial training is a widely used method to improve the robustness of deep neural networks (DNNs) over adversarial perturbations. However, it is empirically observed that adversarial training on over-parameterized networks often suffers from the \textit{robust overfitting}: it can achieve almost zero adversarial training error while the robust generalization performance is not promising. In th… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  11. arXiv:2401.02890  [pdf, other

    stat.ML cs.LG

    Nonlinear functional regression by functional deep neural network with kernel embedding

    Authors: Zhongjie Shi, Jun Fan, Linhao Song, Ding-Xuan Zhou, Johan A. K. Suykens

    Abstract: Recently, deep learning has been widely applied in functional data analysis (FDA) with notable empirical success. However, the infinite dimensionality of functional data necessitates an effective dimension reduction approach for functional learning tasks, particularly in nonlinear functional regression. In this paper, we introduce a functional deep neural network with an adaptive and discretizatio… ▽ More

    Submitted 12 May, 2025; v1 submitted 5 January, 2024; originally announced January 2024.

  12. arXiv:2310.13381  [pdf, other

    cs.LG

    Accelerated sparse Kernel Spectral Clustering for large scale data clustering problems

    Authors: Mihaly Novak, Rocco Langone, Carlos Alzate, Johan Suykens

    Abstract: An improved version of the sparse multiway kernel spectral clustering (KSC) is presented in this brief. The original algorithm is derived from weighted kernel principal component (KPCA) analysis formulated within the primal-dual least-squares support vector machine (LS-SVM) framework. Sparsity is achieved then by the combination of the incomplete Cholesky decomposition (ICD) based low rank approxi… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  13. arXiv:2310.05236  [pdf, other

    cs.LG

    Enhancing Kernel Flexibility via Learning Asymmetric Locally-Adaptive Kernels

    Authors: Fan He, Mingzhen He, Lei Shi, Xiaolin Huang, Johan A. K. Suykens

    Abstract: The lack of sufficient flexibility is the key bottleneck of kernel-based learning that relies on manually designed, pre-given, and non-trainable kernels. To enhance kernel flexibility, this paper introduces the concept of Locally-Adaptive-Bandwidths (LAB) as trainable parameters to enhance the Radial Basis Function (RBF) kernel, giving rise to the LAB RBF kernel. The parameters in LAB RBF kernels… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  14. arXiv:2308.16056  [pdf, other

    cs.LG

    Low-Rank Multitask Learning based on Tensorized SVMs and LSSVMs

    Authors: Jiani Liu, Qinghua Tao, Ce Zhu, Yipeng Liu, Xiaolin Huang, Johan A. K. Suykens

    Abstract: Multitask learning (MTL) leverages task-relatedness to enhance performance. With the emergence of multimodal data, tasks can now be referenced by multiple indices. In this paper, we employ high-order tensors, with each mode corresponding to a task index, to naturally represent tasks referenced by multiple indices and preserve their structural relations. Based on this representation, we propose a g… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  15. arXiv:2307.10078  [pdf, other

    cs.LG stat.ML

    A Dual Formulation for Probabilistic Principal Component Analysis

    Authors: Henri De Plaen, Johan A. K. Suykens

    Abstract: In this paper, we characterize Probabilistic Principal Component Analysis in Hilbert spaces and demonstrate how the optimal solution admits a representation in dual space. This allows us to develop a generative framework for kernel methods. Furthermore, we show how it englobes Kernel Principal Component Analysis and illustrate its working on a toy and a real dataset.

    Submitted 19 July, 2023; originally announced July 2023.

    Comments: ICML 2023 Workshop on Duality for Modern Machine Learning (DP4ML). 14 pages (8 main + 5 appendix), 4 figures and 4 tables

  16. arXiv:2307.02402  [pdf, other

    cs.CV cs.LG

    Unbalanced Optimal Transport: A Unified Framework for Object Detection

    Authors: Henri De Plaen, Pierre-François De Plaen, Johan A. K. Suykens, Marc Proesmans, Tinne Tuytelaars, Luc Van Gool

    Abstract: During training, supervised object detection tries to correctly match the predicted bounding boxes and associated classification scores to the ground truth. This is essential to determine which predictions are to be pushed towards which solutions, or to be discarded. Popular matching strategies include matching to the closest ground truth box (mostly used in combination with anchors), or matching… ▽ More

    Submitted 5 July, 2023; originally announced July 2023.

    Comments: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2023)

  17. arXiv:2306.16431  [pdf, other

    cs.LG cs.AI

    Increasing Performance And Sample Efficiency With Model-agnostic Interactive Feature Attributions

    Authors: Joran Michiels, Maarten De Vos, Johan Suykens

    Abstract: Model-agnostic feature attributions can provide local insights in complex ML models. If the explanation is correct, a domain expert can validate and trust the model's decision. However, if it contradicts the expert's knowledge, related work only corrects irrelevant features to improve the model. To allow for unlimited interaction, in this paper we provide model-agnostic implementations for two pop… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

  18. arXiv:2306.10880  [pdf, other

    cs.LG cs.AI

    Explaining the Model and Feature Dependencies by Decomposition of the Shapley Value

    Authors: Joran Michiels, Maarten De Vos, Johan Suykens

    Abstract: Shapley values have become one of the go-to methods to explain complex models to end-users. They provide a model agnostic post-hoc explanation with foundations in game theory: what is the worth of a player (in machine learning, a feature value) in the objective function (the output of the complex machine learning model). One downside is that they always require outputs of the model when some featu… ▽ More

    Submitted 19 June, 2023; originally announced June 2023.

  19. arXiv:2306.07040  [pdf, other

    cs.LG

    Nonlinear SVD with Asymmetric Kernels: feature learning and asymmetric Nyström method

    Authors: Qinghua Tao, Francesco Tonin, Panagiotis Patrinos, Johan A. K. Suykens

    Abstract: Asymmetric data naturally exist in real life, such as directed graphs. Different from the common kernel methods requiring Mercer kernels, this paper tackles the asymmetric kernel-based learning problem. We describe a nonlinear extension of the matrix Singular Value Decomposition through asymmetric kernels, namely KSVD. First, we construct two nonlinear feature mappings w.r.t. rows and columns of t… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

  20. arXiv:2306.07015  [pdf, other

    cs.LG

    Combining Primal and Dual Representations in Deep Restricted Kernel Machines Classifiers

    Authors: Francesco Tonin, Panagiotis Patrinos, Johan A. K. Suykens

    Abstract: In the context of deep learning with kernel machines, the deep Restricted Kernel Machine (DRKM) framework allows multiple levels of kernel PCA (KPCA) and Least-Squares Support Vector Machines (LSSVM) to be combined into a deep architecture using visible and hidden units. We propose a new method for DRKM classification coupling the objectives of KPCA and classification levels, with the hidden featu… ▽ More

    Submitted 29 August, 2023; v1 submitted 12 June, 2023; originally announced June 2023.

  21. arXiv:2306.05815  [pdf, other

    cs.LG stat.ML

    Extending Kernel PCA through Dualization: Sparsity, Robustness and Fast Algorithms

    Authors: Francesco Tonin, Alex Lambert, Panagiotis Patrinos, Johan A. K. Suykens

    Abstract: The goal of this paper is to revisit Kernel Principal Component Analysis (KPCA) through dualization of a difference of convex functions. This allows to naturally extend KPCA to multiple objective functions and leads to efficient gradient-based algorithms avoiding the expensive SVD of the Gram matrix. Particularly, we consider objective functions that can be written as Moreau envelopes, demonstrati… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: 15 pages, ICML 2023

  22. arXiv:2305.19798  [pdf, other

    cs.LG cs.AI cs.CV

    Primal-Attention: Self-attention through Asymmetric Kernel SVD in Primal Representation

    Authors: Yingyi Chen, Qinghua Tao, Francesco Tonin, Johan A. K. Suykens

    Abstract: Recently, a new line of works has emerged to understand and improve self-attention in Transformers by treating it as a kernel machine. However, existing works apply the methods for symmetric kernels to the asymmetric self-attention, resulting in a nontrivial gap between the analytical understanding and numerical implementation. In this paper, we provide a new perspective to represent and optimize… ▽ More

    Submitted 5 December, 2023; v1 submitted 31 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023. We provide a primal-dual representation for the asymmetric self-attention in transformer that allows to avoid explicit computation of the kernel matrix

  23. arXiv:2305.17251  [pdf, other

    cs.LG

    Duality in Multi-View Restricted Kernel Machines

    Authors: Sonny Achten, Arun Pandey, Hannes De Meulemeester, Bart De Moor, Johan A. K. Suykens

    Abstract: We propose a unifying setting that combines existing restricted kernel machine methods into a single primal-dual multi-view framework for kernel principal component analysis in both supervised and unsupervised settings. We derive the primal and dual representations of the framework and relate different training and inference algorithms from a theoretical perspective. We show how to achieve full eq… ▽ More

    Submitted 6 July, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: ICML 2023 Workshop on Duality for Modern Machine Learning, Honolulu, Hawaii, USA

  24. arXiv:2304.06485  [pdf, ps, other

    eess.SP cs.AI cs.LG

    CoRe-Sleep: A Multimodal Fusion Framework for Time Series Robust to Imperfect Modalities

    Authors: Konstantinos Kontras, Christos Chatzichristos, Huy Phan, Johan Suykens, Maarten De Vos

    Abstract: Sleep abnormalities can have severe health consequences. Automated sleep staging, i.e. labelling the sequence of sleep stages from the patient's physiological recordings, could simplify the diagnostic process. Previous work on automated sleep staging has achieved great results, mainly relying on the EEG signal. However, often multiple sources of information are available beyond EEG. This can be pa… ▽ More

    Submitted 27 March, 2023; originally announced April 2023.

    Comments: 10 pages, 4 figures, 2 tables, journal

  25. Tensorized LSSVMs for Multitask Regression

    Authors: Jiani Liu, Qinghua Tao, Ce Zhu, Yipeng Liu, Johan A. K. Suykens

    Abstract: Multitask learning (MTL) can utilize the relatedness between multiple tasks for performance improvement. The advent of multimodal data allows tasks to be referenced by multiple indices. High-order tensors are capable of providing efficient representations for such tasks, while preserving structural task-relations. In this paper, a new MTL method is proposed by leveraging low-rank tensor analysis a… ▽ More

    Submitted 4 March, 2023; originally announced March 2023.

    Journal ref: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodes Island, Greece, 2023, pp. 1-5

  26. arXiv:2302.11220  [pdf, other

    cs.LG

    Deep Kernel Principal Component Analysis for Multi-level Feature Learning

    Authors: Francesco Tonin, Qinghua Tao, Panagiotis Patrinos, Johan A. K. Suykens

    Abstract: Principal Component Analysis (PCA) and its nonlinear extension Kernel PCA (KPCA) are widely used across science and industry for data analysis and dimensionality reduction. Modern deep learning tools have achieved great empirical success, but a framework for deep principal component analysis is still lacking. Here we develop a deep kernel PCA methodology (DKPCA) to extract multiple levels of the m… ▽ More

    Submitted 22 February, 2023; originally announced February 2023.

  27. Unsupervised Neighborhood Propagation Kernel Layers for Semi-supervised Node Classification

    Authors: Sonny Achten, Francesco Tonin, Panagiotis Patrinos, Johan A. K. Suykens

    Abstract: We present a deep Graph Convolutional Kernel Machine (GCKM) for semi-supervised node classification in graphs. The method is built of two main types of blocks: (i) We introduce unsupervised kernel machine layers propagating the node features in a one-hop neighborhood, using implicit node feature mappings. (ii) We specify a semi-supervised classification kernel machine through the lens of the Fench… ▽ More

    Submitted 15 December, 2023; v1 submitted 31 January, 2023; originally announced January 2023.

    Comments: Accepted for publication in AAAI 2024

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, 38(10), 10766-10774, 2024

  28. arXiv:2301.09811  [pdf, other

    cs.LG

    Multi-view Kernel PCA for Time series Forecasting

    Authors: Arun Pandey, Hannes De Meulemeester, Bart De Moor, Johan A. K. Suykens

    Abstract: In this paper, we propose a kernel principal component analysis model for multi-variate time series forecasting, where the training and prediction schemes are derived from the multi-view formulation of Restricted Kernel Machines. The training problem is simply an eigenvalue decomposition of the summation of two kernel matrices corresponding to the views of the input and output data. When a linear… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  29. arXiv:2207.11971  [pdf, other

    cs.CV

    Jigsaw-ViT: Learning Jigsaw Puzzles in Vision Transformer

    Authors: Yingyi Chen, Xi Shen, Yahui Liu, Qinghua Tao, Johan A. K. Suykens

    Abstract: The success of Vision Transformer (ViT) in various computer vision tasks has promoted the ever-increasing prevalence of this convolution-free network. The fact that ViT works on image patches makes it potentially relevant to the problem of jigsaw puzzle solving, which is a classical self-supervised task aiming at reordering shuffled sequential image patches back to their natural form. Despite its… ▽ More

    Submitted 5 January, 2023; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted to Pattern Recognition Letters 2022. Project page: https://yingyichen-cyy.github.io/Jigsaw-ViT/

  30. arXiv:2207.11559  [pdf, other

    cs.LG

    Tensor-based Multi-view Spectral Clustering via Shared Latent Space

    Authors: Qinghua Tao, Francesco Tonin, Panagiotis Patrinos, Johan A. K. Suykens

    Abstract: Multi-view Spectral Clustering (MvSC) attracts increasing attention due to diverse data sources. However, most existing works are prohibited in out-of-sample predictions and overlook model interpretability and exploration of clustering results. In this paper, a new method for MvSC is proposed via a shared latent space from the Restricted Kernel Machine framework. Through the lens of conjugate feat… ▽ More

    Submitted 23 July, 2022; originally announced July 2022.

    Comments: 14 pages, 12 figures, 5 tables

  31. arXiv:2206.13140  [pdf, other

    cs.LG stat.ML

    Compressing Features for Learning with Noisy Labels

    Authors: Yingyi Chen, Shell Xu Hu, Xi Shen, Chunrong Ai, Johan A. K. Suykens

    Abstract: Supervised learning can be viewed as distilling relevant information from input data into feature representations. This process becomes difficult when supervision is noisy as the distilled information might not be relevant. In fact, recent research shows that networks can easily overfit all labels including those that are corrupted, and hence can hardly generalize to clean datasets. In this paper,… ▽ More

    Submitted 27 June, 2022; originally announced June 2022.

    Comments: Accepted to TNNLS 2022. Project page: https://yingyichen-cyy.github.io/CompressFeatNoisyLabels/

  32. Piecewise Linear Neural Networks and Deep Learning

    Authors: Qinghua Tao, Li Li, Xiaolin Huang, Xiangming Xi, Shuning Wang, Johan A. K. Suykens

    Abstract: As a powerful modelling method, PieceWise Linear Neural Networks (PWLNNs) have proven successful in various fields, most recently in deep learning. To apply PWLNN methods, both the representation and the learning have long been studied. In 1977, the canonical representation pioneered the works of shallow PWLNNs learned by incremental designs, but the applications to large-scale data were prohibite… ▽ More

    Submitted 18 June, 2022; originally announced June 2022.

    Comments: 23 pages, 6 figures

  33. arXiv:2202.01397  [pdf, other

    cs.LG

    Learning with Asymmetric Kernels: Least Squares and Feature Interpretation

    Authors: Mingzhen He, Fan He, Lei Shi, Xiaolin Huang, Johan A. K. Suykens

    Abstract: Asymmetric kernels naturally exist in real life, e.g., for conditional probability and directed graphs. However, most of the existing kernel-based learning methods require kernels to be symmetric, which prevents the use of asymmetric kernels. This paper addresses the asymmetric kernel-based learning in the framework of the least squares support vector machine named AsK-LS, resulting in the first c… ▽ More

    Submitted 2 February, 2022; originally announced February 2022.

  34. arXiv:2110.13501  [pdf, other

    cs.LG eess.SY

    Tensor Network Kalman Filtering for Large-Scale LS-SVMs

    Authors: Maximilian Lucassen, Johan A. K. Suykens, Kim Batselier

    Abstract: Least squares support vector machines are a commonly used supervised learning method for nonlinear regression and classification. They can be implemented in either their primal or dual form. The latter requires solving a linear system, which can be advantageous as an explicit mapping of the data to a possibly infinite-dimensional feature space is avoided. However, for large-scale applications, cur… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  35. arXiv:2110.06910  [pdf, other

    stat.ML cs.LG

    On the Double Descent of Random Features Models Trained with SGD

    Authors: Fanghui Liu, Johan A. K. Suykens, Volkan Cevher

    Abstract: We study generalization properties of random features (RF) regression in high dimensions optimized by stochastic gradient descent (SGD) in under-/over-parameterized regime. In this work, we derive precise non-asymptotic error bounds of RF regression under both constant and polynomial-decay step-size SGD setting, and observe the double descent phenomenon both theoretically and empirically. Our anal… ▽ More

    Submitted 16 October, 2022; v1 submitted 13 October, 2021; originally announced October 2021.

    Comments: Accepted by NeurIPS22

  36. arXiv:2105.13949  [pdf, other

    cs.LG stat.ML

    Latent Space Exploration Using Generative Kernel PCA

    Authors: David Winant, Joachim Schreurs, Johan A. K. Suykens

    Abstract: Kernel PCA is a powerful feature extractor which recently has seen a reformulation in the context of Restricted Kernel Machines (RKMs). These RKMs allow for a representation of kernel PCA in terms of hidden and visible units similar to Restricted Boltzmann Machines. This connection has led to insights on how to use kernel PCA in a generative procedure, called generative kernel PCA. In this paper,… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  37. arXiv:2105.13942  [pdf, other

    cs.LG stat.ML

    Towards Deterministic Diverse Subset Sampling

    Authors: Joachim Schreurs, Michaël Fanuel, Johan A. K. Suykens

    Abstract: Determinantal point processes (DPPs) are well known models for diverse subset selection problems, including recommendation tasks, document summarization and image search. In this paper, we discuss a greedy deterministic adaptation of k-DPP. Deterministic algorithms are interesting for many applications, as they provide interpretability to the user by having no failure probability and always return… ▽ More

    Submitted 28 May, 2021; originally announced May 2021.

  38. arXiv:2104.13766  [pdf, other

    cs.CV

    Boosting Co-teaching with Compression Regularization for Label Noise

    Authors: Yingyi Chen, Xi Shen, Shell Xu Hu, Johan A. K. Suykens

    Abstract: In this paper, we study the problem of learning image classification models in the presence of label noise. We revisit a simple compression regularization named Nested Dropout. We find that Nested Dropout, though originally proposed to perform fast information retrieval and adaptive data compression, can properly regularize a neural network to combat label noise. Moreover, owing to its simplicity,… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: Accepted by CVPR Workshop 2021. Project page: https://github.com/yingyichen-cyy/Nested-Co-teaching

  39. arXiv:2104.02373  [pdf, other

    cs.LG stat.ML

    Leverage Score Sampling for Complete Mode Coverage in Generative Adversarial Networks

    Authors: Joachim Schreurs, Hannes De Meulemeester, Michaël Fanuel, Bart De Moor, Johan A. K. Suykens

    Abstract: Commonly, machine learning models minimize an empirical expectation. As a result, the trained models typically perform well for the majority of the data but the performance may deteriorate in less dense regions of the dataset. This issue also arises in generative modeling. A generative model may overlook underrepresented modes that are less frequent in the empirical data distribution. This problem… ▽ More

    Submitted 21 July, 2021; v1 submitted 6 April, 2021; originally announced April 2021.

  40. Neural Network Training as an Optimal Control Problem: An Augmented Lagrangian Approach

    Authors: Brecht Evens, Puya Latafat, Andreas Themelis, Johan Suykens, Panagiotis Patrinos

    Abstract: Training of neural networks amounts to nonconvex optimization problems that are typically solved by using backpropagation and (variants of) stochastic gradient descent. In this work we propose an alternative approach by viewing the training task as a nonlinear optimal control problem. Under this lens, backpropagation amounts to the sequential approach (single shooting) to optimal control, where th… ▽ More

    Submitted 6 May, 2021; v1 submitted 26 March, 2021; originally announced March 2021.

    Comments: 8 pages; typos corrected

    MSC Class: 49L20; 49M15; 49M37; 68T07; 90C06; 90C26; 90C30

    Journal ref: 60th IEEE Conference on Decision and Control (CDC 2021)

  41. arXiv:2102.08443  [pdf, other

    cs.LG

    Unsupervised Energy-based Out-of-distribution Detection using Stiefel-Restricted Kernel Machine

    Authors: Francesco Tonin, Arun Pandey, Panagiotis Patrinos, Johan A. K. Suykens

    Abstract: Detecting out-of-distribution (OOD) samples is an essential requirement for the deployment of machine learning systems in the real world. Until now, research on energy-based OOD detectors has focused on the softmax confidence score from a pre-trained neural network classifier with access to class labels. In contrast, we propose an unsupervised energy-based OOD detector leveraging the Stiefel-Restr… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  42. arXiv:2011.12659  [pdf, other

    cs.LG stat.ML

    Unsupervised learning of disentangled representations in deep restricted kernel machines with orthogonality constraints

    Authors: Francesco Tonin, Panagiotis Patrinos, Johan A. K. Suykens

    Abstract: We introduce Constr-DRKM, a deep kernel method for the unsupervised learning of disentangled data representations. We propose augmenting the original deep restricted kernel machine formulation for kernel PCA by orthogonality constraints on the latent variables to promote disentanglement and to make it possible to carry out optimization without first defining a stabilized objective. After illustrat… ▽ More

    Submitted 25 November, 2020; originally announced November 2020.

  43. arXiv:2011.06964  [pdf, ps, other

    cs.LG stat.ML

    Determinantal Point Processes Implicitly Regularize Semi-parametric Regression Problems

    Authors: Michaël Fanuel, Joachim Schreurs, Johan A. K. Suykens

    Abstract: Semi-parametric regression models are used in several applications which require comprehensibility without sacrificing accuracy. Typical examples are spline interpolation in geophysics, or non-linear time series problems, where the system includes a linear and non-linear component. We discuss here the use of a finite Determinantal Point Process (DPP) for approximating semi-parametric models. Recen… ▽ More

    Submitted 9 March, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Comments: 26 pages. Extended results. Typos corrected

  44. arXiv:2011.01668  [pdf, other

    cs.LG stat.ML

    Towards a Unified Quadrature Framework for Large-Scale Kernel Machines

    Authors: Fanghui Liu, Xiaolin Huang, Yudong Chen, Johan A. K. Suykens

    Abstract: In this paper, we develop a quadrature framework for large-scale kernel machines via a numerical integration representation. Considering that the integration domain and measure of typical kernels, e.g., Gaussian kernels, arc-cosine kernels, are fully symmetric, we leverage deterministic fully symmetric interpolatory rules to efficiently compute quadrature nodes and associated weights for kernel ap… ▽ More

    Submitted 10 June, 2021; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: 17 pages, 9 figures

  45. arXiv:2010.02681  [pdf, other

    stat.ML cs.LG

    Kernel regression in high dimensions: Refined analysis beyond double descent

    Authors: Fanghui Liu, Zhenyu Liao, Johan A. K. Suykens

    Abstract: In this paper, we provide a precise characterization of generalization properties of high dimensional kernel ridge regression across the under- and over-parameterized regimes, depending on whether the number of training data n exceeds the feature dimension d. By establishing a bias-variance decomposition of the expected excess risk, we show that, while the bias is (almost) independent of d and mon… ▽ More

    Submitted 23 February, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: This paper was accepted by AISTATS-2021

  46. arXiv:2008.02046  [pdf, other

    stat.ML cs.LG stat.CO

    Outlier detection in non-elliptical data by kernel MRCD

    Authors: Joachim Schreurs, Iwein Vranckx, Mia Hubert, Johan A. K. Suykens, Peter J. Rousseeuw

    Abstract: The minimum regularized covariance determinant method (MRCD) is a robust estimator for multivariate location and scatter, which detects outliers by fitting a robust covariance matrix to the data. Its regularization ensures that the covariance matrix is well-conditioned in any dimension. The MRCD assumes that the non-outlying observations are roughly elliptically distributed, but many datasets are… ▽ More

    Submitted 29 March, 2021; v1 submitted 5 August, 2020; originally announced August 2020.

    Journal ref: Statistics and Computing, 2021, Volume 31, article 66

  47. arXiv:2006.14331  [pdf, other

    cs.LG stat.ML

    A Theoretical Framework for Target Propagation

    Authors: Alexander Meulemans, Francesco S. Carzaniga, Johan A. K. Suykens, João Sacramento, Benjamin F. Grewe

    Abstract: The success of deep learning, a brain-inspired form of AI, has sparked interest in understanding how the brain could similarly learn across multiple layers of neurons. However, the majority of biologically-plausible learning algorithms have not yet reached the performance of backpropagation (BP), nor are they built on strong theoretical foundations. Here, we analyze target propagation (TP), a popu… ▽ More

    Submitted 16 December, 2020; v1 submitted 25 June, 2020; originally announced June 2020.

    Comments: 13 pages and 4 figures in main manuscript; 41 pages and 8 figures in supplementary material

    MSC Class: 68T07

  48. arXiv:2006.13701  [pdf, ps, other

    cs.LG stat.ML

    Ensemble Kernel Methods, Implicit Regularization and Determinantal Point Processes

    Authors: Joachim Schreurs, Michaël Fanuel, Johan A. K. Suykens

    Abstract: By using the framework of Determinantal Point Processes (DPPs), some theoretical results concerning the interplay between diversity and regularization can be obtained. In this paper we show that sampling subsets with kDPPs results in implicit regularization in the context of ridgeless Kernel Regression. Furthermore, we leverage the common setup of state-of-the-art DPP algorithms to sample multiple… ▽ More

    Submitted 7 July, 2020; v1 submitted 24 June, 2020; originally announced June 2020.

  49. arXiv:2006.09096  [pdf, other

    cs.LG eess.IV stat.ML

    The Bures Metric for Generative Adversarial Networks

    Authors: Hannes De Meulemeester, Joachim Schreurs, Michaël Fanuel, Bart De Moor, Johan A. K. Suykens

    Abstract: Generative Adversarial Networks (GANs) are performant generative methods yielding high-quality samples. However, under certain circumstances, the training of GANs can lead to mode collapse or mode dropping, i.e. the generative models not being able to sample from the entire probability distribution. To address this problem, we use the last layer of the discriminator as a feature map to study the d… ▽ More

    Submitted 27 April, 2021; v1 submitted 16 June, 2020; originally announced June 2020.

    Comments: Additional empirical results

  50. arXiv:2006.07046  [pdf, other

    cs.LG stat.ML

    Disentangled Representation Learning and Generation with Manifold Optimization

    Authors: Arun Pandey, Michael Fanuel, Joachim Schreurs, Johan A. K. Suykens

    Abstract: Disentanglement is a useful property in representation learning which increases the interpretability of generative models such as Variational autoencoders (VAE), Generative Adversarial Models, and their many variants. Typically in such models, an increase in disentanglement performance is traded-off with generation quality. In the context of latent space models, this work presents a representation… ▽ More

    Submitted 30 May, 2022; v1 submitted 12 June, 2020; originally announced June 2020.