Skip to main content

Showing 1–33 of 33 results for author: Nicolau, M

.
  1. arXiv:2505.21364  [pdf, ps, other

    cs.LG cs.AI

    Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders

    Authors: James Oldfield, Shawn Im, Yixuan Li, Mihalis A. Nicolaou, Ioannis Patras, Grigorios G Chrysos

    Abstract: Multilayer perceptrons (MLPs) are an integral part of large language models, yet their dense representations render them difficult to understand, edit, and steer. Recent methods learn interpretable approximations via neuron-level sparsity, yet fail to faithfully reconstruct the original mapping--significantly increasing model's next-token cross-entropy loss. In this paper, we advocate for moving t… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  2. arXiv:2503.20605  [pdf, other

    physics.flu-dyn physics.comp-ph

    Immersed boundary - lattice Boltzmann method for wetting problems

    Authors: Elisa Bellantoni, Fabio Guglietta, Francesca Pelusi, Mathieu Desbrun, Kiwon Um, Mihalis Nicolaou, Nikos Savva, Mauro Sbragaglia

    Abstract: We develop a mesoscale computational model to describe the interaction of a droplet with a solid. The model is based on the hybrid combination of the immersed boundary and the lattice Boltzmann computational schemes: the former is used to model the non-ideal sharp interface of the droplet coupled with the inner and outer fluids, simulated with the lattice Boltzmann scheme. We further introduce an… ▽ More

    Submitted 27 March, 2025; v1 submitted 26 March, 2025; originally announced March 2025.

    Comments: 14 pages, 9 figures

  3. arXiv:2408.16845  [pdf, other

    cs.CV cs.LG

    Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis

    Authors: Theodoros Kouzelis, Manos Plitsis, Mihalis A. Nicolaou, Yannis Panagakis

    Abstract: Recent advances in Diffusion Models (DMs) have led to significant progress in visual synthesis and editing tasks, establishing them as a strong competitor to Generative Adversarial Networks (GANs). However, the latent space of DMs is not as well understood as that of GANs. Recent research has focused on unsupervised semantic discovery in the latent space of DMs by leveraging the bottleneck layer o… ▽ More

    Submitted 2 September, 2024; v1 submitted 29 August, 2024; originally announced August 2024.

    Comments: Accepted at BMVC2024

  4. arXiv:2405.18045  [pdf, other

    cs.LG cs.CV

    Bridging Mini-Batch and Asymptotic Analysis in Contrastive Learning: From InfoNCE to Kernel-Based Losses

    Authors: Panagiotis Koromilas, Giorgos Bouritsas, Theodoros Giannakopoulos, Mihalis Nicolaou, Yannis Panagakis

    Abstract: What do different contrastive learning (CL) losses actually optimize for? Although multiple CL methods have demonstrated remarkable representation learning capabilities, the differences in their inner workings remain largely opaque. In this work, we analyse several CL families and prove that, under certain conditions, they admit the same minimisers when optimizing either their batch-level objectiv… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Accepted at ICML 2024. Code available at: https://github.com/pakoromilas/DHEL-KCL.git

  5. arXiv:2402.12550  [pdf, other

    cs.CV cs.LG

    Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization

    Authors: James Oldfield, Markos Georgopoulos, Grigorios G. Chrysos, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Jiankang Deng, Ioannis Patras

    Abstract: The Mixture of Experts (MoE) paradigm provides a powerful way to decompose dense layers into smaller, modular computations often more amenable to human interpretation, debugging, and editability. However, a major challenge lies in the computational cost of scaling the number of experts high enough to achieve fine-grained specialization. In this paper, we propose the Multilinear Mixture of Experts… ▽ More

    Submitted 16 October, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: Accepted at NeurIPS 2024. Github: https://github.com/james-oldfield/muMoE. Project page: https://james-oldfield.github.io/muMoE

  6. arXiv:2309.14883  [pdf, other

    cs.CV cs.LG

    Locality-preserving Directions for Interpreting the Latent Space of Satellite Image GANs

    Authors: Georgia Kourmouli, Nikos Kostagiolas, Yannis Panagakis, Mihalis A. Nicolaou

    Abstract: We present a locality-aware method for interpreting the latent space of wavelet-based Generative Adversarial Networks (GANs), that can well capture the large spatial and spectral variability that is characteristic to satellite imagery. By focusing on preserving locality, the proposed method is able to decompose the weight-space of pre-trained GANs and recover interpretable directions that correspo… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

  7. arXiv:2305.14053  [pdf, other

    cs.CV cs.LG

    Parts of Speech-Grounded Subspaces in Vision-Language Models

    Authors: James Oldfield, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: Latent image representations arising from vision-language models have proved immensely useful for a variety of downstream tasks. However, their utility is limited by their entanglement with respect to different visual attributes. For instance, recent work has shown that CLIP image representations are often biased toward specific visual properties (such as objects or actions) in an unpredictable ma… ▽ More

    Submitted 12 November, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: Accepted at NeurIPS 2023

  8. arXiv:2208.02089  [pdf, other

    cs.CV cs.LG eess.IV

    Unsupervised Discovery of Semantic Concepts in Satellite Imagery with Style-based Wavelet-driven Generative Models

    Authors: Nikos Kostagiolas, Mihalis A. Nicolaou, Yannis Panagakis

    Abstract: In recent years, considerable advancements have been made in the area of Generative Adversarial Networks (GANs), particularly with the advent of style-based architectures that address many key shortcomings - both in terms of modeling capabilities and network interpretability. Despite these improvements, the adoption of such approaches in the domain of satellite imagery is not straightforward. Typi… ▽ More

    Submitted 3 August, 2022; originally announced August 2022.

    Comments: 11 pages, 5 figures, accepted at SETN 2022

  9. arXiv:2206.00048  [pdf, other

    cs.CV cs.LG

    PandA: Unsupervised Learning of Parts and Appearances in the Feature Maps of GANs

    Authors: James Oldfield, Christos Tzelepis, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: Recent advances in the understanding of Generative Adversarial Networks (GANs) have led to remarkable progress in visual editing and synthesis tasks, capitalizing on the rich semantics that are embedded in the latent spaces of pre-trained GANs. However, existing methods are often tailored to specific GAN architectures and are limited to either discovering global semantic directions that do not fac… ▽ More

    Submitted 6 February, 2023; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: Accepted at ICLR 2023. Code available at: https://github.com/james-oldfield/PandA

  10. arXiv:2111.11736  [pdf, other

    cs.CV

    Tensor Component Analysis for Interpreting the Latent Space of GANs

    Authors: James Oldfield, Markos Georgopoulos, Yannis Panagakis, Mihalis A. Nicolaou, Ioannis Patras

    Abstract: This paper addresses the problem of finding interpretable directions in the latent space of pre-trained Generative Adversarial Networks (GANs) to facilitate controllable image synthesis. Such interpretable directions correspond to transformations that can affect both the style and geometry of the synthetic images. However, existing approaches that utilise linear techniques to find these transforma… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

    Comments: BMVC 2021

  11. arXiv:2108.04240  [pdf, other

    q-bio.QM cs.LG

    Classification of Influenza Hemagglutinin Protein Sequences using Convolutional Neural Networks

    Authors: Charalambos Chrysostomou, Floris Alexandrou, Mihalis A. Nicolaou, Huseyin Seker

    Abstract: The Influenza virus can be considered as one of the most severe viruses that can infect multiple species with often fatal consequences to the hosts. The Hemagglutinin (HA) gene of the virus can be a target for antiviral drug development realised through accurate identification of its sub-types and possible the targeted hosts. This paper focuses on accurately predicting if an Influenza type A virus… ▽ More

    Submitted 9 August, 2021; originally announced August 2021.

  12. Tensor Methods in Computer Vision and Deep Learning

    Authors: Yannis Panagakis, Jean Kossaifi, Grigorios G. Chrysos, James Oldfield, Mihalis A. Nicolaou, Anima Anandkumar, Stefanos Zafeiriou

    Abstract: Tensors, or multidimensional arrays, are data structures that can naturally represent visual data of multiple dimensions. Inherently able to efficiently capture structured, latent semantic spaces and high-order interactions, tensors have a long history of applications in a wide span of computer vision problems. With the advent of the deep learning paradigm shift in computer vision, tensors have be… ▽ More

    Submitted 7 July, 2021; originally announced July 2021.

    Comments: Proceedings of the IEEE (2021)

  13. arXiv:2009.05908  [pdf, other

    cs.LG stat.ML

    Understanding Boolean Function Learnability on Deep Neural Networks: PAC Learning Meets Neurosymbolic Models

    Authors: Marcio Nicolau, Anderson R. Tavares, Zhiwei Zhang, Pedro Avelar, João M. Flach, Luis C. Lamb, Moshe Y. Vardi

    Abstract: Computational learning theory states that many classes of boolean formulas are learnable in polynomial time. This paper addresses the understudied subject of how, in practice, such formulas can be learned by deep neural networks. Specifically, we analyze boolean formulas associated with model-sampling benchmarks, combinatorial optimization problems, and random 3-CNFs with varying degrees of constr… ▽ More

    Submitted 17 November, 2022; v1 submitted 12 September, 2020; originally announced September 2020.

  14. arXiv:2006.03985  [pdf, other

    cs.CV

    Enhancing Facial Data Diversity with Style-based Face Aging

    Authors: Markos Georgopoulos, James Oldfield, Mihalis A. Nicolaou, Yannis Panagakis, Maja Pantic

    Abstract: A significant limiting factor in training fair classifiers relates to the presence of dataset bias. In particular, face datasets are typically biased in terms of attributes such as gender, age, and race. If not mitigated, bias leads to algorithms that exhibit unfair behaviour towards such groups. In this work, we address the problem of increasing the diversity of face datasets with respect to age.… ▽ More

    Submitted 6 June, 2020; originally announced June 2020.

    Comments: IEEE CVPR 2020 WORKSHOP ON FAIR, DATA EFFICIENT AND TRUSTED COMPUTER VISION

  15. arXiv:2001.11811  [pdf, ps, other

    cs.CC

    The Lagrangian remainder of Taylor's series, distinguishes $\mathcal{O}(f(x))$ time complexities to polynomials or not

    Authors: Nikolaos P. Bakas, Elias Kosmatopoulos, Mihalis Nicolaou, Savvas A. Chatzichristofis

    Abstract: The purpose of this letter is to investigate the time complexity consequences of the truncated Taylor series, known as Taylor Polynomials \cite{bakas2019taylor,Katsoprinakis2011,Nestoridis2011}. In particular, it is demonstrated that the examination of the $\mathbf{P=NP}$ equality, is associated with the determination of whether the $n^{th}$ derivative of a particular solution is bounded or not. A… ▽ More

    Submitted 27 May, 2020; v1 submitted 30 January, 2020; originally announced January 2020.

  16. arXiv:1909.13563  [pdf, other

    cs.LG stat.ML

    A Gradient Free Neural Network Framework Based on Universal Approximation Theorem

    Authors: Nikolaos P. Bakas, Andreas Langousis, Mihalis Nicolaou, Savvas A. Chatzichristofis

    Abstract: We present a numerical scheme for computation of Artificial Neural Networks (ANN) weights, which stems from the Universal Approximation Theorem, avoiding laborious iterations. The proposed algorithm adheres to the underlying theory, is highly fast, and results in remarkably low errors when applied for regression and classification of complex data-sets, such as the Griewank function of multiple var… ▽ More

    Submitted 18 August, 2020; v1 submitted 30 September, 2019; originally announced September 2019.

  17. arXiv:1905.13694  [pdf, other

    cs.CV

    Multimodal Joint Emotion and Game Context Recognition in League of Legends Livestreams

    Authors: Charles Ringer, James Alfred Walker, Mihalis A. Nicolaou

    Abstract: Video game streaming provides the viewer with a rich set of audio-visual data, conveying information both with regards to the game itself, through game footage and audio, as well as the streamer's emotional state and behaviour via webcam footage and audio. Analysing player behaviour and discovering correlations with game context is crucial for modelling and understanding important aspects of lives… ▽ More

    Submitted 31 May, 2019; originally announced May 2019.

    Comments: 8 Pages, IEEE Conference on Games 2019

  18. arXiv:1905.00307  [pdf, other

    cs.CV

    3DFaceGAN: Adversarial Nets for 3D Face Representation, Generation, and Translation

    Authors: Stylianos Moschoglou, Stylianos Ploumpis, Mihalis Nicolaou, Athanasios Papaioannou, Stefanos Zafeiriou

    Abstract: Over the past few years, Generative Adversarial Networks (GANs) have garnered increased interest among researchers in Computer Vision, with applications including, but not limited to, image generation, translation, imputation, and super-resolution. Nevertheless, no GAN-based method has been proposed in the literature that can successfully represent, generate or translate 3D facial shapes (meshes).… ▽ More

    Submitted 9 May, 2019; v1 submitted 1 May, 2019; originally announced May 2019.

    Comments: 15 pages, 12 figures. Submitted to International Journal of Computer Vision (IJCV), special issue: Generative Adversarial Networks for Computer Vision

  19. arXiv:1904.04772  [pdf, other

    cs.CV cs.LG

    Adversarial Learning of Disentangled and Generalizable Representations for Visual Attributes

    Authors: James Oldfield, Yannis Panagakis, Mihalis A. Nicolaou

    Abstract: Recently, a multitude of methods for image-to-image translation have demonstrated impressive results on problems such as multi-domain or multi-attribute transfer. The vast majority of such works leverages the strengths of adversarial learning and deep convolutional autoencoders to achieve realistic results by well-capturing the target data distribution. Nevertheless, the most prominent representat… ▽ More

    Submitted 30 January, 2021; v1 submitted 9 April, 2019; originally announced April 2019.

  20. arXiv:1810.07244  [pdf, ps, other

    math.DG

    On the automorphism group of foliations with geometric transverse structure

    Authors: Laurent Meersseman, Marcel Nicolau, Javier Ribon

    Abstract: Motivated by questions of deformations/moduli in foliation theory, we investigate the structure of some groups of diffeomorphisms preserving a foliation. We give an example of a $C^\infty$ foliation whose diffeomorphism group is not a Lie group in any reasonable sense. On the positive side, we prove that the automorphism group of a transversely holomorphic foliation or a riemannian foliation is a… ▽ More

    Submitted 24 March, 2022; v1 submitted 16 October, 2018; originally announced October 2018.

    Comments: 22 pages. Accepted for publication in Mathematische Zeitschrift

  21. arXiv:1807.09715  [pdf, other

    cs.CV

    Deep Unsupervised Multi-View Detection of Video Game Stream Highlights

    Authors: Charles Ringer, Mihalis A. Nicolaou

    Abstract: We consider the problem of automatic highlight-detection in video game streams. Currently, the vast majority of highlight-detection systems for games are triggered by the occurrence of hard-coded game events (e.g., score change, end-game), while most advanced tools and techniques are based on detection of highlights via visual analysis of game footage. We argue that in the context of game streamin… ▽ More

    Submitted 25 July, 2018; originally announced July 2018.

    Comments: Foundation of Digital Games 2018, 6 pages

  22. arXiv:1804.10938  [pdf, other

    cs.CV cs.AI cs.HC eess.IV stat.ML

    Deep Affect Prediction in-the-wild: Aff-Wild Database and Challenge, Deep Architectures, and Beyond

    Authors: Dimitrios Kollias, Panagiotis Tzirakis, Mihalis A. Nicolaou, Athanasios Papaioannou, Guoying Zhao, Björn Schuller, Irene Kotsia, Stefanos Zafeiriou

    Abstract: Automatic understanding of human affect using visual signals is of great importance in everyday human-machine interactions. Appraising human emotional states, behaviors and reactions displayed in real-world settings, can be accomplished using latent continuous dimensions (e.g., the circumplex model of affect). Valence (i.e., how positive or negative is an emotion) & arousal (i.e., power of the act… ▽ More

    Submitted 1 February, 2019; v1 submitted 29 April, 2018; originally announced April 2018.

  23. arXiv:1802.00030  [pdf, other

    cs.LG stat.ML

    Fusarium Damaged Kernels Detection Using Transfer Learning on Deep Neural Network Architecture

    Authors: Márcio Nicolau, Márcia Barrocas Moreira Pimentel, Casiane Salete Tibola, José Mauricio Cunha Fernandes, Willingthon Pavan

    Abstract: The present work shows the application of transfer learning for a pre-trained deep neural network (DNN), using a small image dataset ($\approx$ 12,000) on a single workstation with enabled NVIDIA GPU card that takes up to 1 hour to complete the training task and archive an overall average accuracy of $94.7\%$. The DNN presents a $20\%$ score of misclassification for an external test dataset. The a… ▽ More

    Submitted 31 January, 2018; originally announced February 2018.

  24. Multi-Attribute Robust Component Analysis for Facial UV Maps

    Authors: Stylianos Moschoglou, Evangelos Ververas, Yannis Panagakis, Mihalis Nicolaou, Stefanos Zafeiriou

    Abstract: Recently, due to the collection of large scale 3D face models, as well as the advent of deep learning, a significant progress has been made in the field of 3D face alignment "in-the-wild". That is, many methods have been proposed that establish sparse or dense 3D correspondences between a 2D facial image and a 3D face model. The utilization of 3D face alignment introduces new challenges and resear… ▽ More

    Submitted 15 December, 2017; originally announced December 2017.

  25. End-to-End Multimodal Emotion Recognition using Deep Neural Networks

    Authors: Panagiotis Tzirakis, George Trigeorgis, Mihalis A. Nicolaou, Björn Schuller, Stefanos Zafeiriou

    Abstract: Automatic affect recognition is a challenging task due to the various modalities emotions can be expressed with. Applications can be found in many domains including multimedia retrieval and human computer interaction. In recent years, deep neural networks have been used with great success in determining emotional states. Inspired by this success, we propose an emotion recognition system using audi… ▽ More

    Submitted 27 April, 2017; originally announced April 2017.

  26. arXiv:1704.03522  [pdf

    cs.NE

    Improving Fitness Functions in Genetic Programming for Classification on Unbalanced Credit Card Datasets

    Authors: Van Loi Cao, Nhien-An Le-Khac, Miguel Nicolau, Michael ONeill, James McDermott

    Abstract: Credit card fraud detection based on machine learning has recently attracted considerable interest from the research community. One of the most important tasks in this area is the ability of classifiers to handle the imbalance in credit card data. In this scenario, classifiers tend to yield poor accuracy on the fraud class (minority class) despite realizing high overall accuracy. This is due to th… ▽ More

    Submitted 11 April, 2017; originally announced April 2017.

  27. Deformations and Moduli of Structures on Manifolds: General Existence Theorem and Application to the Sasakian Case

    Authors: Laurent Meersseman, Marcel Nicolau

    Abstract: In this paper, we prove an existence theorem of a local moduli space for geometric structures in a very general setting. Then to show the interest of this result, we apply it to the case of sasakian and Sasaki-Einstein structures.

    Submitted 16 October, 2015; v1 submitted 27 March, 2015; originally announced March 2015.

    Comments: Final version to be published in Annali Sc. Norm. Sup Pisa;

  28. arXiv:1503.04627  [pdf, ps, other

    math.DS math.AG math.CV

    Foliations and webs inducing Galois coverings

    Authors: Andrés Beltrán, Maycol Falla Luza, David Marín, Marcel Nicolau

    Abstract: We introduce the notion of Galois holomorphic foliation on the complex projective space as that of foliations whose Gauss map is a Galois covering when restricted to an appropriate Zariski open subset. First, we establish general criteria assuring that a rational map between projective manifolds of the same dimension defines a Galois covering. Then, these criteria are used to give a geometric char… ▽ More

    Submitted 16 March, 2015; originally announced March 2015.

    MSC Class: 14E05; 14E20; 37F75; 53A60; 32S65

  29. arXiv:1303.3240  [pdf, other

    cs.LG cs.CV stat.ML

    A Unified Framework for Probabilistic Component Analysis

    Authors: Mihalis A. Nicolaou, Stefanos Zafeiriou, Maja Pantic

    Abstract: We present a unifying framework which reduces the construction of probabilistic component analysis techniques to a mere selection of the latent neighbourhood, thus providing an elegant and principled framework for creating novel component analysis models as well as constructing probabilistic equivalents of deterministic component analysis methods. Under our framework, we unify many very popular an… ▽ More

    Submitted 14 November, 2014; v1 submitted 13 March, 2013; originally announced March 2013.

  30. arXiv:1005.2815  [pdf, ps, other

    cs.AI

    Evolving Genes to Balance a Pole

    Authors: Miguel Nicolau, Marc Schoenauer, W. Banzhaf

    Abstract: We discuss how to use a Genetic Regulatory Network as an evolutionary representation to solve a typical GP reinforcement problem, the pole balancing. The network is a modified version of an Artificial Regulatory Network proposed a few years ago, and the task could be solved only by finding a proper way of connecting inputs and outputs to the network. We show that the representation is able to gene… ▽ More

    Submitted 17 May, 2010; originally announced May 2010.

    Journal ref: EUropean Conference on Genetic Programming, Istanbul : Turkey (2010)

  31. arXiv:0909.4690  [pdf, ps, other

    math.AG math.DG

    Deformations of Kahler manifolds with non vanishing holomorphic vector fields

    Authors: Jaume Amoros, Monica Manjarin, Marcel Nicolau

    Abstract: In this article we study compact Kähler manifolds $X$ admitting non-singular holomorphic vector fields with the aim of extending to this setting the classical birational classification of projective varieties with tangent vector fields. We prove that any such a Kähler manifold $X$ admits an arbitrarily small deformation of a particular type which is a suspension over a torus; that is, a quotient o… ▽ More

    Submitted 19 July, 2010; v1 submitted 25 September, 2009; originally announced September 2009.

    Comments: 38 pages, change in the order of the sections, to appear in JEMS

    MSC Class: 32G07; 32M; 32J27; 37F75; 14M17; 14L27

  32. arXiv:0902.4301  [pdf, ps, other

    math.CV

    Deformations Feuilletees Des Varietes De Hopf

    Authors: Laurent Meersseman, Marcel Nicolau, Alberto Verjovsky

    Abstract: In this article, we focus on a very special class of foliations with complex leaves whose diffeomorphism type is fixed. They have a unique compact leaf and the noncompact leaves all accumulate onto it. We show that the complex structure along the non-compact leaves is fixed by the complex structure of the compact leaf. Reciprocally, we prove that the complex structure along a non-compact leaf de… ▽ More

    Submitted 25 February, 2009; originally announced February 2009.

  33. arXiv:math/0610915  [pdf, ps, other

    math.DG

    Complex and CR-structures on compact Lie groups associated to Abelian actions

    Authors: J. -J. Loeb, M. Manjarin, M. Nicolau

    Abstract: It was shown by Samelson and Wang that each compact Lie group K of even dimension admits left-invariant complex structures. When K has odd dimension it admits a left-invariant CR-structure of maximal dimension. This has been proved recently by Charbonnel and Khalgui who have also given a complete algebraic description of these structures. In this article we present an alternative and more geomet… ▽ More

    Submitted 30 October, 2006; originally announced October 2006.

    MSC Class: 32C10;32C16