Skip to main content

Showing 1–6 of 6 results for author: Sidén, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.14060  [pdf, other

    cs.LG cs.AI cs.CV

    On Partial Prototype Collapse in the DINO Family of Self-Supervised Methods

    Authors: Hariprasath Govindarajan, Per Sidén, Jacob Roll, Fredrik Lindsten

    Abstract: A prominent self-supervised learning paradigm is to model the representations as clusters, or more generally as a mixture model. Learning to map the data samples to compact representations and fitting the mixture model simultaneously leads to the representation collapse problem. Regularizing the distribution of data points over the clusters is the prevalent strategy to avoid this issue. While this… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

    Comments: First version of the paper appeared in OpenReview on 22 Sep 2023. Accepted to BMVC 2024

  2. arXiv:2405.10939  [pdf, other

    cs.LG cs.AI cs.CV

    DINO as a von Mises-Fisher mixture model

    Authors: Hariprasath Govindarajan, Per Sidén, Jacob Roll, Fredrik Lindsten

    Abstract: Self-distillation methods using Siamese networks are popular for self-supervised pre-training. DINO is one such method based on a cross-entropy loss between $K$-dimensional probability vectors, obtained by applying a softmax function to the dot product between representations and learnt prototypes. Given the fact that the learned representations are $L^2$-normalized, we show that DINO and its deri… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: Accepted to ICLR 2023

  3. arXiv:2302.08415  [pdf, other

    stat.ML cs.LG cs.SI

    Temporal Graph Neural Networks for Irregular Data

    Authors: Joel Oskarsson, Per Sidén, Fredrik Lindsten

    Abstract: This paper proposes a temporal graph neural network model for forecasting of graph-structured irregularly observed time series. Our TGNN4I model is designed to handle both irregular time steps and partial observations of the graph. This is achieved by introducing a time-continuous latent state in each node, following a linear Ordinary Differential Equation (ODE) defined by the output of a Gated Re… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 17 pages, 4 figures. Accepted to AISTATS 2023. Code available at https://github.com/joeloskarsson/tgnn4i

  4. arXiv:2206.05032  [pdf, other

    stat.ML cs.LG cs.SI stat.CO

    Scalable Deep Gaussian Markov Random Fields for General Graphs

    Authors: Joel Oskarsson, Per Sidén, Fredrik Lindsten

    Abstract: Machine learning methods on graphs have proven useful in many applications due to their ability to handle generally structured data. The framework of Gaussian Markov Random Fields (GMRFs) provides a principled way to define Gaussian models on graphs by utilizing their sparsity structure. We propose a flexible GMRF model for general graphs built on the multi-layer structure of Deep GMRFs, originall… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: 22 pages, 10 figures. Accepted at ICML 2022. Code available at https://github.com/joeloskarsson/graph-dgmrf

  5. arXiv:2002.07467  [pdf, other

    stat.ML cs.LG stat.CO stat.ME

    Deep Gaussian Markov Random Fields

    Authors: Per Sidén, Fredrik Lindsten

    Abstract: Gaussian Markov random fields (GMRFs) are probabilistic graphical models widely used in spatial statistics and related fields to model dependencies over spatial structures. We establish a formal connection between GMRFs and convolutional neural networks (CNNs). Common GMRFs are special cases of a generative model where the inverse mapping from data to latent variables is given by a 1-layer linear… ▽ More

    Submitted 10 August, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

  6. arXiv:1903.10443  [pdf, other

    cs.RO stat.AP

    Real-Time Robotic Search using Hierarchical Spatial Point Processes

    Authors: Olov Andersson, Per Sidén, Johan Dahlin, Patrick Doherty, Mattias Villani

    Abstract: Aerial robots hold great potential for aiding Search and Rescue (SAR) efforts over large areas. Traditional approaches typically searches an area exhaustively, thereby ignoring that the density of victims varies based on predictable factors, such as the terrain, population density and the type of disaster. We present a probabilistic model to automate SAR planning, with explicit minimization of the… ▽ More

    Submitted 25 March, 2019; originally announced March 2019.