Skip to main content

Showing 1–12 of 12 results for author: Martinkus, K

.
  1. arXiv:2410.05980  [pdf, other

    cs.LG

    Generalizing to any diverse distribution: uniformity, gentle finetuning and rebalancing

    Authors: Andreas Loukas, Karolis Martinkus, Ed Wagstaff, Kyunghyun Cho

    Abstract: As training datasets grow larger, we aspire to develop models that generalize well to any diverse test distribution, even if the latter deviates significantly from the training data. Various approaches like domain adaptation, domain generalization, and robust optimization attempt to address the out-of-distribution challenge by posing assumptions about the relation between training and test distrib… ▽ More

    Submitted 8 October, 2024; originally announced October 2024.

  2. arXiv:2312.11529  [pdf, other

    cs.SI cs.LG

    Efficient and Scalable Graph Generation through Iterative Local Expansion

    Authors: Andreas Bergmeister, Karolis Martinkus, Nathanaël Perraudin, Roger Wattenhofer

    Abstract: In the realm of generative models for graphs, extensive research has been conducted. However, most existing methods struggle with large graphs due to the complexity of representing the entire joint distribution across all node pairs and capturing both global and local graph structures simultaneously. To overcome these issues, we introduce a method that generates a graph by progressively expanding… ▽ More

    Submitted 14 May, 2024; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: Published as a conference paper at ICLR 2024

  3. arXiv:2308.05027  [pdf, other

    q-bio.BM cs.LG stat.ML

    AbDiffuser: Full-Atom Generation of in vitro Functioning Antibodies

    Authors: Karolis Martinkus, Jan Ludwiczak, Kyunghyun Cho, Wei-Ching Liang, Julien Lafrance-Vanasse, Isidro Hotzel, Arvind Rajpal, Yan Wu, Richard Bonneau, Vladimir Gligorijevic, Andreas Loukas

    Abstract: We introduce AbDiffuser, an equivariant and physics-informed diffusion model for the joint generation of antibody 3D structures and sequences. AbDiffuser is built on top of a new representation of protein structure, relies on a novel architecture for aligned proteins, and utilizes strong diffusion priors to improve the denoising process. Our approach improves protein diffusion by taking advantage… ▽ More

    Submitted 6 March, 2024; v1 submitted 28 July, 2023; originally announced August 2023.

    Comments: NeurIPS 2023

  4. arXiv:2304.12895  [pdf, ps, other

    cs.LG cs.NE

    Discovering Graph Generation Algorithms

    Authors: Mihai Babiac, Karolis Martinkus, Roger Wattenhofer

    Abstract: We provide a novel approach to construct generative models for graphs. Instead of using the traditional probabilistic models or deep generative models, we propose to instead find an algorithm that generates the data. We achieve this using evolutionary search and a powerful fitness function, implemented by a randomly initialized graph neural network. This brings certain advantages over current deep… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

  5. arXiv:2211.13219  [pdf, other

    cs.GR cs.AI cs.LG cs.NE cs.RO

    Automating Rigid Origami Design

    Authors: Jeremia Geiger, Karolis Martinkus, Oliver Richter, Roger Wattenhofer

    Abstract: Rigid origami has shown potential in large diversity of practical applications. However, current rigid origami crease pattern design mostly relies on known tessellations. This strongly limits the diversity and novelty of patterns that can be created. In this work, we build upon the recently developed principle of three units method to formulate rigid origami design as a discrete optimization probl… ▽ More

    Submitted 28 April, 2023; v1 submitted 20 November, 2022; originally announced November 2022.

    Comments: IJCAI 2023 AI, Arts & Creativity Special Track

  6. arXiv:2210.01549  [pdf, other

    cs.LG cs.SI stat.ML

    Diffusion Models for Graphs Benefit From Discrete State Spaces

    Authors: Kilian Konstantin Haefeli, Karolis Martinkus, Nathanaël Perraudin, Roger Wattenhofer

    Abstract: Denoising diffusion probabilistic models and score-matching models have proven to be very powerful for generative tasks. While these approaches have also been applied to the generation of discrete graphs, they have, so far, relied on continuous Gaussian perturbations. Instead, in this work, we suggest using discrete noise for the forward Markov process. This ensures that in every intermediate step… ▽ More

    Submitted 15 August, 2023; v1 submitted 4 October, 2022; originally announced October 2022.

    Comments: Presented at the First Learning on Graphs Conference (LoG 2022) and the NeurIPS 2022 New Frontiers in Graph Learning Workshop (NeurIPS GLFrontiers 2022)

  7. arXiv:2206.11010  [pdf, other

    cs.LG cs.AI stat.ML

    Agent-based Graph Neural Networks

    Authors: Karolis Martinkus, Pál András Papp, Benedikt Schesch, Roger Wattenhofer

    Abstract: We present a novel graph neural network we call AgentNet, which is designed specifically for graph-level tasks. AgentNet is inspired by sublinear algorithms, featuring a computational complexity that is independent of the graph size. The architecture of AgentNet differs fundamentally from the architectures of traditional graph neural networks. In AgentNet, some trained \textit{neural agents} intel… ▽ More

    Submitted 27 February, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

    Comments: 32 pages, 6 figures, ICLR 2023

  8. arXiv:2205.13234  [pdf, other

    cs.LG

    DT+GNN: A Fully Explainable Graph Neural Network using Decision Trees

    Authors: Peter Müller, Lukas Faber, Karolis Martinkus, Roger Wattenhofer

    Abstract: We propose the fully explainable Decision Tree Graph Neural Network (DT+GNN) architecture. In contrast to existing black-box GNNs and post-hoc explanation methods, the reasoning of DT+GNN can be inspected at every step. To achieve this, we first construct a differentiable GNN layer, which uses a categorical state space for nodes and messages. This allows us to convert the trained MLPs in the GNN i… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

  9. arXiv:2204.01613  [pdf, other

    cs.LG cs.AI stat.ML

    SPECTRE: Spectral Conditioning Helps to Overcome the Expressivity Limits of One-shot Graph Generators

    Authors: Karolis Martinkus, Andreas Loukas, Nathanaël Perraudin, Roger Wattenhofer

    Abstract: We approach the graph generation problem from a spectral perspective by first generating the dominant parts of the graph Laplacian spectrum and then building a graph matching these eigenvalues and eigenvectors. Spectral conditioning allows for direct modeling of the global and local graph structure and helps to overcome the expressivity and mode collapse issues of one-shot graph generators. Our no… ▽ More

    Submitted 18 June, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: The 39th International Conference on Machine Learning (ICML 2022), 21 pages, 10 figures

  10. arXiv:2111.06283  [pdf, other

    cs.LG

    DropGNN: Random Dropouts Increase the Expressiveness of Graph Neural Networks

    Authors: Pál András Papp, Karolis Martinkus, Lukas Faber, Roger Wattenhofer

    Abstract: This paper studies Dropout Graph Neural Networks (DropGNNs), a new approach that aims to overcome the limitations of standard GNN frameworks. In DropGNNs, we execute multiple runs of a GNN on the input graph, with some of the nodes randomly and independently dropped in each of these runs. Then, we combine the results of these runs to obtain the final result. We prove that DropGNNs can distinguish… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

    Comments: Published in the 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

    MSC Class: 68T07

  11. arXiv:2109.13304  [pdf, other

    cs.CL

    On Isotropy Calibration of Transformers

    Authors: Yue Ding, Karolis Martinkus, Damian Pascual, Simon Clematide, Roger Wattenhofer

    Abstract: Different studies of the embedding space of transformer models suggest that the distribution of contextual representations is highly anisotropic - the embeddings are distributed in a narrow cone. Meanwhile, static word representations (e.g., Word2Vec or GloVe) have been shown to benefit from isotropic spaces. Therefore, previous work has developed methods to calibrate the embedding space of transf… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

  12. arXiv:2010.06948  [pdf, other

    cs.LG astro-ph.IM physics.comp-ph stat.ML

    Scalable Graph Networks for Particle Simulations

    Authors: Karolis Martinkus, Aurelien Lucchi, Nathanaël Perraudin

    Abstract: Learning system dynamics directly from observations is a promising direction in machine learning due to its potential to significantly enhance our ability to understand physical systems. However, the dynamics of many real-world systems are challenging to learn due to the presence of nonlinear potentials and a number of interactions that scales quadratically with the number of particles $N$, as in… ▽ More

    Submitted 20 March, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: 19 pages, 20 figures, AAAI 2021