Skip to main content

Showing 1–23 of 23 results for author: Nickel, M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.13653  [pdf, other

    cs.AI stat.ML

    No Free Delivery Service: Epistemic limits of passive data collection in complex social systems

    Authors: Maximilian Nickel

    Abstract: Rapid model validation via the train-test paradigm has been a key driver for the breathtaking progress in machine learning and AI. However, modern AI systems often depend on a combination of tasks and data collection practices that violate all assumptions ensuring test validity. Yet, without rigorous model validation we cannot ensure the intended outcomes of deployed AI systems, including positive… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

    Comments: To appear in NeurIPS'24

    MSC Class: 62A01 ACM Class: G.3; I.2.0

  2. arXiv:2406.01611  [pdf, other

    cs.IR cs.LG stat.ML

    System-2 Recommenders: Disentangling Utility and Engagement in Recommendation Systems via Temporal Point-Processes

    Authors: Arpit Agarwal, Nicolas Usunier, Alessandro Lazaric, Maximilian Nickel

    Abstract: Recommender systems are an important part of the modern human experience whose influence ranges from the food we eat to the news we read. Yet, there is still debate as to what extent recommendation platforms are aligned with the user goals. A core issue fueling this debate is the challenge of inferring a user utility based on engagement signals such as likes, shares, watch time etc., which are the… ▽ More

    Submitted 29 May, 2024; originally announced June 2024.

    Comments: Accepted at FAccT'24

  3. arXiv:2310.02233  [pdf, other

    stat.ML cs.LG math.OC

    Generalized Schrödinger Bridge Matching

    Authors: Guan-Horng Liu, Yaron Lipman, Maximilian Nickel, Brian Karrer, Evangelos A. Theodorou, Ricky T. Q. Chen

    Abstract: Modern distribution matching algorithms for training diffusion or flow models directly prescribe the time evolution of the marginal distributions between two boundary distributions. In this work, we consider a generalized distribution matching setup, where these marginals are only implicitly described as a solution to some task-specific objective function. The problem setup, known as the Generaliz… ▽ More

    Submitted 18 April, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Camera Ready

  4. arXiv:2309.09924  [pdf, other

    cs.LG eess.SP stat.ML

    DYMAG: Rethinking Message Passing Using Dynamical-systems-based Waveforms

    Authors: Dhananjay Bhaskar, Xingzhi Sun, Yanlei Zhang, Charles Xu, Arman Afrasiyabi, Siddharth Viswanath, Oluwadamilola Fasina, Maximilian Nickel, Guy Wolf, Michael Perlmutter, Smita Krishnaswamy

    Abstract: We present DYMAG, a graph neural network based on a novel form of message aggregation. Standard message-passing neural networks, which often aggregate local neighbors via mean-aggregation, can be regarded as convolving with a simple rectangular waveform which is non-zero only on 1-hop neighbors of every vertex. Here, we go beyond such local averaging. We will convolve the node features with more s… ▽ More

    Submitted 26 May, 2025; v1 submitted 18 September, 2023; originally announced September 2023.

  5. arXiv:2306.06626  [pdf, other

    cs.LG stat.ML

    On Kinetic Optimal Probability Paths for Generative Models

    Authors: Neta Shaul, Ricky T. Q. Chen, Maximilian Nickel, Matt Le, Yaron Lipman

    Abstract: Recent successful generative models are trained by fitting a neural network to an a-priori defined tractable probability density path taking noise to training examples. In this paper we investigate the space of Gaussian probability paths, which includes diffusion paths as an instance, and look for an optimal member in some useful sense. In particular, minimizing the Kinetic Energy (KE) of a path i… ▽ More

    Submitted 11 June, 2023; originally announced June 2023.

  6. arXiv:2212.13659  [pdf, other

    cs.LG stat.ML

    Latent Discretization for Continuous-time Sequence Compression

    Authors: Ricky T. Q. Chen, Matthew Le, Matthew Muckley, Maximilian Nickel, Karen Ullrich

    Abstract: Neural compression offers a domain-agnostic approach to creating codecs for lossy or lossless compression via deep generative models. For sequence compression, however, most deep sequence models have costs that scale with the sequence length rather than the sequence complexity. In this work, we instead treat data sequences as observations from an underlying continuous-time process and learn how to… ▽ More

    Submitted 27 December, 2022; originally announced December 2022.

  7. arXiv:2210.02747  [pdf, other

    cs.LG cs.AI stat.ML

    Flow Matching for Generative Modeling

    Authors: Yaron Lipman, Ricky T. Q. Chen, Heli Ben-Hamu, Maximilian Nickel, Matt Le

    Abstract: We introduce a new paradigm for generative modeling built on Continuous Normalizing Flows (CNFs), allowing us to train CNFs at unprecedented scale. Specifically, we present the notion of Flow Matching (FM), a simulation-free approach for training CNFs based on regressing vector fields of fixed conditional probability paths. Flow Matching is compatible with a general family of Gaussian probability… ▽ More

    Submitted 8 February, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

  8. arXiv:2207.04711  [pdf, other

    stat.ML cs.LG

    Matching Normalizing Flows and Probability Paths on Manifolds

    Authors: Heli Ben-Hamu, Samuel Cohen, Joey Bose, Brandon Amos, Aditya Grover, Maximilian Nickel, Ricky T. Q. Chen, Yaron Lipman

    Abstract: Continuous Normalizing Flows (CNFs) are a class of generative models that transform a prior distribution to a model distribution by solving an ordinary differential equation (ODE). We propose to train CNFs on manifolds by minimizing probability path divergence (PPD), a novel family of divergences between the probability density path generated by the CNF and a target probability density path. PPD i… ▽ More

    Submitted 11 July, 2022; originally announced July 2022.

    Comments: ICML 2022

  9. arXiv:2203.06832  [pdf, other

    cs.LG stat.ML

    Semi-Discrete Normalizing Flows through Differentiable Tessellation

    Authors: Ricky T. Q. Chen, Brandon Amos, Maximilian Nickel

    Abstract: Mapping between discrete and continuous distributions is a difficult task and many have had to resort to heuristical approaches. We propose a tessellation-based approach that directly learns quantization boundaries in a continuous space, complete with exact likelihood evaluations. This is done through constructing normalizing flows on convex polytopes parameterized using a simple homeomorphism wit… ▽ More

    Submitted 11 December, 2022; v1 submitted 13 March, 2022; originally announced March 2022.

    Journal ref: NeurIPS 2022

  10. arXiv:2108.08052  [pdf, other

    stat.ML cs.AI cs.LG

    Moser Flow: Divergence-based Generative Modeling on Manifolds

    Authors: Noam Rozen, Aditya Grover, Maximilian Nickel, Yaron Lipman

    Abstract: We are interested in learning generative models for complex geometries described via manifolds, such as spheres, tori, and other implicit surfaces. Current extensions of existing (Euclidean) generative models are restricted to specific geometries and typically suffer from high computational costs. We introduce Moser Flow (MF), a new class of generative models within the family of continuous normal… ▽ More

    Submitted 2 November, 2021; v1 submitted 18 August, 2021; originally announced August 2021.

  11. arXiv:2011.03902  [pdf, other

    cs.LG stat.ML

    Learning Neural Event Functions for Ordinary Differential Equations

    Authors: Ricky T. Q. Chen, Brandon Amos, Maximilian Nickel

    Abstract: The existing Neural ODE formulation relies on an explicit knowledge of the termination time. We extend Neural ODEs to implicitly defined termination criteria modeled by neural event functions, which can be chained together and differentiated through. Neural Event ODEs are capable of modeling discrete and instantaneous changes in a continuous-time system, without prior knowledge of when these chang… ▽ More

    Submitted 27 October, 2021; v1 submitted 7 November, 2020; originally announced November 2020.

    Journal ref: ICLR 2021

  12. arXiv:2006.10605  [pdf, other

    stat.ML cs.LG

    Riemannian Continuous Normalizing Flows

    Authors: Emile Mathieu, Maximilian Nickel

    Abstract: Normalizing flows have shown great promise for modelling flexible probability distributions in a computationally tractable way. However, whilst data is often naturally described on Riemannian manifolds such as spheres, torii, and hyperbolic spaces, most normalizing flows implicitly assume a flat geometry, making them either misspecified or ill-suited in these situations. To overcome this problem,… ▽ More

    Submitted 9 December, 2020; v1 submitted 18 June, 2020; originally announced June 2020.

    Comments: camera ready NeurIPS 2020

  13. arXiv:2005.13930  [pdf, other

    cs.LG cs.CL stat.ML

    Variational Autoencoder with Embedded Student-$t$ Mixture Model for Authorship Attribution

    Authors: Benedikt Boenninghoff, Steffen Zeiler, Robert M. Nickel, Dorothea Kolossa

    Abstract: Traditional computational authorship attribution describes a classification task in a closed-set scenario. Given a finite set of candidate authors and corresponding labeled texts, the objective is to determine which of the authors has written another set of anonymous or disputed texts. In this work, we propose a probabilistic autoencoding framework to deal with this supervised classification task.… ▽ More

    Submitted 28 May, 2020; originally announced May 2020.

    Comments: Preprint

  14. arXiv:2002.12501  [pdf, other

    cs.LG cs.SI stat.ML

    Learning Multivariate Hawkes Processes at Scale

    Authors: Maximilian Nickel, Matthew Le

    Abstract: Multivariate Hawkes Processes (MHPs) are an important class of temporal point processes that have enabled key advances in understanding and predicting social information systems. However, due to their complex modeling of temporal dependencies, MHPs have proven to be notoriously difficult to scale, what has limited their applications to relatively small domains. In this work, we propose a novel mod… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  15. arXiv:1910.12892  [pdf, other

    cs.LG stat.ML

    Hyperbolic Graph Neural Networks

    Authors: Qi Liu, Maximilian Nickel, Douwe Kiela

    Abstract: Learning from graph-structured data is an important task in machine learning and artificial intelligence, for which Graph Neural Networks (GNNs) have shown great promise. Motivated by recent advances in geometric representation learning, we propose a novel GNN architecture for learning representations on Riemannian manifolds with differentiable exponential and logarithmic maps. We develop a scalab… ▽ More

    Submitted 28 October, 2019; originally announced October 2019.

    Comments: Published at NeurIPS 2019

  16. arXiv:1908.07844  [pdf, ps, other

    cs.CL cs.LG stat.ML

    Similarity Learning for Authorship Verification in Social Media

    Authors: Benedikt Boenninghoff, Robert M. Nickel, Steffen Zeiler, Dorothea Kolossa

    Abstract: Authorship verification tries to answer the question if two documents with unknown authors were written by the same author or not. A range of successful technical approaches has been proposed for this task, many of which are based on traditional linguistic features such as n-grams. These algorithms achieve good results for certain types of written documents like books and novels. Forensic authorsh… ▽ More

    Submitted 20 August, 2019; originally announced August 2019.

    Comments: 5 pages, 3 figures, 1 table, presented on ICASSP 2019 in Brighton, UK

  17. arXiv:1806.03417  [pdf, other

    cs.AI cs.LG stat.ML

    Learning Continuous Hierarchies in the Lorentz Model of Hyperbolic Geometry

    Authors: Maximilian Nickel, Douwe Kiela

    Abstract: We are concerned with the discovery of hierarchical relationships from large-scale unstructured similarity scores. For this purpose, we study different models of hyperbolic space and find that learning embeddings in the Lorentz model is substantially more efficient than in the Poincaré-ball model. We show that the proposed approach allows us to learn high-quality embeddings of large taxonomies whi… ▽ More

    Submitted 8 July, 2018; v1 submitted 9 June, 2018; originally announced June 2018.

    Comments: Accepted at ICML'18

    ACM Class: I.2.0

  18. arXiv:1710.10881  [pdf, ps, other

    stat.ML cs.LG

    Fast Linear Model for Knowledge Graph Embeddings

    Authors: Armand Joulin, Edouard Grave, Piotr Bojanowski, Maximilian Nickel, Tomas Mikolov

    Abstract: This paper shows that a simple baseline based on a Bag-of-Words (BoW) representation learns surprisingly good knowledge graph embeddings. By casting knowledge base completion and question answering as supervised classification problems, we observe that modeling co-occurences of entities and relations leads to state-of-the-art performance with a training time of a few minutes using the open sourced… ▽ More

    Submitted 30 October, 2017; originally announced October 2017.

    Comments: Submitted AKBC 2017

  19. arXiv:1707.01475  [pdf, other

    cs.LG stat.ML

    Complex and Holographic Embeddings of Knowledge Graphs: A Comparison

    Authors: Théo Trouillon, Maximilian Nickel

    Abstract: Embeddings of knowledge graphs have received significant attention due to their excellent performance for tasks like link prediction and entity resolution. In this short paper, we are providing a comparison of two state-of-the-art knowledge graph embeddings for which their equivalence has recently been established, i.e., ComplEx and HolE [Nickel, Rosasco, and Poggio, 2016; Trouillon et al., 2016;… ▽ More

    Submitted 23 July, 2017; v1 submitted 5 July, 2017; originally announced July 2017.

  20. arXiv:1705.08039  [pdf, other

    cs.AI cs.LG stat.ML

    Poincaré Embeddings for Learning Hierarchical Representations

    Authors: Maximilian Nickel, Douwe Kiela

    Abstract: Representation learning has become an invaluable approach for learning from symbolic data such as text and graphs. However, while complex symbolic datasets often exhibit a latent hierarchical structure, state-of-the-art methods typically learn embeddings in Euclidean vector spaces, which do not account for this property. For this purpose, we introduce a new approach for learning hierarchical repre… ▽ More

    Submitted 26 May, 2017; v1 submitted 22 May, 2017; originally announced May 2017.

  21. arXiv:1510.04935  [pdf, other

    cs.AI cs.LG stat.ML

    Holographic Embeddings of Knowledge Graphs

    Authors: Maximilian Nickel, Lorenzo Rosasco, Tomaso Poggio

    Abstract: Learning embeddings of entities and relations is an efficient and versatile method to perform machine learning on relational data such as knowledge graphs. In this work, we propose holographic embeddings (HolE) to learn compositional vector space representations of entire knowledge graphs. The proposed method is related to holographic models of associative memory in that it employs circular correl… ▽ More

    Submitted 7 December, 2015; v1 submitted 16 October, 2015; originally announced October 2015.

    Comments: To appear in AAAI-16

    ACM Class: I.2.6; I.2.4

  22. A Review of Relational Machine Learning for Knowledge Graphs

    Authors: Maximilian Nickel, Kevin Murphy, Volker Tresp, Evgeniy Gabrilovich

    Abstract: Relational machine learning studies methods for the statistical analysis of relational, or graph-structured, data. In this paper, we provide a review of how such statistical models can be "trained" on large knowledge graphs, and then used to predict new facts about the world (which is equivalent to predicting new edges in the graph). In particular, we discuss two fundamentally different kinds of s… ▽ More

    Submitted 28 September, 2015; v1 submitted 2 March, 2015; originally announced March 2015.

    Comments: To appear in Proceedings of the IEEE

  23. arXiv:1306.2084  [pdf, other

    stat.ML cs.LG

    Logistic Tensor Factorization for Multi-Relational Data

    Authors: Maximilian Nickel, Volker Tresp

    Abstract: Tensor factorizations have become increasingly popular approaches for various learning tasks on structured data. In this work, we extend the RESCAL tensor factorization, which has shown state-of-the-art results for multi-relational learning, to account for the binary nature of adjacency tensors. We study the improvements that can be gained via this approach on various benchmark datasets and show t… ▽ More

    Submitted 9 June, 2013; originally announced June 2013.

    Comments: Accepted at ICML 2013 Workshop "Structured Learning: Inferring Graphs from Structured and Unstructured Inputs" (SLG 2013)