Skip to main content

Showing 1–13 of 13 results for author: Laue, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.10897  [pdf, other

    math.OC cs.LG

    Efficient Line Search Method Based on Regression and Uncertainty Quantification

    Authors: Sören Laue, Tomislav Prusina

    Abstract: Unconstrained optimization problems are typically solved using iterative methods, which often depend on line search techniques to determine optimal step lengths in each iteration. This paper introduces a novel line search approach. Traditional line search methods, aimed at determining optimal step lengths, often discard valuable data from the search process and focus on refining step length interv… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: To be featured in LION18 2024

  2. arXiv:2404.15848  [pdf, other

    cs.CL cs.LG

    Detecting Conceptual Abstraction in LLMs

    Authors: Michaela Regneri, Alhassan Abdelhalim, Sören Laue

    Abstract: We present a novel approach to detecting noun abstraction within a large language model (LLM). Starting from a psychologically motivated set of noun pairs in taxonomic relationships, we instantiate surface patterns indicating hypernymy and analyze the attention matrices produced by BERT. We compare the results to two sets of counterfactuals and show that we can detect hypernymy in the abstraction… ▽ More

    Submitted 25 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: Paper accepted at the LREC-COLING 2024 Conference (Paper ID: 1968) https://lrec-coling-2024.org/list-of-accepted-papers/

  3. arXiv:2301.01583  [pdf, other

    cs.CV cs.AI cs.LG

    Why Capsule Neural Networks Do Not Scale: Challenging the Dynamic Parse-Tree Assumption

    Authors: Matthias Mitterreiter, Marcel Koch, Joachim Giesen, Sören Laue

    Abstract: Capsule neural networks replace simple, scalar-valued neurons with vector-valued capsules. They are motivated by the pattern recognition system in the human brain, where complex objects are decomposed into a hierarchy of simpler object parts. Such a hierarchy is referred to as a parse-tree. Conceptually, capsule neural networks have been defined to realize such parse-trees. The capsule neural netw… ▽ More

    Submitted 4 January, 2023; originally announced January 2023.

    Comments: To appear in AAAI 2023

  4. arXiv:2210.10430  [pdf, ps, other

    math.OC cs.LG cs.SC

    Convexity Certificates from Hessians

    Authors: Julien Klaus, Niklas Merk, Konstantin Wiedom, Sören Laue, Joachim Giesen

    Abstract: The Hessian of a differentiable convex function is positive semidefinite. Therefore, checking the Hessian of a given function is a natural approach to certify convexity. However, implementing this approach is not straightforward since it requires a representation of the Hessian that allows its analysis. Here, we implement this approach for a class of functions that is rich enough to support classi… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted for publication at NeurIPS 2022

  5. arXiv:2203.16340  [pdf, other

    cs.LG math.OC stat.ML

    Optimization for Classical Machine Learning Problems on the GPU

    Authors: Sören Laue, Mark Blacher, Joachim Giesen

    Abstract: Constrained optimization problems arise frequently in classical machine learning. There exist frameworks addressing constrained optimization, for instance, CVXPY and GENO. However, in contrast to deep learning frameworks, GPU support is limited. Here, we extend the GENO framework to also solve constrained optimization problems on the GPU. The framework allows the user to specify constrained optimi… ▽ More

    Submitted 30 March, 2022; originally announced March 2022.

    Comments: Appeared in AAAI 2022

  6. arXiv:2010.03313  [pdf, other

    cs.LG cs.SC

    A Simple and Efficient Tensor Calculus for Machine Learning

    Authors: Sören Laue, Matthias Mitterreiter, Joachim Giesen

    Abstract: Computing derivatives of tensor expressions, also known as tensor calculus, is a fundamental task in machine learning. A key concern is the efficiency of evaluating the expressions and their derivatives that hinges on the representation of these expressions. Recently, an algorithm for computing higher order derivatives of tensor expressions like Jacobians or Hessians has been introduced that is a… ▽ More

    Submitted 7 October, 2020; originally announced October 2020.

  7. arXiv:1905.13587  [pdf, other

    cs.LG math.NA math.OC stat.ML

    GENO -- GENeric Optimization for Classical Machine Learning

    Authors: Sören Laue, Matthias Mitterreiter, Joachim Giesen

    Abstract: Although optimization is the longstanding algorithmic backbone of machine learning, new models still require the time-consuming implementation of new solvers. As a result, there are thousands of implementations of optimization algorithms for machine learning problems. A natural question is, if it is always necessary to implement a new solver, or if there is one algorithm that is sufficient for mos… ▽ More

    Submitted 31 May, 2019; originally announced May 2019.

  8. arXiv:1904.02990  [pdf, other

    cs.SC cs.LG

    On the Equivalence of Automatic and Symbolic Differentiation

    Authors: Soeren Laue

    Abstract: We show that reverse mode automatic differentiation and symbolic differentiation are equivalent in the sense that they both perform the same operations when computing derivatives. This is in stark contrast to the common claim that they are substantially different. The difference is often illustrated by claiming that symbolic differentiation suffers from "expression swell" whereas automatic differe… ▽ More

    Submitted 5 December, 2022; v1 submitted 5 April, 2019; originally announced April 2019.

  9. arXiv:1610.02967  [pdf, other

    math.OC cs.LG math.NA stat.ML

    Distributed Convex Optimization with Many Convex Constraints

    Authors: Joachim Giesen, Sören Laue

    Abstract: We address the problem of solving convex optimization problems with many convex constraints in a distributed setting. Our approach is based on an extension of the alternating direction method of multipliers (ADMM) that recently gained a lot of attention in the Big Data context. Although it has been invented decades ago, ADMM so far can be applied only to unconstrained problems and problems with li… ▽ More

    Submitted 6 April, 2018; v1 submitted 7 October, 2016; originally announced October 2016.

  10. arXiv:1606.09481  [pdf, other

    cs.DS

    Generating massive complex networks with hyperbolic geometry faster in practice

    Authors: Moritz von Looz, Mustafa Özdayi, Sören Laue, Henning Meyerhenke

    Abstract: Generative network models play an important role in algorithm development, scaling studies, network analysis, and realistic system benchmarks for graph data sets. The commonly used graph-based benchmark model R-MAT has some drawbacks concerning realism and the scaling behavior of network properties. A complex network model gaining considerable popularity builds random hyperbolic graphs, generated… ▽ More

    Submitted 30 June, 2016; originally announced June 2016.

  11. arXiv:1206.4608  [pdf

    cs.LG cs.DS math.NA stat.ML

    A Hybrid Algorithm for Convex Semidefinite Optimization

    Authors: Soeren Laue

    Abstract: We present a hybrid algorithm for optimizing a convex, smooth function over the cone of positive semidefinite matrices. Our algorithm converges to the global optimal solution and can be used to solve general large-scale semidefinite programs and hence can be readily applied to a variety of machine learning problems. We show experimental results on three machine learning problems (matrix completion… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012

  12. arXiv:0802.2861  [pdf, ps, other

    cs.CG

    Geometric Set Cover and Hitting Sets for Polytopes in $R^3$

    Authors: Sören Laue

    Abstract: Suppose we are given a finite set of points $P$ in $\R^3$ and a collection of polytopes $\mathcal{T}$ that are all translates of the same polytope $T$. We consider two problems in this paper. The first is the set cover problem where we want to select a minimal number of polytopes from the collection $\mathcal{T}$ such that their union covers all input points $P$. The second problem that we consi… ▽ More

    Submitted 20 February, 2008; originally announced February 2008.

    Journal ref: Dans Proceedings of the 25th Annual Symposium on the Theoretical Aspects of Computer Science - STACS 2008, Bordeaux : France (2008)

  13. arXiv:cs/0612121  [pdf, ps, other

    cs.CG cs.AR cs.NI

    Power Assignment Problems in Wireless Communication

    Authors: Stefan Funke, Soeren Laue, Zvi Lotker, Rouven Naujoks

    Abstract: A fundamental class of problems in wireless communication is concerned with the assignment of suitable transmission powers to wireless devices/stations such that the resulting communication graph satisfies certain desired properties and the overall energy consumed is minimized. Many concrete communication tasks in a wireless network like broadcast, multicast, point-to-point routing, creation of… ▽ More

    Submitted 22 December, 2006; originally announced December 2006.

    Comments: 13 pages