Skip to main content

Showing 1–19 of 19 results for author: Hertrich, C

.
  1. arXiv:2505.16716  [pdf, other

    cs.CC cs.DM cs.LG cs.NE math.CO

    The Computational Complexity of Counting Linear Regions in ReLU Neural Networks

    Authors: Moritz Stargalla, Christoph Hertrich, Daniel Reichman

    Abstract: An established measure of the expressive power of a given ReLU neural network is the number of linear regions into which it partitions the input space. There exist many different, non-equivalent definitions of what a linear region actually is. We systematically assess which papers use which definitions and discuss how they relate to each other. We then analyze the computational complexity of count… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 25 pages

  2. arXiv:2505.14338  [pdf, ps, other

    cs.LG cs.DM cs.NE math.CO

    Better Neural Network Expressivity: Subdividing the Simplex

    Authors: Egor Bakaev, Florestan Brunck, Christoph Hertrich, Jack Stade, Amir Yehudayoff

    Abstract: This work studies the expressivity of ReLU neural networks with a focus on their depth. A sequence of previous works showed that $\lceil \log_2(n+1) \rceil$ hidden layers are sufficient to compute all continuous piecewise linear (CPWL) functions on $\mathbb{R}^n$. Hertrich, Basu, Di Summa, and Skutella (NeurIPS'21) conjectured that this result is optimal in the sense that there are CPWL functions… ▽ More

    Submitted 20 May, 2025; originally announced May 2025.

    Comments: 11 pages, 1 figure

  3. arXiv:2505.06169  [pdf, ps, other

    cs.LG cs.DM cs.NE math.CO

    On the Depth of Monotone ReLU Neural Networks and ICNNs

    Authors: Egor Bakaev, Florestan Brunck, Christoph Hertrich, Daniel Reichman, Amir Yehudayoff

    Abstract: We study two models of ReLU neural networks: monotone networks (ReLU$^+$) and input convex neural networks (ICNN). Our focus is on expressivity, mostly in terms of depth, and we prove the following lower bounds. For the maximum function MAX$_n$ computing the maximum of $n$ real numbers, we show that ReLU$^+$ networks cannot compute MAX$_n$, or even approximate it. We prove a sharp $n$ lower bound… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: 27 pages, 17 figures

  4. arXiv:2502.09324  [pdf, other

    cs.LG cs.DM cs.NE math.CO

    Depth-Bounds for Neural Networks via the Braid Arrangement

    Authors: Moritz Grillo, Christoph Hertrich, Georg Loho

    Abstract: We contribute towards resolving the open question of how many hidden layers are required in ReLU networks for exactly representing all continuous and piecewise linear functions on $\mathbb{R}^d$. While the question has been resolved in special cases, the best known lower bound in general is still 2. We focus on neural networks that are compatible with certain polyhedral complexes, more precisely w… ▽ More

    Submitted 13 February, 2025; originally announced February 2025.

  5. arXiv:2411.03006  [pdf, ps, other

    math.CO cs.CC cs.DM cs.LG math.OC

    Neural Networks and (Virtual) Extended Formulations

    Authors: Christoph Hertrich, Georg Loho

    Abstract: Neural networks with piecewise linear activation functions, such as rectified linear units (ReLU) or maxout, are among the most fundamental models in modern machine learning. We make a step towards proving lower bounds on the size of such neural networks by linking their representative capabilities to the notion of the extension complexity $\mathrm{xc}(P)$ of a polytope $P$. This is a well-studied… ▽ More

    Submitted 11 February, 2025; v1 submitted 5 November, 2024; originally announced November 2024.

  6. arXiv:2410.04907  [pdf, other

    math.CO cs.DM cs.LG cs.NE math.OC

    Decomposition Polyhedra of Piecewise Linear Functions

    Authors: Marie-Charlotte Brandenburg, Moritz Grillo, Christoph Hertrich

    Abstract: In this paper we contribute to the frequently studied question of how to decompose a continuous piecewise linear (CPWL) function into a difference of two convex CPWL functions. Every CPWL function has infinitely many such decompositions, but for applications in optimization and neural network theory, it is crucial to find decompositions with as few linear pieces as possible. This is a highly chall… ▽ More

    Submitted 7 October, 2024; originally announced October 2024.

  7. arXiv:2311.01959  [pdf, ps, other

    math.OC cs.DS

    A First Order Method for Linear Programming Parameterized by Circuit Imbalance

    Authors: Richard Cole, Christoph Hertrich, Yixin Tao, László A. Végh

    Abstract: Various first order approaches have been proposed in the literature to solve Linear Programming (LP) problems, recently leading to practically efficient solvers for large-scale LPs. From a theoretical perspective, linear convergence rates have been established for first order LP algorithms, despite the fact that the underlying formulations are not strongly convex. However, the convergence rate typ… ▽ More

    Submitted 27 March, 2024; v1 submitted 3 November, 2023; originally announced November 2023.

  8. arXiv:2305.11005  [pdf, other

    cs.GT cs.LG cs.NE stat.ML

    Mode Connectivity in Auction Design

    Authors: Christoph Hertrich, Yixin Tao, László A. Végh

    Abstract: Optimal auction design is a fundamental problem in algorithmic game theory. This problem is notoriously difficult already in very simple settings. Recent work in differentiable economics showed that neural networks can efficiently learn known optimal auction mechanisms and discover interesting new ones. In an attempt to theoretically justify their empirical success, we focus on one of the first su… ▽ More

    Submitted 17 July, 2024; v1 submitted 18 May, 2023; originally announced May 2023.

    Comments: Conference paper published at NeurIPS 2023

  9. arXiv:2303.17045  [pdf, ps, other

    cs.CC cs.DS cs.LG cs.NE stat.ML

    Training Neural Networks is NP-Hard in Fixed Dimension

    Authors: Vincent Froese, Christoph Hertrich

    Abstract: We study the parameterized complexity of training two-layer neural networks with respect to the dimension of the input data and the number of hidden neurons, considering ReLU and linear threshold activation functions. Albeit the computational complexity of these problems has been studied numerous times in recent years, several questions are still open. We answer questions by Arora et al. [ICLR '18… ▽ More

    Submitted 18 January, 2024; v1 submitted 29 March, 2023; originally announced March 2023.

    Comments: Paper accepted at NeurIPS 2023

  10. arXiv:2302.12553  [pdf, ps, other

    cs.LG cs.DM cs.NE math.CO stat.ML

    Lower Bounds on the Depth of Integral ReLU Neural Networks via Lattice Polytopes

    Authors: Christian Haase, Christoph Hertrich, Georg Loho

    Abstract: We prove that the set of functions representable by ReLU neural networks with integer weights strictly increases with the network depth while allowing arbitrary width. More precisely, we show that $\lceil\log_2(n)\rceil$ hidden layers are indeed necessary to compute the maximum of $n$ numbers, matching known upper bounds. Our results are based on the known duality between neural networks and Newto… ▽ More

    Submitted 24 February, 2023; originally announced February 2023.

    Comments: ICLR 2023 conference paper

  11. arXiv:2204.01368  [pdf, other

    cs.CC cs.LG cs.NE

    Training Fully Connected Neural Networks is $\exists\mathbb{R}$-Complete

    Authors: Daniel Bertschinger, Christoph Hertrich, Paul Jungeblut, Tillmann Miltzow, Simon Weber

    Abstract: We consider the problem of finding weights and biases for a two-layer fully connected neural network to fit a given set of data points as well as possible, also known as EmpiricalRiskMinimization. Our main result is that the associated decision problem is $\exists\mathbb{R}$-complete, that is, polynomial-time equivalent to determining whether a multivariate polynomial with integer coefficients has… ▽ More

    Submitted 22 March, 2024; v1 submitted 4 April, 2022; originally announced April 2022.

    Comments: 39 pages, 17 figures. Changes in version 2: Added algebraic universality result, improved interpretation of results Changes in version 3: Improved exposition by formalizing properties of gadgets

  12. arXiv:2105.14835  [pdf, ps, other

    cs.LG cs.DM cs.NE math.CO stat.ML

    Towards Lower Bounds on the Depth of ReLU Neural Networks

    Authors: Christoph Hertrich, Amitabh Basu, Marco Di Summa, Martin Skutella

    Abstract: We contribute to a better understanding of the class of functions that can be represented by a neural network with ReLU activations and a given architecture. Using techniques from mixed-integer optimization, polyhedral theory, and tropical geometry, we provide a mathematical counterbalance to the universal approximation theorems which suggest that a single hidden layer is sufficient for learning a… ▽ More

    Submitted 17 July, 2024; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: Authors' accepted manuscript for SIAM Journal on Discrete Mathematics. A preliminary conference version appeared at NeurIPS 2021

    Journal ref: SIAM Journal on Discrete Mathematics 2023 37:2, 997-1029

  13. arXiv:2105.08675  [pdf, ps, other

    cs.LG cs.CC cs.DS cs.NE stat.ML

    The Computational Complexity of ReLU Network Training Parameterized by Data Dimensionality

    Authors: Vincent Froese, Christoph Hertrich, Rolf Niedermeier

    Abstract: Understanding the computational complexity of training simple neural networks with rectified linear units (ReLUs) has recently been a subject of intensive research. Closing gaps and complementing results from the literature, we present several results on the parameterized complexity of training two-layer ReLU networks with respect to various loss functions. After a brief discussion of other parame… ▽ More

    Submitted 23 August, 2022; v1 submitted 18 May, 2021; originally announced May 2021.

    Journal ref: Journal of Artificial Intelligence Research 74 (2022): 1775-1790

  14. arXiv:2102.06635  [pdf, other

    cs.LG cs.CC cs.DS cs.NE stat.ML

    ReLU Neural Networks of Polynomial Size for Exact Maximum Flow Computation

    Authors: Christoph Hertrich, Leon Sering

    Abstract: This paper studies the expressive power of artificial neural networks with rectified linear units. In order to study them as a model of real-valued computation, we introduce the concept of Max-Affine Arithmetic Programs and show equivalence between them and neural networks concerning natural complexity measures. We then use this result to show that two fundamental combinatorial optimization proble… ▽ More

    Submitted 17 July, 2024; v1 submitted 12 February, 2021; originally announced February 2021.

    Comments: Authors' accepted manuscript for Mathematical Programming (2024). A short version appeared in the proceedings of IPCO 2023

  15. arXiv:2008.09692  [pdf, other

    math.CO cs.DM

    Coloring Drawings of Graphs

    Authors: Christoph Hertrich, Felix Schröder, Raphael Steiner

    Abstract: We consider cell colorings of drawings of graphs in the plane. Given a multi-graph $G$ together with a drawing $Γ(G)$ in the plane with only finitely many crossings, we define a cell $k$-coloring of $Γ(G)$ to be a coloring of the maximal connected regions of the drawing, the cells, with $k$ colors such that adjacent cells have different colors. By the $4$-color theorem, every drawing of a bridge… ▽ More

    Submitted 26 August, 2022; v1 submitted 21 August, 2020; originally announced August 2020.

    Comments: 35 pages, 23 figures

    MSC Class: 05C10 05C15 (Primary) 05C45 (Secondary) ACM Class: F.2.2; G.2.2

    Journal ref: The Electronic Journal of Combinatorics 29(1) (2022), #P1.17

  16. Scheduling a Proportionate Flow Shop of Batching Machines

    Authors: Christoph Hertrich, Christian Weiß, Heiner Ackermann, Sandy Heydrich, Sven O. Krumke

    Abstract: In this paper we study a proportionate flow shop of batching machines with release dates and a fixed number $m \geq 2$ of machines. The scheduling problem has so far barely received any attention in the literature, but recently its importance has increased significantly, due to applications in the industrial scaling of modern bio-medicine production processes. We show that for any fixed number of… ▽ More

    Submitted 26 November, 2020; v1 submitted 17 June, 2020; originally announced June 2020.

    Comments: Version 2: replace initial preprint with authors' accepted manuscript

    Journal ref: Journal of Scheduling 23, 575-593 (2020)

  17. arXiv:2005.14105  [pdf, other

    cs.LG cs.CC cs.DM cs.NE stat.ML

    Provably Good Solutions to the Knapsack Problem via Neural Networks of Bounded Size

    Authors: Christoph Hertrich, Martin Skutella

    Abstract: The development of a satisfying and rigorous mathematical understanding of the performance of neural networks is a major challenge in artificial intelligence. Against this background, we study the expressive power of neural networks through the example of the classical NP-hard Knapsack Problem. Our main contribution is a class of recurrent neural networks (RNNs) with rectified linear units that ar… ▽ More

    Submitted 11 July, 2024; v1 submitted 28 May, 2020; originally announced May 2020.

    Comments: Authors' accepted manuscript for the INFORMS Journal on Computing. A short version of this paper appeared in the proceedings of AAAI 2021

    Journal ref: INFORMS Journal on Computing 35(5):1079-1097 (2023)

  18. Online Algorithms to Schedule a Proportionate Flexible Flow Shop of Batching Machines

    Authors: Christoph Hertrich, Christian Weiß, Heiner Ackermann, Sandy Heydrich, Sven O. Krumke

    Abstract: This paper is the first to consider online algorithms to schedule a proportionate flexible flow shop of batching machines (PFFB). The scheduling model is motivated by manufacturing processes of individualized medicaments, which are used in modern medicine to treat some serious illnesses. We provide two different online algorithms, proving also lower bounds for the offline problem to compute their… ▽ More

    Submitted 17 July, 2024; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: Authors' accepted manuscript

    Journal ref: Journal of Scheduling 25, 643-657 (2022)

  19. Sweep Algorithms for the Capacitated Vehicle Routing Problem with Structured Time Window

    Authors: Christoph Hertrich, Philipp Hungerländer, Christian Truden

    Abstract: The capacitated Vehicle Routing Problem with structured Time Windows (cVRPsTW) is concerned with finding optimal tours for vehicles with given capacity constraints to deliver goods to customers within assigned time windows. In our problem variant these time windows have a special structure, namely they are non-overlapping and each time window holds several customers. This is a reasonable assumptio… ▽ More

    Submitted 9 January, 2019; originally announced January 2019.