Skip to main content

Showing 1–12 of 12 results for author: Palagi, L

Searching in archive math. Search in all archives.
.
  1. arXiv:2411.15795  [pdf, ps, other

    cs.LG math.OC

    Beyond adaptive gradient: Fast-Controlled Minibatch Algorithm for large-scale optimization

    Authors: Corrado Coppola, Lorenzo Papa, Irene Amerini, Laura Palagi

    Abstract: Adaptive gradient methods have been increasingly adopted by deep learning community due to their fast convergence and reduced sensitivity to hyper-parameters. However, these methods come with limitations, such as increased memory requirements for elements like moving averages and a poorly understood convergence theory. To overcome these challenges, we introduce F-CMA, a Fast-Controlled Mini-batch… ▽ More

    Submitted 16 December, 2024; v1 submitted 24 November, 2024; originally announced November 2024.

    Comments: There is an error in the literature review, in section 1. In particular, we noticed that there is a wrong citation, the [65], which has been erroneously associated with another author's claims

  2. arXiv:2405.02089  [pdf, other

    math.OC

    Computational issues in Optimization for Deep networks

    Authors: Corrado Coppola, Lorenzo Papa, Marco Boresta, Irene Amerini, Laura Palagi

    Abstract: The paper aims to investigate relevant computational issues of deep neural network architectures with an eye to the interaction between the optimization algorithm and the classification performance. In particular, we aim to analyze the behaviour of state-of-the-art optimization algorithms in relationship to their hyperparameters setting in order to detect robustness with respect to the choice of a… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

  3. arXiv:2404.10099  [pdf, other

    math.OC cs.LG

    Feature selection in linear SVMs via a hard cardinality constraint: a scalable SDP decomposition approach

    Authors: Immanuel Bomze, Federico D'Onofrio, Laura Palagi, Bo Peng

    Abstract: In this paper, we study the embedded feature selection problem in linear Support Vector Machines (SVMs), in which a cardinality constraint is employed, leading to an interpretable classification model. The problem is NP-hard due to the presence of the cardinality constraint, even though the original linear SVM amounts to a problem solvable in polynomial time. To handle the hard problem, we first i… ▽ More

    Submitted 19 December, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: Submitted to European Journal of Operational Research. arXiv admin note: text overlap with arXiv:1808.02435 by other authors

    MSC Class: 90C22; 90C11 ACM Class: I.5.1; I.2.0

  4. arXiv:2401.12625  [pdf, other

    math.OC cs.DM

    Benders decomposition for congested partial set covering location with uncertain demand

    Authors: Alice Calamita, Ivana Ljubić, Laura Palagi

    Abstract: In this paper, we introduce a mixed integer quadratic formulation for the congested variant of the partial set covering location problem, which involves determining a subset of facility locations to open and efficiently allocating customers to these facilities to minimize the combined costs of facility opening and congestion while ensuring target coverage. To enhance the resilience of the solution… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  5. arXiv:2307.15775  [pdf, other

    math.OC

    CMA Light: a novel Minibatch Algorithm for large-scale non convex finite sum optimization

    Authors: Corrado Coppola, Giampaolo Liuzzi, Laura Palagi

    Abstract: The supervised training of a deep neural network on a given dataset consists in the unconstrained minimization of the finite sum of continuously differentiable functions, commonly referred to as loss with respect to the samples. These functions depend on the network parameters and most of the times are non-convex. We develop CMA Light, a globally convergent mini-batch gradient method to tackle thi… ▽ More

    Submitted 22 May, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

  6. arXiv:2303.04216  [pdf, other

    math.OC

    A computational study of off-the-shelf MINLP solvers on a benchmark set of congested capacitated facility location problems

    Authors: Pasquale Avella, Alice Calamita, Laura Palagi

    Abstract: This paper analyzes the performance of five well-known off-the-shelf optimization solvers on a set of congested capacitated facility location problems formulated as mixed-integer conic programs (MICPs). We aim to compare the computational efficiency of the solvers and examine the solution strategies they adopt when solving instances with different sizes and complexity. The solvers we compare are G… ▽ More

    Submitted 2 August, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

  7. Unboxing Tree Ensembles for interpretability: a hierarchical visualization tool and a multivariate optimal re-built tree

    Authors: Giulia Di Teodoro, Marta Monaci, Laura Palagi

    Abstract: The interpretability of models has become a crucial issue in Machine Learning because of algorithmic decisions' growing impact on real-world applications. Tree ensemble methods, such as Random Forests or XgBoost, are powerful learning tools for classification tasks. However, while combining multiple trees may provide higher prediction quality than a single one, it sacrifices the interpretability p… ▽ More

    Submitted 18 January, 2024; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: 44 pages, 9 figures, 20 tables

  8. arXiv:2212.01848  [pdf, other

    math.OC cs.LG

    Convergence of ease-controlled Random Reshuffling gradient Algorithms under Lipschitz smoothness

    Authors: Ruggiero Seccia, Corrado Coppola, Giampaolo Liuzzi, Laura Palagi

    Abstract: In this work, we consider minimizing the average of a very large number of smooth and possibly non-convex functions, and we focus on two widely used minibatch frameworks to tackle this optimization problem: Incremental Gradient (IG) and Random Reshuffling (RR). We define ease-controlled modifications of the IG/RR schemes, which require a light additional computational effort {but} can be proved to… ▽ More

    Submitted 20 May, 2024; v1 submitted 4 December, 2022; originally announced December 2022.

    Comments: Add author, add references, correct typos, improve imoplementation

    MSC Class: 90.C.XX ACM Class: G.4.1

  9. Margin Optimal Classification Trees

    Authors: Federico D'Onofrio, Giorgio Grani, Marta Monaci, Laura Palagi

    Abstract: In recent years, there has been growing attention to interpretable machine learning models which can give explanatory insights on their behaviour. Thanks to their interpretability, decision trees have been intensively studied for classification tasks and, due to the remarkable advances in mixed integer programming (MIP), various approaches have been proposed to formulate the problem of training an… ▽ More

    Submitted 8 October, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

  10. arXiv:2208.00202  [pdf, other

    math.OC cs.AI

    Solving the vehicle routing problem with deep reinforcement learning

    Authors: Simone Foa, Corrado Coppola, Giorgio Grani, Laura Palagi

    Abstract: Recently, the applications of the methodologies of Reinforcement Learning (RL) to NP-Hard Combinatorial optimization problems have become a popular topic. This is essentially due to the nature of the traditional combinatorial algorithms, often based on a trial-and-error process. RL aims at automating this process. At this regard, this paper focuses on the application of RL for the Vehicle Routing… ▽ More

    Submitted 30 July, 2022; originally announced August 2022.

    Comments: This version is really preliminary and the possibility of errors and typos is high

    MSC Class: 90-08

  11. Block Layer Decomposition schemes for training Deep Neural Networks

    Authors: Laura Palagi, Ruggiero Seccia

    Abstract: Deep Feedforward Neural Networks' (DFNNs) weights estimation relies on the solution of a very large nonconvex optimization problem that may have many local (no global) minimizers, saddle points and large plateaus. As a consequence, optimization algorithms can be attracted toward local minimizers which can lead to bad solutions or can slow down the optimization process. Furthermore, the time needed… ▽ More

    Submitted 18 March, 2020; originally announced March 2020.

    Comments: 23 pages. J Glob Optim (2019)

  12. arXiv:1509.05289  [pdf, ps, other

    math.OC

    A Class of Parallel Decomposition Algorithms for SVMs Training

    Authors: Andrea Manno, Laura Palagi, Simone Sagratella

    Abstract: The training of Support Vector Machines may be a very difficult task when dealing with very large datasets. The memory requirement and the time consumption of the SVMs algorithms grow rapidly with the increase of the data. To overcome these drawbacks, we propose a parallel decomposition algorithmic scheme for SVMs training for which we prove global convergence under suitable conditions. We outline… ▽ More

    Submitted 3 November, 2015; v1 submitted 17 September, 2015; originally announced September 2015.