Skip to main content

Showing 1–16 of 16 results for author: Tran-Dinh, Q

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.03180  [pdf, other

    math.OC cs.LG

    Shuffling Momentum Gradient Algorithm for Convex Optimization

    Authors: Trang H. Tran, Quoc Tran-Dinh, Lam M. Nguyen

    Abstract: The Stochastic Gradient Descent method (SGD) and its stochastic variants have become methods of choice for solving finite-sum optimization problems arising from machine learning and data science thanks to their ability to handle large-scale applications and big datasets. In the last decades, researchers have made substantial effort to study the theoretical performance of SGD and its shuffling vari… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

    Comments: Vietnam Journal of Mathematics (VJOM), Special issue dedicated to Dr. Tamás Terlaky on the occasion of his 70th birthday, 2024

  2. arXiv:2103.03452  [pdf, other

    stat.ML cs.DC cs.LG

    FedDR -- Randomized Douglas-Rachford Splitting Algorithms for Nonconvex Federated Composite Optimization

    Authors: Quoc Tran-Dinh, Nhan H. Pham, Dzung T. Phan, Lam M. Nguyen

    Abstract: We develop two new algorithms, called, FedDR and asyncFedDR, for solving a fundamental nonconvex composite optimization problem in federated learning. Our algorithms rely on a novel combination between a nonconvex Douglas-Rachford splitting method, randomized block-coordinate strategies, and asynchronous implementation. They can also handle convex regularizers. Unlike recent methods in the literat… ▽ More

    Submitted 28 October, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: 39 pages, and 12 figures

    Report number: UNC-STOR-June 2021

    Journal ref: NeurIPs 2021

  3. arXiv:2011.11884  [pdf, other

    math.OC cs.LG stat.ML

    SMG: A Shuffling Gradient-Based Method with Momentum

    Authors: Trang H. Tran, Lam M. Nguyen, Quoc Tran-Dinh

    Abstract: We combine two advanced ideas widely used in optimization for machine learning: shuffling strategy and momentum technique to develop a novel shuffling gradient-based method with momentum, coined Shuffling Momentum Gradient (SMG), for non-convex finite-sum optimization problems. While our method is inspired by momentum techniques, its update is fundamentally different from existing momentum-based m… ▽ More

    Submitted 9 June, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: The 38th International Conference on Machine Learning (ICML 2021)

  4. arXiv:2011.10298  [pdf, other

    cs.LG math.OC

    Convergence Analysis of Homotopy-SGD for non-convex optimization

    Authors: Matilde Gargiani, Andrea Zanelli, Quoc Tran-Dinh, Moritz Diehl, Frank Hutter

    Abstract: First-order stochastic methods for solving large-scale non-convex optimization problems are widely used in many big-data applications, e.g. training deep neural networks as well as other complex and potentially non-convex machine learning models. Their inexpensive iterations generally come together with slow global convergence rate (mostly sublinear), leading to the necessity of carrying out a ver… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

    Comments: 21 pages, 14 figures, technical report

  5. arXiv:2010.14763  [pdf, other

    cs.LG math.OC stat.ML

    Hogwild! over Distributed Local Data Sets with Linearly Increasing Mini-Batch Sizes

    Authors: Marten van Dijk, Nhuong V. Nguyen, Toan N. Nguyen, Lam M. Nguyen, Quoc Tran-Dinh, Phuong Ha Nguyen

    Abstract: Hogwild! implements asynchronous Stochastic Gradient Descent (SGD) where multiple threads in parallel access a common repository containing training data, perform SGD iterations and update shared state that represents a jointly learned (global) model. We consider big data analysis where training data is distributed among local data sets in a heterogeneous way -- and we wish to move SGD computation… ▽ More

    Submitted 26 February, 2021; v1 submitted 26 October, 2020; originally announced October 2020.

    Comments: arXiv admin note: substantial text overlap with arXiv:2007.09208 AISTATS 2021

  6. arXiv:2007.09208  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    Asynchronous Federated Learning with Reduced Number of Rounds and with Differential Privacy from Less Aggregated Gaussian Noise

    Authors: Marten van Dijk, Nhuong V. Nguyen, Toan N. Nguyen, Lam M. Nguyen, Quoc Tran-Dinh, Phuong Ha Nguyen

    Abstract: The feasibility of federated learning is highly constrained by the server-clients infrastructure in terms of network communication. Most newly launched smartphones and IoT devices are equipped with GPUs or sufficient computing hardware to run powerful AI models. However, in case of the original synchronous federated learning, client devices suffer waiting times and regular communication between cl… ▽ More

    Submitted 17 July, 2020; originally announced July 2020.

  7. arXiv:2003.00430  [pdf, other

    cs.LG math.OC

    A Hybrid Stochastic Policy Gradient Algorithm for Reinforcement Learning

    Authors: Nhan H. Pham, Lam M. Nguyen, Dzung T. Phan, Phuong Ha Nguyen, Marten van Dijk, Quoc Tran-Dinh

    Abstract: We propose a novel hybrid stochastic policy gradient estimator by combining an unbiased policy gradient estimator, the REINFORCE estimator, with another biased one, an adapted SARAH estimator for policy optimization. The hybrid policy gradient estimator is shown to be biased, but has variance reduced property. Using this estimator, we develop a new Proximal Hybrid Stochastic Policy Gradient Algori… ▽ More

    Submitted 21 September, 2020; v1 submitted 1 March, 2020; originally announced March 2020.

    Comments: Accepted for publication at the 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020)

    Journal ref: Proceedings of the International Conference on Artificial Intelligence and Statistics, PMLR 108:374-385, 2020

  8. arXiv:2002.08246  [pdf, other

    math.OC cs.LG stat.ML

    A Unified Convergence Analysis for Shuffling-Type Gradient Methods

    Authors: Lam M. Nguyen, Quoc Tran-Dinh, Dzung T. Phan, Phuong Ha Nguyen, Marten van Dijk

    Abstract: In this paper, we propose a unified convergence analysis for a class of generic shuffling-type gradient methods for solving finite-sum optimization problems. Our analysis works with any sampling without replacement strategy and covers many known variants such as randomized reshuffling, deterministic or randomized single permutation, and cyclic and incremental gradient schemes. We focus on two diff… ▽ More

    Submitted 19 September, 2021; v1 submitted 19 February, 2020; originally announced February 2020.

    Comments: Journal of Machine Learning Research, 2021

  9. arXiv:2002.07003  [pdf, other

    math.OC cs.LG stat.ML

    A Newton Frank-Wolfe Method for Constrained Self-Concordant Minimization

    Authors: Deyi Liu, Volkan Cevher, Quoc Tran-Dinh

    Abstract: We demonstrate how to scalably solve a class of constrained self-concordant minimization problems using linear minimization oracles (LMO) over the constraint set. We prove that the number of LMO calls of our method is nearly the same as that of the Frank-Wolfe method in the L-smooth case. Specifically, our Newton Frank-Wolfe method uses $\mathcal{O}(ε^{-ν})$ LMO's, where $ε$ is the desired accurac… ▽ More

    Submitted 17 February, 2020; originally announced February 2020.

  10. arXiv:1907.03793  [pdf, other

    math.OC cs.LG stat.ML

    A Hybrid Stochastic Optimization Framework for Stochastic Composite Nonconvex Optimization

    Authors: Quoc Tran-Dinh, Nhan H. Pham, Dzung T. Phan, Lam M. Nguyen

    Abstract: We introduce a new approach to develop stochastic optimization algorithms for a class of stochastic composite and possibly nonconvex optimization problems. The main idea is to combine two stochastic estimators to create a new hybrid one. We first introduce our hybrid estimator and then investigate its fundamental properties to form a foundational theory for algorithmic development. Next, we apply… ▽ More

    Submitted 2 May, 2020; v1 submitted 8 July, 2019; originally announced July 2019.

    Comments: 49 pages, 2 tables, 9 figures

    Report number: UNC-STOR-2019.07.V1-03

  11. arXiv:1902.05679  [pdf, other

    math.OC cs.LG stat.ML

    ProxSARAH: An Efficient Algorithmic Framework for Stochastic Composite Nonconvex Optimization

    Authors: Nhan H. Pham, Lam M. Nguyen, Dzung T. Phan, Quoc Tran-Dinh

    Abstract: We propose a new stochastic first-order algorithmic framework to solve stochastic composite nonconvex optimization problems that covers both finite-sum and expectation settings. Our algorithms rely on the SARAH estimator introduced in (Nguyen et al, 2017) and consist of two steps: a proximal gradient and an averaging step making them different from existing nonconvex proximal-type algorithms. The… ▽ More

    Submitted 28 March, 2019; v1 submitted 14 February, 2019; originally announced February 2019.

    Comments: 45 pages, 8 figures, and 2 table

    Report number: STOR-UNC-Feb14.2019

  12. arXiv:1603.06313  [pdf, other

    cs.IT math.OC stat.ML

    Convex block-sparse linear regression with expanders -- provably

    Authors: Anastasios Kyrillidis, Bubacarr Bah, Rouzbeh Hasheminezhad, Quoc Tran-Dinh, Luca Baldassarre, Volkan Cevher

    Abstract: Sparse matrices are favorable objects in machine learning and optimization. When such matrices are used, in place of dense ones, the overall complexity requirements in optimization can be significantly reduced in practice, both in terms of space and run-time. Prompted by this observation, we study a convex optimization scheme for block-sparse recovery from linear measurements. To obtain linear ske… ▽ More

    Submitted 2 April, 2016; v1 submitted 20 March, 2016; originally announced March 2016.

    Comments: 12 pages, 6 figures, to appear at AISTATS

  13. arXiv:1603.01681  [pdf, other

    math.OC cs.IT stat.ML

    A single-phase, proximal path-following framework

    Authors: Quoc Tran-Dinh, Anastasios Kyrillidis, Volkan Cevher

    Abstract: We propose a new proximal, path-following framework for a class of constrained convex problems. We consider settings where the nonlinear---and possibly non-smooth---objective part is endowed with a proximity operator, and the constraint set is equipped with a self-concordant barrier. Our approach relies on the following two main ideas. First, we re-parameterize the optimality condition as an auxil… ▽ More

    Submitted 25 December, 2016; v1 submitted 5 March, 2016; originally announced March 2016.

    Comments: 26 pages, 2 figures, 4 tables (This is the first revision. The original one was uploaded on arxiv on March 5, 2016

    Report number: 90C06, 90C25, 90-08 MSC Class: 90C06; 90C25; 90-08

  14. arXiv:1507.05367  [pdf, other

    cs.IT math.OC stat.ML

    Structured Sparsity: Discrete and Convex approaches

    Authors: Anastasios Kyrillidis, Luca Baldassarre, Marwa El-Halabi, Quoc Tran-Dinh, Volkan Cevher

    Abstract: Compressive sensing (CS) exploits sparsity to recover sparse or compressible signals from dimensionality reducing, non-adaptive sensing mechanisms. Sparsity is also used to enhance interpretability in machine learning and statistics applications: While the ambient dimension is vast in modern data analysis problems, the relevant information therein typically resides in a much lower dimensional spac… ▽ More

    Submitted 19 July, 2015; originally announced July 2015.

    Comments: 30 pages, 18 figures

  15. arXiv:1405.3263  [pdf, other

    stat.ML cs.IT math.OC

    Scalable sparse covariance estimation via self-concordance

    Authors: Anastasios Kyrillidis, Rabeeh Karimi Mahabadi, Quoc Tran-Dinh, Volkan Cevher

    Abstract: We consider the class of convex minimization problems, composed of a self-concordant function, such as the $\log\det$ metric, a convex data fidelity term $h(\cdot)$ and, a regularizing -- possibly non-smooth -- function $g(\cdot)$. This type of problems have recently attracted a great deal of interest, mainly due to their omnipresence in top-notch applications. Under this \emph{locally} Lipschitz… ▽ More

    Submitted 13 May, 2014; originally announced May 2014.

    Comments: 7 pages, 1 figure, Accepted at AAAI-14

  16. arXiv:1308.2867  [pdf, other

    stat.ML cs.LG math.OC

    Composite Self-Concordant Minimization

    Authors: Quoc Tran-Dinh, Anastasios Kyrillidis, Volkan Cevher

    Abstract: We propose a variable metric framework for minimizing the sum of a self-concordant function and a possibly non-smooth convex function, endowed with an easily computable proximal operator. We theoretically establish the convergence of our framework without relying on the usual Lipschitz gradient assumption on the smooth part. An important highlight of our work is a new set of analytic step-size sel… ▽ More

    Submitted 14 April, 2014; v1 submitted 13 August, 2013; originally announced August 2013.

    Comments: 46 pages, 9 figures