Skip to main content

Showing 1–13 of 13 results for author: Varshney, P K

Searching in archive math. Search in all archives.
.
  1. arXiv:2203.04850  [pdf, other

    math.OC cs.DC cs.LG

    Federated Minimax Optimization: Improved Convergence Analyses and Algorithms

    Authors: Pranay Sharma, Rohan Panda, Gauri Joshi, Pramod K. Varshney

    Abstract: In this paper, we consider nonconvex minimax optimization, which is gaining prominence in many modern machine learning applications such as GANs. Large-scale edge-based collection of training data in these applications calls for communication-efficient distributed optimization algorithms, such as those used in federated learning, to process the data. In this paper, we analyze Local stochastic grad… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: 52 pages, 4 figures

  2. arXiv:2106.10435  [pdf, other

    cs.LG math.OC stat.ML

    STEM: A Stochastic Two-Sided Momentum Algorithm Achieving Near-Optimal Sample and Communication Complexities for Federated Learning

    Authors: Prashant Khanduri, Pranay Sharma, Haibo Yang, Mingyi Hong, Jia Liu, Ketan Rajawat, Pramod K. Varshney

    Abstract: Federated Learning (FL) refers to the paradigm where multiple worker nodes (WNs) build a joint model by using local data. Despite extensive research, for a generic non-convex FL problem, it is not clear, how to choose the WNs' and the server's update directions, the minibatch sizes, and the local update frequency, so that the WNs use the minimum number of samples and communication rounds to achiev… ▽ More

    Submitted 19 June, 2021; originally announced June 2021.

  3. arXiv:2012.11518  [pdf, other

    stat.ML cs.LG math.OC

    Zeroth-Order Hybrid Gradient Descent: Towards A Principled Black-Box Optimization Framework

    Authors: Pranay Sharma, Kaidi Xu, Sijia Liu, Pin-Yu Chen, Xue Lin, Pramod K. Varshney

    Abstract: In this work, we focus on the study of stochastic zeroth-order (ZO) optimization which does not require first-order gradient information and uses only function evaluations. The problem of ZO optimization has emerged in many recent machine learning applications, where the gradient of the objective function is either unavailable or difficult to compute. In such cases, we can approximate the full gra… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

    Comments: 27 pages, 3 figures

  4. arXiv:2005.00224  [pdf, ps, other

    math.OC cs.DC

    Distributed Stochastic Non-Convex Optimization: Momentum-Based Variance Reduction

    Authors: Prashant Khanduri, Pranay Sharma, Swatantra Kafle, Saikiran Bulusu, Ketan Rajawat, Pramod K. Varshney

    Abstract: In this work, we propose a distributed algorithm for stochastic non-convex optimization. We consider a worker-server architecture where a set of $K$ worker nodes (WNs) in collaboration with a server node (SN) jointly aim to minimize a global, potentially non-convex objective function. The objective function is assumed to be the sum of local objective functions available at each WN, with each node… ▽ More

    Submitted 1 May, 2020; originally announced May 2020.

  5. arXiv:2001.03166  [pdf, ps, other

    math.OC cs.DC eess.SY

    On Distributed Online Convex Optimization with Sublinear Dynamic Regret and Fit

    Authors: Pranay Sharma, Prashant Khanduri, Lixin Shen, Donald J. Bucci Jr., Pramod K. Varshney

    Abstract: In this work, we consider a distributed online convex optimization problem, with time-varying (potentially adversarial) constraints. A set of nodes, jointly aim to minimize a global objective function, which is the sum of local convex functions. The objective and constraint functions are revealed locally to the nodes, at each time, after taking an action. Naturally, the constraints cannot be insta… ▽ More

    Submitted 5 May, 2021; v1 submitted 9 January, 2020; originally announced January 2020.

    Comments: 22 pages

  6. arXiv:1912.06036  [pdf, ps, other

    math.OC cs.DC cs.LG cs.MA stat.ML

    Parallel Restarted SPIDER -- Communication Efficient Distributed Nonconvex Optimization with Optimal Computation Complexity

    Authors: Pranay Sharma, Swatantra Kafle, Prashant Khanduri, Saikiran Bulusu, Ketan Rajawat, Pramod K. Varshney

    Abstract: In this paper, we propose a distributed algorithm for stochastic smooth, non-convex optimization. We assume a worker-server architecture where $N$ nodes, each having $n$ (potentially infinite) number of samples, collaborate with the help of a central server to perform the optimization task. The global objective is to minimize the average of local cost functions available at individual nodes. The p… ▽ More

    Submitted 6 November, 2020; v1 submitted 12 December, 2019; originally announced December 2019.

  7. arXiv:1912.04531  [pdf, ps, other

    math.OC cs.DC cs.MA

    Byzantine Resilient Non-Convex SVRG with Distributed Batch Gradient Computations

    Authors: Prashant Khanduri, Saikiran Bulusu, Pranay Sharma, Pramod K. Varshney

    Abstract: In this work, we consider the distributed stochastic optimization problem of minimizing a non-convex function $f(x) = \mathbb{E}_{ξ\sim \mathcal{D}} f(x; ξ)$ in an adversarial setting, where the individual functions $f(x; ξ)$ can also be potentially non-convex. We assume that at most $α$-fraction of a total of $K$ nodes can be Byzantines. We propose a robust stochastic variance-reduced gradient (S… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Comments: Optimization for Machine Learning, 2019

  8. arXiv:1410.5904  [pdf, ps, other

    cs.CR cs.DC math.OC stat.AP

    Distributed Detection in Tree Networks: Byzantines and Mitigation Techniques

    Authors: Bhavya Kailkhura, Swastik Brahma, Berkan Dulek, Yunghsiang S Han, Pramod K. Varshney

    Abstract: In this paper, the problem of distributed detection in tree networks in the presence of Byzantines is considered. Closed form expressions for optimal attacking strategies that minimize the miss detection error exponent at the fusion center (FC) are obtained. We also look at the problem from the network designer's (FC's) perspective. We study the problem of designing optimal distributed detection p… ▽ More

    Submitted 21 October, 2014; originally announced October 2014.

  9. arXiv:1311.2448  [pdf, ps, other

    math.NA cs.IT

    Recovery of Sparse Matrices via Matrix Sketching

    Authors: Thakshila Wimalajeewa, Yonina C. Eldar, Pramod K. Varshney

    Abstract: In this paper, we consider the problem of recovering an unknown sparse matrix X from the matrix sketch Y = AX B^T. The dimension of Y is less than that of X, and A and B are known matrices. This problem can be solved using standard compressive sensing (CS) theory after converting it to vector form using the Kronecker operation. In this case, the measurement matrix assumes a Kronecker product struc… ▽ More

    Submitted 11 November, 2013; originally announced November 2013.

  10. arXiv:1309.4513  [pdf, ps, other

    stat.AP cs.CR math.CO

    Distributed Detection in Tree Topologies with Byzantines

    Authors: Bhavya Kailkhura, Swastik Brahma, Yunghsiang S. Han, Pramod K. Varshney

    Abstract: In this paper, we consider the problem of distributed detection in tree topologies in the presence of Byzantines. The expression for minimum attacking power required by the Byzantines to blind the fusion center (FC) is obtained. More specifically, we show that when more than a certain fraction of individual node decisions are falsified, the decision fusion scheme becomes completely incapable. We o… ▽ More

    Submitted 17 September, 2013; originally announced September 2013.

  11. Sensor Selection Based on Generalized Information Gain for Target Tracking in Large Sensor Networks

    Authors: Xiaojing Shen, Pramod K. Varshney

    Abstract: In this paper, sensor selection problems for target tracking in large sensor networks with linear equality or inequality constraints are considered. First, we derive an equivalent Kalman filter for sensor selection, i.e., generalized information filter. Then, under a regularity condition, we prove that the multistage look-ahead policy that minimizes either the final or the average estimation error… ▽ More

    Submitted 6 February, 2013; originally announced February 2013.

    Comments: 38 pages, 14 figures, submitted to Journal

  12. arXiv:1211.6719  [pdf, ps, other

    cs.IT math.NA

    Cooperative Sparsity Pattern Recovery in Distributed Networks Via Distributed-OMP

    Authors: Thakshila Wimalajeewa, Pramod K. Varshney

    Abstract: In this paper, we consider the problem of collaboratively estimating the sparsity pattern of a sparse signal with multiple measurement data in distributed networks. We assume that each node makes Compressive Sensing (CS) based measurements via random projections regarding the same sparse signal. We propose a distributed greedy algorithm based on Orthogonal Matching Pursuit (OMP), in which the spar… ▽ More

    Submitted 28 November, 2012; originally announced November 2012.

  13. arXiv:0908.2954  [pdf, other

    stat.ME math.ST

    Approximation of Average Run Length of Moving Sum Algorithms Using Multivariate Probabilities

    Authors: Swarnendu Kar, Kishan G. Mehrotra, Pramod K. Varshney

    Abstract: Among the various procedures used to detect potential changes in a stochastic process the moving sum algorithms are very popular due to their intuitive appeal and good statistical performance. One of the important design parameters of a change detection algorithm is the expected interval between false positives, also known as the average run length (ARL). Computation of the ARL usually involves… ▽ More

    Submitted 20 August, 2009; originally announced August 2009.