Skip to main content

Showing 1–49 of 49 results for author: Shahrampour, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.05689  [pdf, other

    math.OC cs.LG

    Local Linear Convergence of Infeasible Optimization with Orthogonal Constraints

    Authors: Youbang Sun, Shixiang Chen, Alfredo Garcia, Shahin Shahrampour

    Abstract: Many classical and modern machine learning algorithms require solving optimization tasks under orthogonality constraints. Solving these tasks with feasible methods requires a gradient descent update followed by a retraction operation on the Stiefel manifold, which can be computationally expensive. Recently, an infeasible retraction-free approach, termed the landing algorithm, was proposed as an ef… ▽ More

    Submitted 7 December, 2024; originally announced December 2024.

  2. arXiv:2406.01484  [pdf, other

    math.OC cs.LG eess.SY

    Online Optimization Perspective on First-Order and Zero-Order Decentralized Nonsmooth Nonconvex Stochastic Optimization

    Authors: Emre Sahinoglu, Shahin Shahrampour

    Abstract: We investigate the finite-time analysis of finding ($δ,ε$)-stationary points for nonsmooth nonconvex objectives in decentralized stochastic optimization. A set of agents aim at minimizing a global function using only their local information by interacting over a network. We present a novel algorithm, called Multi Epoch Decentralized Online Learning (ME-DOL), for which we establish the sample compl… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: To appear in ICML 2024

  3. arXiv:2405.11590  [pdf, other

    cs.LG math.OC

    Retraction-Free Decentralized Non-convex Optimization with Orthogonal Constraints

    Authors: Youbang Sun, Shixiang Chen, Alfredo Garcia, Shahin Shahrampour

    Abstract: In this paper, we investigate decentralized non-convex optimization with orthogonal constraints. Conventional algorithms for this setting require either manifold retractions or other types of projection to ensure feasibility, both of which involve costly linear algebra operations (e.g., SVD or matrix inversion). On the other hand, infeasible methods are able to provide similar performance with hig… ▽ More

    Submitted 7 December, 2024; v1 submitted 19 May, 2024; originally announced May 2024.

  4. arXiv:2405.02769  [pdf, other

    cs.LG cs.MA math.OC

    Linear Convergence of Independent Natural Policy Gradient in Games with Entropy Regularization

    Authors: Youbang Sun, Tao Liu, P. R. Kumar, Shahin Shahrampour

    Abstract: This work focuses on the entropy-regularized independent natural policy gradient (NPG) algorithm in multi-agent reinforcement learning. In this work, agents are assumed to have access to an oracle with exact policy evaluation and seek to maximize their respective independent rewards. Each individual's reward is assumed to depend on the actions of all the agents in the multi-agent system, leading t… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

  5. arXiv:2403.08553  [pdf, other

    math.OC cs.LG eess.SY

    Regret Analysis of Policy Optimization over Submanifolds for Linearly Constrained Online LQG

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: Recent advancement in online optimization and control has provided novel tools to study online linear quadratic regulator (LQR) problems, where cost matrices are varying adversarially over time. However, the controller parameterization of existing works may not satisfy practical conditions like sparsity due to physical connections. In this work, we study online linear quadratic Gaussian problems w… ▽ More

    Submitted 13 March, 2024; originally announced March 2024.

  6. arXiv:2403.07207  [pdf, other

    stat.ML cs.LG

    Tracking Dynamic Gaussian Density with a Theoretically Optimal Sliding Window Approach

    Authors: Yinsong Wang, Yu Ding, Shahin Shahrampour

    Abstract: Dynamic density estimation is ubiquitous in many applications, including computer vision and signal processing. One popular method to tackle this problem is the "sliding window" kernel density estimator. There exist various implementations of this method that use heuristically defined weight sequences for the observed data. The weight sequence, however, is a key aspect of the estimator affecting t… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  7. arXiv:2310.09727  [pdf, other

    cs.LG math.OC

    Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games

    Authors: Youbang Sun, Tao Liu, Ruida Zhou, P. R. Kumar, Shahin Shahrampour

    Abstract: This work studies an independent natural policy gradient (NPG) algorithm for the multi-agent reinforcement learning problem in Markov potential games. It is shown that, under mild technical assumptions and the introduction of the \textit{suboptimality gap}, the independent NPG method with an oracle providing exact policy evaluation asymptotically reaches an $ε$-Nash Equilibrium (NE) within… ▽ More

    Submitted 27 October, 2023; v1 submitted 15 October, 2023; originally announced October 2023.

    Comments: Will appear in NeurIPS 2023

  8. arXiv:2310.03206  [pdf, other

    math.OC cs.LG eess.SY

    Regret Analysis of Distributed Online Control for LTI Systems with Adversarial Disturbances

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: This paper addresses the distributed online control problem over a network of linear time-invariant (LTI) systems (with possibly unknown dynamics) in the presence of adversarial perturbations. There exists a global network cost that is characterized by a time-varying convex function, which evolves in an adversarial manner and is sequentially and partially observed by local agents. The goal of each… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

  9. arXiv:2302.12320  [pdf, other

    math.OC cs.LG eess.SY

    Dynamic Regret Analysis of Safe Distributed Online Optimization for Convex and Non-convex Problems

    Authors: Ting-Jui Chang, Sapana Chaudhary, Dileep Kalathil, Shahin Shahrampour

    Abstract: This paper addresses safe distributed online optimization over an unknown set of linear safety constraints. A network of agents aims at jointly minimizing a global, time-varying function, which is only partially observable to each individual agent. Therefore, agents must engage in local communications to generate a safe sequence of actions competitive with the best minimizer sequence in hindsight,… ▽ More

    Submitted 23 February, 2023; originally announced February 2023.

  10. arXiv:2302.02224  [pdf, other

    cs.LG stat.ML

    TAP: The Attention Patch for Cross-Modal Knowledge Transfer from Unlabeled Modality

    Authors: Yinsong Wang, Shahin Shahrampour

    Abstract: This paper addresses a cross-modal learning framework, where the objective is to enhance the performance of supervised learning in the primary modality using an unlabeled, unpaired secondary modality. Taking a probabilistic approach for missing information estimation, we show that the extra information contained in the secondary modality can be estimated via Nadaraya-Watson (NW) kernel regression,… ▽ More

    Submitted 19 June, 2024; v1 submitted 4 February, 2023; originally announced February 2023.

    Comments: Accepted to TMLR

  11. arXiv:2209.12307  [pdf, other

    cs.LG eess.SY math.OC

    On the Stability Analysis of Open Federated Learning Systems

    Authors: Youbang Sun, Heshan Fernando, Tianyi Chen, Shahin Shahrampour

    Abstract: We consider the open federated learning (FL) systems, where clients may join and/or leave the system during the FL process. Given the variability of the number of present clients, convergence to a fixed model cannot be guaranteed in open systems. Instead, we resort to a new performance metric that we term the stability of open FL systems, which quantifies the magnitude of the learned model in open… ▽ More

    Submitted 12 March, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

  12. arXiv:2207.01062  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Distributed Online System Identification for LTI Systems Using Reverse Experience Replay

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: Identification of linear time-invariant (LTI) systems plays an important role in control and reinforcement learning. Both asymptotic and finite-time offline system identification are well-studied in the literature. For online system identification, the idea of stochastic-gradient descent with reverse experience replay (SGD-RER) was recently proposed, where the data sequence is stored in several bu… ▽ More

    Submitted 15 September, 2022; v1 submitted 3 July, 2022; originally announced July 2022.

  13. arXiv:2203.08317  [pdf, other

    stat.ML cs.LG eess.SP

    TAKDE: Temporal Adaptive Kernel Density Estimator for Real-Time Dynamic Density Estimation

    Authors: Yinsong Wang, Yu Ding, Shahin Shahrampour

    Abstract: Real-time density estimation is ubiquitous in many applications, including computer vision and signal processing. Kernel density estimation is arguably one of the most commonly used density estimation techniques, and the use of "sliding window" mechanism adapts kernel density estimators to dynamic processes. In this paper, we derive the asymptotic mean integrated squared error (AMISE) upper bound… ▽ More

    Submitted 8 November, 2023; v1 submitted 15 March, 2022; originally announced March 2022.

  14. arXiv:2112.05888  [pdf, other

    stat.ML cs.LG

    A Sparse Expansion For Deep Gaussian Processes

    Authors: Liang Ding, Rui Tuo, Shahin Shahrampour

    Abstract: In this work, we use Deep Gaussian Processes (DGPs) as statistical surrogates for stochastic processes with complex distributions. Conventional inferential methods for DGP models can suffer from high computational complexity as they require large-scale operations with kernel matrices for training and inference. In this work, we propose an efficient scheme for accurate inference and efficient train… ▽ More

    Submitted 29 April, 2023; v1 submitted 10 December, 2021; originally announced December 2021.

  15. arXiv:2105.14385  [pdf, other

    math.OC cs.LG eess.SY

    On Centralized and Distributed Mirror Descent: Convergence Analysis Using Quadratic Constraints

    Authors: Youbang Sun, Mahyar Fazlyab, Shahin Shahrampour

    Abstract: Mirror descent (MD) is a powerful first-order optimization technique that subsumes several optimization algorithms including gradient descent (GD). In this work, we develop a semi-definite programming (SDP) framework to analyze the convergence rate of MD in centralized and distributed settings under both strongly convex and non-strongly convex assumptions. We view MD with a dynamical system lens a… ▽ More

    Submitted 18 January, 2022; v1 submitted 29 May, 2021; originally announced May 2021.

  16. arXiv:2105.07310  [pdf, other

    math.OC cs.LG eess.SY

    Regret Analysis of Distributed Online LQR Control for Unknown LTI Systems

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: Online optimization has recently opened avenues to study optimal control for time-varying cost functions that are unknown in advance. Inspired by this line of research, we study the distributed online linear quadratic regulator (LQR) problem for linear time-invariant (LTI) systems with unknown dynamics. Consider a multi-agent network where each agent is modeled as a LTI system. The network has a g… ▽ More

    Submitted 6 February, 2022; v1 submitted 15 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: substantial text overlap with arXiv:2009.13749

  17. arXiv:2102.07091  [pdf, other

    math.OC cs.LG eess.SY

    Decentralized Riemannian Gradient Descent on the Stiefel Manifold

    Authors: Shixiang Chen, Alfredo Garcia, Mingyi Hong, Shahin Shahrampour

    Abstract: We consider a distributed non-convex optimization where a network of agents aims at minimizing a global function over the Stiefel manifold. The global function is represented as a finite sum of smooth local functions, where each local function is associated with one agent and agents communicate with each other over an undirected connected graph. The problem is non-convex as local functions are pos… ▽ More

    Submitted 14 February, 2021; originally announced February 2021.

  18. arXiv:2101.09346  [pdf, ps, other

    math.OC cs.LG eess.SY

    On the Local Linear Rate of Consensus on the Stiefel Manifold

    Authors: Shixiang Chen, Alfredo Garcia, Mingyi Hong, Shahin Shahrampour

    Abstract: We study the convergence properties of Riemannian gradient method for solving the consensus problem (for an undirected connected graph) over the Stiefel manifold. The Stiefel manifold is a non-convex set and the standard notion of averaging in the Euclidean space does not work for this problem. We propose Distributed Riemannian Consensus on Stiefel Manifold (DRCS) and prove that it enjoys a local… ▽ More

    Submitted 22 January, 2021; originally announced January 2021.

  19. arXiv:2011.12233  [pdf, other

    math.OC cs.LG eess.SY

    Linear Convergence of Distributed Mirror Descent with Integral Feedback for Strongly Convex Problems

    Authors: Youbang Sun, Shahin Shahrampour

    Abstract: Distributed optimization often requires finding the minimum of a global objective function written as a sum of local functions. A group of agents work collectively to minimize the global function. We study a continuous-time decentralized mirror descent algorithm that uses purely local gradient information to converge to the global optimal solution. The algorithm enforces consensus among agents usi… ▽ More

    Submitted 24 November, 2020; originally announced November 2020.

    Comments: 12 pages, 1 figure

  20. arXiv:2009.13749  [pdf, other

    math.OC cs.LG eess.SY

    Distributed Online Linear Quadratic Control for Linear Time-invariant Systems

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: Classical linear quadratic (LQ) control centers around linear time-invariant (LTI) systems, where the control-state pairs introduce a quadratic cost with time-invariant parameters. Recent advancement in online optimization and control has provided novel tools to study LQ problems that are robust to time-varying cost parameters. Inspired by this line of research, we study the distributed online LQ… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

  21. arXiv:2009.06747  [pdf, other

    math.OC cs.LG stat.ML

    Distributed Mirror Descent with Integral Feedback: Asymptotic Convergence Analysis of Continuous-time Dynamics

    Authors: Youbang Sun, Shahin Shahrampour

    Abstract: This work addresses distributed optimization, where a network of agents wants to minimize a global strongly convex objective function. The global function can be written as a sum of local convex functions, each of which is associated with an agent. We propose a continuous-time distributed mirror descent algorithm that uses purely local information to converge to the global optimum. Unlike previous… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

  22. arXiv:2006.03912  [pdf, other

    cs.LG math.OC stat.ML

    Unconstrained Online Optimization: Dynamic Regret Analysis of Strongly Convex and Smooth Problems

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: The regret bound of dynamic online learning algorithms is often expressed in terms of the variation in the function sequence ($V_T$) and/or the path-length of the minimizer sequence after $T$ rounds. For strongly convex and smooth functions, , Zhang et al. establish the squared path-length of the minimizer sequence ($C^*_{2,T}$) as a lower bound on regret. They also show that online gradient desce… ▽ More

    Submitted 14 August, 2020; v1 submitted 6 June, 2020; originally announced June 2020.

  23. arXiv:2006.03706  [pdf, ps, other

    cs.LG stat.ML

    Learning from Non-Random Data in Hilbert Spaces: An Optimal Recovery Perspective

    Authors: Simon Foucart, Chunyang Liao, Shahin Shahrampour, Yinsong Wang

    Abstract: The notion of generalization in classical Statistical Learning is often attached to the postulate that data points are independent and identically distributed (IID) random variables. While relevant in many applications, this postulate may not hold in general, encouraging the development of learning frameworks that are robust to non-IID data. In this work, we consider the regression problem from an… ▽ More

    Submitted 11 September, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: Title modified; formatting changed; some reorganization and addition of Theorem 4

  24. arXiv:2006.03696  [pdf, other

    cs.LG stat.ML

    High-Dimensional Non-Parametric Density Estimation in Mixed Smooth Sobolev Spaces

    Authors: Liang Ding, Lu Zou, Wenjia Wang, Shahin Shahrampour, Rui Tuo

    Abstract: Density estimation plays a key role in many tasks in machine learning, statistical inference, and visualization. The main bottleneck in high-dimensional density estimation is the prohibitive computational cost and the slow convergence rate. In this paper, we propose novel estimators for high-dimensional non-parametric density estimation called the adaptive hyperbolic cross density estimators, whic… ▽ More

    Submitted 20 October, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

  25. arXiv:2004.13233  [pdf, other

    math.OC cs.LG stat.ML

    On Distributed Non-convex Optimization: Projected Subgradient Method For Weakly Convex Problems in Networks

    Authors: Shixiang Chen, Alfredo Garcia, Shahin Shahrampour

    Abstract: The stochastic subgradient method is a widely-used algorithm for solving large-scale optimization problems arising in machine learning. Often these problems are neither smooth nor convex. Recently, Davis et al. [1-2] characterized the convergence of the stochastic subgradient method for the weakly convex case, which encompasses many important applications (e.g., robust phase retrieval, blind decon… ▽ More

    Submitted 23 February, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

  26. arXiv:2003.05783  [pdf, other

    stat.ML cs.LG

    Statistical and Topological Properties of Sliced Probability Divergences

    Authors: Kimia Nadjahi, Alain Durmus, Lénaïc Chizat, Soheil Kolouri, Shahin Shahrampour, Umut Şimşekli

    Abstract: The idea of slicing divergences has been proven to be successful when comparing two probability measures in various machine learning applications including generative modeling, and consists in computing the expected value of a `base divergence' between one-dimensional random projections of the two measures. However, the topological, statistical, and computational consequences of this technique hav… ▽ More

    Submitted 4 January, 2022; v1 submitted 12 March, 2020; originally announced March 2020.

    Comments: Published at NeurIPS 2020 (Spotlight)

  27. arXiv:2002.12537  [pdf, other

    stat.ML cs.LG

    Generalized Sliced Distances for Probability Distributions

    Authors: Soheil Kolouri, Kimia Nadjahi, Umut Simsekli, Shahin Shahrampour

    Abstract: Probability metrics have become an indispensable part of modern statistics and machine learning, and they play a quintessential role in various applications, including statistical hypothesis testing and generative modeling. However, in a practical setting, the convergence behavior of the algorithms built upon these distances have not been well established, except for a few specific cases. In this… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  28. arXiv:2002.04753  [pdf, other

    cs.LG stat.ML

    RFN: A Random-Feature Based Newton Method for Empirical Risk Minimization in Reproducing Kernel Hilbert Spaces

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: In supervised learning using kernel methods, we often encounter a large-scale finite-sum minimization over a reproducing kernel Hilbert space (RKHS). Large-scale finite-sum problems can be solved using efficient variants of Newton method, where the Hessian is approximated via sub-samples of data. In RKHS, however, the dependence of the penalty function to kernel makes standard sub-sampling approac… ▽ More

    Submitted 6 June, 2022; v1 submitted 11 February, 2020; originally announced February 2020.

  29. arXiv:2002.04195  [pdf, other

    cs.LG stat.ML

    Generalization Guarantees for Sparse Kernel Approximation with Entropic Optimal Features

    Authors: Liang Ding, Rui Tuo, Shahin Shahrampour

    Abstract: Despite their success, kernel methods suffer from a massive computational cost in practice. In this paper, in lieu of commonly used kernel expansion with respect to $N$ inputs, we develop a novel optimal design maximizing the entropy among kernel features. This procedure results in a kernel expansion with respect to entropic optimal features (EOF), improving the data representation dramatically du… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

  30. arXiv:1910.13567  [pdf, other

    eess.SP cs.IT

    Cell Association via Boundary Detection: A Scalable Approach Based on Data-Driven Random Features

    Authors: Yinsong Wang, Hessam Mahdavifar, Kamran Entesari, Shahin Shahrampour

    Abstract: The problem of cell association is considered for cellular users present in the field. This has become a challenging problem with the deployment of 5G networks which will share the sub-6 GHz bands with the legacy 4G networks. Instead of taking a network-controlled approach, which may not be scalable with the number of users and may introduce extra delays into the system, we propose a scalable solu… ▽ More

    Submitted 29 October, 2019; originally announced October 2019.

    Comments: 6 pages

  31. arXiv:1910.05384  [pdf, other

    cs.LG stat.ML

    ORCCA: Optimal Randomized Canonical Correlation Analysis

    Authors: Yinsong Wang, Shahin Shahrampour

    Abstract: Random features approach has been widely used for kernel approximation in large-scale machine learning. A number of recent studies have explored data-dependent sampling of features, modifying the stochastic oracle from which random features are sampled. While proposed techniques in this realm improve the approximation, their suitability is often verified on a single learning task. In this paper, w… ▽ More

    Submitted 1 November, 2021; v1 submitted 11 October, 2019; originally announced October 2019.

  32. arXiv:1909.11820  [pdf, other

    cs.LG stat.ML

    A Mean-Field Theory for Kernel Alignment with Random Features in Generative and Discriminative Models

    Authors: Masoud Badiei Khuzani, Liyue Shen, Shahin Shahrampour, Lei Xing

    Abstract: We propose a novel supervised learning method to optimize the kernel in the maximum mean discrepancy generative adversarial networks (MMD GANs), and the kernel support vector machines (SVMs). Specifically, we characterize a distributionally robust optimization problem to compute a good distribution for the random feature model of Rahimi and Recht. Due to the fact that the distributional optimizati… ▽ More

    Submitted 21 February, 2020; v1 submitted 25 September, 2019; originally announced September 2019.

    Comments: 51 pages, 4 figures. In this edition, new simulations for the kernel SVMs are included

  33. arXiv:1909.09736  [pdf, other

    eess.SY cs.LG

    Distributed Parameter Estimation in Randomized One-hidden-layer Neural Networks

    Authors: Yinsong Wang, Shahin Shahrampour

    Abstract: This paper addresses distributed parameter estimation in randomized one-hidden-layer neural networks. A group of agents sequentially receive measurements of an unknown parameter that is only partially observable to them. In this paper, we present a fully distributed estimation algorithm where agents exchange local estimates with their neighbors to collectively identify the true value of the parame… ▽ More

    Submitted 20 March, 2020; v1 submitted 20 September, 2019; originally announced September 2019.

    Comments: 6 Pages

  34. arXiv:1903.08329  [pdf, other

    cs.LG cs.AI stat.ML

    On Sampling Random Features From Empirical Leverage Scores: Implementation and Theoretical Guarantees

    Authors: Shahin Shahrampour, Soheil Kolouri

    Abstract: Random features provide a practical framework for large-scale kernel approximation and supervised learning. It has been shown that data-dependent sampling of random features using leverage scores can significantly reduce the number of features required to achieve optimal learning bounds. Leverage scores introduce an optimized distribution for features based on an infinite-dimensional integral oper… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

    Comments: 23 pages

  35. arXiv:1810.03817  [pdf, ps, other

    cs.LG stat.ML

    Learning Bounds for Greedy Approximation with Explicit Feature Maps from Multiple Kernels

    Authors: Shahin Shahrampour, Vahid Tarokh

    Abstract: Nonlinear kernels can be approximated using finite-dimensional feature maps for efficient risk minimization. Due to the inherent trade-off between the dimension of the (mapped) feature space and the approximation accuracy, the key problem is to identify promising (explicit) features leading to a satisfactory out-of-sample performance. In this work, we tackle this problem by efficiently choosing su… ▽ More

    Submitted 9 October, 2018; originally announced October 2018.

    Comments: Proc. of 2018 Advances in Neural Information Processing Systems (NIPS 2018)

  36. arXiv:1712.07102  [pdf, other

    stat.ML cs.LG

    On Data-Dependent Random Features for Improved Generalization in Supervised Learning

    Authors: Shahin Shahrampour, Ahmad Beirami, Vahid Tarokh

    Abstract: The randomized-feature approach has been successfully employed in large-scale kernel approximation and supervised learning. The distribution from which the random features are drawn impacts the number of features required to efficiently perform a learning task. Recently, it has been shown that employing data-dependent randomization improves the performance in terms of the required number of random… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

    Comments: 12 pages; (pages 1-8) to appear in Proc. of AAAI Conference on Artificial Intelligence (AAAI), 2018

  37. arXiv:1711.05323  [pdf, other

    stat.ML cs.LG

    On Optimal Generalizability in Parametric Learning

    Authors: Ahmad Beirami, Meisam Razaviyayn, Shahin Shahrampour, Vahid Tarokh

    Abstract: We consider the parametric learning problem, where the objective of the learner is determined by a parametric loss function. Employing empirical risk minimization with possibly regularization, the inferred parameter vector will be biased toward the training samples. Such bias is measured by the cross validation procedure in practice where the data set is partitioned into a training set used for tr… ▽ More

    Submitted 14 November, 2017; originally announced November 2017.

    Comments: Proc. of 2017 Advances in Neural Information Processing Systems (NIPS 2017)

  38. arXiv:1707.02649  [pdf, ps, other

    stat.ML cs.LG

    Nonlinear Sequential Accepts and Rejects for Identification of Top Arms in Stochastic Bandits

    Authors: Shahin Shahrampour, Vahid Tarokh

    Abstract: We address the M-best-arm identification problem in multi-armed bandits. A player has a limited budget to explore K arms (M<K), and once pulled, each arm yields a reward drawn (independently) from a fixed, unknown distribution. The goal is to find the top M arms in the sense of expected reward. We develop an algorithm which proceeds in rounds to deactivate arms iteratively. At each round, the budg… ▽ More

    Submitted 9 July, 2017; originally announced July 2017.

    Comments: 7 pages

  39. arXiv:1702.06219  [pdf, other

    math.OC cs.MA stat.ML

    An Online Optimization Approach for Multi-Agent Tracking of Dynamic Parameters in the Presence of Adversarial Noise

    Authors: Shahin Shahrampour, Ali Jadbabaie

    Abstract: This paper addresses tracking of a moving target in a multi-agent network. The target follows a linear dynamics corrupted by an adversarial noise, i.e., the noise is not generated from a statistical distribution. The location of the target at each time induces a global time-varying loss function, and the global loss is a sum of local losses, each of which is associated to one agent. Agents noisy o… ▽ More

    Submitted 20 February, 2017; originally announced February 2017.

    Comments: 8 pages, To appear in American Control Conference 2017

  40. arXiv:1609.02845  [pdf, other

    math.OC cs.DC cs.LG stat.ML

    Distributed Online Optimization in Dynamic Environments Using Mirror Descent

    Authors: Shahin Shahrampour, Ali Jadbabaie

    Abstract: This work addresses decentralized online optimization in non-stationary environments. A network of agents aim to track the minimizer of a global time-varying convex function. The minimizer evolves according to a known dynamics corrupted by an unknown, unstructured noise. At each time, the global function can be cast as a sum of a finite number of local functions, each of which is assigned to one a… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

  41. On Sequential Elimination Algorithms for Best-Arm Identification in Multi-Armed Bandits

    Authors: Shahin Shahrampour, Mohammad Noshad, Vahid Tarokh

    Abstract: We consider the best-arm identification problem in multi-armed bandits, which focuses purely on exploration. A player is given a fixed budget to explore a finite set of arms, and the rewards of each arm are drawn independently from a fixed, unknown distribution. The player aims to identify the arm with the largest expected reward. We propose a general framework to unify sequential elimination algo… ▽ More

    Submitted 13 April, 2017; v1 submitted 8 September, 2016; originally announced September 2016.

  42. arXiv:1603.04954  [pdf, other

    cs.LG math.OC

    Online Optimization in Dynamic Environments: Improved Regret Rates for Strongly Convex Problems

    Authors: Aryan Mokhtari, Shahin Shahrampour, Ali Jadbabaie, Alejandro Ribeiro

    Abstract: In this paper, we address tracking of a time-varying parameter with unknown dynamics. We formalize the problem as an instance of online optimization in a dynamic setting. Using online gradient descent, we propose a method that sequentially predicts the value of the parameter and in turn suffers a loss. The objective is to minimize the accumulation of losses over the time horizon, a notion that is… ▽ More

    Submitted 16 March, 2016; originally announced March 2016.

  43. arXiv:1603.00576  [pdf, ps, other

    math.OC cs.LG cs.SI

    Distributed Estimation of Dynamic Parameters : Regret Analysis

    Authors: Shahin Shahrampour, Alexander Rakhlin, Ali Jadbabaie

    Abstract: This paper addresses the estimation of a time- varying parameter in a network. A group of agents sequentially receive noisy signals about the parameter (or moving target), which does not follow any particular dynamics. The parameter is not observable to an individual agent, but it is globally identifiable for the whole network. Viewing the problem with an online optimization lens, we aim to provid… ▽ More

    Submitted 1 March, 2016; originally announced March 2016.

    Comments: 6 pages, To appear in American Control Conference 2016

  44. arXiv:1503.03517  [pdf, ps, other

    cs.LG math.OC stat.ML

    Switching to Learn

    Authors: Shahin Shahrampour, Mohammad Amin Rahimian, Ali Jadbabaie

    Abstract: A network of agents attempt to learn some unknown state of the world drawn by nature from a finite set. Agents observe private signals conditioned on the true state, and form beliefs about the unknown state accordingly. Each agent may face an identification problem in the sense that she cannot distinguish the truth in isolation. However, by communicating with each other, agents are able to benefit… ▽ More

    Submitted 11 March, 2015; originally announced March 2015.

    Comments: 6 pages, To appear in American Control Conference 2015

  45. arXiv:1501.06225  [pdf, ps, other

    cs.LG math.OC stat.ML

    Online Optimization : Competing with Dynamic Comparators

    Authors: Ali Jadbabaie, Alexander Rakhlin, Shahin Shahrampour, Karthik Sridharan

    Abstract: Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop prediction methods that perform well against complex benchmarks. In this paper, we address these two directions together. We present a fully adaptive method that… ▽ More

    Submitted 25 January, 2015; originally announced January 2015.

    Comments: 23 pages, To appear in International Conference on Artificial Intelligence and Statistics (AISTATS) 2015

  46. arXiv:1409.8606  [pdf, other

    math.OC cs.LG cs.SI stat.ML

    Distributed Detection : Finite-time Analysis and Impact of Network Topology

    Authors: Shahin Shahrampour, Alexander Rakhlin, Ali Jadbabaie

    Abstract: This paper addresses the problem of distributed detection in multi-agent networks. Agents receive private signals about an unknown state of the world. The underlying state is globally identifiable, yet informative signals may be dispersed throughout the network. Using an optimization-based framework, we develop an iterative local strategy for updating individual beliefs. In contrast to the existin… ▽ More

    Submitted 30 September, 2014; originally announced September 2014.

    Comments: 29 pages, 5 figures

  47. arXiv:1310.0432  [pdf, ps, other

    math.OC cs.LG cs.SI stat.ML

    Online Learning of Dynamic Parameters in Social Networks

    Authors: Shahin Shahrampour, Alexander Rakhlin, Ali Jadbabaie

    Abstract: This paper addresses the problem of online learning in a dynamic setting. We consider a social network in which each individual observes a private signal about the underlying state of the world and communicates with her neighbors at each time period. Unlike many existing approaches, the underlying state is dynamic, and evolves according to a geometric random walk. We view the scenario as an optimi… ▽ More

    Submitted 1 October, 2013; originally announced October 2013.

    Comments: 12 pages, To appear in Neural Information Processing Systems (NIPS) 2013

  48. arXiv:1309.2350  [pdf, ps, other

    cs.LG cs.SI math.OC stat.ML

    Exponentially Fast Parameter Estimation in Networks Using Distributed Dual Averaging

    Authors: Shahin Shahrampour, Ali Jadbabaie

    Abstract: In this paper we present an optimization-based view of distributed parameter estimation and observational social learning in networks. Agents receive a sequence of random, independent and identically distributed (i.i.d.) signals, each of which individually may not be informative about the underlying true state, but the signals together are globally informative enough to make the true state identif… ▽ More

    Submitted 9 September, 2013; originally announced September 2013.

    Comments: 6 pages, To appear in Conference on Decision and Control 2013

  49. arXiv:1303.3250  [pdf, ps, other

    cs.SI math.OC physics.soc-ph

    Reconstruction of Directed Networks from Consensus Dynamics

    Authors: Shahin Shahrampour, Victor M. Preciado

    Abstract: This paper addresses the problem of identifying the topology of an unknown, weighted, directed network running a consensus dynamics. We propose a methodology to reconstruct the network topology from the dynamic response when the system is stimulated by a wide-sense stationary noise of unknown power spectral density. The method is based on a node-knockout, or grounding, procedure wherein the ground… ▽ More

    Submitted 15 March, 2013; v1 submitted 13 March, 2013; originally announced March 2013.

    Comments: 6 pages

    Journal ref: S. Shahrampour and V.M. Preciado,"Reconstruction of Directed Networks from Consensus Dynamics," in Proc. American Control Conference, 2013