Skip to main content

Showing 1–30 of 30 results for author: Shahrampour, S

Searching in archive stat. Search in all archives.
.
  1. arXiv:2403.07207  [pdf, other

    stat.ML cs.LG

    Tracking Dynamic Gaussian Density with a Theoretically Optimal Sliding Window Approach

    Authors: Yinsong Wang, Yu Ding, Shahin Shahrampour

    Abstract: Dynamic density estimation is ubiquitous in many applications, including computer vision and signal processing. One popular method to tackle this problem is the "sliding window" kernel density estimator. There exist various implementations of this method that use heuristically defined weight sequences for the observed data. The weight sequence, however, is a key aspect of the estimator affecting t… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  2. arXiv:2302.02224  [pdf, other

    cs.LG stat.ML

    TAP: The Attention Patch for Cross-Modal Knowledge Transfer from Unlabeled Modality

    Authors: Yinsong Wang, Shahin Shahrampour

    Abstract: This paper addresses a cross-modal learning framework, where the objective is to enhance the performance of supervised learning in the primary modality using an unlabeled, unpaired secondary modality. Taking a probabilistic approach for missing information estimation, we show that the extra information contained in the secondary modality can be estimated via Nadaraya-Watson (NW) kernel regression,… ▽ More

    Submitted 19 June, 2024; v1 submitted 4 February, 2023; originally announced February 2023.

    Comments: Accepted to TMLR

  3. arXiv:2207.01062  [pdf, other

    cs.LG eess.SY math.OC stat.ML

    Distributed Online System Identification for LTI Systems Using Reverse Experience Replay

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: Identification of linear time-invariant (LTI) systems plays an important role in control and reinforcement learning. Both asymptotic and finite-time offline system identification are well-studied in the literature. For online system identification, the idea of stochastic-gradient descent with reverse experience replay (SGD-RER) was recently proposed, where the data sequence is stored in several bu… ▽ More

    Submitted 15 September, 2022; v1 submitted 3 July, 2022; originally announced July 2022.

  4. arXiv:2203.08317  [pdf, other

    stat.ML cs.LG eess.SP

    TAKDE: Temporal Adaptive Kernel Density Estimator for Real-Time Dynamic Density Estimation

    Authors: Yinsong Wang, Yu Ding, Shahin Shahrampour

    Abstract: Real-time density estimation is ubiquitous in many applications, including computer vision and signal processing. Kernel density estimation is arguably one of the most commonly used density estimation techniques, and the use of "sliding window" mechanism adapts kernel density estimators to dynamic processes. In this paper, we derive the asymptotic mean integrated squared error (AMISE) upper bound… ▽ More

    Submitted 8 November, 2023; v1 submitted 15 March, 2022; originally announced March 2022.

  5. arXiv:2112.05888  [pdf, other

    stat.ML cs.LG

    A Sparse Expansion For Deep Gaussian Processes

    Authors: Liang Ding, Rui Tuo, Shahin Shahrampour

    Abstract: In this work, we use Deep Gaussian Processes (DGPs) as statistical surrogates for stochastic processes with complex distributions. Conventional inferential methods for DGP models can suffer from high computational complexity as they require large-scale operations with kernel matrices for training and inference. In this work, we propose an efficient scheme for accurate inference and efficient train… ▽ More

    Submitted 29 April, 2023; v1 submitted 10 December, 2021; originally announced December 2021.

  6. arXiv:2009.06747  [pdf, other

    math.OC cs.LG stat.ML

    Distributed Mirror Descent with Integral Feedback: Asymptotic Convergence Analysis of Continuous-time Dynamics

    Authors: Youbang Sun, Shahin Shahrampour

    Abstract: This work addresses distributed optimization, where a network of agents wants to minimize a global strongly convex objective function. The global function can be written as a sum of local convex functions, each of which is associated with an agent. We propose a continuous-time distributed mirror descent algorithm that uses purely local information to converge to the global optimum. Unlike previous… ▽ More

    Submitted 14 September, 2020; originally announced September 2020.

  7. arXiv:2006.03912  [pdf, other

    cs.LG math.OC stat.ML

    Unconstrained Online Optimization: Dynamic Regret Analysis of Strongly Convex and Smooth Problems

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: The regret bound of dynamic online learning algorithms is often expressed in terms of the variation in the function sequence ($V_T$) and/or the path-length of the minimizer sequence after $T$ rounds. For strongly convex and smooth functions, , Zhang et al. establish the squared path-length of the minimizer sequence ($C^*_{2,T}$) as a lower bound on regret. They also show that online gradient desce… ▽ More

    Submitted 14 August, 2020; v1 submitted 6 June, 2020; originally announced June 2020.

  8. arXiv:2006.03706  [pdf, ps, other

    cs.LG stat.ML

    Learning from Non-Random Data in Hilbert Spaces: An Optimal Recovery Perspective

    Authors: Simon Foucart, Chunyang Liao, Shahin Shahrampour, Yinsong Wang

    Abstract: The notion of generalization in classical Statistical Learning is often attached to the postulate that data points are independent and identically distributed (IID) random variables. While relevant in many applications, this postulate may not hold in general, encouraging the development of learning frameworks that are robust to non-IID data. In this work, we consider the regression problem from an… ▽ More

    Submitted 11 September, 2020; v1 submitted 5 June, 2020; originally announced June 2020.

    Comments: Title modified; formatting changed; some reorganization and addition of Theorem 4

  9. arXiv:2006.03696  [pdf, other

    cs.LG stat.ML

    High-Dimensional Non-Parametric Density Estimation in Mixed Smooth Sobolev Spaces

    Authors: Liang Ding, Lu Zou, Wenjia Wang, Shahin Shahrampour, Rui Tuo

    Abstract: Density estimation plays a key role in many tasks in machine learning, statistical inference, and visualization. The main bottleneck in high-dimensional density estimation is the prohibitive computational cost and the slow convergence rate. In this paper, we propose novel estimators for high-dimensional non-parametric density estimation called the adaptive hyperbolic cross density estimators, whic… ▽ More

    Submitted 20 October, 2021; v1 submitted 5 June, 2020; originally announced June 2020.

  10. arXiv:2004.13233  [pdf, other

    math.OC cs.LG stat.ML

    On Distributed Non-convex Optimization: Projected Subgradient Method For Weakly Convex Problems in Networks

    Authors: Shixiang Chen, Alfredo Garcia, Shahin Shahrampour

    Abstract: The stochastic subgradient method is a widely-used algorithm for solving large-scale optimization problems arising in machine learning. Often these problems are neither smooth nor convex. Recently, Davis et al. [1-2] characterized the convergence of the stochastic subgradient method for the weakly convex case, which encompasses many important applications (e.g., robust phase retrieval, blind decon… ▽ More

    Submitted 23 February, 2021; v1 submitted 27 April, 2020; originally announced April 2020.

  11. arXiv:2003.05783  [pdf, other

    stat.ML cs.LG

    Statistical and Topological Properties of Sliced Probability Divergences

    Authors: Kimia Nadjahi, Alain Durmus, Lénaïc Chizat, Soheil Kolouri, Shahin Shahrampour, Umut Şimşekli

    Abstract: The idea of slicing divergences has been proven to be successful when comparing two probability measures in various machine learning applications including generative modeling, and consists in computing the expected value of a `base divergence' between one-dimensional random projections of the two measures. However, the topological, statistical, and computational consequences of this technique hav… ▽ More

    Submitted 4 January, 2022; v1 submitted 12 March, 2020; originally announced March 2020.

    Comments: Published at NeurIPS 2020 (Spotlight)

  12. arXiv:2002.12537  [pdf, other

    stat.ML cs.LG

    Generalized Sliced Distances for Probability Distributions

    Authors: Soheil Kolouri, Kimia Nadjahi, Umut Simsekli, Shahin Shahrampour

    Abstract: Probability metrics have become an indispensable part of modern statistics and machine learning, and they play a quintessential role in various applications, including statistical hypothesis testing and generative modeling. However, in a practical setting, the convergence behavior of the algorithms built upon these distances have not been well established, except for a few specific cases. In this… ▽ More

    Submitted 27 February, 2020; originally announced February 2020.

  13. arXiv:2002.04753  [pdf, other

    cs.LG stat.ML

    RFN: A Random-Feature Based Newton Method for Empirical Risk Minimization in Reproducing Kernel Hilbert Spaces

    Authors: Ting-Jui Chang, Shahin Shahrampour

    Abstract: In supervised learning using kernel methods, we often encounter a large-scale finite-sum minimization over a reproducing kernel Hilbert space (RKHS). Large-scale finite-sum problems can be solved using efficient variants of Newton method, where the Hessian is approximated via sub-samples of data. In RKHS, however, the dependence of the penalty function to kernel makes standard sub-sampling approac… ▽ More

    Submitted 6 June, 2022; v1 submitted 11 February, 2020; originally announced February 2020.

  14. arXiv:2002.04195  [pdf, other

    cs.LG stat.ML

    Generalization Guarantees for Sparse Kernel Approximation with Entropic Optimal Features

    Authors: Liang Ding, Rui Tuo, Shahin Shahrampour

    Abstract: Despite their success, kernel methods suffer from a massive computational cost in practice. In this paper, in lieu of commonly used kernel expansion with respect to $N$ inputs, we develop a novel optimal design maximizing the entropy among kernel features. This procedure results in a kernel expansion with respect to entropic optimal features (EOF), improving the data representation dramatically du… ▽ More

    Submitted 10 February, 2020; originally announced February 2020.

  15. arXiv:1910.05384  [pdf, other

    cs.LG stat.ML

    ORCCA: Optimal Randomized Canonical Correlation Analysis

    Authors: Yinsong Wang, Shahin Shahrampour

    Abstract: Random features approach has been widely used for kernel approximation in large-scale machine learning. A number of recent studies have explored data-dependent sampling of features, modifying the stochastic oracle from which random features are sampled. While proposed techniques in this realm improve the approximation, their suitability is often verified on a single learning task. In this paper, w… ▽ More

    Submitted 1 November, 2021; v1 submitted 11 October, 2019; originally announced October 2019.

  16. arXiv:1909.11820  [pdf, other

    cs.LG stat.ML

    A Mean-Field Theory for Kernel Alignment with Random Features in Generative and Discriminative Models

    Authors: Masoud Badiei Khuzani, Liyue Shen, Shahin Shahrampour, Lei Xing

    Abstract: We propose a novel supervised learning method to optimize the kernel in the maximum mean discrepancy generative adversarial networks (MMD GANs), and the kernel support vector machines (SVMs). Specifically, we characterize a distributionally robust optimization problem to compute a good distribution for the random feature model of Rahimi and Recht. Due to the fact that the distributional optimizati… ▽ More

    Submitted 21 February, 2020; v1 submitted 25 September, 2019; originally announced September 2019.

    Comments: 51 pages, 4 figures. In this edition, new simulations for the kernel SVMs are included

  17. arXiv:1903.08329  [pdf, other

    cs.LG cs.AI stat.ML

    On Sampling Random Features From Empirical Leverage Scores: Implementation and Theoretical Guarantees

    Authors: Shahin Shahrampour, Soheil Kolouri

    Abstract: Random features provide a practical framework for large-scale kernel approximation and supervised learning. It has been shown that data-dependent sampling of random features using leverage scores can significantly reduce the number of features required to achieve optimal learning bounds. Leverage scores introduce an optimized distribution for features based on an infinite-dimensional integral oper… ▽ More

    Submitted 19 March, 2019; originally announced March 2019.

    Comments: 23 pages

  18. arXiv:1810.03817  [pdf, ps, other

    cs.LG stat.ML

    Learning Bounds for Greedy Approximation with Explicit Feature Maps from Multiple Kernels

    Authors: Shahin Shahrampour, Vahid Tarokh

    Abstract: Nonlinear kernels can be approximated using finite-dimensional feature maps for efficient risk minimization. Due to the inherent trade-off between the dimension of the (mapped) feature space and the approximation accuracy, the key problem is to identify promising (explicit) features leading to a satisfactory out-of-sample performance. In this work, we tackle this problem by efficiently choosing su… ▽ More

    Submitted 9 October, 2018; originally announced October 2018.

    Comments: Proc. of 2018 Advances in Neural Information Processing Systems (NIPS 2018)

  19. arXiv:1712.07102  [pdf, other

    stat.ML cs.LG

    On Data-Dependent Random Features for Improved Generalization in Supervised Learning

    Authors: Shahin Shahrampour, Ahmad Beirami, Vahid Tarokh

    Abstract: The randomized-feature approach has been successfully employed in large-scale kernel approximation and supervised learning. The distribution from which the random features are drawn impacts the number of features required to efficiently perform a learning task. Recently, it has been shown that employing data-dependent randomization improves the performance in terms of the required number of random… ▽ More

    Submitted 19 December, 2017; originally announced December 2017.

    Comments: 12 pages; (pages 1-8) to appear in Proc. of AAAI Conference on Artificial Intelligence (AAAI), 2018

  20. arXiv:1711.05323  [pdf, other

    stat.ML cs.LG

    On Optimal Generalizability in Parametric Learning

    Authors: Ahmad Beirami, Meisam Razaviyayn, Shahin Shahrampour, Vahid Tarokh

    Abstract: We consider the parametric learning problem, where the objective of the learner is determined by a parametric loss function. Employing empirical risk minimization with possibly regularization, the inferred parameter vector will be biased toward the training samples. Such bias is measured by the cross validation procedure in practice where the data set is partitioned into a training set used for tr… ▽ More

    Submitted 14 November, 2017; originally announced November 2017.

    Comments: Proc. of 2017 Advances in Neural Information Processing Systems (NIPS 2017)

  21. arXiv:1707.02649  [pdf, ps, other

    stat.ML cs.LG

    Nonlinear Sequential Accepts and Rejects for Identification of Top Arms in Stochastic Bandits

    Authors: Shahin Shahrampour, Vahid Tarokh

    Abstract: We address the M-best-arm identification problem in multi-armed bandits. A player has a limited budget to explore K arms (M<K), and once pulled, each arm yields a reward drawn (independently) from a fixed, unknown distribution. The goal is to find the top M arms in the sense of expected reward. We develop an algorithm which proceeds in rounds to deactivate arms iteratively. At each round, the budg… ▽ More

    Submitted 9 July, 2017; originally announced July 2017.

    Comments: 7 pages

  22. arXiv:1702.06219  [pdf, other

    math.OC cs.MA stat.ML

    An Online Optimization Approach for Multi-Agent Tracking of Dynamic Parameters in the Presence of Adversarial Noise

    Authors: Shahin Shahrampour, Ali Jadbabaie

    Abstract: This paper addresses tracking of a moving target in a multi-agent network. The target follows a linear dynamics corrupted by an adversarial noise, i.e., the noise is not generated from a statistical distribution. The location of the target at each time induces a global time-varying loss function, and the global loss is a sum of local losses, each of which is associated to one agent. Agents noisy o… ▽ More

    Submitted 20 February, 2017; originally announced February 2017.

    Comments: 8 pages, To appear in American Control Conference 2017

  23. arXiv:1609.02845  [pdf, other

    math.OC cs.DC cs.LG stat.ML

    Distributed Online Optimization in Dynamic Environments Using Mirror Descent

    Authors: Shahin Shahrampour, Ali Jadbabaie

    Abstract: This work addresses decentralized online optimization in non-stationary environments. A network of agents aim to track the minimizer of a global time-varying convex function. The minimizer evolves according to a known dynamics corrupted by an unknown, unstructured noise. At each time, the global function can be cast as a sum of a finite number of local functions, each of which is assigned to one a… ▽ More

    Submitted 9 September, 2016; originally announced September 2016.

  24. On Sequential Elimination Algorithms for Best-Arm Identification in Multi-Armed Bandits

    Authors: Shahin Shahrampour, Mohammad Noshad, Vahid Tarokh

    Abstract: We consider the best-arm identification problem in multi-armed bandits, which focuses purely on exploration. A player is given a fixed budget to explore a finite set of arms, and the rewards of each arm are drawn independently from a fixed, unknown distribution. The player aims to identify the arm with the largest expected reward. We propose a general framework to unify sequential elimination algo… ▽ More

    Submitted 13 April, 2017; v1 submitted 8 September, 2016; originally announced September 2016.

  25. arXiv:1509.04332  [pdf, ps, other

    eess.SY math.OC stat.ML

    Learning without Recall by Random Walks on Directed Graphs

    Authors: Mohammad Amin Rahimian, Shahin Shahrampour, Ali Jadbabaie

    Abstract: We consider a network of agents that aim to learn some unknown state of the world using private observations and exchange of beliefs. At each time, agents observe private signals generated based on the true unknown state. Each agent might not be able to distinguish the true state based only on her private observations. This occurs when some other states are observationally equivalent to the true s… ▽ More

    Submitted 14 September, 2015; originally announced September 2015.

    Comments: 6 pages, To Appear in Conference on Decision and Control 2015

  26. arXiv:1503.03517  [pdf, ps, other

    cs.LG math.OC stat.ML

    Switching to Learn

    Authors: Shahin Shahrampour, Mohammad Amin Rahimian, Ali Jadbabaie

    Abstract: A network of agents attempt to learn some unknown state of the world drawn by nature from a finite set. Agents observe private signals conditioned on the true state, and form beliefs about the unknown state accordingly. Each agent may face an identification problem in the sense that she cannot distinguish the truth in isolation. However, by communicating with each other, agents are able to benefit… ▽ More

    Submitted 11 March, 2015; originally announced March 2015.

    Comments: 6 pages, To appear in American Control Conference 2015

  27. arXiv:1501.06225  [pdf, ps, other

    cs.LG math.OC stat.ML

    Online Optimization : Competing with Dynamic Comparators

    Authors: Ali Jadbabaie, Alexander Rakhlin, Shahin Shahrampour, Karthik Sridharan

    Abstract: Recent literature on online learning has focused on developing adaptive algorithms that take advantage of a regularity of the sequence of observations, yet retain worst-case performance guarantees. A complementary direction is to develop prediction methods that perform well against complex benchmarks. In this paper, we address these two directions together. We present a fully adaptive method that… ▽ More

    Submitted 25 January, 2015; originally announced January 2015.

    Comments: 23 pages, To appear in International Conference on Artificial Intelligence and Statistics (AISTATS) 2015

  28. arXiv:1409.8606  [pdf, other

    math.OC cs.LG cs.SI stat.ML

    Distributed Detection : Finite-time Analysis and Impact of Network Topology

    Authors: Shahin Shahrampour, Alexander Rakhlin, Ali Jadbabaie

    Abstract: This paper addresses the problem of distributed detection in multi-agent networks. Agents receive private signals about an unknown state of the world. The underlying state is globally identifiable, yet informative signals may be dispersed throughout the network. Using an optimization-based framework, we develop an iterative local strategy for updating individual beliefs. In contrast to the existin… ▽ More

    Submitted 30 September, 2014; originally announced September 2014.

    Comments: 29 pages, 5 figures

  29. arXiv:1310.0432  [pdf, ps, other

    math.OC cs.LG cs.SI stat.ML

    Online Learning of Dynamic Parameters in Social Networks

    Authors: Shahin Shahrampour, Alexander Rakhlin, Ali Jadbabaie

    Abstract: This paper addresses the problem of online learning in a dynamic setting. We consider a social network in which each individual observes a private signal about the underlying state of the world and communicates with her neighbors at each time period. Unlike many existing approaches, the underlying state is dynamic, and evolves according to a geometric random walk. We view the scenario as an optimi… ▽ More

    Submitted 1 October, 2013; originally announced October 2013.

    Comments: 12 pages, To appear in Neural Information Processing Systems (NIPS) 2013

  30. arXiv:1309.2350  [pdf, ps, other

    cs.LG cs.SI math.OC stat.ML

    Exponentially Fast Parameter Estimation in Networks Using Distributed Dual Averaging

    Authors: Shahin Shahrampour, Ali Jadbabaie

    Abstract: In this paper we present an optimization-based view of distributed parameter estimation and observational social learning in networks. Agents receive a sequence of random, independent and identically distributed (i.i.d.) signals, each of which individually may not be informative about the underlying true state, but the signals together are globally informative enough to make the true state identif… ▽ More

    Submitted 9 September, 2013; originally announced September 2013.

    Comments: 6 pages, To appear in Conference on Decision and Control 2013