Skip to main content

Showing 1–38 of 38 results for author: Póczos, B

Searching in archive math. Search in all archives.
.
  1. arXiv:2211.03970  [pdf, other

    cs.LG math.OC

    On the Algorithmic Stability and Generalization of Adaptive Optimization Methods

    Authors: Han Nguyen, Hai Pham, Sashank J. Reddi, Barnabás Póczos

    Abstract: Despite their popularity in deep learning and machine learning in general, the theoretical properties of adaptive optimizers such as Adagrad, RMSProp, Adam or AdamW are not yet fully understood. In this paper, we develop a novel framework to study the stability and generalization of these optimization methods. Based on this framework, we show provable guarantees about such properties that depend h… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 21 pages including appendix

  2. arXiv:2004.08597  [pdf, other

    math.ST cs.LG stat.ML

    Robust Density Estimation under Besov IPM Losses

    Authors: Ananya Uppal, Shashank Singh, Barnabas Poczos

    Abstract: We study minimax convergence rates of nonparametric density estimation in the Huber contamination model, in which a proportion of the data comes from an unknown outlier distribution. We provide the first results for this problem under a large family of losses, called Besov integral probability metrics (IPMs), that includes $\mathcal{L}^p$, Wasserstein, Kolmogorov-Smirnov, and other common distance… ▽ More

    Submitted 6 September, 2021; v1 submitted 18 April, 2020; originally announced April 2020.

  3. arXiv:2002.08528  [pdf, other

    cs.LG math.OC stat.ML

    Adaptive Sampling Distributed Stochastic Variance Reduced Gradient for Heterogeneous Distributed Datasets

    Authors: Ilqar Ramazanli, Han Nguyen, Hai Pham, Sashank J. Reddi, Barnabas Poczos

    Abstract: We study distributed optimization algorithms for minimizing the average of \emph{heterogeneous} functions distributed across several machines with a focus on communication efficiency. In such settings, naively using the classical stochastic gradient descent (SGD) or its variants (e.g., SVRG) with a uniform sampling of machines typically yields poor performance. It often leads to the dependence of… ▽ More

    Submitted 17 November, 2020; v1 submitted 19 February, 2020; originally announced February 2020.

  4. arXiv:1902.03511  [pdf, other

    math.ST cs.IT cs.LG stat.ML

    Nonparametric Density Estimation & Convergence Rates for GANs under Besov IPM Losses

    Authors: Ananya Uppal, Shashank Singh, Barnabás Póczos

    Abstract: We study the problem of estimating a nonparametric probability density under a large family of losses called Besov IPMs, which include, for example, $\mathcal{L}^p$ distances, total variation distance, and generalizations of both Wasserstein and Kolmogorov-Smirnov distances. For a wide variety of settings, we provide both lower and upper bounds, identifying precisely how the choice of loss functio… ▽ More

    Submitted 13 January, 2020; v1 submitted 9 February, 2019; originally announced February 2019.

    Comments: Advances in Neural Information Processing Systems. 2019

  5. arXiv:1810.02054  [pdf, other

    cs.LG math.OC stat.ML

    Gradient Descent Provably Optimizes Over-parameterized Neural Networks

    Authors: Simon S. Du, Xiyu Zhai, Barnabas Poczos, Aarti Singh

    Abstract: One of the mysteries in the success of neural networks is randomly initialized first order methods like gradient descent can achieve zero training loss even though the objective function is non-convex and non-smooth. This paper demystifies this surprising phenomenon for two-layer fully connected ReLU activated neural networks. For an $m$ hidden node shallow neural network with ReLU activation and… ▽ More

    Submitted 4 February, 2019; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: ICLR 2019

  6. arXiv:1805.08836  [pdf, other

    math.ST cs.IT stat.ML

    Nonparametric Density Estimation under Adversarial Losses

    Authors: Shashank Singh, Ananya Uppal, Boyue Li, Chun-Liang Li, Manzil Zaheer, Barnabás Póczos

    Abstract: We study minimax convergence rates of nonparametric density estimation under a large class of loss functions called "adversarial losses", which, besides classical $\mathcal{L}^p$ losses, includes maximum mean discrepancy (MMD), Wasserstein distance, and total variation distance. These losses are closely related to the losses encoded by discriminator networks in generative adversarial networks (GAN… ▽ More

    Submitted 28 October, 2018; v1 submitted 22 May, 2018; originally announced May 2018.

  7. arXiv:1803.11451  [pdf, ps, other

    math.ST cs.IT stat.ML

    Minimax Estimation of Quadratic Fourier Functionals

    Authors: Shashank Singh, Bharath K. Sriperumbudur, Barnabás Póczos

    Abstract: We study estimation of (semi-)inner products between two nonparametric probability distributions, given IID samples from each distribution. These products include relatively well-studied classical $\mathcal{L}^2$ and Sobolev inner products, as well as those induced by translation-invariant reproducing kernels, for which we believe our results are the first. We first propose estimators for these qu… ▽ More

    Submitted 1 September, 2018; v1 submitted 30 March, 2018; originally announced March 2018.

  8. arXiv:1802.08855  [pdf, ps, other

    math.ST cs.IT cs.LG stat.ML

    Minimax Distribution Estimation in Wasserstein Distance

    Authors: Shashank Singh, Barnabás Póczos

    Abstract: The Wasserstein metric is an important measure of distance between probability distributions, with applications in machine learning, statistics, probability theory, and data analysis. This paper provides upper and lower bounds on statistical minimax rates for the problem of estimating a probability distribution under Wasserstein loss, using only metric properties, such as covering and packing numb… ▽ More

    Submitted 6 November, 2019; v1 submitted 24 February, 2018; originally announced February 2018.

  9. arXiv:1712.00779  [pdf, other

    cs.LG cs.AI cs.CV math.OC stat.ML

    Gradient Descent Learns One-hidden-layer CNN: Don't be Afraid of Spurious Local Minima

    Authors: Simon S. Du, Jason D. Lee, Yuandong Tian, Barnabas Poczos, Aarti Singh

    Abstract: We consider the problem of learning a one-hidden-layer neural network with non-overlapping convolutional layer and ReLU activation, i.e., $f(\mathbf{Z}, \mathbf{w}, \mathbf{a}) = \sum_j a_jσ(\mathbf{w}^T\mathbf{Z}_j)$, in which both the convolutional weights $\mathbf{w}$ and the output weights $\mathbf{a}$ are parameters to be learned. When the labels are the outputs from a teacher network of the… ▽ More

    Submitted 14 June, 2018; v1 submitted 3 December, 2017; originally announced December 2017.

    Comments: Accepted by ICML 2018

  10. arXiv:1708.08587  [pdf, other

    math.ST cs.IT cs.LG stat.ML

    On the Reconstruction Risk of Convolutional Sparse Dictionary Learning

    Authors: Shashank Singh, Barnabás Póczos, Jian Ma

    Abstract: Sparse dictionary learning (SDL) has become a popular method for adaptively identifying parsimonious representations of a dataset, a fundamental problem in machine learning and signal processing. While most work on SDL assumes a training dataset of independent and identically distributed samples, a variant known as convolutional sparse dictionary learning (CSDL) relaxes this assumption, allowing m… ▽ More

    Submitted 24 February, 2018; v1 submitted 29 August, 2017; originally announced August 2017.

  11. arXiv:1705.10412  [pdf, other

    math.OC cs.LG stat.ML

    Gradient Descent Can Take Exponential Time to Escape Saddle Points

    Authors: Simon S. Du, Chi Jin, Jason D. Lee, Michael I. Jordan, Barnabas Poczos, Aarti Singh

    Abstract: Although gradient descent (GD) almost always escapes saddle points asymptotically [Lee et al., 2016], this paper shows that even with fairly natural random initialization schemes and non-pathological functions, GD can be significantly slowed down by saddle points, taking exponential time to escape. On the other hand, gradient descent with perturbations [Ge et al., 2015, Jin et al., 2017] is not sl… ▽ More

    Submitted 5 November, 2017; v1 submitted 29 May, 2017; originally announced May 2017.

    Comments: Accepted by NIPS 2017

  12. arXiv:1702.07803  [pdf, ps, other

    math.ST cs.IT stat.ML

    Nonparanormal Information Estimation

    Authors: Shashank Singh, Barnabás Pøczos

    Abstract: We study the problem of using i.i.d. samples from an unknown multivariate probability distribution $p$ to estimate the mutual information of $p$. This problem has recently received attention in two settings: (1) where $p$ is assumed to be Gaussian and (2) where $p$ is assumed only to lie in a large nonparametric smoothness class. Estimators proposed for the Gaussian case converge in high dimension… ▽ More

    Submitted 24 February, 2017; originally announced February 2017.

  13. arXiv:1608.06879  [pdf, other

    math.OC cs.LG stat.ML

    AIDE: Fast and Communication Efficient Distributed Optimization

    Authors: Sashank J. Reddi, Jakub Konečný, Peter Richtárik, Barnabás Póczós, Alex Smola

    Abstract: In this paper, we present two new communication-efficient methods for distributed minimization of an average of functions. The first algorithm is an inexact variant of the DANE algorithm that allows any local algorithm to return an approximate solution to a local subproblem. We show that such a strategy does not affect the theoretical guarantees of DANE significantly. In fact, our approach can be… ▽ More

    Submitted 24 August, 2016; originally announced August 2016.

  14. arXiv:1607.08254  [pdf, other

    math.OC cs.LG stat.ML

    Stochastic Frank-Wolfe Methods for Nonconvex Optimization

    Authors: Sashank J. Reddi, Suvrit Sra, Barnabas Poczos, Alex Smola

    Abstract: We study Frank-Wolfe methods for nonconvex stochastic and finite-sum optimization problems. Frank-Wolfe methods (in the convex case) have gained tremendous recent interest in machine learning and optimization communities due to their projection-free property and their ability to exploit structured constraints. However, our understanding of these algorithms in the nonconvex setting is fairly limite… ▽ More

    Submitted 29 July, 2016; v1 submitted 27 July, 2016; originally announced July 2016.

  15. arXiv:1606.01554  [pdf, ps, other

    math.ST cs.IT stat.ML

    Finite-Sample Analysis of Fixed-k Nearest Neighbor Density Functional Estimators

    Authors: Shashank Singh, Barnabás Póczos

    Abstract: We provide finite-sample analysis of a general framework for using k-nearest neighbor statistics to estimate functionals of a nonparametric continuous probability density, including entropies and divergences. Rather than plugging a consistent density estimate (which requires $k \to \infty$ as the sample size $n \to \infty$) into the functional of interest, the estimators we consider fix k and perf… ▽ More

    Submitted 5 June, 2016; originally announced June 2016.

    Comments: 16 pages, 0 figures

  16. arXiv:1605.06900  [pdf, other

    math.OC cs.LG stat.ML

    Fast Stochastic Methods for Nonsmooth Nonconvex Optimization

    Authors: Sashank J. Reddi, Suvrit Sra, Barnabas Poczos, Alex Smola

    Abstract: We analyze stochastic algorithms for optimizing nonconvex, nonsmooth finite-sum problems, where the nonconvex part is smooth and the nonsmooth part is convex. Surprisingly, unlike the smooth case, our knowledge of this fundamental problem is very limited. For example, it is not known whether the proximal stochastic gradient method with constant minibatch converges to a stationary point. To tackle… ▽ More

    Submitted 23 May, 2016; originally announced May 2016.

  17. arXiv:1605.05785  [pdf, other

    math.ST cs.IT stat.ML

    Efficient Nonparametric Smoothness Estimation

    Authors: Shashank Singh, Simon S. Du, Barnabás Póczos

    Abstract: Sobolev quantities (norms, inner products, and distances) of probability density functions are important in the theory of nonparametric statistics, but have rarely been used in practice, partly due to a lack of practical estimators. They also include, as special cases, $L^2$ quantities which are used in many applications. We propose and analyze a family of estimators for Sobolev quantities of unkn… ▽ More

    Submitted 21 July, 2016; v1 submitted 18 May, 2016; originally announced May 2016.

  18. arXiv:1603.08589  [pdf, other

    cs.IT math.ST stat.ML

    Generalized Exponential Concentration Inequality for Rényi Divergence Estimation

    Authors: Shashank Singh, Barnabás Póczos

    Abstract: Estimating divergences in a consistent way is of great importance in many machine learning tasks. Although this is a fundamental problem in nonparametric statistics, to the best of our knowledge there has been no finite sample exponential inequality convergence bound derived for any divergence estimators. The main contribution of our work is to provide such a bound for an estimator of Rényi-$α$ di… ▽ More

    Submitted 28 March, 2016; originally announced March 2016.

    Comments: In 31st International Conference on Machine Learning (ICML), 2014

  19. arXiv:1603.08578  [pdf, ps, other

    math.ST cs.IT stat.ML

    Analysis of k-Nearest Neighbor Distances with Application to Entropy Estimation

    Authors: Shashank Singh, Barnabás Póczos

    Abstract: Estimating entropy and mutual information consistently is important for many machine learning applications. The Kozachenko-Leonenko (KL) estimator (Kozachenko & Leonenko, 1987) is a widely used nonparametric estimator for the entropy of multivariate continuous random variables, as well as the basis of the mutual information estimator of Kraskov et al. (2004), perhaps the most widely used estimator… ▽ More

    Submitted 21 July, 2016; v1 submitted 28 March, 2016; originally announced March 2016.

  20. arXiv:1603.06160  [pdf, other

    math.OC cs.LG cs.NE stat.ML

    Stochastic Variance Reduction for Nonconvex Optimization

    Authors: Sashank J. Reddi, Ahmed Hefny, Suvrit Sra, Barnabas Poczos, Alex Smola

    Abstract: We study nonconvex finite-sum problems and analyze stochastic variance reduced gradient (SVRG) methods for them. SVRG and related methods have recently surged into prominence for convex optimization given their edge over stochastic gradient descent (SGD); but their theoretical analysis almost exclusively assumes convexity. In contrast, we prove non-asymptotic rates of convergence (to stationary po… ▽ More

    Submitted 4 April, 2016; v1 submitted 19 March, 2016; originally announced March 2016.

    Comments: Minor feedback changes

  21. arXiv:1603.06159  [pdf, other

    math.OC cs.LG stat.ML

    Fast Incremental Method for Nonconvex Optimization

    Authors: Sashank J. Reddi, Suvrit Sra, Barnabas Poczos, Alex Smola

    Abstract: We analyze a fast incremental aggregated gradient method for optimizing nonconvex problems of the form $\min_x \sum_i f_i(x)$. Specifically, we analyze the SAGA algorithm within an Incremental First-order Oracle framework, and show that it converges to a stationary point provably faster than both gradient descent and stochastic gradient descent. We also discuss a Polyak's special class of nonconve… ▽ More

    Submitted 19 March, 2016; originally announced March 2016.

  22. arXiv:1509.08535  [pdf, other

    math.ST cs.AI cs.DM stat.ML

    Boolean Matrix Factorization and Noisy Completion via Message Passing

    Authors: Siamak Ravanbakhsh, Barnabas Poczos, Russell Greiner

    Abstract: Boolean matrix factorization and Boolean matrix completion from noisy observations are desirable unsupervised data-analysis methods due to their interpretability, but hard to perform due to their NP-hardness. We treat these problems as maximum a posteriori inference problems in a graphical model and present a message passing approach that scales linearly with the number of observations and factors… ▽ More

    Submitted 4 February, 2016; v1 submitted 28 September, 2015; originally announced September 2015.

  23. arXiv:1508.00655  [pdf, other

    math.ST cs.AI cs.IT cs.LG stat.ML

    Adaptivity and Computation-Statistics Tradeoffs for Kernel and Distance based High Dimensional Two Sample Testing

    Authors: Aaditya Ramdas, Sashank J. Reddi, Barnabas Poczos, Aarti Singh, Larry Wasserman

    Abstract: Nonparametric two sample testing is a decision theoretic problem that involves identifying differences between two random variables without making parametric assumptions about their underlying distributions. We refer to the most common settings as mean difference alternatives (MDA), for testing differences only in first moments, and general difference alternatives (GDA), which is about testing for… ▽ More

    Submitted 4 August, 2015; originally announced August 2015.

    Comments: 35 pages, 4 figures

  24. arXiv:1505.04215  [pdf, other

    stat.ML cs.AI cs.LG math.ST

    An Analysis of Active Learning With Uniform Feature Noise

    Authors: Aaditya Ramdas, Barnabas Poczos, Aarti Singh, Larry Wasserman

    Abstract: In active learning, the user sequentially chooses values for feature $X$ and an oracle returns the corresponding label $Y$. In this paper, we consider the effect of feature noise in active learning, which could arise either because $X$ itself is being measured, or it is corrupted in transmission to the oracle, or the oracle returns the label of a noisy version of the query point. In statistics, fe… ▽ More

    Submitted 15 May, 2015; originally announced May 2015.

    Comments: 24 pages, 2 figures, published in the proceedings of the 17th International Conference on Artificial Intelligence and Statistics (AISTATS), 2014

  25. arXiv:1411.6314  [pdf, other

    math.ST cs.AI cs.IT cs.LG stat.ML

    On the High-dimensional Power of Linear-time Kernel Two-Sample Testing under Mean-difference Alternatives

    Authors: Aaditya Ramdas, Sashank J. Reddi, Barnabas Poczos, Aarti Singh, Larry Wasserman

    Abstract: Nonparametric two sample testing deals with the question of consistently deciding if two distributions are different, given samples from both, without making any parametric assumptions about the form of the distributions. The current literature is split into two kinds of tests - those which are consistent without any assumptions about how the distributions may differ (\textit{general} alternatives… ▽ More

    Submitted 23 November, 2014; originally announced November 2014.

    Comments: 25 pages, 5 figures

  26. arXiv:1411.2066  [pdf, ps, other

    math.ST cs.LG math.FA stat.ML

    Learning Theory for Distribution Regression

    Authors: Zoltan Szabo, Bharath Sriperumbudur, Barnabas Poczos, Arthur Gretton

    Abstract: We focus on the distribution regression problem: regressing to vector-valued outputs from probability measures. Many important machine learning and statistical tasks fit into this framework, including multi-instance learning and point estimation problems without analytical solution (such as hyperparameter or entropy estimation). Despite the large number of available heuristics in the literature, t… ▽ More

    Submitted 21 October, 2016; v1 submitted 7 November, 2014; originally announced November 2014.

    Comments: Final version appeared at JMLR, with supplement. Code: https://bitbucket.org/szzoli/ite/. arXiv admin note: text overlap with arXiv:1402.1754

    MSC Class: 62G08; 46E22; 47B32 ACM Class: G.3; I.2.6

    Journal ref: Journal of Machine Learning Research, 17(152):1-40, 2016

  27. arXiv:1406.2083  [pdf, other

    stat.ML cs.IT cs.LG math.ST stat.ME

    On the Decreasing Power of Kernel and Distance based Nonparametric Hypothesis Tests in High Dimensions

    Authors: Sashank J. Reddi, Aaditya Ramdas, Barnabás Póczos, Aarti Singh, Larry Wasserman

    Abstract: This paper is about two related decision theoretic problems, nonparametric two-sample testing and independence testing. There is a belief that two recently proposed solutions, based on kernels and distances between pairs of points, behave well in high-dimensional settings. We identify different sources of misconception that give rise to the above belief. Specifically, we differentiate the hardness… ▽ More

    Submitted 23 November, 2014; v1 submitted 9 June, 2014; originally announced June 2014.

    Comments: 19 pages, 9 figures, published in AAAI-15: The 29th AAAI Conference on Artificial Intelligence (with author order reversed from ArXiv)

  28. arXiv:1402.2966  [pdf, other

    stat.ML math.ST

    Nonparametric Estimation of Renyi Divergence and Friends

    Authors: Akshay Krishnamurthy, Kirthevasan Kandasamy, Barnabas Poczos, Larry Wasserman

    Abstract: We consider nonparametric estimation of $L_2$, Renyi-$α$ and Tsallis-$α$ divergences between continuous distributions. Our approach is to construct estimators for particular integral functionals of two densities and translate them into divergence estimators. For the integral functionals, our estimators are based on corrections of a preliminary plug-in estimator. We show that these estimators achie… ▽ More

    Submitted 12 May, 2014; v1 submitted 12 February, 2014; originally announced February 2014.

  29. arXiv:1402.1754  [pdf, ps, other

    math.ST cs.LG math.FA stat.ML

    Two-stage Sampled Learning Theory on Distributions

    Authors: Zoltan Szabo, Arthur Gretton, Barnabas Poczos, Bharath Sriperumbudur

    Abstract: We focus on the distribution regression problem: regressing to a real-valued response from a probability distribution. Although there exist a large number of similarity measures between distributions, very little is known about their generalization performance in specific learning tasks. Learning problems formulated on distributions have an inherent two-stage sampled difficulty: in practice only s… ▽ More

    Submitted 26 January, 2015; v1 submitted 7 February, 2014; originally announced February 2014.

    Comments: v6: accepted at AISTATS-2015 for oral presentation; final version; code: https://bitbucket.org/szzoli/ite/; extension to the misspecified and vector-valued case: https://arxiv.boxedpaper.com/abs/1411.2066

    MSC Class: 62G08; 46E22; 47B32 ACM Class: G.3; I.2.6

  30. arXiv:1311.2236  [pdf, other

    stat.ML cs.LG math.ST

    Fast Distribution To Real Regression

    Authors: Junier B. Oliva, Willie Neiswanger, Barnabas Poczos, Jeff Schneider, Eric Xing

    Abstract: We study the problem of distribution to real-value regression, where one aims to regress a mapping $f$ that takes in a distribution input covariate $P\in \mathcal{I}$ (for a non-parametric family of distributions $\mathcal{I}$) and outputs a real-valued response $Y=f(P) + ε$. This setting was recently studied, and a "Kernel-Kernel" estimator was introduced and shown to have a polynomial rate of co… ▽ More

    Submitted 8 March, 2014; v1 submitted 9 November, 2013; originally announced November 2013.

  31. arXiv:1311.2234  [pdf, other

    stat.ML cs.LG math.ST

    FuSSO: Functional Shrinkage and Selection Operator

    Authors: Junier B. Oliva, Barnabas Poczos, Timothy Verstynen, Aarti Singh, Jeff Schneider, Fang-Cheng Yeh, Wen-Yih Tseng

    Abstract: We present the FuSSO, a functional analogue to the LASSO, that efficiently finds a sparse set of functional input covariates to regress a real-valued response against. The FuSSO does so in a semi-parametric fashion, making no parametric assumptions about the nature of input functional covariates and assuming a linear form to the mapping of functional covariates to the response. We provide a statis… ▽ More

    Submitted 8 March, 2014; v1 submitted 9 November, 2013; originally announced November 2013.

  32. arXiv:1302.0082  [pdf, other

    stat.ML cs.LG math.ST

    Distribution-Free Distribution Regression

    Authors: Barnabas Poczos, Alessandro Rinaldo, Aarti Singh, Larry Wasserman

    Abstract: `Distribution regression' refers to the situation where a response Y depends on a covariate P where P is a probability distribution. The model is Y=f(P) + mu where f is an unknown regression function and mu is a random error. Typically, we do not observe P directly, but rather, we observe a sample from P. In this paper we develop theory and methods for distribution-free versions of distribution re… ▽ More

    Submitted 1 February, 2013; originally announced February 2013.

  33. arXiv:1206.4682  [pdf

    cs.LG math.ST stat.ML

    Copula-based Kernel Dependency Measures

    Authors: Barnabas Poczos, Zoubin Ghahramani, Jeff Schneider

    Abstract: The paper presents a new copula based method for measuring dependence between random variables. Our approach extends the Maximum Mean Discrepancy to the copula of the joint distribution. We prove that this approach has several advantageous properties. Similarly to Shannon mutual information, the proposed dependence measure is invariant to any strictly increasing transformation of the marginal vari… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012

  34. arXiv:1201.0341  [pdf, ps, other

    math.OC cs.LG math.ST stat.ML

    Collaborative Filtering via Group-Structured Dictionary Learning

    Authors: Zoltan Szabo, Barnabas Poczos, Andras Lorincz

    Abstract: Structured sparse coding and the related structured dictionary learning problems are novel research areas in machine learning. In this paper we present a new application of structured dictionary learning for collaborative filtering based recommender systems. Our extensive numerical experiments demonstrate that the presented technique outperforms its state-of-the-art competitors and has several adv… ▽ More

    Submitted 1 January, 2012; originally announced January 2012.

    Comments: A compressed version of the paper has been accepted for publication at the 10th International Conference on Latent Variable Analysis and Source Separation (LVA/ICA 2012)

    MSC Class: 65K10; 90C26; 49M37 (Primary) ACM Class: I.2.6; I.5.4

    Journal ref: International Conference on Latent Variable Analysis and Source Separation (LVA/ICA), vol. 7191 of LNCS, pp. 247-254, 2012

  35. arXiv:math/0703612  [pdf, other

    math.ST

    Independent Process Analysis without A Priori Dimensional Information

    Authors: Barnabas Poczos, Zoltan Szabo, Melinda Kiszlinger, Andras Lorincz

    Abstract: Recently, several algorithms have been proposed for independent subspace analysis where hidden variables are i.i.d. processes. We show that these methods can be extended to certain AR, MA, ARMA and ARIMA tasks. Central to our paper is that we introduce a cascade of algorithms, which aims to solve these tasks without previous knowledge about the number and the dimensions of the hidden processes.… ▽ More

    Submitted 20 March, 2007; originally announced March 2007.

    Comments: 9 pages, 2 figures

    Journal ref: ICA-2007, pages 252-259

  36. arXiv:math/0701210  [pdf, ps, other

    math.ST stat.ML

    Undercomplete Blind Subspace Deconvolution

    Authors: Zoltan Szabo, Barnabas Poczos, Andras Lorincz

    Abstract: We introduce the blind subspace deconvolution (BSSD) problem, which is the extension of both the blind source deconvolution (BSD) and the independent subspace analysis (ISA) tasks. We examine the case of the undercomplete BSSD (uBSSD). Applying temporal concatenation we reduce this problem to ISA. The associated `high dimensional' ISA problem can be handled by a recent technique called joint f-d… ▽ More

    Submitted 20 May, 2007; v1 submitted 7 January, 2007; originally announced January 2007.

    Comments: Final version, appeared in Journal of Machine Learning Research

    Journal ref: Zoltan Szabo, Barnabas Poczos, Andras Lorincz: Undercomplete Blind Subspace Deconvolution. Journal of Machine Learning Research 8(May):1063-1095, 2007

  37. arXiv:math/0608100  [pdf, ps, other

    math.ST

    Separation Theorem for K-Independent Subspace Analysis with Sufficient Conditions

    Authors: Zoltan Szabo, Barnabas Poczos, Andras Lorincz

    Abstract: Here, a Separation Theorem about K-Independent Subspace Analysis (K real or complex), a generalization of K-Independent Component Analysis (KICA) is proven. According to the theorem, KISA estimation can be executed in two steps under certain conditions. In the first step, 1-dimensional KICA estimation is executed. In the second step, optimal permutation of the KICA elements is searched for. We p… ▽ More

    Submitted 10 August, 2006; v1 submitted 3 August, 2006; originally announced August 2006.

    Comments: Reference [13]: corrected

  38. arXiv:math/0603535  [pdf, ps, other

    math.ST

    Separation Theorem for Independent Subspace Analysis with Sufficient Conditions

    Authors: Zoltan Szabo, Barnabas Poczos, Andras Lorincz

    Abstract: Here, a separation theorem about Independent Subspace Analysis (ISA), a generalization of Independent Component Analysis (ICA) is proven. According to the theorem, ISA estimation can be executed in two steps under certain conditions. In the first step, 1-dimensional ICA estimation is executed. In the second step, optimal permutation of the ICA elements is searched for. We present sufficient cond… ▽ More

    Submitted 23 March, 2006; v1 submitted 22 March, 2006; originally announced March 2006.

    Comments: 11 pages, 0 figures

    MSC Class: 62G05; 62M10