Skip to main content

Showing 1–26 of 26 results for author: Rajaratnam, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2408.11718  [pdf, other

    stat.ME stat.ML

    Scalable and non-iterative graphical model estimation

    Authors: Kshitij Khare, Syed Rahman, Bala Rajaratnam, Jiayuan Zhou

    Abstract: Graphical models have found widespread applications in many areas of modern statistics and machine learning. Iterative Proportional Fitting (IPF) and its variants have become the default method for undirected graphical model estimation, and are thus ubiquitous in the field. As the IPF is an iterative approach, it is not always readily scalable to modern high-dimensional data regimes. In this paper… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  2. arXiv:1703.09163   

    stat.CO

    Scalable Bayesian shrinkage and uncertainty quantification in high-dimensional regression

    Authors: Bala Rajaratnam, Doug Sparks, Kshitij Khare, Liyuan Zhang

    Abstract: Bayesian shrinkage methods have generated a lot of recent interest as tools for high-dimensional regression and model selection. These methods naturally facilitate tractable uncertainty quantification and incorporation of prior information. A common feature of these models, including the Bayesian lasso, global-local shrinkage priors, and spike-and-slab priors is that the corresponding priors on th… ▽ More

    Submitted 14 April, 2017; v1 submitted 27 March, 2017; originally announced March 2017.

    Comments: This paper is already available at arXiv with identifier arXiv:1509.03697, and was submitted again due to some confusion

  3. arXiv:1610.02436  [pdf, other

    stat.ME

    A convex framework for high-dimensional sparse Cholesky based covariance estimation

    Authors: Kshitij Khare, Sang Oh, Syed Rahman, Bala Rajaratnam

    Abstract: Covariance estimation for high-dimensional datasets is a fundamental problem in modern day statistics with numerous applications. In these high dimensional datasets, the number of variables p is typically larger than the sample size n. A popular way of tackling this challenge is to induce sparsity in the covariance matrix, its inverse or a relevant transformation. In particular, methods inducing s… ▽ More

    Submitted 7 October, 2016; originally announced October 2016.

  4. arXiv:1606.00033  [pdf, other

    stat.ME

    Generalized Pseudolikelihood Methods for Inverse Covariance Estimation

    Authors: Alnur Ali, Kshitij Khare, Sang-Yun Oh, Bala Rajaratnam

    Abstract: We introduce PseudoNet, a new pseudolikelihood-based estimator of the inverse covariance matrix, that has a number of useful statistical and computational properties. We show, through detailed experiments with synthetic and also real-world finance as well as wind power data, that PseudoNet outperforms related methods in terms of estimation error and support recovery, making it well-suited for use… ▽ More

    Submitted 14 October, 2016; v1 submitted 31 May, 2016; originally announced June 2016.

  5. arXiv:1509.03697  [pdf, other

    stat.ME

    Scalable Bayesian shrinkage and uncertainty quantification for high-dimensional regression

    Authors: Bala Rajaratnam, Doug Sparks, Kshitij Khare, Liyuan Zhang

    Abstract: Bayesian shrinkage methods have generated a lot of recent interest as tools for high-dimensional regression and model selection. These methods naturally facilitate tractable uncertainty quantification and incorporation of prior information. This benefit has led to extensive use of the Bayesian shrinkage methods across diverse applications. A common feature of these models is that the corresponding… ▽ More

    Submitted 19 April, 2017; v1 submitted 11 September, 2015; originally announced September 2015.

    MSC Class: 62J05

  6. arXiv:1508.00947  [pdf, other

    math.ST math.PR stat.ME

    MCMC-Based Inference in the Era of Big Data: A Fundamental Analysis of the Convergence Complexity of High-Dimensional Chains

    Authors: Bala Rajaratnam, Doug Sparks

    Abstract: Markov chain Monte Carlo (MCMC) lies at the core of modern Bayesian methodology, much of which would be impossible without it. Thus, the convergence properties of MCMCs have received significant attention, and in particular, proving (geometric) ergodicity is of critical interest. Trust in the ability of MCMCs to sample from modern-day high-dimensional posteriors, however, has been limited by a wid… ▽ More

    Submitted 26 August, 2015; v1 submitted 4 August, 2015; originally announced August 2015.

    MSC Class: 60J05; 65C40

  7. arXiv:1505.02475  [pdf, other

    math.ST stat.ML

    Foundational principles for large scale inference: Illustrations through correlation mining

    Authors: Alfred O. Hero, Bala Rajaratnam

    Abstract: When can reliable inference be drawn in the "Big Data" context? This paper presents a framework for answering this fundamental question in the context of correlation mining, with implications for general large scale inference. In large scale data applications like genomics, connectomics, and eco-informatics the dataset is often variable-rich but sample-starved: a regime where the number $n$ of acq… ▽ More

    Submitted 18 May, 2015; v1 submitted 10 May, 2015; originally announced May 2015.

  8. arXiv:1505.00703  [pdf, other

    stat.ME

    Bayesian inference for Gaussian graphical models beyond decomposable graphs

    Authors: Kshitij Khare, Bala Rajaratnam, Abhishek Saha

    Abstract: Bayesian inference for graphical models has received much attention in the literature in recent years. It is well known that when the graph G is decomposable, Bayesian inference is significantly more tractable than in the general non-decomposable setting. Penalized likelihood inference on the other hand has made tremendous gains in the past few years in terms of scalability and tractability. Bayes… ▽ More

    Submitted 4 May, 2015; originally announced May 2015.

    MSC Class: 62-09; 62F15

  9. Two-stage Sampling, Prediction and Adaptive Regression via Correlation Screening (SPARCS)

    Authors: Hamed Firouzi, Alfred Hero, Bala Rajaratnam

    Abstract: This paper proposes a general adaptive procedure for budget-limited predictor design in high dimensions called two-stage Sampling, Prediction and Adaptive Regression via Correlation Screening (SPARCS). SPARCS can be applied to high dimensional prediction problems in experimental science, medicine, finance, and engineering, as illustrated by the following. Suppose one wishes to run a sequence of ex… ▽ More

    Submitted 1 October, 2016; v1 submitted 22 February, 2015; originally announced February 2015.

    Comments: To appear in IEEE Transactions on Information Theory. 40 Pages. arXiv admin note: text overlap with arXiv:1303.2378

  10. arXiv:1502.01073  [pdf, other

    stat.ME

    Extracting Common Time Trends from Concurrent Time Series: Maximum Autocorrelation Factors with Application to Tree Ring Time Series Data

    Authors: Matz A. Haugen, Bala Rajaratnam, Paul Switzer

    Abstract: Concurrent time series commonly arise in various applications, including when monitoring the environment such as in air quality measurement networks, weather stations, oceanographic buoys, or in paleo form such as lake sediments, tree rings, ice cores, or coral isotopes, with each monitoring or sampling site providing one of the time series. The goal in such applications is to extract a common tim… ▽ More

    Submitted 17 October, 2015; v1 submitted 3 February, 2015; originally announced February 2015.

    Comments: 38 pages, 12 figures

  11. arXiv:1502.00471  [pdf, other

    stat.ME

    Towards a sparse, scalable, and stably positive definite (inverse) covariance estimator

    Authors: Sang-Yun Oh, Bala Rajaratnam, Joong-Ho Won

    Abstract: High dimensional covariance estimation and graphical models is a contemporary topic in statistics and machine learning having widespread applications. An important line of research in this regard is to shrink the extreme spectrum of the covariance matrix estimators. A separate line of research in the literature has considered sparse inverse covariance estimation which in turn gives rise to graphic… ▽ More

    Submitted 26 June, 2016; v1 submitted 2 February, 2015; originally announced February 2015.

    Comments: 19 pages; 1 figure

  12. arXiv:1409.3768  [pdf, other

    stat.CO cs.LG stat.ML

    Optimization Methods for Sparse Pseudo-Likelihood Graphical Model Selection

    Authors: Sang-Yun Oh, Onkar Dalal, Kshitij Khare, Bala Rajaratnam

    Abstract: Sparse high dimensional graphical model selection is a popular topic in contemporary machine learning. To this end, various useful approaches have been proposed in the context of $\ell_1$-penalized estimation in the Gaussian framework. Though many of these inverse covariance estimation approaches are demonstrably scalable and have leveraged recent advances in convex optimization, they still depend… ▽ More

    Submitted 12 September, 2014; originally announced September 2014.

    Comments: NIPS accepted version

    Journal ref: Advances in Neural Information Processing Systems 27, 667-675 (2014)

  13. arXiv:1405.3034  [pdf, other

    stat.CO stat.ML

    G-AMA: Sparse Gaussian graphical model estimation via alternating minimization

    Authors: Onkar Dalal, Bala Rajaratnam

    Abstract: Several methods have been recently proposed for estimating sparse Gaussian graphical models using $\ell_{1}$ regularization on the inverse covariance matrix. Despite recent advances, contemporary applications require methods that are even faster in order to handle ill-conditioned high dimensional modern day datasets. In this paper, we propose a new method, G-AMA, to solve the sparse inverse covari… ▽ More

    Submitted 14 May, 2014; v1 submitted 13 May, 2014; originally announced May 2014.

    Comments: 21 pages, 3 figures

  14. arXiv:1404.5100  [pdf, ps, other

    math.OC stat.CO

    Convergence of cyclic coordinatewise l1 minimization

    Authors: Kshitij Khare, Bala Rajaratnam

    Abstract: We consider the general problem of minimizing an objective function which is the sum of a convex function (not strictly convex) and absolute values of a subset of variables (or equivalently the l1-norm of the variables). This problem appears exten- sively in modern statistical applications associated with high-dimensional data or "big data", and corresponds to optimizing l1-regularized likelihoods… ▽ More

    Submitted 29 January, 2015; v1 submitted 20 April, 2014; originally announced April 2014.

    MSC Class: 49N99; 62H99

  15. arXiv:1401.2480  [pdf, other

    stat.ME

    Lasso Regression: Estimation and Shrinkage via Limit of Gibbs Sampling

    Authors: Bala Rajaratnam, Steven Roberts, Doug Sparks, Onkar Dalal

    Abstract: The application of the lasso is espoused in high-dimensional settings where only a small number of the regression coefficients are believed to be nonzero. Moreover, statistical properties of high-dimensional lasso estimators are often proved under the assumption that the correlation between the predictors is bounded. In this vein, coordinatewise methods, the most common means of computing the lass… ▽ More

    Submitted 5 January, 2015; v1 submitted 10 January, 2014; originally announced January 2014.

  16. arXiv:1310.2641  [pdf, ps, other

    math.PR stat.ML

    Duality in Graphical Models

    Authors: Dhafer Malouche, Bala Rajaratnam, Benjamin T. Rolfs

    Abstract: Graphical models have proven to be powerful tools for representing high-dimensional systems of random variables. One example of such a model is the undirected graph, in which lack of an edge represents conditional independence between two random variables given the rest. Another example is the bidirected graph, in which absence of edges encodes pairwise marginal independence. Both of these classes… ▽ More

    Submitted 9 October, 2013; originally announced October 2013.

  17. Statistical paleoclimate reconstructions via Markov random fields

    Authors: Dominique Guillot, Bala Rajaratnam, Julien Emile-Geay

    Abstract: Understanding centennial scale climate variability requires data sets that are accurate, long, continuous and of broad spatial coverage. Since instrumental measurements are generally only available after 1850, temperature fields must be reconstructed using paleoclimate archives, known as proxies. Various climate field reconstructions (CFR) methods have been proposed to relate past temperature to s… ▽ More

    Submitted 2 June, 2015; v1 submitted 25 September, 2013; originally announced September 2013.

    Comments: Published at http://dx.doi.org/10.1214/14-AOAS794 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS794

    Journal ref: Annals of Applied Statistics 2015, Vol. 9, No. 1, 324-352

  18. arXiv:1308.5736  [pdf, other

    stat.ME

    A Methodology for Robust Multiproxy Paleoclimate Reconstructions and Modeling of Temperature Conditional Quantiles

    Authors: Lucas Janson, Bala Rajaratnam

    Abstract: Great strides have been made in the field of reconstructing past temperatures based on models relating temperature to temperature-sensitive paleoclimate proxies. One of the goals of such reconstructions is to assess if current climate is anomalous in a millennial context. These regression based approaches model the conditional mean of the temperature distribution as a function of paleoclimate prox… ▽ More

    Submitted 26 August, 2013; originally announced August 2013.

    MSC Class: 62J05

    Journal ref: Journal of the American Statistical Association 109 (2014) 63-77

  19. arXiv:1307.5381  [pdf, other

    stat.ME stat.CO stat.ML

    A convex pseudo-likelihood framework for high dimensional partial correlation estimation with convergence guarantees

    Authors: Kshitij Khare, Sang-Yun Oh, Bala Rajaratnam

    Abstract: Sparse high dimensional graphical model selection is a topic of much interest in modern day statistics. A popular approach is to apply l1-penalties to either (1) parametric likelihoods, or, (2) regularized regression/pseudo-likelihoods, with the latter having the distinct advantage that they do not explicitly assume Gaussianity. As none of the popular methods proposed for solving pseudo-likelihood… ▽ More

    Submitted 14 August, 2014; v1 submitted 20 July, 2013; originally announced July 2013.

    Journal ref: Journal of the Royal Statistical Society: Series B (Statistical Methodology) 77, 803-825 (2014)

  20. arXiv:1303.2378  [pdf, other

    stat.ML

    Predictive Correlation Screening: Application to Two-stage Predictor Design in High Dimension

    Authors: Hamed Firouzi, Bala Rajaratnam, Alfred Hero

    Abstract: We introduce a new approach to variable selection, called Predictive Correlation Screening, for predictor design. Predictive Correlation Screening (PCS) implements false positive control on the selected variables, is well suited to small sample sizes, and is scalable to high dimensions. We establish asymptotic bounds for Familywise Error Rate (FWER), and resultant mean square error of a linear pre… ▽ More

    Submitted 10 April, 2013; v1 submitted 10 March, 2013; originally announced March 2013.

    Comments: 31 pages, 9 figures, Appearing in Proceedings of the 16th International Conference on Artificial Intelligence and Statistics (AISTATS)

  21. arXiv:1211.2532  [pdf, other

    stat.CO cs.LG stat.ML

    Iterative Thresholding Algorithm for Sparse Inverse Covariance Estimation

    Authors: Dominique Guillot, Bala Rajaratnam, Benjamin T. Rolfs, Arian Maleki, Ian Wong

    Abstract: The L1-regularized maximum likelihood estimation problem has recently become a topic of great interest within the machine learning, statistics, and optimization communities as a method for producing sparse inverse covariance estimators. In this paper, a proximal gradient method (G-ISTA) for performing L1-regularized covariance matrix estimation is presented. Although numerous algorithms have been… ▽ More

    Submitted 26 November, 2012; v1 submitted 12 November, 2012; originally announced November 2012.

    Comments: 25 pages, 1 figure, 4 tables. Conference paper

  22. arXiv:1202.4198  [pdf, ps, other

    math.PR math.ST stat.AP

    Successive Standardization of Rectangular Arrays

    Authors: Richard A. Olshen, Bala Rajaratnam

    Abstract: In this note we illustrate and develop further with mathematics and examples, the work on successive standardization (or normalization) that is studied earlier by the same authors in Olshen and Rajaratnam (2010) and Olshen and Rajaratnam (2011). Thus, we deal with successive iterations applied to rectangular arrays of numbers, where to avoid technical difficulties an array has at least three rows… ▽ More

    Submitted 19 February, 2012; originally announced February 2012.

    MSC Class: 62H05; 60F15; 60G46

  23. arXiv:1111.2667  [pdf, ps, other

    stat.ML stat.CO

    A note on the lack of symmetry in the graphical lasso

    Authors: Benjamin T. Rolfs, Bala Rajaratnam

    Abstract: The graphical lasso (glasso) is a widely-used fast algorithm for estimating sparse inverse covariance matrices. The glasso solves an L1 penalized maximum likelihood problem and is available as an R library on CRAN. The output from the glasso, a regularized covariance matrix estimate a sparse inverse covariance matrix estimate, not only identify a graphical model but can also serve as intermediate… ▽ More

    Submitted 23 July, 2012; v1 submitted 11 November, 2011; originally announced November 2011.

    Comments: 9 pages, 2 figures

  24. arXiv:1109.4371  [pdf, other

    math.ST stat.OT

    High dimensional Bayesian inference for Gaussian directed acyclic graph models

    Authors: Emanuel Ben-David, Tianxi Li, Helene Massam, Bala Rajaratnam

    Abstract: In this paper, we consider Gaussian models Markov with respect to an arbitrary DAG. We first construct a family of conjugate priors for the Cholesky parametrization of the covariance matrix of such models. This family has as many shape parameters as the DAG has vertices, and naturally extends the work of Geiger and Heckerman [8]. From these distributions, we derive prior distributions for the cova… ▽ More

    Submitted 5 March, 2015; v1 submitted 20 September, 2011; originally announced September 2011.

    Comments: 55 pages, 8 figures, 12 table

    MSC Class: 62-09; 62E10; 62J05

  25. Discussion of: A statistical analysis of multiple temperature proxies: Are reconstructions of surface temperatures over the last 1000 years reliable?

    Authors: Peter Craigmile, Bala Rajaratnam

    Abstract: Discussion of "A statistical analysis of multiple temperature proxies: Are reconstructions of surface temperatures over the last 1000 years reliable?" by B.B. McShane and A.J. Wyner [arXiv:1104.4002]

    Submitted 21 April, 2011; originally announced April 2011.

    Comments: Published in at http://dx.doi.org/10.1214/10-AOAS398F the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS398F

    Journal ref: Annals of Applied Statistics 2011, Vol. 5, No. 1, 88-90

  26. arXiv:1102.1204  [pdf, ps, other

    stat.ML

    Large Scale Correlation Screening

    Authors: Alfred O. Hero, Bala Rajaratnam

    Abstract: This paper treats the problem of screening for variables with high correlations in high dimensional data in which there can be many fewer samples than variables. We focus on threshold-based correlation screening methods for three related applications: screening for variables with large correlations within a single treatment (autocorrelation screening); screening for variables with large cross-corr… ▽ More

    Submitted 26 June, 2011; v1 submitted 6 February, 2011; originally announced February 2011.

    Comments: 33 pages, 7 figures; Changes in version 2: There are no changes in the technical material in this revised version. The only changes are correcting typographical errors and referencing related work in the area. There is also material in the introduction where more context to the correlation screening problem is given (especially in terms of relationships to other testing methods)