Skip to main content

Showing 1–10 of 10 results for author: Homrighausen, D

.
  1. Compressed and Penalized Linear Regression

    Authors: Darren Homrighausen, Daniel J. McDonald

    Abstract: Modern applications require methods that are computationally feasible on large datasets but also preserve statistical efficiency. Frequently, these two concerns are seen as contradictory: approximation methods that enable computation are assumed to degrade statistical performance relative to exact methods. In applied mathematics, where much of the current theoretical work on approximation resides,… ▽ More

    Submitted 22 May, 2017; originally announced May 2017.

    Comments: 39 pages, 9 figures

    Journal ref: Journal of Computational and Graphical Statistics (2020), Vol 29, pp. 309--322

  2. A study on tuning parameter selection for the high-dimensional lasso

    Authors: Darren Homrighausen, Daniel J. McDonald

    Abstract: High-dimensional predictive models, those with more measurements than observations, require regularization to be well defined, perform well empirically, and possess theoretical guarantees. The amount of regularization, often determined by tuning parameters, is integral to achieving good performance. One can choose the tuning parameter in a variety of ways, such as through resampling methods or gen… ▽ More

    Submitted 12 July, 2019; v1 submitted 3 February, 2016; originally announced February 2016.

    Comments: 64 pages, 11 figures

    Journal ref: Journal of Statistical Computation and Simulation (2018), vol. 88, pp. 2865-2892

  3. On the Nyström and Column-Sampling Methods for the Approximate Principal Components Analysis of Large Data Sets

    Authors: Darren Homrighausen, Daniel J. McDonald

    Abstract: In this paper we analyze approximate methods for undertaking a principal components analysis (PCA) on large data sets. PCA is a classical dimension reduction method that involves the projection of the data onto the subspace spanned by the leading eigenvectors of the covariance matrix. This projection can be used either for exploratory purposes or as an input for further analysis, e.g. regression.… ▽ More

    Submitted 2 February, 2016; originally announced February 2016.

    Comments: 20 pages

    Journal ref: Journal of Computational and Graphical Statistics, 25(2), 2016

  4. arXiv:1308.0810  [pdf, other

    math.ST stat.ML

    Risk-consistency of cross-validation with lasso-type procedures

    Authors: Darren Homrighausen, Daniel J. McDonald

    Abstract: The lasso and related sparsity inducing algorithms have been the target of substantial theoretical and applied research. Correspondingly, many results are known about their behavior for a fixed or optimally chosen tuning parameter specified up to unknown constants. In practice, however, this oracle tuning parameter is inaccessible so one must use the data to select one. Common statistical practice… ▽ More

    Submitted 21 June, 2016; v1 submitted 4 August, 2013; originally announced August 2013.

    Comments: 25 pages, 3 figures

  5. arXiv:1207.0538  [pdf, ps, other

    stat.ME

    Efficient Estimators for Sequential and Resolution-Limited Inverse Problems

    Authors: Darren Homrighausen, Christopher R. Genovese

    Abstract: A common problem in the sciences is that a signal of interest is observed only indirectly, through smooth functionals of the signal whose values are then obscured by noise. In such inverse problems, the functionals dampen or entirely eliminate some of the signal's interesting features. This makes it difficult or even impossible to fully reconstruct the signal, even without noise. In this paper, we… ▽ More

    Submitted 2 July, 2012; originally announced July 2012.

  6. arXiv:1206.6128  [pdf, ps, other

    math.ST

    Leave-one-out cross-validation is risk consistent for lasso

    Authors: Darren Homrighausen, Daniel J. McDonald

    Abstract: The lasso procedure is ubiquitous in the statistical and signal processing literature, and as such, is the target of substantial theoretical and applied research. While much of this research focuses on the desirable properties that lasso possesses---predictive risk consistency, sign consistency, correct model selection---all of it has assumes that the tuning parameter is chosen in an oracle fashio… ▽ More

    Submitted 4 August, 2013; v1 submitted 26 June, 2012; originally announced June 2012.

    Comments: 15 pages, 0 figures

  7. Regularization Techniques for PSF-Matching Kernels. I. Choice of Kernel Basis

    Authors: A. C. Becker, D. Homrighausen, A. J. Connolly, C. R. Genovese, R. Owen, S. J. Bickerton, R. H. Lupton

    Abstract: We review current methods for building PSF-matching kernels for the purposes of image subtraction or coaddition. Such methods use a linear decomposition of the kernel on a series of basis functions. The correct choice of these basis functions is fundamental to the efficiency and effectiveness of the matching - the chosen bases should represent the underlying signal using a reasonably small number… ▽ More

    Submitted 13 February, 2012; originally announced February 2012.

    Comments: Submitted to MNRAS; 5 figures

  8. arXiv:1107.4340  [pdf, other

    stat.ML

    Spectral approximations in machine learning

    Authors: Darren Homrighausen, Daniel J. McDonald

    Abstract: In many areas of machine learning, it becomes necessary to find the eigenvector decompositions of large matrices. We discuss two methods for reducing the computational burden of spectral decompositions: the more venerable Nystom extension and a newly introduced algorithm based on random projections. Previous work has centered on the ability to reconstruct the original matrix. We argue that a more… ▽ More

    Submitted 21 July, 2011; originally announced July 2011.

    Comments: 11 pages, 4 figures

  9. Semi-supervised Learning for Photometric Supernova Classification

    Authors: Joseph W. Richards, Darren Homrighausen, Peter E. Freeman, Chad M. Schafer, Dovi Poznanski

    Abstract: We present a semi-supervised method for photometric supernova typing. Our approach is to first use the nonlinear dimension reduction technique diffusion map to detect structure in a database of supernova light curves and subsequently employ random forest classification on a spectroscopically confirmed training set to learn a model that can predict the type of each newly observed supernova. We demo… ▽ More

    Submitted 27 September, 2011; v1 submitted 30 March, 2011; originally announced March 2011.

    Comments: 16 pages, 11 figures, accepted for publication in MNRAS

  10. arXiv:1011.4059  [pdf, ps, other

    astro-ph.IM

    Image Coaddition with Temporally Varying Kernels

    Authors: Darren Homrighausen, Christopher Genovese, Andy Connolly, Andy Becker, Russell Owen

    Abstract: Large, multi-frequency imaging surveys, such as the Large Synaptic Survey Telescope (LSST), need to do near-real time analysis of very large datasets. This raises a host of statistical and computational problems where standard methods do not work. In this paper, we study a proposed method for combining stacks of images into a single summary image, sometimes referred to as a template. This task is… ▽ More

    Submitted 17 November, 2010; originally announced November 2010.