Skip to main content

Showing 1–16 of 16 results for author: Frostig, R

Searching in archive stat. Search in all archives.
.
  1. arXiv:2203.17193  [pdf, other

    cs.LG stat.ML

    Learning from many trajectories

    Authors: Stephen Tu, Roy Frostig, Mahdi Soltanolkotabi

    Abstract: We initiate a study of supervised learning from many independent sequences ("trajectories") of non-independent covariates, reflecting tasks in sequence modeling, control, and reinforcement learning. Conceptually, our multi-trajectory setup sits between two traditional settings in statistical learning theory: learning from independent examples and learning from a single auto-correlated sequence. Ou… ▽ More

    Submitted 31 January, 2023; v1 submitted 31 March, 2022; originally announced March 2022.

  2. arXiv:2106.01104  [pdf, other

    stat.ME stat.CO

    Filtrated Common Functional Principal Components for Multivariate Functional data

    Authors: Shuhao Jiao, Ron D. Frostig, Hernando Ombao

    Abstract: Local field potentials (LFPs) are signals that measure electrical activity in localized cortical regions from implanted tetrodes in the human or animal brain. The LFP signals are curves observed at multiple tetrodes which are implanted across a patch on the surface of the cortex. Hence, they can be treated as multi-group functional data, where the trajectories collected across temporal epochs from… ▽ More

    Submitted 26 November, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

  3. arXiv:2105.15183  [pdf, other

    cs.LG math.NA stat.ML

    Efficient and Modular Implicit Differentiation

    Authors: Mathieu Blondel, Quentin Berthet, Marco Cuturi, Roy Frostig, Stephan Hoyer, Felipe Llinares-López, Fabian Pedregosa, Jean-Philippe Vert

    Abstract: Automatic differentiation (autodiff) has revolutionized machine learning. It allows to express complex computations by composing elementary ones in creative ways and removes the burden of computing their derivatives by hand. More recently, differentiation of optimization problem solutions has attracted widespread attention with applications such as optimization layers, and in bi-level problems suc… ▽ More

    Submitted 12 October, 2022; v1 submitted 31 May, 2021; originally announced May 2021.

    Comments: V3: added more related work and Jacobian precision figure

  4. Break Point Detection for Functional Covariance

    Authors: Shuhao Jiao, Ron D. Frostig, Hernando Ombao

    Abstract: Many experiments record sequential trajectories where each trajectory consists of oscillations and fluctuations around zero. Such trajectories can be viewed as zero-mean functional data. When there are structural breaks (on the sequence of trajectories) in higher order moments, it is not always easy to spot these by mere visual inspection. Motivated by this challenging problem in brain signal anal… ▽ More

    Submitted 4 February, 2022; v1 submitted 24 June, 2020; originally announced June 2020.

  5. arXiv:2004.00855  [pdf, other

    stat.ME

    Variation Pattern Classification of Functional Data

    Authors: Shuhao Jiao, Ron D. Frostig, Hernando Ombao

    Abstract: A new classification method for functional data is proposed in this paper. This work is motivated by the need to identify features that discriminate between neurological conditions on which local field potentials (LFPs) were recorded. Regardless of the condition, these local field potentials have zero mean and thus the first moments of these random processes do not have discriminating power. We pr… ▽ More

    Submitted 4 February, 2022; v1 submitted 2 April, 2020; originally announced April 2020.

  6. arXiv:1911.12295  [pdf, other

    stat.ME

    Modeling Spectral Properties in Stationary Processes of Varying Dimensions with Applications to Brain Local Field Potential Signals

    Authors: Raanju Ragavendar Sundararajan, Ron D. Frostig, Hernando Ombao

    Abstract: A common class of methods for analyzing of multivariate time series, stationary and nonstationary, decomposes the observed series into latent sources. Methods such as principal compoment analysis (PCA), independent component analysis (ICA) and Stationary Subspace Analysis (SSA) assume the observed multivariate process is generated by latent sources that are stationary or nonstationary. We develop… ▽ More

    Submitted 29 November, 2019; v1 submitted 27 November, 2019; originally announced November 2019.

    Comments: 34 pages

    MSC Class: 62M10; 62M15

  7. arXiv:1905.10360  [pdf, other

    cs.LG cs.DS stat.ML

    The advantages of multiple classes for reducing overfitting from test set reuse

    Authors: Vitaly Feldman, Roy Frostig, Moritz Hardt

    Abstract: Excessive reuse of holdout data can lead to overfitting. However, there is little concrete evidence of significant overfitting due to holdout reuse in popular multiclass benchmarks today. Known results show that, in the worst-case, revealing the accuracy of $k$ adaptively chosen classifiers on a data set of size $n$ allows to create a classifier with bias of $Θ(\sqrt{k/n})$ for any binary predicti… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

  8. arXiv:1811.03600  [pdf, other

    cs.LG stat.ML

    Measuring the Effects of Data Parallelism on Neural Network Training

    Authors: Christopher J. Shallue, Jaehoon Lee, Joseph Antognini, Jascha Sohl-Dickstein, Roy Frostig, George E. Dahl

    Abstract: Recent hardware developments have dramatically increased the scale of data parallelism available for neural network training. Among the simplest ways to harness next-generation hardware is to increase the batch size in standard mini-batch neural network training algorithms. In this work, we aim to experimentally characterize the effects of increasing the batch size on training time, as measured by… ▽ More

    Submitted 18 July, 2019; v1 submitted 8 November, 2018; originally announced November 2018.

    Journal ref: Journal of Machine Learning Research 20 (2019) 1-49

  9. arXiv:1809.09448  [pdf, other

    stat.ME

    Modeling Dependence via Copula of Functionals of Fourier Coefficients

    Authors: Charles Fontaine, Ron D. Frostig, Hernando Ombao

    Abstract: The goal of this paper is to develop a measure for characterizing complex dependence between stationary time series that cannot be captured by traditional measures such as correlation and coherence. Our approach is to use copula models of functionals of the Fourier coefficients which is a generalization of coherence. Here, we use standard parametric copula models with a single parameter both from… ▽ More

    Submitted 25 September, 2018; originally announced September 2018.

  10. arXiv:1809.08785  [pdf, other

    stat.AP

    Modeling non-linear spectral domain dependence using copulas with applications to rat local field potentials

    Authors: Charles Fontaine, Ron D. Frostig, Hernando Ombao

    Abstract: This paper intends to develop tools for characterizing non-linear spectral dependence between spontaneous brain signals. We use parametric copula models (both bivariate and vine models) applied on the magnitude of Fourier coefficients rather than using coherence. The motivation behind this work is an experiment on rats that studied the impact of stroke on the connectivity structure (dependence) be… ▽ More

    Submitted 24 September, 2018; originally announced September 2018.

  11. arXiv:1608.03100  [pdf, other

    stat.ML cs.LG

    Estimation from Indirect Supervision with Linear Moments

    Authors: Aditi Raghunathan, Roy Frostig, John Duchi, Percy Liang

    Abstract: In structured prediction problems where we have indirect supervision of the output, maximum marginal likelihood faces two computational obstacles: non-convexity of the objective and intractability of even a single gradient computation. In this paper, we bypass both obstacles for a class of what we call linear indirectly-supervised problems. Our approach is simple: we solve a linear system to estim… ▽ More

    Submitted 10 August, 2016; originally announced August 2016.

    Comments: 12 pages, 7 figures, extended and updated version of our paper appearing in ICML 2016

  12. arXiv:1602.06872  [pdf, ps, other

    cs.DS cs.LG stat.ML

    Principal Component Projection Without Principal Component Analysis

    Authors: Roy Frostig, Cameron Musco, Christopher Musco, Aaron Sidford

    Abstract: We show how to efficiently project a vector onto the top principal components of a matrix, without explicitly computing these components. Specifically, we introduce an iterative algorithm that provably computes the projection using few calls to any black-box routine for ridge regression. By avoiding explicit principal component analysis (PCA), our algorithm is the first with no runtime dependenc… ▽ More

    Submitted 26 November, 2019; v1 submitted 22 February, 2016; originally announced February 2016.

  13. arXiv:1602.05897  [pdf, other

    cs.LG cs.AI cs.CC cs.DS stat.ML

    Toward Deeper Understanding of Neural Networks: The Power of Initialization and a Dual View on Expressivity

    Authors: Amit Daniely, Roy Frostig, Yoram Singer

    Abstract: We develop a general duality between neural networks and compositional kernels, striving towards a better understanding of deep learning. We show that initial representations generated by common random initializations are sufficiently rich to express all functions in the dual kernel space. Hence, though the training objective is hard to optimize in the worst case, the initial weights form a good s… ▽ More

    Submitted 19 May, 2017; v1 submitted 18 February, 2016; originally announced February 2016.

  14. arXiv:1506.07512  [pdf, other

    stat.ML cs.DS cs.LG

    Un-regularizing: approximate proximal point and faster stochastic algorithms for empirical risk minimization

    Authors: Roy Frostig, Rong Ge, Sham M. Kakade, Aaron Sidford

    Abstract: We develop a family of accelerated stochastic algorithms that minimize sums of convex functions. Our algorithms improve upon the fastest running time for empirical risk minimization (ERM), and in particular linear least-squares regression, across a wide range of problem settings. To achieve this, we establish a framework based on the classical proximal point algorithm. Namely, we provide several a… ▽ More

    Submitted 24 June, 2015; originally announced June 2015.

  15. arXiv:1412.6606  [pdf, other

    stat.ML cs.LG

    Competing with the Empirical Risk Minimizer in a Single Pass

    Authors: Roy Frostig, Rong Ge, Sham M. Kakade, Aaron Sidford

    Abstract: In many estimation problems, e.g. linear and logistic regression, we wish to minimize an unknown objective given only unbiased samples of the objective function. Furthermore, we aim to achieve this using as few samples as possible. In the absence of computational constraints, the minimizer of a sample average of observed data -- commonly referred to as either the empirical risk minimizer (ERM) or… ▽ More

    Submitted 25 February, 2015; v1 submitted 20 December, 2014; originally announced December 2014.

  16. arXiv:1312.6205  [pdf, other

    stat.ML cs.LG

    Relaxations for inference in restricted Boltzmann machines

    Authors: Sida I. Wang, Roy Frostig, Percy Liang, Christopher D. Manning

    Abstract: We propose a relaxation-based approximate inference algorithm that samples near-MAP configurations of a binary pairwise Markov random field. We experiment on MAP inference tasks in several restricted Boltzmann machines. We also use our underlying sampler to estimate the log-partition function of restricted Boltzmann machines and compare against other sampling-based methods.

    Submitted 2 January, 2014; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: ICLR 2014 workshop track submission