Skip to main content

Showing 1–50 of 58 results for author: Lafferty, J

.
  1. arXiv:2505.15927  [pdf, ps, other

    stat.ML cs.LG

    CoT Information: Improved Sample Complexity under Chain-of-Thought Supervision

    Authors: Awni Altabaa, Omar Montasser, John Lafferty

    Abstract: Learning complex functions that involve multi-step reasoning poses a significant challenge for standard supervised learning from input-output examples. Chain-of-thought (CoT) supervision, which provides intermediate reasoning steps together with the final output, has emerged as a powerful empirical technique, underpinning much of the recent progress in the reasoning capabilities of large language… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  2. arXiv:2405.16727  [pdf, other

    cs.LG

    Disentangling and Integrating Relational and Sensory Information in Transformer Architectures

    Authors: Awni Altabaa, John Lafferty

    Abstract: Relational reasoning is a central component of generally intelligent systems, enabling robust and data-efficient inductive generalization. Recent empirical evidence shows that many existing neural architectures, including Transformers, struggle with tasks requiring relational reasoning. In this work, we distinguish between two types of information: sensory information about the properties of indiv… ▽ More

    Submitted 2 October, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

    Comments: 27 pages, 11 figures

  3. arXiv:2402.08856  [pdf, other

    cs.LG stat.ML

    Approximation of relation functions and attention mechanisms

    Authors: Awni Altabaa, John Lafferty

    Abstract: Inner products of neural network feature maps arise in a wide variety of machine learning frameworks as a method of modeling relations between inputs. This work studies the approximation properties of inner products of neural networks. It is shown that the inner product of a multi-layer perceptron with itself is a universal approximator for symmetric positive-definite relation functions. In the ca… ▽ More

    Submitted 15 June, 2024; v1 submitted 13 February, 2024; originally announced February 2024.

    Comments: 24 pages; added discussion on curse of dimensionality in v2

  4. arXiv:2310.03240  [pdf, other

    cs.LG

    Learning Hierarchical Relational Representations through Relational Convolutions

    Authors: Awni Altabaa, John Lafferty

    Abstract: An evolving area of research in deep learning is the study of architectures and inductive biases that support the learning of relational feature representations. In this paper, we address the challenge of learning representations of hierarchical relations--that is, higher-order relational patterns among groups of objects. We introduce "relational convolutional networks", a neural architecture equi… ▽ More

    Submitted 26 September, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 31 pages

    Journal ref: Transactions on Machine Learning Research (TMLR), 2024

  5. arXiv:2309.06629  [pdf, other

    cs.AI cs.NE

    The Relational Bottleneck as an Inductive Bias for Efficient Abstraction

    Authors: Taylor W. Webb, Steven M. Frankland, Awni Altabaa, Simon Segert, Kamesh Krishnamurthy, Declan Campbell, Jacob Russin, Tyler Giallanza, Zack Dulberg, Randall O'Reilly, John Lafferty, Jonathan D. Cohen

    Abstract: A central challenge for cognitive science is to explain how abstract concepts are acquired from limited experience. This has often been framed in terms of a dichotomy between connectionist and symbolic cognitive models. Here, we highlight a recently emerging line of work that suggests a novel reconciliation of these approaches, by exploiting an inductive bias that we term the relational bottleneck… ▽ More

    Submitted 1 May, 2024; v1 submitted 12 September, 2023; originally announced September 2023.

  6. arXiv:2304.00195  [pdf, other

    stat.ML cs.LG

    Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers

    Authors: Awni Altabaa, Taylor Webb, Jonathan Cohen, John Lafferty

    Abstract: An extension of Transformers is proposed that enables explicit relational reasoning through a novel module called the Abstractor. At the core of the Abstractor is a variant of attention called relational cross-attention. The approach is motivated by an architectural inductive bias for relational learning that disentangles relational information from object-level features. This enables explicit rel… ▽ More

    Submitted 12 April, 2024; v1 submitted 31 March, 2023; originally announced April 2023.

    Comments: Published at ICLR 2024

  7. arXiv:2302.10392  [pdf, other

    q-bio.NC cs.LG

    From seeing to remembering: Images with harder-to-reconstruct representations leave stronger memory traces

    Authors: Qi Lin, Zifan Li, John Lafferty, Ilker Yildirim

    Abstract: Much of what we remember is not due to intentional selection, but simply a by-product of perceiving. This raises a foundational question about the architecture of the mind: How does perception interface with and influence memory? Here, inspired by a classic proposal relating perceptual processing to memory durability, the level-of-processing theory, we present a sparse coding model for compressing… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  8. arXiv:2205.13614  [pdf, other

    q-bio.NC cs.LG

    Emergent organization of receptive fields in networks of excitatory and inhibitory neurons

    Authors: Leon Lufkin, Ashish Puri, Ganlin Song, Xinyi Zhong, John Lafferty

    Abstract: Local patterns of excitation and inhibition that can generate neural waves are studied as a computational mechanism underlying the organization of neuronal tunings. Sparse coding algorithms based on networks of excitatory and inhibitory neurons are proposed that exhibit topographic maps as the receptive fields are adapted to input stimuli. Motivated by a leaky integrate-and-fire model of neural wa… ▽ More

    Submitted 26 May, 2022; originally announced May 2022.

  9. arXiv:2106.06044  [pdf, other

    stat.ML cs.LG

    Convergence and Alignment of Gradient Descent with Random Backpropagation Weights

    Authors: Ganlin Song, Ruitu Xu, John Lafferty

    Abstract: Stochastic gradient descent with backpropagation is the workhorse of artificial neural networks. It has long been recognized that backpropagation fails to be a biologically plausible algorithm. Fundamentally, it is a non-local procedure -- updating one neuron's synaptic weights requires knowledge of synaptic weights or receptive fields of downstream neurons. This limits the use of artificial neura… ▽ More

    Submitted 22 December, 2021; v1 submitted 10 June, 2021; originally announced June 2021.

    Comments: 35 pages

  10. arXiv:2006.14781  [pdf, other

    stat.ML cs.LG math.OC

    The huge Package for High-dimensional Undirected Graph Estimation in R

    Authors: Tuo Zhao, Han Liu, Kathryn Roeder, John Lafferty, Larry Wasserman

    Abstract: We describe an R package named huge which provides easy-to-use functions for estimating high dimensional undirected graphs from data. This package implements recent results in the literature, including Friedman et al. (2007), Liu et al. (2009, 2012) and Liu et al. (2010). Compared with the existing graph estimation package glasso, the huge package provides extra features: (1) instead of using Fort… ▽ More

    Submitted 25 June, 2020; originally announced June 2020.

    Comments: Published on JMLR in 2012

  11. arXiv:2005.09912  [pdf, other

    math.ST stat.ML

    Model Repair: Robust Recovery of Over-Parameterized Statistical Models

    Authors: Chao Gao, John Lafferty

    Abstract: A new type of robust estimation problem is introduced where the goal is to recover a statistical model that has been corrupted after it has been estimated from data. Methods are proposed for "repairing" the model using only the design and not the response values used to fit the model in a supervised learning setting. Theory is developed which reveals that two important ingredients are necessary fo… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

  12. arXiv:1907.08653  [pdf, other

    stat.ML cs.LG

    Surfing: Iterative optimization over incrementally trained deep networks

    Authors: Ganlin Song, Zhou Fan, John Lafferty

    Abstract: We investigate a sequential optimization procedure to minimize the empirical risk functional $f_{\hatθ}(x) = \frac{1}{2}\|G_{\hatθ}(x) - y\|^2$ for certain families of deep networks $G_θ(x)$. The approach is to optimize a sequence of objective functions that use network parameters obtained during different stages of the training process. When initialized with random parameters $θ_0$, we show that… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

  13. arXiv:1907.08646  [pdf, other

    math.ST cs.LG stat.ML

    Fair quantile regression

    Authors: Dana Yang, John Lafferty, David Pollard

    Abstract: Quantile regression is a tool for learning conditional distributions. In this paper we study quantile regression in the setting where a protected attribute is unavailable when fitting the model. This can lead to "unfair'' quantile estimators for which the effective quantiles are very different for the subpopulations defined by the protected attribute. We propose a procedure for adjusting the estim… ▽ More

    Submitted 19 July, 2019; originally announced July 2019.

  14. arXiv:1902.06034  [pdf, other

    cs.IR cs.CL cs.LG stat.ML

    TopicEq: A Joint Topic and Mathematical Equation Model for Scientific Texts

    Authors: Michihiro Yasunaga, John Lafferty

    Abstract: Scientific documents rely on both mathematics and text to communicate ideas. Inspired by the topical correspondence between mathematical equations and word contexts observed in scientific texts, we propose a novel topic model that jointly generates mathematical equations and their surrounding text (TopicEq). Using an extension of the correlated topic model, the context is generated from a mixture… ▽ More

    Submitted 25 April, 2019; v1 submitted 15 February, 2019; originally announced February 2019.

    Comments: AAAI 2019

  15. arXiv:1805.06439  [pdf, other

    stat.ML cs.LG

    Prediction Rule Reshaping

    Authors: Matt Bonakdarpour, Sabyasachi Chatterjee, Rina Foygel Barber, John Lafferty

    Abstract: Two methods are proposed for high-dimensional shape-constrained regression and classification. These methods reshape pre-trained prediction rules to satisfy shape constraints like monotonicity and convexity. The first method can be applied to any pre-trained prediction rule, while the second method deals specifically with random forests. In both cases, efficient algorithms are developed for comput… ▽ More

    Submitted 16 May, 2018; originally announced May 2018.

  16. arXiv:1803.01302  [pdf, other

    stat.ML cs.LG math.ST

    Distributed Nonparametric Regression under Communication Constraints

    Authors: Yuancheng Zhu, John Lafferty

    Abstract: This paper studies the problem of nonparametric estimation of a smooth function with data distributed across multiple machines. We assume an independent sample from a white noise model is collected at each machine, and an estimator of the underlying true function needs to be constructed at a central machine. We place limits on the number of bits that each machine can use to transmit information to… ▽ More

    Submitted 23 June, 2018; v1 submitted 4 March, 2018; originally announced March 2018.

  17. arXiv:1710.00862  [pdf, other

    stat.ME cs.SI math.ST stat.AP

    Testing for Global Network Structure Using Small Subgraph Statistics

    Authors: Chao Gao, John Lafferty

    Abstract: We study the problem of testing for community structure in networks using relations between the observed frequencies of small subgraphs. We propose a simple test for the existence of communities based only on the frequencies of three-node subgraphs. The test statistic is shown to be asymptotically normal under a null assumption of no community structure, and to have power approaching one under a c… ▽ More

    Submitted 16 October, 2017; v1 submitted 2 October, 2017; originally announced October 2017.

  18. arXiv:1704.06742  [pdf, other

    stat.ME cs.SI math.ST

    Testing Network Structure Using Relations Between Small Subgraph Probabilities

    Authors: Chao Gao, John Lafferty

    Abstract: We study the problem of testing for structure in networks using relations between the observed frequencies of small subgraphs. We consider the statistics \begin{align*} T_3 & =(\text{edge frequency})^3 - \text{triangle frequency}\\ T_2 & =3(\text{edge frequency})^2(1-\text{edge frequency}) - \text{V-shape frequency} \end{align*} and prove a central limit theorem for $(T_2, T_3)$ under an Erdős-Rén… ▽ More

    Submitted 21 April, 2017; originally announced April 2017.

  19. arXiv:1607.08211  [pdf, other

    stat.ME

    Selective Inference for Group-Sparse Linear Models

    Authors: Fan Yang, Rina Foygel Barber, Prateek Jain, John Lafferty

    Abstract: We develop tools for selective inference in the setting of group sparsity, including the construction of confidence intervals and p-values for testing selected groups of variables. Our main technical result gives the precise distribution of the magnitude of the projection of the data onto a given subspace, and enables us to develop inference procedures for a broad class of group-sparse selection m… ▽ More

    Submitted 27 July, 2016; originally announced July 2016.

  20. arXiv:1605.07596  [pdf, other

    stat.ML

    Local Minimax Complexity of Stochastic Convex Optimization

    Authors: Yuancheng Zhu, Sabyasachi Chatterjee, John Duchi, John Lafferty

    Abstract: We extend the traditional worst-case, minimax analysis of stochastic convex optimization by introducing a localized form of minimax complexity for individual functions. Our main result gives function-specific lower and upper bounds on the number of stochastic subgradient evaluations needed to optimize either the function or its "hardest local alternative" to a given numerical precision. The bounds… ▽ More

    Submitted 26 May, 2016; v1 submitted 24 May, 2016; originally announced May 2016.

  21. arXiv:1605.07051  [pdf, other

    stat.ML cs.LG

    Convergence Analysis for Rectangular Matrix Completion Using Burer-Monteiro Factorization and Gradient Descent

    Authors: Qinqing Zheng, John Lafferty

    Abstract: We address the rectangular matrix completion problem by lifting the unknown matrix to a positive semidefinite matrix in higher dimension, and optimizing a nonconvex objective over the semidefinite factor using a simple gradient descent scheme. With $O( μr^2 κ^2 n \max(μ, \log n))$ random observations of a $n_1 \times n_2$ $μ$-incoherent matrix of rank $r$ and condition number $κ$, where… ▽ More

    Submitted 21 November, 2016; v1 submitted 23 May, 2016; originally announced May 2016.

  22. arXiv:1602.08048  [pdf, other

    math.ST

    Denoising Flows on Trees

    Authors: Sabyasachi Chatterjee, John Lafferty

    Abstract: We study the estimation of flows on trees, a structured generalization of isotonic regression. A tree flow is defined recursively as a positive flow value into a node that is partitioned into an outgoing flow to the children nodes, with some amount of the flow possibly leaking outside. We study the behavior of the least squares estimator for flows, and the associated minimax lower bounds. We chara… ▽ More

    Submitted 2 January, 2017; v1 submitted 25 February, 2016; originally announced February 2016.

  23. arXiv:1512.02956  [pdf, other

    math.ST

    Adaptive Risk Bounds in Unimodal Regression

    Authors: Sabyasachi Chatterjee, John Lafferty

    Abstract: We study the statistical properties of the least squares estimator in unimodal sequence estimation. Although closely related to isotonic regression, unimodal regression has not been as extensively studied. We show that the unimodal least squares estimator is adaptive in the sense that the risk scales as a function of the number of values in the true underlying sequence. Such adaptivity properties… ▽ More

    Submitted 9 May, 2017; v1 submitted 9 December, 2015; originally announced December 2015.

  24. arXiv:1508.03090  [pdf, ps, other

    math.NT

    Hida duality and the Iwasawa main conjecture

    Authors: Matthew J. Lafferty

    Abstract: The central result of this paper is a refinement of Hida's duality theorem between ordinary Lambda-adic modular forms and the universal ordinary Hecke algebra. Specifically, we give a necessary condition for this duality to be integral with respect to particular submodules of the space ordinary Lambda-adic modular forms. This refinement allows us to give a simple proof that the universal ordinary… ▽ More

    Submitted 12 August, 2015; originally announced August 2015.

    Comments: 27 pages. Comments welcome

  25. arXiv:1506.06081  [pdf, other

    stat.ML cs.LG

    A Convergent Gradient Descent Algorithm for Rank Minimization and Semidefinite Programming from Random Linear Measurements

    Authors: Qinqing Zheng, John Lafferty

    Abstract: We propose a simple, scalable, and fast gradient descent algorithm to optimize a nonconvex objective for the rank minimization problem and a closely related family of semidefinite programs. With $O(r^3 κ^2 n \log n)$ random measurements of a positive semidefinite $n \times n$ matrix of rank $r$ and condition number $κ$, our method is guaranteed to converge linearly to the global optimum.

    Submitted 24 March, 2016; v1 submitted 19 June, 2015; originally announced June 2015.

    Comments: Fix a minor error in Appendix E

  26. arXiv:1503.07368  [pdf, other

    math.ST stat.ML

    Quantized Nonparametric Estimation over Sobolev Ellipsoids

    Authors: Yuancheng Zhu, John Lafferty

    Abstract: We formulate the notion of minimax estimation under storage or communication constraints, and prove an extension to Pinsker's theorem for nonparametric estimation over Sobolev ellipsoids. Placing limits on the number of bits used to encode any estimator, we give tight lower and upper bounds on the excess risk due to quantization in terms of the number of bits, the signal size, and the noise level.… ▽ More

    Submitted 11 April, 2017; v1 submitted 25 March, 2015; originally announced March 2015.

  27. arXiv:1411.1805  [pdf, other

    math.ST stat.ML

    Faithful Variable Screening for High-Dimensional Convex Regression

    Authors: Min Xu, Minhua Chen, John Lafferty

    Abstract: We study the problem of variable selection in convex nonparametric regression. Under the assumption that the true regression function is convex and sparse, we develop a screening procedure to select a subset of variables that contains the relevant variables. Our approach is a two-stage quadratic programming method that estimates a sum of one-dimensional convex functions, followed by one-dimensiona… ▽ More

    Submitted 17 November, 2014; v1 submitted 6 November, 2014; originally announced November 2014.

  28. arXiv:1409.6833  [pdf, other

    math.ST stat.ML

    Quantized Estimation of Gaussian Sequence Models in Euclidean Balls

    Authors: Yuancheng Zhu, John Lafferty

    Abstract: A central result in statistical theory is Pinsker's theorem, which characterizes the minimax rate in the normal means model of nonparametric estimation. In this paper, we present an extension to Pinsker's theorem where estimation is carried out under storage or communication constraints. In particular, we place limits on the number of bits used to encode an estimator, and analyze the excess risk i… ▽ More

    Submitted 24 September, 2014; originally announced September 2014.

    Comments: Appearing at NIPS 2014

  29. arXiv:1301.2286  [pdf

    cs.LG stat.ML

    Iterative Markov Chain Monte Carlo Computation of Reference Priors and Minimax Risk

    Authors: John Lafferty, Larry A. Wasserman

    Abstract: We present an iterative Markov chainMonte Carlo algorithm for computingreference priors and minimax risk forgeneral parametric families. Ourapproach uses MCMC techniques based onthe Blahut-Arimoto algorithm forcomputing channel capacity ininformation theory. We give astatistical analysis of the algorithm,bounding the number of samples requiredfor the stochastic algorithm to closelyapproximate th… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: Appears in Proceedings of the Seventeenth Conference on Uncertainty in Artificial Intelligence (UAI2001)

    Report number: UAI-P-2001-PG-293-300

  30. arXiv:1301.1919  [pdf, ps, other

    stat.ML

    Nonparametric Reduced Rank Regression

    Authors: Rina Foygel, Michael Horrell, Mathias Drton, John Lafferty

    Abstract: We propose an approach to multivariate nonparametric regression that generalizes reduced rank regression for linear models. An additive model is estimated for each dimension of a $q$-dimensional response, with a shared $p$-dimensional predictor variable. To control the complexity of the model, we employ a functional form of the Ky-Fan or nuclear norm, resulting in a set of function estimates that… ▽ More

    Submitted 9 January, 2013; originally announced January 2013.

  31. arXiv:1301.0588  [pdf

    cs.LG cs.IR stat.ML

    Expectation-Propogation for the Generative Aspect Model

    Authors: Thomas P. Minka, John Lafferty

    Abstract: The generative aspect model is an extension of the multinomial model for text that allows word probabilities to vary stochastically across documents. Previous results with aspect models have been promising, but hindered by the computational difficulty of carrying out inference and learning. This paper demonstrates that the simple variational methods of Blei et al (2001) can lead to inaccurate in… ▽ More

    Submitted 12 December, 2012; originally announced January 2013.

    Comments: Appears in Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence (UAI2002)

    Report number: UAI-P-2002-PG-352-359

  32. arXiv:1207.4172  [pdf

    cs.LG stat.ML

    Variational Chernoff Bounds for Graphical Models

    Authors: Pradeep Ravikumar, John Lafferty

    Abstract: Recent research has made significant progress on the problem of bounding log partition functions for exponential family graphical models. Such bounds have associated dual parameters that are often used as heuristic estimates of the marginal probabilities required in inference and learning. However these variational estimates do not give rigorous bounds on marginal probabilities, nor do they give e… ▽ More

    Submitted 11 July, 2012; originally announced July 2012.

    Comments: Appears in Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence (UAI2004)

    Report number: UAI-P-2004-PG-462-469

  33. arXiv:1206.6488  [pdf

    stat.ME cs.LG stat.ML

    The Nonparanormal SKEPTIC

    Authors: Han Liu, Fang Han, Ming Yuan, John Lafferty, Larry Wasserman

    Abstract: We propose a semiparametric approach, named nonparanormal skeptic, for estimating high dimensional undirected graphical models. In terms of modeling, we consider the nonparanormal family proposed by Liu et al (2009). In terms of estimation, we exploit nonparametric rank-based correlation coefficient estimators including the Spearman's rho and Kendall's tau. In high dimensional settings, we prove t… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  34. arXiv:1206.6450  [pdf

    cs.LG stat.ML

    Conditional Sparse Coding and Grouped Multivariate Regression

    Authors: Min Xu, John Lafferty

    Abstract: We study the problem of multivariate regression where the data are naturally grouped, and a regression matrix is to be estimated for each group. We propose an approach in which a dictionary of low rank parameter matrices is estimated across groups, and a sparse linear combination of the dictionary elements is estimated to form a model within each group. We refer to the method as conditional sparse… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  35. arXiv:1206.6408  [pdf

    stat.ME astro-ph.IM cs.LG

    Sequential Nonparametric Regression

    Authors: Haijie Gu, John Lafferty

    Abstract: We present algorithms for nonparametric regression in settings where the data are obtained sequentially. While traditional estimators select bandwidths that depend upon the sample size, for sequential data the effective sample size is dynamically changing. We propose a linear time algorithm that adjusts the bandwidth for each new data point, and show that the estimator achieves the optimal minimax… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  36. arXiv:1206.4669  [pdf

    cs.LG stat.ML

    Sparse Additive Functional and Kernel CCA

    Authors: Sivaraman Balakrishnan, Kriti Puniyani, John Lafferty

    Abstract: Canonical Correlation Analysis (CCA) is a classical tool for finding correlations among the components of two random vectors. In recent years, CCA has been widely applied to the analysis of genomic data, where it is common for researchers to perform multiple assays on a single set of patient samples. Recent work has proposed sparse variants of CCA to address the high dimensionality of such data. H… ▽ More

    Submitted 18 June, 2012; originally announced June 2012.

    Comments: ICML2012

  37. arXiv:1202.2169  [pdf, other

    stat.ML

    High Dimensional Semiparametric Gaussian Copula Graphical Models

    Authors: Han Liu, Fang Han, Ming Yuan, John Lafferty, Larry Wasserman

    Abstract: In this paper, we propose a semiparametric approach, named nonparanormal skeptic, for efficiently and robustly estimating high dimensional undirected graphical models. To achieve modeling flexibility, we consider Gaussian Copula graphical models (or the nonparanormal) as proposed by Liu et al. (2009). To achieve estimation robustness, we exploit nonparametric rank-based correlation coefficient est… ▽ More

    Submitted 27 July, 2012; v1 submitted 9 February, 2012; originally announced February 2012.

    Comments: 34 pages, 10 figures; the Annals of Statistics, 2012

  38. arXiv:1201.0794  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Sparse Nonparametric Graphical Models

    Authors: John Lafferty, Han Liu, Larry Wasserman

    Abstract: We present some nonparametric methods for graphical modeling. In the discrete case, where the data are binary or drawn from a finite alphabet, Markov random fields are already essentially nonparametric, since the cliques can take only a finite number of values. Continuous data are different. The Gaussian graphical model is the standard parametric model for continuous data, but it makes distributio… ▽ More

    Submitted 7 January, 2013; v1 submitted 3 January, 2012; originally announced January 2012.

    Comments: Published in at http://dx.doi.org/10.1214/12-STS391 the Statistical Science (http://www.imstat.org/sts/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-STS-STS391

    Journal ref: Statistical Science 2012, Vol. 27, No. 4, 519-537

  39. High-dimensional Ising model selection using ${\ell_1}$-regularized logistic regression

    Authors: Pradeep Ravikumar, Martin J. Wainwright, John D. Lafferty

    Abstract: We consider the problem of estimating the graph associated with a binary Ising Markov random field. We describe a method based on $\ell_1$-regularized logistic regression, in which the neighborhood of any given node is estimated by performing logistic regression subject to an $\ell_1$-constraint. The method is analyzed under high-dimensional scaling in which both the number of nodes $p$ and maximu… ▽ More

    Submitted 2 October, 2010; originally announced October 2010.

    Comments: Published in at http://dx.doi.org/10.1214/09-AOS691 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS691

    Journal ref: Annals of Statistics 2010, Vol. 38, No. 3, 1287-1319

  40. arXiv:1008.5211  [pdf, ps, other

    stat.ML

    Union Support Recovery in Multi-task Learning

    Authors: Mladen Kolar, John Lafferty, Larry Wasserman

    Abstract: We sharply characterize the performance of different penalization schemes for the problem of selecting the relevant variables in the multi-task setting. Previous work focuses on the regression problem where conditions on the design matrix complicate the analysis. A clearer and simpler picture emerges by studying the Normal means model. This model, often used in the field of statistics, is a simpli… ▽ More

    Submitted 30 August, 2010; originally announced August 2010.

  41. arXiv:1006.3972  [pdf, ps, other

    stat.ML stat.ME

    Graph-Valued Regression

    Authors: Han Liu, Xi Chen, John Lafferty, Larry Wasserman

    Abstract: Undirected graphical models encode in a graph $G$ the dependency structure of a random vector $Y$. In many applications, it is of interest to model $Y$ given another random vector $X$ as input. We refer to the problem of estimating the graph $G(x)$ of $Y$ conditioned on $X=x$ as ``graph-valued regression.'' In this paper, we propose a semiparametric method for estimating $G(x)$ that builds a tree… ▽ More

    Submitted 20 June, 2010; originally announced June 2010.

  42. arXiv:1001.1557  [pdf, ps, other

    stat.ML

    Forest Density Estimation

    Authors: Han Liu, Min Xu, Haijie Gu, Anupam Gupta, John Lafferty, Larry Wasserman

    Abstract: We study graph estimation and density estimation in high dimensions, using a family of density estimators based on forest structured undirected graphical models. For density estimation, we do not assume the true distribution corresponds to a forest; rather, we form kernel density estimates of the bivariate and univariate marginals, and apply Kruskal's algorithm to estimate the optimal forest on he… ▽ More

    Submitted 20 October, 2010; v1 submitted 10 January, 2010; originally announced January 2010.

    Comments: Extended version of earlier paper titled "Tree density estimation"

  43. arXiv:0907.1013  [pdf, other

    stat.ML

    Visualizing Topics with Multi-Word Expressions

    Authors: David M. Blei, John D. Lafferty

    Abstract: We describe a new method for visualizing topics, the distributions over terms that are automatically extracted from large text corpora using latent variable models. Our method finds significant $n$-grams related to a topic, which are then used to help understand and interpret the underlying distribution. Compared with the usual visualization, which simply lists the most probable topical terms, t… ▽ More

    Submitted 6 July, 2009; originally announced July 2009.

  44. arXiv:0903.0649  [pdf, ps, other

    stat.ML

    The Nonparanormal: Semiparametric Estimation of High Dimensional Undirected Graphs

    Authors: Han Liu, John Lafferty, Larry Wasserman

    Abstract: Recent methods for estimating sparse undirected graphs for real-valued data in high dimensional problems rely heavily on the assumption of normality. We show how to use a semiparametric Gaussian copula--or "nonparanormal"--for high dimensional inference. Just as additive models extend linear models by replacing linear functions with a set of one-dimensional smooth functions, the nonparanormal ex… ▽ More

    Submitted 3 March, 2009; originally announced March 2009.

  45. arXiv:0804.4202  [pdf, ps, other

    math.ST

    High-Dimensional Graphical Model Selection Using $\ell_1$-Regularized Logistic Regression

    Authors: Pradeep Ravikumar, Martin J. Wainwright, John D. Lafferty

    Abstract: We consider the problem of estimating the graph structure associated with a discrete Markov random field. We describe a method based on $\ell_1$-regularized logistic regression, in which the neighborhood of any given node is estimated by performing logistic regression subject to an $\ell_1$-constraint. Our framework applies to the high-dimensional setting, in which both the number of nodes $p$ a… ▽ More

    Submitted 25 April, 2008; originally announced April 2008.

    Comments: Appeared as UC Berkeley, Department of Statistics, Technical Report

    Report number: Technical Report 750

  46. Rodeo: Sparse, greedy nonparametric regression

    Authors: John Lafferty, Larry Wasserman

    Abstract: We present a greedy method for simultaneously performing local bandwidth selection and variable selection in nonparametric regression. The method starts with a local linear estimator with large bandwidths, and incrementally decreases the bandwidth of variables for which the gradient of the estimator with respect to bandwidth is large. The method--called rodeo (regularization of derivative expect… ▽ More

    Submitted 12 March, 2008; originally announced March 2008.

    Comments: Published in at http://dx.doi.org/10.1214/009053607000000811 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOS-AOS0318 MSC Class: 62G08 (Primary); 62G20 (Secondary)

    Journal ref: Annals of Statistics 2008, Vol. 36, No. 1, 28-63

  47. arXiv:0802.2758  [pdf, ps, other

    stat.ML math.ST

    Time Varying Undirected Graphs

    Authors: Shuheng Zhou, John Lafferty, Larry Wasserman

    Abstract: Undirected graphs are often used to describe high dimensional distributions. Under sparsity conditions, the graph can be estimated using $\ell_1$ penalization methods. However, current methods assume that the data are independent and identically distributed. If the distribution, and hence the graph, evolves over time then the data are not longer identically distributed. In this paper, we show ho… ▽ More

    Submitted 28 April, 2008; v1 submitted 20 February, 2008; originally announced February 2008.

    Comments: 12 pages, 3 figures, to appear in COLT 2008

    Journal ref: The 21st Annual Conference on Learning Theory (COLT 2008), Helsinki, Finland

  48. Correction: A correlated topic model of Science

    Authors: David M. Blei, John D. Lafferty

    Abstract: Correction to Annals of Applied Statistics 1 (2007) 17--35 [doi:10.1214/07-AOAS114]

    Submitted 10 December, 2007; originally announced December 2007.

    Comments: Published in at http://dx.doi.org/10.1214/07-AOAS136 the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS136

    Journal ref: Annals of Applied Statistics 2007, Vol. 1, No. 2, 634-634

  49. arXiv:0711.4555  [pdf, ps, other

    math.ST

    Sparse Additive Models

    Authors: Pradeep Ravikumar, John Lafferty, Han Liu, Larry Wasserman

    Abstract: We present a new class of methods for high-dimensional nonparametric regression and classification called sparse additive models (SpAM). Our methods combine ideas from sparse linear modeling and additive nonparametric regression. We derive an algorithm for fitting the models that is practical and effective even when the number of covariates is larger than the sample size. SpAM is closely related… ▽ More

    Submitted 8 April, 2008; v1 submitted 28 November, 2007; originally announced November 2007.

  50. A correlated topic model of Science

    Authors: David M. Blei, John D. Lafferty

    Abstract: Topic models, such as latent Dirichlet allocation (LDA), can be useful tools for the statistical analysis of document collections and other discrete data. The LDA model assumes that the words of each document arise from a mixture of topics, each of which is a distribution over the vocabulary. A limitation of LDA is the inability to model topic correlation even though, for example, a document abo… ▽ More

    Submitted 31 August, 2007; v1 submitted 27 August, 2007; originally announced August 2007.

    Comments: Published at http://dx.doi.org/10.1214/07-AOAS114 in the Annals of Applied Statistics (http://www.imstat.org/aoas/) by the Institute of Mathematical Statistics (http://www.imstat.org)

    Report number: IMS-AOAS-AOAS114

    Journal ref: Annals of Applied Statistics 2007, Vol. 1, No. 1, 17-35