Skip to main content

Showing 1–15 of 15 results for author: Levin, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.01024  [pdf, other

    stat.ME

    Testing for Repeated Motifs and Hierarchical Structure in Stochastic Blockmodels

    Authors: Al-Fahad Al-Qadhi, Keith Levin, Vincent Lyzinski

    Abstract: The rise in complexity of network data in neuroscience, social networks, and protein-protein interaction networks has been accompanied by several efforts to model and understand these data at different scales. A key multiscale network modeling technique posits hierarchical structure in the network, and by treating networks as multiple levels of subdivisions with shared statistical properties we ca… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

  2. arXiv:2410.10772  [pdf, other

    stat.ME

    Peer effects in the linear-in-means model may be inestimable even when identified

    Authors: Alex Hayes, Keith Levin

    Abstract: Linear-in-means models are widely used to investigate peer effects. Identifying peer effects in these models is challenging, but conditions for identification are well-known. However, even when peer effects are identified, they may not be estimable, due to an asymptotic colinearity issue: as sample size increases, peer effects become more and more linearly dependent. We show that asymptotic coline… ▽ More

    Submitted 14 October, 2024; originally announced October 2024.

  3. arXiv:2212.12041  [pdf, other

    stat.ME

    Estimating network-mediated causal effects via principal components network regression

    Authors: Alex Hayes, Mark M. Fredrickson, Keith Levin

    Abstract: We develop a method to decompose causal effects on a social network into an indirect effect mediated by the network, and a direct effect independent of the social network. To handle the complexity of network structures, we assume that latent social groups act as causal mediators. We develop principal components network regression models to differentiate the social effect from the non-social effect… ▽ More

    Submitted 6 March, 2025; v1 submitted 22 December, 2022; originally announced December 2022.

    Comments: Updating to match published JMLR version

    Journal ref: Journal of Machine Learning Research 26 (2025): 1-99

  4. arXiv:2210.17519  [pdf, other

    stat.ME stat.AP

    Predicting Responses from Weighted Networks with Node Covariates in an Application to Neuroimaging

    Authors: Daniel Kessler, Keith Levin, Elizaveta Levina

    Abstract: We consider the setting where many networks are observed on a common node set, and each observation comprises edge weights of a network, covariates observed at each node, and an overall response. The goal is to use the edge weights and node covariates to predict the response while identifying an interpretable set of predictive features. Our motivating application is neuroimaging, where edge weight… ▽ More

    Submitted 22 August, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

  5. arXiv:2005.02151  [pdf, other

    cs.IR cs.LG math.ST stat.ML

    Vertex Nomination in Richly Attributed Networks

    Authors: Keith Levin, Carey E. Priebe, Vince Lyzinski

    Abstract: Vertex nomination is a lightly-supervised network information retrieval task in which vertices of interest in one graph are used to query a second graph to discover vertices of interest in the second graph. Similar to other information retrieval tasks, the output of a vertex nomination scheme is a ranked list of the vertices in the second graph, with the heretofore unknown vertices of interest ide… ▽ More

    Submitted 4 May, 2023; v1 submitted 29 April, 2020; originally announced May 2020.

    Comments: 46 pages, 5 figures

  6. arXiv:1910.00423  [pdf, other

    stat.ML cs.LG math.ST stat.ME

    Limit theorems for out-of-sample extensions of the adjacency and Laplacian spectral embeddings

    Authors: Keith Levin, Fred Roosta, Minh Tang, Michael W. Mahoney, Carey E. Priebe

    Abstract: Graph embeddings, a class of dimensionality reduction techniques designed for relational data, have proven useful in exploring and modeling network structure. Most dimensionality reduction methods allow out-of-sample extensions, by which an embedding can be applied to observations not present in the training set. Applied to graphs, the out-of-sample extension problem concerns how to compute the em… ▽ More

    Submitted 29 September, 2019; originally announced October 2019.

    Comments: Portions of this work originally appeared in ICML2018 as "Out-of-sample extension of graph adjacency spectral embedding" (accompanying technical report available at arXiv:1802.06307). This work extends the results of that earlier paper to a second graph embedding technique called the Laplacian spectral embedding and presents additional experiments

  7. arXiv:1907.10821  [pdf, other

    math.ST stat.ME

    Bootstrapping Networks with Latent Space Structure

    Authors: Keith Levin, Elizaveta Levina

    Abstract: A core problem in statistical network analysis is to develop network analogues of classical techniques. The problem of bootstrapping network data stands out as especially challenging, since typically one observes only a single network, rather than a sample. Here we propose two methods for obtaining bootstrap samples for networks drawn from latent space models. The first method generates bootstrap… ▽ More

    Submitted 11 October, 2021; v1 submitted 24 July, 2019; originally announced July 2019.

  8. arXiv:1906.07265  [pdf, other

    math.ST cs.LG eess.SP stat.ME stat.ML

    Recovering shared structure from multiple networks with unknown edge distributions

    Authors: Keith Levin, Asad Lodhia, Elizaveta Levina

    Abstract: In increasingly many settings, data sets consist of multiple samples from a population of networks, with vertices aligned across these networks. For example, brain connectivity networks in neuroscience consist of measures of interaction between brain regions that have been aligned to a common template. We consider the setting where the observed networks have a shared expectation, but may differ in… ▽ More

    Submitted 8 May, 2021; v1 submitted 12 June, 2019; originally announced June 2019.

  9. arXiv:1802.06307  [pdf, other

    stat.ML

    Out-of-sample extension of graph adjacency spectral embedding

    Authors: Keith Levin, Farbod Roosta-Khorasani, Michael W. Mahoney, Carey E. Priebe

    Abstract: Many popular dimensionality reduction procedures have out-of-sample extensions, which allow a practitioner to apply a learned embedding to observations not seen in the initial training sample. In this work, we consider the problem of obtaining an out-of-sample extension for the adjacency spectral embedding, a procedure for embedding the vertices of a graph into Euclidean space. We present two diff… ▽ More

    Submitted 17 February, 2018; originally announced February 2018.

  10. arXiv:1802.04960  [pdf, other

    stat.ML

    Vertex nomination: The canonical sampling and the extended spectral nomination schemes

    Authors: Jordan Yoder, Li Chen, Henry Pao, Eric Bridgeford, Keith Levin, Donniell Fishkind, Carey Priebe, Vince Lyzinski

    Abstract: Suppose that one particular block in a stochastic block model is of interest, but block labels are only observed for a few of the vertices in the network. Utilizing a graph realized from the model and the observed block labels, the vertex nomination task is to order the vertices with unobserved block labels into a ranked nomination list with the goal of having an abundance of interesting vertices… ▽ More

    Submitted 22 January, 2020; v1 submitted 14 February, 2018; originally announced February 2018.

  11. arXiv:1711.05610  [pdf, other

    stat.ML

    On consistent vertex nomination schemes

    Authors: Vince Lyzinski, Keith Levin, Carey E. Priebe

    Abstract: Given a vertex of interest in a network $G_1$, the vertex nomination problem seeks to find the corresponding vertex of interest (if it exists) in a second network $G_2$. A vertex nomination scheme produces a list of the vertices in $G_2$, ranked according to how likely they are judged to be the corresponding vertex of interest in $G_2$. The vertex nomination problem and related information retriev… ▽ More

    Submitted 9 December, 2018; v1 submitted 15 November, 2017; originally announced November 2017.

    Comments: 32 pages, 4 figures

  12. arXiv:1709.05454  [pdf, other

    stat.ME math.ST stat.ML

    Statistical inference on random dot product graphs: a survey

    Authors: Avanti Athreya, Donniell E. Fishkind, Keith Levin, Vince Lyzinski, Youngser Park, Yichen Qin, Daniel L. Sussman, Minh Tang, Joshua T. Vogelstein, Carey E. Priebe

    Abstract: The random dot product graph (RDPG) is an independent-edge random graph that is analytically tractable and, simultaneously, either encompasses or can successfully approximate a wide range of random graphs, from relatively simple stochastic block models to complex latent position graphs. In this survey paper, we describe a comprehensive paradigm for statistical inference on random dot product graph… ▽ More

    Submitted 16 September, 2017; originally announced September 2017.

    Comments: An expository survey paper on a comprehensive paradigm for inference for random dot product graphs, centered on graph adjacency and Laplacian spectral embeddings. Paper outlines requisite background; summarizes theory, methodology, and applications from previous and ongoing work; and closes with a discussion of several open problems

    MSC Class: 62FXX; 62GXX; 62HXX; 05CXX

    Journal ref: Journal of Machine Learning Research, 2018

  13. arXiv:1705.09355  [pdf, other

    stat.ME

    A central limit theorem for an omnibus embedding of multiple random graphs and implications for multiscale network inference

    Authors: Keith Levin, Avanti Athreya, Minh Tang, Vince Lyzinski, Youngser Park, Carey E. Priebe

    Abstract: Performing statistical analyses on collections of graphs is of import to many disciplines, but principled, scalable methods for multi-sample graph inference are few. Here we describe an "omnibus" embedding in which multiple graphs on the same vertex set are jointly embedded into a single space with a distinct representation for each graph. We prove a central limit theorem for this embedding and de… ▽ More

    Submitted 25 June, 2019; v1 submitted 25 May, 2017; originally announced May 2017.

    MSC Class: 62H12; 62H15; 05C80

  14. arXiv:1607.01369  [pdf, other

    stat.ML

    On the Consistency of the Likelihood Maximization Vertex Nomination Scheme: Bridging the Gap Between Maximum Likelihood Estimation and Graph Matching

    Authors: Vince Lyzinski, Keith Levin, Donniell E. Fishkind, Carey E. Priebe

    Abstract: Given a graph in which a few vertices are deemed interesting a priori, the vertex nomination task is to order the remaining vertices into a nomination list such that there is a concentration of interesting vertices at the top of the list. Previous work has yielded several approaches to this problem, with theoretical results in the setting where the graph is drawn from a stochastic block model (SBM… ▽ More

    Submitted 27 August, 2016; v1 submitted 5 July, 2016; originally announced July 2016.

  15. Laplacian Eigenmaps from Sparse, Noisy Similarity Measurements

    Authors: Keith Levin, Vince Lyzinski

    Abstract: Manifold learning and dimensionality reduction techniques are ubiquitous in science and engineering, but can be computationally expensive procedures when applied to large data sets or when similarities are expensive to compute. To date, little work has been done to investigate the tradeoff between computational resources and the quality of learned representations. We present both theoretical and e… ▽ More

    Submitted 16 August, 2016; v1 submitted 12 March, 2016; originally announced March 2016.