Skip to main content

Showing 1–10 of 10 results for author: Bridgeford, E

Searching in archive stat. Search in all archives.
.
  1. arXiv:2503.10710  [pdf, ps, other

    q-bio.OT stat.OT

    How causal perspectives can inform neuroscience data analysis

    Authors: Eric W. Bridgeford, Brian S. Caffo, Maya B. Mathur, Russell A. Poldrack

    Abstract: Over the past two decades, considerable strides have been made in advancing neuroscientific techniques, yet challenges remain in attributing causality to observed associations. This review addresses a fundamental issue in observational neuroscience studies and advocates for incorporating causal inference frameworks into standard practice. We systematically introduce necessary definitions and conce… ▽ More

    Submitted 4 September, 2025; v1 submitted 12 March, 2025; originally announced March 2025.

  2. arXiv:2307.13868  [pdf, other

    stat.ME cs.LG stat.ML

    Learning sources of variability from high-dimensional observational studies

    Authors: Eric W. Bridgeford, Jaewon Chung, Brian Gilbert, Sambit Panda, Adam Li, Cencheng Shen, Alexandra Badea, Brian Caffo, Joshua T. Vogelstein

    Abstract: Causal inference studies whether the presence of a variable influences an observed outcome. As measured by quantities such as the "average treatment effect," this paradigm is employed across numerous biological fields, from vaccine and drug development to policy interventions. Unfortunately, the majority of these methods are often limited to univariate outcomes. Our work generalizes causal estiman… ▽ More

    Submitted 28 November, 2023; v1 submitted 25 July, 2023; originally announced July 2023.

  3. arXiv:2011.14990  [pdf, other

    q-bio.NC stat.ME

    Multiscale Comparative Connectomics

    Authors: Vivek Gopalakrishnan, Jaewon Chung, Eric Bridgeford, Benjamin D. Pedigo, Jesús Arroyo, Lucy Upchurch, G. Allan Johnson, Nian Wang, Youngser Park, Carey E. Priebe, Joshua T. Vogelstein

    Abstract: The connectome, a map of the structural and/or functional connections in the brain, provides a complex representation of the neurobiological phenotypes on which it supervenes. This information-rich data modality has the potential to transform our understanding of the relationship between patterns in brain connectivity and neurological processes, disorders, and diseases. However, existing computati… ▽ More

    Submitted 2 December, 2024; v1 submitted 30 November, 2020; originally announced November 2020.

  4. arXiv:2005.11911  [pdf, other

    stat.AP math.ST

    Statistical Analysis of Data Repeatability Measures

    Authors: Zeyi Wang, Eric Bridgeford, Shangsi Wang, Joshua T. Vogelstein, Brian Caffo

    Abstract: The advent of modern data collection and processing techniques has seen the size, scale, and complexity of data grow exponentially. A seminal step in leveraging these rich datasets for downstream inference is understanding the characteristics of the data which are repeatable -- the aspects of the data that are able to be identified under a duplicated analysis. Conflictingly, the utility of traditi… ▽ More

    Submitted 28 July, 2024; v1 submitted 24 May, 2020; originally announced May 2020.

  5. arXiv:1907.02088  [pdf, other

    stat.CO cs.MS stat.ME stat.ML

    hyppo: A Multivariate Hypothesis Testing Python Package

    Authors: Sambit Panda, Satish Palaniappan, Junhao Xiong, Eric W. Bridgeford, Ronak Mehta, Cencheng Shen, Joshua T. Vogelstein

    Abstract: We introduce hyppo, a unified library for performing multivariate hypothesis testing, including independence, two-sample, and k-sample testing. While many multivariate independence tests have R packages available, the interfaces are inconsistent and most are not available in Python. hyppo includes many state of the art multivariate testing procedures. The package is easy-to-use and is flexible eno… ▽ More

    Submitted 12 September, 2024; v1 submitted 3 July, 2019; originally announced July 2019.

  6. arXiv:1904.05329  [pdf, other

    cs.SI stat.ML stat.OT

    GraSPy: Graph Statistics in Python

    Authors: Jaewon Chung, Benjamin D. Pedigo, Eric W. Bridgeford, Bijan K. Varjavand, Hayden S. Helm, Joshua T. Vogelstein

    Abstract: We introduce GraSPy, a Python library devoted to statistical inference, machine learning, and visualization of random graphs and graph populations. This package provides flexible and easy-to-use algorithms for analyzing and understanding graphs with a scikit-learn compliant API. GraSPy can be downloaded from Python Package Index (PyPi), and is released under the Apache 2.0 open-source license. The… ▽ More

    Submitted 14 August, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

    Journal ref: Journal of Machine Learning Research 20.158 (2019): 1-7

  7. On a 'Two Truths' Phenomenon in Spectral Graph Clustering

    Authors: Carey E. Priebe, Youngser Park, Joshua T. Vogelstein, John M. Conroy, Vince Lyzinski, Minh Tang, Avanti Athreya, Joshua Cape, Eric Bridgeford

    Abstract: Clustering is concerned with coherently grouping observations without any explicit concept of true groupings. Spectral graph clustering - clustering the vertices of a graph based on their spectral embedding - is commonly approached via K-means (or, more generally, Gaussian mixture model) clustering composed with either Laplacian or Adjacency spectral embedding (LSE or ASE). Recent theoretical resu… ▽ More

    Submitted 11 February, 2019; v1 submitted 23 August, 2018; originally announced August 2018.

    Journal ref: PNAS 116 (2019) 5995-6000

  8. arXiv:1802.04960  [pdf, other

    stat.ML

    Vertex nomination: The canonical sampling and the extended spectral nomination schemes

    Authors: Jordan Yoder, Li Chen, Henry Pao, Eric Bridgeford, Keith Levin, Donniell Fishkind, Carey Priebe, Vince Lyzinski

    Abstract: Suppose that one particular block in a stochastic block model is of interest, but block labels are only observed for a few of the vertices in the network. Utilizing a graph realized from the model and the observed block labels, the vertex nomination task is to order the vertices with unobserved block labels into a ranked nomination list with the goal of having an abundance of interesting vertices… ▽ More

    Submitted 22 January, 2020; v1 submitted 14 February, 2018; originally announced February 2018.

  9. arXiv:1709.01233  [pdf, other

    stat.ML

    Supervised Dimensionality Reduction for Big Data

    Authors: Joshua T. Vogelstein, Eric Bridgeford, Minh Tang, Da Zheng, Christopher Douville, Randal Burns, Mauro Maggioni

    Abstract: To solve key biomedical problems, experimentalists now routinely measure millions or billions of features (dimensions) per sample, with the hope that data science techniques will be able to build accurate data-driven inferences. Because sample sizes are typically orders of magnitude smaller than the dimensionality of these data, valid inferences require finding a low-dimensional representation tha… ▽ More

    Submitted 23 January, 2021; v1 submitted 5 September, 2017; originally announced September 2017.

    Comments: 6 figures

  10. Discovering and Deciphering Relationships Across Disparate Data Modalities

    Authors: Joshua T. Vogelstein, Eric Bridgeford, Qing Wang, Carey E. Priebe, Mauro Maggioni, Cencheng Shen

    Abstract: Understanding the relationships between different properties of data, such as whether a connectome or genome has information about disease status, is becoming increasingly important in modern biological datasets. While existing approaches can test whether two properties are related, they often require unfeasibly large sample sizes in real data scenarios, and do not provide any insight into how or… ▽ More

    Submitted 6 December, 2018; v1 submitted 16 September, 2016; originally announced September 2016.

    Journal ref: eLife 8, e41690, 2019