Skip to main content

Showing 1–13 of 13 results for author: Nelson, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2409.14256  [pdf, ps, other

    stat.ME

    POI-SIMEX for Conditionally Poisson Distributed Biomarkers from Tissue Histology

    Authors: Aijun Yang, Phineas T. Hamilton, Brad H. Nelson, Julian J. Lum, Mary Lesperance, Farouk S. Nathoo

    Abstract: Covariate measurement error in regression analysis is an important issue that has been studied extensively under the classical additive and the Berkson error models. Here, we consider cases where covariates are derived from tumor tissue histology, and in particular tissue microarrays. In such settings, biomarkers are evaluated from tissue cores that are subsampled from a larger tissue section so t… ▽ More

    Submitted 3 November, 2024; v1 submitted 21 September, 2024; originally announced September 2024.

    Comments: 18 pages, 2 figures

  2. arXiv:2312.02119  [pdf, other

    cs.LG cs.AI cs.CL cs.CR stat.ML

    Tree of Attacks: Jailbreaking Black-Box LLMs Automatically

    Authors: Anay Mehrotra, Manolis Zampetakis, Paul Kassianik, Blaine Nelson, Hyrum Anderson, Yaron Singer, Amin Karbasi

    Abstract: While Large Language Models (LLMs) display versatile functionality, they continue to generate harmful, biased, and toxic content, as demonstrated by the prevalence of human-designed jailbreaks. In this work, we present Tree of Attacks with Pruning (TAP), an automated method for generating jailbreaks that only requires black-box access to the target LLM. TAP utilizes an attacker LLM to iteratively… ▽ More

    Submitted 31 October, 2024; v1 submitted 4 December, 2023; originally announced December 2023.

    Comments: Accepted for presentation at NeurIPS 2024. Code: https://github.com/RICommunity/TAP

  3. arXiv:2308.03565  [pdf, other

    cs.CL stat.CO

    Topological Interpretations of GPT-3

    Authors: Tianyi Sun, Bradley Nelson

    Abstract: This is an experiential study of investigating a consistent method for deriving the correlation between sentence vector and semantic meaning of a sentence. We first used three state-of-the-art word/sentence embedding methods including GPT-3, Word2Vec, and Sentence-BERT, to embed plain text sentence strings into high dimensional spaces. Then we compute the pairwise distance between any possible com… ▽ More

    Submitted 8 August, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

    Comments: 70 pages

  4. arXiv:2308.01796  [pdf, other

    stat.CO cs.CG

    Greedy Matroid Algorithm And Computational Persistent Homology

    Authors: Tianyi Sun, Bradley Nelson

    Abstract: An important problem in computational topology is to calculate the homology of a space from samples. In this work, we develop a statistical approach to this problem by calculating the expected rank of an induced map on homology from a sub-sample to the full space. We develop a greedy matroid algorithm for finding an optimal basis for the image of the induced map, and investigate the relationship b… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

    Comments: 35 pages

  5. arXiv:2302.02254  [pdf, other

    stat.CO math.ST

    Getting to "rate-optimal'' in ranking & selection

    Authors: Harun Avci, Barry L. Nelson, Andreas Wächter

    Abstract: In their 2004 seminal paper, Glynn and Juneja formally and precisely established the rate-optimal, probability-of-incorrect-selection, replication allocation scheme for selecting the best of k simulated systems. In the case of independent, normally distributed outputs this allocation has a simple form that depends in an intuitively appealing way on the true means and variances. Of course the means… ▽ More

    Submitted 4 February, 2023; originally announced February 2023.

    Journal ref: Proceedings of the 2021 Winter Simulation Conference

  6. arXiv:2205.07362  [pdf, ps, other

    cs.LG math.RT stat.ML

    What is an equivariant neural network?

    Authors: Lek-Heng Lim, Bradley J. Nelson

    Abstract: We explain equivariant neural networks, a notion underlying breakthroughs in machine learning from deep convolutional neural networks for computer vision to AlphaFold 2 for protein structure prediction, without assuming knowledge of equivariance or neural networks. The basic mathematical ideas are simple but are often obscured by engineering complications that come with practical realizations. We… ▽ More

    Submitted 16 November, 2022; v1 submitted 15 May, 2022; originally announced May 2022.

    Comments: 8 pages, 3 figure

    ACM Class: I.2.6

  7. arXiv:2203.08980  [pdf, other

    stat.ME eess.SY

    Stochastic Simulation Uncertainty Analysis to Accelerate Flexible Biomanufacturing Process Development

    Authors: Wei Xie, Russell R. Barton, Barry L. Nelson, Keqi Wang

    Abstract: Motivated by critical challenges and needs from biopharmaceuticals manufacturing, we propose a general metamodel-assisted stochastic simulation uncertainty analysis framework to accelerate the development of a simulation model with modular design for flexible production processes. There are often very limited process observations. Thus, there exist both simulation and model uncertainties in the sy… ▽ More

    Submitted 3 September, 2022; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: 32 pages, 3 figures. arXiv admin note: substantial text overlap with arXiv:2011.04207

  8. arXiv:2011.04207  [pdf, other

    stat.ME

    Statistical Uncertainty Analysis for Stochastic Simulation

    Authors: Wei Xie, Barry L. Nelson, Russell R. Barton

    Abstract: When we use simulation to evaluate the performance of a stochastic system, the simulation often contains input distributions estimated from real-world data; therefore, there is both simulation and input uncertainty in the performance estimates. Ignoring either source of uncertainty underestimates the overall statistical error. Simulation uncertainty can be reduced by additional computation (e.g.,… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: 40 pages, 3 figures

  9. arXiv:1905.12200  [pdf, other

    cs.LG math.AT stat.ML

    A Topology Layer for Machine Learning

    Authors: Rickard Brüel-Gabrielsson, Bradley J. Nelson, Anjan Dwaraknath, Primoz Skraba, Leonidas J. Guibas, Gunnar Carlsson

    Abstract: Topology applied to real world data using persistent homology has started to find applications within machine learning, including deep learning. We present a differentiable topology layer that computes persistent homology based on level set filtrations and edge-based filtrations. We present three novel applications: the topological layer can (i) regularize data reconstruction or the weights of mac… ▽ More

    Submitted 24 April, 2020; v1 submitted 28 May, 2019; originally announced May 2019.

  10. arXiv:1705.10865  [pdf, other

    stat.ML stat.AP

    Sparse canonical correlation analysis

    Authors: Xiaotong Suo, Victor Minden, Bradley Nelson, Robert Tibshirani, Michael Saunders

    Abstract: Canonical correlation analysis was proposed by Hotelling [6] and it measures linear relationship between two multidimensional variables. In high dimensional setting, the classical canonical correlation analysis breaks down. We propose a sparse canonical correlation analysis by adding l1 constraints on the canonical vectors and show how to solve it efficiently using linearized alternating direction… ▽ More

    Submitted 2 June, 2017; v1 submitted 30 May, 2017; originally announced May 2017.

  11. arXiv:1512.06171  [pdf, other

    stat.ME stat.CO stat.ML

    Regularized Estimation of Piecewise Constant Gaussian Graphical Models: The Group-Fused Graphical Lasso

    Authors: Alexander J. Gibberd, James D. B. Nelson

    Abstract: The time-evolving precision matrix of a piecewise-constant Gaussian graphical model encodes the dynamic conditional dependency structure of a multivariate time-series. Traditionally, graphical models are estimated under the assumption that data is drawn identically from a generating distribution. Introducing sparsity and sparse-difference inducing priors we relax these assumptions and propose a no… ▽ More

    Submitted 31 October, 2017; v1 submitted 18 December, 2015; originally announced December 2015.

    Comments: 32 pages, 9 figures

    Journal ref: Journal of Computational and Graphical Statistics, 2017, Volume 26, Number 3, pp 623--634

  12. arXiv:1306.1066  [pdf, other

    stat.ML cs.LG

    Bayesian Differential Privacy through Posterior Sampling

    Authors: Christos Dimitrakakis, Blaine Nelson, and Zuhe Zhang, Aikaterini Mitrokotsa, Benjamin Rubinstein

    Abstract: Differential privacy formalises privacy-preserving mechanisms that provide access to a database. We pose the question of whether Bayesian inference itself can be used directly to provide private access to data, with no modification. The answer is affirmative: under certain conditions on the prior, sampling from the posterior distribution can be used to achieve a desired level of privacy and utilit… ▽ More

    Submitted 23 December, 2016; v1 submitted 5 June, 2013; originally announced June 2013.

    Comments: 38 pages; An earlier version of this article was published in ALT 2014. This version has corrections and additional results

  13. arXiv:1206.6389  [pdf, other

    cs.LG cs.CR stat.ML

    Poisoning Attacks against Support Vector Machines

    Authors: Battista Biggio, Blaine Nelson, Pavel Laskov

    Abstract: We investigate a family of poisoning attacks against Support Vector Machines (SVM). Such attacks inject specially crafted training data that increases the SVM's test error. Central to the motivation for these attacks is the fact that most learning algorithms assume that their training data comes from a natural or well-behaved distribution. However, this assumption does not generally hold in securi… ▽ More

    Submitted 25 March, 2013; v1 submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)