Skip to main content

Showing 1–23 of 23 results for author: Gifford, D

.
  1. arXiv:2406.07908  [pdf, other

    cs.LG cs.AI stat.ML

    Ablation Based Counterfactuals

    Authors: Zheng Dai, David K Gifford

    Abstract: Diffusion models are a class of generative models that generate high-quality samples, but at present it is difficult to characterize how they depend upon their training data. This difficulty raises scientific and regulatory questions, and is a consequence of the complexity of diffusion models and their sampling process. To analyze this dependence, we introduce Ablation Based Counterfactuals (ABC),… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 11 pages, 7 figures, appendix included

  2. arXiv:2306.02174  [pdf, other

    stat.ML cs.AI cs.LG

    Training Data Attribution for Diffusion Models

    Authors: Zheng Dai, David K Gifford

    Abstract: Diffusion models have become increasingly popular for synthesizing high-quality samples based on training datasets. However, given the oftentimes enormous sizes of the training datasets, it is difficult to assess how training data impact the samples produced by a trained diffusion model. The difficulty of relating diffusion model inputs and outputs poses significant challenges to model explainabil… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

    Comments: 14 pages, 6 figures

  3. arXiv:2206.08336  [pdf, other

    q-bio.QM cs.LG

    Constrained Submodular Optimization for Vaccine Design

    Authors: Zheng Dai, David Gifford

    Abstract: Advances in machine learning have enabled the prediction of immune system responses to prophylactic and therapeutic vaccines. However, the engineering task of designing vaccines remains a challenge. In particular, the genetic variability of the human immune system makes it difficult to design peptide vaccines that provide widespread immunity in vaccinated populations. We introduce a framework for… ▽ More

    Submitted 26 January, 2023; v1 submitted 16 June, 2022; originally announced June 2022.

    Comments: 24 pages, 9 figures

  4. arXiv:2112.04033  [pdf, other

    cs.CV cs.LG stat.ML

    Image classifiers can not be made robust to small perturbations

    Authors: Zheng Dai, David K. Gifford

    Abstract: The sensitivity of image classifiers to small perturbations in the input is often viewed as a defect of their construction. We demonstrate that this sensitivity is a fundamental property of classifiers. For any arbitrary classifier over the set of $n$-by-$n$ images, we show that for all but one class it is possible to change the classification of all but a tiny fraction of the images in that class… ▽ More

    Submitted 9 August, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

    Comments: 8 pages, 2 figures

  5. arXiv:2103.03014  [pdf, other

    cs.LG cs.AI cs.CV

    Lost in Pruning: The Effects of Pruning Neural Networks beyond Test Accuracy

    Authors: Lucas Liebenwein, Cenk Baykal, Brandon Carter, David Gifford, Daniela Rus

    Abstract: Neural network pruning is a popular technique used to reduce the inference costs of modern, potentially overparameterized, networks. Starting from a pre-trained network, the process is as follows: remove redundant parameters, retrain, and repeat while maintaining the same test accuracy. The result is a model that is a fraction of the size of the original with comparable predictive performance (tes… ▽ More

    Submitted 4 March, 2021; originally announced March 2021.

    Comments: Published in MLSys 2021

  6. arXiv:2101.10902  [pdf, other

    q-bio.QM cs.LG

    Maximum n-times Coverage for Vaccine Design

    Authors: Ge Liu, Alexander Dimitrakakis, Brandon Carter, David Gifford

    Abstract: We introduce the maximum $n$-times coverage problem that selects $k$ overlays to maximize the summed coverage of weighted elements, where each element must be covered at least $n$ times. We also define the min-cost $n$-times coverage problem where the objective is to select the minimum set of overlays such that the sum of the weights of elements that are covered at least $n$ times is at least $τ$.… ▽ More

    Submitted 4 May, 2022; v1 submitted 24 January, 2021; originally announced January 2021.

    Comments: ICLR 2022

  7. arXiv:2005.02425  [pdf, other

    physics.ed-ph

    Epistemic stances toward group work in learning physics: Interactions between epistemology and social dynamics in a collaborative problem solving context

    Authors: Jessica R. Hoehn, Julian D. Gifford, Noah D. Finkelstein

    Abstract: As educators we often ask our physics students to work in groups---on tutorials, during in-class discussions, and on homework, projects, or exams. Researchers have documented the benefits of group work for students' conceptual mastery and problem solving skills, and have worked to optimize the productivity of group work by assigning roles and composing groups based on performance levels or gender.… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: 21 pages, 4 figures, submitted to Physical Review Physics Education Research

  8. arXiv:2003.08907  [pdf, other

    cs.LG cs.CV stat.ML

    Overinterpretation reveals image classification model pathologies

    Authors: Brandon Carter, Siddhartha Jain, Jonas Mueller, David Gifford

    Abstract: Image classifiers are typically scored on their test set accuracy, but high accuracy can mask a subtle type of model failure. We find that high scoring convolutional neural networks (CNNs) on popular benchmarks exhibit troubling pathologies that allow them to display high accuracy even in the absence of semantically salient features. When a model provides a high-confidence decision without salient… ▽ More

    Submitted 7 December, 2021; v1 submitted 19 March, 2020; originally announced March 2020.

    Comments: NeurIPS 2021

  9. arXiv:2002.07916  [pdf, other

    cs.LG stat.ML

    Information Condensing Active Learning

    Authors: Siddhartha Jain, Ge Liu, David Gifford

    Abstract: We introduce Information Condensing Active Learning (ICAL), a batch mode model agnostic Active Learning (AL) method targeted at Deep Bayesian Active Learning that focuses on acquiring labels for points which have as much information as possible about the still unacquired points. ICAL uses the Hilbert Schmidt Independence Criterion (HSIC) to measure the strength of the dependency between a candidat… ▽ More

    Submitted 19 February, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

  10. arXiv:1906.07380  [pdf, other

    cs.LG stat.ML

    Maximizing Overall Diversity for Improved Uncertainty Estimates in Deep Ensembles

    Authors: Siddhartha Jain, Ge Liu, Jonas Mueller, David Gifford

    Abstract: The inaccuracy of neural network models on inputs that do not stem from the training data distribution is both problematic and at times unrecognized. Model uncertainty estimation can address this issue, where uncertainty estimates are often based on the variation in predictions produced by a diverse ensemble of models applied to the same input. Here we describe Maximize Overall Diversity (MOD), a… ▽ More

    Submitted 12 February, 2020; v1 submitted 18 June, 2019; originally announced June 2019.

    Comments: 10 pages, 3 figures

  11. arXiv:1810.03805  [pdf, other

    cs.LG stat.ML

    What made you do this? Understanding black-box decisions with sufficient input subsets

    Authors: Brandon Carter, Jonas Mueller, Siddhartha Jain, David Gifford

    Abstract: Local explanation frameworks aim to rationalize particular decisions made by a black-box prediction model. Existing techniques are often restricted to a specific type of predictor or based on input saliency, which may be undesirably sensitive to factors unrelated to the model's decision making process. We instead propose sufficient input subsets that identify minimal subsets of features whose obse… ▽ More

    Submitted 8 February, 2019; v1 submitted 9 October, 2018; originally announced October 2018.

    Comments: Published in AISTATS 2019; Equal contribution by first two authors

  12. Galaxy Cluster Mass Reconstruction Project - IV. Understanding the effects of imperfect membership on cluster mass estimation

    Authors: R. Wojtak, L. Old, G. A. Mamon, F. R. Pearce, R. de Carvalho, C. Sifón, M. E. Gray, R. A. Skibba, D. Croton, S. Bamford, D. Gifford, A. von der Linden, J. C. Muñoz-Cuartas, V. Müller, R. J. Pearson, E. Rozo, E. Rykoff, A. Saro, T. Sepp, E. Tempel

    Abstract: The primary difficulty in measuring dynamical masses of galaxy clusters from galaxy data lies in the separation between true cluster members from interloping galaxies along the line of sight. We study the impact of membership contamination and incompleteness on cluster mass estimates obtained with 25 commonly used techniques applied to nearly 1000 mock clusters. We show that all methods overestima… ▽ More

    Submitted 16 August, 2018; v1 submitted 8 June, 2018; originally announced June 2018.

    Comments: 18 pages, 11 figures, 3tables; accepted for publication in MNRAS

    Journal ref: MNRAS, 481, 324 (2018)

  13. Inferring Gravitational Potentials from Mass Densities in Cluster-sized Halos

    Authors: Christopher J. Miller, Alejo Stark, Daniel Gifford, Nicholas Kern

    Abstract: We use N-body simulations to quantify how the escape velocity in cluster-sized halos maps to the gravitational potential in a LambdaCDM universe. Using spherical density-potential pairs and the Poisson equation, we find that the matter density inferred gravitational potential profile predicts the escape velocity profile to within a few percent accuracy for group and cluster-sized halos (10^13 < M_… ▽ More

    Submitted 16 December, 2016; originally announced December 2016.

    Comments: Published in the Astrophysical Journal

    Journal ref: ApJ 2016, volume 822, page 41

  14. Stacking Caustic Masses from Galaxy Clusters

    Authors: Daniel Gifford, Nicholas Kern, Christopher J. Miller

    Abstract: Ongoing and future spectroscopic surveys will measure numerous galaxy redshifts within tens of thousands of galaxy clusters. However, the sampling within these clusters will be low, 15 < N < 50 per cluster. With such data, it will be difficult to achieve accurate and precise mass estimates for individual clusters using phase-space mass estimation techniques. We develop and test a new stacking algo… ▽ More

    Submitted 16 December, 2016; originally announced December 2016.

    Comments: Acceptable for publication, Astrophysical Journal

  15. On Escaping a Galaxy Cluster in an Accelerating Universe

    Authors: Alejo Stark, Christopher J. Miller, Daniel Gifford

    Abstract: We derive the escape velocity profile for an Einasto density field in an accelerating universe and demonstrate its physical viability by comparing theoretical expectations to both light-cone data generated from N-body simulations and archival data on 20 galaxy clusters. We demonstrate that the projection function ($g(β)$) is deemed physically viable only for the theoretical expectation that includ… ▽ More

    Submitted 17 November, 2016; originally announced November 2016.

    Comments: 10 pages, 5 figures, accepted to ApJ

    Journal ref: ApJ 830 109 (2016)

  16. Probing Theories of Gravity with Phase Space-Inferred Potentials of Galaxy Clusters

    Authors: Alejo Stark, Christopher J. Miller, Nicholas Kern, Daniel Gifford, Gong-Bo Zhao, Baojiu Li, Kazuya Koyama, Robert C. Nichol

    Abstract: Modified theories of gravity provide us with a unique opportunity to generate innovative tests of gravity. In Chameleon f(R) gravity, the gravitational potential differs from the weak-field limit of general relativity (GR) in a mass dependent way. We develop a probe of gravity which compares high mass clusters, where Chameleon effects are weak, to low mass clusters, where the effects can be strong… ▽ More

    Submitted 29 February, 2016; originally announced March 2016.

    Comments: 9 pages, 4 figures, accepted to PRD

    Journal ref: Phys.Rev.D93:084036,2016

  17. arXiv:1512.02800  [pdf, other

    astro-ph.CO astro-ph.GA

    The XMM Cluster Survey: evolution of the velocity dispersion -- temperature relation over half a Hubble time

    Authors: Susan Wilson, Matt Hilton, Philip J. Rooney, Caroline Caldwell, Scott T. Kay, Chris A. Collins, Ian G. McCarthy, A. Kathy Romer, Alberto Bermeo-Hernandez, Rebecca Bernstein, Luiz da Costa, Daniel Gifford, Devon Hollowood, Ben Hoyle, Tesla Jeltema, Andrew R. Liddle, Marcio A. G Maia, Robert G. Mann, Julian A. Mayers, Nicola Mehrtens, Christopher J. Miller, Robert C. Nichol, Ricardo Ogando, Martin Sahlén, Benjamin Stahl , et al. (4 additional authors not shown)

    Abstract: We measure the evolution of the velocity dispersion--temperature ($σ_{\rm v}$--$T_{\rm X}$) relation up to $z = 1$ using a sample of 38 galaxy clusters drawn from the \textit{XMM} Cluster Survey. This work improves upon previous studies by the use of a homogeneous cluster sample and in terms of the number of high redshift clusters included. We present here new redshift and velocity dispersion meas… ▽ More

    Submitted 3 August, 2016; v1 submitted 9 December, 2015; originally announced December 2015.

    Comments: Accepted to MNRAS (3 August 2016); Paper: 15 pages, 12 figures; Appendix A: 1 table; Appendix B: 34 Tables; Appendix C: 2 Figures

  18. arXiv:1511.04486  [pdf, other

    stat.ME q-bio.GN q-bio.QM

    Modeling Persistent Trends in Distributions

    Authors: Jonas Mueller, Tommi Jaakkola, David Gifford

    Abstract: We present a nonparametric framework to model a short sequence of probability distributions that vary both due to underlying effects of sequential progression and confounding noise. To distinguish between these two types of variation and estimate the sequential-progression effects, our approach leverages an assumption that these effects follow a persistent trend. This work is motivated by the rece… ▽ More

    Submitted 24 May, 2017; v1 submitted 13 November, 2015; originally announced November 2015.

    Comments: To appear in: Journal of the American Statistical Association

    Journal ref: Journal of the American Statistical Association, 113(523):1296-1310, 2018

  19. arXiv:1503.07188  [pdf, other

    astro-ph.CO astro-ph.GA

    A Multi-Wavelength Mass Analysis of RCS2 J232727.6-020437, a ~3x10$^{15}$M$_{\odot}$ Galaxy Cluster at z=0.7

    Authors: K. Sharon, M. D. Gladders, D. P. Marrone, H. Hoekstra, E. Rasia, H. Bourdin, D. Gifford, A. K. Hicks, C. Greer, T. Mroczkowski, L. F. Barrientos, M. Bayliss, J. E. Carlstrom, D. G. Gilbank, M. Gralla, J. Hlavacek-Larrondo, E. Leitch, P. Mazzotta, C. Miller, S. J. C. Muchovej, T. Schrabback, H. K. C. Yee

    Abstract: We present an initial study of the mass and evolutionary state of a massive and distant cluster, RCS2 J232727.6-020437. This cluster, at z=0.6986, is the richest cluster discovered in the RCS2 project. The mass measurements presented in this paper are derived from all possible mass proxies: X-ray measurements, weak-lensing shear, strong lensing, Sunyaev Zel'dovich effect decrement, the velocity di… ▽ More

    Submitted 3 November, 2015; v1 submitted 24 March, 2015; originally announced March 2015.

    Comments: 19 pages, 15 figures, submitted to ApJ on March 5, 2015; in press. Manuscript revised following the referee review

  20. arXiv:1502.07347  [pdf, other

    astro-ph.CO astro-ph.GA

    Galaxy Cluster Mass Reconstruction Project: II. Quantifying scatter and bias using contrasting mock catalogues

    Authors: L. Old, R. Wojtak, G. A. Mamon, R. A. Skibba, F. R. Pearce, D. Croton, S. Bamford, P. Behroozi, R. de Carvalho, J. C. Muñoz-Cuartas, D. Gifford, M. E. Gray, A. von der Linden, M. R. Merrifield, S. I. Muldrew, V. Müller, R. J. Pearson, T. J. Ponman, E. Rozo, E. Rykoff, A. Saro, T. Sepp, C. Sifón, E. Tempel

    Abstract: This article is the second in a series in which we perform an extensive comparison of various galaxy-based cluster mass estimation techniques that utilise the positions, velocities and colours of galaxies. Our aim is to quantify the scatter, systematic bias and completeness of cluster masses derived from a diverse set of 25 galaxy-based methods using two contrasting mock galaxy catalogues based on… ▽ More

    Submitted 25 February, 2015; originally announced February 2015.

    Comments: 25 pages, 19 figures, 7 tables. Accepted for publication in MNRAS

  21. Galaxy Cluster Mass Reconstruction Project: I. Methods and first results on galaxy-based techniques

    Authors: L. Old, R. A. Skibba, F. R. Pearce, D. Croton, S. I. Muldrew, J. C. Muñoz-Cuartas, D. Gifford, M. E. Gray, A. von der Linden, G. A. Mamon, M. R. Merrifield, V. Müller, R. J. Pearson, T. J. Ponman, A. Saro, T. Sepp, C. Sifón, E. Tempel, E. Tundo, Y. O. Wang, R. Wojtak

    Abstract: This paper is the first in a series in which we perform an extensive comparison of various galaxy-based cluster mass estimation techniques that utilise the positions, velocities and colours of galaxies. Our primary aim is to test the performance of these cluster mass estimation techniques on a diverse set of models that will increase in complexity. We begin by providing participating methods with… ▽ More

    Submitted 18 March, 2014; originally announced March 2014.

    Comments: 25 pages, 15 figures, 5 tables. Accepted for publication in MNRAS

  22. Velocity Anisotropy and Shape Bias in the Caustic Technique

    Authors: Daniel Gifford, Christopher J. Miller

    Abstract: We use the Millennium Simulation to quantify the statistical accuracy and precision of the escape velocity technique for measuring cluster-sized halo masses at z~0.1. We show that in 3D, one can measure nearly unbiased (<4%) halo masses (>1.5x10^14 M_solar h^-1) with 10-15% scatter. Line-of-sight projection effects increase the scatter to ~25%, where we include the known velocity anisotropies. The… ▽ More

    Submitted 28 June, 2013; originally announced July 2013.

    Comments: Published in ApJ Letters

    Journal ref: The Astrophysical Journal Letters, Volume 768, Issue 2, article id. L32, 5 pp. (2013)

  23. A Systematic Analysis of Caustic Methods for Galaxy Cluster Masses

    Authors: Daniel Gifford, Christopher J. Miller, Nicholas Kern

    Abstract: We quantify the expected observed statistical and systematic uncertainties of the escape velocity as a measure of the gravitational potential and total mass of galaxy clusters. We focus our attention on low redshift (z < 0.15) clusters, where large and deep spectroscopic datasets currently exist. Utilizing a suite of Millennium Simulation semi-analytic galaxy catalogs, we find that the dynamical m… ▽ More

    Submitted 28 June, 2013; originally announced July 2013.

    Comments: 14 pages, 10 figures, ApJ accepted