Skip to main content

Showing 1–8 of 8 results for author: Lewis, B

Searching in archive stat. Search in all archives.
.
  1. arXiv:2411.05020  [pdf, other

    cs.CY stat.AP

    Cast vote records: A database of ballots from the 2020 U.S. Election

    Authors: Shiro Kuriwaki, Mason Reece, Samuel Baltz, Aleksandra Conevska, Joseph R. Loffredo, Can Mutlu, Taran Samarth, Kevin E. Acevedo Jetter, Zachary Djanogly Garai, Kate Murray, Shigeo Hirano, Jeffrey B. Lewis, James M. Snyder Jr., Charles H. Stewart III

    Abstract: Ballots are the core records of elections. Electronic records of actual ballots cast (cast vote records) are available to the public in some jurisdictions. However, they have been released in a variety of formats and have not been independently evaluated. Here we introduce a database of cast vote records from the 2020 U.S. general election. We downloaded publicly available unstandardized cast vote… ▽ More

    Submitted 24 October, 2024; originally announced November 2024.

    Comments: 26 pages and appendix

  2. arXiv:2403.15291  [pdf, other

    stat.AP physics.soc-ph q-bio.PE

    Wastewater-based Epidemiology for COVID-19 Surveillance and Beyond: A Survey

    Authors: Chen Chen, Yunfan Wang, Gursharn Kaur, Aniruddha Adiga, Baltazar Espinoza, Srinivasan Venkatramanan, Andrew Warren, Bryan Lewis, Justin Crow, Rekha Singh, Alexandra Lorentz, Denise Toney, Madhav Marathe

    Abstract: The pandemic of COVID-19 has imposed tremendous pressure on public health systems and social economic ecosystems over the past years. To alleviate its social impact, it is important to proactively track the prevalence of COVID-19 within communities. The traditional way to estimate the disease prevalence is to estimate from reported clinical test data or surveys. However, the coverage of clinical t… ▽ More

    Submitted 23 September, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

  3. Privacy Violations in Election Results

    Authors: Shiro Kuriwaki, Jeffrey B. Lewis, Michael Morse

    Abstract: After an election, should election officials release a copy of each anonymous ballot? Some policymakers have championed public disclosure to counter distrust, but others worry that it might undermine ballot secrecy. We introduce the term vote revelation to refer to the linkage of a vote on an anonymous ballot to the voter's name in the public voter file, and detail how such revelation could theore… ▽ More

    Submitted 14 March, 2025; v1 submitted 8 August, 2023; originally announced August 2023.

    Comments: Published version in Science Advances

    Journal ref: Science Advances (2025), vol 11, issue 11, adt1512

  4. arXiv:2010.14491  [pdf, other

    cs.LG stat.AP

    Examining Deep Learning Models with Multiple Data Sources for COVID-19 Forecasting

    Authors: Lijing Wang, Aniruddha Adiga, Srinivasan Venkatramanan, Jiangzhuo Chen, Bryan Lewis, Madhav Marathe

    Abstract: The COVID-19 pandemic represents the most significant public health disaster since the 1918 influenza pandemic. During pandemics such as COVID-19, timely and reliable spatio-temporal forecasting of epidemic dynamics is crucial. Deep learning-based time series models for forecasting have recently gained popularity and have been successfully used for epidemic forecasting. Here we focus on the design… ▽ More

    Submitted 23 November, 2020; v1 submitted 27 October, 2020; originally announced October 2020.

  5. arXiv:1712.00546  [pdf, other

    stat.AP stat.CO

    Calibrating a Stochastic Agent Based Model Using Quantile-based Emulation

    Authors: Arindam Fadikar, Dave Higdon, Jiangzhuo Chen, Brian Lewis, Srini Venkatramanan, Madhav Marathe

    Abstract: In a number of cases, the Quantile Gaussian Process (QGP) has proven effective in emulating stochastic, univariate computer model output (Plumlee and Tuo, 2014). In this paper, we develop an approach that uses this emulation approach within a Bayesian model calibration framework to calibrate an agent-based model of an epidemic. In addition, this approach is extended to handle the multivariate natu… ▽ More

    Submitted 1 December, 2017; originally announced December 2017.

    Comments: 20 pages, 12 figures

    MSC Class: 62P47; 62G43; 62H46

  6. arXiv:1608.00451  [pdf, other

    stat.CO math.NA

    Numerical tolerance for spectral decompositions of random matrices

    Authors: Avanti Athreya, Michael Kane, Bryan Lewis, Zachary Lubberts, Vince Lyzinski, Youngser Park, Carey E. Priebe, Minh Tang

    Abstract: We precisely quantify the impact of statistical error in the quality of a numerical approximation to a random matrix eigendecomposition, and under mild conditions, we use this to introduce an optimal numerical tolerance for residual error in spectral decompositions of random matrices. We demonstrate that terminating an eigendecomposition algorithm when the numerical error and statistical error are… ▽ More

    Submitted 30 January, 2020; v1 submitted 1 August, 2016; originally announced August 2016.

    Comments: 20 pages, 2 figures

    MSC Class: 15; 62; 65

  7. arXiv:1512.07246  [pdf, other

    stat.CO

    Efficient Thresholded Correlation using Truncated Singular Value Decomposition

    Authors: James Baglama, Michael Kane, Bryan Lewis, Alex Poliakov

    Abstract: Efficiently computing a subset of a correlation matrix consisting of values above a specified threshold is important to many practical applications. Real-world problems in genomics, machine learning, finance other applications can produce correlation matrices too large to explicitly form and tractably compute. Often, only values corresponding to highly-correlated vectors are of interest, and those… ▽ More

    Submitted 11 March, 2016; v1 submitted 22 December, 2015; originally announced December 2015.

    Comments: 12 pages

  8. Scatter Matrix Concordance: A Diagnostic for Regressions on Subsets of Data

    Authors: Michael J. Kane, Bryan Lewis, Sekhar Tatikonda, Simon Urbanek

    Abstract: Linear regression models depend directly on the design matrix and its properties. Techniques that efficiently estimate model coefficients by partitioning rows of the design matrix are increasingly popular for large-scale problems because they fit well with modern parallel computing architectures. We propose a simple measure of {\em concordance} between a design matrix and a subset of its rows that… ▽ More

    Submitted 12 July, 2015; originally announced July 2015.