Skip to main content

Showing 1–7 of 7 results for author: Bowen, C M

Searching in archive stat. Search in all archives.
.
  1. arXiv:2412.11794  [pdf, other

    cs.HC cs.CR stat.AP

    But Can You Use It? Design Recommendations for Differentially Private Interactive Systems

    Authors: Liudas Panavas, Joshua Snoke, Erika Tyagi, Claire McKay Bowen, Aaron R. Williams

    Abstract: Accessing data collected by federal statistical agencies is essential for public policy research and improving evidence-based decision making, such as evaluating the effectiveness of social programs, understanding demographic shifts, or addressing public health challenges. Differentially private interactive systems, or validation servers, can form a crucial part of the data-sharing infrastructure.… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

  2. arXiv:2308.00872  [pdf, ps, other

    stat.ME

    Advancing Microdata Privacy Protection: A Review of Synthetic Data

    Authors: Jingchen Hu, Claire McKay Bowen

    Abstract: Synthetic data generation is a powerful tool for privacy protection when considering public release of record-level data files. Initially proposed about three decades ago, it has generated significant research and application interest. To meet the pressing demand of data privacy protection in a variety of contexts, the field needs more researchers and practitioners. This review provides a comprehe… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  3. arXiv:2110.12055  [pdf, other

    stat.AP

    A Feasibility Study of Differentially Private Summary Statistics and Regression Analyses with Evaluations on Administrative and Survey Data

    Authors: Andrés F. Barrientos, Aaron R. Williams, Joshua Snoke, Claire McKay Bowen

    Abstract: Federal administrative data, such as tax data, are invaluable for research, but because of privacy concerns, access to these data is typically limited to select agencies and a few individuals. An alternative to sharing microlevel data is to allow individuals to query statistics without directly accessing the confidential data. This paper studies the feasibility of using differentially private (DP)… ▽ More

    Submitted 30 June, 2023; v1 submitted 22 October, 2021; originally announced October 2021.

    Comments: Main: 30 pages, 3 figures, 3 tables; Supplemental: 26 pages, 14 figures, 12 tables; References: 7 pages

  4. arXiv:1911.12704  [pdf, other

    stat.AP cs.CR cs.LG

    Comparative Study of Differentially Private Synthetic Data Algorithms from the NIST PSCR Differential Privacy Synthetic Data Challenge

    Authors: Claire McKay Bowen, Joshua Snoke

    Abstract: Differentially private synthetic data generation offers a recent solution to release analytically useful data while preserving the privacy of individuals in the data. In order to utilize these algorithms for public policy decisions, policymakers need an accurate understanding of these algorithms' comparative performance. Correspondingly, data practitioners require standard metrics for evaluating t… ▽ More

    Submitted 12 October, 2020; v1 submitted 28 November, 2019; originally announced November 2019.

    Comments: 32 pages (27 main, 5 references), 3 figures, 9 tables

  5. arXiv:1803.06763  [pdf, other

    stat.AP

    Differentially Private Data Release via Statistical Election to Partition Sequentially

    Authors: Claire McKay Bowen, Fang Liu, Binyue Su

    Abstract: Differential Privacy (DP) formalizes privacy in mathematical terms and provides a robust concept for privacy protection. DIfferentially Private Data Synthesis (DIPS) techniques produce and release synthetic individual-level data in the DP framework. One key challenge to developing DIPS methods is preservation of the statistical utility of synthetic data, especially in high-dimensional settings. We… ▽ More

    Submitted 20 October, 2020; v1 submitted 18 March, 2018; originally announced March 2018.

    Comments: 24 pages, 7 figures

  6. Comparative Study of Differentially Private Data Synthesis Methods

    Authors: Claire McKay Bowen, Fang Liu

    Abstract: When sharing data among researchers or releasing data for public use, there is a risk of exposing sensitive information of individuals in the data set. Data synthesis (DS) is a statistical disclosure limitation technique for releasing synthetic data sets with pseudo individual records. Traditional DS techniques often rely on strong assumptions of a data intruder's behaviors and background knowledg… ▽ More

    Submitted 8 January, 2019; v1 submitted 2 February, 2016; originally announced February 2016.

    Comments: The main paper is the first 48 pages (8 pages of reference). The rest of the pages (49 - 67) contain the Supplemental Material

    Journal ref: Statistical Science 35 (2), 280-307, 2020

  7. arXiv:1409.0909  [pdf, other

    stat.ME

    Partitioning a Large Simulation as It Runs

    Authors: Kary Myers, Earl Lawrence, Michael Fugate, Claire McKay Bowen, Lawrence Ticknor, Jon Woodring, Joanne Wendelberger, Jim Ahrens

    Abstract: As computer simulations continue to grow in size and complexity, they present a particularly challenging class of big data problems. Many application areas are moving toward exascale computing systems, systems that perform $10^{18}$ FLOPS (FLoating-point Operations Per Second) --- a billion billion calculations per second. Simulations at this scale can generate output that exceeds both the storage… ▽ More

    Submitted 23 September, 2015; v1 submitted 2 September, 2014; originally announced September 2014.