Skip to main content

Showing 1–7 of 7 results for author: Vilhuber, L

.
  1. arXiv:2312.11283  [pdf, other

    stat.AP cs.CR econ.EM

    The 2010 Census Confidentiality Protections Failed, Here's How and Why

    Authors: John M. Abowd, Tamara Adams, Robert Ashmead, David Darais, Sourya Dey, Simson L. Garfinkel, Nathan Goldschlag, Daniel Kifer, Philip Leclerc, Ethan Lew, Scott Moore, Rolando A. Rodríguez, Ramy N. Tadros, Lars Vilhuber

    Abstract: Using only 34 published tables, we reconstruct five variables (census block, sex, age, race, and ethnicity) in the confidential 2010 Census person records. Using the 38-bin age variable tabulated at the census block level, at most 20.1% of reconstructed records can differ from their confidential source on even a single value for these five variables. Using only published data, an attacker can veri… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  2. arXiv:2309.14581  [pdf

    stat.AP cs.CR econ.EM

    Assessing Utility of Differential Privacy for RCTs

    Authors: Soumya Mukherjee, Aratrika Mustafi, Aleksandra Slavković, Lars Vilhuber

    Abstract: Randomized control trials, RCTs, have become a powerful tool for assessing the impact of interventions and policies in many contexts. They are considered the gold-standard for inference in the biomedical fields and in many social sciences. Researchers have published an increasing number of studies that rely on RCTs for at least part of the inference, and these studies typically include the respons… ▽ More

    Submitted 25 September, 2023; originally announced September 2023.

    Comments: Submitted

  3. Reproducibility and Transparency versus Privacy and Confidentiality: Reflections from a Data Editor

    Authors: Lars Vilhuber

    Abstract: Transparency and reproducibility are often seen in opposition to privacy and confidentiality. Data that need to be kept confidential are seen as an impediment to reproducibility, and privacy would seem to inhibit transparency. I bring a more nuanced view to the discussion, and show, using examples from over 1,000 reproducibility assessments, that confidential data can very well be used in reproduc… ▽ More

    Submitted 23 May, 2023; originally announced May 2023.

  4. Teaching for large-scale Reproducibility Verification

    Authors: Lars Vilhuber, Hyuk Harry Son, Meredith Welch, David N. Wasser, Michael Darisse

    Abstract: We describe a unique environment in which undergraduate students from various STEM and social science disciplines are trained in data provenance and reproducible methods, and then apply that knowledge to real, conditionally accepted manuscripts and associated replication packages. We describe in detail the recruitment, training, and regular activities. While the activity is not part of a regular c… ▽ More

    Submitted 31 March, 2022; originally announced April 2022.

  5. Applying Data Synthesis for Longitudinal Business Data across Three Countries

    Authors: M. Jahangir Alam, Benoit Dostie, Jörg Drechsler, Lars Vilhuber

    Abstract: Data on businesses collected by statistical agencies are challenging to protect. Many businesses have unique characteristics, and distributions of employment, sales, and profits are highly skewed. Attackers wishing to conduct identification attacks often have access to much more information than for any individual. As a consequence, most disclosure avoidance mechanisms fail to strike an acceptable… ▽ More

    Submitted 24 July, 2020; originally announced August 2020.

  6. arXiv:2007.13275  [pdf, other

    econ.EM stat.ME

    Total Error and Variability Measures for the Quarterly Workforce Indicators and LEHD Origin-Destination Employment Statistics in OnTheMap

    Authors: Kevin L. McKinney, Andrew S. Green, Lars Vilhuber, John M. Abowd

    Abstract: We report results from the first comprehensive total quality evaluation of five major indicators in the U.S. Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) Program Quarterly Workforce Indicators (QWI): total flow-employment, beginning-of-quarter employment, full-quarter employment, average monthly earnings of full-quarter employees, and total quarterly payroll. Beginning-of-quarte… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

  7. arXiv:1906.09353  [pdf, ps, other

    econ.TH cs.CR cs.DB

    Suboptimal Provision of Privacy and Statistical Accuracy When They are Public Goods

    Authors: John M. Abowd, Ian M. Schmutte, William Sexton, Lars Vilhuber

    Abstract: With vast databases at their disposal, private tech companies can compete with public statistical agencies to provide population statistics. However, private companies face different incentives to provide high-quality statistics and to protect the privacy of the people whose data are used. When both privacy protection and statistical accuracy are public goods, private providers tend to produce at… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.