Skip to main content

Showing 1–18 of 18 results for author: Abowd, J M

.
  1. arXiv:2407.12775  [pdf

    econ.GN

    Estimating the Potential Impact of Combined Race and Ethnicity Reporting on Long-Term Earnings Statistics

    Authors: Kevin L. McKinney, John M. Abowd

    Abstract: We use place of birth information from the Social Security Administration linked to earnings data from the Longitudinal Employer-Household Dynamics Program and detailed race and ethnicity data from the 2010 Census to study how long-term earnings differentials vary by place of birth for different self-identified race and ethnicity categories. We focus on foreign-born persons from countries that are… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: CRIW Conference Race, Ethnicity, and Economic Statistics for the 21st Century, Spring 2024

  2. arXiv:2312.14191  [pdf, ps, other

    cs.CR econ.EM stat.AP

    Noisy Measurements Are Important, the Design of Census Products Is Much More Important

    Authors: John M. Abowd

    Abstract: McCartan et al. (2023) call for "making differential privacy work for census data users." This commentary explains why the 2020 Census Noisy Measurement Files (NMFs) are not the best focus for that plea. The August 2021 letter from 62 prominent researchers asking for production of the direct output of the differential privacy system deployed for the 2020 Census signaled the engagement of the schol… ▽ More

    Submitted 1 May, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    Journal ref: Harvard Data Science Review, Volume 6, Number 2 (Spring, 2024)

  3. arXiv:2312.11283  [pdf, other

    stat.AP cs.CR econ.EM

    The 2010 Census Confidentiality Protections Failed, Here's How and Why

    Authors: John M. Abowd, Tamara Adams, Robert Ashmead, David Darais, Sourya Dey, Simson L. Garfinkel, Nathan Goldschlag, Daniel Kifer, Philip Leclerc, Ethan Lew, Scott Moore, Rolando A. Rodríguez, Ramy N. Tadros, Lars Vilhuber

    Abstract: Using only 34 published tables, we reconstruct five variables (census block, sex, age, race, and ethnicity) in the confidential 2010 Census person records. Using the 38-bin age variable tabulated at the census block level, at most 20.1% of reconstructed records can differ from their confidential source on even a single value for these five variables. Using only published data, an attacker can veri… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  4. arXiv:2312.10863  [pdf, ps, other

    cs.CR stat.CO

    Disclosure Avoidance for the 2020 Census Demographic and Housing Characteristics File

    Authors: Ryan Cumings-Menon, Robert Ashmead, Daniel Kifer, Philip Leclerc, Matthew Spence, Pavel Zhuravlev, John M. Abowd

    Abstract: In "The 2020 Census Disclosure Avoidance System TopDown Algorithm," Abowd et al. (2022) describe the concepts and methods used by the Disclosure Avoidance System (DAS) to produce formally private output in support of the 2020 Census data product releases, with a particular focus on the DAS implementation that was used to create the 2020 Census Redistricting Data (P.L. 94-171) Summary File. In this… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

  5. arXiv:2310.09398  [pdf, other

    cs.CR econ.EM stat.ME

    An In-Depth Examination of Requirements for Disclosure Risk Assessment

    Authors: Ron S. Jarmin, John M. Abowd, Robert Ashmead, Ryan Cumings-Menon, Nathan Goldschlag, Michael B. Hawes, Sallie Ann Keller, Daniel Kifer, Philip Leclerc, Jerome P. Reiter, Rolando A. Rodríguez, Ian Schmutte, Victoria A. Velkoff, Pavel Zhuravlev

    Abstract: The use of formal privacy to protect the confidentiality of responses in the 2020 Decennial Census of Population and Housing has triggered renewed interest and debate over how to measure the disclosure risks and societal benefits of the published data products. Following long-established precedent in economics and statistics, we argue that any proposal for quantifying disclosure risk should be bas… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

    Comments: 47 pages, 1 table

    Journal ref: PNAS, October 13, 2023, Vol. 120, No. 43

  6. arXiv:2308.15445  [pdf

    econ.EM

    Mixed-Effects Methods for Search and Matching Research

    Authors: John M. Abowd, Kevin L. McKinney

    Abstract: We study mixed-effects methods for estimating equations containing person and firm effects. In economics such models are usually estimated using fixed-effects methods. Recent enhancements to those fixed-effects methods include corrections to the bias in estimating the covariance matrix of the person and firm effects, which we also consider.

    Submitted 29 August, 2023; originally announced August 2023.

  7. arXiv:2303.00845  [pdf, ps, other

    stat.AP cs.CR econ.EM

    $21^{st}$ Century Statistical Disclosure Limitation: Motivations and Challenges

    Authors: John M Abowd, Michael B Hawes

    Abstract: This chapter examines the motivations and imperatives for modernizing how statistical agencies approach statistical disclosure limitation for official data product releases. It discusses the implications for agencies' broader data governance and decision-making, and it identifies challenges that agencies will likely face along the way. In conclusion, the chapter proposes some principles and best p… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: Forthcoming CRC Handbook of Formally Private and Synthetic Data Approaches for Statistical Disclosure Control

  8. arXiv:2209.03310  [pdf, other

    cs.CR stat.ME

    Bayesian and Frequentist Semantics for Common Variations of Differential Privacy: Applications to the 2020 Census

    Authors: Daniel Kifer, John M. Abowd, Robert Ashmead, Ryan Cumings-Menon, Philip Leclerc, Ashwin Machanavajjhala, William Sexton, Pavel Zhuravlev

    Abstract: The purpose of this paper is to guide interpretation of the semantic privacy guarantees for some of the major variations of differential privacy, which include pure, approximate, Rényi, zero-concentrated, and $f$ differential privacy. We interpret privacy-loss accounting parameters, frequentist semantics, and Bayesian semantics (including new results). The driving application is the interpretation… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

  9. Measuring Race in US Economic Statistics: What Do We Know?

    Authors: Sonya Ravindranath Waddell, John M. Abowd, Camille Busette, Mark Hugo Lopez

    Abstract: This article is an edited transcript of the session of the same name at the 38th Annual NABE Economic Policy Conference: Policy Options for Sustainable and Inclusive Growth. The panelists are experts from government and private research organizations.

    Submitted 10 August, 2022; originally announced August 2022.

    Comments: Pre-publication version. Includes all information in the published version Bus Econ (2022)

  10. Confidentiality Protection in the 2020 US Census of Population and Housing

    Authors: John M Abowd, Michael B Hawes

    Abstract: In an era where external data and computational capabilities far exceed statistical agencies' own resources and capabilities, they face the renewed challenge of protecting the confidentiality of underlying microdata when publishing statistics in very granular form and ensuring that these granular data are used for statistical purposes only. Conventional statistical disclosure limitation methods ar… ▽ More

    Submitted 27 December, 2022; v1 submitted 7 June, 2022; originally announced June 2022.

    Comments: Version 2 corrects a few transcription errors in Tables 2, 3 and 5. Version 3 adds final journal copy edits to the preprint

    Journal ref: Annual Review of Statistics and Its Application 2023 10:1

  11. arXiv:2204.08986  [pdf, other

    cs.CR econ.EM stat.AP

    The 2020 Census Disclosure Avoidance System TopDown Algorithm

    Authors: John M. Abowd, Robert Ashmead, Ryan Cumings-Menon, Simson Garfinkel, Micah Heineck, Christine Heiss, Robert Johns, Daniel Kifer, Philip Leclerc, Ashwin Machanavajjhala, Brett Moran, William Sexton, Matthew Spence, Pavel Zhuravlev

    Abstract: The Census TopDown Algorithm (TDA) is a disclosure avoidance system using differential privacy for privacy-loss accounting. The algorithm ingests the final, edited version of the 2020 Census data and the final tabulation geographic definitions. The algorithm then creates noisy versions of key queries on the data, referred to as measurements, using zero-Concentrated Differential Privacy. Another ke… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

  12. arXiv:2203.16654  [pdf, other

    cs.CR

    Geographic Spines in the 2020 Census Disclosure Avoidance System

    Authors: Ryan Cumings-Menon, John M. Abowd, Robert Ashmead, Daniel Kifer, Philip Leclerc, Jeffrey Ocker, Michael Ratcliffe, Pavel Zhuravlev

    Abstract: The 2020 Census Disclosure Avoidance System (DAS) is a formally private mechanism that first adds independent noise to cross tabulations for a set of pre-specified hierarchical geographic units, which is known as the geographic spine. After post-processing these noisy measurements, DAS outputs a formally private database with fields indicating location in the standard census geographic spine, whic… ▽ More

    Submitted 15 March, 2024; v1 submitted 30 March, 2022; originally announced March 2022.

  13. arXiv:2112.05822  [pdf

    econ.GN stat.AP

    U.S. Long-Term Earnings Outcomes by Sex, Race, Ethnicity, and Place of Birth

    Authors: Kevin L. McKinney, John M. Abowd, Hubert P. Janicki

    Abstract: This paper is part of the Global Income Dynamics Project cross-country comparison of earnings inequality, volatility, and mobility. Using data from the U.S. Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) infrastructure files we produce a uniform set of earnings statistics for the U.S. From 1998 to 2019, we find U.S. earnings inequality has increased and volatility has decreased. T… ▽ More

    Submitted 10 December, 2021; originally announced December 2021.

    Comments: 77 pages, 42 figures

  14. arXiv:2008.00253  [pdf

    econ.GN stat.AP

    Male Earnings Volatility in LEHD before, during, and after the Great Recession

    Authors: Kevin L. McKinney, John M. Abowd

    Abstract: This paper is part of a coordinated collection of papers on prime-age male earnings volatility. Each paper produces a similar set of statistics for the same reference population using a different primary data source. Our primary data source is the Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) infrastructure files. Using LEHD data from 1998 to 2016, we create a well-defined popula… ▽ More

    Submitted 1 February, 2022; v1 submitted 1 August, 2020; originally announced August 2020.

    Comments: Revision submitted to JBES with figures included in the text and Appendix added

  15. arXiv:2007.13275  [pdf, other

    econ.EM stat.ME

    Total Error and Variability Measures for the Quarterly Workforce Indicators and LEHD Origin-Destination Employment Statistics in OnTheMap

    Authors: Kevin L. McKinney, Andrew S. Green, Lars Vilhuber, John M. Abowd

    Abstract: We report results from the first comprehensive total quality evaluation of five major indicators in the U.S. Census Bureau's Longitudinal Employer-Household Dynamics (LEHD) Program Quarterly Workforce Indicators (QWI): total flow-employment, beginning-of-quarter employment, full-quarter employment, average monthly earnings of full-quarter employees, and total quarterly payroll. Beginning-of-quarte… ▽ More

    Submitted 26 July, 2020; originally announced July 2020.

  16. arXiv:1906.09353  [pdf, ps, other

    econ.TH cs.CR cs.DB

    Suboptimal Provision of Privacy and Statistical Accuracy When They are Public Goods

    Authors: John M. Abowd, Ian M. Schmutte, William Sexton, Lars Vilhuber

    Abstract: With vast databases at their disposal, private tech companies can compete with public statistical agencies to provide population statistics. However, private companies face different incentives to provide high-quality statistics and to protect the privacy of the people whose data are used. When both privacy protection and statistical accuracy are public goods, private providers tend to produce at… ▽ More

    Submitted 21 June, 2019; originally announced June 2019.

  17. Issues Encountered Deploying Differential Privacy

    Authors: Simson L. Garfinkel, John M. Abowd, Sarah Powazek

    Abstract: When differential privacy was created more than a decade ago, the motivating example was statistics published by an official statistics agency. In attempting to transition differential privacy from the academy to practice, the U.S. Census Bureau has encountered many challenges unanticipated by differential privacy's creators. These challenges include obtaining qualified personnel and a suitable co… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.

  18. An Economic Analysis of Privacy Protection and Statistical Accuracy as Social Choices

    Authors: John M. Abowd, Ian M. Schmutte

    Abstract: Statistical agencies face a dual mandate to publish accurate statistics while protecting respondent privacy. Increasing privacy protection requires decreased accuracy. Recognizing this as a resource allocation problem, we propose an economic solution: operate where the marginal cost of increasing privacy equals the marginal benefit. Our model of production, from computer science, assumes data are… ▽ More

    Submitted 20 August, 2018; originally announced August 2018.

    Comments: Forthcoming in American Economic Review