Skip to main content

Showing 1–8 of 8 results for author: Garfinkel, S

.
  1. arXiv:2402.04464  [pdf

    cs.AI cs.CY

    Ten Hard Problems in Artificial Intelligence We Must Get Right

    Authors: Gavin Leech, Simson Garfinkel, Misha Yagudin, Alexander Briand, Aleksandr Zhuravlev

    Abstract: We explore the AI2050 "hard problems" that block the promise of AI and cause AI risks: (1) developing general capabilities of the systems; (2) assuring the performance of AI systems and their training processes; (3) aligning system goals with human goals; (4) enabling great applications of AI in real life; (5) addressing economic disruptions; (6) ensuring the participation of all; (7) at the same… ▽ More

    Submitted 19 April, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: 75 + 19 pages

  2. arXiv:2312.11283  [pdf, other

    stat.AP cs.CR econ.EM

    The 2010 Census Confidentiality Protections Failed, Here's How and Why

    Authors: John M. Abowd, Tamara Adams, Robert Ashmead, David Darais, Sourya Dey, Simson L. Garfinkel, Nathan Goldschlag, Daniel Kifer, Philip Leclerc, Ethan Lew, Scott Moore, Rolando A. Rodríguez, Ramy N. Tadros, Lars Vilhuber

    Abstract: Using only 34 published tables, we reconstruct five variables (census block, sex, age, race, and ethnicity) in the confidential 2010 Census person records. Using the 38-bin age variable tabulated at the census block level, at most 20.1% of reconstructed records can differ from their confidential source on even a single value for these five variables. Using only published data, an attacker can veri… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  3. arXiv:2208.01639  [pdf, ps, other

    cs.CR

    Sharpening Your Tools: Updating bulk_extractor for the 2020s

    Authors: Simson Garfinkel, Jonathan Stewart

    Abstract: Bulk_extractor is a high-performance digital forensics tool written in C++. Between 2018 and 2022 we updated the program from C++98 to C++17, performed a complete code refactoring, and adopted a unit test framework. The new version typically runs with 75\% more throughput than the previous version, which we attribute to improved multithreading. We provide lessons and recommendations for other digi… ▽ More

    Submitted 31 March, 2022; originally announced August 2022.

  4. arXiv:2204.08986  [pdf, other

    cs.CR econ.EM stat.AP

    The 2020 Census Disclosure Avoidance System TopDown Algorithm

    Authors: John M. Abowd, Robert Ashmead, Ryan Cumings-Menon, Simson Garfinkel, Micah Heineck, Christine Heiss, Robert Johns, Daniel Kifer, Philip Leclerc, Ashwin Machanavajjhala, Brett Moran, William Sexton, Matthew Spence, Pavel Zhuravlev

    Abstract: The Census TopDown Algorithm (TDA) is a disclosure avoidance system using differential privacy for privacy-loss accounting. The algorithm ingests the final, edited version of the 2020 Census data and the final tabulation geographic definitions. The algorithm then creates noisy versions of key queries on the data, referred to as measurements, using zero-Concentrated Differential Privacy. Another ke… ▽ More

    Submitted 19 April, 2022; originally announced April 2022.

  5. arXiv:2110.13239  [pdf, ps, other

    cs.CR

    An Uncertainty Principle is a Price of Privacy-Preserving Microdata

    Authors: John Abowd, Robert Ashmead, Ryan Cumings-Menon, Simson Garfinkel, Daniel Kifer, Philip Leclerc, William Sexton, Ashley Simpson, Christine Task, Pavel Zhuravlev

    Abstract: Privacy-protected microdata are often the desired output of a differentially private algorithm since microdata is familiar and convenient for downstream users. However, there is a statistical price for this kind of convenience. We show that an uncertainty principle governs the trade-off between accuracy for a population of interest ("sum query") vs. accuracy for its component sub-populations ("poi… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: Preprint of NeurIPS 2021 paper

  6. Randomness Concerns When Deploying Differential Privacy

    Authors: Simson L. Garfinkel, Philip Leclerc

    Abstract: The U.S. Census Bureau is using differential privacy (DP) to protect confidential respondent data collected for the 2020 Decennial Census of Population & Housing. The Census Bureau's DP system is implemented in the Disclosure Avoidance System (DAS) and requires a source of random numbers. We estimate that the 2020 Census will require roughly 90TB of random bytes to protect the person and household… ▽ More

    Submitted 6 September, 2020; originally announced September 2020.

    Comments: 12 pages plus 2 pages bibliography

    Journal ref: 19th Workshop on Privacy in the Electronic Society (WPES'20), November 9, 2020, Virtual Event, USA

  7. arXiv:2004.00402  [pdf

    cs.OS

    A File System For Write-Once Media

    Authors: Simson L. Garfinkel, J. Spencer Love

    Abstract: A file system standard for use with write-once media such as digital compact disks is proposed. The file system is designed to work with any operating system and a variety of physical media. Although the implementation is simple, it provides a a full-featured and high-performance alternative to conventional file systems on traditional, multiple-write media such as magnetic disks.

    Submitted 30 March, 2020; originally announced April 2020.

    Comments: MIT Media Laboratory Tech Report, 1985

  8. Issues Encountered Deploying Differential Privacy

    Authors: Simson L. Garfinkel, John M. Abowd, Sarah Powazek

    Abstract: When differential privacy was created more than a decade ago, the motivating example was statistics published by an official statistics agency. In attempting to transition differential privacy from the academy to practice, the U.S. Census Bureau has encountered many challenges unanticipated by differential privacy's creators. These challenges include obtaining qualified personnel and a suitable co… ▽ More

    Submitted 6 September, 2018; originally announced September 2018.