Skip to main content

Showing 1–43 of 43 results for author: Ho, D E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.17303  [pdf, ps, other

    cs.CY

    The California Report on Frontier AI Policy

    Authors: Rishi Bommasani, Scott R. Singer, Ruth E. Appel, Sarah Cen, A. Feder Cooper, Elena Cryst, Lindsey A. Gailmard, Ian Klaus, Meredith M. Lee, Inioluwa Deborah Raji, Anka Reuel, Drew Spence, Alexander Wan, Angelina Wang, Daniel Zhang, Daniel E. Ho, Percy Liang, Dawn Song, Joseph E. Gonzalez, Jonathan Zittrain, Jennifer Tour Chayes, Mariano-Florentino Cuellar, Li Fei-Fei

    Abstract: The innovations emerging at the frontier of artificial intelligence (AI) are poised to create historic opportunities for humanity but also raise complex policy challenges. Continued progress in frontier AI carries the potential for profound advances in scientific discovery, economic productivity, and broader social well-being. As the epicenter of global AI innovation, California has a unique oppor… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: Authored by the Joint California Policy Working Group on AI Frontier Models

  2. arXiv:2506.13735  [pdf, ps, other

    cs.CY

    Bias Delayed is Bias Denied? Assessing the Effect of Reporting Delays on Disparity Assessments

    Authors: Jennah Gosciak, Aparna Balagopalan, Derek Ouyang, Allison Koenecke, Marzyeh Ghassemi, Daniel E. Ho

    Abstract: Conducting disparity assessments at regular time intervals is critical for surfacing potential biases in decision-making and improving outcomes across demographic groups. Because disparity assessments fundamentally depend on the availability of demographic information, their efficacy is limited by the availability and consistency of available demographic identifiers. While prior work has considere… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  3. arXiv:2505.15216  [pdf, ps, other

    cs.CR cs.AI cs.CL cs.LG

    BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems

    Authors: Andy K. Zhang, Joey Ji, Celeste Menders, Riya Dulepet, Thomas Qin, Ron Y. Wang, Junrong Wu, Kyleen Liao, Jiliang Li, Jinghan Hu, Sara Hong, Nardos Demilew, Shivatmica Murgai, Jason Tran, Nishka Kacheria, Ethan Ho, Denis Liu, Lauren McLane, Olivia Bruvik, Dai-Rong Han, Seungwoo Kim, Akhil Vyas, Cuiyuanxiu Chen, Ryan Li, Weiran Xu , et al. (9 additional authors not shown)

    Abstract: AI agents have the potential to significantly alter the cybersecurity landscape. To help us understand this change, we introduce the first framework to capture offensive and defensive cyber-capabilities in evolving real-world systems. Instantiating this framework with BountyBench, we set up 25 systems with complex, real-world codebases. To capture the vulnerability lifecycle, we define three task… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 78 pages

  4. arXiv:2505.12546  [pdf, other

    cs.CL cs.CY cs.LG

    Extracting memorized pieces of (copyrighted) books from open-weight language models

    Authors: A. Feder Cooper, Aaron Gokaslan, Amy B. Cyphert, Christopher De Sa, Mark A. Lemley, Daniel E. Ho, Percy Liang

    Abstract: Plaintiffs and defendants in copyright lawsuits over generative AI often make sweeping, opposing claims about the extent to which large language models (LLMs) have memorized plaintiffs' protected expression. Drawing on adversarial ML and copyright law, we show that these polarized positions dramatically oversimplify the relationship between memorization and copyright. To do so, we leverage a recen… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  5. A Reasoning-Focused Legal Retrieval Benchmark

    Authors: Lucia Zheng, Neel Guha, Javokhir Arifov, Sarah Zhang, Michal Skreta, Christopher D. Manning, Peter Henderson, Daniel E. Ho

    Abstract: As the legal community increasingly examines the use of large language models (LLMs) for various legal applications, legal AI developers have turned to retrieval-augmented LLMs ("RAG" systems) to improve system performance and robustness. An obstacle to the development of specialized RAG systems is the lack of realistic legal RAG benchmarks which capture the complexity of both legal retrieval and… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: CS&Law 2025. For data, see https://reglab.github.io/legal-rag-benchmarks/

  6. arXiv:2503.03888  [pdf, other

    cs.CL

    AI for Scaling Legal Reform: Mapping and Redacting Racial Covenants in Santa Clara County

    Authors: Faiz Surani, Mirac Suzgun, Vyoma Raman, Christopher D. Manning, Peter Henderson, Daniel E. Ho

    Abstract: Legal reform can be challenging in light of the volume, complexity, and interdependence of laws, codes, and records. One salient example of this challenge is the effort to restrict and remove racially restrictive covenants, clauses in property deeds that historically barred individuals of specific races from purchasing homes. Despite the Supreme Court holding such racial covenants unenforceable in… ▽ More

    Submitted 6 March, 2025; v1 submitted 12 February, 2025; originally announced March 2025.

    Comments: https://reglab.github.io/racialcovenants/

  7. arXiv:2502.01926  [pdf, other

    cs.CY cs.CL

    Fairness through Difference Awareness: Measuring Desired Group Discrimination in LLMs

    Authors: Angelina Wang, Michelle Phan, Daniel E. Ho, Sanmi Koyejo

    Abstract: Algorithmic fairness has conventionally adopted the mathematically convenient perspective of racial color-blindness (i.e., difference unaware treatment). However, we contend that in a range of important settings, group difference awareness matters. For example, differentiating between groups may be necessary in legal contexts (e.g., the U.S. compulsory draft applies to men but not women) and harm… ▽ More

    Submitted 22 May, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

    Comments: Accepted to ACL 2025 Main Conference

  8. arXiv:2501.04902  [pdf, other

    cs.CY cs.HC

    Artificial Intelligence in Environmental Protection: The Importance of Organizational Context from a Field Study in Wisconsin

    Authors: Nicolas Rothbacher, Kit T. Rodolfa, Mihir Bhaskar, Erin Maneri, Christine Tsang, Daniel E. Ho

    Abstract: Advances in Artificial Intelligence (AI) have generated widespread enthusiasm for the potential of AI to support our understanding and protection of the environment. As such tools move from basic research to more consequential settings, such as regulatory enforcement, the human context of how AI is utilized, interpreted, and deployed becomes increasingly critical. Yet little work has systematicall… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

  9. arXiv:2412.06966  [pdf, other

    cs.LG cs.AI cs.CY

    Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice

    Authors: A. Feder Cooper, Christopher A. Choquette-Choo, Miranda Bogen, Matthew Jagielski, Katja Filippova, Ken Ziyu Liu, Alexandra Chouldechova, Jamie Hayes, Yangsibo Huang, Niloofar Mireshghallah, Ilia Shumailov, Eleni Triantafillou, Peter Kairouz, Nicole Mitchell, Percy Liang, Daniel E. Ho, Yejin Choi, Sanmi Koyejo, Fernando Delgado, James Grimmelmann, Vitaly Shmatikov, Christopher De Sa, Solon Barocas, Amy Cyphert, Mark Lemley , et al. (10 additional authors not shown)

    Abstract: We articulate fundamental mismatches between technical methods for machine unlearning in Generative AI, and documented aspirations for broader impact that these methods could have for law and policy. These aspirations are both numerous and varied, motivated by issues that pertain to privacy, copyright, safety, and more. For example, unlearning is often invoked as a solution for removing the effect… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

    Comments: Presented at the 2nd Workshop on Generative AI and Law at ICML (July 2024)

  10. arXiv:2410.21195  [pdf, other

    cs.CL cs.AI cs.CY

    Belief in the Machine: Investigating Epistemological Blind Spots of Language Models

    Authors: Mirac Suzgun, Tayfun Gur, Federico Bianchi, Daniel E. Ho, Thomas Icard, Dan Jurafsky, James Zou

    Abstract: As language models (LMs) become integral to fields like healthcare, law, and journalism, their ability to differentiate between fact, belief, and knowledge is essential for reliable decision-making. Failure to grasp these distinctions can lead to significant consequences in areas such as medical diagnosis, legal judgments, and dissemination of fake news. Despite this, current literature has largel… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: https://github.com/suzgunmirac/belief-in-the-machine

  11. arXiv:2408.08926  [pdf, other

    cs.CR cs.AI cs.CL cs.CY cs.LG

    Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models

    Authors: Andy K. Zhang, Neil Perry, Riya Dulepet, Joey Ji, Celeste Menders, Justin W. Lin, Eliot Jones, Gashon Hussein, Samantha Liu, Donovan Jasper, Pura Peetathawatchai, Ari Glenn, Vikram Sivashankar, Daniel Zamoshchin, Leo Glikbarg, Derek Askaryar, Mike Yang, Teddy Zhang, Rishi Alluri, Nathan Tran, Rinnara Sangpisit, Polycarpos Yiorkadjis, Kenny Osele, Gautham Raghupathi, Dan Boneh , et al. (2 additional authors not shown)

    Abstract: Language Model (LM) agents for cybersecurity that are capable of autonomously identifying vulnerabilities and executing exploits have potential to cause real-world impact. Policymakers, model providers, and researchers in the AI and cybersecurity communities are interested in quantifying the capabilities of such agents to help mitigate cyberrisk and investigate opportunities for penetration testin… ▽ More

    Submitted 12 April, 2025; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: ICLR 2025 Oral

  12. arXiv:2407.16900  [pdf, other

    cs.LG cs.AI cs.CY

    Regulating AI Adaptation: An Analysis of AI Medical Device Updates

    Authors: Kevin Wu, Eric Wu, Kit Rodolfa, Daniel E. Ho, James Zou

    Abstract: While the pace of development of AI has rapidly progressed in recent years, the implementation of safe and effective regulatory frameworks has lagged behind. In particular, the adaptive nature of AI models presents unique challenges to regulators as updating a model can improve its performance but also introduce safety risks. In the US, the Food and Drug Administration (FDA) has been a forerunner… ▽ More

    Submitted 22 June, 2024; originally announced July 2024.

    Journal ref: CHIL 2024

  13. arXiv:2406.13847  [pdf, other

    cs.CV

    Locating and measuring marine aquaculture production from space: a computer vision approach in the French Mediterranean

    Authors: Sebastian Quaade, Andrea Vallebueno, Olivia D. N. Alcabes, Kit T. Rodolfa, Daniel E. Ho

    Abstract: Aquaculture production -- the cultivation of aquatic plants and animals -- has grown rapidly since the 1990s, but sparse, self-reported and aggregate production data limits the effective understanding and monitoring of the industry's trends and potential risks. Building on a manual survey of aquaculture production from remote sensing imagery, we train a computer vision model to identify marine aqu… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  14. arXiv:2406.12165  [pdf, other

    cs.CL

    Statistical Uncertainty in Word Embeddings: GloVe-V

    Authors: Andrea Vallebueno, Cassandra Handan-Nader, Christopher D. Manning, Daniel E. Ho

    Abstract: Static word embeddings are ubiquitous in computational social science applications and contribute to practical decision-making in a variety of fields including law and healthcare. However, assessing the statistical uncertainty in downstream conclusions drawn from word embedding statistics has remained challenging. When using only point estimates for embeddings, researchers have no streamlined way… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  15. arXiv:2405.20362  [pdf, other

    cs.CL cs.CY

    Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools

    Authors: Varun Magesh, Faiz Surani, Matthew Dahl, Mirac Suzgun, Christopher D. Manning, Daniel E. Ho

    Abstract: Legal practice has witnessed a sharp rise in products incorporating artificial intelligence (AI). Such tools are designed to assist with a wide range of core legal tasks, from search and summarization of caselaw to document drafting. But the large language models used in these tools are prone to "hallucinate," or make up false information, making their use risky in high-stakes domains. Recently, c… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Our dataset, tool outputs, and labels will be made available upon publication. This version of the manuscript (May 30, 2024) is updated to reflect an evaluation of Westlaw's AI-Assisted Research

  16. arXiv:2404.02127  [pdf, other

    cs.CL cs.AI cs.LG

    LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain

    Authors: Joel Niklaus, Lucia Zheng, Arya D. McCarthy, Christopher Hahn, Brian M. Rosen, Peter Henderson, Daniel E. Ho, Garrett Honke, Percy Liang, Christopher Manning

    Abstract: Instruction tuning is an important step in making language models useful for direct user interaction. However, the legal domain is underrepresented in typical instruction datasets (e.g., only 10 out of 1600+ tasks in Super-NaturalInstructions). To study whether instruction tuning on legal datasets is necessary for strong legal reasoning, we aggregate 58 annotated legal datasets and write instructi… ▽ More

    Submitted 23 January, 2025; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at Findings of NAACL 2025

    MSC Class: 68T50 ACM Class: I.2

  17. arXiv:2403.07918  [pdf, other

    cs.CY cs.AI cs.LG

    On the Societal Impact of Open Foundation Models

    Authors: Sayash Kapoor, Rishi Bommasani, Kevin Klyman, Shayne Longpre, Ashwin Ramaswami, Peter Cihon, Aspen Hopkins, Kevin Bankston, Stella Biderman, Miranda Bogen, Rumman Chowdhury, Alex Engler, Peter Henderson, Yacine Jernite, Seth Lazar, Stefano Maffulli, Alondra Nelson, Joelle Pineau, Aviya Skowron, Dawn Song, Victor Storchan, Daniel Zhang, Daniel E. Ho, Percy Liang, Arvind Narayanan

    Abstract: Foundation models are powerful technologies: how they are released publicly directly shapes their societal impact. In this position paper, we focus on open foundation models, defined here as those with broadly available model weights (e.g. Llama 2, Stable Diffusion XL). We identify five distinctive properties (e.g. greater customizability, poor monitoring) of open foundation models that lead to bo… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

  18. arXiv:2402.02008  [pdf, other

    cs.CL cs.AI

    How well do LLMs cite relevant medical references? An evaluation framework and analyses

    Authors: Kevin Wu, Eric Wu, Ally Cassasola, Angela Zhang, Kevin Wei, Teresa Nguyen, Sith Riantawan, Patricia Shi Riantawan, Daniel E. Ho, James Zou

    Abstract: Large language models (LLMs) are currently being used to answer medical questions across a variety of clinical domains. Recent top-performing commercial LLMs, in particular, are also capable of citing sources to support their responses. In this paper, we ask: do the sources that LLMs generate actually support the claims that they make? To answer this, we propose three contributions. First, as expe… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  19. arXiv:2401.01301  [pdf, other

    cs.CL cs.AI cs.CY

    Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models

    Authors: Matthew Dahl, Varun Magesh, Mirac Suzgun, Daniel E. Ho

    Abstract: Do large language models (LLMs) know the law? These models are increasingly being used to augment legal practice, education, and research, yet their revolutionary potential is threatened by the presence of hallucinations -- textual output that is not consistent with legal facts. We present the first systematic evidence of these hallucinations, documenting LLMs' varying performance across jurisdict… ▽ More

    Submitted 21 June, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Journal ref: Journal of Legal Analysis 16, no. 1 (2024): 64-93

  20. arXiv:2310.01679  [pdf, other

    cs.LG cs.CY stat.ML

    Estimating and Implementing Conventional Fairness Metrics With Probabilistic Protected Features

    Authors: Hadi Elzayn, Emily Black, Patrick Vossler, Nathanael Jo, Jacob Goldin, Daniel E. Ho

    Abstract: The vast majority of techniques to train fair models require access to the protected attribute (e.g., race, gender), either at train time or in production. However, in many important applications this protected attribute is largely unavailable. In this paper, we develop methods for measuring and reducing fairness violations in a setting with limited access to protected attribute labels. Specifical… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  21. arXiv:2309.17337  [pdf, other

    cs.LG cs.AI cs.CY

    Toward Operationalizing Pipeline-aware ML Fairness: A Research Agenda for Developing Practical Guidelines and Tools

    Authors: Emily Black, Rakshit Naidu, Rayid Ghani, Kit T. Rodolfa, Daniel E. Ho, Hoda Heidari

    Abstract: While algorithmic fairness is a thriving area of research, in practice, mitigating issues of bias often gets reduced to enforcing an arbitrarily chosen fairness metric, either by enforcing fairness constraints during the optimization step, post-processing model outputs, or by manipulating the training data. Recent work has called on the ML community to take a more holistic approach to tackle fairn… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: EAAMO'23 (Archival)

  22. arXiv:2308.11462  [pdf, other

    cs.CL cs.AI cs.CY

    LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

    Authors: Neel Guha, Julian Nyarko, Daniel E. Ho, Christopher Ré, Adam Chilton, Aditya Narayana, Alex Chohlas-Wood, Austin Peters, Brandon Waldon, Daniel N. Rockmore, Diego Zambrano, Dmitry Talisman, Enam Hoque, Faiz Surani, Frank Fagan, Galit Sarfaty, Gregory M. Dickinson, Haggai Porat, Jason Hegland, Jessica Wu, Joe Nudell, Joel Niklaus, John Nay, Jonathan H. Choi, Kevin Tobia , et al. (15 additional authors not shown)

    Abstract: The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform? To enable greater study of this question, we present LegalBench: a collaboratively constructed legal reasoning benchmark consisting of 162 tasks covering six different types of legal reasoning. LegalBench was built through an interdisc… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 143 pages, 79 tables, 4 figures

  23. arXiv:2306.09237  [pdf, other

    cs.CL cs.AI cs.LG

    One Law, Many Languages: Benchmarking Multilingual Legal Reasoning for Judicial Support

    Authors: Ronja Stern, Vishvaksenan Rasiah, Veton Matoshi, Srinanda Brügger Bose, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho, Joel Niklaus

    Abstract: Recent strides in Large Language Models (LLMs) have saturated many Natural Language Processing (NLP) benchmarks, emphasizing the need for more challenging ones to properly assess LLM capabilities. However, domain-specific and multilingual benchmarks are rare because they require in-depth expertise to develop. Still, most public models are trained predominantly on English corpora, while other langu… ▽ More

    Submitted 21 August, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    MSC Class: 68T50 ACM Class: I.2

  24. arXiv:2306.02069  [pdf, other

    cs.CL cs.AI cs.LG

    MultiLegalPile: A 689GB Multilingual Legal Corpus

    Authors: Joel Niklaus, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho

    Abstract: Large, high-quality datasets are crucial for training Large Language Models (LLMs). However, so far, there are few datasets available for specialized critical domains such as law and the available ones are often only for the English language. We curate and release MultiLegalPile, a 689GB corpus in 24 languages from 17 jurisdictions. The MultiLegalPile corpus, which includes diverse legal data sour… ▽ More

    Submitted 19 May, 2024; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2024

    MSC Class: 68T50 ACM Class: I.2

  25. Potential for allocative harm in an environmental justice data tool

    Authors: Benjamin Q. Huynh, Elizabeth T. Chin, Allison Koenecke, Derek Ouyang, Daniel E. Ho, Mathew V. Kiang, David H. Rehkopf

    Abstract: Neighborhood-level screening algorithms are increasingly being deployed to inform policy decisions. We evaluate one such algorithm, CalEnviroScreen - designed to promote environmental justice and used to guide hundreds of millions of dollars in public funding annually - assessing its potential for allocative harm. We observe the model to be sensitive to subjective model decisions, with 16% of trac… ▽ More

    Submitted 12 April, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Journal ref: Nat Mach Intell 6, 187-194 (2024)

  26. arXiv:2303.02580  [pdf, other

    stat.AP cs.CY

    Estimating Racial Disparities When Race is Not Observed

    Authors: Cory McCartan, Robin Fisher, Jacob Goldin, Daniel E. Ho, Kosuke Imai

    Abstract: The estimation of racial disparities in various fields is often hampered by the lack of individual-level racial information. In many cases, the law prohibits the collection of such information to prevent direct racial discrimination. As a result, analysts have frequently adopted Bayesian Improved Surname Geocoding (BISG) and its variants, which combine individual names and addresses with Census da… ▽ More

    Submitted 16 April, 2024; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: 28 pages, 9 figures, plus references and appendices

  27. arXiv:2209.06120  [pdf, ps, other

    cs.AI

    LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning

    Authors: Neel Guha, Daniel E. Ho, Julian Nyarko, Christopher Ré

    Abstract: Can foundation models be guided to execute tasks involving legal reasoning? We believe that building a benchmark to answer this question will require sustained collaborative efforts between the computer science and legal communities. To that end, this short paper serves three purposes. First, we describe how IRAC-a framework legal scholars use to distinguish different types of legal reasoning-can… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 13 pages, 7 tables

  28. arXiv:2208.11747  [pdf, other

    cs.LG

    Entropy Regularization for Population Estimation

    Authors: Ben Chugg, Peter Henderson, Jacob Goldin, Daniel E. Ho

    Abstract: Entropy regularization is known to improve exploration in sequential decision-making problems. We show that this same mechanism can also lead to nearly unbiased and lower-variance estimates of the mean reward in the optimize-and-estimate structured bandit setting. Mean reward estimation (i.e., population estimation) tasks have recently been shown to be essential for public policy settings where le… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  29. Detecting Environmental Violations with Satellite Imagery in Near Real Time: Land Application under the Clean Water Act

    Authors: Ben Chugg, Nicolas Rothbacher, Alex Feng, Xiaoqi Long, Daniel E. Ho

    Abstract: This paper introduces a new, highly consequential setting for the use of computer vision for environmental sustainability. Concentrated Animal Feeding Operations (CAFOs) (aka intensive livestock farms or "factory farms") produce significant manure and pollution. Dumping manure in the winter months poses significant environmental risks and violates environmental law in many states. Yet the federal… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted to CIKM '22

  30. arXiv:2207.00220  [pdf, other

    cs.CL cs.CY

    Pile of Law: Learning Responsible Data Filtering from the Law and a 256GB Open-Source Legal Dataset

    Authors: Peter Henderson, Mark S. Krass, Lucia Zheng, Neel Guha, Christopher D. Manning, Dan Jurafsky, Daniel E. Ho

    Abstract: One concern with the rise of large language models lies with their potential for significant harm, particularly from pretraining on biased, obscene, copyrighted, and private information. Emerging ethical approaches have attempted to filter pretraining material, but such approaches have been ad hoc and failed to take context into account. We offer an approach to filtering grounded in law, which has… ▽ More

    Submitted 29 November, 2022; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: Presented at NeurIPS Datasets & Benchmarks (2022)

  31. arXiv:2206.09875  [pdf, other

    cs.LG cs.CY

    Algorithmic Fairness and Vertical Equity: Income Fairness with IRS Tax Audit Models

    Authors: Emily Black, Hadi Elzayn, Alexandra Chouldechova, Jacob Goldin, Daniel E. Ho

    Abstract: This study examines issues of algorithmic fairness in the context of systems that inform tax audit selection by the United States Internal Revenue Service (IRS). While the field of algorithmic fairness has developed primarily around notions of treating like individuals alike, we instead explore the concept of vertical equity -- appropriately accounting for relevant differences across individuals -… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  32. arXiv:2206.04737  [pdf, other

    cs.CY

    Outsider Oversight: Designing a Third Party Audit Ecosystem for AI Governance

    Authors: Inioluwa Deborah Raji, Peggy Xu, Colleen Honigsberg, Daniel E. Ho

    Abstract: Much attention has focused on algorithmic audits and impact assessments to hold developers and users of algorithmic systems accountable. But existing algorithmic accountability policy approaches have neglected the lessons from non-algorithmic domains: notably, the importance of interventions that allow for the effective participation of third parties. Our paper synthesizes lessons from other field… ▽ More

    Submitted 9 June, 2022; originally announced June 2022.

    Comments: Presented at 5th Annual ACM/AAAI AI Ethics and Society (AIES) conference

  33. arXiv:2204.11910  [pdf, other

    cs.LG cs.CY

    Integrating Reward Maximization and Population Estimation: Sequential Decision-Making for Internal Revenue Service Audit Selection

    Authors: Peter Henderson, Ben Chugg, Brandon Anderson, Kristen Altenburger, Alex Turk, John Guyton, Jacob Goldin, Daniel E. Ho

    Abstract: We introduce a new setting, optimize-and-estimate structured bandits. Here, a policy must select a batch of arms, each characterized by its own context, that would allow it to both maximize reward and maintain an accurate (ideally unbiased) population estimate of the reward. This setting is inherent to many public and private sector applications and often requires handling delayed feedback, small… ▽ More

    Submitted 24 January, 2023; v1 submitted 25 April, 2022; originally announced April 2022.

    Comments: Accepted to the Thirty-Seventh AAAI Conference On Artificial Intelligence (AAAI), 2023

  34. arXiv:2112.10988  [pdf, other

    cs.CV cs.LG

    Mapping industrial poultry operations at scale with deep learning and aerial imagery

    Authors: Caleb Robinson, Ben Chugg, Brandon Anderson, Juan M. Lavista Ferres, Daniel E. Ho

    Abstract: Concentrated Animal Feeding Operations (CAFOs) pose serious risks to air, water, and public health, but have proven to be challenging to regulate. The U.S. Government Accountability Office notes that a basic challenge is the lack of comprehensive location information on CAFOs. We use the USDA's National Agricultural Imagery Program (NAIP) 1m/pixel aerial imagery to detect poultry CAFOs across the… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

  35. Beyond Ads: Sequential Decision-Making Algorithms in Law and Public Policy

    Authors: Peter Henderson, Ben Chugg, Brandon Anderson, Daniel E. Ho

    Abstract: We explore the promises and challenges of employing sequential decision-making algorithms -- such as bandits, reinforcement learning, and active learning -- in law and public policy. While such algorithms have well-characterized performance in the private sector (e.g., online advertising), the tendency to naively apply algorithms motivated by one domain, often online advertisements, can be called… ▽ More

    Submitted 29 November, 2022; v1 submitted 13 December, 2021; originally announced December 2021.

    Comments: Version 1 presented at Causal Inference Challenges in Sequential Decision Making: Bridging Theory and Practice (2021), a NeurIPS 2021 Workshop; Version 2 presented at the 2nd ACM Symposium on Computer Science and Law (2022) (DOI: https://dl.acm.org/doi/10.1145/3511265.3550439)

  36. arXiv:2110.13306  [pdf, other

    cs.LG

    Reconciling Risk Allocation and Prevalence Estimation in Public Health Using Batched Bandits

    Authors: Ben Chugg, Daniel E. Ho

    Abstract: In many public health settings, there is a perceived tension between allocating resources to known vulnerable areas and learning about the overall prevalence of the problem. Inspired by a door-to-door Covid-19 testing program we helped design, we combine multi-armed bandit strategies and insights from sampling theory to demonstrate how to recover accurate prevalence estimates while continuing to a… ▽ More

    Submitted 25 October, 2021; originally announced October 2021.

    Comments: Published in Machine Learning in Public Health Workshop at NeurIPS 2021

  37. arXiv:2108.07258  [pdf, other

    cs.LG cs.AI cs.CY

    On the Opportunities and Risks of Foundation Models

    Authors: Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh , et al. (89 additional authors not shown)

    Abstract: AI is undergoing a paradigm shift with the rise of models (e.g., BERT, DALL-E, GPT-3) that are trained on broad data at scale and are adaptable to a wide range of downstream tasks. We call these models foundation models to underscore their critically central yet incomplete character. This report provides a thorough account of the opportunities and risks of foundation models, ranging from their cap… ▽ More

    Submitted 12 July, 2022; v1 submitted 16 August, 2021; originally announced August 2021.

    Comments: Authored by the Center for Research on Foundation Models (CRFM) at the Stanford Institute for Human-Centered Artificial Intelligence (HAI). Report page with citation guidelines: https://crfm.stanford.edu/report.html

  38. Context-Aware Legal Citation Recommendation using Deep Learning

    Authors: Zihan Huang, Charles Low, Mengqiu Teng, Hongyi Zhang, Daniel E. Ho, Mark S. Krass, Matthias Grabmair

    Abstract: Lawyers and judges spend a large amount of time researching the proper legal authority to cite while drafting decisions. In this paper, we develop a citation recommendation tool that can help improve efficiency in the process of opinion drafting. We train four types of machine learning models, including a citation-list based method (collaborative filtering) and three context-based methods (text si… ▽ More

    Submitted 20 June, 2021; originally announced June 2021.

    Comments: 10 pages published in Proceedings of ICAIL 2021; link to data here: https://reglab.stanford.edu/data/bva-case-citation-dataset ; code available here: https://github.com/TUMLegalTech/bva-citation-prediction

  39. Enhancing Environmental Enforcement with Near Real-Time Monitoring: Likelihood-Based Detection of Structural Expansion of Intensive Livestock Farms

    Authors: Ben Chugg, Brandon Anderson, Seiji Eicher, Sandy Lee, Daniel E. Ho

    Abstract: Much environmental enforcement in the United States has historically relied on either self-reported data or physical, resource-intensive, infrequent inspections. Advances in remote sensing and computer vision, however, have the potential to augment compliance monitoring by detecting early warning signs of noncompliance. We demonstrate a process for rapid identification of significant structural ex… ▽ More

    Submitted 2 August, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Journal ref: International Journal of Applied Earth Observation and Geoinformation, Volume 103, 2021, 102463, ISSN 0303-2434

  40. arXiv:2104.08671  [pdf, other

    cs.CL

    When Does Pretraining Help? Assessing Self-Supervised Learning for Law and the CaseHOLD Dataset

    Authors: Lucia Zheng, Neel Guha, Brandon R. Anderson, Peter Henderson, Daniel E. Ho

    Abstract: While self-supervised learning has made rapid advances in natural language processing, it remains unclear when researchers should engage in resource-intensive domain-specific pretraining (domain pretraining). The law, puzzlingly, has yielded few documented instances of substantial gains to domain pretraining in spite of the fact that legal language is widely seen to be unique. We hypothesize that… ▽ More

    Submitted 5 July, 2021; v1 submitted 17 April, 2021; originally announced April 2021.

    Comments: ICAIL 2021. Code & data available at https://github.com/reglab/casehold

  41. arXiv:2103.09787  [pdf, other

    cs.CV

    Temporal Cluster Matching for Change Detection of Structures from Satellite Imagery

    Authors: Caleb Robinson, Anthony Ortiz, Juan M. Lavista Ferres, Brandon Anderson, Daniel E. Ho

    Abstract: Longitudinal studies are vital to understanding dynamic changes of the planet, but labels (e.g., buildings, facilities, roads) are often available only for a single point in time. We propose a general model, Temporal Cluster Matching (TCM), for detecting building changes in time series of remotely sensed imagery when footprint labels are observed only once. The intuition behind the model is that t… ▽ More

    Submitted 29 June, 2021; v1 submitted 17 March, 2021; originally announced March 2021.

    Comments: Published in ACM COMPASS 2021

  42. arXiv:2012.14285  [pdf

    cs.CY cs.LG

    Affirmative Algorithms: The Legal Grounds for Fairness as Awareness

    Authors: Daniel E. Ho, Alice Xiang

    Abstract: While there has been a flurry of research in algorithmic fairness, what is less recognized is that modern antidiscrimination law may prohibit the adoption of such techniques. We make three contributions. First, we discuss how such approaches will likely be deemed "algorithmic affirmative action," posing serious legal risks of violating equal protection, particularly under the higher education juri… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 12 pages, 3 figures

    Journal ref: 10/30/20 U. Chi. L. Rev. Online 143, https://lawreviewblog.uchicago.edu/2020/10/30/aa-ho-xiang/

  43. Leveraging Administrative Data for Bias Audits: Assessing Disparate Coverage with Mobility Data for COVID-19 Policy

    Authors: Amanda Coston, Neel Guha, Derek Ouyang, Lisa Lu, Alexandra Chouldechova, Daniel E. Ho

    Abstract: Anonymized smartphone-based mobility data has been widely adopted in devising and evaluating COVID-19 response strategies such as the targeting of public health resources. Yet little attention has been paid to measurement validity and demographic bias, due in part to the lack of documentation about which users are represented as well as the challenge of obtaining ground truth data on unique visits… ▽ More

    Submitted 15 April, 2021; v1 submitted 13 November, 2020; originally announced November 2020.

    Journal ref: Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency. pp. 173-184