Skip to main content

Showing 1–50 of 77 results for author: Ho, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.17303  [pdf, ps, other

    cs.CY

    The California Report on Frontier AI Policy

    Authors: Rishi Bommasani, Scott R. Singer, Ruth E. Appel, Sarah Cen, A. Feder Cooper, Elena Cryst, Lindsey A. Gailmard, Ian Klaus, Meredith M. Lee, Inioluwa Deborah Raji, Anka Reuel, Drew Spence, Alexander Wan, Angelina Wang, Daniel Zhang, Daniel E. Ho, Percy Liang, Dawn Song, Joseph E. Gonzalez, Jonathan Zittrain, Jennifer Tour Chayes, Mariano-Florentino Cuellar, Li Fei-Fei

    Abstract: The innovations emerging at the frontier of artificial intelligence (AI) are poised to create historic opportunities for humanity but also raise complex policy challenges. Continued progress in frontier AI carries the potential for profound advances in scientific discovery, economic productivity, and broader social well-being. As the epicenter of global AI innovation, California has a unique oppor… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

    Comments: Authored by the Joint California Policy Working Group on AI Frontier Models

  2. arXiv:2506.13735  [pdf, ps, other

    cs.CY

    Bias Delayed is Bias Denied? Assessing the Effect of Reporting Delays on Disparity Assessments

    Authors: Jennah Gosciak, Aparna Balagopalan, Derek Ouyang, Allison Koenecke, Marzyeh Ghassemi, Daniel E. Ho

    Abstract: Conducting disparity assessments at regular time intervals is critical for surfacing potential biases in decision-making and improving outcomes across demographic groups. Because disparity assessments fundamentally depend on the availability of demographic information, their efficacy is limited by the availability and consistency of available demographic identifiers. While prior work has considere… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

  3. arXiv:2506.04252  [pdf, ps, other

    cs.AI cs.CL cs.LG

    A Graph-Retrieval-Augmented Generation Framework Enhances Decision-Making in the Circular Economy

    Authors: Yang Zhao, Chengxiao Dai, Dusit Niyato, Chuan Fu Tan, Keyi Xiang, Yueyang Wang, Zhiquan Yeo, Daren Tan Zong Loong, Jonathan Low Zhaozhi, Eugene H. Z. HO

    Abstract: Large language models (LLMs) hold promise for sustainable manufacturing, but often hallucinate industrial codes and emission factors, undermining regulatory and investment decisions. We introduce CircuGraphRAG, a retrieval-augmented generation (RAG) framework that grounds LLMs outputs in a domain-specific knowledge graph for the circular economy. This graph connects 117,380 industrial and waste en… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  4. arXiv:2505.17860  [pdf, other

    cs.GR cs.CV cs.LG

    Multi-Person Interaction Generation from Two-Person Motion Priors

    Authors: Wenning Xu, Shiyu Fan, Paul Henderson, Edmond S. L. Ho

    Abstract: Generating realistic human motion with high-level controls is a crucial task for social understanding, robotics, and animation. With high-quality MOCAP data becoming more available recently, a wide range of data-driven approaches have been presented. However, modelling multi-person interactions still remains a less explored area. In this paper, we present Graph-driven Interaction Sampling, a metho… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    Comments: SIGGRAPH 2025 Conference Papers

    ACM Class: I.3.7

  5. arXiv:2505.15216  [pdf, ps, other

    cs.CR cs.AI cs.CL cs.LG

    BountyBench: Dollar Impact of AI Agent Attackers and Defenders on Real-World Cybersecurity Systems

    Authors: Andy K. Zhang, Joey Ji, Celeste Menders, Riya Dulepet, Thomas Qin, Ron Y. Wang, Junrong Wu, Kyleen Liao, Jiliang Li, Jinghan Hu, Sara Hong, Nardos Demilew, Shivatmica Murgai, Jason Tran, Nishka Kacheria, Ethan Ho, Denis Liu, Lauren McLane, Olivia Bruvik, Dai-Rong Han, Seungwoo Kim, Akhil Vyas, Cuiyuanxiu Chen, Ryan Li, Weiran Xu , et al. (9 additional authors not shown)

    Abstract: AI agents have the potential to significantly alter the cybersecurity landscape. To help us understand this change, we introduce the first framework to capture offensive and defensive cyber-capabilities in evolving real-world systems. Instantiating this framework with BountyBench, we set up 25 systems with complex, real-world codebases. To capture the vulnerability lifecycle, we define three task… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 78 pages

  6. arXiv:2505.12546  [pdf, other

    cs.CL cs.CY cs.LG

    Extracting memorized pieces of (copyrighted) books from open-weight language models

    Authors: A. Feder Cooper, Aaron Gokaslan, Amy B. Cyphert, Christopher De Sa, Mark A. Lemley, Daniel E. Ho, Percy Liang

    Abstract: Plaintiffs and defendants in copyright lawsuits over generative AI often make sweeping, opposing claims about the extent to which large language models (LLMs) have memorized plaintiffs' protected expression. Drawing on adversarial ML and copyright law, we show that these polarized positions dramatically oversimplify the relationship between memorization and copyright. To do so, we leverage a recen… ▽ More

    Submitted 18 May, 2025; originally announced May 2025.

  7. A Reasoning-Focused Legal Retrieval Benchmark

    Authors: Lucia Zheng, Neel Guha, Javokhir Arifov, Sarah Zhang, Michal Skreta, Christopher D. Manning, Peter Henderson, Daniel E. Ho

    Abstract: As the legal community increasingly examines the use of large language models (LLMs) for various legal applications, legal AI developers have turned to retrieval-augmented LLMs ("RAG" systems) to improve system performance and robustness. An obstacle to the development of specialized RAG systems is the lack of realistic legal RAG benchmarks which capture the complexity of both legal retrieval and… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: CS&Law 2025. For data, see https://reglab.github.io/legal-rag-benchmarks/

  8. arXiv:2503.03888  [pdf, other

    cs.CL

    AI for Scaling Legal Reform: Mapping and Redacting Racial Covenants in Santa Clara County

    Authors: Faiz Surani, Mirac Suzgun, Vyoma Raman, Christopher D. Manning, Peter Henderson, Daniel E. Ho

    Abstract: Legal reform can be challenging in light of the volume, complexity, and interdependence of laws, codes, and records. One salient example of this challenge is the effort to restrict and remove racially restrictive covenants, clauses in property deeds that historically barred individuals of specific races from purchasing homes. Despite the Supreme Court holding such racial covenants unenforceable in… ▽ More

    Submitted 6 March, 2025; v1 submitted 12 February, 2025; originally announced March 2025.

    Comments: https://reglab.github.io/racialcovenants/

  9. arXiv:2502.02028  [pdf, other

    cs.CL cs.AI

    Fine-tuning Language Models for Recipe Generation: A Comparative Analysis and Benchmark Study

    Authors: Anneketh Vij, Changhao Liu, Rahul Anil Nair, Theodore Eugene Ho, Edward Shi, Ayan Bhowmick

    Abstract: This research presents an exploration and study of the recipe generation task by fine-tuning various very small language models, with a focus on developing robust evaluation metrics and comparing across different language models the open-ended task of recipe generation. This study presents extensive experiments with multiple model architectures, ranging from T5-small (Raffel et al., 2023) and Smol… ▽ More

    Submitted 16 February, 2025; v1 submitted 4 February, 2025; originally announced February 2025.

    Comments: 18 pages, 10 figures,14 tables

  10. arXiv:2502.01926  [pdf, other

    cs.CY cs.CL

    Fairness through Difference Awareness: Measuring Desired Group Discrimination in LLMs

    Authors: Angelina Wang, Michelle Phan, Daniel E. Ho, Sanmi Koyejo

    Abstract: Algorithmic fairness has conventionally adopted the mathematically convenient perspective of racial color-blindness (i.e., difference unaware treatment). However, we contend that in a range of important settings, group difference awareness matters. For example, differentiating between groups may be necessary in legal contexts (e.g., the U.S. compulsory draft applies to men but not women) and harm… ▽ More

    Submitted 22 May, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

    Comments: Accepted to ACL 2025 Main Conference

  11. arXiv:2501.04902  [pdf, other

    cs.CY cs.HC

    Artificial Intelligence in Environmental Protection: The Importance of Organizational Context from a Field Study in Wisconsin

    Authors: Nicolas Rothbacher, Kit T. Rodolfa, Mihir Bhaskar, Erin Maneri, Christine Tsang, Daniel E. Ho

    Abstract: Advances in Artificial Intelligence (AI) have generated widespread enthusiasm for the potential of AI to support our understanding and protection of the environment. As such tools move from basic research to more consequential settings, such as regulatory enforcement, the human context of how AI is utilized, interpreted, and deployed becomes increasingly critical. Yet little work has systematicall… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

  12. arXiv:2412.06966  [pdf, other

    cs.LG cs.AI cs.CY

    Machine Unlearning Doesn't Do What You Think: Lessons for Generative AI Policy, Research, and Practice

    Authors: A. Feder Cooper, Christopher A. Choquette-Choo, Miranda Bogen, Matthew Jagielski, Katja Filippova, Ken Ziyu Liu, Alexandra Chouldechova, Jamie Hayes, Yangsibo Huang, Niloofar Mireshghallah, Ilia Shumailov, Eleni Triantafillou, Peter Kairouz, Nicole Mitchell, Percy Liang, Daniel E. Ho, Yejin Choi, Sanmi Koyejo, Fernando Delgado, James Grimmelmann, Vitaly Shmatikov, Christopher De Sa, Solon Barocas, Amy Cyphert, Mark Lemley , et al. (10 additional authors not shown)

    Abstract: We articulate fundamental mismatches between technical methods for machine unlearning in Generative AI, and documented aspirations for broader impact that these methods could have for law and policy. These aspirations are both numerous and varied, motivated by issues that pertain to privacy, copyright, safety, and more. For example, unlearning is often invoked as a solution for removing the effect… ▽ More

    Submitted 9 December, 2024; originally announced December 2024.

    Comments: Presented at the 2nd Workshop on Generative AI and Law at ICML (July 2024)

  13. Gla-AI4BioMed at RRG24: Visual Instruction-tuned Adaptation for Radiology Report Generation

    Authors: Xi Zhang, Zaiqiao Meng, Jake Lever, Edmond S. L. Ho

    Abstract: We introduce a radiology-focused visual language model designed to generate radiology reports from chest X-rays. Building on previous findings that large language models (LLMs) can acquire multimodal capabilities when aligned with pretrained vision encoders, we demonstrate similar potential with chest X-ray images. This integration enhances the ability of model to understand and describe chest X-r… ▽ More

    Submitted 6 December, 2024; originally announced December 2024.

    Comments: Accepted by BioNLP@ACL 2024

    Journal ref: Proceedings of the 23rd Workshop on Biomedical Natural Language Processing (BioNLP 2024) 2024

  14. arXiv:2412.01450  [pdf, other

    cs.AI cs.CV

    Artificial Intelligence for Geometry-Based Feature Extraction, Analysis and Synthesis in Artistic Images: A Survey

    Authors: Mridula Vijendran, Jingjing Deng, Shuang Chen, Edmond S. L. Ho, Hubert P. H. Shum

    Abstract: Artificial Intelligence significantly enhances the visual art industry by analyzing, identifying and generating digitized artistic images. This review highlights the substantial benefits of integrating geometric data into AI models, addressing challenges such as high inter-class variations, domain gaps, and the separation of style from content by incorporating geometric information. Models not onl… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 56 pages, 8 tables, 1 figure (35 embedded images), Artificial Intelligence Review (AIR) 2024

  15. arXiv:2411.19378  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Libra: Leveraging Temporal Images for Biomedical Radiology Analysis

    Authors: Xi Zhang, Zaiqiao Meng, Jake Lever, Edmond S. L. Ho

    Abstract: Radiology report generation (RRG) requires advanced medical image analysis, effective temporal reasoning, and accurate text generation. While multimodal large language models (MLLMs) align with pre-trained vision encoders to enhance visual-language understanding, most existing methods rely on single-image analysis or rule-based heuristics to process multiple images, failing to fully leverage tempo… ▽ More

    Submitted 16 February, 2025; v1 submitted 28 November, 2024; originally announced November 2024.

    Comments: 30 pages, 5 figures, Adding Appendix

    ACM Class: I.2.10; J.3; I.5.4

  16. arXiv:2411.13774  [pdf, other

    cs.CV

    Segment Any Class (SAC): Multi-Class Few-Shot Semantic Segmentation via Class Region Proposals

    Authors: Hussni Mohd Zakir, Eric Tatt Wei Ho

    Abstract: The Segment-Anything Model (SAM) is a vision foundation model for segmentation with a prompt-driven framework. SAM generates class-agnostic masks based on user-specified instance-referring prompts. However, adapting SAM for automated segmentation -- where manual input is absent -- of specific object classes often requires additional model training. We present Segment Any Class (SAC), a novel, trai… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

    Comments: 8 pages, 2 figures, 3 tables

  17. arXiv:2410.21195  [pdf, other

    cs.CL cs.AI cs.CY

    Belief in the Machine: Investigating Epistemological Blind Spots of Language Models

    Authors: Mirac Suzgun, Tayfun Gur, Federico Bianchi, Daniel E. Ho, Thomas Icard, Dan Jurafsky, James Zou

    Abstract: As language models (LMs) become integral to fields like healthcare, law, and journalism, their ability to differentiate between fact, belief, and knowledge is essential for reliable decision-making. Failure to grasp these distinctions can lead to significant consequences in areas such as medical diagnosis, legal judgments, and dissemination of fake news. Despite this, current literature has largel… ▽ More

    Submitted 28 October, 2024; originally announced October 2024.

    Comments: https://github.com/suzgunmirac/belief-in-the-machine

  18. arXiv:2408.08926  [pdf, other

    cs.CR cs.AI cs.CL cs.CY cs.LG

    Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models

    Authors: Andy K. Zhang, Neil Perry, Riya Dulepet, Joey Ji, Celeste Menders, Justin W. Lin, Eliot Jones, Gashon Hussein, Samantha Liu, Donovan Jasper, Pura Peetathawatchai, Ari Glenn, Vikram Sivashankar, Daniel Zamoshchin, Leo Glikbarg, Derek Askaryar, Mike Yang, Teddy Zhang, Rishi Alluri, Nathan Tran, Rinnara Sangpisit, Polycarpos Yiorkadjis, Kenny Osele, Gautham Raghupathi, Dan Boneh , et al. (2 additional authors not shown)

    Abstract: Language Model (LM) agents for cybersecurity that are capable of autonomously identifying vulnerabilities and executing exploits have potential to cause real-world impact. Policymakers, model providers, and researchers in the AI and cybersecurity communities are interested in quantifying the capabilities of such agents to help mitigate cyberrisk and investigate opportunities for penetration testin… ▽ More

    Submitted 12 April, 2025; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: ICLR 2025 Oral

  19. arXiv:2408.07892  [pdf, other

    cs.CY

    Personhood credentials: Artificial intelligence and the value of privacy-preserving tools to distinguish who is real online

    Authors: Steven Adler, Zoë Hitzig, Shrey Jain, Catherine Brewer, Wayne Chang, Renée DiResta, Eddy Lazzarin, Sean McGregor, Wendy Seltzer, Divya Siddarth, Nouran Soliman, Tobin South, Connor Spelliscy, Manu Sporny, Varya Srivastava, John Bailey, Brian Christian, Andrew Critch, Ronnie Falcon, Heather Flanagan, Kim Hamilton Duffy, Eric Ho, Claire R. Leibowicz, Srikanth Nadhamuni, Alan Z. Rozenshtein , et al. (7 additional authors not shown)

    Abstract: Anonymity is an important principle online. However, malicious actors have long used misleading identities to conduct fraud, spread disinformation, and carry out other deceptive schemes. With the advent of increasingly capable AI, bad actors can amplify the potential scale and effectiveness of their operations, intensifying the challenge of balancing anonymity and trustworthiness online. In this p… ▽ More

    Submitted 17 January, 2025; v1 submitted 14 August, 2024; originally announced August 2024.

    Comments: 63 pages, 7 figures, 5 tables; minor additions to acknowledgments and wording changes for clarity; corrected typo; updated email address reference for author

  20. arXiv:2407.16900  [pdf, other

    cs.LG cs.AI cs.CY

    Regulating AI Adaptation: An Analysis of AI Medical Device Updates

    Authors: Kevin Wu, Eric Wu, Kit Rodolfa, Daniel E. Ho, James Zou

    Abstract: While the pace of development of AI has rapidly progressed in recent years, the implementation of safe and effective regulatory frameworks has lagged behind. In particular, the adaptive nature of AI models presents unique challenges to regulators as updating a model can improve its performance but also introduce safety risks. In the US, the Food and Drug Administration (FDA) has been a forerunner… ▽ More

    Submitted 22 June, 2024; originally announced July 2024.

    Journal ref: CHIL 2024

  21. arXiv:2406.18691  [pdf, other

    cs.CV

    Geometric Features Enhanced Human-Object Interaction Detection

    Authors: Manli Zhu, Edmond S. L. Ho, Shuang Chen, Longzhi Yang, Hubert P. H. Shum

    Abstract: Cameras are essential vision instruments to capture images for pattern detection and measurement. Human-object interaction (HOI) detection is one of the most popular pattern detection approaches for captured human-centric visual scenes. Recently, Transformer-based models have become the dominant approach for HOI detection due to their advanced network architectures and thus promising results. Howe… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: Accepted to IEEE TIM

  22. arXiv:2406.13847  [pdf, other

    cs.CV

    Locating and measuring marine aquaculture production from space: a computer vision approach in the French Mediterranean

    Authors: Sebastian Quaade, Andrea Vallebueno, Olivia D. N. Alcabes, Kit T. Rodolfa, Daniel E. Ho

    Abstract: Aquaculture production -- the cultivation of aquatic plants and animals -- has grown rapidly since the 1990s, but sparse, self-reported and aggregate production data limits the effective understanding and monitoring of the industry's trends and potential risks. Building on a manual survey of aquaculture production from remote sensing imagery, we train a computer vision model to identify marine aqu… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  23. arXiv:2406.12165  [pdf, other

    cs.CL

    Statistical Uncertainty in Word Embeddings: GloVe-V

    Authors: Andrea Vallebueno, Cassandra Handan-Nader, Christopher D. Manning, Daniel E. Ho

    Abstract: Static word embeddings are ubiquitous in computational social science applications and contribute to practical decision-making in a variety of fields including law and healthcare. However, assessing the statistical uncertainty in downstream conclusions drawn from word embedding statistics has remained challenging. When using only point estimates for embeddings, researchers have no streamlined way… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  24. arXiv:2405.20362  [pdf, other

    cs.CL cs.CY

    Hallucination-Free? Assessing the Reliability of Leading AI Legal Research Tools

    Authors: Varun Magesh, Faiz Surani, Matthew Dahl, Mirac Suzgun, Christopher D. Manning, Daniel E. Ho

    Abstract: Legal practice has witnessed a sharp rise in products incorporating artificial intelligence (AI). Such tools are designed to assist with a wide range of core legal tasks, from search and summarization of caselaw to document drafting. But the large language models used in these tools are prone to "hallucinate," or make up false information, making their use risky in high-stakes domains. Recently, c… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Our dataset, tool outputs, and labels will be made available upon publication. This version of the manuscript (May 30, 2024) is updated to reflect an evaluation of Westlaw's AI-Assisted Research

  25. arXiv:2404.05490  [pdf, other

    cs.CV

    Two-Person Interaction Augmentation with Skeleton Priors

    Authors: Baiyi Li, Edmond S. L. Ho, Hubert P. H. Shum, He Wang

    Abstract: Close and continuous interaction with rich contacts is a crucial aspect of human activities (e.g. hugging, dancing) and of interest in many domains like activity recognition, motion prediction, character animation, etc. However, acquiring such skeletal motion is challenging. While direct motion capture is expensive and slow, motion editing/generation is also non-trivial, as complex contact pattern… ▽ More

    Submitted 9 April, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

  26. arXiv:2404.02127  [pdf, other

    cs.CL cs.AI cs.LG

    LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain

    Authors: Joel Niklaus, Lucia Zheng, Arya D. McCarthy, Christopher Hahn, Brian M. Rosen, Peter Henderson, Daniel E. Ho, Garrett Honke, Percy Liang, Christopher Manning

    Abstract: Instruction tuning is an important step in making language models useful for direct user interaction. However, the legal domain is underrepresented in typical instruction datasets (e.g., only 10 out of 1600+ tasks in Super-NaturalInstructions). To study whether instruction tuning on legal datasets is necessary for strong legal reasoning, we aggregate 58 annotated legal datasets and write instructi… ▽ More

    Submitted 23 January, 2025; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: Accepted at Findings of NAACL 2025

    MSC Class: 68T50 ACM Class: I.2

  27. arXiv:2403.07918  [pdf, other

    cs.CY cs.AI cs.LG

    On the Societal Impact of Open Foundation Models

    Authors: Sayash Kapoor, Rishi Bommasani, Kevin Klyman, Shayne Longpre, Ashwin Ramaswami, Peter Cihon, Aspen Hopkins, Kevin Bankston, Stella Biderman, Miranda Bogen, Rumman Chowdhury, Alex Engler, Peter Henderson, Yacine Jernite, Seth Lazar, Stefano Maffulli, Alondra Nelson, Joelle Pineau, Aviya Skowron, Dawn Song, Victor Storchan, Daniel Zhang, Daniel E. Ho, Percy Liang, Arvind Narayanan

    Abstract: Foundation models are powerful technologies: how they are released publicly directly shapes their societal impact. In this position paper, we focus on open foundation models, defined here as those with broadly available model weights (e.g. Llama 2, Stable Diffusion XL). We identify five distinctive properties (e.g. greater customizability, poor monitoring) of open foundation models that lead to bo… ▽ More

    Submitted 27 February, 2024; originally announced March 2024.

  28. arXiv:2402.02008  [pdf, other

    cs.CL cs.AI

    How well do LLMs cite relevant medical references? An evaluation framework and analyses

    Authors: Kevin Wu, Eric Wu, Ally Cassasola, Angela Zhang, Kevin Wei, Teresa Nguyen, Sith Riantawan, Patricia Shi Riantawan, Daniel E. Ho, James Zou

    Abstract: Large language models (LLMs) are currently being used to answer medical questions across a variety of clinical domains. Recent top-performing commercial LLMs, in particular, are also capable of citing sources to support their responses. In this paper, we ask: do the sources that LLMs generate actually support the claims that they make? To answer this, we propose three contributions. First, as expe… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

  29. arXiv:2401.01301  [pdf, other

    cs.CL cs.AI cs.CY

    Large Legal Fictions: Profiling Legal Hallucinations in Large Language Models

    Authors: Matthew Dahl, Varun Magesh, Mirac Suzgun, Daniel E. Ho

    Abstract: Do large language models (LLMs) know the law? These models are increasingly being used to augment legal practice, education, and research, yet their revolutionary potential is threatened by the presence of hallucinations -- textual output that is not consistent with legal facts. We present the first systematic evidence of these hallucinations, documenting LLMs' varying performance across jurisdict… ▽ More

    Submitted 21 June, 2024; v1 submitted 2 January, 2024; originally announced January 2024.

    Journal ref: Journal of Legal Analysis 16, no. 1 (2024): 64-93

  30. arXiv:2312.13776  [pdf, other

    cs.CV

    Pose-based Tremor Type and Level Analysis for Parkinson's Disease from Video

    Authors: Haozheng Zhang, Edmond S. L. Ho, Xiatian Zhang, Silvia Del Din, Hubert P. H. Shum

    Abstract: Purpose:Current methods for diagnosis of PD rely on clinical examination. The accuracy of diagnosis ranges between 73% and 84%, and is influenced by the experience of the clinical assessor. Hence, an automatic, effective and interpretable supporting system for PD symptom identification would support clinicians in making more robust PD diagnostic decisions. Methods: We propose to analyze Parkinson'… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

  31. arXiv:2310.18891  [pdf, other

    cs.HC cs.CY cs.RO eess.SY

    Social Interaction-Aware Dynamical Models and Decision Making for Autonomous Vehicles

    Authors: Luca Crosato, Kai Tian, Hubert P. H Shum, Edmond S. L. Ho, Yafei Wang, Chongfeng Wei

    Abstract: Interaction-aware Autonomous Driving (IAAD) is a rapidly growing field of research that focuses on the development of autonomous vehicles (AVs) that are capable of interacting safely and efficiently with human road users. This is a challenging task, as it requires the autonomous vehicle to be able to understand and predict the behaviour of human road users. In this literature review, the current s… ▽ More

    Submitted 30 October, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

  32. arXiv:2310.01679  [pdf, other

    cs.LG cs.CY stat.ML

    Estimating and Implementing Conventional Fairness Metrics With Probabilistic Protected Features

    Authors: Hadi Elzayn, Emily Black, Patrick Vossler, Nathanael Jo, Jacob Goldin, Daniel E. Ho

    Abstract: The vast majority of techniques to train fair models require access to the protected attribute (e.g., race, gender), either at train time or in production. However, in many important applications this protected attribute is largely unavailable. In this paper, we develop methods for measuring and reducing fairness violations in a setting with limited access to protected attribute labels. Specifical… ▽ More

    Submitted 2 October, 2023; originally announced October 2023.

  33. arXiv:2309.17337  [pdf, other

    cs.LG cs.AI cs.CY

    Toward Operationalizing Pipeline-aware ML Fairness: A Research Agenda for Developing Practical Guidelines and Tools

    Authors: Emily Black, Rakshit Naidu, Rayid Ghani, Kit T. Rodolfa, Daniel E. Ho, Hoda Heidari

    Abstract: While algorithmic fairness is a thriving area of research, in practice, mitigating issues of bias often gets reduced to enforcing an arbitrarily chosen fairness metric, either by enforcing fairness constraints during the optimization step, post-processing model outputs, or by manipulating the training data. Recent work has called on the ML community to take a more holistic approach to tackle fairn… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: EAAMO'23 (Archival)

  34. arXiv:2308.11462  [pdf, other

    cs.CL cs.AI cs.CY

    LegalBench: A Collaboratively Built Benchmark for Measuring Legal Reasoning in Large Language Models

    Authors: Neel Guha, Julian Nyarko, Daniel E. Ho, Christopher Ré, Adam Chilton, Aditya Narayana, Alex Chohlas-Wood, Austin Peters, Brandon Waldon, Daniel N. Rockmore, Diego Zambrano, Dmitry Talisman, Enam Hoque, Faiz Surani, Frank Fagan, Galit Sarfaty, Gregory M. Dickinson, Haggai Porat, Jason Hegland, Jessica Wu, Joe Nudell, Joel Niklaus, John Nay, Jonathan H. Choi, Kevin Tobia , et al. (15 additional authors not shown)

    Abstract: The advent of large language models (LLMs) and their adoption by the legal community has given rise to the question: what types of legal reasoning can LLMs perform? To enable greater study of this question, we present LegalBench: a collaboratively constructed legal reasoning benchmark consisting of 162 tasks covering six different types of legal reasoning. LegalBench was built through an interdisc… ▽ More

    Submitted 20 August, 2023; originally announced August 2023.

    Comments: 143 pages, 79 tables, 4 figures

  35. arXiv:2306.09237  [pdf, other

    cs.CL cs.AI cs.LG

    One Law, Many Languages: Benchmarking Multilingual Legal Reasoning for Judicial Support

    Authors: Ronja Stern, Vishvaksenan Rasiah, Veton Matoshi, Srinanda Brügger Bose, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho, Joel Niklaus

    Abstract: Recent strides in Large Language Models (LLMs) have saturated many Natural Language Processing (NLP) benchmarks, emphasizing the need for more challenging ones to properly assess LLM capabilities. However, domain-specific and multilingual benchmarks are rare because they require in-depth expertise to develop. Still, most public models are trained predominantly on English corpora, while other langu… ▽ More

    Submitted 21 August, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

    MSC Class: 68T50 ACM Class: I.2

  36. arXiv:2306.02069  [pdf, other

    cs.CL cs.AI cs.LG

    MultiLegalPile: A 689GB Multilingual Legal Corpus

    Authors: Joel Niklaus, Veton Matoshi, Matthias Stürmer, Ilias Chalkidis, Daniel E. Ho

    Abstract: Large, high-quality datasets are crucial for training Large Language Models (LLMs). However, so far, there are few datasets available for specialized critical domains such as law and the available ones are often only for the English language. We curate and release MultiLegalPile, a 689GB corpus in 24 languages from 17 jurisdictions. The MultiLegalPile corpus, which includes diverse legal data sour… ▽ More

    Submitted 19 May, 2024; v1 submitted 3 June, 2023; originally announced June 2023.

    Comments: Accepted to ACL 2024

    MSC Class: 68T50 ACM Class: I.2

  37. arXiv:2305.10589  [pdf, other

    cs.CV

    INCLG: Inpainting for Non-Cleft Lip Generation with a Multi-Task Image Processing Network

    Authors: Shuang Chen, Amir Atapour-Abarghouei, Edmond S. L. Ho, Hubert P. H. Shum

    Abstract: We present a software that predicts non-cleft facial images for patients with cleft lip, thereby facilitating the understanding, awareness and discussion of cleft lip surgeries. To protect patients privacy, we design a software framework using image inpainting, which does not require cleft lip images for training, thereby mitigating the risk of model leakage. We implement a novel multi-task archit… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  38. Potential for allocative harm in an environmental justice data tool

    Authors: Benjamin Q. Huynh, Elizabeth T. Chin, Allison Koenecke, Derek Ouyang, Daniel E. Ho, Mathew V. Kiang, David H. Rehkopf

    Abstract: Neighborhood-level screening algorithms are increasingly being deployed to inform policy decisions. We evaluate one such algorithm, CalEnviroScreen - designed to promote environmental justice and used to guide hundreds of millions of dollars in public funding annually - assessing its potential for allocative harm. We observe the model to be sensitive to subjective model decisions, with 16% of trac… ▽ More

    Submitted 12 April, 2023; v1 submitted 12 April, 2023; originally announced April 2023.

    Journal ref: Nat Mach Intell 6, 187-194 (2024)

  39. arXiv:2304.00858  [pdf, other

    cs.CV

    Focalized Contrastive View-invariant Learning for Self-supervised Skeleton-based Action Recognition

    Authors: Qianhui Men, Edmond S. L. Ho, Hubert P. H. Shum, Howard Leung

    Abstract: Learning view-invariant representation is a key to improving feature discrimination power for skeleton-based action recognition. Existing approaches cannot effectively remove the impact of viewpoint due to the implicit view-dependent representations. In this work, we propose a self-supervised framework called Focalized Contrastive View-invariant Learning (FoCoViL), which significantly suppresses t… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  40. arXiv:2303.02580  [pdf, other

    stat.AP cs.CY

    Estimating Racial Disparities When Race is Not Observed

    Authors: Cory McCartan, Robin Fisher, Jacob Goldin, Daniel E. Ho, Kosuke Imai

    Abstract: The estimation of racial disparities in various fields is often hampered by the lack of individual-level racial information. In many cases, the law prohibits the collection of such information to prevent direct racial discrimination. As a result, analysts have frequently adopted Bayesian Improved Surname Geocoding (BISG) and its variants, which combine individual names and addresses with Census da… ▽ More

    Submitted 16 April, 2024; v1 submitted 4 March, 2023; originally announced March 2023.

    Comments: 28 pages, 9 figures, plus references and appendices

  41. arXiv:2212.08568  [pdf, other

    cs.CV cs.LG

    Biomedical image analysis competitions: The state of current participation practice

    Authors: Matthias Eisenmann, Annika Reinke, Vivienn Weru, Minu Dietlinde Tizabi, Fabian Isensee, Tim J. Adler, Patrick Godau, Veronika Cheplygina, Michal Kozubek, Sharib Ali, Anubha Gupta, Jan Kybic, Alison Noble, Carlos Ortiz de Solórzano, Samiksha Pachade, Caroline Petitjean, Daniel Sage, Donglai Wei, Elizabeth Wilden, Deepak Alapatt, Vincent Andrearczyk, Ujjwal Baid, Spyridon Bakas, Niranjan Balu, Sophia Bano , et al. (331 additional authors not shown)

    Abstract: The number of international benchmarking competitions is steadily increasing in various fields of machine learning (ML) research and practice. So far, however, little is known about the common practice as well as bottlenecks faced by the community in tackling the research questions posed. To shed light on the status quo of algorithm development in the specific field of biomedical imaging analysis,… ▽ More

    Submitted 12 September, 2023; v1 submitted 16 December, 2022; originally announced December 2022.

  42. arXiv:2209.06120  [pdf, ps, other

    cs.AI

    LegalBench: Prototyping a Collaborative Benchmark for Legal Reasoning

    Authors: Neel Guha, Daniel E. Ho, Julian Nyarko, Christopher Ré

    Abstract: Can foundation models be guided to execute tasks involving legal reasoning? We believe that building a benchmark to answer this question will require sustained collaborative efforts between the computer science and legal communities. To that end, this short paper serves three purposes. First, we describe how IRAC-a framework legal scholars use to distinguish different types of legal reasoning-can… ▽ More

    Submitted 13 September, 2022; originally announced September 2022.

    Comments: 13 pages, 7 tables

  43. arXiv:2209.02824  [pdf, other

    cs.CV cs.LG eess.IV

    CP-AGCN: Pytorch-based Attention Informed Graph Convolutional Network for Identifying Infants at Risk of Cerebral Palsy

    Authors: Haozheng Zhang, Edmond S. L. Ho, Hubert P. H. Shum

    Abstract: Early prediction is clinically considered one of the essential parts of cerebral palsy (CP) treatment. We propose to implement a low-cost and interpretable classification system for supporting CP prediction based on General Movement Assessment (GMA). We design a Pytorch-based attention-informed graph convolutional network to early identify infants at risk of CP from skeletal data extracted from RG… ▽ More

    Submitted 6 September, 2022; originally announced September 2022.

  44. arXiv:2208.11747  [pdf, other

    cs.LG

    Entropy Regularization for Population Estimation

    Authors: Ben Chugg, Peter Henderson, Jacob Goldin, Daniel E. Ho

    Abstract: Entropy regularization is known to improve exploration in sequential decision-making problems. We show that this same mechanism can also lead to nearly unbiased and lower-variance estimates of the mean reward in the optimize-and-estimate structured bandit setting. Mean reward estimation (i.e., population estimation) tasks have recently been shown to be essential for public policy settings where le… ▽ More

    Submitted 24 August, 2022; originally announced August 2022.

  45. Detecting Environmental Violations with Satellite Imagery in Near Real Time: Land Application under the Clean Water Act

    Authors: Ben Chugg, Nicolas Rothbacher, Alex Feng, Xiaoqi Long, Daniel E. Ho

    Abstract: This paper introduces a new, highly consequential setting for the use of computer vision for environmental sustainability. Concentrated Animal Feeding Operations (CAFOs) (aka intensive livestock farms or "factory farms") produce significant manure and pollution. Dumping manure in the winter months poses significant environmental risks and violates environmental law in many states. Yet the federal… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Accepted to CIKM '22

  46. arXiv:2208.08848  [pdf, other

    cs.CV

    A Two-stream Convolutional Network for Musculoskeletal and Neurological Disorders Prediction

    Authors: Manli Zhu, Qianhui Men, Edmond S. L. Ho, Howard Leung, Hubert P. H. Shum

    Abstract: Musculoskeletal and neurological disorders are the most common causes of walking problems among older people, and they often lead to diminished quality of life. Analyzing walking motion data manually requires trained professionals and the evaluations may not always be objective. To facilitate early diagnosis, recent deep learning-based methods have shown promising results for automated analysis, w… ▽ More

    Submitted 18 August, 2022; originally announced August 2022.

    Comments: Journal of Medical Systems

  47. arXiv:2208.01149  [pdf, other

    cs.CV

    A Feasibility Study on Image Inpainting for Non-cleft Lip Generation from Patients with Cleft Lip

    Authors: Shuang Chen, Amir Atapour-Abarghouei, Jane Kerby, Edmond S. L. Ho, David C. G. Sainsbury, Sophie Butterworth, Hubert P. H. Shum

    Abstract: A Cleft lip is a congenital abnormality requiring surgical repair by a specialist. The surgeon must have extensive experience and theoretical knowledge to perform surgery, and Artificial Intelligence (AI) method has been proposed to guide surgeons in improving surgical outcomes. If AI can be used to predict what a repaired cleft lip would look like, surgeons could use it as an adjunct to adjust th… ▽ More

    Submitted 1 August, 2022; originally announced August 2022.

    Comments: 4 pages, 2 figures, BHI 2022

  48. arXiv:2208.00774  [pdf, other

    cs.GR cs.CV

    Interaction Mix and Match: Synthesizing Close Interaction using Conditional Hierarchical GAN with Multi-Hot Class Embedding

    Authors: Aman Goel, Qianhui Men, Edmond S. L. Ho

    Abstract: Synthesizing multi-character interactions is a challenging task due to the complex and varied interactions between the characters. In particular, precise spatiotemporal alignment between characters is required in generating close interactions such as dancing and fighting. Existing work in generating multi-character interactions focuses on generating a single type of reactive motion for a given seq… ▽ More

    Submitted 4 August, 2022; v1 submitted 23 July, 2022; originally announced August 2022.

    Comments: Accepted to SCA 2022 (will be published in CGF)

  49. arXiv:2207.06828  [pdf, other

    cs.CV cs.LG

    Pose-based Tremor Classification for Parkinson's Disease Diagnosis from Video

    Authors: Haozheng Zhang, Edmond S. L. Ho, Xiatian Zhang, Hubert P. H. Shum

    Abstract: Parkinson's disease (PD) is a progressive neurodegenerative disorder that results in a variety of motor dysfunction symptoms, including tremors, bradykinesia, rigidity and postural instability. The diagnosis of PD mainly relies on clinical experience rather than a definite medical test, and the diagnostic accuracy is only about 73-84% since it is challenged by the subjective opinions or experience… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: MICCAI 2022

  50. Interaction-aware Decision-making for Automated Vehicles using Social Value Orientation

    Authors: Luca Crosato, Hubert P. H. Shum, Edmond S. L. Ho, Chongfeng Wei

    Abstract: Motion control algorithms in the presence of pedestrians are critical for the development of safe and reliable Autonomous Vehicles (AVs). Traditional motion control algorithms rely on manually designed decision-making policies which neglect the mutual interactions between AVs and pedestrians. On the other hand, recent advances in Deep Reinforcement Learning allow for the automatic learning of poli… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.