Skip to main content

Showing 1–13 of 13 results for author: Siu, H C

.
  1. arXiv:2503.19607  [pdf, other

    cs.HC cs.AI

    Enabling Rapid Shared Human-AI Mental Model Alignment via the After-Action Review

    Authors: Edward Gu, Ho Chit Siu, Melanie Platt, Isabelle Hurley, Jaime Peña, Rohan Paleja

    Abstract: In this work, we present two novel contributions toward improving research in human-machine teaming (HMT): 1) a Minecraft testbed to accelerate testing and deployment of collaborative AI agents and 2) a tool to allow users to revisit and analyze behaviors within an HMT episode to facilitate shared mental model development. Our browser-based Minecraft testbed allows for rapid testing of collaborati… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: Accepted to the Cooperative Multi-Agent Systems Decision-making and Learning:Human-Multi-Agent Cognitive Fusion Workshop at AAAI 2025

  2. arXiv:2503.16507  [pdf, other

    cs.HC cs.AI

    Fewer Than 1% of Explainable AI Papers Validate Explainability with Humans

    Authors: Ashley Suh, Isabelle Hurley, Nora Smith, Ho Chit Siu

    Abstract: This late-breaking work presents a large-scale analysis of explainable AI (XAI) literature to evaluate claims of human explainability. We collaborated with a professional librarian to identify 18,254 papers containing keywords related to explainability and interpretability. Of these, we find that only 253 papers included terms suggesting human involvement in evaluating an XAI technique, and just 1… ▽ More

    Submitted 13 March, 2025; originally announced March 2025.

    Comments: Extended Abstracts of the CHI Conference on Human Factors in Computing Systems (CHI EA '25)

  3. arXiv:2503.15516  [pdf, other

    cs.HC cs.AI

    In Pursuit of Predictive Models of Human Preferences Toward AI Teammates

    Authors: Ho Chit Siu, Jaime D. Peña, Yutai Zhou, Ross E. Allen

    Abstract: We seek measurable properties of AI agents that make them better or worse teammates from the subjective perspective of human collaborators. Our experiments use the cooperative card game Hanabi -- a common benchmark for AI-teaming research. We first evaluate AI agents on a set of objective metrics based on task performance, information theory, and game theory, which are measurable without human int… ▽ More

    Submitted 31 January, 2025; originally announced March 2025.

  4. arXiv:2411.17861  [pdf, other

    cs.LG cs.AI

    Accelerating Proximal Policy Optimization Learning Using Task Prediction for Solving Environments with Delayed Rewards

    Authors: Ahmad Ahmad, Mehdi Kermanshah, Kevin Leahy, Zachary Serlin, Ho Chit Siu, Makai Mann, Cristian-Ioan Vasile, Roberto Tron, Calin Belta

    Abstract: In this paper, we tackle the challenging problem of delayed rewards in reinforcement learning (RL). While Proximal Policy Optimization (PPO) has emerged as a leading Policy Gradient method, its performance can degrade under delayed rewards. We introduce two key enhancements to PPO: a hybrid policy architecture that combines an offline policy (trained on expert demonstrations) with an online PPO po… ▽ More

    Submitted 4 December, 2024; v1 submitted 26 November, 2024; originally announced November 2024.

  5. arXiv:2409.18074  [pdf, other

    math.NT math.AG math.DS

    On the number of quadratic polynomials with a given portrait

    Authors: Ho Chung Siu

    Abstract: Let $F$ be a number field. Given a quadratic polynomial $f_c(z) = z^2 + c \in F[z]$, we can construct a directed graph $Preper(f_c, F)$ (also called a portrait), whose vertices are $F$-rational preperiodic points for $f_c$, with an edge $α\to β$ if and only if $f_c(α) = β$. Poonen and Faber classified the portraits that occur for infinitely many $c$'s. Given a portrait $P$, we prove an asymptoti… ▽ More

    Submitted 4 October, 2024; v1 submitted 26 September, 2024; originally announced September 2024.

    Comments: Comments welcome; v2: added references

  6. arXiv:2407.02632  [pdf, other

    cs.HC cs.FL

    STL: Still Tricky Logic (for System Validation, Even When Showing Your Work)

    Authors: Isabelle Hurley, Rohan Paleja, Ashley Suh, Jaime D. Peña, Ho Chit Siu

    Abstract: As learned control policies become increasingly common in autonomous systems, there is increasing need to ensure that they are interpretable and can be checked by human stakeholders. Formal specifications have been proposed as ways to produce human-interpretable policies for autonomous systems that can still be learned from examples. Previous work showed that despite claims of interpretability, hu… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

  7. arXiv:2406.02018  [pdf, other

    cs.CL cs.AI cs.HC

    Why Would You Suggest That? Human Trust in Language Model Responses

    Authors: Manasi Sharma, Ho Chit Siu, Rohan Paleja, Jaime D. Peña

    Abstract: The emergence of Large Language Models (LLMs) has revealed a growing need for human-AI collaboration, especially in creative decision-making scenarios where trust and reliance are paramount. Through human studies and model evaluations on the open-ended News Headline Generation task from the LaMP benchmark, we analyze how the framing and presence of explanations affect user trust and model performa… ▽ More

    Submitted 4 October, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

    Journal ref: ICML Humans, Algorithmic Decision-Making and Society: Modeling Interactions and Impact Workshop 2024

  8. arXiv:2403.14344  [pdf, other

    cs.RO cs.HC

    Tell Me What You Want (What You Really, Really Want): Addressing the Expectation Gap for Goal Conveyance from Humans to Robots

    Authors: Kevin Leahy, Ho Chit Siu

    Abstract: Conveying human goals to autonomous systems (AS) occurs both when the system is being designed and when it is being operated. The design-step conveyance is typically mediated by robotics and AI engineers, who must appropriately capture end-user requirements and concepts of operations, while the operation-step conveyance is mediated by the design, interfaces, and behavior of the AI. However, commun… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.

    Comments: Presented at the End-User Development for Human-Robot Interaction (EUD4HRI) workshop at HRI 2024

  9. arXiv:2305.17258  [pdf, other

    cs.AI cs.HC cs.RO

    STL: Surprisingly Tricky Logic (for System Validation)

    Authors: Ho Chit Siu, Kevin Leahy, Makai Mann

    Abstract: Much of the recent work developing formal methods techniques to specify or learn the behavior of autonomous systems is predicated on a belief that formal specifications are interpretable and useful for humans when checking systems. Though frequently asserted, this assumption is rarely tested. We performed a human experiment (N = 62) with a mix of people who were and were not familiar with formal m… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

  10. arXiv:2108.11503  [pdf, other

    cs.AI cs.CV cs.DC cs.PF

    Maneuver Identification Challenge

    Authors: Kaira Samuel, Vijay Gadepally, David Jacobs, Michael Jones, Kyle McAlpin, Kyle Palko, Ben Paulk, Sid Samsi, Ho Chit Siu, Charles Yee, Jeremy Kepner

    Abstract: AI algorithms that identify maneuvers from trajectory data could play an important role in improving flight safety and pilot training. AI challenges allow diverse teams to work together to solve hard problems and are an effective tool for developing AI solutions. AI challenges are also a key driver of AI computational requirements. The Maneuver Identification Challenge hosted at maneuver-id.mit.ed… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

    Comments: 7 pages, 8 figures, 1 table, 33 references, accepted to IEEE HPEC 2021

  11. arXiv:2107.07630  [pdf, other

    cs.AI cs.HC

    Evaluation of Human-AI Teams for Learned and Rule-Based Agents in Hanabi

    Authors: Ho Chit Siu, Jaime D. Pena, Edenna Chen, Yutai Zhou, Victor J. Lopez, Kyle Palko, Kimberlee C. Chang, Ross E. Allen

    Abstract: Deep reinforcement learning has generated superhuman AI in competitive games such as Go and StarCraft. Can similar learning techniques create a superior AI teammate for human-machine collaborative games? Will humans prefer AI teammates that improve objective team performance or those that improve subjective metrics of trust? In this study, we perform a single-blind evaluation of teams of humans an… ▽ More

    Submitted 21 October, 2021; v1 submitted 15 July, 2021; originally announced July 2021.

    Comments: Accepted for publication at NeurIPS 2021

  12. arXiv:1907.09789  [pdf, other

    astro-ph.IM cs.AI cs.NE

    Genetic Algorithms for Starshade Retargeting in Space-Based Telescopes

    Authors: Ho Chit Siu, Victor Pankratius

    Abstract: Future space-based telescopes will leverage starshades as components that can be independently positioned. Starshades will adjust the light coming in from exoplanet host stars and enhance the direct imaging of exoplanets and other phenomena. In this context, scheduling of space-based telescope observations is subject to a large number of dynamic constraints, including target observability, fuel, a… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

  13. arXiv:1705.02080  [pdf, ps, other

    math.NT

    Restriction of Hecke eigenforms to horocycles

    Authors: Ho Chung Siu, Kannan Soundararajan

    Abstract: We prove a sharp upper bound on the $L^2$-norm of Hecke eigenforms restricted to a horocycle, as the weight tends to infinity.

    Submitted 5 May, 2017; originally announced May 2017.