Skip to main content

Showing 1–3 of 3 results for author: Do, H J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2507.02186  [pdf, ps, other

    cs.HC

    EvalAssist: A Human-Centered Tool for LLM-as-a-Judge

    Authors: Zahra Ashktorab, Elizabeth M. Daly, Erik Miehling, Werner Geyer, Martin Santillan Cooper, Tejaswini Pedapati, Michael Desmond, Qian Pan, Hyo Jin Do

    Abstract: With the broad availability of large language models and their ability to generate vast outputs using varied prompts and configurations, determining the best output for a given task requires an intensive evaluation process, one where machine learning practitioners must decide how to assess the outputs and then carefully carry out the evaluation. This process is both time-consuming and costly. As p… ▽ More

    Submitted 2 July, 2025; originally announced July 2025.

  2. arXiv:2405.20434  [pdf, other

    cs.HC cs.AI

    Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions

    Authors: Hyo Jin Do, Rachel Ostrand, Justin D. Weisz, Casey Dugan, Prasanna Sattigeri, Dennis Wei, Keerthiram Murugesan, Werner Geyer

    Abstract: While humans increasingly rely on large language models (LLMs), they are susceptible to generating inaccurate or false information, also known as "hallucinations". Technical advancements have been made in algorithms that detect hallucinated content by assessing the factuality of the model's responses and attributing sections of those responses to specific source documents. However, there is limite… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Submitted to the Trust and Reliance in Evolving Human-AI Workflows (TREW) Workshop at CHI 2024

  3. arXiv:2403.14459  [pdf, other

    cs.CL cs.AI

    Multi-Level Explanations for Generative Language Models

    Authors: Lucas Monteiro Paes, Dennis Wei, Hyo Jin Do, Hendrik Strobelt, Ronny Luss, Amit Dhurandhar, Manish Nagireddy, Karthikeyan Natesan Ramamurthy, Prasanna Sattigeri, Werner Geyer, Soumya Ghosh

    Abstract: Perturbation-based explanation methods such as LIME and SHAP are commonly applied to text classification. This work focuses on their extension to generative language models. To address the challenges of text as output and long text inputs, we propose a general framework called MExGen that can be instantiated with different attribution algorithms. To handle text output, we introduce the notion of s… ▽ More

    Submitted 21 March, 2024; originally announced March 2024.