Skip to main content

Showing 1–5 of 5 results for author: Kelleher, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.00010  [pdf

    cs.CL cs.AI

    Jailbreak Detection in Clinical Training LLMs Using Feature-Based Predictive Models

    Authors: Tri Nguyen, Lohith Srikanth Pentapalli, Magnus Sieverding, Laurah Turner, Seth Overla, Weibing Zheng, Chris Zhou, David Furniss, Danielle Weber, Michael Gharib, Matt Kelleher, Michael Shukis, Cameron Pawlik, Kelly Cohen

    Abstract: Jailbreaking in Large Language Models (LLMs) threatens their safe use in sensitive domains like education by allowing users to bypass ethical safeguards. This study focuses on detecting jailbreaks in 2-Sigma, a clinical education platform that simulates patient interactions using LLMs. We annotated over 2,300 prompts across 158 conversations using four linguistic variables shown to correlate stron… ▽ More

    Submitted 21 April, 2025; originally announced May 2025.

  2. arXiv:2504.02111  [pdf, other

    cs.AI cs.CL cs.LG

    Exploring LLM Reasoning Through Controlled Prompt Variations

    Authors: Giannis Chatziveroglou, Richard Yun, Maura Kelleher

    Abstract: This study investigates the reasoning robustness of large language models (LLMs) on mathematical problem-solving tasks under systematically introduced input perturbations. Using the GSM8K dataset as a controlled testbed, we evaluate how well state-of-the-art models maintain logical consistency and correctness when confronted with four categories of prompt perturbations: irrelevant context, patholo… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  3. What Do Children and Parents Want and Perceive in Conversational Agents? Towards Transparent, Trustworthy, Democratized Agents

    Authors: Jessica Van Brummelen, Maura Kelleher, Mingyan Claire Tian, Nghi Hoang Nguyen

    Abstract: Historically, researchers have focused on analyzing WEIRD, adult perspectives on technology. This means we may not have technology developed appropriately for children and those from non-WEIRD countries. In this paper, we analyze children and parents from various countries' perspectives on an emerging technology: conversational agents. We aim to better understand participants' trust of agents, par… ▽ More

    Submitted 20 January, 2023; v1 submitted 16 September, 2022; originally announced September 2022.

    Comments: 18 pages, 9 figures, submitted to IDC 2023, for associated appendix: https://gist.github.com/jessvb/fa1d4c75910106d730d194ffd4d725d3

  4. arXiv:2209.05063  [pdf, other

    cs.HC

    Learning Affects Trust: Design Recommendations and Concepts for Teaching Children -- and Nearly Anyone -- about Conversational Agents

    Authors: Jessica Van Brummelen, Mingyan Claire Tian, Maura Kelleher, Nghi Hoang Nguyen

    Abstract: Research has shown that human-agent relationships form in similar ways to human-human relationships. Since children do not have the same critical analysis skills as adults (and may over-trust technology, for example), this relationship-formation is concerning. Nonetheless, little research investigates children's perceptions of conversational agents in-depth, and even less investigates how educatio… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 9 pages, 11 figures, submitted to EAAI at AAAI 2023, for associated appendix: https://gist.github.com/jessvb/e35bc0daf859c30f73008a1ad1b37824

  5. arXiv:1812.11901  [pdf, other

    cs.CV

    Large-Scale Object Detection of Images from Network Cameras in Variable Ambient Lighting Conditions

    Authors: Caleb Tung, Matthew R. Kelleher, Ryan J. Schlueter, Binhan Xu, Yung-Hsiang Lu, George K. Thiruvathukal, Yen-Kuang Chen, Yang Lu

    Abstract: Computer vision relies on labeled datasets for training and evaluation in detecting and recognizing objects. The popular computer vision program, YOLO ("You Only Look Once"), has been shown to accurately detect objects in many major image datasets. However, the images found in those datasets, are independent of one another and cannot be used to test YOLO's consistency at detecting the same object… ▽ More

    Submitted 31 December, 2018; originally announced December 2018.

    Comments: Submitted to MIPR 2019 (Accepted)