Skip to main content

Showing 1–4 of 4 results for author: Brower-Sinning, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.08575  [pdf, ps, other

    cs.SE cs.AI cs.LG

    Using Quality Attribute Scenarios for ML Model Test Case Generation

    Authors: Rachel Brower-Sinning, Grace A. Lewis, Sebastían Echeverría, Ipek Ozkaya

    Abstract: Testing of machine learning (ML) models is a known challenge identified by researchers and practitioners alike. Unfortunately, current practice for ML model testing prioritizes testing for model performance, while often neglecting the requirements and constraints of the ML-enabled system that integrates the model. This limited view of testing leads to failures during integration, deployment, and o… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: Paper accepted and presented in SAML 2024, the 3rd International Workshop on Software Architecture and Machine Learning, co-located with ICSA 2024, the 21st IEEE International Conference on Software Architecture

  2. arXiv:2310.09668  [pdf, other

    cs.CL cs.SE

    Beyond Testers' Biases: Guiding Model Testing with Knowledge Bases using LLMs

    Authors: Chenyang Yang, Rishabh Rustogi, Rachel Brower-Sinning, Grace A. Lewis, Christian Kästner, Tongshuang Wu

    Abstract: Current model testing work has mostly focused on creating test cases. Identifying what to test is a step that is largely ignored and poorly supported. We propose Weaver, an interactive tool that supports requirements elicitation for guiding model testing. Weaver uses large language models to generate knowledge bases and recommends concepts from them interactively, allowing testers to elicit requir… ▽ More

    Submitted 14 October, 2023; originally announced October 2023.

  3. arXiv:2211.06409  [pdf, other

    cs.AI cs.SE

    Capabilities for Better ML Engineering

    Authors: Chenyang Yang, Rachel Brower-Sinning, Grace A. Lewis, Christian Kästner, Tongshuang Wu

    Abstract: In spite of machine learning's rapid growth, its engineering support is scattered in many forms, and tends to favor certain engineering stages, stakeholders, and evaluation preferences. We envision a capability-based framework, which uses fine-grained specifications for ML model behaviors to unite existing efforts towards better ML engineering. We use concrete scenarios (model design, debugging, a… ▽ More

    Submitted 10 February, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

  4. arXiv:2209.03345  [pdf, other

    cs.SE

    Data Leakage in Notebooks: Static Detection and Better Processes

    Authors: Chenyang Yang, Rachel A Brower-Sinning, Grace A. Lewis, Christian Kästner

    Abstract: Data science pipelines to train and evaluate models with machine learning may contain bugs just like any other code. Leakage between training and test data can lead to overestimating the model's accuracy during offline evaluations, possibly leading to deployment of low-quality models in production. Such leakage can happen easily by mistake or by following poor practices, but may be tedious and cha… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.