Skip to main content

Showing 1–4 of 4 results for author: Bertucci, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.09011  [pdf, other

    cs.HC cs.AI cs.LG

    VAE Explainer: Supplement Learning Variational Autoencoders with Interactive Visualization

    Authors: Donald Bertucci, Alex Endert

    Abstract: Variational Autoencoders are widespread in Machine Learning, but are typically explained with dense math notation or static code examples. This paper presents VAE Explainer, an interactive Variational Autoencoder running in the browser to supplement existing static documentation (e.g., Keras Code Examples). VAE Explainer adds interactions to the VAE summary with interactive model inputs, latent sp… ▽ More

    Submitted 13 September, 2024; originally announced September 2024.

    Comments: 6 pages, 4 figures

  2. Zeno: An Interactive Framework for Behavioral Evaluation of Machine Learning

    Authors: Ángel Alexander Cabrera, Erica Fu, Donald Bertucci, Kenneth Holstein, Ameet Talwalkar, Jason I. Hong, Adam Perer

    Abstract: Machine learning models with high accuracy on test data can still produce systematic failures, such as harmful biases and safety issues, when deployed in the real world. To detect and mitigate such failures, practitioners run behavioral evaluation of their models, checking model outputs for specific types of inputs. Behavioral evaluation is important but challenging, requiring that practitioners d… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

  3. arXiv:2206.02039  [pdf, other

    cs.AI cs.LG

    Beyond Value: CHECKLIST for Testing Inferences in Planning-Based RL

    Authors: Kin-Ho Lam, Delyar Tabatabai, Jed Irvine, Donald Bertucci, Anita Ruangrotsakun, Minsuk Kahng, Alan Fern

    Abstract: Reinforcement learning (RL) agents are commonly evaluated via their expected value over a distribution of test scenarios. Unfortunately, this evaluation approach provides limited evidence for post-deployment generalization beyond the test distribution. In this paper, we address this limitation by extending the recent CheckList testing methodology from natural language processing to planning-based… ▽ More

    Submitted 7 June, 2022; v1 submitted 4 June, 2022; originally announced June 2022.

    Comments: This work will appear in the Proceedings of the 32nd International Conference on Automated Planning and Scheduling (ICAPS2022) https://icaps22.icaps-conference.org/papers

  4. arXiv:2205.06935  [pdf, other

    cs.HC cs.AI cs.LG

    DendroMap: Visual Exploration of Large-Scale Image Datasets for Machine Learning with Treemaps

    Authors: Donald Bertucci, Md Montaser Hamid, Yashwanthi Anand, Anita Ruangrotsakun, Delyar Tabatabai, Melissa Perez, Minsuk Kahng

    Abstract: In this paper, we present DendroMap, a novel approach to interactively exploring large-scale image datasets for machine learning (ML). ML practitioners often explore image datasets by generating a grid of images or projecting high-dimensional representations of images into 2-D using dimensionality reduction techniques (e.g., t-SNE). However, neither approach effectively scales to large datasets be… ▽ More

    Submitted 15 August, 2022; v1 submitted 13 May, 2022; originally announced May 2022.

    Comments: This paper has been accepted for the IEEE VIS 2022 Conference and will be published in the IEEE Transactions on Visualization and Computer Graphics