Skip to main content

Showing 1–7 of 7 results for author: Tosch, E

.
  1. arXiv:2212.06719  [pdf, other

    cs.CY cs.AI cs.PL

    Exploring Consequences of Privacy Policies with Narrative Generation via Answer Set Programming

    Authors: Chinmaya Dabral, Emma Tosch, Chris Martens

    Abstract: Informed consent has become increasingly salient for data privacy and its regulation. Entities from governments to for-profit companies have addressed concerns about data privacy with policies that enumerate the conditions for personal data storage and transfer. However, increased enumeration of and transparency in data privacy policies has not improved end-users' comprehension of how their data m… ▽ More

    Submitted 13 December, 2022; originally announced December 2022.

    Comments: 6 pages; will be presented as a long talk at ProLaLa 2023 (colocated with POPL 2023)

  2. PlanAlyzer: Assessing Threats to the Validity of Online Experiments

    Authors: Emma Tosch, Eytan Bakshy, Emery D. Berger, David D. Jensen, J. Eliot B. Moss

    Abstract: Online experiments are ubiquitous. As the scale of experiments has grown, so has the complexity of their design and implementation. In response, firms have developed software frameworks for designing and deploying online experiments. Ensuring that experiments in these frameworks are correctly designed and that their results are trustworthy---referred to as *internal validity*---can be difficult. C… ▽ More

    Submitted 30 September, 2019; originally announced September 2019.

    Comments: 30 pages, hella long

    Journal ref: OOPSLA 2019

  3. arXiv:1905.02825  [pdf, other

    cs.LG stat.ML

    Toybox: A Suite of Environments for Experimental Evaluation of Deep Reinforcement Learning

    Authors: Emma Tosch, Kaleigh Clary, John Foley, David Jensen

    Abstract: Evaluation of deep reinforcement learning (RL) is inherently challenging. In particular, learned policies are largely opaque, and hypotheses about the behavior of deep RL agents are difficult to test in black-box environments. Considerable effort has gone into addressing opacity, but almost no effort has been devoted to producing high quality environments for experimental evaluation of agent behav… ▽ More

    Submitted 7 May, 2019; originally announced May 2019.

  4. arXiv:1904.06312  [pdf, other

    cs.LG cs.AI stat.ML

    Let's Play Again: Variability of Deep Reinforcement Learning Agents in Atari Environments

    Authors: Kaleigh Clary, Emma Tosch, John Foley, David Jensen

    Abstract: Reproducibility in reinforcement learning is challenging: uncontrolled stochasticity from many sources, such as the learning algorithm, the learned policy, and the environment itself have led researchers to report the performance of learned agents using aggregate metrics of performance over multiple random seeds for a single environment. Unfortunately, there are still pernicious sources of variabi… ▽ More

    Submitted 12 April, 2019; originally announced April 2019.

    Comments: NeurIPS 2018 Critiquing and Correcting Trends Workshop

  5. arXiv:1812.02868  [pdf, other

    cs.LG cs.AI stat.ML

    Measuring and Characterizing Generalization in Deep Reinforcement Learning

    Authors: Sam Witty, Jun Ki Lee, Emma Tosch, Akanksha Atrey, Michael Littman, David Jensen

    Abstract: Deep reinforcement-learning methods have achieved remarkable performance on challenging control tasks. Observations of the resulting behavior give the impression that the agent has constructed a generalized representation that supports insightful action decisions. We re-examine what is meant by generalization in RL, and propose several definitions based on an agent's performance in on-policy, off-… ▽ More

    Submitted 11 December, 2018; v1 submitted 6 December, 2018; originally announced December 2018.

  6. arXiv:1812.02850  [pdf, other

    cs.AI

    ToyBox: Better Atari Environments for Testing Reinforcement Learning Agents

    Authors: John Foley, Emma Tosch, Kaleigh Clary, David Jensen

    Abstract: It is a widely accepted principle that software without tests has bugs. Testing reinforcement learning agents is especially difficult because of the stochastic nature of both agents and environments, the complexity of state-of-the-art models, and the sequential nature of their predictions. Recently, the Arcade Learning Environment (ALE) has become one of the most widely used benchmark suites for d… ▽ More

    Submitted 25 January, 2019; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: NeurIPS Systems for ML Workshop

  7. arXiv:1406.5572  [pdf, other

    cs.PL cs.HC

    SurveyMan: Programming and Automatically Debugging Surveys

    Authors: Emma Tosch, Emery D. Berger

    Abstract: Surveys can be viewed as programs, complete with logic, control flow, and bugs. Word choice or the order in which questions are asked can unintentionally bias responses. Vague, confusing, or intrusive questions can cause respondents to abandon a survey. Surveys can also have runtime errors: inattentive respondents can taint results. This effect is especially problematic when deploying surveys in u… ▽ More

    Submitted 20 June, 2014; originally announced June 2014.

    Comments: Submitted version; accepted to OOPSLA 2014

    ACM Class: D.3.2; J.4; J.5