Skip to main content

Showing 1–2 of 2 results for author: Ensign, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.14008  [pdf, other

    cs.LG

    Investigating the Indirect Object Identification circuit in Mamba

    Authors: Danielle Ensign, AdriĆ  Garriga-Alonso

    Abstract: How well will current interpretability techniques generalize to future models? A relevant case study is Mamba, a recent recurrent architecture with scaling comparable to Transformers. We adapt pre-Mamba techniques to Mamba and partially reverse-engineer the circuit responsible for the Indirect Object Identification (IOI) task. Our techniques provide evidence that 1) Layer 39 is a key bottleneck, 2… ▽ More

    Submitted 21 July, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:1706.09847  [pdf, other

    cs.CY stat.ML

    Runaway Feedback Loops in Predictive Policing

    Authors: Danielle Ensign, Sorelle A. Friedler, Scott Neville, Carlos Scheidegger, Suresh Venkatasubramanian

    Abstract: Predictive policing systems are increasingly used to determine how to allocate police across a city in order to best prevent crime. Discovered crime data (e.g., arrest counts) are used to help update the model, and the process is repeated. Such systems have been empirically shown to be susceptible to runaway feedback loops, where police are repeatedly sent back to the same neighborhoods regardless… ▽ More

    Submitted 21 December, 2017; v1 submitted 29 June, 2017; originally announced June 2017.

    Comments: Extended version accepted to the 1st Conference on Fairness, Accountability and Transparency, 2018. Adds further treatment of reported as well as discovered incidents