Showing 1–1 of 1 results for author: Hauon, E

Search v0.5.6 released 2020-02-24

arXiv:2204.11073 [pdf, ps, other]

cs.LG

doi 10.1145/3459637.3482126

Grad-SAM: Explaining Transformers via Gradient Self-Attention Maps

Authors: Oren Barkan, Edan Hauon, Avi Caciularu, Ori Katz, Itzik Malkiel, Omri Armstrong, Noam Koenigstein

Abstract: Transformer-based language models significantly advanced the state-of-the-art in many linguistic tasks. As this revolution continues, the ability to explain model predictions has become a major area of interest for the NLP community. In this work, we present Gradient Self-Attention Maps (Grad-SAM) - a novel gradient-based method that analyzes self-attention units and identifies the input elements… ▽ More Transformer-based language models significantly advanced the state-of-the-art in many linguistic tasks. As this revolution continues, the ability to explain model predictions has become a major area of interest for the NLP community. In this work, we present Gradient Self-Attention Maps (Grad-SAM) - a novel gradient-based method that analyzes self-attention units and identifies the input elements that explain the model's prediction the best. Extensive evaluations on various benchmarks show that Grad-SAM obtains significant improvements over state-of-the-art alternatives. △ Less

Submitted 23 April, 2022; originally announced April 2022.

Search v0.5.6 released 2020-02-24