Showing 1–1 of 1 results for author: Hauon, E
-
Grad-SAM: Explaining Transformers via Gradient Self-Attention Maps
Authors:
Oren Barkan,
Edan Hauon,
Avi Caciularu,
Ori Katz,
Itzik Malkiel,
Omri Armstrong,
Noam Koenigstein
Abstract:
Transformer-based language models significantly advanced the state-of-the-art in many linguistic tasks. As this revolution continues, the ability to explain model predictions has become a major area of interest for the NLP community. In this work, we present Gradient Self-Attention Maps (Grad-SAM) - a novel gradient-based method that analyzes self-attention units and identifies the input elements…
▽ More
Transformer-based language models significantly advanced the state-of-the-art in many linguistic tasks. As this revolution continues, the ability to explain model predictions has become a major area of interest for the NLP community. In this work, we present Gradient Self-Attention Maps (Grad-SAM) - a novel gradient-based method that analyzes self-attention units and identifies the input elements that explain the model's prediction the best. Extensive evaluations on various benchmarks show that Grad-SAM obtains significant improvements over state-of-the-art alternatives.
△ Less
Submitted 23 April, 2022;
originally announced April 2022.