Showing 1–1 of 1 results for author: Weill, Y
-
MetricBERT: Text Representation Learning via Self-Supervised Triplet Training
Authors:
Itzik Malkiel,
Dvir Ginzburg,
Oren Barkan,
Avi Caciularu,
Yoni Weill,
Noam Koenigstein
Abstract:
We present MetricBERT, a BERT-based model that learns to embed text under a well-defined similarity metric while simultaneously adhering to the ``traditional'' masked-language task. We focus on downstream tasks of learning similarities for recommendations where we show that MetricBERT outperforms state-of-the-art alternatives, sometimes by a substantial margin. We conduct extensive evaluations of…
▽ More
We present MetricBERT, a BERT-based model that learns to embed text under a well-defined similarity metric while simultaneously adhering to the ``traditional'' masked-language task. We focus on downstream tasks of learning similarities for recommendations where we show that MetricBERT outperforms state-of-the-art alternatives, sometimes by a substantial margin. We conduct extensive evaluations of our method and its different variants, showing that our training objective is highly beneficial over a traditional contrastive loss, a standard cosine similarity objective, and six other baselines. As an additional contribution, we publish a dataset of video games descriptions along with a test set of similarity annotations crafted by a domain expert.
△ Less
Submitted 13 August, 2022;
originally announced August 2022.