Skip to main content

Showing 1–2 of 2 results for author: Kurita, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.00680  [pdf, other

    cs.CL cs.LG stat.ML

    Zipfian Whitening

    Authors: Sho Yokoi, Han Bao, Hiroto Kurita, Hidetoshi Shimodaira

    Abstract: The word embedding space in neural models is skewed, and correcting this can improve task performance. We point out that most approaches for modeling, correcting, and measuring the symmetry of an embedding space implicitly assume that the word frequencies are uniform; in reality, word frequencies follow a highly non-uniform distribution, known as Zipf's law. Surprisingly, simply performing PCA whi… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024

  2. arXiv:2310.15921  [pdf, other

    cs.CL

    Contrastive Learning-based Sentence Encoders Implicitly Weight Informative Words

    Authors: Hiroto Kurita, Goro Kobayashi, Sho Yokoi, Kentaro Inui

    Abstract: The performance of sentence encoders can be significantly improved through the simple practice of fine-tuning using contrastive loss. A natural question arises: what characteristics do models acquire during contrastive learning? This paper theoretically and experimentally shows that contrastive-based sentence encoders implicitly weight words based on information-theoretic quantities; that is, more… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: 16 pages, 6 figures, accepted to EMNLP 2023 Findings (short paper)