Skip to main content

Showing 1–13 of 13 results for author: Yamagiwa, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.15428  [pdf, ps, other

    cs.CL

    Likelihood Variance as Text Importance for Resampling Texts to Map Language Models

    Authors: Momose Oyama, Ryo Kishino, Hiroaki Yamagiwa, Hidetoshi Shimodaira

    Abstract: We address the computational cost of constructing a model map, which embeds diverse language models into a common space for comparison via KL divergence. The map relies on log-likelihoods over a large text set, making the cost proportional to the number of texts. To reduce this cost, we propose a resampling method that selects important texts with weights proportional to the variance of log-likeli… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  2. arXiv:2505.15353  [pdf, ps, other

    cs.CL

    Revealing Language Model Trajectories via Kullback-Leibler Divergence

    Authors: Ryo Kishino, Yusuke Takase, Momose Oyama, Hiroaki Yamagiwa, Hidetoshi Shimodaira

    Abstract: A recently proposed method enables efficient estimation of the KL divergence between language models, including models with different architectures, by assigning coordinates based on log-likelihood vectors. To better understand the behavior of this metric, we systematically evaluate KL divergence across a wide range of conditions using publicly available language models. Our analysis covers compar… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  3. arXiv:2502.16173  [pdf, ps, other

    cs.CL

    Mapping 1,000+ Language Models via the Log-Likelihood Vector

    Authors: Momose Oyama, Hiroaki Yamagiwa, Yusuke Takase, Hidetoshi Shimodaira

    Abstract: To compare autoregressive language models at scale, we propose using log-likelihood vectors computed on a predefined text set as model features. This approach has a solid theoretical basis: when treated as model coordinates, their squared Euclidean distance approximates the Kullback-Leibler divergence of text-generation probabilities. Our method is highly scalable, with computational cost growing… ▽ More

    Submitted 31 May, 2025; v1 submitted 22 February, 2025; originally announced February 2025.

    Comments: ACL 2025

  4. arXiv:2412.12569  [pdf, ps, other

    cs.CL

    Quantifying Lexical Semantic Shift via Unbalanced Optimal Transport

    Authors: Ryo Kishino, Hiroaki Yamagiwa, Ryo Nagata, Sho Yokoi, Hidetoshi Shimodaira

    Abstract: Lexical semantic change detection aims to identify shifts in word meanings over time. While existing methods using embeddings from a diachronic corpus pair estimate the degree of change for target words, they offer limited insight into changes at the level of individual usage instances. To address this, we apply Unbalanced Optimal Transport (UOT) to sets of contextualized word embeddings, capturin… ▽ More

    Submitted 31 May, 2025; v1 submitted 17 December, 2024; originally announced December 2024.

    Comments: ACL 2025

  5. arXiv:2409.19919  [pdf, other

    cs.CL

    Understanding Higher-Order Correlations Among Semantic Components in Embeddings

    Authors: Momose Oyama, Hiroaki Yamagiwa, Hidetoshi Shimodaira

    Abstract: Independent Component Analysis (ICA) offers interpretable semantic components of embeddings. While ICA theory assumes that embeddings can be linearly decomposed into independent components, real-world data often do not satisfy this assumption. Consequently, non-independencies remain between the estimated components, which ICA cannot eliminate. We quantified these non-independencies using higher-or… ▽ More

    Submitted 9 October, 2024; v1 submitted 29 September, 2024; originally announced September 2024.

    Comments: EMNLP 2024

  6. arXiv:2409.11253  [pdf, other

    cs.CL

    Norm of Mean Contextualized Embeddings Determines their Variance

    Authors: Hiroaki Yamagiwa, Hidetoshi Shimodaira

    Abstract: Contextualized embeddings vary by context, even for the same token, and form a distribution in the embedding space. To analyze this distribution, we focus on the norm of the mean embedding and the variance of the embeddings. In this study, we first demonstrate that these values follow the well-known formula for variance in statistics and provide an efficient sequential computation method. Then, by… ▽ More

    Submitted 17 December, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

    Comments: COLING 2025

  7. arXiv:2406.18094  [pdf, other

    cs.CL

    Shimo Lab at "Discharge Me!": Discharge Summarization by Prompt-Driven Concatenation of Electronic Health Record Sections

    Authors: Yunzhen He, Hiroaki Yamagiwa, Hidetoshi Shimodaira

    Abstract: In this paper, we present our approach to the shared task "Discharge Me!" at the BioNLP Workshop 2024. The primary goal of this task is to reduce the time and effort clinicians spend on writing detailed notes in the electronic health record (EHR). Participants develop a pipeline to generate the "Brief Hospital Course" and "Discharge Instructions" sections from the EHR. Our approach involves a firs… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: BioNLP @ ACL2024

  8. arXiv:2406.10984  [pdf, other

    cs.CL

    Revisiting Cosine Similarity via Normalized ICA-transformed Embeddings

    Authors: Hiroaki Yamagiwa, Momose Oyama, Hidetoshi Shimodaira

    Abstract: Cosine similarity is widely used to measure the similarity between two embeddings, while interpretations based on angle and correlation coefficient are common. In this study, we focus on the interpretable axes of embeddings transformed by Independent Component Analysis (ICA), and propose a novel interpretation of cosine similarity as the sum of semantic similarities over axes. The normalized ICA-t… ▽ More

    Submitted 17 December, 2024; v1 submitted 16 June, 2024; originally announced June 2024.

    Comments: COLING 2025

  9. Predicting drug-gene relations via analogy tasks with word embeddings

    Authors: Hiroaki Yamagiwa, Ryoma Hashimoto, Kiwamu Arakane, Ken Murakami, Shou Soeda, Momose Oyama, Yihua Zhu, Mariko Okada, Hidetoshi Shimodaira

    Abstract: Natural language processing (NLP) is utilized in a wide range of fields, where words in text are typically transformed into feature vectors called embeddings. BioConceptVec is a specific example of embeddings tailored for biology, trained on approximately 30 million PubMed abstracts using models such as skip-gram. Generally, word embeddings are known to solve analogy tasks through simple vector ar… ▽ More

    Submitted 27 May, 2025; v1 submitted 3 June, 2024; originally announced June 2024.

    Journal ref: Sci Rep 15, 17240 (2025)

  10. arXiv:2401.06112  [pdf, other

    cs.CL

    Axis Tour: Word Tour Determines the Order of Axes in ICA-transformed Embeddings

    Authors: Hiroaki Yamagiwa, Yusuke Takase, Hidetoshi Shimodaira

    Abstract: Word embedding is one of the most important components in natural language processing, but interpreting high-dimensional embeddings remains a challenging problem. To address this problem, Independent Component Analysis (ICA) is identified as an effective solution. ICA-transformed word embeddings reveal interpretable semantic axes; however, the order of these axes are arbitrary. In this study, we f… ▽ More

    Submitted 9 October, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: EMNLP 2024 Findings (short)

  11. arXiv:2308.13779  [pdf, other

    cs.CV

    Zero-Shot Edge Detection with SCESAME: Spectral Clustering-based Ensemble for Segment Anything Model Estimation

    Authors: Hiroaki Yamagiwa, Yusuke Takase, Hiroyuki Kambe, Ryosuke Nakamoto

    Abstract: This paper proposes a novel zero-shot edge detection with SCESAME, which stands for Spectral Clustering-based Ensemble for Segment Anything Model Estimation, based on the recently proposed Segment Anything Model (SAM). SAM is a foundation model for segmentation tasks, and one of the interesting applications of SAM is Automatic Mask Generation (AMG), which generates zero-shot segmentation masks of… ▽ More

    Submitted 18 November, 2023; v1 submitted 26 August, 2023; originally announced August 2023.

    Comments: 11 pages, accepted to WACV 2024 Workshop

  12. arXiv:2305.13175  [pdf, other

    cs.CL

    Discovering Universal Geometry in Embeddings with ICA

    Authors: Hiroaki Yamagiwa, Momose Oyama, Hidetoshi Shimodaira

    Abstract: This study utilizes Independent Component Analysis (ICA) to unveil a consistent semantic structure within embeddings of words or images. Our approach extracts independent semantic components from the embeddings of a pre-trained model by leveraging anisotropic information that remains after the whitening process in Principal Component Analysis (PCA). We demonstrate that each embedding can be expres… ▽ More

    Submitted 2 November, 2023; v1 submitted 22 May, 2023; originally announced May 2023.

    Comments: 29 pages, EMNLP 2023

  13. arXiv:2211.06229  [pdf, other

    cs.CL

    Improving word mover's distance by leveraging self-attention matrix

    Authors: Hiroaki Yamagiwa, Sho Yokoi, Hidetoshi Shimodaira

    Abstract: Measuring the semantic similarity between two sentences is still an important task. The word mover's distance (WMD) computes the similarity via the optimal alignment between the sets of word embeddings. However, WMD does not utilize word order, making it challenging to distinguish sentences with significant overlaps of similar words, even if they are semantically very different. Here, we attempt t… ▽ More

    Submitted 2 November, 2023; v1 submitted 11 November, 2022; originally announced November 2022.

    Comments: 24 pages, accepted to EMNLP 2023 Findings