Skip to main content

Showing 1–1 of 1 results for author: Shade, B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2305.02457  [pdf, other

    cs.CL cond-mat.stat-mech physics.soc-ph

    Quantifying the Dissimilarity of Texts

    Authors: Benjamin Shade, Eduardo G. Altmann

    Abstract: Quantifying the dissimilarity of two texts is an important aspect of a number of natural language processing tasks, including semantic information retrieval, topic classification, and document clustering. In this paper, we compared the properties and performance of different dissimilarity measures $D$ using three different representations of texts -- vocabularies, word frequency distributions, and… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: 16 pages, 4 figures, part of the Special Issue Novel Methods and Applications in Natural Language Processing

    Journal ref: Information 2023, 14, 271