Skip to main content

Showing 1–1 of 1 results for author: Mollberg, D E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2004.07776  [pdf, other

    cs.CL

    Kvistur 2.0: a BiLSTM Compound Splitter for Icelandic

    Authors: Jón Friðrik Daðason, David Erik Mollberg, Hrafn Loftsson, Kristín Bjarnadóttir

    Abstract: In this paper, we present a character-based BiLSTM model for splitting Icelandic compound words, and show how varying amounts of training data affects the performance of the model. Compounding is highly productive in Icelandic, and new compounds are constantly being created. This results in a large number of out-of-vocabulary (OOV) words, negatively impacting the performance of many NLP tools. Our… ▽ More

    Submitted 16 April, 2020; originally announced April 2020.

    Comments: Accepted at LREC 2020