Skip to main content

Showing 1–1 of 1 results for author: Ruohe, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2003.11562  [pdf, other

    cs.CL cs.LG cs.SD eess.AS stat.ML

    Finnish Language Modeling with Deep Transformer Models

    Authors: Abhilash Jain, Aku Ruohe, Stig-Arne Grönroos, Mikko Kurimo

    Abstract: Transformers have recently taken the center stage in language modeling after LSTM's were considered the dominant model architecture for a long time. In this project, we investigate the performance of the Transformer architectures-BERT and Transformer-XL for the language modeling task. We use a sub-word model setting with the Finnish language and compare it to the previous State of the art (SOTA) L… ▽ More

    Submitted 27 March, 2020; v1 submitted 14 March, 2020; originally announced March 2020.

    Comments: 4 pages