Showing 1–2 of 2 results for author: Auersperger, M

Search v0.5.6 released 2020-02-24

arXiv:2206.04751 [pdf, other]

cs.CL

Defending Compositionality in Emergent Languages

Authors: Michal Auersperger, Pavel Pecina

Abstract: Compositionality has traditionally been understood as a major factor in productivity of language and, more broadly, human cognition. Yet, recently, some research started to question its status, showing that artificial neural networks are good at generalization even without noticeable compositional behavior. We argue that some of these conclusions are too strong and/or incomplete. In the context of… ▽ More Compositionality has traditionally been understood as a major factor in productivity of language and, more broadly, human cognition. Yet, recently, some research started to question its status, showing that artificial neural networks are good at generalization even without noticeable compositional behavior. We argue that some of these conclusions are too strong and/or incomplete. In the context of a two-agent communication game, we show that compositionality indeed seems essential for successful generalization when the evaluation is done on a proper dataset. △ Less

Submitted 9 June, 2022; originally announced June 2022.

Comments: Accepted to NAACL SRW 22
arXiv:1907.12750 [pdf, ps, other]

cs.CL

English-Czech Systems in WMT19: Document-Level Transformer

Authors: Martin Popel, Dominik Macháček, Michal Auersperger, Ondřej Bojar, Pavel Pecina

Abstract: We describe our NMT systems submitted to the WMT19 shared task in English-Czech news translation. Our systems are based on the Transformer model implemented in either Tensor2Tensor (T2T) or Marian framework. We aimed at improving the adequacy and coherence of translated documents by enlarging the context of the source and target. Instead of translating each sentence independently, we split the d… ▽ More We describe our NMT systems submitted to the WMT19 shared task in English-Czech news translation. Our systems are based on the Transformer model implemented in either Tensor2Tensor (T2T) or Marian framework. We aimed at improving the adequacy and coherence of translated documents by enlarging the context of the source and target. Instead of translating each sentence independently, we split the document into possibly overlapping multi-sentence segments. In case of the T2T implementation, this "document-level"-trained system achieves a $+0.6$ BLEU improvement ($p<0.05$) relative to the same system applied on isolated sentences. To assess the potential effect document-level models might have on lexical coherence, we performed a semi-automatic analysis, which revealed only a few sentences improved in this aspect. Thus, we cannot draw any conclusions from this weak evidence. △ Less

Submitted 30 July, 2019; originally announced July 2019.

Comments: WMT19

Search v0.5.6 released 2020-02-24