Baselines and test data for cross-lingual inference

Agić, Željko; Schluter, Natalie

Computer Science > Computation and Language

arXiv:1704.05347 (cs)

[Submitted on 18 Apr 2017 (v1), last revised 2 Mar 2018 (this version, v2)]

Title:Baselines and test data for cross-lingual inference

Authors:Željko Agić, Natalie Schluter

View PDF

Abstract:The recent years have seen a revival of interest in textual entailment, sparked by i) the emergence of powerful deep neural network learners for natural language processing and ii) the timely development of large-scale evaluation datasets such as SNLI. Recast as natural language inference, the problem now amounts to detecting the relation between pairs of statements: they either contradict or entail one another, or they are mutually neutral. Current research in natural language inference is effectively exclusive to English. In this paper, we propose to advance the research in SNLI-style natural language inference toward multilingual evaluation. To that end, we provide test data for four major languages: Arabic, French, Spanish, and Russian. We experiment with a set of baselines. Our systems are based on cross-lingual word embeddings and machine translation. While our best system scores an average accuracy of just over 75%, we focus largely on enabling further research in multilingual inference.

Comments:	To appear at LREC 2018
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1704.05347 [cs.CL]
	(or arXiv:1704.05347v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1704.05347

Submission history

From: Zeljko Agic [view email]
[v1] Tue, 18 Apr 2017 14:12:37 UTC (43 KB)
[v2] Fri, 2 Mar 2018 18:24:49 UTC (38 KB)

Full-text links:

Access Paper:

view license

Current browse context:

< prev | next >

new | recent | 2017-04

Change to browse by:

cs.CL

References & Citations

DBLP - CS Bibliography

listing | bibtex

Zeljko Agic
Natalie Schluter

export BibTeX citation

Computer Science > Computation and Language

Title:Baselines and test data for cross-lingual inference

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Baselines and test data for cross-lingual inference

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators