Impact of Training Dataset Size on Neural Answer Selection Models

Linjordet, Trond; Balog, Krisztian

Computer Science > Information Retrieval

arXiv:1901.10496 (cs)

[Submitted on 29 Jan 2019]

Title:Impact of Training Dataset Size on Neural Answer Selection Models

Authors:Trond Linjordet, Krisztian Balog

View PDF

Abstract:It is held as a truism that deep neural networks require large datasets to train effective models. However, large datasets, especially with high-quality labels, can be expensive to obtain. This study sets out to investigate (i) how large a dataset must be to train well-performing models, and (ii) what impact can be shown from fractional changes to the dataset size. A practical method to investigate these questions is to train a collection of deep neural answer selection models using fractional subsets of varying sizes of an initial dataset. We observe that dataset size has a conspicuous lack of effect on the training of some of these models, bringing the underlying algorithms into question.

Comments:	7 pages, 2 figures
Subjects:	Information Retrieval (cs.IR); Computation and Language (cs.CL)
Cite as:	arXiv:1901.10496 [cs.IR]
	(or arXiv:1901.10496v1 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.1901.10496

Submission history

From: Trond Linjordet [view email]
[v1] Tue, 29 Jan 2019 19:00:21 UTC (5,247 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.IR

< prev | next >

new | recent | 2019-01

Change to browse by:

cs
cs.CL

References & Citations

DBLP - CS Bibliography

listing | bibtex

Trond Linjordet
Krisztian Balog

export BibTeX citation

Computer Science > Information Retrieval

Title:Impact of Training Dataset Size on Neural Answer Selection Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:Impact of Training Dataset Size on Neural Answer Selection Models

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators