Showing 1–1 of 1 results for author: Souza, D

Search v0.5.6 released 2020-02-24

arXiv:2408.10359 [pdf, other]

econ.GN cs.CY

How Small is Big Enough? Open Labeled Datasets and the Development of Deep Learning

Authors: Daniel Souza, Aldo Geuna, Jeff Rodríguez

Abstract: We investigate the emergence of Deep Learning as a technoscientific field, emphasizing the role of open labeled datasets. Through qualitative and quantitative analyses, we evaluate the role of datasets like CIFAR-10 in advancing computer vision and object recognition, which are central to the Deep Learning revolution. Our findings highlight CIFAR-10's crucial role and enduring influence on the fie… ▽ More We investigate the emergence of Deep Learning as a technoscientific field, emphasizing the role of open labeled datasets. Through qualitative and quantitative analyses, we evaluate the role of datasets like CIFAR-10 in advancing computer vision and object recognition, which are central to the Deep Learning revolution. Our findings highlight CIFAR-10's crucial role and enduring influence on the field, as well as its importance in teaching ML techniques. Results also indicate that dataset characteristics such as size, number of instances, and number of categories, were key factors. Econometric analysis confirms that CIFAR-10, a small-but-sufficiently-large open dataset, played a significant and lasting role in technological advancements and had a major function in the development of the early scientific literature as shown by citation metrics. △ Less

Submitted 19 August, 2024; originally announced August 2024.

Search v0.5.6 released 2020-02-24