BBC-Oxford British Sign Language Dataset
Authors:
Samuel Albanie,
Gül Varol,
Liliane Momeni,
Hannah Bull,
Triantafyllos Afouras,
Himel Chowdhury,
Neil Fox,
Bencie Woll,
Rob Cooper,
Andrew McParland,
Andrew Zisserman
Abstract:
In this work, we introduce the BBC-Oxford British Sign Language (BOBSL) dataset, a large-scale video collection of British Sign Language (BSL). BOBSL is an extended and publicly released dataset based on the BSL-1K dataset introduced in previous work. We describe the motivation for the dataset, together with statistics and available annotations. We conduct experiments to provide baselines for the…
▽ More
In this work, we introduce the BBC-Oxford British Sign Language (BOBSL) dataset, a large-scale video collection of British Sign Language (BSL). BOBSL is an extended and publicly released dataset based on the BSL-1K dataset introduced in previous work. We describe the motivation for the dataset, together with statistics and available annotations. We conduct experiments to provide baselines for the tasks of sign recognition, sign language alignment, and sign language translation. Finally, we describe several strengths and limitations of the data from the perspectives of machine learning and linguistics, note sources of bias present in the dataset, and discuss potential applications of BOBSL in the context of sign language technology. The dataset is available at https://www.robots.ox.ac.uk/~vgg/data/bobsl/.
△ Less
Submitted 5 November, 2021;
originally announced November 2021.