Skip to main content

Showing 1–4 of 4 results for author: de Gusmão, P P B

Searching in archive eess. Search in all archives.
.
  1. arXiv:2209.15575  [pdf, other

    cs.SD cs.LG eess.AS

    Match to Win: Analysing Sequences Lengths for Efficient Self-supervised Learning in Speech and Audio

    Authors: Yan Gao, Javier Fernandez-Marques, Titouan Parcollet, Pedro P. B. de Gusmao, Nicholas D. Lane

    Abstract: Self-supervised learning (SSL) has proven vital in speech and audio-related applications. The paradigm trains a general model on unlabeled data that can later be used to solve specific downstream tasks. This type of model is costly to train as it requires manipulating long input sequences that can only be handled by powerful centralised servers. Surprisingly, despite many attempts to increase trai… ▽ More

    Submitted 22 November, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

  2. arXiv:2104.14297  [pdf, other

    cs.SD cs.LG eess.AS

    End-to-End Speech Recognition from Federated Acoustic Models

    Authors: Yan Gao, Titouan Parcollet, Salah Zaiem, Javier Fernandez-Marques, Pedro P. B. de Gusmao, Daniel J. Beutel, Nicholas D. Lane

    Abstract: Training Automatic Speech Recognition (ASR) models under federated learning (FL) settings has attracted a lot of attention recently. However, the FL scenarios often presented in the literature are artificial and fail to capture the complexity of real FL systems. In this paper, we construct a challenging and realistic ASR federated experimental setup consisting of clients with heterogeneous data di… ▽ More

    Submitted 9 July, 2021; v1 submitted 29 April, 2021; originally announced April 2021.

  3. arXiv:1911.09968  [pdf, other

    cs.CV cs.LG eess.IV

    SelfVIO: Self-Supervised Deep Monocular Visual-Inertial Odometry and Depth Estimation

    Authors: Yasin Almalioglu, Mehmet Turan, Alp Eren Sari, Muhamad Risqi U. Saputra, Pedro P. B. de Gusmão, Andrew Markham, Niki Trigoni

    Abstract: In the last decade, numerous supervised deep learning approaches requiring large amounts of labeled data have been proposed for visual-inertial odometry (VIO) and depth map estimation. To overcome the data limitation, self-supervised learning has emerged as a promising alternative, exploiting constraints such as geometric and photometric consistency in the scene. In this study, we introduce a nove… ▽ More

    Submitted 23 July, 2020; v1 submitted 22 November, 2019; originally announced November 2019.

    Comments: 15 pages, submitted to The IEEE Transactions on Robotics (T-RO) journal, under review

  4. arXiv:1909.08356  [pdf, other

    eess.SP stat.CO

    Sensor Fusion for Magneto-Inductive Navigation

    Authors: Johan Wahlström, Manon Kok, Pedro Porto Buarque de Gusmao, Traian E. Abrudan, Niki Trigoni, Andrew Markham

    Abstract: Magneto-inductive navigation is an inexpensive and easily deployable solution to many of today's navigation problems. By utilizing very low frequency magnetic fields, magneto-inductive technology circumvents the problems with attenuation and multipath that often plague competing modalities. Using triaxial transmitter and receiver coils, it is possible to compute position and orientation estimates… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.