Skip to main content

Showing 1–1 of 1 results for author: Rodrigues, L B

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.08658  [pdf, other

    cs.SD cs.AI

    Evaluating Voice Command Pipelines for Drone Control: From STT and LLM to Direct Classification and Siamese Networks

    Authors: Lucca Emmanuel Pineli Simões, Lucas Brandão Rodrigues, Rafaela Mota Silva, Gustavo Rodrigues da Silva

    Abstract: This paper presents the development and comparative evaluation of three voice command pipelines for controlling a Tello drone, using speech recognition and deep learning techniques. The aim is to enhance human-machine interaction by enabling intuitive voice control of drone actions. The pipelines developed include: (1) a traditional Speech-to-Text (STT) followed by a Large Language Model (LLM) app… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    ACM Class: I.2.7; I.2.10