Skip to main content

Showing 1–1 of 1 results for author: Palle, D R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.08295  [pdf, other

    eess.AS cs.CV cs.LG cs.SD

    A Real-Time Active Speaker Detection System Integrating an Audio-Visual Signal with a Spatial Querying Mechanism

    Authors: Ilya Gurvich, Ido Leichter, Dharmendar Reddy Palle, Yossi Asher, Alon Vinnikov, Igor Abramovski, Vishak Gopal, Ross Cutler, Eyal Krupka

    Abstract: We introduce a distinctive real-time, causal, neural network-based active speaker detection system optimized for low-power edge computing. This system drives a virtual cinematography module and is deployed on a commercial device. The system uses data originating from a microphone array and a 360-degree camera. Our network requires only 127 MFLOPs per participant, for a meeting with 14 participants… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.