Skip to main content

Showing 1–4 of 4 results for author: Leichter, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2309.08295  [pdf, other

    eess.AS cs.CV cs.LG cs.SD

    A Real-Time Active Speaker Detection System Integrating an Audio-Visual Signal with a Spatial Querying Mechanism

    Authors: Ilya Gurvich, Ido Leichter, Dharmendar Reddy Palle, Yossi Asher, Alon Vinnikov, Igor Abramovski, Vishak Gopal, Ross Cutler, Eyal Krupka

    Abstract: We introduce a distinctive real-time, causal, neural network-based active speaker detection system optimized for low-power edge computing. This system drives a virtual cinematography module and is deployed on a commercial device. The system uses data originating from a microphone array and a 360-degree camera. Our network requires only 127 MFLOPs per participant, for a meeting with 14 participants… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

  2. arXiv:1912.04979  [pdf, other

    eess.AS cs.CL cs.CV cs.SD eess.IV

    Advances in Online Audio-Visual Meeting Transcription

    Authors: Takuya Yoshioka, Igor Abramovski, Cem Aksoylar, Zhuo Chen, Moshe David, Dimitrios Dimitriadis, Yifan Gong, Ilya Gurvich, Xuedong Huang, Yan Huang, Aviv Hurvitz, Li Jiang, Sharon Koubi, Eyal Krupka, Ido Leichter, Changliang Liu, Partha Parthasarathy, Alon Vinnikov, Lingfeng Wu, Xiong Xiao, Wayne Xiong, Huaming Wang, Zhenghao Wang, Jun Zhang, Yong Zhao , et al. (1 additional authors not shown)

    Abstract: This paper describes a system that generates speaker-annotated transcripts of meetings by using a microphone array and a 360-degree camera. The hallmark of the system is its ability to handle overlapped speech, which has been an unsolved problem in realistic settings for over a decade. We show that this problem can be addressed by using a continuous speech separation approach. In addition, we desc… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.

    Comments: To appear in Proc. IEEE ASRU Workshop 2019

  3. arXiv:1412.2873  [pdf, ps, other

    cs.CV

    Cancer Detection with Multiple Radiologists via Soft Multiple Instance Logistic Regression and $L_1$ Regularization

    Authors: Inna Stainvas, Alexandra Manevitch, Isaac Leichter

    Abstract: This paper deals with the multiple annotation problem in medical application of cancer detection in digital images. The main assumption is that though images are labeled by many experts, the number of images read by the same expert is not large. Thus differing with the existing work on modeling each expert and ground truth simultaneously, the multi annotation information is used in a soft manner.… ▽ More

    Submitted 9 December, 2014; originally announced December 2014.

    Comments: 20 pages, report

  4. arXiv:1403.5919  [pdf, other

    cs.CV

    SRA: Fast Removal of General Multipath for ToF Sensors

    Authors: Daniel Freedman, Eyal Krupka, Yoni Smolin, Ido Leichter, Mirko Schmidt

    Abstract: A major issue with Time of Flight sensors is the presence of multipath interference. We present Sparse Reflections Analysis (SRA), an algorithm for removing this interference which has two main advantages. First, it allows for very general forms of multipath, including interference with three or more paths, diffuse multipath resulting from Lambertian surfaces, and combinations thereof. SRA removes… ▽ More

    Submitted 24 March, 2014; originally announced March 2014.