Skip to main content

Showing 1–3 of 3 results for author: Punjabi, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.14240  [pdf, other

    cs.RO

    Effectively Rearranging Heterogeneous Objects on Cluttered Tabletops

    Authors: Kai Gao, Justin Yu, Tanay Sandeep Punjabi, Jingjin Yu

    Abstract: Effectively rearranging heterogeneous objects constitutes a high-utility skill that an intelligent robot should master. Whereas significant work has been devoted to the grasp synthesis of heterogeneous objects, little attention has been given to the planning for sequentially manipulating such objects. In this work, we examine the long-horizon sequential rearrangement of heterogeneous objects in a… ▽ More

    Submitted 30 June, 2023; v1 submitted 25 June, 2023; originally announced June 2023.

    Comments: Accepted by 2023 IROS - IEEE/RSJ International Conference on Intelligent Robots

  2. arXiv:2007.03900  [pdf, other

    eess.AS cs.CL cs.SD

    Streaming End-to-End Bilingual ASR Systems with Joint Language Identification

    Authors: Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Müller, Sergio Murillo, Ariya Rastrow, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann

    Abstract: Multilingual ASR technology simplifies model training and deployment, but its accuracy is known to depend on the availability of language information at runtime. Since language identity is seldom known beforehand in real-world scenarios, it must be inferred on-the-fly with minimum latency. Furthermore, in voice-activated smart assistant systems, language identity is also required for downstream pr… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

  3. arXiv:1912.00958  [pdf, other

    cs.CL cs.LG

    Language Model Bootstrapping Using Neural Machine Translation For Conversational Speech Recognition

    Authors: Surabhi Punjabi, Harish Arsikere, Sri Garimella

    Abstract: Building conversational speech recognition systems for new languages is constrained by the availability of utterances that capture user-device interactions. Data collection is both expensive and limited by the speed of manual transcription. In order to address this, we advocate the use of neural machine translation as a data augmentation technique for bootstrapping language models. Machine transla… ▽ More

    Submitted 2 December, 2019; originally announced December 2019.

    Comments: Accepted by IEEE ASRU workshop, 2019