Skip to main content

Showing 1–15 of 15 results for author: Fox, D

Searching in archive eess. Search in all archives.
.
  1. arXiv:2410.03930  [pdf, ps, other

    cs.CL cs.SD eess.AS

    Reverb: Open-Source ASR and Diarization from Rev

    Authors: Nishchal Bhandari, Danny Chen, Miguel Ángel del Río Fernández, Natalie Delworth, Jennifer Drexler Fox, Migüel Jetté, Quinten McNamara, Corey Miller, Ondřej Novotný, Ján Profant, Nan Qin, Martin Ratajczak, Jean-Philippe Robichaud

    Abstract: Today, we are open-sourcing our core speech recognition and diarization models for non-commercial use. We are releasing both a full production pipeline for developers as well as pared-down research models for experimentation. Rev hopes that these releases will spur research and innovation in the fast-moving domain of voice technology. The speech recognition models released today outperform all exi… ▽ More

    Submitted 24 February, 2025; v1 submitted 4 October, 2024; originally announced October 2024.

  2. arXiv:2404.12308  [pdf, other

    cs.RO cs.LG eess.SY

    ASID: Active Exploration for System Identification in Robotic Manipulation

    Authors: Marius Memmel, Andrew Wagenmaker, Chuning Zhu, Patrick Yin, Dieter Fox, Abhishek Gupta

    Abstract: Model-free control strategies such as reinforcement learning have shown the ability to learn control strategies without requiring an accurate model or simulator of the world. While this is appealing due to the lack of modeling requirements, such methods can be sample inefficient, making them impractical in many real-world domains. On the other hand, model-based control techniques leveraging accura… ▽ More

    Submitted 26 June, 2024; v1 submitted 18 April, 2024; originally announced April 2024.

    Comments: Project website at https://weirdlabuw.github.io/asid

  3. arXiv:2309.15013  [pdf, other

    cs.CL cs.SD eess.AS

    Updated Corpora and Benchmarks for Long-Form Speech Recognition

    Authors: Jennifer Drexler Fox, Desh Raj, Natalie Delworth, Quinn McNamara, Corey Miller, Migüel Jetté

    Abstract: The vast majority of ASR research uses corpora in which both the training and test data have been pre-segmented into utterances. In most real-word ASR use-cases, however, test audio is not segmented, leading to a mismatch between inference-time conditions and models trained on segmented utterances. In this paper, we re-release three standard ASR corpora - TED-LIUM 3, Gigapeech, and VoxPopuli-en -… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: Submitted to ICASSP 2024

  4. arXiv:2209.01250  [pdf, other

    cs.CL cs.SD eess.AS

    Improving Contextual Recognition of Rare Words with an Alternate Spelling Prediction Model

    Authors: Jennifer Drexler Fox, Natalie Delworth

    Abstract: Contextual ASR, which takes a list of bias terms as input along with audio, has drawn recent interest as ASR use becomes more widespread. We are releasing contextual biasing lists to accompany the Earnings21 dataset, creating a public benchmark for this task. We present baseline results on this benchmark using a pretrained end-to-end ASR model from the WeNet toolkit. We show results for shallow fu… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

  5. arXiv:2202.08883  [pdf, other

    eess.AS cs.LG cs.SD

    Curriculum optimization for low-resource speech recognition

    Authors: Anastasia Kuznetsova, Anurag Kumar, Jennifer Drexler Fox, Francis Tyers

    Abstract: Modern end-to-end speech recognition models show astonishing results in transcribing audio signals into written text. However, conventional data feeding pipelines may be sub-optimal for low-resource speech recognition, which still remains a challenging task. We propose an automated curriculum learning approach to optimize the sequence of training examples based on both the progress of the model wh… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  6. arXiv:2109.10443  [pdf, other

    cs.RO eess.SY

    Geometric Fabrics: Generalizing Classical Mechanics to Capture the Physics of Behavior

    Authors: Karl Van Wyk, Mandy Xie, Anqi Li, Muhammad Asif Rana, Buck Babich, Bryan Peele, Qian Wan, Iretiayo Akinola, Balakumar Sundaralingam, Dieter Fox, Byron Boots, Nathan D. Ratliff

    Abstract: Classical mechanical systems are central to controller design in energy shaping methods of geometric control. However, their expressivity is limited by position-only metrics and the intimate link between metric and geometry. Recent work on Riemannian Motion Policies (RMPs) has shown that shedding these restrictions results in powerful design tools, but at the expense of theoretical stability guara… ▽ More

    Submitted 18 January, 2022; v1 submitted 21 September, 2021; originally announced September 2021.

  7. arXiv:2105.12189  [pdf, other

    cs.LG cs.RO eess.SY

    Robust Value Iteration for Continuous Control Tasks

    Authors: Michael Lutter, Shie Mannor, Jan Peters, Dieter Fox, Animesh Garg

    Abstract: When transferring a control policy from simulation to a physical system, the policy needs to be robust to variations in the dynamics to perform well. Commonly, the optimal policy overfits to the approximate model and the corresponding state-distribution, often resulting in failure to trasnfer underlying distributional shifts. In this paper, we present Robust Fitted Value Iteration, which uses dyna… ▽ More

    Submitted 25 May, 2021; originally announced May 2021.

    Comments: Accepted Paper at Robotics: Science and Systems

  8. arXiv:2105.04682  [pdf, other

    cs.LG cs.RO eess.SY

    Value Iteration in Continuous Actions, States and Time

    Authors: Michael Lutter, Shie Mannor, Jan Peters, Dieter Fox, Animesh Garg

    Abstract: Classical value iteration approaches are not applicable to environments with continuous states and actions. For such environments, the states and actions are usually discretized, which leads to an exponential increase in computational complexity. In this paper, we propose continuous fitted value iteration (cFVI). This algorithm enables dynamic programming for continuous states and actions with a k… ▽ More

    Submitted 10 May, 2021; originally announced May 2021.

    Comments: Accepted at International Conference on Machine Learning (ICML) 2021

  9. arXiv:2103.11097  [pdf, other

    eess.SY

    In-Field Gyroscope Autocalibration with Iterative Attitude Estimation

    Authors: Li Wang, Rob Duffield, Deborah Fox, Athena Hammond, Andrew J. Zhang, Wei Xing Zheng, Steven W. Su

    Abstract: This paper presents an efficient in-field calibration method tailored for low-cost triaxial MEMS gyroscopes often used in healthcare applications. Traditional calibration techniques are challenging to implement in clinical settings due to the unavailability of high-precision equipment. Unlike the auto-calibration approaches used for triaxial MEMS accelerometers, which rely on local gravity, gyrosc… ▽ More

    Submitted 15 August, 2024; v1 submitted 20 March, 2021; originally announced March 2021.

  10. arXiv:2005.13143  [pdf, other

    cs.RO cs.LG eess.SY

    Euclideanizing Flows: Diffeomorphic Reduction for Learning Stable Dynamical Systems

    Authors: Muhammad Asif Rana, Anqi Li, Dieter Fox, Byron Boots, Fabio Ramos, Nathan Ratliff

    Abstract: Robotic tasks often require motions with complex geometric structures. We present an approach to learn such motions from a limited number of human demonstrations by exploiting the regularity properties of human motions e.g. stability, smoothness, and boundedness. The complex motions are encoded as rollouts of a stable dynamical system, which, under a change of coordinates defined by a diffeomorphi… ▽ More

    Submitted 21 September, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: 2nd Annual Conference on Learning for Dynamics and Control (L4DC) 2020 -- Revised Version

  11. arXiv:2005.10872  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Guided Uncertainty-Aware Policy Optimization: Combining Learning and Model-Based Strategies for Sample-Efficient Policy Learning

    Authors: Michelle A. Lee, Carlos Florensa, Jonathan Tremblay, Nathan Ratliff, Animesh Garg, Fabio Ramos, Dieter Fox

    Abstract: Traditional robotic approaches rely on an accurate model of the environment, a detailed description of how to perform the task, and a robust perception system to keep track of the current state. On the other hand, reinforcement learning approaches can operate directly from raw sensory inputs with only a reward signal to describe the task, but are extremely sample-inefficient and brittle. In this w… ▽ More

    Submitted 26 May, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

    Journal ref: International Conference in Robotics and Automation 2020

  12. arXiv:1910.02646  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Riemannian Motion Policy Fusion through Learnable Lyapunov Function Reshaping

    Authors: Mustafa Mukadam, Ching-An Cheng, Dieter Fox, Byron Boots, Nathan Ratliff

    Abstract: RMPflow is a recently proposed policy-fusion framework based on differential geometry. While RMPflow has demonstrated promising performance, it requires the user to provide sensible subtask policies as Riemannian motion policies (RMPs: a motion policy and an importance matrix function), which can be a difficult design problem in its own right. We propose RMPfusion, a variation of RMPflow, to addre… ▽ More

    Submitted 8 October, 2019; v1 submitted 7 October, 2019; originally announced October 2019.

    Comments: Conference on Robot Learning (CoRL), 2019

  13. arXiv:1811.07049  [pdf, other

    cs.RO eess.SY

    RMPflow: A Computational Graph for Automatic Motion Policy Generation

    Authors: Ching-An Cheng, Mustafa Mukadam, Jan Issac, Stan Birchfield, Dieter Fox, Byron Boots, Nathan Ratliff

    Abstract: We develop a novel policy synthesis algorithm, RMPflow, based on geometrically consistent transformations of Riemannian Motion Policies (RMPs). RMPs are a class of reactive motion policies designed to parameterize non-Euclidean behaviors as dynamical systems in intrinsically nonlinear task spaces. Given a set of RMPs designed for individual tasks, RMPflow can consistently combine these local polic… ▽ More

    Submitted 5 April, 2019; v1 submitted 16 November, 2018; originally announced November 2018.

    Comments: WAFR 2018

  14. arXiv:1710.00489  [pdf, other

    cs.RO cs.AI cs.CV cs.NE eess.SY

    SE3-Pose-Nets: Structured Deep Dynamics Models for Visuomotor Planning and Control

    Authors: Arunkumar Byravan, Felix Leeb, Franziska Meier, Dieter Fox

    Abstract: In this work, we present an approach to deep visuomotor control using structured deep dynamics models. Our deep dynamics model, a variant of SE3-Nets, learns a low-dimensional pose embedding for visuomotor control via an encoder-decoder structure. Unlike prior work, our dynamics model is structured: given an input scene, our network explicitly learns to segment salient parts and predict their pose… ▽ More

    Submitted 2 October, 2017; originally announced October 2017.

    Comments: 8 pages, Initial submission to IEEE International Conference on Robotics and Automation (ICRA) 2018

  15. arXiv:1502.02860  [pdf, other

    stat.ML cs.LG cs.RO eess.SY

    Gaussian Processes for Data-Efficient Learning in Robotics and Control

    Authors: Marc Peter Deisenroth, Dieter Fox, Carl Edward Rasmussen

    Abstract: Autonomous learning has been a promising direction in control and robotics for more than a decade since data-driven learning allows to reduce the amount of engineering knowledge, which is otherwise required. However, autonomous reinforcement learning (RL) approaches typically require many interactions with the system to learn controllers, which is a practical limitation in real systems, such as ro… ▽ More

    Submitted 10 October, 2017; v1 submitted 10 February, 2015; originally announced February 2015.

    Comments: 20 pages, 29 figures; fixed a typo in equation on page 8

    Journal ref: IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, issue no 2, pages 408-423, February 2015