Skip to main content

Showing 1–3 of 3 results for author: Shulman, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.10064  [pdf, other

    stat.ME cs.LG

    Adaptive Physics-Guided Neural Network

    Authors: David Shulman, Itai Dattner

    Abstract: This paper introduces an adaptive physics-guided neural network (APGNN) framework for predicting quality attributes from image data by integrating physical laws into deep learning models. The APGNN adaptively balances data-driven and physics-informed predictions, enhancing model accuracy and robustness across different environments. Our approach is evaluated on both synthetic and real-world datase… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

  2. arXiv:2302.09566   

    cs.LG math.OC

    Optimization Methods in Deep Learning: A Comprehensive Overview

    Authors: David Shulman

    Abstract: In recent years, deep learning has achieved remarkable success in various fields such as image recognition, natural language processing, and speech recognition. The effectiveness of deep learning largely depends on the optimization methods used to train deep neural networks. In this paper, we provide an overview of first-order optimization methods such as Stochastic Gradient Descent, Adagrad, Adad… ▽ More

    Submitted 24 April, 2023; v1 submitted 19 February, 2023; originally announced February 2023.

    Comments: We have decided to withdraw our manuscript 'Optimization Methods in Deep Learning: A Comprehensive Overview' from arXiv as we are now pursuing different research directions. We apologize for any inconvenience and appreciate your understanding

  3. arXiv:2104.02014  [pdf, other

    cs.CL eess.AS

    SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition

    Authors: Patrick K. O'Neill, Vitaly Lavrukhin, Somshubra Majumdar, Vahid Noroozi, Yuekai Zhang, Oleksii Kuchaiev, Jagadeesh Balam, Yuliya Dovzhenko, Keenan Freyberg, Michael D. Shulman, Boris Ginsburg, Shinji Watanabe, Georg Kucsko

    Abstract: In the English speech-to-text (STT) machine learning task, acoustic models are conventionally trained on uncased Latin characters, and any necessary orthography (such as capitalization, punctuation, and denormalization of non-standard words) is imputed by separate post-processing models. This adds complexity and limits performance, as many formatting tasks benefit from semantic information present… ▽ More

    Submitted 6 April, 2021; v1 submitted 5 April, 2021; originally announced April 2021.

    Comments: 5 pages, 1 figure. Submitted to INTERSPEECH 2021