Skip to main content

Showing 1–6 of 6 results for author: Makino, T

Searching in archive eess. Search in all archives.
.
  1. arXiv:2309.10330  [pdf, other

    physics.optics eess.SP

    Time Stretch with Continuous-Wave Lasers

    Authors: Tingyi Zhou, Yuta Goto, Takeshi Makino, Callen MacPhee, Yiming Zhou, Asad M. Madni, Hideaki Furukawa, Naoya Wada, Bahram Jalali

    Abstract: A single-shot measurement technique for ultrafast phenomena with high throughput enables the capture of rare events within a short time scale, facilitating the exploration of rare ultrafast processes. Photonic time stretch stands out as a highly effective method for both detecting rapid events and achieving remarkable speed in imaging and ranging applications. The current time stretch method relie… ▽ More

    Submitted 1 November, 2023; v1 submitted 19 September, 2023; originally announced September 2023.

  2. arXiv:2205.05586  [pdf, other

    eess.AS cs.CV cs.LG cs.SD

    End-to-End Multi-Person Audio/Visual Automatic Speech Recognition

    Authors: Otavio Braga, Takaki Makino, Olivier Siohan, Hank Liao

    Abstract: Traditionally, audio-visual automatic speech recognition has been studied under the assumption that the speaking face on the visual signal is the face matching the audio. However, in a more realistic setting, when multiple faces are potentially on screen one needs to decide which face to feed to the A/V ASR system. The present work takes the recent progress of A/V ASR one step further and consider… ▽ More

    Submitted 11 May, 2022; originally announced May 2022.

  3. arXiv:2011.14036  [pdf, other

    eess.IV cs.CV cs.CY cs.LG

    Differences between human and machine perception in medical diagnosis

    Authors: Taro Makino, Stanislaw Jastrzebski, Witold Oleszkiewicz, Celin Chacko, Robin Ehrenpreis, Naziya Samreen, Chloe Chhor, Eric Kim, Jiyon Lee, Kristine Pysarenko, Beatriu Reig, Hildegard Toth, Divya Awal, Linda Du, Alice Kim, James Park, Daniel K. Sodickson, Laura Heacock, Linda Moy, Kyunghyun Cho, Krzysztof J. Geras

    Abstract: Deep neural networks (DNNs) show promise in image-based medical diagnosis, but cannot be fully trusted since their performance can be severely degraded by dataset shifts to which human perception remains invariant. If we can better understand the differences between human and machine perception, we can potentially characterize and mitigate this effect. We therefore propose a framework for comparin… ▽ More

    Submitted 27 November, 2020; originally announced November 2020.

  4. arXiv:2009.09282  [pdf, other

    eess.IV cs.CV cs.LG

    Reducing false-positive biopsies with deep neural networks that utilize local and global information in screening mammograms

    Authors: Nan Wu, Zhe Huang, Yiqiu Shen, Jungkyu Park, Jason Phang, Taro Makino, S. Gene Kim, Kyunghyun Cho, Laura Heacock, Linda Moy, Krzysztof J. Geras

    Abstract: Breast cancer is the most common cancer in women, and hundreds of thousands of unnecessary biopsies are done around the world at a tremendous cost. It is crucial to reduce the rate of biopsies that turn out to be benign tissue. In this study, we build deep neural networks (DNNs) to classify biopsied lesions as being either malignant or benign, with the goal of using these networks as second reader… ▽ More

    Submitted 19 September, 2020; originally announced September 2020.

  5. arXiv:2008.01774  [pdf, other

    cs.LG cs.CV eess.IV

    An artificial intelligence system for predicting the deterioration of COVID-19 patients in the emergency department

    Authors: Farah E. Shamout, Yiqiu Shen, Nan Wu, Aakash Kaku, Jungkyu Park, Taro Makino, Stanisław Jastrzębski, Jan Witowski, Duo Wang, Ben Zhang, Siddhant Dogra, Meng Cao, Narges Razavian, David Kudlowitz, Lea Azour, William Moore, Yvonne W. Lui, Yindalon Aphinyanaphongs, Carlos Fernandez-Granda, Krzysztof J. Geras

    Abstract: During the coronavirus disease 2019 (COVID-19) pandemic, rapid and accurate triage of patients at the emergency department is critical to inform decision-making. We propose a data-driven approach for automatic prediction of deterioration risk using a deep neural network that learns from chest X-ray images and a gradient boosting model that learns from routine clinical variables. Our AI prognosis s… ▽ More

    Submitted 3 November, 2020; v1 submitted 4 August, 2020; originally announced August 2020.

  6. arXiv:1911.04890  [pdf, other

    eess.AS cs.CL cs.CV cs.LG cs.SD

    Recurrent Neural Network Transducer for Audio-Visual Speech Recognition

    Authors: Takaki Makino, Hank Liao, Yannis Assael, Brendan Shillingford, Basilio Garcia, Otavio Braga, Olivier Siohan

    Abstract: This work presents a large-scale audio-visual speech recognition system based on a recurrent neural network transducer (RNN-T) architecture. To support the development of such a system, we built a large audio-visual (A/V) dataset of segmented utterances extracted from YouTube public videos, leading to 31k hours of audio-visual training content. The performance of an audio-only, visual-only, and au… ▽ More

    Submitted 8 November, 2019; originally announced November 2019.

    Comments: Will be presented in 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2019)