Skip to main content

Showing 1–18 of 18 results for author: Kawahara, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2501.17173  [pdf, other

    astro-ph.IM cs.RO

    Model Evaluation of a Transformable CubeSat for Nonholonomic Attitude Reorientation Using a Drop Tower

    Authors: Yuki Kubo, Tsubasa Ando, Hirona Kawahara, Shu Miyata, Naoya Uchiyama, Kazutoshi Ito, Yoshiki Sugawara

    Abstract: This paper presents a design for a drop tower test to evaluate a numerical model for a structurally reconfigurable spacecraft with actuatable joints, referred to as a transformable spacecraft. A mock-up robot for a 3U-sized transformable spacecraft is designed to fit in a limited time and space of the microgravity environment available in the drop tower. The robot performs agile reorientation, ref… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

    Comments: 22 pages, 20 figures

  2. arXiv:2409.20516  [pdf, other

    eess.AS cs.SD eess.SP

    Proposal of protocols for speech materials acquisition and presentation assisted by tools based on structured test signals

    Authors: Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Kohei Yatabe

    Abstract: We propose protocols for acquiring speech materials, making them reusable for future investigations, and presenting them for subjective experiments. We also provide means to evaluate existing speech materials' compatibility with target applications. We built these protocols and tools based on structured test signals and analysis methods, including a new family of the Time-Stretched Pulse (TSP). Ov… ▽ More

    Submitted 30 September, 2024; originally announced September 2024.

    Comments: 6 pages 6 figures, accepted ORIENTAL COCOSDA 2024

    MSC Class: 68-04 ACM Class: J.2

  3. arXiv:2404.13418  [pdf, ps, other

    cs.HC eess.AS

    Interactive tools for making temporally variable, multiple-attributes, and multiple-instances morphing accessible: Flexible manipulation of divergent speech instances for explorational research and education

    Authors: Hideki Kawahara, Masanori Morise

    Abstract: We generalized a voice morphing algorithm capable of handling temporally variable, multiple-attributes, and multiple instances. The generalized morphing provides a new strategy for investigating speech diversity. However, excessive complexity and the difficulty of preparation have prevented researchers and students from enjoying its benefits. To address this issue, we introduced a set of interacti… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 5 pages, 7 figures, submitted to Acoustical Science and Technology of Acoustical Society of Japan

    MSC Class: 68-04 ACM Class: K.3.1

  4. arXiv:2309.02767  [pdf, ps, other

    cs.SD eess.AS

    Simultaneous Measurement of Multiple Acoustic Attributes Using Structured Periodic Test Signals Including Music and Other Sound Materials

    Authors: Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Tatsuya Kitamura

    Abstract: We introduce a general framework for measuring acoustic properties such as liner time-invariant (LTI) response, signal-dependent time-invariant (SDTI) component, and random and time-varying (RTV) component simultaneously using structured periodic test signals. The framework also enables music pieces and other sound materials as test signals by "safeguarding" them by adding slight deterministic "no… ▽ More

    Submitted 6 September, 2023; originally announced September 2023.

    Comments: 8 pages, 17 figures, accepted for APSIPA ASC 2023

    MSC Class: 68-04 ACM Class: J.2

  5. arXiv:2204.00911  [pdf, ps, other

    cs.SD eess.AS

    Measuring pitch extractors' response to frequency-modulated multi-component signals

    Authors: Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise

    Abstract: This article focuses on the research tool for investigating the fundamental frequencies of voiced sounds. We introduce an objective and informative measurement method of pitch extractors' response to frequency-modulated tones. The method uses a new test signal for acoustic system analysis. The test signal enables simultaneous measurement of the extractors' responses. They are the modulation freque… ▽ More

    Submitted 2 April, 2022; originally announced April 2022.

    Comments: 11 pages, 9 figures, The following article has been submitted to/accepted by The Acoustical Society of America. After it is published, it will be found at http://asa.scitation.org/journal/jas

    MSC Class: 94A12; 93C80; 42-08

  6. arXiv:2204.00902  [pdf, ps, other

    cs.SD eess.AS eess.SP

    An objective test tool for pitch extractors' response attributes

    Authors: Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise

    Abstract: We propose an objective measurement method for pitch extractors' responses to frequency-modulated signals. It enables us to evaluate different pitch extractors with unified criteria. The method uses extended time-stretched pulses combined by binary orthogonal sequences. It provides simultaneous measurement results consisting of the linear and the non-linear time-invariant responses and random and… ▽ More

    Submitted 24 June, 2022; v1 submitted 2 April, 2022; originally announced April 2022.

    Comments: 5 pages, 9 figures, submitted to Interspeech2022. arXiv admin note: text overlap with arXiv:2111.03629

    MSC Class: 94A12; 93C80; 42-08

  7. arXiv:2112.11373  [pdf, ps, other

    cs.SD eess.AS

    Safeguarding test signals for acoustic measurement using arbitrary sounds

    Authors: Hideki Kawahara, Kohei Yatabe

    Abstract: We propose a simple method to measure acoustic responses using any sounds by converting them suitable for measurement. This method enables us to use music pieces for measuring acoustic conditions. It is advantageous to measure such conditions without annoying test sounds to listeners. In addition, applying the underlying idea of simultaneous measurement of multiple paths provides practically valua… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: 4 pages, 10 figures, submitted to Acoustical Science and Technology

    MSC Class: 42-04; 42-08; 68-04

  8. arXiv:2111.03629   

    cs.SD cs.HC eess.AS eess.SP

    Objective measurement of pitch extractors' responses to frequency modulated sounds and two reference pitch extraction methods for analyzing voice pitch responses to auditory stimulation

    Authors: Hideki Kawahara, Kohei Yatabe, Ken-Ichi Sakakibara, Tatsuya Kitamura, Hideki Banno, Masanori Morise

    Abstract: We propose an objective measurement method for pitch extractors' responses to frequency-modulated signals. The method simultaneously measures the linear and the non-linear time-invariant responses and random and time-varying responses. It uses extended time-stretched pulses combined by binary orthogonal sequences. Our recent finding of involuntary voice pitch response to auditory stimulation while… ▽ More

    Submitted 27 June, 2022; v1 submitted 5 November, 2021; originally announced November 2021.

    Comments: ICASSP2022 rejected this. The substantially revised version was submitted to Interspeech2022 and accepted. It is arXiv:2204.00911

    MSC Class: 94A12; 93C80; 42-08

  9. arXiv:2109.11594  [pdf, ps, other

    cs.SD cs.HC eess.AS eess.SP

    Implementation of interactive tools for investigating fundamental frequency response of voiced sounds to auditory stimulation

    Authors: Hideki Kawahara, Toshie Matsui Kohei, Yatabe Ken-Ichi Sakakibara Minoru Tsuzaki Masanori Morise Toshio Irino

    Abstract: We introduced a measurement procedure for the involuntary response of voice fundamental-frequency to frequency modulated auditory stimulation. This involuntary response plays an essential role in voice fundamental frequency control while less investigated due to technical difficulties. This article introduces an interactive and real-time tool for investigating this response and supporting tools ad… ▽ More

    Submitted 23 September, 2021; originally announced September 2021.

    Comments: Accepted for APSIPA ASC 2021

    MSC Class: 91E45; 92-04; 94A11

  10. arXiv:2104.01444  [pdf, ps, other

    cs.SD eess.AS eess.SP

    Mixture of orthogonal sequences made from extended time-stretched pulses enables measurement of involuntary voice fundamental frequency response to pitch perturbation

    Authors: Hideki Kawahara, Toshie Matsui, Kohei Yatabe, Ken-Ichi Sakakibara, Minoru Tsuzaki, Masanori Morise, Toshio Irino

    Abstract: Auditory feedback plays an essential role in the regulation of the fundamental frequency of voiced sounds. The fundamental frequency also responds to auditory stimulation other than the speaker's voice. We propose to use this response of the fundamental frequency of sustained vowels to frequency-modulated test signals for investigating involuntary control of voice pitch. This involuntary response… ▽ More

    Submitted 3 April, 2021; originally announced April 2021.

    Comments: 5 pages, 9 figures, submitted to Interspeech2021

    MSC Class: 92C55

  11. arXiv:2010.13185  [pdf, ps, other

    cs.SD eess.AS

    Cascaded all-pass filters with randomized center frequencies and phase polarity for acoustic and speech measurement and data augmentation

    Authors: Hideki Kawahara, Kohei Yatabe

    Abstract: We introduce a new member of TSP (Time Stretched Pulse) for acoustic and speech measurement infrastructure, based on a simple all-pass filter and systematic randomization. This new infrastructure fundamentally upgrades our previous measurement procedure, which enables simultaneous measurement of multiple attributes, including non-linear ones without requiring extra filtering nor post-processing. O… ▽ More

    Submitted 12 February, 2021; v1 submitted 25 October, 2020; originally announced October 2020.

    Comments: 5 pages, 5 figures, Accepted ICASSP2021(Review comment by all reviewers: Very original)

    MSC Class: 68U06(Primary); 68T06; 68W06(Secondary)

  12. arXiv:2008.02439  [pdf, ps, other

    eess.AS cs.SD

    Simultaneous measurement of time-invariant linear and nonlinear, and random and extra responses using frequency domain variant of velvet noise

    Authors: Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Masanori Morise, Hideki Banno

    Abstract: We introduce a new acoustic measurement method that can measure the linear time-invariant response, the nonlinear time-invariant response, and random and time-varying responses simultaneously. The method uses a set of orthogonal sequences made from a set of unit FVNs (Frequency domain variant of Velvet Noise), a new member of the TSP (Time Stretched Pulse). FVN has a unique feature that other TSP… ▽ More

    Submitted 9 August, 2020; v1 submitted 5 August, 2020; originally announced August 2020.

    Comments: 10 pages, 15 figures, APSIPA ASC 2020

    MSC Class: 94A12 ACM Class: J.m

    Journal ref: 2020 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Auckland, New Zealand, 2020, pp. 174-183

  13. Frequency domain variant of Velvet noise and its application to acoustic measurements

    Authors: Hideki Kawahara, Ken-Ichi Sakakibara, Mitsunori Mizumachi, Hideki Banno, Masanori Morise, Toshio Irino

    Abstract: We propose a new family of test signals for acoustic measurements such as impulse response, nonlinearity, and the effects of background noise. The proposed family complements difficulties in existing families, the Swept-Sine (SS), pseudo-random noise such as the maximum length sequence (MLS). The proposed family uses the frequency domain variant of the Velvet noise (FVN) as its building block. An… ▽ More

    Submitted 10 September, 2019; originally announced September 2019.

    Comments: 10 pages, 14 figures, APSIPA ASC 2019. arXiv admin note: text overlap with arXiv:1806.06812

    Journal ref: 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Lanzhou, China, 2019, pp. 1523-1532

  14. arXiv:1909.03650  [pdf, ps, other

    cs.SD cs.HC eess.AS eess.SP

    Real-time and interactive tools for vocal training based on an analytic signal with a cosine series envelope

    Authors: Hideki Kawahara, Ken-Ichi Sakakibara, Eri Haneishi, Kaori Hagiwara

    Abstract: We introduce real-time and interactive tools for assisting vocal training. In this presentation, we demonstrate mainly a tool based on real-time visualizer of fundamental frequency candidates to provide information-rich feedback to learners. The visualizer uses an efficient algorithm using analytic signals for deriving phase-based attributes. We start using these tools in vocal training for assist… ▽ More

    Submitted 9 September, 2019; originally announced September 2019.

    Comments: 4 pages, 6 figures, APSIPA ASC 2019

    Journal ref: 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), Lanzhou, China, 2019, pp. 907-910

  15. arXiv:1806.06812  [pdf, ps, other

    cs.SD eess.AS eess.SP

    Frequency domain variants of velvet noise and their application to speech processing and synthesis: with appendices

    Authors: Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda, Toshio Irino

    Abstract: We propose a new excitation source signal for VOCODERs and an all-pass impulse response for post-processing of synthetic sounds and pre-processing of natural sounds for data-augmentation. The proposed signals are variants of velvet noise, which is a sparse discrete signal consisting of a few non-zero (1 or -1) elements and sounds smoother than Gaussian white noise. One of the proposed variants, FV… ▽ More

    Submitted 18 June, 2018; originally announced June 2018.

    Comments: 11 pages, 20 figures, and 1 table, Interspeech 2018

  16. arXiv:1706.02964  [pdf, ps, other

    eess.AS cs.SD eess.SP

    A modulation property of time-frequency derivatives of filtered phase and its application to aperiodicity and fo estimation

    Authors: Hideki Kawahara, Ken-Ichi Sakakibara, Masanori Morise, Hideki Banno, Tomoki Toda

    Abstract: We introduce a simple and linear SNR (strictly speaking, periodic to random power ratio) estimator (0dB to 80dB without additional calibration/linearization) for providing reliable descriptions of aperiodicity in speech corpus. The main idea of this method is to estimate the background random noise level without directly extracting the background noise. The proposed method is applicable to a wide… ▽ More

    Submitted 9 June, 2017; originally announced June 2017.

    Comments: 8 pages 9 figures, Submitted and accepted in Interspeech2017

    Journal ref: Proc. Interspeech 2017, pp.424-428

  17. arXiv:1702.06724  [pdf, ps, other

    eess.AS cs.SD eess.SP

    A new cosine series antialiasing function and its application to aliasing-free glottal source models for speech and singing synthesis

    Authors: Hideki Kawahara, Ken-Ichi Sakakibara, Hideki Banno, Masanori Morise, Tomoki Toda, Toshio Irino

    Abstract: We formulated and implemented a procedure to generate aliasing-free excitation source signals. It uses a new antialiasing filter in the continuous time domain followed by an IIR digital filter for response equalization. We introduced a cosine-series-based general design procedure for the new antialiasing function. We applied this new procedure to implement the antialiased Fujisaki-Ljungqvist model… ▽ More

    Submitted 8 June, 2017; v1 submitted 22 February, 2017; originally announced February 2017.

    Comments: Submitted to Interspeech 2017

    Journal ref: Proc. Interspeech 2017, pp.1358-1362

  18. arXiv:1605.07809  [pdf, ps, other

    cs.SD eess.AS eess.SP

    Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis

    Authors: Hideki Kawahara, Yannis Agiomyrgiannakis, Heiga Zen

    Abstract: This paper introduces a general and flexible framework for F0 and aperiodicity (additive non periodic component) analysis, specifically intended for high-quality speech synthesis and modification applications. The proposed framework consists of three subsystems: instantaneous frequency estimator and initial aperiodicity detector, F0 trajectory tracker, and F0 refinement and aperiodicity extractor.… ▽ More

    Submitted 22 July, 2016; v1 submitted 25 May, 2016; originally announced May 2016.

    Comments: Accepted for presentation in ISCA workshop SSW9

    Journal ref: 9th ISCA Speech Synthesis Workshop, 2016, pp.221-228