Skip to main content

Showing 1–9 of 9 results for author: Ueno, N

Searching in archive eess. Search in all archives.
.
  1. Sound Field Estimation: Theories and Applications

    Authors: Natsuki Ueno, Shoichi Koyama

    Abstract: The spatial information of sound plays a crucial role in various situations, ranging from daily activities to advanced engineering technologies. To fully utilize its potential, numerous research studies on spatial audio signal processing have been carried out in the literature. Sound field estimation is one of the key foundational technologies that can be applied to a wide range of acoustic signal… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: published in Foundations and Trends in Signal Processing, vol. 19, no. 1; see https://www.nowpublishers.com/article/Details/SIG-121

    Journal ref: Foundations and Trends in Signal Processing, vol. 19, no. 1, pp. 1-98, 2025

  2. arXiv:2501.05557  [pdf, other

    eess.AS cs.SD

    Mel-Spectrogram Inversion via Alternating Direction Method of Multipliers

    Authors: Yoshiki Masuyama, Natsuki Ueno, Nobutaka Ono

    Abstract: Signal reconstruction from its mel-spectrogram is known as mel-spectrogram inversion and has many applications, including speech and foley sound synthesis. In this paper, we propose a mel-spectrogram inversion method based on a rigorous optimization algorithm. To reconstruct a time-domain signal with inverse short-time Fourier transform (STFT), both full-band STFT magnitude and phase should be pre… ▽ More

    Submitted 13 January, 2025; v1 submitted 9 January, 2025; originally announced January 2025.

    Comments: Accepted to ICASSP 2025

  3. arXiv:2408.14731  [pdf, other

    cs.SD eess.AS

    Physics-Informed Machine Learning For Sound Field Estimation

    Authors: Shoichi Koyama, Juliano G. C. Ribeiro, Tomohiko Nakamura, Natsuki Ueno, Mirco Pezzoli

    Abstract: The area of study concerning the estimation of spatial sound, i.e., the distribution of a physical quantity of sound such as acoustic pressure, is called sound field estimation, which is the basis for various applied technologies related to spatial audio processing. The sound field estimation problem is formulated as a function interpolation problem in machine learning in a simplified scenario. Ho… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: Accepted to IEEE Signal Processing Magazine, Special Issue on Model-based and Data-Driven Audio Signal Processing

  4. arXiv:2307.12232  [pdf, other

    cs.SD eess.AS eess.SP

    Signal Reconstruction from Mel-spectrogram Based on Bi-level Consistency of Full-band Magnitude and Phase

    Authors: Yoshiki Masuyama, Natsuki Ueno, Nobutaka Ono

    Abstract: We propose an optimization-based method for reconstructing a time-domain signal from a low-dimensional spectral representation such as a mel-spectrogram. Phase reconstruction has been studied to reconstruct a time-domain signal from the full-band short-time Fourier transform (STFT) magnitude. The Griffin-Lim algorithm (GLA) has been widely used because it relies only on the redundancy of STFT and… ▽ More

    Submitted 23 July, 2023; originally announced July 2023.

    Comments: Accepted to IEEE WASPAA 2023

  5. arXiv:2303.13027  [pdf, other

    eess.AS cs.SD

    Weighted Pressure and Mode Matching for Sound Field Reproduction: Theoretical and Experimental Comparisons

    Authors: Shoichi Koyama, Keisuke Kimura, Natsuki Ueno

    Abstract: Two sound field reproduction methods, weighted pressure matching and weighted mode matching, are theoretically and experimentally compared. The weighted pressure and mode matching are a generalization of conventional pressure and mode matching, respectively. Both methods are derived by introducing a weighting matrix in the pressure and mode matching. The weighting matrix in the weighted pressure m… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted to Journal of Audio Engineering Society, Special Issue on Spatial Audio

  6. arXiv:2112.06774  [pdf, ps, other

    cs.SD eess.AS

    Mean-square-error-based secondary source placement in sound field synthesis with prior information on desired field

    Authors: Keisuke Kimura, Shoichi Koyama, Natsuki Ueno, Hiroshi Saruwatari

    Abstract: A method of optimizing secondary source placement in sound field synthesis is proposed. Such an optimization method will be useful when the allowable placement region and available number of loudspeakers are limited. We formulate a mean-square-error-based cost function, incorporating the statistical properties of possible desired sound fields, for general linear-least-squares-based sound field syn… ▽ More

    Submitted 9 December, 2021; originally announced December 2021.

    Comments: Accepted to IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2021

  7. arXiv:2111.11045  [pdf, other

    eess.AS cs.SD

    Sound Field Reproduction With Weighted Mode Matching and Infinite-Dimensional Harmonic Analysis: An Experimental Evaluation

    Authors: Shoichi Koyama, Keisuke Kimura, Natsuki Ueno

    Abstract: Sound field reproduction methods based on numerical optimization, which aim to minimize the error between synthesized and desired sound fields, are useful in many practical scenarios because of their flexibility in the array geometry of loudspeakers. However, the reproduction performance of these methods in a practical environment has not been sufficiently investigated. We evaluate weighted mode m… ▽ More

    Submitted 22 November, 2021; originally announced November 2021.

    Comments: Accepted to International Conference on Immersive and 3D Audio (I3DA) 2021

  8. arXiv:2110.04972  [pdf, ps, other

    cs.SD eess.AS

    Kernel Learning For Sound Field Estimation With L1 and L2 Regularizations

    Authors: Ryosuke Horiuchi, Shoichi Koyama, Juliano G. C. Ribeiro, Natsuki Ueno, Hiroshi Saruwatari

    Abstract: A method to estimate an acoustic field from discrete microphone measurements is proposed. A kernel-interpolation-based method using the kernel function formulated for sound field interpolation has been used in various applications. The kernel function with directional weighting makes it possible to incorporate prior information on source directions to improve estimation accuracy. However, in prior… ▽ More

    Submitted 12 October, 2021; v1 submitted 10 October, 2021; originally announced October 2021.

    Comments: Accepted to IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2021

  9. arXiv:2106.10801  [pdf, other

    eess.AS cs.SD

    MeshRIR: A Dataset of Room Impulse Responses on Meshed Grid Points For Evaluating Sound Field Analysis and Synthesis Methods

    Authors: Shoichi Koyama, Tomoya Nishida, Keisuke Kimura, Takumi Abe, Natsuki Ueno, Jesper Brunnström

    Abstract: A new impulse response (IR) dataset called "MeshRIR" is introduced. Currently available datasets usually include IRs at an array of microphones from several source positions under various room conditions, which are basically designed for evaluating speech enhancement and distant speech recognition methods. On the other hand, methods of estimating or controlling spatial sound fields have been exten… ▽ More

    Submitted 23 July, 2021; v1 submitted 20 June, 2021; originally announced June 2021.

    Comments: Accepted to IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2021