Skip to main content

Showing 1–10 of 10 results for author: Byun, J

Searching in archive eess. Search in all archives.
.
  1. arXiv:2503.23108  [pdf, other

    eess.AS cs.LG cs.SD

    SupertonicTTS: Towards Highly Scalable and Efficient Text-to-Speech System

    Authors: Hyeongju Kim, Jinhyeok Yang, Yechan Yu, Seunghun Ji, Jacob Morton, Frederik Bous, Joon Byun, Juheon Lee

    Abstract: We present a novel text-to-speech (TTS) system, namely SupertonicTTS, for improved scalability and efficiency in speech synthesis. SupertonicTTS comprises three components: a speech autoencoder for continuous latent representation, a text-to-latent module leveraging flow-matching for text-to-latent mapping, and an utterance-level duration predictor. To enable a lightweight architecture, we employ… ▽ More

    Submitted 16 May, 2025; v1 submitted 29 March, 2025; originally announced March 2025.

    Comments: 21 pages, preprint

  2. arXiv:2410.22363  [pdf, other

    math.OC eess.SY

    Branch-and-bound algorithm for efficient reliability analysis of general coherent systems

    Authors: Ji-Eun Byun, Hyeuk Ryu, Daniel Straub

    Abstract: Branch and bound algorithms have been developed for reliability analysis of coherent systems. They exhibit a set of advantages; in particular, they can find a computationally efficient representation of a system failure or survival event, which can be re-used when the input probability distributions change over time or when new data is available. However, existing branch-and-bound algorithms can h… ▽ More

    Submitted 27 October, 2024; originally announced October 2024.

    Comments: Preprint for peer-reviewed article

    MSC Class: 60-08 ACM Class: G.3; I.5.2

  3. An empirical study on speech restoration guided by self supervised speech representation

    Authors: Jaeuk Byun, Youna Ji, Soo Whan Chung, Soyeon Choe, Min Seok Choi

    Abstract: Enhancing speech quality is an indispensable yet difficult task as it is often complicated by a range of degradation factors. In addition to additive noise, reverberation, clipping, and speech attenuation can all adversely affect speech quality. Speech restoration aims to recover speech components from these distortions. This paper focuses on exploring the impact of self-supervised speech represen… ▽ More

    Submitted 30 May, 2023; originally announced May 2023.

    Comments: To be presented at ICASSP 2023

  4. arXiv:2301.08078  [pdf, other

    cs.RO eess.SY

    Stable Contact Guaranteeing Motion/Force Control for an Aerial Manipulator on an Arbitrarily Tilted Surface

    Authors: Jeonghyun Byun, Byeongjun Kim, Changhyeon Kim, Donggeon David Oh, H. Jin Kim

    Abstract: This study aims to design a motion/force controller for an aerial manipulator which guarantees the tracking of time-varying motion/force trajectories as well as the stability during the transition between free and contact motions. To this end, we model the force exerted on the end-effector as the Kelvin-Voigt linear model and estimate its parameters by recursive least-squares estimator. Then, the… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: to be presented in 2023 IEEE International Conference on Robotics and Automations (ICRA), London, United Kingdom, 2023

  5. arXiv:2210.17327  [pdf, other

    eess.AS cs.LG cs.SD

    Diffusion-based Generative Speech Source Separation

    Authors: Robin Scheibler, Youna Ji, Soo-Whan Chung, Jaeuk Byun, Soyeon Choe, Min-Seok Choi

    Abstract: We propose DiffSep, a new single channel source separation method based on score-matching of a stochastic differential equation (SDE). We craft a tailored continuous time diffusion-mixing process starting from the separated sources and converging to a Gaussian distribution centered on their mixture. This formulation lets us apply the machinery of score-based generative modelling. First, we train a… ▽ More

    Submitted 2 November, 2022; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 5 pages, 3 figures, 2 tables. Submitted to ICASSP 2023

  6. Machine Learning-Based GPS Multipath Detection Method Using Dual Antennas

    Authors: Sanghyun Kim, Jungyun Byun, Kwansik Park

    Abstract: In urban areas, global navigation satellite system (GNSS) signals are often reflected or blocked by buildings, thus resulting in large positioning errors. In this study, we proposed a machine learning approach for global positioning system (GPS) multipath detection that uses dual antennas. A machine learning model that could classify GPS signal reception conditions was trained with several GPS mea… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: Submitted to ASCC 2022

  7. arXiv:2107.00353  [pdf, other

    cs.RO eess.SY

    Stability and Robustness Analysis of Plug-Pulling using an Aerial Manipulator

    Authors: Jeonghyun Byun, Dongjae Lee, Hoseong Seo, Inkyu Jang, Jeongjun Choi, H. Jin Kim

    Abstract: In this paper, an autonomous aerial manipulation task of pulling a plug out of an electric socket is conducted, where maintaining the stability and robustness is challenging due to sudden disappearance of a large interaction force. The abrupt change in the dynamical model before and after the separation of the plug can cause destabilization or mission failure. To accomplish aerial plug-pulling, we… ▽ More

    Submitted 5 July, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: to be presented in 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Prague, Czech Republic, 2021

  8. arXiv:2105.10967  [pdf, other

    eess.IV cs.CV

    FBI-Denoiser: Fast Blind Image Denoiser for Poisson-Gaussian Noise

    Authors: Jaeseok Byun, Sungmin Cha, Taesup Moon

    Abstract: We consider the challenging blind denoising problem for Poisson-Gaussian noise, in which no additional information about clean images or noise level parameters is available. Particularly, when only "single" noisy images are available for training a denoiser, the denoising performance of existing methods was not satisfactory. Recently, the blind pixelwise affine image denoiser (BP-AIDE) was propose… ▽ More

    Submitted 23 May, 2021; originally announced May 2021.

    Comments: CVPR 2021 camera ready version

  9. arXiv:1910.04397  [pdf, other

    eess.IV cs.CV

    BitNet: Learning-Based Bit-Depth Expansion

    Authors: Junyoung Byun, Kyujin Shim, Changick Kim

    Abstract: Bit-depth is the number of bits for each color channel of a pixel in an image. Although many modern displays support unprecedented higher bit-depth to show more realistic and natural colors with a high dynamic range, most media sources are still in bit-depth of 8 or lower. Since insufficient bit-depth may generate annoying false contours or lose detailed visual appearance, bit-depth expansion (BDE… ▽ More

    Submitted 10 October, 2019; originally announced October 2019.

    Comments: Accepted by ACCV 2018, Authors Byun and Shim contributed equally

  10. arXiv:1905.09396  [pdf, other

    cs.RO eess.SY

    Predictive Control for Chasing a Ground Vehicle using a UAV

    Authors: Jaeseung Byun, Karan P. Jain, Siddharth H. Nair, Haoyun Xu, Jiaming Zha

    Abstract: We propose a high-level planner for a multirotor to chase a ground vehicle, while simultaneously respecting various state and input constraints. Assuming a minimal kinematic model for the ground vehicle, we use data collected online to generate predictions for our planner within a model predictive control framework. Our solution is demonstrated, both via simulations and experiments on a stable qua… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.