Skip to main content

Showing 1–13 of 13 results for author: Eom, S

.
  1. arXiv:2501.10044  [pdf

    physics.optics physics.ins-det

    Approaching the quantum-limited precision in frequency-comb-based spectral interferometry for length measurements

    Authors: Yoon-Soo Jang, Heulbi Ahn, Sunghoon Eom, Jungjae Park, Jonghan Jin

    Abstract: Over the last two decades, frequency combs have brought breakthroughs in length metrology with traceability to length standards. In particular, frequency-comb-based spectral interferometry is regarded as a promising technology for next-generation length standards. However, to achieve this, the nanometer-level precision inherent in laser interferometer is required. Here, we report distance measurem… ▽ More

    Submitted 17 January, 2025; originally announced January 2025.

  2. arXiv:2408.08591  [pdf, other

    cs.CV

    Zero-Shot Dual-Path Integration Framework for Open-Vocabulary 3D Instance Segmentation

    Authors: Tri Ton, Ji Woo Hong, SooHwan Eom, Jun Yeop Shim, Junyeong Kim, Chang D. Yoo

    Abstract: Open-vocabulary 3D instance segmentation transcends traditional closed-vocabulary methods by enabling the identification of both previously seen and unseen objects in real-world scenarios. It leverages a dual-modality approach, utilizing both 3D point clouds and 2D multi-view images to generate class-agnostic object mask proposals. Previous efforts predominantly focused on enhancing 3D mask propos… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

    Comments: OpenSUN 3D: 2nd Workshop on Open-Vocabulary 3D Scene Understanding (CVPR 2024)

  3. arXiv:2407.16574  [pdf, other

    cs.CL

    TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback

    Authors: Eunseop Yoon, Hee Suk Yoon, SooHwan Eom, Gunsoo Han, Daniel Wontae Nam, Daejin Jo, Kyoung-Woon On, Mark A. Hasegawa-Johnson, Sungwoong Kim, Chang D. Yoo

    Abstract: Reinforcement Learning from Human Feedback (RLHF) leverages human preference data to train language models to align more closely with human essence. These human preference data, however, are labeled at the sequence level, creating a mismatch between sequence-level preference labels and tokens, which are autoregressively generated from the language model. Although several recent approaches have tri… ▽ More

    Submitted 8 December, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

    Comments: ACL2024 Findings

  4. arXiv:2403.11578  [pdf, other

    eess.AS

    AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition

    Authors: SooHwan Eom, Eunseop Yoon, Hee Suk Yoon, Chanwoo Kim, Mark Hasegawa-Johnson, Chang D. Yoo

    Abstract: In Automatic Speech Recognition (ASR) systems, a recurring obstacle is the generation of narrowly focused output distributions. This phenomenon emerges as a side effect of Connectionist Temporal Classification (CTC), a robust sequence learning tool that utilizes dynamic programming for sequence mapping. While earlier efforts have tried to combine the CTC loss with an entropy maximization regulariz… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  5. arXiv:2308.15273  [pdf, other

    cs.CV

    Cross-Modal Retrieval Meets Inference:Improving Zero-Shot Classification with Cross-Modal Retrieval

    Authors: Seongha Eom, Namgyu Ho, Jaehoon Oh, Se-Young Yun

    Abstract: Contrastive language-image pre-training (CLIP) has demonstrated remarkable zero-shot classification ability, namely image classification using novel text labels. Existing works have attempted to enhance CLIP by fine-tuning on downstream tasks, but these have inadvertently led to performance degradation on unseen classes, thus harming zero-shot generalization. This paper aims to address this challe… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

  6. arXiv:2308.08442  [pdf, other

    cs.CL cs.SD eess.AS

    Mitigating the Exposure Bias in Sentence-Level Grapheme-to-Phoneme (G2P) Transduction

    Authors: Eunseop Yoon, Hee Suk Yoon, Dhananjaya Gowda, SooHwan Eom, Daehyeok Kim, John Harvill, Heting Gao, Mark Hasegawa-Johnson, Chanwoo Kim, Chang D. Yoo

    Abstract: Text-to-Text Transfer Transformer (T5) has recently been considered for the Grapheme-to-Phoneme (G2P) transduction. As a follow-up, a tokenizer-free byte-level model based on T5 referred to as ByT5, recently gave promising results on word-level G2P conversion by representing each input character with its corresponding UTF-8 encoding. Although it is generally understood that sentence-level or parag… ▽ More

    Submitted 16 August, 2023; originally announced August 2023.

    Comments: INTERSPEECH 2023

  7. Programmable spectral shaping to improve the measurement precision of frequency comb mode-resolved spectral interferometric ranging

    Authors: Yoon-Soo Jang, Sunghoon Eom, Jungjae Park, Jonghan Jin

    Abstract: Comb-mode resolved spectral domain interferometry (CORE-SDI), which is capable of measuring length of kilometers or more with precision on the order of nanometers, is considered to be a promising technology for next-generation length standards, replacing laser displacement interferometers. In this study, we aim to improve the measurement precision of CORE-SDI using programmable spectral shaping. W… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

    Comments: 22 pages, 10 figures

    Journal ref: Optics & Laser Technology 170, 110324, 2024

  8. arXiv:2305.03864  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Revisiting contrast mechanism of lateral piezoresponse force microscopy

    Authors: Jaegyu Kim, Seongwoo Cho, Jiwon Yeom, Seongmun Eom, Seungbum Hong

    Abstract: Piezoresponse force microscopy (PFM) has been widely used for nanoscale analysis of piezoelectric properties and ferroelectric domains. Although PFM is useful because of its simple and nondestructive features, PFM measurements can be obscured by non-piezoelectric effects that could affect the PFM signals or lead to ferroelectric-like behaviors in non-ferroelectric materials. Many researches have a… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

    Comments: 38 pages, 18 figures

  9. arXiv:2303.12968  [pdf, other

    cs.HC

    Ambient Intelligence for Next-Generation AR

    Authors: Tim Scargill, Sangjun Eom, Ying Chen, Maria Gorlatova

    Abstract: Next-generation augmented reality (AR) promises a high degree of context-awareness - a detailed knowledge of the environmental, user, social and system conditions in which an AR experience takes place. This will facilitate both the closer integration of the real and virtual worlds, and the provision of context-specific content or adaptations. However, environmental awareness in particular is chall… ▽ More

    Submitted 24 March, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

    Comments: This is a preprint of a book chapter which will appear in the Springer Handbook of the Metaverse

  10. arXiv:2212.02059  [pdf, other

    cs.CV

    Region-Conditioned Orthogonal 3D U-Net for Weather4Cast Competition

    Authors: Taehyeon Kim, Shinhwan Kang, Hyeonjeong Shin, Deukryeol Yoon, Seongha Eom, Kijung Shin, Se-Young Yun

    Abstract: The Weather4Cast competition (hosted by NeurIPS 2022) required competitors to predict super-resolution rain movies in various regions of Europe when low-resolution satellite contexts covering wider regions are given. In this paper, we show that a general baseline 3D U-Net can be significantly improved with region-conditioned layers as well as orthogonality regularizations on 1x1x1 convolutional la… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

    Comments: workshop at NeurIPS 2022 Competition Track on Weather4Cast

  11. arXiv:2211.01692  [pdf, other

    cs.CL

    Data-efficient End-to-end Information Extraction for Statistical Legal Analysis

    Authors: Wonseok Hwang, Saehee Eom, Hanuhl Lee, Hai Jin Park, Minjoon Seo

    Abstract: Legal practitioners often face a vast amount of documents. Lawyers, for instance, search for appropriate precedents favorable to their clients, while the number of legal precedents is ever-growing. Although legal search engines can assist finding individual target documents and narrowing down the number of candidates, retrieved information is often presented as unstructured text and users have to… ▽ More

    Submitted 3 November, 2022; originally announced November 2022.

    Comments: NLLP workshop @ EMNLP 2022

  12. arXiv:1801.02782  [pdf, ps, other

    cs.IT

    UAV-Aided Wireless Communication Designs With Propulsion Energy Limitations

    Authors: Subin Eom, Hoon Lee, Junhee Park, Inkyu Lee

    Abstract: This paper studies unmanned aerial vehicle (UAV) aided wireless communication systems where a UAV supports uplink communications of multiple ground nodes (GNs) while flying over the area of the interest. In this system, the propulsion energy consumption at the UAV is taken into account so that the UAV's velocity and acceleration should not exceed a certain threshold. We formulate the minimum avera… ▽ More

    Submitted 8 January, 2018; originally announced January 2018.

    Comments: 24 pages, 7 figures

  13. arXiv:1801.02781  [pdf, ps, other

    cs.IT

    Minimum Throughput Maximization in UAV-Aided Wireless Powered Communication Networks

    Authors: Junhee Park, Hoon Lee, Subin Eom, Inkyu Lee

    Abstract: This paper investigates unmanned aerial vehicle (UAV)-aided wireless powered communication network (WPCN) systems where a mobile access point (AP) at the UAV serves multiple energy-constrained ground terminals (GTs). Specifically, the UAVs first charge the GTs by transmitting the wireless energy transfer (WET) signals in the downlink. Then, by utilizing the harvested wireless energy from the UAVs,… ▽ More

    Submitted 8 January, 2018; originally announced January 2018.

    Comments: 22 pages, 7 figures