Skip to main content

Showing 1–50 of 203 results for author: Choe, D

.
  1. arXiv:2507.06802  [pdf, ps, other

    cs.LG

    Speech Tokenizer is Key to Consistent Representation

    Authors: Wonjin Jung, Sungil Kang, Dong-Yeon Cho

    Abstract: Speech tokenization is crucial in digital speech processing, converting continuous speech signals into discrete units for various computational tasks. This paper introduces a novel speech tokenizer with broad applicability across downstream tasks. While recent advances in residual vector quantization (RVQ) have incorporated semantic elements, they often neglect critical acoustic features. We propo… ▽ More

    Submitted 9 July, 2025; originally announced July 2025.

  2. arXiv:2507.06261  [pdf, ps, other

    cs.CL cs.AI

    Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities

    Authors: Gheorghe Comanici, Eric Bieber, Mike Schaekermann, Ice Pasupat, Noveen Sachdeva, Inderjit Dhillon, Marcel Blistein, Ori Ram, Dan Zhang, Evan Rosen, Luke Marris, Sam Petulla, Colin Gaffney, Asaf Aharoni, Nathan Lintz, Tiago Cardal Pais, Henrik Jacobsson, Idan Szpektor, Nan-Jiang Jiang, Krishna Haridasan, Ahmed Omran, Nikunj Saunshi, Dara Bahri, Gaurav Mishra, Eric Chu , et al. (3278 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 2.X model family: Gemini 2.5 Pro and Gemini 2.5 Flash, as well as our earlier Gemini 2.0 Flash and Flash-Lite models. Gemini 2.5 Pro is our most capable model yet, achieving SoTA performance on frontier coding and reasoning benchmarks. In addition to its incredible coding and reasoning skills, Gemini 2.5 Pro is a thinking model that excels at multimodal unde… ▽ More

    Submitted 7 July, 2025; originally announced July 2025.

    Comments: 72 pages, 17 figures

  3. arXiv:2506.23384  [pdf, ps, other

    cs.FL

    Programmable Co-Transcriptional Splicing: Realizing Regular Languages via Hairpin Deletion

    Authors: Da-Jung Cho, Szilárd Zsolt Fazekas, Shinnosuke Seki, Max Wiedenhöft

    Abstract: RNA co-transcriptionality, where RNA is spliced or folded during transcription from DNA templates, offers promising potential for molecular programming. It enables programmable folding of nano-scale RNA structures and has recently been shown to be Turing universal. While post-transcriptional splicing is well studied, co-transcriptional splicing is gaining attention for its efficiency, though its u… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

    Comments: 28 pages, 8 Figures, Accepted at the 31st International Conference on DNA Computing and Molecular Programming (2025)

    MSC Class: 92-10 ACM Class: F.4.3; J.3; F.1.3

  4. arXiv:2506.19183  [pdf

    cond-mat.mtrl-sci

    A Novel Analysis Framework for Microstructural Characterization of Ferroelectric Hafnia: Experimental Validation and Application

    Authors: Yoonsang Park, Jaeduck Jang, Hyangsook Lee, Kihong Kim, Kyooho Jung, Yunseong Lee, Jaewoo Lee, Eunji Yang, Sanghyun Jo, Sijung Yoo, Hyun Jae Lee, Donghoon Kim, Duk-Hyun Choe, Seunggeol Nam

    Abstract: Herein, we present a novel analysis framework for grain size profile of ferroelectric hafnia to tackle critical shortcomings inherent in the current microstructural analysis. We vastly enhanced visibility of grains with ion beam treatment and performed accurate grain segmentation using deep neural network (DNN). By leveraging our new method, we discovered unexpected discrepancies that contradict p… ▽ More

    Submitted 23 June, 2025; originally announced June 2025.

    Comments: 4 pages (2 pages are text rest are filled with figures)

  5. arXiv:2506.08240  [pdf, ps, other

    cs.LG

    Dealing with the Evil Twins: Improving Random Augmentation by Addressing Catastrophic Forgetting of Diverse Augmentations

    Authors: Dongkyu Cho, Rumi Chunara

    Abstract: Data augmentation is a promising tool for enhancing out-of-distribution generalization, where the key is to produce diverse, challenging variations of the source domain via costly targeted augmentations that maximize its generalization effect. Conversely, random augmentation is inexpensive but is deemed suboptimal due to its limited effect. In this paper, we revisit random augmentation and explore… ▽ More

    Submitted 27 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

    Comments: 12 pages, 6 figures

  6. arXiv:2506.08228  [pdf, ps, other

    cs.LG cs.AI cs.RO

    Scaling Laws of Motion Forecasting and Planning -- A Technical Report

    Authors: Mustafa Baniodeh, Kratarth Goel, Scott Ettinger, Carlos Fuertes, Ari Seff, Tim Shen, Cole Gulino, Chenjie Yang, Ghassen Jerfel, Dokook Choe, Rui Wang, Vinutha Kallem, Sergio Casas, Rami Al-Rfou, Benjamin Sapp, Dragomir Anguelov

    Abstract: We study the empirical scaling laws of a family of encoder-decoder autoregressive transformer models on the task of joint motion forecasting and planning in the autonomous driving domain. Using a 500 thousand hours driving dataset, we demonstrate that, similar to language modeling, model performance improves as a power-law function of the total compute budget, and we observe a strong correlation b… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  7. arXiv:2505.21671  [pdf, ps, other

    cs.AI cs.DS cs.LG math.OC

    Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing

    Authors: Davin Choo, Yuqi Pan, Tonghan Wang, Milind Tambe, Alastair van Heerden, Cheryl Johnson

    Abstract: We study a sequential decision-making problem on a $n$-node graph $G$ where each node has an unknown label from a finite set $\mathbfΣ$, drawn from a joint distribution $P$ that is Markov with respect to $G$. At each step, selecting a node reveals its label and yields a label-dependent reward. The goal is to adaptively choose nodes to maximize expected accumulated discounted rewards. We impose a f… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  8. arXiv:2505.20868  [pdf, ps, other

    cs.SD cs.AI eess.AS

    Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech

    Authors: Nam-Gyu Kim, Deok-Hyeon Cho, Seung-Bin Kim, Seong-Whan Lee

    Abstract: Recent advances in expressive text-to-speech (TTS) have introduced diverse methods based on style embedding extracted from reference speech. However, synthesizing high-quality expressive speech remains challenging. We propose Spotlight-TTS, which exclusively emphasizes style via voiced-aware style extraction and style direction adjustment. Voiced-aware style extraction focuses on voiced regions hi… ▽ More

    Submitted 29 June, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

    Comments: Proceedings of Interspeech 2025

  9. arXiv:2505.19693  [pdf, ps, other

    cs.SD cs.AI eess.AS

    EmoSphere-SER: Enhancing Speech Emotion Recognition Through Spherical Representation with Auxiliary Classification

    Authors: Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Seong-Whan Lee

    Abstract: Speech emotion recognition predicts a speaker's emotional state from speech signals using discrete labels or continuous dimensions such as arousal, valence, and dominance (VAD). We propose EmoSphere-SER, a joint model that integrates spherical VAD region classification to guide VAD regression for improved emotion prediction. In our framework, VAD values are transformed into spherical coordinates t… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Proceedings of Interspeech 2025

  10. arXiv:2505.19687  [pdf, ps, other

    cs.SD cs.AI eess.AS

    DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech

    Authors: Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Seong-Whan Lee

    Abstract: Cross-speaker emotion transfer in speech synthesis relies on extracting speaker-independent emotion embeddings for accurate emotion modeling without retaining speaker traits. However, existing timbre compression methods fail to fully separate speaker and emotion characteristics, causing speaker leakage and degraded synthesis quality. To address this, we propose DiEmo-TTS, a self-supervised distill… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Proceedings of Interspeech 2025

  11. arXiv:2505.19252  [pdf, other

    cs.DS cs.AI cs.LG

    Learning-Augmented Online Bipartite Fractional Matching

    Authors: Davin Choo, Billy Jin, Yongho Shin

    Abstract: Online bipartite matching is a fundamental problem in online optimization, extensively studied both in its integral and fractional forms due to its theoretical significance and practical applications, such as online advertising and resource allocation. Motivated by recent progress in learning-augmented algorithms, we study online bipartite fractional matching when the algorithm is given advice in… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  12. arXiv:2505.13376  [pdf, ps, other

    cs.RO

    Seeing, Saying, Solving: An LLM-to-TL Framework for Cooperative Robots

    Authors: Dan BW Choe, Sundhar Vinodh Sangeetha, Steven Emanuel, Chih-Yuan Chiu, Samuel Coogan, Shreyas Kousik

    Abstract: Increased robot deployment, such as in warehousing, has revealed a need for seamless collaboration among heterogeneous robot teams to resolve unforeseen conflicts. To address this challenge, we propose a novel, decentralized framework for robots to request and provide help. The framework begins with robots detecting conflicts using a Vision Language Model (VLM), then reasoning over whether help is… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  13. arXiv:2505.12745  [pdf, ps, other

    cs.LG cs.AI

    PEER pressure: Model-to-Model Regularization for Single Source Domain Generalization

    Authors: Dong Kyu Cho, Inwoo Hwang, Sanghack Lee

    Abstract: Data augmentation is a popular tool for single source domain generalization, which expands the source domain by generating simulated ones, improving generalization on unseen target domains. In this work, we show that the performance of such augmentation-based methods in the target domains universally fluctuates during training, posing challenges in model selection under realistic scenarios. We arg… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 21 pages, 9 figures, Accepted at CVPR 2025

  14. arXiv:2505.08799  [pdf, other

    cs.CR

    Measuring Security in 5G and Future Networks

    Authors: Loay Abdelrazek, Rim ElMalki, Filippo Rebecchi, Daniel Cho

    Abstract: In today's increasingly interconnected and fast-paced digital ecosystem, mobile networks, such as 5G and future generations such as 6G, play a pivotal role and must be considered as critical infrastructures. Ensuring their security is paramount to safeguard both individual users and the industries that depend on these networks. An essential condition for maintaining and improving the security post… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: Accepted and presented in IEEE Future Networks World Forum 2024 conference, This is a pre-print version

  15. arXiv:2505.08230  [pdf, ps, other

    cs.RO

    SKiD-SLAM: Robust, Lightweight, and Distributed Multi-Robot LiDAR SLAM in Resource-Constrained Field Environments

    Authors: Hogyun Kim, Jiwon Choi, Juwon Kim, Geonmo Yang, Dongjin Cho, Hyungtae Lim, Younggun Cho

    Abstract: Distributed LiDAR SLAM is crucial for achieving efficient robot autonomy and improving the scalability of mapping. However, two issues need to be considered when applying it in field environments: one is resource limitation, and the other is inter/intra-robot association. The resource limitation issue arises when the data size exceeds the processing capacity of the network or memory, especially wh… ▽ More

    Submitted 8 June, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

    Comments: 8 pages, 10 figures

  16. arXiv:2504.13490  [pdf, other

    cs.CV

    Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing

    Authors: Joowon Kim, Ziseok Lee, Donghyeon Cho, Sanghyun Jo, Yeonsung Jung, Kyungsu Kim, Eunho Yang

    Abstract: Despite recent advances in diffusion models, achieving reliable image generation and editing remains challenging due to the inherent diversity induced by stochastic noise in the sampling process. Instruction-guided image editing with diffusion models offers user-friendly capabilities, yet editing failures, such as background distortion, frequently occur. Users often resort to trial and error, adju… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  17. arXiv:2504.13354  [pdf, ps, other

    cs.FL

    A Formalization of Co-Transcriptional Splicing as an Operation on Formal Languages

    Authors: Da-Jung Cho, Szilárd Zsolt Fazekas, Shinnosuke Seki, Max Wiedenhöft

    Abstract: RNA co-transcriptionality is the process where RNA sequences are spliced while being transcribed from DNA templates. This process holds potential as a key tool for molecular programming. Co-transcriptional folding has been shown to be programmable for assembling nano-scale RNA structures, and recent advances have proven its Turing universality. While post-transcriptional splicing has been extensiv… ▽ More

    Submitted 17 June, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: 35 pages, 2 tables, 4 figures, Updated Long Version, Under revision review in Natural Computing

    MSC Class: 68Q45; 68Q17 ACM Class: F.4.3; F.1.3

  18. arXiv:2504.02193  [pdf, other

    cs.AI

    More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment

    Authors: Yifan Wang, Runjin Chen, Bolian Li, David Cho, Yihe Deng, Ruqi Zhang, Tianlong Chen, Zhangyang Wang, Ananth Grama, Junyuan Hong

    Abstract: Aligning large language models (LLMs) with human values is an increasingly critical step in post-training. Direct Preference Optimization (DPO) has emerged as a simple, yet effective alternative to reinforcement learning from human feedback (RLHF). Synthetic preference data with its low cost and high quality enable effective alignment through single- or multi-model generated preference data. Our s… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  19. arXiv:2504.00137  [pdf, other

    eess.SY

    Performance analysis of metasurface-based spatial multimode transmission for 6G wireless communications

    Authors: Ju Yong Lee, Seung-Won Keum, Sang Min Oh, Dang-Oh Kim, Dong-Ho Cho

    Abstract: In 6th generation wireless communication technology, it is important to utilize space resources efficiently. Recently, holographic multiple-input multiple-output (HMIMO) and meta-surface technology have attracted attention as technologies that maximize space utilization for 6G mobile communications. However, studies on HMIMO communications are still in an initial stage and its fundamental limits a… ▽ More

    Submitted 31 March, 2025; originally announced April 2025.

  20. arXiv:2503.05841  [pdf

    math.AP physics.flu-dyn

    Low Mach number limit for the diffusion approximation model in radiation hydrodynamics at equilibrium-diffusion regime

    Authors: Kwang-Il Choe, Dae-Won Choe, Myong Chol Pak

    Abstract: The low Mach number limit for the compressible viscous diffusion approximation model arising in radiation hydrodynamics is rigorously justified. For the 3-D Cauchy problem, the solutions in an equilibrium diffusion regime are shown to converge to the solutions of an incompressible Navier-Stokes equations locally and globally in time as Mach number goes to zero, when the effect of the small tempera… ▽ More

    Submitted 6 March, 2025; originally announced March 2025.

    Comments: 26 pages

  21. arXiv:2502.07274  [pdf, other

    cs.LG cs.AI

    Memory Is Not the Bottleneck: Cost-Efficient Continual Learning via Weight Space Consolidation

    Authors: Dongkyu Cho, Taesup Moon, Rumi Chunara, Kyunghyun Cho, Sungmin Cha

    Abstract: Continual learning (CL) has traditionally emphasized minimizing exemplar memory usage, assuming that memory is the primary bottleneck. However, in modern computing environments-particularly those involving large foundation models-memory is inexpensive and abundant, while GPU time constitutes the main cost. This paper re-examines CL under a more realistic setting with sufficient exemplar memory, wh… ▽ More

    Submitted 20 May, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

    Comments: 23 pages, 11 figures

  22. arXiv:2502.04998  [pdf, other

    cs.AI

    On Sequential Fault-Intolerant Process Planning

    Authors: Andrzej Kaczmarczyk, Davin Choo, Niclas Boehmer, Milind Tambe, Haifeng Xu

    Abstract: We propose and study a planning problem we call Sequential Fault-Intolerant Process Planning (SFIPP). SFIPP captures a reward structure common in many sequential multi-stage decision problems where the planning is deemed successful only if all stages succeed. Such reward structures are different from classic additive reward structures and arise in important applications such as drug/material disco… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: 20 pages; 7 figures

  23. arXiv:2501.07809  [pdf, other

    cs.LG cs.AI math.AP

    Conformal mapping Coordinates Physics-Informed Neural Networks (CoCo-PINNs): learning neural networks for designing neutral inclusions

    Authors: Daehee Cho, Hyeonmin Yun, Jaeyong Lee, Mikyoung Lim

    Abstract: We focus on designing and solving the neutral inclusion problem via neural networks. The neutral inclusion problem has a long history in the theory of composite materials, and it is exceedingly challenging to identify the precise condition that precipitates a general-shaped inclusion into a neutral inclusion. Physics-informed neural networks (PINNs) have recently become a highly successful approac… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

  24. arXiv:2501.06246  [pdf, other

    cs.CL cs.AI cs.DS

    A partition cover approach to tokenization

    Authors: Jia Peng Lim, Shawn Tan, Davin Choo, Hady W. Lauw

    Abstract: Tokenization is the process of encoding strings into tokens of a fixed vocabulary size, and is widely utilized in Natural Language Processing applications. The leading tokenization algorithm today is Byte-Pair Encoding (BPE), which formulates the tokenization problem as a compression problem and tackles it by performing sequences of merges. In this work, we formulate tokenization as an optimizatio… ▽ More

    Submitted 25 May, 2025; v1 submitted 8 January, 2025; originally announced January 2025.

    Comments: under review

  25. arXiv:2412.19561  [pdf, other

    quant-ph

    Single-qubit quantum gate at an arbitrary speed

    Authors: Seongjin Ahn, Kichan Park, Daehee Cho, Mikyoung Lim, Taeyoung Choi, Andrey S. Moskalenko

    Abstract: Quantum information processing comprises physical processes, which obey the quantum speed limit (QSL): high speed requires strong driving. Single-qubit gates using Rabi oscillation, which is based on the rotating wave approximation (RWA), satisfy this bound in the form that the gate time $T$ is inversely proportional to the Rabi frequency $Ω$, characterizing the driving strength. However, if the g… ▽ More

    Submitted 27 December, 2024; originally announced December 2024.

    Comments: 11 pages, 4 figures

  26. arXiv:2412.06192  [pdf, other

    cs.RO

    PoLaRIS Dataset: A Maritime Object Detection and Tracking Dataset in Pohang Canal

    Authors: Jiwon Choi, Dongjin Cho, Gihyeon Lee, Hogyun Kim, Geonmo Yang, Joowan Kim, Younggun Cho

    Abstract: Maritime environments often present hazardous situations due to factors such as moving ships or buoys, which become obstacles under the influence of waves. In such challenging conditions, the ability to detect and track potentially hazardous objects is critical for the safe navigation of marine robots. To address the scarcity of comprehensive datasets capturing these dynamic scenarios, we introduc… ▽ More

    Submitted 19 December, 2024; v1 submitted 8 December, 2024; originally announced December 2024.

  27. arXiv:2411.13955  [pdf, other

    quant-ph

    A silicon-based ion trap chip protected from semiconductor charging

    Authors: Daun Chung, Kwangyeul Choi, Woojun Lee, Chiyoon Kim, Hosung Shon, Jeonghyun Park, Beomgeun Cho, Kyungmin Lee, Suhan Kim, Seungwoo Yoo, Eui Hwan Jung, Changhyun Jung, Jiyong Kang, Kyunghye Kim, Roberts Berkis, Tracy Northup, Dong-Il "Dan'' Cho, Taehyun Kim

    Abstract: Silicon-based ion trap chips can benefit from existing advanced fabrication technologies, such as multi-metal layer techniques for two-dimensional architectures and silicon photonics for the integration of on-chip optical components. However, the scalability of these technologies may be compromised by semiconductor charging, where photogenerated charge carriers produce electric potentials that dis… ▽ More

    Submitted 21 November, 2024; originally announced November 2024.

  28. arXiv:2411.12700  [pdf, other

    cs.LG cs.DS cs.IT stat.ML

    Learning multivariate Gaussians with imperfect advice

    Authors: Arnab Bhattacharyya, Davin Choo, Philips George John, Themis Gouleakis

    Abstract: We revisit the problem of distribution learning within the framework of learning-augmented algorithms. In this setting, we explore the scenario where a probability distribution is provided as potentially inaccurate advice on the true, unknown distribution. Our objective is to develop learning algorithms whose sample complexity decreases as the quality of the advice improves, thereby surpassing sta… ▽ More

    Submitted 31 January, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

  29. arXiv:2411.08141  [pdf, ps, other

    math.ST stat.ML

    Probably approximately correct high-dimensional causal effect estimation given a valid adjustment set

    Authors: Davin Choo, Chandler Squires, Arnab Bhattacharyya, David Sontag

    Abstract: Accurate estimates of causal effects play a key role in decision-making across applications such as healthcare, economics, and operations. In the absence of randomized experiments, a common approach to estimating causal effects uses \textit{covariate adjustment}. In this paper, we study covariate adjustment for discrete distributions from the PAC learning perspective, assuming knowledge of a valid… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

  30. arXiv:2411.02625  [pdf, other

    cs.SD cs.AI eess.AS

    EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector

    Authors: Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Seong-Whan Lee

    Abstract: Emotional text-to-speech (TTS) technology has achieved significant progress in recent years; however, challenges remain owing to the inherent complexity of emotions and limitations of the available emotional speech datasets and models. Previous studies typically relied on limited emotional speech datasets or required extensive manual annotations, restricting their ability to generalize across diff… ▽ More

    Submitted 16 April, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Journal ref: Published in IEEE Transactions on Affective Computing 2025

  31. arXiv:2410.11894  [pdf, other

    eess.SY cs.LG eess.IV nlin.CD

    Automated Discovery of Operable Dynamics from Videos

    Authors: Kuang Huang, Dong Heon Cho, Boyuan Chen

    Abstract: Dynamical systems form the foundation of scientific discovery, traditionally modeled with predefined state variables such as the angle and angular velocity, and differential equations such as the equation of motion for a single pendulum. We introduce a framework that automatically discovers a low-dimensional and operable representation of system dynamics, including a set of compact state variables… ▽ More

    Submitted 23 April, 2025; v1 submitted 13 October, 2024; originally announced October 2024.

  32. arXiv:2410.06583  [pdf, other

    cs.DS

    A short note about the learning-augmented secretary problem

    Authors: Davin Choo, Chun Kai Ling

    Abstract: We consider the secretary problem through the lens of learning-augmented algorithms. As it is known that the best possible expected competitive ratio is $1/e$ in the classic setting without predictions, a natural goal is to design algorithms that are 1-consistent and $1/e$-robust. Unfortunately, [FY24] provided hardness constructions showing that such a goal is not attainable when the candidates'… ▽ More

    Submitted 2 November, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

  33. arXiv:2409.15784  [pdf

    physics.app-ph cond-mat.mtrl-sci cs.LG physics.optics

    Deep-learning real-time phase retrieval of imperfect diffraction patterns from X-ray free-electron lasers

    Authors: Sung Yun Lee, Do Hyung Cho, Chulho Jung, Daeho Sung, Daewoong Nam, Sangsoo Kim, Changyong Song

    Abstract: Machine learning is attracting surging interest across nearly all scientific areas by enabling the analysis of large datasets and the extraction of scientific information from incomplete data. Data-driven science is rapidly growing, especially in X-ray methodologies, where advanced light sources and detection technologies accumulate vast amounts of data that exceed meticulous human inspection capa… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    MSC Class: 68T07 ACM Class: J.2

  34. Signatures of Amorphous Shiba State in FeTe$_{0.55}$Se$_{0.45}$

    Authors: Jinwon Lee, Sanghun Lee, Andreas Kreisel, Jens Paaske, Brian M. Andersen, Koen M. Bastiaans, Damianos Chatzopoulos, Genda Gu, Doohee Cho, Milan P. Allan

    Abstract: The iron-based superconductor FeTe$_{0.55}$Se$_{0.45}$ is a peculiar material: it hosts a surface state with a Dirac dispersion, is a putative topological superconductor hosting Majorana modes in vortices, and has an unusually low Fermi energy. The superconducting state is generally thought to be characterized by three gaps in different bands, with the usual homogenous, spatially extended Bogoliub… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 6 pages, 4 figures

    Journal ref: Nano Letters 25, 4227-4233 (2025)

  35. arXiv:2407.21678  [pdf

    physics.app-ph

    Charged-impurity free printing-based diffusion doping in molybdenum disulfide field-effect transistors

    Authors: Inho Jeong, Jiwoo Yang, Juntae Jang, Daeheum Cho, Deok-Hwang Kwon, Jae-Keun Kim, Takhee Lee, Kyungjune Cho, Seungjun Chung

    Abstract: In practical electronic applications, where doping is crucial to exploit large-area two-dimensional (2D) semiconductors, surface charge transfer doping (SCTD) has emerged as a promising strategy to tailor their electrical characteristics. However, impurity scattering caused by resultant ionized dopants, after donating or withdrawing carriers, hinders transport in 2D semiconductor layers, limiting… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  36. arXiv:2407.03231  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Dimensionality Engineering of Magnetic Anisotropy from Anomalous Hall Effect in Synthetic SrRuO3 Crystals

    Authors: Seung Gyo Jeong, Seong Won Cho, Sehwan Song, Jin Young Oh, Do Gyeom Jeong, Gyeongtak Han, Hu Young Jeong, Ahmed Yousef Mohamed, Woo-suk Noh, Sungkyun Park, Jong Seok Lee, Suyoun Lee, Young-Min Kim, Deok-Yong Cho, Woo Seok Choi

    Abstract: Magnetic anisotropy in atomically thin correlated heterostructures is essential for exploring quantum magnetic phases for next-generation spintronics. Whereas previous studies have mostly focused on van der Waals systems, here, we investigate the impact of dimensionality of epitaxially-grown correlated oxides down to the monolayer limit on structural, magnetic, and orbital anisotropies. By designi… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 23 pages

    Journal ref: published 2024

  37. arXiv:2407.00927  [pdf, ps, other

    cs.LG cs.CC stat.ML

    Learnability of Parameter-Bounded Bayes Nets

    Authors: Arnab Bhattacharyya, Davin Choo, Sutanu Gayen, Dimitrios Myrisiotis

    Abstract: Bayes nets are extensively used in practice to efficiently represent joint probability distributions over a set of random variables and capture dependency relations. In a seminal paper, Chickering et al. (JMLR 2004) showed that given a distribution $\mathbb{P}$, that is defined as the marginal distribution of a Bayes net, it is $\mathsf{NP}$-hard to decide whether there is a parameter-bounded Baye… ▽ More

    Submitted 4 August, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

    Comments: 15 pages, 2 figures

  38. Origin of Distinct Insulating Domains in the Layered Charge Density Wave Material 1T-TaS2

    Authors: Hyungryul Yang, Byeongin Lee, Junho Bang, Sunghun Kim, Dirk Wulferding, Sung-Hoon Lee, Doohee Cho

    Abstract: Vertical charge order shapes the electronic properties in layered charge density wave (CDW) materials. Various stacking orders inevitably create nanoscale domains with distinct electronic structures inaccessible to bulk probes. Here, the stacking characteristics of bulk 1$T$-TaS$2$ are analyzed using scanning tunneling spectroscopy (STS) and density functional theory (DFT) calculations. It is obse… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 26 pages and 13 figures

  39. Charge ordered phases in the hole-doped triangular Mott insulator 4Hb-TaS2

    Authors: Junho Bang, Byeongin Lee, Hyungryul Yang, Sunghun Kim, Dirk Wulferding, Doohee Cho

    Abstract: 4Hb-TaS2 has been proposed to possess unconventional superconductivity with broken time reveral symmetry due to distinctive layered structure, featuring a heterojunction between a 2D triangular Mott insulator and a charge density wave metal. However, since a frustrated spin state in the correlated insulating layer is susceptible to charge ordering with carrier doping, it is required to investigate… ▽ More

    Submitted 17 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

    Comments: 18 pages, 6 figures

    Journal ref: Phys. Rev. B 109, 195170 (2024)

  40. EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech

    Authors: Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Sang-Hoon Lee, Seong-Whan Lee

    Abstract: Despite rapid advances in the field of emotional text-to-speech (TTS), recent studies primarily focus on mimicking the average style of a particular emotion. As a result, the ability to manipulate speech emotion remains constrained to several predefined labels, compromising the ability to reflect the nuanced variations of emotion. In this paper, we propose EmoSphere-TTS, which synthesizes expressi… ▽ More

    Submitted 4 November, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Proceedings of Interspeech

  41. arXiv:2405.09784  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Online bipartite matching with imperfect advice

    Authors: Davin Choo, Themis Gouleakis, Chun Kai Ling, Arnab Bhattacharyya

    Abstract: We study the problem of online unweighted bipartite matching with $n$ offline vertices and $n$ online vertices where one wishes to be competitive against the optimal offline algorithm. While the classic RANKING algorithm of Karp et al. [1990] provably attains competitive ratio of $1-1/e > 1/2$, we show that no learning-augmented method can be both 1-consistent and strictly better than $1/2$-robust… ▽ More

    Submitted 23 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: Accepted into ICML 2024

  42. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  43. arXiv:2403.15714  [pdf, ps, other

    math.AP

    Analytic asymptotic formulas for effective parameters of planar elastic composites

    Authors: Daehee Cho, Doosung Choi, Mikyoung Lim

    Abstract: We investigate the effective elastic properties of periodic dilute two-phase composites consisting of an homogeneous isotropic matrix and a periodic array of rigid inclusions. We assume the rigid inclusion in a unit cell is a simply connected, bounded domain so that there exists an exterior conformal mapping corresponding the inclusion. Recently, an analytical series solution method for the elasti… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  44. arXiv:2403.15713  [pdf, ps, other

    math.AP

    Geometric series solution for the plane elastostatic problem in the presence of a cavity

    Authors: Daehee Cho, Doosung Choi, Mikyoung Lim

    Abstract: This paper presents an analytic series solution method for the elastic inclusion problem in a two-dimensional unbounded isotropic medium with a cavity. Generalizing the work of Mattei and Lim \cite{Mattei:2021:EAS}, this study develops an analytic series solution method for the elastic inclusion problem to encompass a cavity problem. The central mathematical challenge tackled in this research is t… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  45. arXiv:2403.01519  [pdf, other

    math.AP

    Analytic shape recovery of an elastic inclusion from elastic moment tensors

    Authors: Daehee Cho, Mikyoung Lim

    Abstract: In this paper, we present an analytic non-iterative approach for recovering a planar isotropic elastic inclusion embedded in an unbounded medium from the elastic moment tensors (EMTs), which are coefficients for the multipole expansion of field perturbation caused by the inclusion. EMTs contain information about the inclusion's material and geometric properties and, as is well known, the inclusion… ▽ More

    Submitted 5 August, 2024; v1 submitted 3 March, 2024; originally announced March 2024.

    Comments: 22 pages, 3 figures

  46. Envy-Free House Allocation with Minimum Subsidy

    Authors: Davin Choo, Yan Hao Ling, Warut Suksompong, Nicholas Teh, Jian Zhang

    Abstract: House allocation refers to the problem where $m$ houses are to be allocated to $n$ agents so that each agent receives one house. Since an envy-free house allocation does not always exist, we consider finding such an allocation in the presence of subsidy. We show that computing an envy-free allocation with minimum subsidy is NP-hard in general, but can be done efficiently if $m$ differs from $n$ by… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Journal ref: Operations Research Letters, 54:107103 (2024)

  47. arXiv:2402.08229  [pdf, other

    cs.LG cs.DS stat.ME stat.ML

    Causal Discovery under Off-Target Interventions

    Authors: Davin Choo, Kirankumar Shiragur, Caroline Uhler

    Abstract: Causal graph discovery is a significant problem with applications across various disciplines. However, with observational data alone, the underlying causal graph can only be recovered up to its Markov equivalence class, and further assumptions or interventions are necessary to narrow down the true graph. This work addresses the causal discovery problem under the setting of stochastic interventions… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted into AISTATS 2024

  48. arXiv:2401.08095  [pdf, other

    cs.SD cs.AI eess.AS

    DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment

    Authors: Hyung-Seok Oh, Sang-Hoon Lee, Deok-Hyeon Cho, Seong-Whan Lee

    Abstract: Emotional voice conversion (EVC) involves modifying various acoustic characteristics, such as pitch and spectral envelope, to match a desired emotional state while preserving the speaker's identity. Existing EVC methods often rely on text transcriptions or time-alignment information and struggle to handle varying speech durations effectively. In this paper, we propose DurFlex-EVC, a duration-flexi… ▽ More

    Submitted 20 January, 2025; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: 15 pages, 11 figures, 12 tables

    Journal ref: IEEE Transactions on Affective Computing, 2025, pp.1 - 15

  49. arXiv:2401.00265  [pdf, ps, other

    cond-mat.mtrl-sci

    An unconventional platform for two-dimensional Kagome flat bands on semiconductor surfaces

    Authors: Jae Hyuck Lee, GwanWoo Kim, Inkyung Song, Yejin Kim, Yeonjae Lee, Sung Jong Yoo, Deok-Yong Cho, Jun-Won Rhim, Jongkeun Jung, Gunn Kim, Changyoung Kim

    Abstract: In condensed matter physics, the Kagome lattice and its inherent flat bands have attracted considerable attention for their potential to host a variety of exotic physical phenomena. Despite extensive efforts to fabricate thin films of Kagome materials aimed at modulating the flat bands through electrostatic gating or strain manipulation, progress has been limited. Here, we report the observation o… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: 7 pages, 4 figures

  50. arXiv:2312.08986  [pdf, other

    cond-mat.str-el cond-mat.mtrl-sci

    Melting of unidirectional charge density waves across twin domain boundaries in GdTe$_{3}$

    Authors: Sanghun Lee, Eunseo Kim, Junho Bang, Jongho Park, Changyoung Kim, Dirk Wulferding, Doohee Cho

    Abstract: Solids undergoing a transition from order to disorder experience the proliferation of topological defects. The melting process generates transient quantum states. However, their dynamical nature with femtosecond lifetime hinders exploration with atomic precision. Here, we suggest an alternative approach to the dynamical melting process by focusing on the interface created by competing degenerate q… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Journal ref: Nano Lett. 23, 11219 (2023)