Skip to main content

Showing 1–50 of 104 results for author: Choe, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.23384  [pdf, ps, other

    cs.FL

    Programmable Co-Transcriptional Splicing: Realizing Regular Languages via Hairpin Deletion

    Authors: Da-Jung Cho, Szilárd Zsolt Fazekas, Shinnosuke Seki, Max Wiedenhöft

    Abstract: RNA co-transcriptionality, where RNA is spliced or folded during transcription from DNA templates, offers promising potential for molecular programming. It enables programmable folding of nano-scale RNA structures and has recently been shown to be Turing universal. While post-transcriptional splicing is well studied, co-transcriptional splicing is gaining attention for its efficiency, though its u… ▽ More

    Submitted 29 June, 2025; originally announced June 2025.

    Comments: 28 pages, 8 Figures, Accepted at the 31st International Conference on DNA Computing and Molecular Programming (2025)

    MSC Class: 92-10 ACM Class: F.4.3; J.3; F.1.3

  2. arXiv:2506.08240  [pdf, ps, other

    cs.LG

    Dealing with the Evil Twins: Improving Random Augmentation by Addressing Catastrophic Forgetting of Diverse Augmentations

    Authors: Dongkyu Cho, Rumi Chunara

    Abstract: Data augmentation is a promising tool for enhancing out-of-distribution generalization, where the key is to produce diverse, challenging variations of the source domain via costly targeted augmentations that maximize its generalization effect. Conversely, random augmentation is inexpensive but is deemed suboptimal due to its limited effect. In this paper, we revisit random augmentation and explore… ▽ More

    Submitted 27 June, 2025; v1 submitted 9 June, 2025; originally announced June 2025.

    Comments: 12 pages, 6 figures

  3. arXiv:2506.08228  [pdf, ps, other

    cs.LG cs.AI cs.RO

    Scaling Laws of Motion Forecasting and Planning -- A Technical Report

    Authors: Mustafa Baniodeh, Kratarth Goel, Scott Ettinger, Carlos Fuertes, Ari Seff, Tim Shen, Cole Gulino, Chenjie Yang, Ghassen Jerfel, Dokook Choe, Rui Wang, Vinutha Kallem, Sergio Casas, Rami Al-Rfou, Benjamin Sapp, Dragomir Anguelov

    Abstract: We study the empirical scaling laws of a family of encoder-decoder autoregressive transformer models on the task of joint motion forecasting and planning in the autonomous driving domain. Using a 500 thousand hours driving dataset, we demonstrate that, similar to language modeling, model performance improves as a power-law function of the total compute budget, and we observe a strong correlation b… ▽ More

    Submitted 9 June, 2025; originally announced June 2025.

  4. arXiv:2505.21671  [pdf, ps, other

    cs.AI cs.DS cs.LG math.OC

    Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing

    Authors: Davin Choo, Yuqi Pan, Tonghan Wang, Milind Tambe, Alastair van Heerden, Cheryl Johnson

    Abstract: We study a sequential decision-making problem on a $n$-node graph $G$ where each node has an unknown label from a finite set $\mathbfΣ$, drawn from a joint distribution $P$ that is Markov with respect to $G$. At each step, selecting a node reveals its label and yields a label-dependent reward. The goal is to adaptively choose nodes to maximize expected accumulated discounted rewards. We impose a f… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  5. arXiv:2505.20868  [pdf, ps, other

    cs.SD cs.AI eess.AS

    Spotlight-TTS: Spotlighting the Style via Voiced-Aware Style Extraction and Style Direction Adjustment for Expressive Text-to-Speech

    Authors: Nam-Gyu Kim, Deok-Hyeon Cho, Seung-Bin Kim, Seong-Whan Lee

    Abstract: Recent advances in expressive text-to-speech (TTS) have introduced diverse methods based on style embedding extracted from reference speech. However, synthesizing high-quality expressive speech remains challenging. We propose Spotlight-TTS, which exclusively emphasizes style via voiced-aware style extraction and style direction adjustment. Voiced-aware style extraction focuses on voiced regions hi… ▽ More

    Submitted 29 June, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

    Comments: Proceedings of Interspeech 2025

  6. arXiv:2505.19693  [pdf, ps, other

    cs.SD cs.AI eess.AS

    EmoSphere-SER: Enhancing Speech Emotion Recognition Through Spherical Representation with Auxiliary Classification

    Authors: Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Seong-Whan Lee

    Abstract: Speech emotion recognition predicts a speaker's emotional state from speech signals using discrete labels or continuous dimensions such as arousal, valence, and dominance (VAD). We propose EmoSphere-SER, a joint model that integrates spherical VAD region classification to guide VAD regression for improved emotion prediction. In our framework, VAD values are transformed into spherical coordinates t… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Proceedings of Interspeech 2025

  7. arXiv:2505.19687  [pdf, ps, other

    cs.SD cs.AI eess.AS

    DiEmo-TTS: Disentangled Emotion Representations via Self-Supervised Distillation for Cross-Speaker Emotion Transfer in Text-to-Speech

    Authors: Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Seong-Whan Lee

    Abstract: Cross-speaker emotion transfer in speech synthesis relies on extracting speaker-independent emotion embeddings for accurate emotion modeling without retaining speaker traits. However, existing timbre compression methods fail to fully separate speaker and emotion characteristics, causing speaker leakage and degraded synthesis quality. To address this, we propose DiEmo-TTS, a self-supervised distill… ▽ More

    Submitted 26 May, 2025; originally announced May 2025.

    Comments: Proceedings of Interspeech 2025

  8. arXiv:2505.19252  [pdf, other

    cs.DS cs.AI cs.LG

    Learning-Augmented Online Bipartite Fractional Matching

    Authors: Davin Choo, Billy Jin, Yongho Shin

    Abstract: Online bipartite matching is a fundamental problem in online optimization, extensively studied both in its integral and fractional forms due to its theoretical significance and practical applications, such as online advertising and resource allocation. Motivated by recent progress in learning-augmented algorithms, we study online bipartite fractional matching when the algorithm is given advice in… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  9. arXiv:2505.13376  [pdf, ps, other

    cs.RO

    Seeing, Saying, Solving: An LLM-to-TL Framework for Cooperative Robots

    Authors: Dan BW Choe, Sundhar Vinodh Sangeetha, Steven Emanuel, Chih-Yuan Chiu, Samuel Coogan, Shreyas Kousik

    Abstract: Increased robot deployment, such as in warehousing, has revealed a need for seamless collaboration among heterogeneous robot teams to resolve unforeseen conflicts. To address this challenge, we propose a novel, decentralized framework for robots to request and provide help. The framework begins with robots detecting conflicts using a Vision Language Model (VLM), then reasoning over whether help is… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  10. arXiv:2505.12745  [pdf, ps, other

    cs.LG cs.AI

    PEER pressure: Model-to-Model Regularization for Single Source Domain Generalization

    Authors: Dong Kyu Cho, Inwoo Hwang, Sanghack Lee

    Abstract: Data augmentation is a popular tool for single source domain generalization, which expands the source domain by generating simulated ones, improving generalization on unseen target domains. In this work, we show that the performance of such augmentation-based methods in the target domains universally fluctuates during training, posing challenges in model selection under realistic scenarios. We arg… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

    Comments: 21 pages, 9 figures, Accepted at CVPR 2025

  11. arXiv:2505.08799  [pdf, other

    cs.CR

    Measuring Security in 5G and Future Networks

    Authors: Loay Abdelrazek, Rim ElMalki, Filippo Rebecchi, Daniel Cho

    Abstract: In today's increasingly interconnected and fast-paced digital ecosystem, mobile networks, such as 5G and future generations such as 6G, play a pivotal role and must be considered as critical infrastructures. Ensuring their security is paramount to safeguard both individual users and the industries that depend on these networks. An essential condition for maintaining and improving the security post… ▽ More

    Submitted 9 May, 2025; originally announced May 2025.

    Comments: Accepted and presented in IEEE Future Networks World Forum 2024 conference, This is a pre-print version

  12. arXiv:2505.08230  [pdf, ps, other

    cs.RO

    SKiD-SLAM: Robust, Lightweight, and Distributed Multi-Robot LiDAR SLAM in Resource-Constrained Field Environments

    Authors: Hogyun Kim, Jiwon Choi, Juwon Kim, Geonmo Yang, Dongjin Cho, Hyungtae Lim, Younggun Cho

    Abstract: Distributed LiDAR SLAM is crucial for achieving efficient robot autonomy and improving the scalability of mapping. However, two issues need to be considered when applying it in field environments: one is resource limitation, and the other is inter/intra-robot association. The resource limitation issue arises when the data size exceeds the processing capacity of the network or memory, especially wh… ▽ More

    Submitted 8 June, 2025; v1 submitted 13 May, 2025; originally announced May 2025.

    Comments: 8 pages, 10 figures

  13. arXiv:2504.13490  [pdf, other

    cs.CV

    Early Timestep Zero-Shot Candidate Selection for Instruction-Guided Image Editing

    Authors: Joowon Kim, Ziseok Lee, Donghyeon Cho, Sanghyun Jo, Yeonsung Jung, Kyungsu Kim, Eunho Yang

    Abstract: Despite recent advances in diffusion models, achieving reliable image generation and editing remains challenging due to the inherent diversity induced by stochastic noise in the sampling process. Instruction-guided image editing with diffusion models offers user-friendly capabilities, yet editing failures, such as background distortion, frequently occur. Users often resort to trial and error, adju… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

  14. arXiv:2504.13354  [pdf, ps, other

    cs.FL

    A Formalization of Co-Transcriptional Splicing as an Operation on Formal Languages

    Authors: Da-Jung Cho, Szilárd Zsolt Fazekas, Shinnosuke Seki, Max Wiedenhöft

    Abstract: RNA co-transcriptionality is the process where RNA sequences are spliced while being transcribed from DNA templates. This process holds potential as a key tool for molecular programming. Co-transcriptional folding has been shown to be programmable for assembling nano-scale RNA structures, and recent advances have proven its Turing universality. While post-transcriptional splicing has been extensiv… ▽ More

    Submitted 17 June, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: 35 pages, 2 tables, 4 figures, Updated Long Version, Under revision review in Natural Computing

    MSC Class: 68Q45; 68Q17 ACM Class: F.4.3; F.1.3

  15. arXiv:2504.02193  [pdf, other

    cs.AI

    More is Less: The Pitfalls of Multi-Model Synthetic Preference Data in DPO Safety Alignment

    Authors: Yifan Wang, Runjin Chen, Bolian Li, David Cho, Yihe Deng, Ruqi Zhang, Tianlong Chen, Zhangyang Wang, Ananth Grama, Junyuan Hong

    Abstract: Aligning large language models (LLMs) with human values is an increasingly critical step in post-training. Direct Preference Optimization (DPO) has emerged as a simple, yet effective alternative to reinforcement learning from human feedback (RLHF). Synthetic preference data with its low cost and high quality enable effective alignment through single- or multi-model generated preference data. Our s… ▽ More

    Submitted 2 April, 2025; originally announced April 2025.

  16. arXiv:2502.07274  [pdf, other

    cs.LG cs.AI

    Memory Is Not the Bottleneck: Cost-Efficient Continual Learning via Weight Space Consolidation

    Authors: Dongkyu Cho, Taesup Moon, Rumi Chunara, Kyunghyun Cho, Sungmin Cha

    Abstract: Continual learning (CL) has traditionally emphasized minimizing exemplar memory usage, assuming that memory is the primary bottleneck. However, in modern computing environments-particularly those involving large foundation models-memory is inexpensive and abundant, while GPU time constitutes the main cost. This paper re-examines CL under a more realistic setting with sufficient exemplar memory, wh… ▽ More

    Submitted 20 May, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

    Comments: 23 pages, 11 figures

  17. arXiv:2502.04998  [pdf, other

    cs.AI

    On Sequential Fault-Intolerant Process Planning

    Authors: Andrzej Kaczmarczyk, Davin Choo, Niclas Boehmer, Milind Tambe, Haifeng Xu

    Abstract: We propose and study a planning problem we call Sequential Fault-Intolerant Process Planning (SFIPP). SFIPP captures a reward structure common in many sequential multi-stage decision problems where the planning is deemed successful only if all stages succeed. Such reward structures are different from classic additive reward structures and arise in important applications such as drug/material disco… ▽ More

    Submitted 7 February, 2025; originally announced February 2025.

    Comments: 20 pages; 7 figures

  18. arXiv:2501.07809  [pdf, other

    cs.LG cs.AI math.AP

    Conformal mapping Coordinates Physics-Informed Neural Networks (CoCo-PINNs): learning neural networks for designing neutral inclusions

    Authors: Daehee Cho, Hyeonmin Yun, Jaeyong Lee, Mikyoung Lim

    Abstract: We focus on designing and solving the neutral inclusion problem via neural networks. The neutral inclusion problem has a long history in the theory of composite materials, and it is exceedingly challenging to identify the precise condition that precipitates a general-shaped inclusion into a neutral inclusion. Physics-informed neural networks (PINNs) have recently become a highly successful approac… ▽ More

    Submitted 13 January, 2025; originally announced January 2025.

  19. arXiv:2501.06246  [pdf, other

    cs.CL cs.AI cs.DS

    A partition cover approach to tokenization

    Authors: Jia Peng Lim, Shawn Tan, Davin Choo, Hady W. Lauw

    Abstract: Tokenization is the process of encoding strings into tokens of a fixed vocabulary size, and is widely utilized in Natural Language Processing applications. The leading tokenization algorithm today is Byte-Pair Encoding (BPE), which formulates the tokenization problem as a compression problem and tackles it by performing sequences of merges. In this work, we formulate tokenization as an optimizatio… ▽ More

    Submitted 25 May, 2025; v1 submitted 8 January, 2025; originally announced January 2025.

    Comments: under review

  20. arXiv:2412.06192  [pdf, other

    cs.RO

    PoLaRIS Dataset: A Maritime Object Detection and Tracking Dataset in Pohang Canal

    Authors: Jiwon Choi, Dongjin Cho, Gihyeon Lee, Hogyun Kim, Geonmo Yang, Joowan Kim, Younggun Cho

    Abstract: Maritime environments often present hazardous situations due to factors such as moving ships or buoys, which become obstacles under the influence of waves. In such challenging conditions, the ability to detect and track potentially hazardous objects is critical for the safe navigation of marine robots. To address the scarcity of comprehensive datasets capturing these dynamic scenarios, we introduc… ▽ More

    Submitted 19 December, 2024; v1 submitted 8 December, 2024; originally announced December 2024.

  21. arXiv:2411.12700  [pdf, other

    cs.LG cs.DS cs.IT stat.ML

    Learning multivariate Gaussians with imperfect advice

    Authors: Arnab Bhattacharyya, Davin Choo, Philips George John, Themis Gouleakis

    Abstract: We revisit the problem of distribution learning within the framework of learning-augmented algorithms. In this setting, we explore the scenario where a probability distribution is provided as potentially inaccurate advice on the true, unknown distribution. Our objective is to develop learning algorithms whose sample complexity decreases as the quality of the advice improves, thereby surpassing sta… ▽ More

    Submitted 31 January, 2025; v1 submitted 19 November, 2024; originally announced November 2024.

  22. arXiv:2411.02625  [pdf, other

    cs.SD cs.AI eess.AS

    EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vector

    Authors: Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Seong-Whan Lee

    Abstract: Emotional text-to-speech (TTS) technology has achieved significant progress in recent years; however, challenges remain owing to the inherent complexity of emotions and limitations of the available emotional speech datasets and models. Previous studies typically relied on limited emotional speech datasets or required extensive manual annotations, restricting their ability to generalize across diff… ▽ More

    Submitted 16 April, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

    Journal ref: Published in IEEE Transactions on Affective Computing 2025

  23. arXiv:2410.11894  [pdf, other

    eess.SY cs.LG eess.IV nlin.CD

    Automated Discovery of Operable Dynamics from Videos

    Authors: Kuang Huang, Dong Heon Cho, Boyuan Chen

    Abstract: Dynamical systems form the foundation of scientific discovery, traditionally modeled with predefined state variables such as the angle and angular velocity, and differential equations such as the equation of motion for a single pendulum. We introduce a framework that automatically discovers a low-dimensional and operable representation of system dynamics, including a set of compact state variables… ▽ More

    Submitted 23 April, 2025; v1 submitted 13 October, 2024; originally announced October 2024.

  24. arXiv:2410.06583  [pdf, other

    cs.DS

    A short note about the learning-augmented secretary problem

    Authors: Davin Choo, Chun Kai Ling

    Abstract: We consider the secretary problem through the lens of learning-augmented algorithms. As it is known that the best possible expected competitive ratio is $1/e$ in the classic setting without predictions, a natural goal is to design algorithms that are 1-consistent and $1/e$-robust. Unfortunately, [FY24] provided hardness constructions showing that such a goal is not attainable when the candidates'… ▽ More

    Submitted 2 November, 2024; v1 submitted 9 October, 2024; originally announced October 2024.

  25. arXiv:2409.15784  [pdf

    physics.app-ph cond-mat.mtrl-sci cs.LG physics.optics

    Deep-learning real-time phase retrieval of imperfect diffraction patterns from X-ray free-electron lasers

    Authors: Sung Yun Lee, Do Hyung Cho, Chulho Jung, Daeho Sung, Daewoong Nam, Sangsoo Kim, Changyong Song

    Abstract: Machine learning is attracting surging interest across nearly all scientific areas by enabling the analysis of large datasets and the extraction of scientific information from incomplete data. Data-driven science is rapidly growing, especially in X-ray methodologies, where advanced light sources and detection technologies accumulate vast amounts of data that exceed meticulous human inspection capa… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

    MSC Class: 68T07 ACM Class: J.2

  26. arXiv:2407.00927  [pdf, ps, other

    cs.LG cs.CC stat.ML

    Learnability of Parameter-Bounded Bayes Nets

    Authors: Arnab Bhattacharyya, Davin Choo, Sutanu Gayen, Dimitrios Myrisiotis

    Abstract: Bayes nets are extensively used in practice to efficiently represent joint probability distributions over a set of random variables and capture dependency relations. In a seminal paper, Chickering et al. (JMLR 2004) showed that given a distribution $\mathbb{P}$, that is defined as the marginal distribution of a Bayes net, it is $\mathsf{NP}$-hard to decide whether there is a parameter-bounded Baye… ▽ More

    Submitted 4 August, 2024; v1 submitted 30 June, 2024; originally announced July 2024.

    Comments: 15 pages, 2 figures

  27. EmoSphere-TTS: Emotional Style and Intensity Modeling via Spherical Emotion Vector for Controllable Emotional Text-to-Speech

    Authors: Deok-Hyeon Cho, Hyung-Seok Oh, Seung-Bin Kim, Sang-Hoon Lee, Seong-Whan Lee

    Abstract: Despite rapid advances in the field of emotional text-to-speech (TTS), recent studies primarily focus on mimicking the average style of a particular emotion. As a result, the ability to manipulate speech emotion remains constrained to several predefined labels, compromising the ability to reflect the nuanced variations of emotion. In this paper, we propose EmoSphere-TTS, which synthesizes expressi… ▽ More

    Submitted 4 November, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Proceedings of Interspeech

  28. arXiv:2405.09784  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Online bipartite matching with imperfect advice

    Authors: Davin Choo, Themis Gouleakis, Chun Kai Ling, Arnab Bhattacharyya

    Abstract: We study the problem of online unweighted bipartite matching with $n$ offline vertices and $n$ online vertices where one wishes to be competitive against the optimal offline algorithm. While the classic RANKING algorithm of Karp et al. [1990] provably attains competitive ratio of $1-1/e > 1/2$, we show that no learning-augmented method can be both 1-consistent and strictly better than $1/2$-robust… ▽ More

    Submitted 23 May, 2024; v1 submitted 15 May, 2024; originally announced May 2024.

    Comments: Accepted into ICML 2024

  29. arXiv:2404.01954  [pdf, other

    cs.CL cs.AI

    HyperCLOVA X Technical Report

    Authors: Kang Min Yoo, Jaegeun Han, Sookyo In, Heewon Jeon, Jisu Jeong, Jaewook Kang, Hyunwook Kim, Kyung-Min Kim, Munhyong Kim, Sungju Kim, Donghyun Kwak, Hanock Kwak, Se Jung Kwon, Bado Lee, Dongsoo Lee, Gichang Lee, Jooho Lee, Baeseong Park, Seongjin Shin, Joonsang Yu, Seolki Baek, Sumin Byeon, Eungsup Cho, Dooseok Choe, Jeesung Han , et al. (371 additional authors not shown)

    Abstract: We introduce HyperCLOVA X, a family of large language models (LLMs) tailored to the Korean language and culture, along with competitive capabilities in English, math, and coding. HyperCLOVA X was trained on a balanced mix of Korean, English, and code data, followed by instruction-tuning with high-quality human-annotated datasets while abiding by strict safety guidelines reflecting our commitment t… ▽ More

    Submitted 13 April, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    Comments: 44 pages; updated authors list and fixed author names

  30. Envy-Free House Allocation with Minimum Subsidy

    Authors: Davin Choo, Yan Hao Ling, Warut Suksompong, Nicholas Teh, Jian Zhang

    Abstract: House allocation refers to the problem where $m$ houses are to be allocated to $n$ agents so that each agent receives one house. Since an envy-free house allocation does not always exist, we consider finding such an allocation in the presence of subsidy. We show that computing an envy-free allocation with minimum subsidy is NP-hard in general, but can be done efficiently if $m$ differs from $n$ by… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

    Journal ref: Operations Research Letters, 54:107103 (2024)

  31. arXiv:2402.08229  [pdf, other

    cs.LG cs.DS stat.ME stat.ML

    Causal Discovery under Off-Target Interventions

    Authors: Davin Choo, Kirankumar Shiragur, Caroline Uhler

    Abstract: Causal graph discovery is a significant problem with applications across various disciplines. However, with observational data alone, the underlying causal graph can only be recovered up to its Markov equivalence class, and further assumptions or interventions are necessary to narrow down the true graph. This work addresses the causal discovery problem under the setting of stochastic interventions… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

    Comments: Accepted into AISTATS 2024

  32. arXiv:2401.08095  [pdf, other

    cs.SD cs.AI eess.AS

    DurFlex-EVC: Duration-Flexible Emotional Voice Conversion Leveraging Discrete Representations without Text Alignment

    Authors: Hyung-Seok Oh, Sang-Hoon Lee, Deok-Hyeon Cho, Seong-Whan Lee

    Abstract: Emotional voice conversion (EVC) involves modifying various acoustic characteristics, such as pitch and spectral envelope, to match a desired emotional state while preserving the speaker's identity. Existing EVC methods often rely on text transcriptions or time-alignment information and struggle to handle varying speech durations effectively. In this paper, we propose DurFlex-EVC, a duration-flexi… ▽ More

    Submitted 20 January, 2025; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: 15 pages, 11 figures, 12 tables

    Journal ref: IEEE Transactions on Affective Computing, 2025, pp.1 - 15

  33. arXiv:2312.02819  [pdf, other

    cs.CV

    Deterministic Guidance Diffusion Model for Probabilistic Weather Forecasting

    Authors: Donggeun Yoon, Minseok Seo, Doyi Kim, Yeji Choi, Donghyeon Cho

    Abstract: Weather forecasting requires not only accuracy but also the ability to perform probabilistic prediction. However, deterministic weather forecasting methods do not support probabilistic predictions, and conversely, probabilistic models tend to be less accurate. To address these challenges, in this paper, we introduce the \textbf{\textit{D}}eterministic \textbf{\textit{G}}uidance \textbf{\textit{D}}… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

    Comments: 16 pages

  34. arXiv:2310.19261  [pdf, other

    cs.LG

    Diversify & Conquer: Outcome-directed Curriculum RL via Out-of-Distribution Disagreement

    Authors: Daesol Cho, Seungjae Lee, H. Jin Kim

    Abstract: Reinforcement learning (RL) often faces the challenges of uninformed search problems where the agent should explore without access to the domain knowledge such as characteristics of the environment or external rewards. To tackle these challenges, this work proposes a new approach for curriculum RL called Diversify for Disagreement & Conquer (D2C). Unlike previous curriculum learning methods, D2C r… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

  35. arXiv:2310.17330  [pdf, other

    cs.LG cs.AI

    CQM: Curriculum Reinforcement Learning with a Quantized World Model

    Authors: Seungjae Lee, Daesol Cho, Jonghae Park, H. Jin Kim

    Abstract: Recent curriculum Reinforcement Learning (RL) has shown notable progress in solving complex tasks by proposing sequences of surrogate tasks. However, the previous approaches often face challenges when they generate curriculum goals in a high-dimensional space. Thus, they usually rely on manually specified goal spaces. To alleviate this limitation and improve the scalability of the curriculum, we p… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023

  36. arXiv:2310.06333  [pdf, ps, other

    cs.LG cs.DS math.PR math.ST stat.ML

    Learning bounded-degree polytrees with known skeleton

    Authors: Davin Choo, Joy Qiping Yang, Arnab Bhattacharyya, Clément L. Canonne

    Abstract: We establish finite-sample guarantees for efficient proper learning of bounded-degree polytrees, a rich class of high-dimensional probability distributions and a subclass of Bayesian networks, a widely-studied type of graphical model. Recently, Bhattacharyya et al. (2021) obtained finite-sample guarantees for recovering tree-structured Bayesian networks, i.e., 1-polytrees. We extend their results… ▽ More

    Submitted 21 January, 2024; v1 submitted 10 October, 2023; originally announced October 2023.

    Comments: Fixed some typos. Added some discussions. Accepted to ALT 2024

  37. arXiv:2306.05781  [pdf, other

    cs.LG cs.AI cs.DS stat.ME stat.ML

    Adaptivity Complexity for Causal Graph Discovery

    Authors: Davin Choo, Kirankumar Shiragur

    Abstract: Causal discovery from interventional data is an important problem, where the task is to design an interventional strategy that learns the hidden ground truth causal graph $G(V,E)$ on $|V| = n$ nodes while minimizing the number of performed interventions. Most prior interventional strategies broadly fall into two categories: non-adaptive and adaptive. Non-adaptive strategies decide on a single fixe… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted into UAI 2023

  38. arXiv:2305.19588  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Active causal structure learning with advice

    Authors: Davin Choo, Themis Gouleakis, Arnab Bhattacharyya

    Abstract: We introduce the problem of active causal structure learning with advice. In the typical well-studied setting, the learning algorithm is given the essential graph for the observational distribution and is asked to recover the underlying causal directed acyclic graph (DAG) $G^*$ while minimizing the number of interventions made. In our setting, we are additionally given side information about… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: Accepted into ICML 2023

  39. arXiv:2305.09943  [pdf, other

    cs.LG cs.AI cs.RO

    Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum

    Authors: Jigang Kim, Daesol Cho, H. Jin Kim

    Abstract: While reinforcement learning (RL) has achieved great success in acquiring complex skills solely from environmental interactions, it assumes that resets to the initial state are readily available at the end of each episode. Such an assumption hinders the autonomous learning of embodied agents due to the time-consuming and cumbersome workarounds for resetting in the physical world. Hence, there has… ▽ More

    Submitted 8 June, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: ICML 2023, first two authors contributed equally

  40. arXiv:2305.09858  [pdf, other

    cs.IR cs.AI cs.CL cs.LG

    Knowledge Graph Completion Models are Few-shot Learners: An Empirical Study of Relation Labeling in E-commerce with LLMs

    Authors: Jiao Chen, Luyi Ma, Xiaohan Li, Nikhil Thakurdesai, Jianpeng Xu, Jason H. D. Cho, Kaushiki Nag, Evren Korpeoglu, Sushant Kumar, Kannan Achan

    Abstract: Knowledge Graphs (KGs) play a crucial role in enhancing e-commerce system performance by providing structured information about entities and their relationships, such as complementary or substitutable relations between products or product types, which can be utilized in recommender systems. However, relation labeling in KGs remains a challenging task due to the dynamic nature of e-commerce domains… ▽ More

    Submitted 16 May, 2023; originally announced May 2023.

  41. arXiv:2305.08269  [pdf, other

    cs.CC cs.CG cs.DS

    The Sharp Power Law of Local Search on Expanders

    Authors: Simina Brânzei, Davin Choo, Nicholas Recker

    Abstract: Local search is a powerful heuristic in optimization and computer science, the complexity of which was studied in the white box and black box models. In the black box model, we are given a graph $G = (V,E)$ and oracle access to a function $f : V \to \mathbb{R}$. The local search problem is to find a vertex $v$ that is a local minimum, i.e. with $f(v) \leq f(u)$ for all $(u,v) \in E$, using as few… ▽ More

    Submitted 15 August, 2023; v1 submitted 14 May, 2023; originally announced May 2023.

  42. arXiv:2305.04445  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    New metrics and search algorithms for weighted causal DAGs

    Authors: Davin Choo, Kirankumar Shiragur

    Abstract: Recovering causal relationships from data is an important problem. Using observational data, one can typically only recover causal graphs up to a Markov equivalence class and additional assumptions or interventional data are needed for complete recovery. In this work, under some standard assumptions, we study causal graph discovery via adaptive interventions with node-dependent interventional cost… ▽ More

    Submitted 29 May, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: Accepted into ICML 2023

  43. arXiv:2305.01905  [pdf, other

    cs.CV

    Localization using Multi-Focal Spatial Attention for Masked Face Recognition

    Authors: Yooshin Cho, Hanbyel Cho, Hyeong Gwon Hong, Jaesung Ahn, Dongmin Cho, JungWoo Chang, Junmo Kim

    Abstract: Since the beginning of world-wide COVID-19 pandemic, facial masks have been recommended to limit the spread of the disease. However, these masks hide certain facial attributes. Hence, it has become difficult for existing face recognition systems to perform identity verification on masked faces. In this context, it is necessary to develop masked Face Recognition (MFR) for contactless biometric reco… ▽ More

    Submitted 7 September, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

    Comments: Accepted at FG 2023 - InterID Workshop

  44. arXiv:2304.06818  [pdf, other

    cs.CV

    Soundini: Sound-Guided Diffusion for Natural Video Editing

    Authors: Seung Hyun Lee, Sieun Kim, Innfarn Yoo, Feng Yang, Donghyeon Cho, Youngseo Kim, Huiwen Chang, Jinkyu Kim, Sangpil Kim

    Abstract: We propose a method for adding sound-guided visual effects to specific regions of videos with a zero-shot setting. Animating the appearance of the visual effect is challenging because each frame of the edited video should have visual changes while maintaining temporal consistency. Moreover, existing video editing solutions focus on temporal consistency across frames, ignoring the visual style vari… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

  45. arXiv:2301.11741  [pdf, other

    cs.LG cs.AI cs.RO

    Outcome-directed Reinforcement Learning by Uncertainty & Temporal Distance-Aware Curriculum Goal Generation

    Authors: Daesol Cho, Seungjae Lee, H. Jin Kim

    Abstract: Current reinforcement learning (RL) often suffers when solving a challenging exploration problem where the desired outcomes or high rewards are rarely observed. Even though curriculum RL, a framework that solves complex tasks by proposing a sequence of surrogate tasks, shows reasonable results, most of the previous works still have difficulty in proposing curriculum due to the absence of a mechani… ▽ More

    Submitted 20 February, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

    Comments: ICLR 2023 Spotlight. First two authors contributed equally

  46. arXiv:2301.03180  [pdf, other

    cs.LG cs.DS stat.ML

    Subset verification and search algorithms for causal DAGs

    Authors: Davin Choo, Kirankumar Shiragur

    Abstract: Learning causal relationships between variables is a fundamental task in causal inference and directed acyclic graphs (DAGs) are a popular choice to represent the causal relationships. As one can recover a causal graph only up to its Markov equivalence class from observations, interventions are often used for the recovery task. Interventions are costly in general and it is important to design algo… ▽ More

    Submitted 13 February, 2024; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: Accepted into AISTATS 2023 (https://aistats.org/aistats2023/accepted.html)

  47. arXiv:2211.16465  [pdf, other

    cs.HC

    "I Want to Figure Things Out": Supporting Exploration in Navigation for People with Visual Impairments

    Authors: Gaurav Jain, Yuanyang Teng, Dong Heon Cho, Yunhao Xing, Maryam Aziz, Brian A. Smith

    Abstract: Navigation assistance systems (NASs) aim to help visually impaired people (VIPs) navigate unfamiliar environments. Most of today's NASs support VIPs via turn-by-turn navigation, but a growing body of work highlights the importance of exploration as well. It is unclear, however, how NASs should be designed to help VIPs explore unfamiliar environments. In this paper, we perform a qualitative study t… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: To appear in the Proceedings of the ACM on Human-Computer Interaction, CSCW1, April 2023 issue. To be presented at CSCW 2023

  48. arXiv:2211.13291  [pdf, ps, other

    cs.LG cs.DS math.PR math.ST

    Learning and Testing Latent-Tree Ising Models Efficiently

    Authors: Davin Choo, Yuval Dagan, Constantinos Daskalakis, Anthimos Vardis Kandiros

    Abstract: We provide time- and sample-efficient algorithms for learning and testing latent-tree Ising models, i.e. Ising models that may only be observed at their leaf nodes. On the learning side, we obtain efficient algorithms for learning a tree-structured Ising model whose leaf node distribution is close in Total Variation Distance, improving on the results of prior work. On the testing side, we provide… ▽ More

    Submitted 10 July, 2023; v1 submitted 23 November, 2022; originally announced November 2022.

  49. arXiv:2211.07077  [pdf, other

    cs.CV

    IFQA: Interpretable Face Quality Assessment

    Authors: Byungho Jo, Donghyeon Cho, In Kyu Park, Sungeun Hong

    Abstract: Existing face restoration models have relied on general assessment metrics that do not consider the characteristics of facial regions. Recent works have therefore assessed their methods using human studies, which is not scalable and involves significant effort. This paper proposes a novel face-centric metric based on an adversarial framework where a generator simulates face restoration and a discr… ▽ More

    Submitted 16 November, 2022; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: WACV 2023, Code: https://github.com/VCLLab/IFQA

  50. arXiv:2210.07760  [pdf, other

    cs.CV

    Lightweight Alpha Matting Network Using Distillation-Based Channel Pruning

    Authors: Donggeun Yoon, Jinsun Park, Donghyeon Cho

    Abstract: Recently, alpha matting has received a lot of attention because of its usefulness in mobile applications such as selfies. Therefore, there has been a demand for a lightweight alpha matting model due to the limited computational resources of commercial portable devices. To this end, we suggest a distillation-based channel pruning method for the alpha matting networks. In the pruning step, we remove… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: Accepted by ACCV2022