Skip to main content

Showing 1–50 of 418 results for author: Choi, D

.
  1. arXiv:2507.04329  [pdf, ps, other

    cs.CL

    No Language Data Left Behind: A Comparative Study of CJK Language Datasets in the Hugging Face Ecosystem

    Authors: Dasol Choi, Woomyoung Park, Youngsook Song

    Abstract: Recent advances in Natural Language Processing (NLP) have underscored the crucial role of high-quality datasets in building large language models (LLMs). However, while extensive resources and analyses exist for English, the landscape for East Asian languages - particularly Chinese, Japanese, and Korean (CJK) - remains fragmented and underexplored, despite these languages together serving over 1.6… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  2. arXiv:2507.04327  [pdf, ps, other

    cs.LG

    TinyProto: Communication-Efficient Federated Learning with Sparse Prototypes in Resource-Constrained Environments

    Authors: Gyuejeong Lee, Daeyoung Choi

    Abstract: Communication efficiency in federated learning (FL) remains a critical challenge for resource-constrained environments. While prototype-based FL reduces communication overhead by sharing class prototypes-mean activations in the penultimate layer-instead of model parameters, its efficiency decreases with larger feature dimensions and class counts. We propose TinyProto, which addresses these limitat… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  3. arXiv:2507.04310  [pdf, ps, other

    cs.LG cs.DC

    Heterogeneous Federated Learning with Prototype Alignment and Upscaling

    Authors: Gyuejeong Lee, Jihwan Shin, Daeyoung Choi

    Abstract: Heterogeneity in data distributions and model architectures remains a significant challenge in federated learning (FL). Various heterogeneous FL (HtFL) approaches have recently been proposed to address this challenge. Among them, prototype-based FL (PBFL) has emerged as a practical framework that only shares per-class mean activations from the penultimate layer. However, PBFL approaches often suff… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  4. arXiv:2507.01308  [pdf, ps, other

    cs.RO cs.CV

    LANet: A Lane Boundaries-Aware Approach For Robust Trajectory Prediction

    Authors: Muhammad Atta ur Rahman, Dooseop Choi, KyoungWook Min

    Abstract: Accurate motion forecasting is critical for safe and efficient autonomous driving, enabling vehicles to predict future trajectories and make informed decisions in complex traffic scenarios. Most of the current designs of motion prediction models are based on the major representation of lane centerlines, which limits their capability to capture critical road environments and traffic rules and const… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

    Comments: Accepted at the 17th IEEE International Conference on Advanced Computational Intelligence (ICACI 2025)

  5. arXiv:2506.13921  [pdf, ps, other

    math.OC physics.class-ph physics.space-ph

    A Study on Effective Initial Guess Finding Method Based on Bézier Curves: Orbit Determination Applications

    Authors: Daegyun Choi, Sungwook Yang, Henzeh Leeghim, Donghoon Kim

    Abstract: In celestial mechanics, proper orbits related to missions are obtained by solving two-point boundary value problems. Since a selection method of initial value affects the convergence of the solution, developing an effective method to find an initial guess is required. In this work, Bézier curves, which can describe complicated curves and surfaces, are utilized to find the initial guess. First, the… ▽ More

    Submitted 16 June, 2025; originally announced June 2025.

    Comments: 10 pages, 4 figures, 4 tables, 2019 AAS/AIAA Astrodynamics Specialist Conference

  6. arXiv:2506.11344  [pdf, ps, other

    cs.CL

    Do We Still Need Audio? Rethinking Speaker Diarization with a Text-Based Approach Using Multiple Prediction Models

    Authors: Peilin Wu, Jinho D. Choi

    Abstract: We present a novel approach to Speaker Diarization (SD) by leveraging text-based methods focused on Sentence-level Speaker Change Detection within dialogues. Unlike audio-based SD systems, which are often challenged by audio quality and speaker similarity, our approach utilizes the dialogue transcript alone. Two models are developed: the Single Prediction Model (SPM) and the Multiple Prediction Mo… ▽ More

    Submitted 12 June, 2025; originally announced June 2025.

  7. arXiv:2506.03575  [pdf, ps, other

    physics.optics

    Brillouin lasers in Bragg grating microresonators

    Authors: Ryan L. Russell, Moritz Merklein, Choon Kong Lai, Cong Tinh Bui, Alvaro Casas-Bedoya, Duk-Yong Choi, Stephen J. Madden, Benjamin J. Eggleton

    Abstract: Chip-scale coherent light sources are required in applications spanning metrology and sensing to telecommunications. Brillouin lasers (BLs) offer a route to ultra-coherent optical sources in compact microresonators with free spectral range (FSR) matched to the Brillouin frequency shift (BFS). However, BFS - FSR matching typically facilitates cascaded Brillouin scattering, constraining achievable B… ▽ More

    Submitted 4 June, 2025; originally announced June 2025.

  8. arXiv:2506.01360  [pdf, ps, other

    cs.LG

    RDB2G-Bench: A Comprehensive Benchmark for Automatic Graph Modeling of Relational Databases

    Authors: Dongwon Choi, Sunwoo Kim, Juyeon Kim, Kyungho Kim, Geon Lee, Shinhwan Kang, Myunghwan Kim, Kijung Shin

    Abstract: Relational databases (RDBs) are composed of interconnected tables, where relationships between them are defined through foreign keys. Recent research on applying machine learning to RDBs has explored graph-based representations of RDBs, where rows of tables are modeled as nodes, and foreign key relationships are modeled as edges. RDB-to-graph modeling helps capture cross-table dependencies, ultima… ▽ More

    Submitted 2 June, 2025; originally announced June 2025.

    Comments: Code and datasets are in https://github.com/chlehdwon/RDB2G-Bench

  9. arXiv:2506.01206  [pdf, other

    cs.CL cs.AI

    Mamba Drafters for Speculative Decoding

    Authors: Daewon Choi, Seunghyuk Oh, Saket Dingliwal, Jihoon Tack, Kyuyoung Kim, Woomin Song, Seojin Kim, Insu Han, Jinwoo Shin, Aram Galstyan, Shubham Katiyar, Sravan Babu Bodapati

    Abstract: Speculative decoding has emerged as a promising approach to accelerating large language model (LLM) generation using a fast drafter while maintaining alignment with the target model's distribution. However, existing approaches face a trade-off: external drafters offer flexibility but can suffer from slower drafting, while self-speculation methods use drafters tailored to the target model but requi… ▽ More

    Submitted 1 June, 2025; originally announced June 2025.

  10. arXiv:2506.00481  [pdf, other

    cs.CL cs.AI

    PVP: An Image Dataset for Personalized Visual Persuasion with Persuasion Strategies, Viewer Characteristics, and Persuasiveness Ratings

    Authors: Junseo Kim, Jongwook Han, Dongmin Choi, Jongwook Yoon, Eun-Ju Lee, Yohan Jo

    Abstract: Visual persuasion, which uses visual elements to influence cognition and behaviors, is crucial in fields such as advertising and political communication. With recent advancements in artificial intelligence, there is growing potential to develop persuasive systems that automatically generate persuasive images tailored to individuals. However, a significant bottleneck in this area is the lack of com… ▽ More

    Submitted 31 May, 2025; originally announced June 2025.

    Comments: ACL 2025 Main. Code and dataset are released at: https://github.com/holi-lab/PVP_Personalized_Visual_Persuasion

  11. arXiv:2505.16348  [pdf, other

    cs.CL

    Embodied Agents Meet Personalization: Exploring Memory Utilization for Personalized Assistance

    Authors: Taeyoon Kwon, Dongwook Choi, Sunghwan Kim, Hyojun Kim, Seungjun Moon, Beong-woo Kwak, Kuan-Hao Huang, Jinyoung Yeo

    Abstract: Embodied agents empowered by large language models (LLMs) have shown strong performance in household object rearrangement tasks. However, these tasks primarily focus on single-turn interactions with simplified instructions, which do not truly reflect the challenges of providing meaningful assistance to users. To provide personalized assistance, embodied agents must understand the unique semantics… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: Work in progress

  12. arXiv:2505.16267  [pdf, ps, other

    physics.optics

    Rapid adiabatic couplers with arbitrary split ratios for broadband DWDM interleaver application

    Authors: Daehan Choi, Woo-Joo Kim, Young-Ik Sohn

    Abstract: We experimentally demonstrate a compact and broadband rapid adiabatic couplers (RACs) with arbitrary power split ratios, achieved through the combination of translational offset and waveguide width control. Fabricated RACs of four different target split ratios show power splitting within $\pm$3% of the design target over a 160 nm wavelength range. Using these RACs, we implement an 8-channel dense… ▽ More

    Submitted 22 May, 2025; originally announced May 2025.

    Comments: 4 pages, 4 figures

  13. arXiv:2505.15685  [pdf, ps, other

    cs.RO

    From Grounding to Manipulation: Case Studies of Foundation Model Integration in Embodied Robotic Systems

    Authors: Xiuchao Sui, Daiying Tian, Qi Sun, Ruirui Chen, Dongkyu Choi, Kenneth Kwok, Soujanya Poria

    Abstract: Foundation models (FMs) are increasingly used to bridge language and action in embodied agents, yet the operational characteristics of different FM integration strategies remain under-explored -- particularly for complex instruction following and versatile action generation in changing environments. This paper examines three paradigms for building robotic systems: end-to-end vision-language-action… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 17 pages, 13 figures

  14. arXiv:2505.15367  [pdf, ps, other

    cs.CV cs.AI cs.CL

    Better Safe Than Sorry? Overreaction Problem of Vision Language Models in Visual Emergency Recognition

    Authors: Dasol Choi, Seunghyun Lee, Youngsook Song

    Abstract: Vision-Language Models (VLMs) have shown capabilities in interpreting visual content, but their reliability in safety-critical everyday life scenarios remains insufficiently explored. We introduce VERI (Visual Emergency Recognition Dataset), a diagnostic benchmark comprising 200 images organized into 100 contrastive pairs. Each emergency scene is paired with a visually similar but safe counterpart… ▽ More

    Submitted 6 July, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

  15. arXiv:2505.15277  [pdf, other

    cs.CL

    Web-Shepherd: Advancing PRMs for Reinforcing Web Agents

    Authors: Hyungjoo Chae, Sunghwan Kim, Junhee Cho, Seungone Kim, Seungjun Moon, Gyeom Hwangbo, Dongha Lim, Minjin Kim, Yeonjun Hwang, Minju Gwak, Dongwook Choi, Minseok Kang, Gwanhoon Im, ByeongUng Cho, Hyojun Kim, Jun Hee Han, Taeyoon Kwon, Minju Kim, Beong-woo Kwak, Dongjin Kang, Jinyoung Yeo

    Abstract: Web navigation is a unique domain that can automate many repetitive real-life tasks and is challenging as it requires long-horizon sequential decision making beyond typical multimodal large language model (MLLM) tasks. Yet, specialized reward models for web navigation that can be utilized during both training and test-time have been absent until now. Despite the importance of speed and cost-effect… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: Work in progress

  16. arXiv:2505.15160  [pdf, other

    cs.CV

    Lossless Token Merging Even Without Fine-Tuning in Vision Transformers

    Authors: Jaeyeon Lee, Dong-Wan Choi

    Abstract: Although Vision Transformers (ViTs) have become the standard architecture in computer vision, their massive sizes lead to significant computational overhead. Token compression techniques have attracted considerable attention to address this issue, but they often suffer from severe information loss, requiring extensive additional training to achieve practical performance. In this paper, we propose… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: Under Review

  17. arXiv:2505.12686  [pdf, other

    cs.LG cs.SD eess.AS

    RoVo: Robust Voice Protection Against Unauthorized Speech Synthesis with Embedding-Level Perturbations

    Authors: Seungmin Kim, Sohee Park, Donghyun Kim, Jisu Lee, Daeseon Choi

    Abstract: With the advancement of AI-based speech synthesis technologies such as Deep Voice, there is an increasing risk of voice spoofing attacks, including voice phishing and fake news, through unauthorized use of others' voices. Existing defenses that inject adversarial perturbations directly into audio signals have limited effectiveness, as these perturbations can easily be neutralized by speech enhance… ▽ More

    Submitted 19 May, 2025; originally announced May 2025.

  18. arXiv:2505.10079  [pdf, other

    cond-mat.mes-hall quant-ph

    Electron spin resonance with scanning tunneling microscopy: a tool for an on-surface quantum platform of identical qubits

    Authors: Deung-Jang Choi, Soo-hyon Phark, Andreas J. Heinrich, Nicolás Lorente

    Abstract: Integration of electron spin resonance (ESR) in a scanning tunneling microscope (STM) has enabled an all-electrical control of atomic and molecular spins on solid surfaces with atomic-scale precision and energy resolution beyond thermal limitations. Further, coherent manipulation and detection of individual spins in an ESR-STM establishes a powerful quantum platform, allowing for the implementatio… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  19. arXiv:2505.09428  [pdf, ps, other

    quant-ph cond-mat.mes-hall

    Unraveling spin entanglement using quantum gates with scanning tunneling microscopy-driven electron spin resonance

    Authors: Eric D. Switzer, Jose Reina-Gálvez, Géza Giedke, Talat S. Rahman, Christoph Wolf, Deung-Jang Choi, Nicolás Lorente

    Abstract: Quantum entanglement is a fundamental resource for quantum information processing, and its controlled generation and detection remain key challenges in scalable quantum architectures. Here, we numerically demonstrate the deterministic generation of entangled spin states in a solid-state platform by implementing quantum gates via electron spin resonance combined with scanning tunneling microscopy (… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

  20. arXiv:2505.08835  [pdf, other

    cs.CR cs.AI cs.CV

    Robustness Analysis against Adversarial Patch Attacks in Fully Unmanned Stores

    Authors: Hyunsik Na, Wonho Lee, Seungdeok Roh, Sohee Park, Daeseon Choi

    Abstract: The advent of convenient and efficient fully unmanned stores equipped with artificial intelligence-based automated checkout systems marks a new era in retail. However, these systems have inherent artificial intelligence security vulnerabilities, which are exploited via adversarial patch attacks, particularly in physical environments. This study demonstrated that adversarial patches can severely di… ▽ More

    Submitted 13 May, 2025; originally announced May 2025.

  21. Phantom Domain Finite Element Method: A novel approach for heterogeneous materials

    Authors: Tianlong He, Philippe Karamian-Surville, Daniel Choï

    Abstract: In this paper, we introduce the Phantom Domain Finite Element Method (PDFEM), a novel computational approach tailored for the efficient analysis of heterogeneous and composite materials. Inspired by fictitious domain methods, this method employs a structured mesh to discretize the entire material domain while utilizing separate, independent meshes for the inclusions. These inclusion meshes are cou… ▽ More

    Submitted 4 May, 2025; originally announced May 2025.

  22. arXiv:2505.01015  [pdf, ps, other

    cs.CL cs.AI

    Value Portrait: Assessing Language Models' Values through Psychometrically and Ecologically Valid Items

    Authors: Jongwook Han, Dongmin Choi, Woojung Song, Eun-Ju Lee, Yohan Jo

    Abstract: The importance of benchmarks for assessing the values of language models has been pronounced due to the growing need of more authentic, human-aligned responses. However, existing benchmarks rely on human or machine annotations that are vulnerable to value-related biases. Furthermore, the tested scenarios often diverge from real-world contexts in which models are commonly used to generate text and… ▽ More

    Submitted 11 June, 2025; v1 submitted 2 May, 2025; originally announced May 2025.

    Comments: This paper has been accepted for publication at ACL 2025

    ACM Class: I.2.7

  23. arXiv:2505.00043  [pdf, ps, other

    q-bio.QM

    EchoNet-Quality: Denoising Echocardiograms via Deep Generative Modeling of Ultrasound Noise

    Authors: David Choi, Milos Vukadinovic, Bryan He, Christina Binder, Yuki Sahashi, David Ouyang

    Abstract: Echocardiography (echo), or cardiac ultrasound, is the most widely used imaging modality for cardiac form and function due to its relatively low cost, rapid acquisition time, and non-invasive nature. However, ultrasound acquisitions are often limited by artifacts and noise that hinder diagnostic interpretation in clinical settings. Existing methodologies for denoising echos consist solely of tradi… ▽ More

    Submitted 19 June, 2025; v1 submitted 29 April, 2025; originally announced May 2025.

  24. arXiv:2504.21851  [pdf, other

    cs.CL cs.AI

    TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments

    Authors: Sichang Tu, Abigail Powers, Stephen Doogan, Jinho D. Choi

    Abstract: Objectives: While Large Language Models (LLMs) have been widely used to assist clinicians and support patients, no existing work has explored dialogue systems for standard diagnostic interviews and assessments. This study aims to bridge the gap in mental healthcare accessibility by developing an LLM-powered dialogue system that replicates clinician behavior. Materials and Methods: We introduce TRU… ▽ More

    Submitted 30 April, 2025; originally announced April 2025.

    Comments: 5 figures, 4 tables

  25. arXiv:2504.20566  [pdf, ps, other

    cs.LG cs.AI

    Inclusive Training Separation and Implicit Knowledge Interaction for Balanced Online Class-Incremental Learning

    Authors: Shunjie Wen, Thomas Heinis, Dong-Wan Choi

    Abstract: Online class-incremental learning (OCIL) focuses on gradually learning new classes (called plasticity) from a stream of data in a single-pass, while concurrently preserving knowledge of previously learned classes (called stability). The primary challenge in OCIL lies in maintaining a good balance between the knowledge of old and new classes within the continually updated model. Most existing metho… ▽ More

    Submitted 29 April, 2025; originally announced April 2025.

    Comments: Under review

  26. arXiv:2504.18474  [pdf, other

    cs.CL

    Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions

    Authors: James D. Finch, Yasasvi Josyula, Jinho D. Choi

    Abstract: In task-oriented dialogue (TOD) systems, Slot Schema Induction (SSI) is essential for automatically identifying key information slots from dialogue data without manual intervention. This paper presents a novel state-of-the-art (SoTA) approach that formulates SSI as a text generation task, where a language model incrementally constructs and refines a slot schema over a stream of dialogue data. To d… ▽ More

    Submitted 25 April, 2025; originally announced April 2025.

    Comments: Accepted (B) to TACL 2025

  27. arXiv:2504.13969  [pdf, other

    cs.HC cs.AI cs.CY

    Tinker Tales: Interactive Storytelling Framework for Early Childhood Narrative Development and AI Literacy

    Authors: Nayoung Choi, Peace Cyebukayire, Jinho D. Choi

    Abstract: This paper presents Tinker Tales, an interactive storytelling framework in the format of a board game, designed to support both narrative development and AI literacy in early childhood. The framework integrates tangible and speech-based interactions with AI through NFC chip-attached pawns and tokens, along with a speaker and microphone. Children select and define key story elements-such as charact… ▽ More

    Submitted 22 April, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

  28. arXiv:2504.13439  [pdf, ps, other

    cs.CL

    D-GEN: Automatic Distractor Generation and Evaluation for Reliable Assessment of Generative Model

    Authors: Grace Byun, Jinho D. Choi

    Abstract: Evaluating generative models with open-ended generation is challenging due to inconsistencies in response formats. Multiple-choice (MC) evaluation mitigates this issue, but generating high-quality distractors is time-consuming and labor-intensive. We introduce D-GEN, the first open-source distractor generator model that transforms open-ended data into an MC format. To evaluate distractor quality,… ▽ More

    Submitted 12 June, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: ACL 2025 Findings

    Journal ref: ACL 2025 Findings

  29. arXiv:2504.13425  [pdf, other

    cs.CL

    Secure Multifaceted-RAG for Enterprise: Hybrid Knowledge Retrieval with Security Filtering

    Authors: Grace Byun, Shinsun Lee, Nayoung Choi, Jinho D. Choi

    Abstract: Existing Retrieval-Augmented Generation (RAG) systems face challenges in enterprise settings due to limited retrieval scope and data security risks. When relevant internal documents are unavailable, the system struggles to generate accurate and complete responses. Additionally, using closed-source Large Language Models (LLMs) raises concerns about exposing proprietary information. To address these… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

  30. arXiv:2504.12870  [pdf, other

    eess.AS

    CST-former: Multidimensional Attention-based Transformer for Sound Event Localization and Detection in Real Scenes

    Authors: Yusun Shul, Dayun Choi, Jung-Woo Choi

    Abstract: Sound event localization and detection (SELD) is a task for the classification of sound events and the identification of direction of arrival (DoA) utilizing multichannel acoustic signals. For effective classification and localization, a channel-spectro-temporal transformer (CST-former) was suggested. CST-former employs multidimensional attention mechanisms across the spatial, spectral, and tempor… ▽ More

    Submitted 17 April, 2025; originally announced April 2025.

    Comments: 12 pages, 10 figures, Submitted to IEEE/ACM Transactions on Audio, Speech, and Language Processing

  31. arXiv:2504.02877  [pdf, other

    cs.CL

    Revisiting Funnel Transformers for Modern LLM Architectures with Comprehensive Ablations in Training and Inference Configurations

    Authors: DongHyun Choi, Lucas Spangher, Chris Hidey, Peter Grabowski, Ramy Eskander

    Abstract: Transformer-based Large Language Models, which suffer from high computational costs, advance so quickly that techniques proposed to streamline earlier iterations are not guaranteed to benefit more modern models. Building upon the Funnel Transformer proposed by Dai and Le (2020), which progressively compresses intermediate representations, we investigate the impact of funneling in contemporary Gemm… ▽ More

    Submitted 1 April, 2025; originally announced April 2025.

  32. arXiv:2503.22968  [pdf, ps, other

    cs.CE cs.AI cs.CL

    Redefining Evaluation Standards: A Unified Framework for Evaluating the Korean Capabilities of Language Models

    Authors: Hanwool Lee, Dasol Choi, Sooyong Kim, Ilgyun Jung, Sangwon Baek, Guijin Son, Inseon Hwang, Naeun Lee, Seunghyeok Hong

    Abstract: Recent advancements in Korean large language models (LLMs) have driven numerous benchmarks and evaluation methods, yet inconsistent protocols cause up to 10 p.p performance gaps across institutions. Overcoming these reproducibility gaps does not mean enforcing a one-size-fits-all evaluation. Rather, effective benchmarking requires diverse experimental approaches and a framework robust enough to su… ▽ More

    Submitted 29 June, 2025; v1 submitted 29 March, 2025; originally announced March 2025.

  33. arXiv:2503.22194  [pdf, other

    cs.CV cs.LG

    ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation

    Authors: Yunhong Min, Daehyeon Choi, Kyeongmin Yeo, Jihyun Lee, Minhyuk Sung

    Abstract: We introduce ORIGEN, the first zero-shot method for 3D orientation grounding in text-to-image generation across multiple objects and diverse categories. While previous work on spatial grounding in image generation has mainly focused on 2D positioning, it lacks control over 3D orientation. To address this, we propose a reward-guided sampling approach using a pretrained discriminative model for 3D o… ▽ More

    Submitted 28 May, 2025; v1 submitted 28 March, 2025; originally announced March 2025.

    Comments: Project Page: https://origen2025.github.io

  34. arXiv:2503.13936  [pdf, other

    cond-mat.str-el

    Time-domain identification of distinct mechanisms for competing charge density waves in a rare-earth tritelluride

    Authors: Yifan Su, B. Q. Lv, Alfred Zong, Aaron Müller, Sambuddha Chattopadhyay, Pavel E. Dolgirev, Anisha G. Singh, Joshua A. W. Straquadine, Dongsung Choi, Doron Azoury, Masataka Mogi, Ian R. Fisher, Eugene Demler, Nuh Gedik

    Abstract: Understanding the origin of phase transitions and the interactions between distinct phases remains a central task in condensed matter physics. Charge density wave (CDW) systems provide an ideal platform for investigating these phenomena. While the dominant CDW phases in many materials can be explained through Fermi surface nesting or electron-phonon interactions, certain CDW phase transitions rema… ▽ More

    Submitted 25 March, 2025; v1 submitted 18 March, 2025; originally announced March 2025.

  35. Expandora: Broadening Design Exploration with Text-to-Image Model

    Authors: DaEun Choi, Kihoon Son, Hyunjoon Jung, Juho Kim

    Abstract: Broad exploration of references is critical in the visual design process. While text-to-image (T2I) models offer efficiency and customization of exploration, they often limit support for divergence in exploration. We conducted a formative study (N=6) to investigate the limitations of current interaction with the T2I model for broad exploration and found that designers struggle to articulate explor… ▽ More

    Submitted 2 March, 2025; originally announced March 2025.

    Comments: Accepted to CHI'25 LBW

  36. arXiv:2502.21270  [pdf, ps, other

    math.AG math.QA

    Conformal Block Divisors for Discrete Series Virasoro VOA $\text{Vir}_{2k+1,2}$

    Authors: Daebeom Choi

    Abstract: In this work, we study a family of vector bundles on the moduli space of curves constructed from representations of $\text{Vir}_{2k+1,2}$, a family of vertex operator algebras derived from the Virasoro Lie algebra. Using the relationship between rank and degree, we characterize their asymptotic behavior, demonstrating that their first Chern classes are nef on $\overline{\rm{M}}_{g,n}$ in many case… ▽ More

    Submitted 28 February, 2025; originally announced February 2025.

    Comments: 37 pages. Comments Welcome!

    MSC Class: 14H10; 17B69 (primary); 81R10 (secondary)

  37. arXiv:2502.19703  [pdf, ps, other

    math.AG

    Singularities and syzygies of secant varieties of smooth projective varieties

    Authors: Doyoung Choi, Justin Lacini, Jinhyung Park, John Sheridan

    Abstract: We study the higher secant varieties of a smooth projective variety embedded in projective space. We prove that when the variety is a surface and the embedding line bundle is sufficiently positive, these varieties are normal with Du Bois singularities and the syzygies of their defining ideals are linear to the expected order. We show that the cohomology of the structure sheaf of the surface comple… ▽ More

    Submitted 26 February, 2025; originally announced February 2025.

    Comments: 77 pages. Comments welcome

  38. arXiv:2502.19596  [pdf

    cs.AI cs.IR

    Reference-Aligned Retrieval-Augmented Question Answering over Heterogeneous Proprietary Documents

    Authors: Nayoung Choi, Grace Byun, Andrew Chung, Ellie S. Paek, Shinsun Lee, Jinho D. Choi

    Abstract: Proprietary corporate documents contain rich domain-specific knowledge, but their overwhelming volume and disorganized structure make it difficult even for employees to access the right information when needed. For example, in the automotive industry, vehicle crash-collision tests, each costing hundreds of thousands of dollars, produce highly detailed documentation. However, retrieving relevant co… ▽ More

    Submitted 16 June, 2025; v1 submitted 26 February, 2025; originally announced February 2025.

    MSC Class: H.3

  39. arXiv:2502.14800  [pdf

    cond-mat.mtrl-sci cond-mat.mes-hall physics.optics

    Discovery of transient topological crystalline order in optically driven SnSe

    Authors: Masataka Mogi, Dongsung Choi, Kyoung Hun Oh, Diana Golovanova, Yufei Zhao, Yifan Su, Zongqi Shen, Doron Azoury, Haoyu Xia, Batyr Ilyas, Tianchuang Luo, Noriaki Kida, Taito Osaka, Tadashi Togashi, Binghai Yan, Nuh Gedik

    Abstract: Ultrafast optical excitation provides a powerful route for accessing emergent quantum phases far from equilibrium, enabling transient light-induced phenomena such as magnetism, ferroelectricity, and superconductivity. However, extending this approach to induce topological phases, especially in conventional semiconductors, remains challenging. Here, we report the observation of a thermally inaccess… ▽ More

    Submitted 16 May, 2025; v1 submitted 20 February, 2025; originally announced February 2025.

    Comments: 27 pages, 5 figures

  40. arXiv:2502.08861  [pdf, other

    quant-ph cond-mat.mes-hall

    Two-dimensional Si spin qubit arrays with multilevel interconnects

    Authors: Sieu D. Ha, Edwin Acuna, Kate Raach, Zachery T. Bloom, Teresa L. Brecht, James M. Chappell, Maxwell D. Choi, Justin E. Christensen, Ian T. Counts, Dominic Daprano, J. P. Dodson, Kevin Eng, David J. Fialkow, Christina A. C. Garcia, Wonill Ha, Thomas R. B. Harris, nathan holman, Isaac Khalaf, Justine W. Matten, Christi A. Peterson, Clifford E. Plesha, Matthew J. Ruiz, Aaron Smith, Bryan J. Thomas, Samuel J. Whiteley , et al. (4 additional authors not shown)

    Abstract: The promise of quantum computation is contingent upon physical qubits with both low gate error rate and broad scalability. Silicon-based spins are a leading qubit platform, but demonstrations to date have not utilized fabrication processes capable of extending arrays in two dimensions while maintaining complete control of individual spins. Here, we implement an interconnect process, common in semi… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

  41. arXiv:2502.08474  [pdf, other

    cs.LG cs.AI cs.CV

    Training-Free Restoration of Pruned Neural Networks

    Authors: Keonho Lee, Minsoo Kim, Dong-Wan Choi

    Abstract: Although network pruning has been highly popularized to compress deep neural networks, its resulting accuracy heavily depends on a fine-tuning process that is often computationally expensive and requires the original data. However, this may not be the case in real-world scenarios, and hence a few recent works attempt to restore pruned networks without any expensive retraining process. Their strong… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

    Comments: Under Review in TNNLS since May 2022

  42. arXiv:2502.03984  [pdf, other

    cs.CL cs.AI

    PGB: One-Shot Pruning for BERT via Weight Grouping and Permutation

    Authors: Hyemin Lim, Jaeyeon Lee, Dong-Wan Choi

    Abstract: Large pretrained language models such as BERT suffer from slow inference and high memory usage, due to their huge size. Recent approaches to compressing BERT rely on iterative pruning and knowledge distillation, which, however, are often too complicated and computationally intensive. This paper proposes a novel semi-structured one-shot pruning method for BERT, called… ▽ More

    Submitted 6 February, 2025; originally announced February 2025.

  43. arXiv:2502.00196  [pdf, other

    cs.CV cs.AI cs.CL

    DermaSynth: Rich Synthetic Image-Text Pairs Using Open Access Dermatology Datasets

    Authors: Abdurrahim Yilmaz, Furkan Yuceyalcin, Ece Gokyayla, Donghee Choi, Ozan Erdem, Ali Anil Demircali, Rahmetullah Varol, Ufuk Gorkem Kirabali, Gulsum Gencoglan, Joram M. Posma, Burak Temelkuran

    Abstract: A major barrier to developing vision large language models (LLMs) in dermatology is the lack of large image--text pairs dataset. We introduce DermaSynth, a dataset comprising of 92,020 synthetic image--text pairs curated from 45,205 images (13,568 clinical and 35,561 dermatoscopic) for dermatology-related clinical tasks. Leveraging state-of-the-art LLMs, using Gemini 2.0, we used clinically relate… ▽ More

    Submitted 4 March, 2025; v1 submitted 31 January, 2025; originally announced February 2025.

    Comments: 12 pages, 4 figures

  44. arXiv:2501.16769  [pdf, ps, other

    cs.CV

    Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models

    Authors: Muhammad Atta ur Rahman, Dooseop Choi, Seung-Ik Lee, KyoungWook Min

    Abstract: Open-vocabulary semantic segmentation attempts to classify and outline objects in an image using arbitrary text labels, including those unseen during training. Self-supervised learning resolves numerous visual and linguistic processing problems when effectively trained. This study investigates simple yet efficient methods for adapting previously learned foundation models for open-vocabulary semant… ▽ More

    Submitted 1 July, 2025; v1 submitted 28 January, 2025; originally announced January 2025.

    Comments: Accepted at the 17th IEEE International Conference on Advanced Computational Intelligence (ICACI 2025)

  45. arXiv:2501.16726  [pdf, other

    cs.IT cs.AI cs.NI

    Bridging Neural Networks and Wireless Systems with MIMO-OFDM Semantic Communications

    Authors: Hanju Yoo, Dongha Choi, Yonghwi Kim, Yoontae Kim, Songkuk Kim, Chan-Byoung Chae, Robert W. Heath Jr

    Abstract: Semantic communications aim to enhance transmission efficiency by jointly optimizing source coding, channel coding, and modulation. While prior research has demonstrated promising performance in simulations, real-world implementations often face significant challenges, including noise variability and nonlinear distortions, leading to performance gaps. This article investigates these challenges in… ▽ More

    Submitted 28 January, 2025; originally announced January 2025.

    Comments: 7 pages, 5 figures

  46. arXiv:2501.11055  [pdf, ps, other

    math.AG

    Singularities of the nested Hilbert scheme of points of length 3, 4

    Authors: Doyoung Choi

    Abstract: We show that the projection morphism $X^{[3,4]} \longrightarrow X^{[3]}$ is flat even if it has reducible fiber. After showing the rational singularities of the fiber of residual morphism $\textrm{res}_{3,4} :X^{[3,4]} \longrightarrow X$, we conclude that $X^{[3,4]}$ has canonical Gorenstein singularities. As a corollary, we specify the singularities of several nested Hilbert schemes.

    Submitted 19 January, 2025; originally announced January 2025.

    Comments: 15 pages

    MSC Class: Primary:14C05; Secondary:14E18

  47. arXiv:2501.05712  [pdf, other

    cs.CL

    Multi-Step Reasoning in Korean and the Emergent Mirage

    Authors: Guijin Son, Hyunwoo Ko, Dasol Choi

    Abstract: We introduce HRMCR (HAE-RAE Multi-Step Commonsense Reasoning), a benchmark designed to evaluate large language models' ability to perform multi-step reasoning in culturally specific contexts, focusing on Korean. The questions are automatically generated via templates and algorithms, requiring LLMs to integrate Korean cultural knowledge into sequential reasoning steps. Consistent with prior observa… ▽ More

    Submitted 12 March, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

    Comments: C3NLP @ NAACL 2025

  48. arXiv:2501.03441  [pdf, other

    cs.CL

    Finding A Voice: Evaluating African American Dialect Generation for Chatbot Technology

    Authors: Sarah E. Finch, Ellie S. Paek, Sejung Kwon, Ikseon Choi, Jessica Wells, Rasheeta Chandler, Jinho D. Choi

    Abstract: As chatbots become increasingly integrated into everyday tasks, designing systems that accommodate diverse user populations is crucial for fostering trust, engagement, and inclusivity. This study investigates the ability of contemporary Large Language Models (LLMs) to generate African American Vernacular English (AAVE) and evaluates the impact of AAVE usage on user experiences in chatbot applicati… ▽ More

    Submitted 6 January, 2025; originally announced January 2025.

  49. arXiv:2501.02448  [pdf, other

    cs.CL

    Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap

    Authors: Hyunwoo Ko, Guijin Son, Dasol Choi

    Abstract: Large language models (LLMs) demonstrate exceptional performance on complex reasoning tasks. However, despite their strong reasoning capabilities in high-resource languages (e.g., English and Chinese), a significant performance gap persists in other languages. To investigate this gap in Korean, we introduce HRM8K, a benchmark comprising 8,011 English-Korean parallel bilingual math problems. Throug… ▽ More

    Submitted 31 January, 2025; v1 submitted 5 January, 2025; originally announced January 2025.

    Comments: 18 pages, 14 figures, 9 tables

  50. arXiv:2501.00271  [pdf, ps, other

    math-ph math.RT

    Generalized finite and affine $W$-algebras in type $A$

    Authors: Dong Jun Choi, Alexander Molev, Uhi Rinn Suh

    Abstract: We construct a new family of affine $W$-algebras $W^k(λ,μ)$ parameterized by partitions $λ$ and $μ$ associated with the centralizers of nilpotent elements in $\mathfrak{gl}_N$. The new family unifies a few known classes of $W$-algebras. In particular, for the column-partition $λ$ we recover the affine $W$-algebras $W^k(\mathfrak{gl}_N,f)$ of Kac, Roan and Wakimoto, associated with nilpotent elemen… ▽ More

    Submitted 30 December, 2024; originally announced January 2025.

    Comments: 29 pages