Skip to main content

Showing 1–50 of 515 results for author: Koo, J

.
  1. arXiv:2505.22960  [pdf, ps, other

    cs.AI cs.LG

    Revisiting Multi-Agent Debate as Test-Time Scaling: A Systematic Study of Conditional Effectiveness

    Authors: Yongjin Yang, Euiin Yi, Jongwoo Ko, Kimin Lee, Zhijing Jin, Se-Young Yun

    Abstract: The remarkable growth in large language model (LLM) capabilities has spurred exploration into multi-agent systems, with debate frameworks emerging as a promising avenue for enhanced problem-solving. These multi-agent debate (MAD) approaches, where agents collaboratively present, critique, and refine arguments, potentially offer improved reasoning, robustness, and diverse perspectives over monolith… ▽ More

    Submitted 28 May, 2025; originally announced May 2025.

    Comments: Preprint, under review

  2. arXiv:2505.20770  [pdf, ps, other

    cs.SD cs.MM eess.AS

    Can Large Language Models Predict Audio Effects Parameters from Natural Language?

    Authors: Seungheon Doh, Junghyun Koo, Marco A. Martínez-Ramírez, Wei-Hsiang Liao, Juhan Nam, Yuki Mitsufuji

    Abstract: In music production, manipulating audio effects (Fx) parameters through natural language has the potential to reduce technical barriers for non-experts. We present LLM2Fx, a framework leveraging Large Language Models (LLMs) to predict Fx parameters directly from textual descriptions without requiring task-specific training or fine-tuning. Our approach address the text-to-effect parameter predictio… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

    Comments: Submitted to WASPAA 2025

  3. arXiv:2505.19427  [pdf, ps, other

    cs.LG cs.AI

    WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

    Authors: Sihan Chen, Dan Zhao, Jongwoo Ko, Colby Banbury, Huiping Zhuang, Luming Liang, Tianyi Chen

    Abstract: The growing computational demands of large language models (LLMs) make efficient inference and activation strategies increasingly critical. While recent approaches, such as Mixture-of-Experts (MoE), leverage selective activation but require specialized training, training-free sparse activation methods offer broader applicability and superior resource efficiency through their plug-and-play design.… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

  4. arXiv:2505.19401  [pdf, ps, other

    eess.AS

    Stack Less, Repeat More: A Block Reusing Approach for Progressive Speech Enhancement

    Authors: Jangyeon Kim, Ui-Hyeop Shin, Jaehyun Ko, Hyung-Min Park

    Abstract: This paper presents an efficient speech enhancement (SE) approach that reuses a processing block repeatedly instead of conventional stacking. Rather than increasing the number of blocks for learning deep latent representations, repeating a single block leads to progressive refinement while reducing parameter redundancy. We also minimize domain transformation by keeping an encoder and decoder shall… ▽ More

    Submitted 25 May, 2025; originally announced May 2025.

    Comments: Accepted to Interspeech 2025

  5. arXiv:2505.18601  [pdf, ps, other

    cs.CL cs.AI

    Flex-Judge: Think Once, Judge Anywhere

    Authors: Jongwoo Ko, Sungnyun Kim, Sungwoo Cho, Se-Young Yun

    Abstract: Human-generated reward signals are critical for aligning generative models with human preferences, guiding both training and inference-time evaluations. While large language models (LLMs) employed as proxy evaluators, i.e., LLM-as-a-Judge, significantly reduce the costs associated with manual annotations, they typically require extensive modality-specific training data and fail to generalize well… ▽ More

    Submitted 24 May, 2025; originally announced May 2025.

    Comments: The code is available at https://github.com/jongwooko/flex-judge

  6. arXiv:2505.15598  [pdf, ps, other

    math.CT math.AT

    Limits of $(\infty, 1)$-categories with structure and their lax morphisms

    Authors: Joanna Ko

    Abstract: Riehl and Verity have established that for a quasi-category $A$ that admits limits, and a homotopy coherent monad on $A$ which does not preserve limits, the Eilenberg-Moore object still admits limits; this can be interpreted as a completeness result involving lax morphisms. We generalise their result to different models for $(\infty, 1)$-categories, with an abundant variety of structures. For inst… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

    Comments: 56 pages

    MSC Class: 18N60

  7. arXiv:2505.11315  [pdf, other

    cs.SD cs.LG eess.AS

    Improving Inference-Time Optimisation for Vocal Effects Style Transfer with a Gaussian Prior

    Authors: Chin-Yun Yu, Marco A. Martínez-Ramírez, Junghyun Koo, Wei-Hsiang Liao, Yuki Mitsufuji, György Fazekas

    Abstract: Style Transfer with Inference-Time Optimisation (ST-ITO) is a recent approach for transferring the applied effects of a reference audio to a raw audio track. It optimises the effect parameters to minimise the distance between the style embeddings of the processed audio and the reference. However, this method treats all possible configurations equally and relies solely on the embedding space, which… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

    Comments: Submitted to WASPAA 2025

  8. arXiv:2505.10871  [pdf, other

    cs.CR cs.AI cs.CY

    Optimal Allocation of Privacy Budget on Hierarchical Data Release

    Authors: Joonhyuk Ko, Juba Ziani, Ferdinando Fioretto

    Abstract: Releasing useful information from datasets with hierarchical structures while preserving individual privacy presents a significant challenge. Standard privacy-preserving mechanisms, and in particular Differential Privacy, often require careful allocation of a finite privacy budget across different levels and components of the hierarchy. Sub-optimal allocation can lead to either excessive noise, re… ▽ More

    Submitted 16 May, 2025; originally announced May 2025.

  9. arXiv:2505.09163  [pdf, ps, other

    math.NT

    Inverse limits of CM points on certain Shimura varieties

    Authors: Ho Yun Jung, Ja Kyung Koo, Dong Hwa Shin

    Abstract: Let $N$ be a positive integer, and let $D\equiv0$ or $1\Mod{4}$ be a negative integer. We define the sets $\mathcal{CM}(D,\,Y_1(N)^\pm)$ and $\mathcal{CM}(D,\,Y(N)^\pm)$ as subsets of the Shimura varieties $Y_1(N)^\pm$ and $Y(N)^\pm$, respectively, consisting of CM points of discriminant $D$ that are primitive modulo $N$. By using the theory of definite form class groups, we show that the inverse… ▽ More

    Submitted 14 May, 2025; originally announced May 2025.

    MSC Class: 11R37; 11E57; 11G18

  10. arXiv:2505.06544  [pdf, ps, other

    eess.SP cs.NE

    Event-based Neural Spike Detection Using Spiking Neural Networks for Neuromorphic iBMI Systems

    Authors: Chanwook Hwang, Biyan Zhou, Ye Ke, Vivek Mohan, Jong Hwan Ko, Arindam Basu

    Abstract: Implantable brain-machine interfaces (iBMIs) are evolving to record from thousands of neurons wirelessly but face challenges in data bandwidth, power consumption, and implant size. We propose a novel Spiking Neural Network Spike Detector (SNN-SPD) that processes event-based neural data generated via delta modulation and pulse count modulation, converting signals into sparse events. By leveraging t… ▽ More

    Submitted 10 May, 2025; originally announced May 2025.

    Comments: 4 pages, 2 figures, to be published in 2025 IEEE International Symposium on Circuits and Systems (ISCAS) proceedings

  11. Scratch Copilot: Supporting Youth Creative Coding with AI

    Authors: Stefania Druga, Amy J. Ko

    Abstract: Creative coding platforms like Scratch have democratized programming for children, yet translating imaginative ideas into functional code remains a significant hurdle for many young learners. While AI copilots assist adult programmers, few tools target children in block-based environments. Building on prior research \cite{druga_how_2021,druga2023ai, druga2023scratch}, we present Cognimates Scratch… ▽ More

    Submitted 6 May, 2025; originally announced May 2025.

    Comments: 5 figures, 14 pages

  12. arXiv:2504.15558  [pdf, other

    math.ST

    Dynamical mean-field analysis of adaptive Langevin diffusions: Replica-symmetric fixed point and empirical Bayes

    Authors: Zhou Fan, Justin Ko, Bruno Loureiro, Yue M. Lu, Yandi Shen

    Abstract: In many applications of statistical estimation via sampling, one may wish to sample from a high-dimensional target distribution that is adaptively evolving to the samples already seen. We study an example of such dynamics, given by a Langevin diffusion for posterior sampling in a Bayesian linear regression model with i.i.d. regression design, whose prior continuously adapts to the Langevin traject… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  13. arXiv:2504.15556  [pdf, ps, other

    math.ST math.PR

    Dynamical mean-field analysis of adaptive Langevin diffusions: Propagation-of-chaos and convergence of the linear response

    Authors: Zhou Fan, Justin Ko, Bruno Loureiro, Yue M. Lu, Yandi Shen

    Abstract: Motivated by an application to empirical Bayes learning in high-dimensional regression, we study a class of Langevin diffusions in a system with random disorder, where the drift coefficient is driven by a parameter that continuously adapts to the empirical distribution of the realized process up to the current time. The resulting dynamics take the form of a stochastic interacting particle system h… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

  14. arXiv:2504.14914  [pdf, other

    astro-ph.IM astro-ph.GA

    K-DRIFT Preparation: Experimental Verification of an Observation Strategy for Accurate Dark-Sky Flats

    Authors: Woowon Byun, Kwang-Il Seon, Jongwan Ko

    Abstract: Despite its scientific importance, the low-surface-brightness universe has yet to be fully explored due to various systematic uncertainties that affect the achievable surface-brightness limit. Reducing these uncertainties requires very accurate data processing. The dark-sky flat is a widely used calibration frame for accurate flat-field correction, generated by combining the sky background from sc… ▽ More

    Submitted 21 April, 2025; originally announced April 2025.

    Comments: 22 pages, 15 figures, Accepted for publication in PASP

  15. arXiv:2504.14735  [pdf, other

    cs.SD eess.AS

    DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions

    Authors: Chin-Yun Yu, Marco A. Martínez-Ramírez, Junghyun Koo, Ben Hayes, Wei-Hsiang Liao, György Fazekas, Yuki Mitsufuji

    Abstract: This study introduces a novel and interpretable model, DiffVox, for matching vocal effects in music production. DiffVox, short for ``Differentiable Vocal Fx", integrates parametric equalisation, dynamic range control, delay, and reverb with efficient differentiable implementations to enable gradient-based optimisation for parameter estimation. Vocal presets are retrieved from two datasets, compris… ▽ More

    Submitted 20 April, 2025; originally announced April 2025.

    Comments: Submitted to DAFx 2025

  16. arXiv:2504.14123  [pdf, other

    cs.AI cs.CL cs.CV

    Bayesian Principles Improve Prompt Learning In Vision-Language Models

    Authors: Mingyu Kim, Jongwoo Ko, Mijung Park

    Abstract: Prompt learning is a popular fine-tuning method for vision-language models due to its efficiency. It requires a small number of additional learnable parameters while significantly enhancing performance on target tasks. However, most existing methods suffer from overfitting to fine-tuning data, yielding poor generalizability. To address this, we propose a new training objective function based on a… ▽ More

    Submitted 18 April, 2025; originally announced April 2025.

    Comments: AISTATS2025

  17. arXiv:2504.02480  [pdf, other

    cs.CV cs.AI cs.LG

    Graph Attention-Driven Bayesian Deep Unrolling for Dual-Peak Single-Photon Lidar Imaging

    Authors: Kyungmin Choi, JaKeoung Koo, Stephen McLaughlin, Abderrahim Halimi

    Abstract: Single-photon Lidar imaging offers a significant advantage in 3D imaging due to its high resolution and long-range capabilities, however it is challenging to apply in noisy environments with multiple targets per pixel. To tackle these challenges, several methods have been proposed. Statistical methods demonstrate interpretability on the inferred parameters, but they are often limited in their abil… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

  18. arXiv:2503.23371  [pdf, other

    cs.CL cs.AI

    FeRG-LLM : Feature Engineering by Reason Generation Large Language Models

    Authors: Jeonghyun Ko, Gyeongyun Park, Donghoon Lee, Kyunam Lee

    Abstract: One of the key tasks in machine learning for tabular data is feature engineering. Although it is vital for improving the performance of models, it demands considerable human expertise and deep domain knowledge, making it labor-intensive endeavor. To address this issue, we propose a novel framework, \textbf{FeRG-LLM} (\textbf{Fe}ature engineering by \textbf{R}eason \textbf{G}eneration \textbf{L}arg… ▽ More

    Submitted 30 March, 2025; originally announced March 2025.

    Comments: Accepted to NAACL 2025 Findings

  19. arXiv:2503.21721  [pdf, other

    cs.CV

    Evaluating Text-to-Image Synthesis with a Conditional Fréchet Distance

    Authors: Jaywon Koo, Jefferson Hernandez, Moayed Haji-Ali, Ziyan Yang, Vicente Ordonez

    Abstract: Evaluating text-to-image synthesis is challenging due to misalignment between established metrics and human preferences. We propose cFreD, a metric based on the notion of Conditional Fréchet Distance that explicitly accounts for both visual fidelity and text-prompt alignment. Existing metrics such as Inception Score (IS), Fréchet Inception Distance (FID) and CLIPScore assess either image quality o… ▽ More

    Submitted 27 March, 2025; originally announced March 2025.

  20. arXiv:2503.19559  [pdf, other

    astro-ph.IM hep-ex

    Combined Annual Modulation Dark Matter Search with COSINE-100 and ANAIS-112

    Authors: N. Carlin, J. Y. Cho, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. França, C. Ha, I. S. Hahn, S. J. Hollick, S. B. Hong, E. J. Jeon, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim, Y. J. Ko, D. H. Lee , et al. (49 additional authors not shown)

    Abstract: The annual modulation signal, claimed to be consistent with dark matter as observed by DAMA/LIBRA in a sodium-iodide based detector, has persisted for over two decades. COSINE-100 and ANAIS-112 were designed to test the claim directly using the same target material. COSINE-100, located at Yangyang Underground Laboratory in South Korea, and ANAIS-112, located at Canfranc Underground Laboratory in S… ▽ More

    Submitted 25 March, 2025; originally announced March 2025.

    Comments: 6 pages, 4 figures, 3 tables

  21. arXiv:2503.16924  [pdf, other

    cs.CV

    Optimized Minimal 3D Gaussian Splatting

    Authors: Joo Chan Lee, Jong Hwan Ko, Eunbyung Park

    Abstract: 3D Gaussian Splatting (3DGS) has emerged as a powerful representation for real-time, high-performance rendering, enabling a wide range of applications. However, representing 3D scenes with numerous explicit Gaussian primitives imposes significant storage and memory overhead. Recent studies have shown that high-quality rendering can be achieved with a substantially reduced number of Gaussians when… ▽ More

    Submitted 21 March, 2025; originally announced March 2025.

    Comments: Project page: https://maincold2.github.io/omg/

  22. arXiv:2503.16814  [pdf, ps, other

    cs.LG cs.CL

    Understanding Bias Reinforcement in LLM Agents Debate

    Authors: Jihwan Oh, Minchan Jeong, Jongwoo Ko, Se-Young Yun

    Abstract: Large Language Models $($LLMs$)$ solve complex problems using training-free methods like prompt engineering and in-context learning, yet ensuring reasoning correctness remains challenging. While self-correction methods such as self-consistency and self-refinement aim to improve reliability, they often reinforce biases due to the lack of effective feedback mechanisms. Multi-Agent Debate $($MAD$)$ h… ▽ More

    Submitted 28 May, 2025; v1 submitted 20 March, 2025; originally announced March 2025.

    Comments: ICML 2025

  23. arXiv:2503.07067  [pdf, other

    cs.CL cs.AI cs.LG

    DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs

    Authors: Jongwoo Ko, Tianyi Chen, Sungnyun Kim, Tianyu Ding, Luming Liang, Ilya Zharkov, Se-Young Yun

    Abstract: Despite the success of distillation in large language models (LLMs), most prior work applies identical loss functions to both teacher- and student-generated data. These strategies overlook the synergy between loss formulations and data types, leading to a suboptimal performance boost in student models. To address this, we propose DistiLLM-2, a contrastive approach that simultaneously increases the… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: The code will be available soon at https://github.com/jongwooko/distillm-2

  24. arXiv:2503.01708  [pdf, other

    math.ST math.PR

    Pseudo-Maximum Likelihood Theory for High-Dimensional Rank One Inference

    Authors: Curtis Grant, Aukosh Jagannath, Justin Ko

    Abstract: We develop a pseudo-likelihood theory for rank one matrix estimation problems in the high dimensional limit. We prove a variational principle for the limiting pseudo-maximum likelihood which also characterizes the performance of the corresponding pseudo-maximum likelihood estimator. We show that this variational principle is universal and depends only on four parameters determined by the correspon… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: 52 pages, 2 figures

  25. arXiv:2503.01107  [pdf, other

    cs.CV

    VideoHandles: Editing 3D Object Compositions in Videos Using Video Generative Priors

    Authors: Juil Koo, Paul Guerrero, Chun-Hao Paul Huang, Duygu Ceylan, Minhyuk Sung

    Abstract: Generative methods for image and video editing use generative models as priors to perform edits despite incomplete information, such as changing the composition of 3D objects shown in a single image. Recent methods have shown promising composition editing results in the image setting, but in the video setting, editing methods have focused on editing object's appearance and motion, or camera motion… ▽ More

    Submitted 26 March, 2025; v1 submitted 2 March, 2025; originally announced March 2025.

    Comments: Project page: https://videohandles.github.io

  26. arXiv:2502.17105  [pdf, other

    cs.CV cs.AI

    SFLD: Reducing the content bias for AI-generated Image Detection

    Authors: Seoyeon Gye, Junwon Ko, Hyounguk Shon, Minchan Kwon, Junmo Kim

    Abstract: Identifying AI-generated content is critical for the safe and ethical use of generative AI. Recent research has focused on developing detectors that generalize to unknown generators, with popular methods relying either on high-level features or low-level fingerprints. However, these methods have clear limitations: biased towards unseen content, or vulnerable to common image degradations, such as J… ▽ More

    Submitted 24 February, 2025; originally announced February 2025.

    Comments: IEEE/CVF WACV 2025, Oral

  27. arXiv:2502.08939  [pdf, other

    cs.SD cs.AI

    TokenSynth: A Token-based Neural Synthesizer for Instrument Cloning and Text-to-Instrument

    Authors: Kyungsu Kim, Junghyun Koo, Sungho Lee, Haesun Joung, Kyogu Lee

    Abstract: Recent advancements in neural audio codecs have enabled the use of tokenized audio representations in various audio generation tasks, such as text-to-speech, text-to-audio, and text-to-music generation. Leveraging this approach, we propose TokenSynth, a novel neural synthesizer that utilizes a decoder-only transformer to generate desired audio tokens from MIDI tokens and CLAP (Contrastive Language… ▽ More

    Submitted 12 February, 2025; originally announced February 2025.

    Comments: 5 pages, 1 figure, to be published in ICASSP 2025

  28. arXiv:2502.07842  [pdf, other

    cs.AR cs.AI cs.LG

    Column-wise Quantization of Weights and Partial Sums for Accurate and Efficient Compute-In-Memory Accelerators

    Authors: Jiyoon Kim, Kang Eun Jeon, Yulhwa Kim, Jong Hwan Ko

    Abstract: Compute-in-memory (CIM) is an efficient method for implementing deep neural networks (DNNs) but suffers from substantial overhead from analog-to-digital converters (ADCs), especially as ADC precision increases. Low-precision ADCs can reduce this overhead but introduce partial-sum quantization errors degrading accuracy. Additionally, low-bit weight constraints, imposed by cell limitations and the n… ▽ More

    Submitted 13 March, 2025; v1 submitted 11 February, 2025; originally announced February 2025.

  29. arXiv:2502.07834  [pdf, other

    cs.AR cs.AI cs.LG

    MEMHD: Memory-Efficient Multi-Centroid Hyperdimensional Computing for Fully-Utilized In-Memory Computing Architectures

    Authors: Do Yeong Kang, Yeong Hwan Oh, Chanwook Hwang, Jinhee Kim, Kang Eun Jeon, Jong Hwan Ko

    Abstract: The implementation of Hyperdimensional Computing (HDC) on In-Memory Computing (IMC) architectures faces significant challenges due to the mismatch between highdimensional vectors and IMC array sizes, leading to inefficient memory utilization and increased computation cycles. This paper presents MEMHD, a Memory-Efficient Multi-centroid HDC framework designed to address these challenges. MEMHD intro… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: Accepted to appear at DATE 2025

  30. arXiv:2502.07820  [pdf, other

    cs.AR cs.AI

    Low-Rank Compression for IMC Arrays

    Authors: Kang Eun Jeon, Johnny Rhe, Jong Hwan Ko

    Abstract: In this study, we address the challenge of low-rank model compression in the context of in-memory computing (IMC) architectures. Traditional pruning approaches, while effective in model size reduction, necessitate additional peripheral circuitry to manage complex dataflows and mitigate dislocation issues, leading to increased area and energy overheads. To circumvent these drawbacks, we propose lev… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

    Comments: Accepted to appear at DATE'25 (Lyon, France)

  31. arXiv:2502.04362  [pdf, other

    cs.CL cs.AI

    LLMs can be easily Confused by Instructional Distractions

    Authors: Yerin Hwang, Yongil Kim, Jahyun Koo, Taegwan Kang, Hyunkyung Bae, Kyomin Jung

    Abstract: Despite the fact that large language models (LLMs) show exceptional skill in instruction following tasks, this strength can turn into a vulnerability when the models are required to disregard certain instructions. Instruction-following tasks typically involve a clear task description and input text containing the target data to be processed. However, when the input itself resembles an instruction,… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

    Comments: 8 pages

  32. arXiv:2502.01031  [pdf, other

    cs.LG cs.SI

    DiffIM: Differentiable Influence Minimization with Surrogate Modeling and Continuous Relaxation

    Authors: Junghun Lee, Hyunju Kim, Fanchen Bu, Jihoon Ko, Kijung Shin

    Abstract: In social networks, people influence each other through social links, which can be represented as propagation among nodes in graphs. Influence minimization (IMIN) is the problem of manipulating the structures of an input graph (e.g., removing edges) to reduce the propagation among nodes. IMIN can represent time-critical real-world applications, such as rumor blocking, but IMIN is theoretically dif… ▽ More

    Submitted 2 February, 2025; originally announced February 2025.

    Comments: Accepted to AAAI'25

  33. arXiv:2501.13665  [pdf, other

    hep-ex

    Limits on WIMP dark matter with NaI(Tl) crystals in three years of COSINE-100 data

    Authors: G. H. Yu, N. Carlin, J. Y. Cho, J. J. Choi, S. Choi, A. C. Ezeribe, L. E. Franca, C. Ha, I. S. Hahn, S. J. Hollick, E. J. Jeon, H. W. Joo, W. G. Kang, M. Kauer, B. H. Kim, H. J. Kim, J. Kim, K. W. Kim, S. H. Kim, S. K. Kim, W. K. Kim, Y. D. Kim, Y. H. Kim, Y. J. Ko, D. H. Lee , et al. (34 additional authors not shown)

    Abstract: We report limits on WIMP dark matter derived from three years of data collected by the COSINE-100 experiment with NaI(Tl) crystals, achieving an improved energy threshold of 0.7 keV. This lowered threshold enhances sensitivity in the sub-GeV mass range, extending the reach for direct detection of low-mass dark matter. Although no excess of WIMP-like events was observed, the increased sensitivity e… ▽ More

    Submitted 23 January, 2025; originally announced January 2025.

  34. arXiv:2501.07824  [pdf, other

    cs.CL cs.AI cs.LG

    Real-time Verification and Refinement of Language Model Text Generation

    Authors: Joonho Ko, Jinheon Baek, Sung Ju Hwang

    Abstract: Large language models (LLMs) have shown remarkable performance across a wide range of natural language tasks. However, a critical challenge remains in that they sometimes generate factually incorrect answers. To address this, while many previous work has focused on identifying errors in their generation and further refining them, they are slow in deployment since they are designed to verify the re… ▽ More

    Submitted 13 April, 2025; v1 submitted 13 January, 2025; originally announced January 2025.

  35. arXiv:2501.00645  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    SoundBrush: Sound as a Brush for Visual Scene Editing

    Authors: Kim Sung-Bin, Kim Jun-Seong, Junseok Ko, Yewon Kim, Tae-Hyun Oh

    Abstract: We propose SoundBrush, a model that uses sound as a brush to edit and manipulate visual scenes. We extend the generative capabilities of the Latent Diffusion Model (LDM) to incorporate audio information for editing visual scenes. Inspired by existing image-editing works, we frame this task as a supervised learning problem and leverage various off-the-shelf models to construct a sound-paired visual… ▽ More

    Submitted 31 December, 2024; originally announced January 2025.

    Comments: AAAI 2025

  36. arXiv:2412.07475  [pdf, other

    math.CT

    Enhanced 2-categorical structures, two-dimensional limit sketches and the symmetry of internalisation

    Authors: Nathanael Arkor, John Bourke, Joanna Ko

    Abstract: Many structures of interest in two-dimensional category theory have aspects that are inherently strict. This strictness is not a limitation, but rather plays a fundamental role in the theory of such structures. For instance, a monoidal fibration is - crucially - a strict monoidal functor, rather than a pseudo or lax monoidal functor. Other examples include monoidal double categories, double fibrat… ▽ More

    Submitted 10 December, 2024; originally announced December 2024.

    Comments: 49 pages

    MSC Class: 18C10; 18C30; 18C40; 18D20; 18M65; 18N10

  37. arXiv:2412.07454  [pdf, other

    cs.LG cs.AI

    Tazza: Shuffling Neural Network Parameters for Secure and Private Federated Learning

    Authors: Kichang Lee, Jaeho Jin, JaeYeon Park, Songkuk Kim, JeongGil Ko

    Abstract: Federated learning enables decentralized model training without sharing raw data, preserving data privacy. However, its vulnerability towards critical security threats, such as gradient inversion and model poisoning by malicious clients, remain unresolved. Existing solutions often address these issues separately, sacrificing either system robustness or model accuracy. This work introduces Tazza, a… ▽ More

    Submitted 3 February, 2025; v1 submitted 10 December, 2024; originally announced December 2024.

    Comments: 27 pages, 18 figures

    MSC Class: 68T07 ACM Class: I.2.11

  38. arXiv:2412.02280  [pdf, other

    cs.AI cs.CV

    AH-OCDA: Amplitude-based Curriculum Learning and Hopfield Segmentation Model for Open Compound Domain Adaptation

    Authors: Jaehyun Choi, Junwon Ko, Dong-Jae Lee, Junmo Kim

    Abstract: Open compound domain adaptation (OCDA) is a practical domain adaptation problem that consists of a source domain, target compound domain, and unseen open domain. In this problem, the absence of domain labels and pixel-level segmentation labels for both compound and open domains poses challenges to the direct application of existing domain adaptation and generalization methods. To address this issu… ▽ More

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: WACV 2025

  39. arXiv:2412.02237  [pdf, other

    cs.CV cs.AI

    Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models

    Authors: Jungwon Park, Jungmin Ko, Dongnam Byun, Jangwon Suh, Wonjong Rhee

    Abstract: Recent text-to-image diffusion models leverage cross-attention layers, which have been effectively utilized to enhance a range of visual generative tasks. However, our understanding of cross-attention layers remains somewhat limited. In this study, we introduce a mechanistic interpretability approach for diffusion models by constructing Head Relevance Vectors (HRVs) that align with human-specified… ▽ More

    Submitted 24 February, 2025; v1 submitted 3 December, 2024; originally announced December 2024.

    Comments: Accepted by ICLR 2025

  40. arXiv:2411.16312  [pdf, other

    cs.CV

    EPS: Efficient Patch Sampling for Video Overfitting in Deep Super-Resolution Model Training

    Authors: Yiying Wei, Hadi Amirpour, Jong Hwan Ko, Christian Timmerer

    Abstract: Leveraging the overfitting property of deep neural networks (DNNs) is trending in video delivery systems to enhance quality within bandwidth limits. Existing approaches transmit overfitted super-resolution (SR) model streams for low-resolution (LR) bitstreams, which are used to reconstruct high-resolution (HR) videos at the decoder. Although these approaches show promising results, the huge comput… ▽ More

    Submitted 25 November, 2024; originally announced November 2024.

  41. arXiv:2411.12220  [pdf, other

    cs.LG cs.AI cs.CR

    DeTrigger: A Gradient-Centric Approach to Backdoor Attack Mitigation in Federated Learning

    Authors: Kichang Lee, Yujin Shin, Jonghyuk Yun, Songkuk Kim, Jun Han, JeongGil Ko

    Abstract: Federated Learning (FL) enables collaborative model training across distributed devices while preserving local data privacy, making it ideal for mobile and embedded systems. However, the decentralized nature of FL also opens vulnerabilities to model poisoning attacks, particularly backdoor attacks, where adversaries implant trigger patterns to manipulate model predictions. In this paper, we propos… ▽ More

    Submitted 3 February, 2025; v1 submitted 18 November, 2024; originally announced November 2024.

    Comments: 21 pages

    MSC Class: 68T07 ACM Class: I.2.11

  42. arXiv:2411.08117  [pdf, other

    astro-ph.GA

    Tracing the Formation History of Intrahalo Light with Horizon Run 5

    Authors: Hyungjin Joo, M. James Jee, Juhan Kim, Jaehyun Lee, Jongwan Ko, Changbom Park, Jihye Shin, Owain Snaith, Christophe Pichon, Brad Gibson, Yonghwi Kim

    Abstract: We investigate the formation history of intrahalo light (IHL) using the high-resolution (~1 kpc), large-scale (~Gpc) cosmological hydrodynamical simulation, Horizon Run 5 (HR5). IHL particles are identified by carefully considering both their binding energies and positions with respect to the tidal radii of individual galaxies. By analyzing more than 1,200 galaxy groups and clusters with… ▽ More

    Submitted 12 November, 2024; originally announced November 2024.

    Comments: Submitted to ApJ, 14 pages, 11 figures

  43. arXiv:2411.05256  [pdf, other

    physics.ins-det hep-ex

    Radiopurity measurements of liquid scintillator for the COSINE-100 Upgrade

    Authors: J. Kim, C. Ha, S. H. Kim, W. K. Kim, Y. D. Kim, Y. J. Ko, E. K. Lee, H. Lee, H. S. Lee, I. S. Lee, J. Lee, S. H. Lee, S. M. Lee, Y. J. Lee, G. H. Yu

    Abstract: A new 2,400 L liquid scintillator has been produced for the COSINE-100 Upgrade, which is under construction at Yemilab for the next COSINE dark matter experiment phase. The linear-alkyl-benzene-based scintillator is designed to serve as a veto for NaI(Tl) crystal targets and a separate platform for rare event searches. We measured using a sample consisting of a custom-made 445 mL cylindrical Teflo… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

  44. arXiv:2411.02824  [pdf, other

    cs.LG eess.SY

    Layer-Adaptive State Pruning for Deep State Space Models

    Authors: Minseon Gwak, Seongrok Moon, Joohwan Ko, PooGyeon Park

    Abstract: Due to the lack of state dimension optimization methods, deep state space models (SSMs) have sacrificed model capacity, training search space, or stability to alleviate computational costs caused by high state dimensions. In this work, we provide a structured pruning method for SSMs, Layer-Adaptive STate pruning (LAST), which reduces the state dimension of each layer in minimizing model-level outp… ▽ More

    Submitted 31 January, 2025; v1 submitted 5 November, 2024; originally announced November 2024.

    Comments: NeurIPS 2024, Added missing arXiv information for one reference

  45. arXiv:2411.01974  [pdf, other

    cond-mat.dis-nn cs.IT cs.LG

    On the phase diagram of extensive-rank symmetric matrix denoising beyond rotational invariance

    Authors: Jean Barbier, Francesco Camilli, Justin Ko, Koki Okajima

    Abstract: Matrix denoising is central to signal processing and machine learning. Its statistical analysis when the matrix to infer has a factorised structure with a rank growing proportionally to its dimension remains a challenge, except when it is rotationally invariant. In this case the information theoretic limits and an efficient Bayes-optimal denoising algorithm, called rotational invariant estimator [… ▽ More

    Submitted 14 March, 2025; v1 submitted 4 November, 2024; originally announced November 2024.

  46. arXiv:2411.00521  [pdf, other

    astro-ph.GA

    Initial Mass Functions of Young Stellar Clusters from the Gemini Spectroscopic Survey of Nearby Galaxies I. Young Massive Clusters in the Antennae galaxies

    Authors: Jae-Rim Koo, Hyun-Jeong Kim, Beomdu Lim

    Abstract: The stellar initial mass function (IMF) is a key parameter to understand the star formation process and the integrated properties of stellar populations in remote galaxies. We present a spectroscopic study of young massive clusters (YMCs) in the starburst galaxies NGC 4038/39. The integrated spectra of seven YMCs obtained with GMOS-S attached to the 8.2-m Gemini South telescope reveal the spectral… ▽ More

    Submitted 1 November, 2024; originally announced November 2024.

    Comments: 18 pages, 9 figures, accepted for publication in AJ

  47. arXiv:2410.22815  [pdf, other

    cs.LG cs.AI cs.DC

    Towards Robust and Efficient Federated Low-Rank Adaptation with Heterogeneous Clients

    Authors: Jabin Koo, Minwoo Jang, Jungseul Ok

    Abstract: Federated fine-tuning for Large Language Models (LLMs) has recently gained attention due to the heavy communication overhead of transmitting large model updates. Low Rank Adaptation (LoRA) has been proposed as a solution, yet its application in federated learning is complicated by discordance in aggregation. Existing methods addressing this discordance often suffer from performance degradation at… ▽ More

    Submitted 30 October, 2024; originally announced October 2024.

  48. arXiv:2410.19503  [pdf, other

    cs.CL

    SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models

    Authors: Jahyun Koo, Yerin Hwang, Yongil Kim, Taegwan Kang, Hyunkyung Bae, Kyomin Jung

    Abstract: Despite the success of Large Language Models (LLMs), they still face challenges related to high inference costs and memory requirements. To address these issues, Knowledge Distillation (KD) has emerged as a popular method for model compression, with student-generated outputs (SGOs) as training data being particularly notable for reducing the mismatch between training and inference. However, SGOs o… ▽ More

    Submitted 22 April, 2025; v1 submitted 25 October, 2024; originally announced October 2024.

    Comments: NAACL 2025 Findings

  49. The Most Massive Early-type Galaxies Exhibit Tidal Features More Frequently in Lower-density Environments

    Authors: Yongmin Yoon, Jae-Woo Kim, Jongwan Ko

    Abstract: The most massive early-type galaxies (ETGs) are known to form through numerous galaxy mergers. Thus, it is intriguing to study whether their formation in low-density environments, where nearby companions are almost absent, is associated with mergers, which are directly traced by tidal features. Using the 436 most massive ETGs with $M_\mathrm{star}>10^{11.2}\,M_{\odot}$ at $z<0.04$, we determine th… ▽ More

    Submitted 23 October, 2024; originally announced October 2024.

    Comments: 16 pages, 10 figures, published on October 18 in ApJ

    Journal ref: The Astrophysical Journal, Volume 974, Issue 2, id. 299, 13 pp. (2024)

  50. arXiv:2410.09362  [pdf, other

    cs.LG cs.AI

    SeRA: Self-Reviewing and Alignment of Large Language Models using Implicit Reward Margins

    Authors: Jongwoo Ko, Saket Dingliwal, Bhavana Ganesh, Sailik Sengupta, Sravan Bodapati, Aram Galstyan

    Abstract: Direct alignment algorithms (DAAs), such as direct preference optimization (DPO), have become popular alternatives for Reinforcement Learning from Human Feedback (RLHF) due to their simplicity, efficiency, and stability. However, the preferences used in DAAs are usually collected before the alignment training begins and remain unchanged (off-policy). This can lead to two problems where the policy… ▽ More

    Submitted 12 October, 2024; originally announced October 2024.